Science-as-a-Service

Similar documents
Developing Microsoft Azure Solutions (70-532) Syllabus

Developing Microsoft Azure Solutions (70-532) Syllabus

Developing Microsoft Azure Solutions (70-532) Syllabus

At Course Completion Prepares you as per certification requirements for AWS Developer Associate.

The Materials Data Facility

The future of database technology is in the clouds

There is REST and then there is REST. Radovan Semančík November 2017

ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development

Building a Data Strategy for a Digital World

Sunday, May 1,

Sentinet for Microsoft Azure SENTINET

RESEARCH DATA DEPOT AT PURDUE UNIVERSITY

Modern Data Warehouse The New Approach to Azure BI

WHITE PAPER Cloud FastPath: A Highly Secure Data Transfer Solution

Playing Outside Your Sandbox INTERACTING WITH OTHER SYSTEMS USING SHAREPOINT BCS

THE API DEVELOPER EXPERIENCE ENABLING RAPID INTEGRATION

EXPLORE MICROSOFT SHAREPOINT SERVER 2016 AND BEYOND #ILTAG70

BLACKBERRY SPARK COMMUNICATIONS PLATFORM. Getting Started Workbook

Bots. Table of Contents

StratusLab Cloud Distribution Installation. Charles Loomis (CNRS/LAL) 3 July 2014

The IBM MobileFirst Platform

GPII Security. Washington DC, November 2015

AWS Lambda + nodejs Hands-On Training

Salesforce External Identity Implementation Guide

Successfully Integrating MBSE Data Without Replication Using OSLC

Jetstream: Adding Cloud-based Computing to the National Cyberinfrastructure

Transforming the Internal IT Landscape with APIs. Scott Cranton Director, Application Platform SAs April 2018

Best Practices in Securing a Multicloud World

Overview SENTINET 3.1

Automation with Meraki Provisioning API

Single Sign-On Best Practices

The Now Platform Reference Guide

ARCHITECTING WEB APPLICATIONS FOR THE CLOUD: DESIGN PRINCIPLES AND PRACTICAL GUIDANCE FOR AWS

Distributed Systems. 27. Engineering Distributed Systems. Paul Krzyzanowski. Rutgers University. Fall 2018

TECHNICAL OVERVIEW OF NEW AND IMPROVED FEATURES OF EMC ISILON ONEFS 7.1.1

Lesson 14 SOA with REST (Part I)

Oracle API Platform Cloud Service

Salesforce External Identity Implementation Guide

What is database? Types and Examples

Content-based Comparison for Collections Identification

Managing Performance in Liferay DXP: An Overview of Liferay Connected Services

Amazon Web Services (AWS) Training Course Content

Magento Marketplace's New Extension Quality Program

Federated access to e-infrastructures worldwide

Stable currents. Nick Veenstra. Developing against the Pure API

Azure Logic Apps. The big picture. API Apps, Logic Apps & API Management

AssetWise to OpenText PoC Closeout Report

Interoperable Cloud Storage with the CDMI Standard. Mark Carlson, SNIA TC and Oracle Co-Chair, SNIA Cloud Storage TWG

Technical Overview. Version March 2018 Author: Vittorio Bertola

Techno Expert Solutions

Microsoft Architecting Microsoft Azure Solutions.

The next generation of Google APIs

Planning and Administering SharePoint 2016

The iplant Data Commons

Salesforce External Identity Implementation Guide

MAPR DATA GOVERNANCE WITHOUT COMPROMISE

Course Outline. Lesson 2, Azure Portals, describes the two current portals that are available for managing Azure subscriptions and services.

Semantic Web Company. PoolParty - Server. PoolParty - Technical White Paper.

Hosted Azure for your business. Build virtual servers, deploy with flexibility, and reduce your hardware costs with a managed cloud solution.

Full Stack Web Developer Nanodegree Syllabus

Feature: Online App Builder Studio

Leveraging the Globus Platform in your Web Applications. GlobusWorld April 26, 2018 Greg Nawrocki

This paper introduces the security policies, practices, and procedures of Lucidchart.

Unified Governance for Amazon S3 Data Lakes

Aurora, RDS, or On-Prem, Which is right for you

OpenIAM Identity and Access Manager Technical Architecture Overview

Building the Digital Media Workstream: From project pitch to program purchase

Day 1 : August (Thursday) An overview of Globus Toolkit 2.4

onem2m and its role in achieving interoperability in IoT

Oracle WebCenter Interaction: Roadmap for BEA AquaLogic User Interaction. Ajay Gandhi Sr. Director of Product Management Enterprise 2.

Developing Microsoft Azure Solutions

RESTful API Design APIs your consumers will love

About Intellipaat. About the Course. Why Take This Course?

Citizen developer tools are not just for citizen developers!

Storage Made Easy. SoftLayer

Enabling Data Control in a Multi-Cloud World

How to Secure Your Cloud with...a Cloud?

WHAT S NEW IN QLIKVIEW 11

Webinar Annotate data in the EUDAT CDI


Building on to the Digital Preservation Foundation at Harvard Library. Andrea Goethals ABCD-Library Meeting June 27, 2016

CLOUD WORKLOAD SECURITY

Microservices, Messaging and Science Gateways. Review microservices for science gateways and then discuss messaging systems.

COMP390 (Design &) Implementation

Consumerization. Copyright 2014 Trend Micro Inc. IT Work Load

BEYOND CLOUD HOSTING. Andrew Melck, Regional Manager DACH,

OSIsoft Cloud Services Core Infrastructure for Developing Partner Applications

Authentication in the Cloud. Stefan Seelmann

Inside Symantec O 3. Sergi Isasi. Senior Manager, Product Management. SR B30 - Inside Symantec O3 1

Amazon Web Services (AWS) Solutions Architect Intermediate Level Course Content

MarkLogic Server. Reference Application Architecture Guide. MarkLogic 9 May, Copyright 2017 MarkLogic Corporation. All rights reserved.

OpenShift Roadmap Enterprise Kubernetes for Developers. Clayton Coleman, Architect, OpenShift

Building High Performance Apps using NoSQL. Swami Sivasubramanian General Manager, AWS NoSQL

Community Edition Getting Started Guide. July 25, 2018

A RESTful Approach to Identity-based Web Services

Cloud Computing 4/17/2016. Outline. Cloud Computing. Centralized versus Distributed Computing Some people argue that Cloud Computing. Cloud Computing.

Fusion Registry 9 SDMX Data and Metadata Management System

Modernize Your Storage

Apache Hadoop 3. Balazs Gaspar Sales Engineer CEE & CIS Cloudera, Inc. All rights reserved.

BUSINESS DATA LAKE FADI FAKHOURI, SR. SYSTEMS ENGINEER, ISILON SPECIALIST. Copyright 2016 EMC Corporation. All rights reserved.

Transcription:

Science-as-a-Service The iplant Foundation Rion Dooley Edwin Skidmore Dan Stanzione Steve Terry Matthew Vaughn

Outline Why, why, why! When duct tape isn t enough Building an API for the web Core services Love, hate, and building user communities Future plans: Roadmap or Apple Maps?

Why, why, why! iplant is developing cyberinfrastructure that uniquely enables scientists throughout the diverse fields that comprise plant biology to address Grand Challenges in new ways, to stimulate and facilitate cross-disciplinary research, to promote biology and computer science research interactions, and to train the next generation of scientists on the use of cyberinfrastructure in research and education.

Why, why, why! Lots of use cases, lots of users. Command line Public codes High performance Workflows High throughput Long-running tasks Science gateways Local hardware Big memory Web-based Big data Long tail science Database driven Grand challenge Proprietary codes Interactive

Why, why, why! Lots of solutions deployed Atmosphere: private cloud Data Store: virtualized distributed storage Discovery Environment: rich web-based science gateway TNRS: resolution service to help researcher interact Semantic Web: data and service integration based on ontological definitions MyPlant: social media site

Lots of support given Community forums Training workshops Why, why, why! http://www.iplantcollaborative.org/learn/calendar Application porting and optimization Collaborative projects Pilot programs Seed funding 5 7 14 0 250 0 42 7500 0 Seed projects Publications Collaborators Workshops Applications Users

Why, why, why! Despite all this, there was a clear need for access to the breadth of the iplant cyberinfrastructure at an API layer.

When duct tape isn t enough

When duct tape isn t enough Great solutions if solving for X or Y or Z Our users needed solutions for X & Y & Z Patching things together would actually make the end product more complicated Still doesn t address the missing pieces

When duct tape isn t enough Federated auth Accounting Sharing Collaboration Events Notifications Namespace Provenance Common interfaces

Building an API for the web We started with a simple set of requirements from the community: Restful services Friendly URLs Abstract as many details as possible Should work the way the web works Encourage good citizenship Add value on top of existing services Comprehensive, not complete coverage

Building an API for the web We are more alike than we are different HTTP SSL Basic Auth JSON XML Webhooks

Building an API for the web Setting expectations Web vs batch environments big data vs BIG DATA Shared environments Caching Transient systems Uncontrolled infrastructure

Building an API for the web Meeting expectations Asynchronous and Synchronous endpoints Callbacks & notifications Multiple queues Preemptive monitoring Caching Elastic scaling

and this helps us how? (Mostly) everything is in place to build

Core services Production Services Auth Audit Profile Systems IO Monitor Data Apps Jobs Postit Alpha Services Env Event Mashup Meta

Auth Service Future Foundation API https://foundation.iplantcollaborative.org/auth-v1/ Move towards full OAuth2 compliance Support delegated authentication decisions

Profile Service Future Foundation API https://foundation.iplantcollaborative.org/profile-v1/ Move towards OpenID compliance Support CRUD operations Support sandboxed identity management

IO Service Future Foundation API https://foundation.iplantcollaborative.org/io-v1/ Separate out generic staging functionality into a separate service Improve partial data queries Support user-defined systems

Data Service Future Foundation API https://foundation.iplantcollaborative.org/io-v1/ Improve partial data queries

Apps Service Future Foundation API https://foundation.iplantcollaborative.org/io-v1/ Support cloning of apps. Make app publication easier. Support different application execution types Support different execution platforms Better support for individual environment setups.

Systems Service Future Foundation API https://foundation.iplantcollaborative.org/io-v1/ Better monitoring POST method for updating Support registering of private systems Support convenience endpoints for status, etc. KBase integration

Monitor Service Future Foundation API https://foundation.iplantcollaborative.org/io-v1/ Deeper testing of individual systems Add networking tests Ramp up testing frequency when possible Add application and data movement tests

Foundation API https://foundation.iplantcollaborative.org/io-v1/ Job Service Future Add support for OSG, multi-cloud, FutureGrid, and user-defined systems Support CLI apps Support submissions run under individual user accounts Support arbitrary archiving locations Improve rate limiting, provenance, and data movement efficiencies Implement a charging models KBase integration

PostIt Service Future Foundation API https://foundation.iplantcollaborative.org/io-v1/ Expand as more general authenticated url shortening service.

Audit Service Future Foundation API https://foundation.iplantcollaborative.org/audit-v1/ Add better support for Atmosphere accounting Add data movement analytics

Event Service Future Foundation API https://foundation.iplantcollaborative.org/event-v1/ Integrate directly with underlying messaging services Incorporate global IDs into registration requests Integrate API events as first class events in service

Still to come Meta Service Future Improve scalability Explore better ways to translate between schemas Directly support ontology overlays Integrate tightly with the other API services

Still to come Env Service Still thinking through this service Primarily needed when we support individual system registration. Will handle user environment management Update, query user environment Full module support Discovery endpoints Migration endpoints

Love, hate, and building user communities 140 apps 250+ clients 7000+ end users 400TB of non-replicated data 5000+ jobs 2M+ hours burned

Love, hate, and building user communities

Love, hate, and building user communities

Love, hate, and building user communities

Love, hate, and building user communities

Things we ve done well (thus far) Listened to our users Responded to user feedback, Implemented bug fixes, and feature requests quickly and openly Listened to our users Handled early scalability issues Documented the API, the concepts, and the process Kept our services running through down times Been preemptive about issues and updates

Teaching Points Address versioning early Be obsessive-compulsive about regression testing. Communicate with your users early and often. When dependencies go down, you look bad, so design with failovers in mind. Big data doesn t move well on the web. Set user expectations early and often. Don t be afraid to get creative and provide alternative solutions.

Teaching Points No one else thinks it s as simple as you do. The success is in the science. Take the time to onboard developers.

Roadmap or Apple Maps? Don t build stuff just to build stuff Don t aim at nothing, we ll hit it every time Listen to our users. They re the only reason we re here.

Roadmap or Apple Maps? 2.0 release Global identifiers New HPC, HTC, Cloud, and private systems New application types 100 s of new apps Performance improvements Stability improvements

Roadmap or Apple Maps? New services OAuth2: fully compliant with current spec Staging service: from anywhere to anywhere using any protocol Metadata: deep metadata support for the entire enterprise Mashup: store and share workflows Env: manage environments across systems

Developer s Console https://foundation.iplantcollaborative.org/

Demo App https://foundation.iplantcollaborative.org/iplant-test

More Info http://iplantcollaborative.org https://foundation.iplantcollaborative.org/docs https://foundation.iplantcollaborative.org/forums

Questions? We re hiring!!! www.tacc.utexas.edu/about/jobs