XSEDE BOF: Science Clouds

Similar documents
Beyond the Clouds: Trends in Cloud Compu4ng for Science

Sky Computing on FutureGrid and Grid 5000 with Nimbus. Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes Bretagne Atlantique Rennes, France

Large Scale Sky Computing Applications with Nimbus

Distributed Systems. 31. The Cloud: Infrastructure as a Service Paul Krzyzanowski. Rutgers University. Fall 2013

CS 6393 Lecture 10. Cloud Computing. Prof. Ravi Sandhu Executive Director and Endowed Chair. April 12,

Cloud Computing with Nimbus

What is Cloud Computing? What are the Private and Public Clouds? What are IaaS, PaaS, and SaaS? What is the Amazon Web Services (AWS)?

Science Clouds: Early Experiences in Cloud Computing for Scientific Applications

Jetstream: Adding Cloud-based Computing to the National Cyberinfrastructure

ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development

A Cloud Compu*ng Approach to On- Demand and Scalable CyberGIS Analy*cs

CLOUD COMPUTING. Rajesh Kumar. DevOps Architect.

Introduction to Cloud Computing

Cloud Computing for Science

PUBLIC AND HYBRID CLOUD: BREAKING DOWN BARRIERS

Clouds: An Opportunity for Scientific Applications?

Building Effective CyberGIS: FutureGrid. Marlon Pierce, Geoffrey Fox Indiana University

Introduction to data centers

HPC learning using Cloud infrastructure

Infrastructure Clouds for Science and Education: Platform Tools

Cloud Computing with Nimbus

CHAMELEON: A LARGE-SCALE, RECONFIGURABLE EXPERIMENTAL ENVIRONMENT FOR CLOUD RESEARCH

Cloud Computing 4/17/2016. Outline. Cloud Computing. Centralized versus Distributed Computing Some people argue that Cloud Computing. Cloud Computing.

Examining Public Cloud Platforms

Contextualization: Providing One-Click Virtual Clusters

Cornell Red Cloud: Campus-based Hybrid Cloud. Steven Lee Cornell University Center for Advanced Computing

DATA SHEET AlienVault USM Anywhere Powerful Threat Detection and Incident Response for All Your Critical Infrastructure

Eucalyptus Overview The most widely deployed on-premise cloud computing platform

COP Cloud Computing. Presented by: Sanketh Beerabbi University of Central Florida

Jetstream: A science & engineering cloud

Cloud Computing Lecture 4

What is Cloud Computing? Cloud computing is the dynamic delivery of IT resources and capabilities as a Service over the Internet.

Cloud Essentials for Architects using OpenStack

Getting to Know Apache CloudStack

Basics of Cloud Computing Lecture 2. Cloud Providers. Satish Srirama

SURVEY PAPER ON CLOUD COMPUTING

Cloud platforms. T Mobile Systems Programming

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

XSEDE s Campus Bridging Project Jim Ferguson National Institute for Computational Sciences

XSEDE Campus Bridging Tools Rich Knepper Jim Ferguson

Topics II SaaS 2.0 OaSe*: OpenCms as a Service Cloud Sites SaaS 3.0 Google Apps (Gmail, Google Docs) Cloud Assessment Services Cloudify: Cloud Applica

Designing MQ deployments for the cloud generation

Cloud Computing and Service-Oriented Architectures

Virtual Appliances and Education in FutureGrid. Dr. Renato Figueiredo ACIS Lab - University of Florida

Cloud Computing. Grace A. Lewis Research, Technology and Systems Solutions (RTSS) Program System of Systems Practice (SoSP) Initiative

Automated Deployment of Private Cloud (EasyCloud)

EBOOK: VMware Cloud on AWS: Optimized for the Next-Generation Hybrid Cloud

Community Clouds And why you should care about them

NCTA-Certified CloudOps Specialist (NCO) Exam NCO-110

Science-as-a-Service

High Performance and Cloud Computing (HPCC) for Bioinformatics

No Limits Cloud Introducing the HPE Helion Cloud Suite July 28, Copyright 2016 Vivit Worldwide

Science Computing Clouds.

Cloud Computing. Amazon Web Services (AWS)

Automating Elasticity. March 2018

Apache Airavata Open Source Framework

2013 AWS Worldwide Public Sector Summit Washington, D.C.

Module Day Topic. 1 Definition of Cloud Computing and its Basics

Cloud platforms T Mobile Systems Programming

Building a Data-Friendly Platform for a Data- Driven Future

Mobile Cloud Computing

Cloud & container monitoring , Lars Michelsen Check_MK Conference #4

UVA HPC & BIG DATA COURSE. Cloud Computing. Adam Belloum

Automated Deployment of Private Cloud (EasyCloud)

Cloud Computing: Making the Right Choice for Your Organization

Ioan Raicu. Everyone else. More information at: Background? What do you want to get out of this course?

Survey on Cloud Infrastructure Service: OpenStack Compute

CLOUD COMPUTING. Lecture 4: Introductory lecture for cloud computing. By: Latifa ALrashed. Networks and Communication Department

Cloud Infrastructure

Basics of Cloud Computing Lecture 2. Cloud Providers. Satish Srirama

Fast IT - Policy Driven Infrastructure for the Intercloud World

Cloud Computing the VMware Perspective. Bogomil Balkansky Product Marketing

Cloud Computing and Service-Oriented Architectures

Innovative Solutions. Trusted Performance. Intelligently Engineered. Comparison of SD WAN Solutions. Technology Brief

Jetstream Overview A national research and education cloud

Science Clouds: Early Experiences in Cloud Computing for Scientific Applications

How to Keep UP Through Digital Transformation with Next-Generation App Development

Spatio-temporal visualization of SWAT Outputs using SWATShare

Top 40 Cloud Computing Interview Questions

Cloud solution consultant

Supporting Cloud Computing with the Virtual Block Store System

CHEM-E Process Automation and Information Systems: Applications

AWS Solution Architecture Patterns

Genomics on Cisco Metacloud + SwiftStack

Cisco Cloud Architecture with Microsoft Cloud Platform Peter Lackey Technical Solutions Architect PSOSPG-1002

IBM Bluemix compute capabilities IBM Corporation

Cloud solution consultant

Introduction to Amazon Web Services

Moving to the Cloud: Making It Happen With MarkLogic

Amazon Web Services (AWS) Solutions Architect Intermediate Level Course Content

Cloud computing and Citrix C3 Last update 28 July 2009 Feedback welcome -- Michael. IT in the Clouds. Dr Michael Harries Citrix Labs

Managing and Auditing Organizational Migration to the Cloud TELASA SECURITY

A10 HARMONY CONTROLLER

White Paper Impact of DoD Cloud Strategy and FedRAMP on CSP, Government Agencies and Integrators.

An Experimental Study of Load Balancing of OpenNebula Open-Source Cloud Computing Platform

Ian Foster, CS554: Data-Intensive Computing

OPENSTACK PRIVATE CLOUD WITH GITHUB

Cloud Computing Introduction & Offerings from IBM

FAST TRACK YOUR AMAZON AWS CLOUD TECHNICAL SKILLS. Enterprise Website Hosting with AWS

HPE GO4SAP Getting Orchestrated for SAP Automate SAP Basis Management Enterprise-scale end-to-end

Transcription:

XSEDE BOF: Science Clouds Kate Keahey, David Lifka, Manish Parashar, Warren Smith, Carol Song, Shaowen Wang 11/5/13 1

Clouds Available to Science 11/5/13 2

FutureGrid Overview A distributed infrastructure to support computer and computational science experiments Performance analysis, software testing and evaluation, interoperability A rich education and training platform University courses, multi-day training sessions, informal learning and exploration Includes Cloud, Grid, and High-Performance Computing environments Allows users to configure environments to meet their needs Typically available interactively Provides tools to support rigorous experimentation Documenting configurations, recording experiments Allocatable via XSEDE

FutureGrid Deployment Architecture All sites are connected via a core router that is a1ached to a configurable network impairment device. Dedicated network links are shown in red.

IaaS ParBBons Site # Cores* TFLOPS* Total RAM* (GB) Secondary Storage* (TB) ParRRons mostly independent (Nimbus ones share authenrcaron credenrals) Accessed via implementaron- specific interfaces and Amazon interfaces IU system includes large memory/disk nodes PlaLorms IU 1308 11 2048+ 335 Eucalyptus, OpenStack TACC 768 8 1152 30 Nimbus, OpenStack UC 672 7 1344 120 Nimbus SDSC 672 7 2688 72 Nimbus, OpenStack UF 256 3 768 Nimbus Total 3676 42 9344 557 *All parrrons on cluster

Virtual Machine Images and Appliances Provide a set of generic Linux images Provide appliance images for specific tasks Virtual cluster Condor MPI Hadoop Users create and share their own images

On-Demand Research Computing Infrastructure as a Service Software as a Service Cloud Storage Solutions www.cac.cornell.edu/redcloud 11/5/13 7 11/5/13 7

Commercial Clouds AWS (Kate) Instances From $0.02 per hour to Top500 membership Xen hypervisor Storage Transient: 160 GB-48 TB on VM Persistent: Elastic Block Storage (EBS) 1BG-1TB Networking Internal: up to 10 Gb Ethernet for Cluster Compute AWS Direct Connect (for large volume users) Sneakernet Pricing Reserved instances (up to 71% savings), 1-3 year commitment Spot instances 11/5/13 8

Other Major Commercial Offerings (Kate) Google Compute Cloud (since 06/12) Target high performance market, emphasizes consistent performance Uses KVM Pricing by the minute Windows Azure (since 2010) Started out as a platform offering Uses Windows Azure Hypervisor (can do Linux) Rackspace Based on OpenStack (since 2012) Uses Xen hypervisor See more at scienceclouds.org 11/5/13 9

Science Cloud SoluBons: from Private and Community Clouds to Commercial Clouds from Proof- of- Concept to SoluBon 11/5/13 10

Image Management 11/5/13 11

ContextualizaBon Mainstream ctx tools: Chef, Puppet Providing abstractions, scalability, repatability and control StarCluster, Nimbus Context Broker, cloudinit.d How can I automate mulr- dependency deployments? How can I monitor the changes to the environment consistently? How can I create virtual clusters automarcally? How can I automate upgrades? How can I securely create a trust layer? 11/5/13 12

MulB- Cloud, Availability and Scalability Creates and manages domains, groups of resources in the cloud Scalability and availability regulate domain properties based on system and application metrics Multi-cloud Mitigate risks by working with multiple providers Extensible monitoring VM-based, provider-based, and domain data OpenTSDB, traffic sentinel, AWS CloudWatch, etc. Policy agnostic From pre-defined policies to python programs 11/5/13 13

IntegraBng On- Demand Resources Workflow Process Dispatcher Torque Scheduler Condor Scheduler Phantom 11/5/13 14

Science Cloud ApplicaBons 11/5/13 15

Ocean Observatory IniBaBve Towards Observatory Science Sensor-driven processing An always-on service Real-time event-based data stream processing capabilities Highly volatile need for data distribution and processing Nimbus team building platform services for integrated, reactive support for on-demand science High-availability Auto-scaling From regional Nimbus clouds to commercial clouds 16

Building a Plant Observatory Diseases Hyperspectral imaging system mounted on a tower Spectral Indices* NDVI SAVI EVI NDWI Mul@- Temporal Hyperspectral Cubes and Spectral Index Images applica@on images courtesy of Yuki Hamada, ANL Joint project with Pete Beckman, Nicola Ferrier, Yuki Hamada, Rao Kotamarthi, Rajesh Sankaran and others (ANL) 11/5/13 XSEDE 2013, Science Clouds BOF 17

The iplant CollaboraBve Challenge: to build a lasting, community driven Cyberinfrastructure for the Grand Challenges of Plant Science End Users ComputaRonal Users Teragrid XSEDE ü Storage ü ComputaRon ü HosRng ü Web Services ü Scalability Building a plalorm that can support diverse and constantly evolving needs.

iplant Cloud Investments Atmosphere (pre-configured virtual machines) Private science cloud @ University of Arizona Eucalyptus -> OpenStack Custom UI & API Discovery Environment (web-based access to tools and data) Several services running in the cloud DevOps in the cloud Foundation API (programmatic access to iplant) Running on multiple clouds (FutureGrid, HP, Rackspace) Leveraging PaaS and SaaS (HP, Iron.io) DevOps in the cloud Supports execution to arbitrary IaaS providers. Courtesy of Rion Dooley, TACC

Apache Airavata Science Gateway Framework Courtesy of Suresh Marru, Indiana University

Airavata Cloud Usage Globus XSEDE Java CoG AbstracRon Portals & Science Gateways Apache Airavata EC2 API, Whirr API EC2, SSH, HDFS Amazon EC2, Hadoop Systems JSDL / BES Adaptor Unicore JUROPA (Germany) Java CoG AbstracRon Globus Virtual Cluster Infrastructure Cloud (future work)

SWATShare Online SWAT SimulaBon and Model Sharing h"p://water- hub.org An XSEDE science gateway Available to broad SWAT user community Being integrated with Hydroshare project funded by NSF SI2 award Used by researchers and classes SimulaRon of daily streamflow for water resources management in Wabash River Basin SimulaRon of monthly flows for drought predicron and crop management SimulaRon of daily, monthly and annual streamflows for nutrient loading Technical implementaron Using different resources based on job nature: SimulaRon CalibraRon SensiRvity experiment Challenges relevant to cloud compurng On- demand scalability Adaptability E.g., migrate from one resource to next 22

Discussion: How can clouds complement the exisbng XSEDE resources? 11/5/13 23

XSEDE Cloud Investigation Investigation Team Ian Foster, Steve Tuecke, ANL/University of Chicago David Lifka, Susan Mehringer, Paul Redfern, Cornell University CAC Craig Stewart, Indiana University Manish Parashar, Rutgers University Cloud Survey Motivation The goal of XSEDE is to enhance research productivity XSEDE must embrace cloud XSEDE must have a clear understanding of how researchers using the cloud today and why Based on this information XSEDE plans to integrate cloud services into its portfolio to support use cases that are not well served by its current resource offerings Survey Status (www.xsede.org/cloudsurvey - closed end of March; report by July 1) 80 cloud projects from around globe, broad participation (21 disciplines + HASS), extensive technical data (19 categories), user perspectives, e.g., cloud benefits/challenges Focused exclusively on use of cloud for research and education 24

1. Pay as You Go 2. Lower Costs 3. Compute Elasticity 4. Data Elasticity 5. Software as a Service 6. Education as a Service 7. Broader Use 8. Scientific Workflows 9. Rapid Prototyping 10. Data Analysis User Identified Benefits 25

1. Learning Curve 2. Virtual Machine 3. Bandwidth 4. Memory Limits 5. Databases 6. Interoperability 7. Security 8. Data Movement 9. Storage 10. Cost/Funding User Identified Challenges 26