The OpenCirrus TM Project: A global Testbed for Cloud Computing R&D

Similar documents
Open Cirrus TM Cloud Computing Testbed: Federated Data Centers for Open Source Systems and Services Research

Grids and Clouds. GridKa School 2009 Forschungszentrum Karlsruhe September 1st 2009 Christian Baun.

Clouds and Grids. DESY Hamburg July 6th Christian Baun.

CPSC 426/526. Cloud Computing. Ennan Zhai. Computer Science Department Yale University

DISTRIBUTED SYSTEMS [COMP9243] Lecture 8a: Cloud Computing WHAT IS CLOUD COMPUTING? 2. Slide 3. Slide 1. Why is it called Cloud?

Large Scale Sky Computing Applications with Nimbus

Cloud Computing. Grace A. Lewis Research, Technology and Systems Solutions (RTSS) Program System of Systems Practice (SoSP) Initiative

Easy Access to Grid Infrastructures

MOHA: Many-Task Computing Framework on Hadoop

Magellan Project. Jeff Broughton NERSC Systems Department Head October 7, 2009

Sky Computing on FutureGrid and Grid 5000 with Nimbus. Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes Bretagne Atlantique Rennes, France

Architectural Implications of Cloud Computing

An Introduction to Virtualization and Cloud Technologies to Support Grid Computing

Efficient On-Demand Operations in Distributed Infrastructures

Matrix IT work Copyright Do not remove source or Attribution from any graphic or portion of graphic

The IBM Platform Computing HPC Cloud Service. Solution Overview

High Performance and Cloud Computing (HPCC) for Bioinformatics

HPC in Cloud. Presenter: Naresh K. Sehgal Contributors: Billy Cox, John M. Acken, Sohum Sohoni

Computing as a Service

HPC learning using Cloud infrastructure

Cloud Computing mit mathematischen Anwendungen

Cloud computing and Citrix C3 Last update 28 July 2009 Feedback welcome -- Michael. IT in the Clouds. Dr Michael Harries Citrix Labs

CLOUD COMPUTING ABSTRACT

Cloud Computing An IT Paradigm Changer

Bringing OpenStack to the Enterprise. An enterprise-class solution ensures you get the required performance, reliability, and security

Distributed Systems. 31. The Cloud: Infrastructure as a Service Paul Krzyzanowski. Rutgers University. Fall 2013

Mobile Cloud Computing

Cloud Computing introduction

Resources and Services Virtualization without Boundaries (ReSerVoir)

ECE Enterprise Storage Architecture. Fall ~* CLOUD *~. Tyler Bletsch Duke University

Improving Blade Economics with Virtualization

Data Intensive Scalable Computing. Thanks to: Randal E. Bryant Carnegie Mellon University

Migration to Cloud Computing: Roadmap for Success

Mobile Cloud Computing

CSE6331: Cloud Computing

Next Generation Storage for The Software-Defned World

EFFICIENT ALLOCATION OF DYNAMIC RESOURCES IN A CLOUD

What is Cloud Computing? What are the Private and Public Clouds? What are IaaS, PaaS, and SaaS? What is the Amazon Web Services (AWS)?

Cloud Computing Introduction & Offerings from IBM

Demystifying the Cloud With a Look at Hybrid Hosting and OpenStack

SURVEY PAPER ON CLOUD COMPUTING

Evolution of the ATLAS PanDA Workload Management System for Exascale Computational Science

Intro to Software as a Service (SaaS) and Cloud Computing

HPC Cloud at SURFsara

Some thoughts on the evolution of Grid and Cloud computing

New Fresh Storage Approach for New IT Challenges Laurent Denel Philippe Nicolas OpenIO

Title DC Automation: It s a MARVEL!

Cloud Computing: A Perspective Study

The Software Driven Datacenter

Distributed Systems COMP 212. Lecture 18 Othon Michail

Cloud Computing 2. CSCI 4850/5850 High-Performance Computing Spring 2018

Simulation of Cloud Computing Environments with CloudSim

Cloud Computing at Yahoo! Thomas Kwan Director, Research Operations Yahoo! Labs

Now SAML takes it all:

The Social Grid. Leveraging the Power of the Web and Focusing on Development Simplicity

Faculté Polytechnique

Developing, Deploying and Managing Applications on the Cloud

Community Clouds And why you should care about them

Data Center Fundamentals: The Datacenter as a Computer

ICN for Cloud Networking. Lotfi Benmohamed Advanced Network Technologies Division NIST Information Technology Laboratory

Moving to the Cloud. Developing Apps in. the New World of Cloud Computing. Dinkar Sitaram. Geetha Manjunath. David R. Deily ELSEVIER.

NC Education Cloud Feasibility Report

Large Scale Computing Infrastructures

Cloud Computing Patterns & Best Practices. Ezhil Arasan Babaraj Director of R&D Labs CSS Corp, India

Cloud Bursting: Top Reasons Your Organization will Benefit. Scott Jeschonek Director of Cloud Products Avere Systems

Lecture 7: Data Center Networks

Introduction to Cloud Computing. [thoughtsoncloud.com] 1

Cloud Computing Lecture 4

Cloud & AWS Essentials Agenda. Introduction What is the cloud? DevOps approach Basic AWS overview. VPC EC2 and EBS S3 RDS.

EGI-InSPIRE. Cloud Services. Steven Newhouse, EGI.eu Director. 23/05/2011 Cloud Services - ASPIRE - May EGI-InSPIRE RI

IT Enterprise Services. Capita Private Cloud. Cloud potential unleashed

From Internet Data Centers to Data Centers in the Cloud

Pervasive Insight. Mission Critical Platform

Chapter 3 Virtualization Model for Cloud Computing Environment

CloudLab. Updated: 5/24/16

RED HAT CLOUD STRATEGY (OPEN HYBRID CLOUD) Ahmed El-Rayess Solutions Architect

Capabilities & Offering for the Spanish Academic Community. Madrid, 28th November 2017

Module Day Topic. 1 Definition of Cloud Computing and its Basics

Cloud Computing. Ennan Zhai. Computer Science at Yale University

BUILDING A PRIVATE CLOUD. By Mark Black Jay Muelhoefer Parviz Peiravi Marco Righini

ACCI Recommendations on Long Term Cyberinfrastructure Issues: Building Future Development

Cloud Transformation: Data center usage models driving Cloud computing innovation. Jake Smith, Advanced Server Technologies Data Center Group Intel

Cloud computing Antonio Corradi

Scheduling Computational and Storage Resources on the NRP

Cloud-Security: Show-Stopper or Enabling Technology?

Data Center Management and Automation Strategic Briefing

Research Challenges in Cloud Infrastructures to Provision Virtualized Resources

Copyright 2012 EMC Corporation. All rights reserved. Obrigado

MATE-EC2: A Middleware for Processing Data with Amazon Web Services

Introduction To Cloud Computing

CSE 124: THE DATACENTER AS A COMPUTER. George Porter November 20 and 22, 2017

David Bernstein June 2012

Modern Data Warehouse The New Approach to Azure BI

ERP Solution to the Cloud

Cloud Computing. Grace A. Lewis Research, Technology and Systems Solutions (RTSS) Program System of Systems Practice (SoSP) Initiative

Cloud Services. Infrastructure-as-a-Service

Big Data and Cloud Computing

Lecture 10.1 A real SDN implementation: the Google B4 case. Antonio Cianfrani DIET Department Networking Group netlab.uniroma1.it

Self-driving Datacenter: Analytics

HPC over Cloud. July 16 th, SCENT HPC Summer GIST. SCENT (Super Computing CENTer) GIST (Gwangju Institute of Science & Technology)

Transcription:

The OpenCirrus TM Project: A global Testbed for Cloud Computing R&D Marcel Kunze Steinbuch Centre for Computing (SCC) Karlsruhe Institute of Technology (KIT) Germany KIT The cooperation of Forschungszentrum Karlsruhe GmbH und Universität Karlsruhe (TH) www.kit.edu

Karlsruhe Institute of Technology (KIT)! Cooperation between research centre Karlsruhe und Karlsruhe university! Largest scientific center in Germany! 8.000 scientists, 18.000 students! Annual budget: > 500 Million Euro! R&D focus: Energy research and nano-technology >> + 2

Agenda! What is cloud computing?! OpenCirrus TM project! Programming the cloud! HPC and big data! Summary 3

Cloud Computing: A possible Definition A computing cloud is a set of network enabled on demand IT services, scalable and QoS guaranteed, which could be accessed in a simple and pervasive way. 4

Cloud lives in Web 2.0! Everything as a Service (XaaS)! AaaS: Application as a Service! PaaS: Platform as a Service! SaaS: Software as a Service! DaaS: Data as a Service! IaaS: Infrastructure as a Service! HaaS: Hardware as a Service! Industry is pretty much engaged! Various commercial offerings exist 5

Commercial Cloud Offerings (Small Excerpt)! Problem: Commercial offerings are proprietary and usually not open for cloud systems research and development 6

Cloud Systems Research! Simple, transparent, controllable cloud computing infrastructure! What types of interfaces are appropriate for clouds?! How should cloud networks be constructed/managed?! How are security concerns addressed in the cloud?! How are various workloads most efficiently transferred?! What types of applications can run in clouds?! What types of service level agreements are appropriate/possible?! Research requirements! Perform experiments also on a low system level! Flexible cloud computing framework! Compare different methodologies and implementations 7

Cloud Computing: A new Hype following Grid OpenCirrus TM Cloud computing R&D: OpenCirrus TM project 8

Clouds vs. Grids: A Comparison Objective Infrastructure Middleware Cloud Computing Provide desired computing platform via network enabled services One or few data centers, heterogeneous/homogeneous resource under central control, Industry and Business Proprietary, several reference implementations exist (e.g. Amazon) Grid Computing Resource sharing Job execution Geographically distributed, heterogeneous resource, no central control, VO Research and academic organization Well developed, maintained and documented Application Suited for generic applications Special application domains like High Energy Physics User interface Easy to use/deploy, no complex user interface required Difficult use and deployment Need new user interface, e.g., commands, APIs, SDKs, services Business Model Commercial: Pay-as-you-go Publicly funded: Use for free Operational Model Industrialization of IT Fully automated Services Mostly Manufacture Handcrafted Services QoS Possible Little support On-demand provisioning Yes No 9

10

OpenCirrus Cloud Computing Research Testbed http://opencirrus.org! An open, internet-scale global testbed for cloud computing research! Data center management & cloud services! Systems level research! Application level research! Structure: a loose federation! Sponsors: HP Labs, Intel Research, Yahoo!! Partners: UIUC, Singapore IDA, KIT, NSF! Members: System and application development! Great opportunity for cloud R&D 11

Where are the OpenCirrus sites?! Six sites initially: HP Yahoo (sf)! Sites distributed world-wide: HP Research, Yahoo!, UIUC, Intel Research Pittsburgh, KIT, Singapore IDA! 1000-4000 processor cores per site! New CMU site coming in 2009 UIUC Intel (pgh) CMU (coming in 09) KIT (de) IDA (sg) 12

Cloud Architecture Source: S.Tai 13

OpenCirrus TM Blueprint Cloud application services Virtual Resource Sets Cloud infrastructure services Eucalyptus IT infrastructure layer (Physical Resource Sets) 14

Physical Resource Sets (PRS)! PRS service goals! Provide mini-datacenters to researchers! Isolate experiments from each other! Stable base for other research! PRS service approach! Allocate sets of physical co-located nodes, isolated inside VLANs.! Leverage existing software (e.g. Utah Emulab, HP OpsWare)! Start simple, add features as we go! Base to implement virtual resource sets! Hardware as a Service (HaaS) 15

Virtual Resource Sets (VRS)! Basic idea: Abstract from physical resource by introduction of a virtualization layer! Concept applies to all IT aspects: CPU, storage, networks and applications,! Main advantages! Implement IT services exactly fitting customer s varying need! Deploy IT services on demand! Automated resource management! Easily guarantee service levels! Live migration of services! Reduce both: CapEx and OpEx! Infrastructure as a Service (IaaS)! Implement Compute and Storage services! De-facto standard: Amazon Web Services interface 16

Amazon Web Services http://aws.amazon.com/ 17

Eucalyptus: A potential VRS layer http://eucalyptus.cs.ucsb.edu/ Amazon EC2 and S3 Interface! Client-side API! Translator! Cloud Controller! Database! Cluster Controller! Node Controller! Source: R.Wolski 18

Programming the Cloud: Hadoop! An open-source Apache software foundation project sponsored by Yahoo!! http://wiki.apache.org/hadoop/projectdescription! intent is to reproduce the proprietary software infrastructure developed by Google! Provides a parallel programming model (MapReduce), a distributed file system, and a parallel database! http://en.wikipedia.org/wiki/hadoop! http://code.google.com/edu/parallel/mapreduce-tutorial.html 19

The MapReduce Programming Model! Map computation across many objects! Extract a set of key value pairs of e.g. 10 10 Web pages! Reduce results in many different ways! Combine it with other values that share the same key! System deals with issues of resource allocation & reliability 20

How is OpenCirrus different from other testbeds?! OpenCirrus TM supports both system- and app-level research! n/a at Google/IBM and EC2/S3! OpenCirrus TM researchers will have complete access to the underlying hardware and software platform.! OpenCirrus TM allows Intel platform features that support cloud computing (e.g. DCMI, NM) to be exposed, and exploited. Map-Reduce apps Hadoop Virtual machines Google/IBM cluster Map-Reduce apps Hadoop Cluster mgmt software Virtual or physical machines Open Cirrus cluster Can be modified by users Cannot be modified by users Cloud apps and services Can be modified by users 21

How do users get access to OpenCirrus sites?! Project PIs apply to each site separately.! Contact names, email addresses, and web links for applications to each site will be available on the OpenCirrus TM Web site (which goes live Q1)! http://opencirrus.org! Each OpenCirrus TM site decides which users and projects get access to its site.! Planning to have a global sign on for all sites! Users will be able to login to each OpenCirrus TM site for which they are authorized using the same login and password. 22

Who can use the OpenCirrus Resources?! Three different types of users can use OpenCirrus TM sites: (a) Individual PIs from academic research groups (b) Industry researchers from the OpenCirrus TM partners (c) Industry researchers who have a customer relationship with the OpenCirrus TM partners! What is the expected mix of these groups?! The majority of users will be (a) academic researchers and (b) researchers who work for the OpenCirrus TM partners.! There will be a few carefully chosen users who are (c) industry researchers with a customer relationship with an OpenCirrus TM partner 23

What kinds of research projects are OpenCirrus sites looking for?! Open Cirrus TM is seeking research in the following areas (different centers will weight these differently):! Datacenter federation! Datacenter management! Web services! Data-intensive applications and systems! Hadoop map-reduce applications! The following kinds of projects are not of primary interest:! Traditional HPC application development.! Production applications that just need lots of cycles.! Closed source system development. 24

Potential Fields of Cloud System Development (1)! Virtual organizations and social networks! Science is team work, clouds are rather for individuals right now! Integration of cloud services! Standardization of APIs and protocols! Hyperclouds may integrate services of various providers (Stratosphere?)! Management of service quality! Negotiation and monitoring of SLAs! How does this work for Web service mashups?! Privacy, data protection and security! Importance of AAA and encryption! e.g. use of Trusted Platform Module (TPM) 25

Cloud Security: A possible Solution Source: IBM 26

Potential Fields of Cloud System Development (2)! New infrastructure services! HPCaaS: High Performance Computing as a Service! LSDFaaS: Large Scale Data Facility as a Service! GenomeDBaaS: Genome Database as a Service! How does this relate to Grid computing? 27

HPC vs. HTC vs. MTC (Many Task Computing) MTC HPC HTC Source: I.Foster 28

The Grid and Cloud Space glite UNICORE Traditional Cloud / Web 2.0 29

Extension of the Cloud Space to all Areas LSDFaaS Large Scale Data Facility as a Service HPCaaS High Performance Computing as a Service 30

HPCaaS! High Performance Computing as a Service! Interesting Fields for R&D in Open Cirrus TM! Flexible platform services for HPC customers! Development of MPI services for clouds! Development of scheduling services for clouds! Management of software licenses! Integration of Grid resources: Grid as a Service (GaaS) 31

LSDFaaS! Large Scale Data Facility as a Service! Actual projects at KIT in this field:! Data storage for LHC computing! Data storage for ITER (EUFORIA)! Project ANKA (synchrotron radiation source)! Activities in materials research! Long-term data filing due to legal requirements! Development of big data services 32

Big Data! Interesting applications are data hungry! The data grows over time! The data is immobile! 100 TB @ 1Gbps ~= 10 days! Compute comes to the data! Big Data clusters are the new libraries (J. Campbell, et al., Intel Research Pittsburgh, 2007) The value of a cluster is its data 33

Tashi High-Level Design http://wiki.apache.org/incubator/tashiproposal Most decisions happen in the scheduler; manages compute/storage in concert Scheduler Cluster Manager Node Node Node Services are instantiated through virtual machines Virtualization Service Storage Service Node Node Data location information is exposed to scheduler and services The storage service aggregates the capacity of the commodity nodes to house Big Data repositories. Node 34 CM maintains databases and routes messages; decision logic is limited Cluster nodes are assumed to be commodity machines

Tashi Software Architecture Centralized cluster administration Resource DB VM instance DB Node Manager DB Site Specific Plugin(s) Resource Telemetry Service Client Client API Cluster Manager (CM) Scheduling Agent Data Location Service CM-NM API Pub/Sub System VM VM VM VM Legend Tashi component non-tashi component Compute node Resource Controller Plugins (VMM, DFS, power, etc.) Node Manager (NM) nmd system software Sensor Plugins VMM iptables /vlan DFS DFS Metadata Server 35

Tashi is both! An open source software project! http://incubator.apache.org/tashi/! The implementation is intended to become worthy of production use.! Alpha deployment running on OpenCirrus TM cluster at Intel Research Pittsburgh since October 2008.! An open research project! http://www.pittsburgh.intel-research.net/projects/tashi/! Key question: How should compute, storage, and power be managed in a Big Data cluster to optimize for performance, energy, and fault-tolerance?! Initial sponsors include:! Intel Research Pittsburgh! Carnegie Mellon University! Yahoo! 36

The Way to Cloud Nirvana Source: rpath! The roadmap for cloud services! Leads to dynamic data centers! Ranges from infrastructure services to dynamic applications! Complements traditional IT services in the medium term 37

Summary! Cloud computing is the next big thing! Flexible and elastic resource provisioning! Economy of scale makes it attractive! Move from manufacture towards industrialization of IT (Everything as a Service)! OpenCirrus TM offers interesting R&D opportunities! Cloud systems development! Cloud application development! Accepting research proposals soon! OpenCirrus TM workshop at HP Palo Alto on June 8/9 38

Karlsruhe Institute of Technology Steinbuch Centre for Computing (SCC) Thank you for your attention. 39