Going Places with Distributed Computing

Size: px
Start display at page:

Download "Going Places with Distributed Computing"

Transcription

1 Going Places with Distributed Computing ESC Grid Workshop 2007 Kate Keahey University of Chicago Argonne National Laboratory

2 There is more to Grid applications than running simple batch computations

3 Galaxies Collide on the I-WAY I-WAY (SC95) Colliding Galaxies Co-scheduling components Tightly coupled simulation distributed over NCSA, PSC, SDSC Multiple components Dark matter Gas Visualization We never managed to do this again

4 Functional MRI (fmri) Large datasets 90,000 volumes / study 100s of studies Wide range of analyses Testing, production runs Data mining Ensemble, Parameter studies

5 Moral Hazard 11/29/07, ESAC Grid Workshop Kate Keahey

6 A Different Kind of Complexity Complex experimental application codes Developed over more than 10 years, by more than 100 scientists, comprises ~2 M lines of C++ and Fortran code Require complex, customized environments Rely heavily on the right combination of compiler versions and available libraries Dynamically load external libraries depending on the task to be performed Environment validation To ensure reproducibility and result uniformity across environments

7 Critical Applications The National Fusion Collaboratory: Grids for Experimental Science 11/29/07, ESAC Grid Workshop Kate Keahey

8 Urgent Resource Requirements Applications that require resources in response to urgent needs Event tracking, simulation and prediction, etc. Unexpected

9 Combating Complexity

10 The Swift System Clean separation of logical/physical concerns Logical Concerns XDTM specification of logical data structures Concise specification of parallel programs (SwiftScript, with iteration, etc.) Rigorous provenance tracking and query: Virtual data schema & automated recording Physical Concerns Efficient execution on distributed resources: Karajan threading, Falkon provisioning, Globus interfaces, pipelining, load balancing Improved usability and productivity Demonstrated in numerous applications

11 Swift Architecture Specification Scheduling Execution Provisioning Abstract computation Execution Engine (Karajan w/ Swift Runtime) Virtual Node(s) file1 Resource Provisioners Falkon/VWS SwiftScript Compiler Virtual Data Catalog SwiftScript C C C Provenance collector C Swift runtime callouts Status reporting Provenance data launcher Provenance data launcher App F1 file2 App F2 file3 Amazon EC2 Yong Zhao, Mihael Hatigan, Ioan Raicu, Mike Wilde, Ben Clifford

12 Provisioning Resources Condor Glideins Provision resources for a Condor pool by gliding in Condor daemons as jobs through a GRAM interface MyCluster Deploys personal clusters (SGE or Condor) on Teragrid resources. Unlike Condor Glideins, each user gets their own cluster, instead of having resources added to an existing pool Falkon Deploys a daemon that is specially optimized for scheduling lightweight tasks, such as those found in workflows.

13 The Underpinnings: Resource Provisioning as Central Abstraction for Grid Computing

14 Of Jobs and Resources Job management vs resource provisioning The focus of infrastructure available today it to schedule jobs Resources are provisioned as a side-effect of job deployment We need general-purpose leases Associated with the required environment With a well-defined resource quota and availability General-purpose Adaptable to many interactions: managed by customized schedulers, log in via ssh, etc. Building on the side-effect Request a resource quota Once obtained, prepare the environment and make it available via some required method

15 Two Things to Consider Lease Terms Expressive enough so that we can define a variety of leases Best-effort, advance reservations (including immediate leases), periodic, preemptible/non-preemptible, etc. A description of the required resources: leasing CPUs, storage An environment Making lease terms explicit Standards (OGF): WS-Agreement, JSDL Enforcement Corresponding to lease terms Enforcing a large set of the desired features Enforcing them efficiently

16 Virtual Workspaces A dynamically provisioned environment Environment definition: we get exactly the (software) environment we need on demand. Resource allocation: Provision the resources the workspace needs (CPUs, memory, disk, bandwidth, availability), allowing for dynamic renegotiation to reflect changing requirements and conditions. Implementation Traditional means: publishing, automated configuration, coarse-grained enforcement Virtual Machines: encapsulated configuration and fine-grained enforcement Paper: Virtual Workspaces: Achieving Quality of Service and Quality of Life in the Grid

17 The Virtues of Virtualization App App Guest OS (Linux) App Guest OS (NetBSD) Virtual Machine Monitor (VMM) / Hypervisor Hardware App Guest OS (Windows) VM VM VM App Parallels Xen VMWare UML KVM Bring your environment with you Excellent enforcement and isolation Fast to deploy, enables short-term leasing etc. Have a performance impact but it is acceptable for most modern hypervisors Suspend/resume, migration

18 Deploying Workspaces Remotely VWS Service Workspace -Workspace metadata -Pointer to the image -Logistics information -Deployment request -CPU, memory, count, etc. Virtual Workspace implementation

19 Interacting with Workspaces The workspace service publishes information on each workspace as standard WSRF Resource Properties. Users can query those properties to find out information about their workspace (e.g. what IP the workspace was bound to) VWS Service Users can interact directly with their workspaces the same way the would with a physical machine. Trusted Computing Base (TCB)

20 Cloud Computing

21 Virtual Workspaces for STAR STAR image configuration A virtual cluster composed of an OSG head and STAR worker s Using the workspace service over EC2 to provision resources Allocations of up to 100 s Dynamically contextualized for out-of-the-box cluster

22 with thanks to Jerome Lauret and Doug Olson of the STAR project Running Running Runningjobs jobs jobs:::: Running jobs 230 VWS/EC2 Running Running Runningjobs jobs jobs:::: Running jobs 300 PDSF WSU Running Running Runningjobs jobs jobs:::: Running jobs 150 Fermi Job Completion : 11/29/07, ESAC Grid Workshop BNL File Recovery : Kate Keahey Running Runningjobs jobs::: Running jobs 50

23 with thanks to Jerome Lauret and Doug Olson of the STAR project with thanks to Jerome Lauret and Doug Olson of the STAR project Nersc PDSF EC2 (via Workspace Service) WSU Accelerated display of a workflow job state Y = job number, X = job state

24 Can I Do It at Home? Challenge: how can I provide a cloud using virtualization without disrupting the current operation of my cluster? Flying Low: the Workspace Pilot Integrates with popular LRMs (such as PBS) Implements best effort leases Glidein approach: submits a pilot program that claims a resource slot Provides administrator tools Deployed on UC TeraPort

25 Workspace Pilot in Action Level 2: provision VMs Level 1: provision raw resources VWS VM VM Xen dom0 VM VM LRM/PBS Xen dom0 Xen dom0

26 Division of Labor: Decoupling Leasing and Platform Management

27 A Word from the Expert The greatest improvements in the productive powers of labour, and the greater part of the skill, dexterity, and judgment with which it is anywhere directed, or applied, seem to have been the effects of the division of labour. (Adam Smith) 11/29/07, ESAC Grid Workshop Kate Keahey

28 What Does It Mean to Provide Resources? Environment Environment Hardware Software management packages: e.g., COD, Bcfg, Pacman (requires privilege or can deal only with upper layers of software) Hardware Virtual Appliance VMM Hardware Providing isolation via e.g. Xen, KVM, Vmware, Vserver (different levels of encapsulation)

29 Where Do Appliances Come From? Marketplaces (VMWare, EC2, Workspace ) Appliance Provider (a user, a VO, a Grid ) appliance description Good but: ease-of-use? maintenance? all those formats?

30 Where Do Appliances Come From? Marketplaces (VMWare, EC2, Workspace ) Appliance Management Software (OSFarm, rpath, CohesiveFT )) appliance description Xen VMware CDROM Appliance Provider (a user, a VO, a Grid ) Better!

31 Deploying Appliances Appliances are portable It can be reused and customized to many contexts Making the appliance contextaware: Other appliances Site-specific information (e.g. a DNS server) User/group/VO/Grid-specific information (e.g. public keys, host certs, gridmapfiles, etc.) Security issues Who do I trust to provide legitimate context information? How do I make sure that appliances adhere to my site policies? VM VM VM VM site Virtual Organization

32 Where Do Appliances Come From? Marketplaces (VMWare, EC2, Workspace ) Appliance Management Software (OSFarm, rpath, CohesiveFT )) appliance description Xen VMware CDROM appliance assertions appliance contextualization Appliance Provider (a user, a VO, a Grid )

33 Contextualization Challenge: Putting a VM in the deployment context of the Grid, site, and other VMs Assigning and sharing IP addresses, name resolution, application-level configuration, etc. Solution: Management of Common Context Common Context IP hostname pk contextualization agent Configuration-dependent provides&requires Common understanding between the image vendor and deployer Mechanisms for securely delivering the required information to images across different implementations Paper: A Scalable Approach To Deploying And Managing Appliances, TeraGrid conference 2007

34 Workspace Ecosystem Appliance Providers: Virtual Organizations, groups or individual users via OSFarm, rpath, CohesiveFT, bcfg2, etc. Resource Providers: Local clusters, Grid resource providers (TeraGrid, OSG) Commercial providers: EC2, Sun, slicehost, Provisioning a resource, not a platform Middleware: appliances --> resources manage secure appliance deployment Combining networks and storage VWS EC2

35 Making Leases Cost-Effective

36 What Can We Afford? Enforceable == cost-effective Can we afford to provide desirable features? advance reservation semantics, preemption for urgent computing, renegotiation, etc. Typically utilization is a problem: draining Preemption could help but has issues Porting to use specific software required VMs to the rescue Suspend/resume, migration Policies, features, etc. Thesis work by Borja Sotomayor

37 The Draining Problem Node 3 Node 2 Node 1 SHORT-TERM LEASE With pre-emption Node 3 Node 2 Node 1 SHORT-TERM LEASE Without pre-emption

38 Combining Leasing and Job Management Lease Requests Job Requests Best-Effort Lease Requests Lease Manager Events Execution Manager Manages the VMs (image transfers, start, stop, suspend, ( resume Manages the applications running inside the VMs VMM-enabled Worker Nodes

39 Interleaving Soft and Hard Leases Not using VMs (even with backfilling) results in a noticeable hit on runtime. In this case, the scheduler cannot readily start large parallel jobs because of the resource leases. With VMs, these can be started, and suspended before the leases start. Injected leases are short (1h-2h), very frequent (every 4 to 8 hours), large ( cluster (number of s between 1/3 and ½ of the

40 Conclusions Grid applications are demanding Complex workflows, multi-component applications, complex environment requirements, critical execution needs, etc. Higher-level languages to manage locationindependent computing Resource leasing as a central abstraction of Grid computing Enables acquisition of required resources SLAs: making relationships very explicit Short-term leasing an enabling element Environment management: level the playing field for many communities Users are voting with their feet

41 Conclusions (cntd) Towards centralized resource leasing Economies of scale Load-balancing demand Division of labor Appliance providers Resource providers Leveraging diversity of leases -> economy of scale

42 Credits Workspace team: Tim Freeman, Borja Sotomayor Guest appearances Ian Foster, Frank Siebenlist With thanks to many collaborators: Jerome Lauret (STAR, BNL), Doug Olson (STAR, LBNL), Marty Wesley (rpath), Stu Gott (rpath), Ken Van Dine (rpath), Predrag Buncic (Alice, CERN), Haavard Bjerke (CERN), Rick Bradshaw (Bcfg2, ANL), Narayan Desai (Bcfg2, ANL), Duncan Penfold-Brown (Atlas,uvic), Ian Gable (Atlas, uvic), David Grundy (Atlas, uvic), Ti Leggit (University of Chicago), Greg Cross (University of Chicago), Mike Papka (University of Chicago/ANL)

Virtual Workspace Appliances

Virtual Workspace Appliances Virtual Workspace Appliances Tim Freeman, Kate Keahey Supercomputing 2006, Tampa, FL tfreeman@mcs.anl.gov Required Environments Diverse client environment requirements Library versions Application versions

More information

Cloud Computing with Nimbus

Cloud Computing with Nimbus Cloud Computing with Nimbus April 2010 Condor Week Kate Keahey keahey@mcs.anl.gov Nimbus Project University of Chicago Argonne National Laboratory Cloud Computing for Science Environment Complexity Consistency

More information

Cloud Computing for Science

Cloud Computing for Science Cloud Computing for Science August 2009 CoreGrid 2009 Workshop Kate Keahey keahey@mcs.anl.gov Nimbus project lead University of Chicago Argonne National Laboratory Cloud Computing is in the news is it

More information

Cloud Computing with Nimbus

Cloud Computing with Nimbus Cloud Computing with Nimbus March 2009, OGF25 Thilo Kielmann (slides by Kate Keahey keahey@mcs.anl.gov) Nimbus Nimbus goals Allow providers to build clouds Private clouds (privacy, expense considerations)

More information

A Data Diffusion Approach to Large Scale Scientific Exploration

A Data Diffusion Approach to Large Scale Scientific Exploration A Data Diffusion Approach to Large Scale Scientific Exploration Ioan Raicu Distributed Systems Laboratory Computer Science Department University of Chicago Joint work with: Yong Zhao: Microsoft Ian Foster:

More information

Science Clouds: Early Experiences in Cloud Computing for Scientific Applications

Science Clouds: Early Experiences in Cloud Computing for Scientific Applications Science Clouds: Early Experiences in Cloud Computing for Scientific Applications Chicago, October 2008 Kate Keahey, Renato Figueiredo, Jose Fortes, Tim Freeman, Mauricio Tsugawa University of Chicago University

More information

Contextualization: Providing One-Click Virtual Clusters

Contextualization: Providing One-Click Virtual Clusters Contextualization: Providing One-Click Virtual Clusters Kate Keahey Tim Freeman Argonne National Laboratory University of Chicago {keahey, tfreeman}@mcs.anl.gov Cloud Computing Infrastructure-as-a-Service

More information

Managing and Executing Loosely-Coupled Large-Scale Applications on Clusters, Grids, and Supercomputers

Managing and Executing Loosely-Coupled Large-Scale Applications on Clusters, Grids, and Supercomputers Managing and Executing Loosely-Coupled Large-Scale Applications on Clusters, Grids, and Supercomputers Ioan Raicu Distributed Systems Laboratory Computer Science Department University of Chicago Collaborators:

More information

Science Computing Clouds.

Science Computing Clouds. Science Computing Clouds. December 9, 2008 Chan-Hyun Youn School of Engineering/ Grid Middleware Research Center Information and Communications University COPYRIGHT@LANS Lab, Information and Communication

More information

THE UNIVERSITY OF CHICAGO A RESOURCE MANAGEMENT MODEL FOR VM-BASED VIRTUAL WORKSPACES

THE UNIVERSITY OF CHICAGO A RESOURCE MANAGEMENT MODEL FOR VM-BASED VIRTUAL WORKSPACES THE UNIVERSITY OF CHICAGO A RESOURCE MANAGEMENT MODEL FOR VM-BASED VIRTUAL WORKSPACES A PAPER SUBMITTED TO THE FACULTY OF THE DIVISION OF THE PHYSICAL SCIENCES IN CANDIDACY FOR THE DEGREE OF MASTER OF

More information

Flying Low: Simple Leases with Workspace Pilot

Flying Low: Simple Leases with Workspace Pilot Flying Low: Simple Leases with Workspace Pilot Timothy Freeman, Katarzyna Keahey Computation Institute University of Chicago {tfreeman,keahey}@uchicago.edu Abstract. As virtual machines (VMs) are used

More information

Contextualization: Providing One-Click Virtual Clusters

Contextualization: Providing One-Click Virtual Clusters Contextualization: Providing One-Click Virtual Clusters Katarzyna Keahey University of Chicago keahey@mcs.anl.gov Abstract As virtual appliances become more prevalent, we encounter the need to stop manually

More information

Ioan Raicu Distributed Systems Laboratory Computer Science Department University of Chicago

Ioan Raicu Distributed Systems Laboratory Computer Science Department University of Chicago Running 1 Million Jobs in 10 Minutes via the Falkon Fast and Light-weight Ioan Raicu Distributed Systems Laboratory Computer Science Department University of Chicago In Collaboration with: Ian Foster,

More information

Sky Computing on FutureGrid and Grid 5000 with Nimbus. Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes Bretagne Atlantique Rennes, France

Sky Computing on FutureGrid and Grid 5000 with Nimbus. Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes Bretagne Atlantique Rennes, France Sky Computing on FutureGrid and Grid 5000 with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes Bretagne Atlantique Rennes, France Outline Introduction to Sky Computing The Nimbus Project

More information

Large Scale Sky Computing Applications with Nimbus

Large Scale Sky Computing Applications with Nimbus Large Scale Sky Computing Applications with Nimbus Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes Bretagne Atlantique Rennes, France Pierre.Riteau@irisa.fr INTRODUCTION TO SKY COMPUTING IaaS

More information

An Introduction to Virtualization and Cloud Technologies to Support Grid Computing

An Introduction to Virtualization and Cloud Technologies to Support Grid Computing New Paradigms: Clouds, Virtualization and Co. EGEE08, Istanbul, September 25, 2008 An Introduction to Virtualization and Cloud Technologies to Support Grid Computing Distributed Systems Architecture Research

More information

Ioan Raicu Distributed Systems Laboratory Computer Science Department University of Chicago

Ioan Raicu Distributed Systems Laboratory Computer Science Department University of Chicago Falkon, a Fast and Light-weight task execution framework for Clusters, Grids, and Supercomputers Ioan Raicu Distributed Systems Laboratory Computer Science Department University of Chicago In Collaboration

More information

Ioan Raicu. Everyone else. More information at: Background? What do you want to get out of this course?

Ioan Raicu. Everyone else. More information at: Background? What do you want to get out of this course? Ioan Raicu More information at: http://www.cs.iit.edu/~iraicu/ Everyone else Background? What do you want to get out of this course? 2 Data Intensive Computing is critical to advancing modern science Applies

More information

Distributed Systems COMP 212. Lecture 18 Othon Michail

Distributed Systems COMP 212. Lecture 18 Othon Michail Distributed Systems COMP 212 Lecture 18 Othon Michail Virtualisation & Cloud Computing 2/27 Protection rings It s all about protection rings in modern processors Hardware mechanism to protect data and

More information

Clouds: An Opportunity for Scientific Applications?

Clouds: An Opportunity for Scientific Applications? Clouds: An Opportunity for Scientific Applications? Ewa Deelman USC Information Sciences Institute Acknowledgements Yang-Suk Ki (former PostDoc, USC) Gurmeet Singh (former Ph.D. student, USC) Gideon Juve

More information

Grid Scheduling Architectures with Globus

Grid Scheduling Architectures with Globus Grid Scheduling Architectures with Workshop on Scheduling WS 07 Cetraro, Italy July 28, 2007 Ignacio Martin Llorente Distributed Systems Architecture Group Universidad Complutense de Madrid 1/38 Contents

More information

ALICE Grid Activities in US

ALICE Grid Activities in US ALICE Grid Activities in US 1 ALICE-USA Computing Project ALICE-USA Collaboration formed to focus on the ALICE EMCal project Construction, installation, testing and integration participating institutions

More information

Dynamic Creation and Management of Runtime Environments in the Grid

Dynamic Creation and Management of Runtime Environments in the Grid Dynamic Creation and Management of Runtime Environments in the Grid Kate Keahey keahey@mcs.anl.gov Matei Ripeanu matei@cs.uchicago.edu Karl Doering kdoering@cs.ucr.edu 1 Introduction Management of complex,

More information

Introduction to Cloud Computing and Virtual Resource Management. Jian Tang Syracuse University

Introduction to Cloud Computing and Virtual Resource Management. Jian Tang Syracuse University Introduction to Cloud Computing and Virtual Resource Management Jian Tang Syracuse University 1 Outline Definition Components Why Cloud Computing Cloud Services IaaS Cloud Providers Overview of Virtual

More information

HPC learning using Cloud infrastructure

HPC learning using Cloud infrastructure HPC learning using Cloud infrastructure Florin MANAILA IT Architect florin.manaila@ro.ibm.com Cluj-Napoca 16 March, 2010 Agenda 1. Leveraging Cloud model 2. HPC on Cloud 3. Recent projects - FutureGRID

More information

Grid Programming: Concepts and Challenges. Michael Rokitka CSE510B 10/2007

Grid Programming: Concepts and Challenges. Michael Rokitka CSE510B 10/2007 Grid Programming: Concepts and Challenges Michael Rokitka SUNY@Buffalo CSE510B 10/2007 Issues Due to Heterogeneous Hardware level Environment Different architectures, chipsets, execution speeds Software

More information

Storage Virtualization. Eric Yen Academia Sinica Grid Computing Centre (ASGC) Taiwan

Storage Virtualization. Eric Yen Academia Sinica Grid Computing Centre (ASGC) Taiwan Storage Virtualization Eric Yen Academia Sinica Grid Computing Centre (ASGC) Taiwan Storage Virtualization In computer science, storage virtualization uses virtualization to enable better functionality

More information

VC3. Virtual Clusters for Community Computation. DOE NGNS PI Meeting September 27-28, 2017

VC3. Virtual Clusters for Community Computation. DOE NGNS PI Meeting September 27-28, 2017 VC3 Virtual Clusters for Community Computation DOE NGNS PI Meeting September 27-28, 2017 Douglas Thain, University of Notre Dame Rob Gardner, University of Chicago John Hover, Brookhaven National Lab A

More information

Virtualization. A very short summary by Owen Synge

Virtualization. A very short summary by Owen Synge Virtualization A very short summary by Owen Synge Outline What is Virtulization? What's virtulization good for? What's virtualisation bad for? We had a workshop. What was presented? What did we do with

More information

Problems for Resource Brokering in Large and Dynamic Grid Environments

Problems for Resource Brokering in Large and Dynamic Grid Environments Problems for Resource Brokering in Large and Dynamic Grid Environments Cătălin L. Dumitrescu Computer Science Department The University of Chicago cldumitr@cs.uchicago.edu (currently at TU Delft) Kindly

More information

Monitoring Grid Virtual Machine deployments

Monitoring Grid Virtual Machine deployments University of Victoria Faculty of Engineering Fall 2008 Work Term Report Monitoring Grid Virtual Machine deployments Department of Physics University of Victoria Victoria, BC Michael Paterson 0031209 Work

More information

Extreme-scale scripting: Opportunities for large taskparallel applications on petascale computers

Extreme-scale scripting: Opportunities for large taskparallel applications on petascale computers Extreme-scale scripting: Opportunities for large taskparallel applications on petascale computers Michael Wilde, Ioan Raicu, Allan Espinosa, Zhao Zhang, Ben Clifford, Mihael Hategan, Kamil Iskra, Pete

More information

On the Use of Cloud Computing for Scientific Workflows

On the Use of Cloud Computing for Scientific Workflows On the Use of Cloud Computing for Scientific Workflows Christina Hoffa 1, Gaurang Mehta 2, Timothy Freeman 3, Ewa Deelman 2, Kate Keahey 3, Bruce Berriman 4, John Good 4 1 Indiana University, 2 University

More information

STATUS OF PLANS TO USE CONTAINERS IN THE WORLDWIDE LHC COMPUTING GRID

STATUS OF PLANS TO USE CONTAINERS IN THE WORLDWIDE LHC COMPUTING GRID The WLCG Motivation and benefits Container engines Experiments status and plans Security considerations Summary and outlook STATUS OF PLANS TO USE CONTAINERS IN THE WORLDWIDE LHC COMPUTING GRID SWISS EXPERIENCE

More information

Cloud Computing the VMware Perspective. Bogomil Balkansky Product Marketing

Cloud Computing the VMware Perspective. Bogomil Balkansky Product Marketing Cloud Computing the VMware Perspective Bogomil Balkansky Product Marketing Cloud Computing - the Key Questions What is it? Why do you need it? How do you build (or leverage) one (or many)? How do you operate

More information

AutoPyFactory: A Scalable Flexible Pilot Factory Implementation

AutoPyFactory: A Scalable Flexible Pilot Factory Implementation ATL-SOFT-PROC-2012-045 22 May 2012 Not reviewed, for internal circulation only AutoPyFactory: A Scalable Flexible Pilot Factory Implementation J. Caballero 1, J. Hover 1, P. Love 2, G. A. Stewart 3 on

More information

Virtualization. Dr. Yingwu Zhu

Virtualization. Dr. Yingwu Zhu Virtualization Dr. Yingwu Zhu Virtualization Definition Framework or methodology of dividing the resources of a computer into multiple execution environments. Types Platform Virtualization: Simulate a

More information

Chapter 5 C. Virtual machines

Chapter 5 C. Virtual machines Chapter 5 C Virtual machines Virtual Machines Host computer emulates guest operating system and machine resources Improved isolation of multiple guests Avoids security and reliability problems Aids sharing

More information

Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands

Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands Unleash Your Data Center s Hidden Power September 16, 2014 Molly Rector CMO, EVP Product Management & WW Marketing

More information

Scientific Computing on Emerging Infrastructures. using HTCondor

Scientific Computing on Emerging Infrastructures. using HTCondor Scientific Computing on Emerging Infrastructures using HT HT Week, 20th May 2015 University of California, San Diego 1 Scientific Computing LHC probes nature at 10-17cm Weak Scale Scientific instruments:

More information

Huawei FusionCloud Desktop Solution 5.1 Resource Reuse Technical White Paper HUAWEI TECHNOLOGIES CO., LTD. Issue 01.

Huawei FusionCloud Desktop Solution 5.1 Resource Reuse Technical White Paper HUAWEI TECHNOLOGIES CO., LTD. Issue 01. Huawei FusionCloud Desktop Solution 5.1 Resource Reuse Technical White Paper Issue 01 Date 2014-03-26 HUAWEI TECHNOLOGIES CO., LTD. 2014. All rights reserved. No part of this document may be reproduced

More information

Data publication and discovery with Globus

Data publication and discovery with Globus Data publication and discovery with Globus Questions and comments to outreach@globus.org The Globus data publication and discovery services make it easy for institutions and projects to establish collections,

More information

EGEE and Interoperation

EGEE and Interoperation EGEE and Interoperation Laurence Field CERN-IT-GD ISGC 2008 www.eu-egee.org EGEE and glite are registered trademarks Overview The grid problem definition GLite and EGEE The interoperability problem The

More information

What is Cloud Computing? Cloud computing is the dynamic delivery of IT resources and capabilities as a Service over the Internet.

What is Cloud Computing? Cloud computing is the dynamic delivery of IT resources and capabilities as a Service over the Internet. 1 INTRODUCTION What is Cloud Computing? Cloud computing is the dynamic delivery of IT resources and capabilities as a Service over the Internet. Cloud computing encompasses any Subscriptionbased or pay-per-use

More information

glideinwms architecture by Igor Sfiligoi, Jeff Dost (UCSD)

glideinwms architecture by Igor Sfiligoi, Jeff Dost (UCSD) glideinwms architecture by Igor Sfiligoi, Jeff Dost (UCSD) Outline A high level overview of the glideinwms Description of the components 2 glideinwms from 10k feet 3 Refresher - HTCondor A Condor pool

More information

CIS : Computational Reproducibility

CIS : Computational Reproducibility CIS 602-01: Computational Reproducibility Containers Dr. David Koop Virtual Machines Software Abstraction - Behaves like hardware - Encapsulates all OS and application state Virtualization Layer - Extra

More information

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy Why the Grid? Science is becoming increasingly digital and needs to deal with increasing amounts of

More information

CLOUD COMPUTING IT0530. G.JEYA BHARATHI Asst.Prof.(O.G) Department of IT SRM University

CLOUD COMPUTING IT0530. G.JEYA BHARATHI Asst.Prof.(O.G) Department of IT SRM University CLOUD COMPUTING IT0530 G.JEYA BHARATHI Asst.Prof.(O.G) Department of IT SRM University What is virtualization? Virtualization is way to run multiple operating systems and user applications on the same

More information

CS 470 Spring Virtualization and Cloud Computing. Mike Lam, Professor. Content taken from the following:

CS 470 Spring Virtualization and Cloud Computing. Mike Lam, Professor. Content taken from the following: CS 470 Spring 2018 Mike Lam, Professor Virtualization and Cloud Computing Content taken from the following: A. Silberschatz, P. B. Galvin, and G. Gagne. Operating System Concepts, 9 th Edition (Chapter

More information

OpenNebula on VMware: Cloud Reference Architecture

OpenNebula on VMware: Cloud Reference Architecture OpenNebula on VMware: Cloud Reference Architecture Version 1.2, October 2016 Abstract The OpenNebula Cloud Reference Architecture is a blueprint to guide IT architects, consultants, administrators and

More information

NVIDIA DGX SYSTEMS PURPOSE-BUILT FOR AI

NVIDIA DGX SYSTEMS PURPOSE-BUILT FOR AI NVIDIA DGX SYSTEMS PURPOSE-BUILT FOR AI Overview Unparalleled Value Product Portfolio Software Platform From Desk to Data Center to Cloud Summary AI researchers depend on computing performance to gain

More information

Dynamic Resource Allocation on Virtual Machines

Dynamic Resource Allocation on Virtual Machines Dynamic Resource Allocation on Virtual Machines Naveena Anumala VIT University, Chennai 600048 anumala.naveena2015@vit.ac.in Guide: Dr. R. Kumar VIT University, Chennai -600048 kumar.rangasamy@vit.ac.in

More information

High Throughput Urgent Computing

High Throughput Urgent Computing Condor Week 2008 High Throughput Urgent Computing Jason Cope jason.cope@colorado.edu Project Collaborators Argonne National Laboratory / University of Chicago Pete Beckman Suman Nadella Nick Trebon University

More information

How it can help your organisation

How it can help your organisation How it can help your organisation History Types of Virtualisation & Hypervisors Virtualisation Features Why Virtualisation? Virtualising Oracle Performance Licensing Support Cloud 1998 VMware founded by

More information

Cluster Abstraction: towards Uniform Resource Description and Access in Multicluster Grid

Cluster Abstraction: towards Uniform Resource Description and Access in Multicluster Grid Cluster Abstraction: towards Uniform Resource Description and Access in Multicluster Grid Maoyuan Xie, Zhifeng Yun, Zhou Lei, Gabrielle Allen Center for Computation & Technology, Louisiana State University,

More information

Scheduling Computational and Storage Resources on the NRP

Scheduling Computational and Storage Resources on the NRP Scheduling Computational and Storage Resources on the NRP Rob Gardner Dima Mishin University of Chicago UCSD Second NRP Workshop Montana State University August 6-7, 2018 slides: http://bit.ly/nrp-scheduling

More information

Operating Systems 4/27/2015

Operating Systems 4/27/2015 Virtualization inside the OS Operating Systems 24. Virtualization Memory virtualization Process feels like it has its own address space Created by MMU, configured by OS Storage virtualization Logical view

More information

Science Clouds: Early Experiences in Cloud Computing for Scientific Applications

Science Clouds: Early Experiences in Cloud Computing for Scientific Applications Science Clouds: Early Experiences in Cloud Computing for Scientific Applications K. Keahey 1, R. Figueiredo 2, J. Fortes 2, T. Freeman 1, M. Tsugawa 2 1 University of Chicago, 2 University of Florida Abstract

More information

Cloud Computing Lecture 4

Cloud Computing Lecture 4 Cloud Computing Lecture 4 1/17/2012 What is Hypervisor in Cloud Computing and its types? The hypervisor is a virtual machine monitor (VMM) that manages resources for virtual machines. The name hypervisor

More information

Linux Automation.

Linux Automation. Linux Automation Using Red Hat Enterprise Linux to extract maximum value from IT infrastructure www.redhat.com Table of contents Summary statement Page 3 Background Page 4 Creating a more efficient infrastructure:

More information

High Throughput WAN Data Transfer with Hadoop-based Storage

High Throughput WAN Data Transfer with Hadoop-based Storage High Throughput WAN Data Transfer with Hadoop-based Storage A Amin 2, B Bockelman 4, J Letts 1, T Levshina 3, T Martin 1, H Pi 1, I Sfiligoi 1, M Thomas 2, F Wuerthwein 1 1 University of California, San

More information

Integration of Cloud and Grid Middleware at DGRZR

Integration of Cloud and Grid Middleware at DGRZR D- of International Symposium on Computing 2010 Stefan Freitag Robotics Research Institute Dortmund University of Technology March 12, 2010 Overview D- 1 D- Resource Center Ruhr 2 Clouds in the German

More information

What s new in HTCondor? What s coming? HTCondor Week 2018 Madison, WI -- May 22, 2018

What s new in HTCondor? What s coming? HTCondor Week 2018 Madison, WI -- May 22, 2018 What s new in HTCondor? What s coming? HTCondor Week 2018 Madison, WI -- May 22, 2018 Todd Tannenbaum Center for High Throughput Computing Department of Computer Sciences University of Wisconsin-Madison

More information

PROOF-Condor integration for ATLAS

PROOF-Condor integration for ATLAS PROOF-Condor integration for ATLAS G. Ganis,, J. Iwaszkiewicz, F. Rademakers CERN / PH-SFT M. Livny, B. Mellado, Neng Xu,, Sau Lan Wu University Of Wisconsin Condor Week, Madison, 29 Apr 2 May 2008 Outline

More information

Corral: A Glide-in Based Service for Resource Provisioning

Corral: A Glide-in Based Service for Resource Provisioning : A Glide-in Based Service for Resource Provisioning Gideon Juve USC Information Sciences Institute juve@usc.edu Outline Throughput Applications Grid Computing Multi-level scheduling and Glideins Example:

More information

Overview Demo Claudia OpenNebula

Overview Demo Claudia OpenNebula 1 Overview Demo Claudia OpenNebula RESERVOIR Reference Architecture 2 Virtual Execution Environment Manager Service Manager VMI Client Policy Engine Remote VEEMs OpenNebula Monitoring VEE Hosts 3 VEEH

More information

Magellan Project. Jeff Broughton NERSC Systems Department Head October 7, 2009

Magellan Project. Jeff Broughton NERSC Systems Department Head October 7, 2009 Magellan Project Jeff Broughton NERSC Systems Department Head October 7, 2009 1 Magellan Background National Energy Research Scientific Computing Center (NERSC) Argonne Leadership Computing Facility (ALCF)

More information

On-demand provisioning of HEP compute resources on cloud sites and shared HPC centers

On-demand provisioning of HEP compute resources on cloud sites and shared HPC centers On-demand provisioning of HEP compute resources on cloud sites and shared HPC centers CHEP 2016 - San Francisco, United States of America Gunther Erli, Frank Fischer, Georg Fleig, Manuel Giffels, Thomas

More information

Table of Contents 1.1. Introduction. Overview of vsphere Integrated Containers 1.2

Table of Contents 1.1. Introduction. Overview of vsphere Integrated Containers 1.2 Table of Contents Introduction Overview of vsphere Integrated Containers 1.1 1.2 2 Overview of vsphere Integrated Containers This document provides an overview of VMware vsphere Integrated Containers.

More information

BUILDING A PRIVATE CLOUD. By Mark Black Jay Muelhoefer Parviz Peiravi Marco Righini

BUILDING A PRIVATE CLOUD. By Mark Black Jay Muelhoefer Parviz Peiravi Marco Righini BUILDING A PRIVATE CLOUD By Mark Black Jay Muelhoefer Parviz Peiravi Marco Righini HOW PLATFORM COMPUTING'S PLATFORM ISF AND INTEL'S TRUSTED EXECUTION TECHNOLOGY CAN HELP 24 loud computing is a paradigm

More information

Virtualization. ...or how adding another layer of abstraction is changing the world. CIS 399: Unix Skills University of Pennsylvania.

Virtualization. ...or how adding another layer of abstraction is changing the world. CIS 399: Unix Skills University of Pennsylvania. Virtualization...or how adding another layer of abstraction is changing the world. CIS 399: Unix Skills University of Pennsylvania April 6, 2009 (CIS 399 Unix) Virtualization April 6, 2009 1 / 22 What

More information

University of Alberta. Zhu Pang. Master of Science. Department of Computing Science

University of Alberta. Zhu Pang. Master of Science. Department of Computing Science University of Alberta HIGH PERFORMANCE LIVE MIGRATION OVER LOW-BANDWIDTH, HIGH-DELAY NETWORK WITH LOSS PREVENTION by Zhu Pang A thesis submitted to the Faculty of Graduate Studies and Research in partial

More information

Distributed File System Support for Virtual Machines in Grid Computing

Distributed File System Support for Virtual Machines in Grid Computing Distributed File System Support for Virtual Machines in Grid Computing Ming Zhao, Jian Zhang, Renato Figueiredo Advanced Computing and Information Systems Electrical and Computer Engineering University

More information

Resources and Services Virtualization without Boundaries (ReSerVoir)

Resources and Services Virtualization without Boundaries (ReSerVoir) Resources and Services Virtualization without Boundaries (ReSerVoir) Benny Rochwerger April 14, 2008 IBM Labs in Haifa The Evolution of the Power Grid The Burden Iron Works Water Wheel http://w w w.rootsw

More information

Easy Access to Grid Infrastructures

Easy Access to Grid Infrastructures Easy Access to Grid Infrastructures Dr. Harald Kornmayer (NEC Laboratories Europe) On behalf of the g-eclipse consortium WP11 Grid Workshop Grenoble, France 09 th of December 2008 Background in astro particle

More information

Case Studies in Storage Access by Loosely Coupled Petascale Applications

Case Studies in Storage Access by Loosely Coupled Petascale Applications Case Studies in Storage Access by Loosely Coupled Petascale Applications Justin M Wozniak and Michael Wilde Petascale Data Storage Workshop at SC 09 Portland, Oregon November 15, 2009 Outline Scripted

More information

70-247: Configuring and Deploying a Private Cloud with System Center 2012

70-247: Configuring and Deploying a Private Cloud with System Center 2012 70-247: Configuring and Deploying a Private Cloud with System Center 2012 Module 01 - Understanding the Private Cloud Lesson 1: Understanding the Private Cloud Cloud Comparisons Comparing the Private and

More information

The Materials Data Facility

The Materials Data Facility The Materials Data Facility Ben Blaiszik (blaiszik@uchicago.edu), Kyle Chard (chard@uchicago.edu) Ian Foster (foster@uchicago.edu) materialsdatafacility.org What is MDF? We aim to make it simple for materials

More information

Data Centers and Cloud Computing

Data Centers and Cloud Computing Data Centers and Cloud Computing CS677 Guest Lecture Tim Wood 1 Data Centers Large server and storage farms 1000s of servers Many TBs or PBs of data Used by Enterprises for server applications Internet

More information

g-eclipse A Framework for Accessing Grid Infrastructures Nicholas Loulloudes Trainer, University of Cyprus (loulloudes.n_at_cs.ucy.ac.

g-eclipse A Framework for Accessing Grid Infrastructures Nicholas Loulloudes Trainer, University of Cyprus (loulloudes.n_at_cs.ucy.ac. g-eclipse A Framework for Accessing Grid Infrastructures Trainer, University of Cyprus (loulloudes.n_at_cs.ucy.ac.cy) EGEE Training the Trainers May 6 th, 2009 Outline Grid Reality The Problem g-eclipse

More information

Pegasus Workflow Management System. Gideon Juve. USC Informa3on Sciences Ins3tute

Pegasus Workflow Management System. Gideon Juve. USC Informa3on Sciences Ins3tute Pegasus Workflow Management System Gideon Juve USC Informa3on Sciences Ins3tute Scientific Workflows Orchestrate complex, multi-stage scientific computations Often expressed as directed acyclic graphs

More information

Managing large-scale workflows with Pegasus

Managing large-scale workflows with Pegasus Funded by the National Science Foundation under the OCI SDCI program, grant #0722019 Managing large-scale workflows with Pegasus Karan Vahi ( vahi@isi.edu) Collaborative Computing Group USC Information

More information

A Cloud-based Dynamic Workflow for Mass Spectrometry Data Analysis

A Cloud-based Dynamic Workflow for Mass Spectrometry Data Analysis A Cloud-based Dynamic Workflow for Mass Spectrometry Data Analysis Ashish Nagavaram, Gagan Agrawal, Michael A. Freitas, Kelly H. Telu The Ohio State University Gaurang Mehta, Rajiv. G. Mayani, Ewa Deelman

More information

Virtualization. Michael Tsai 2018/4/16

Virtualization. Michael Tsai 2018/4/16 Virtualization Michael Tsai 2018/4/16 What is virtualization? Let s first look at a video from VMware http://www.vmware.com/tw/products/vsphere.html Problems? Low utilization Different needs DNS DHCP Web

More information

CHAPTER 2 LITERATURE REVIEW AND BACKGROUND

CHAPTER 2 LITERATURE REVIEW AND BACKGROUND 8 CHAPTER 2 LITERATURE REVIEW AND BACKGROUND 2.1 LITERATURE REVIEW Several researches have been carried out in Grid Resource Management and some of the existing research works closely related to this thesis

More information

Multiprocessor Scheduling. Multiprocessor Scheduling

Multiprocessor Scheduling. Multiprocessor Scheduling Multiprocessor Scheduling Will consider only shared memory multiprocessor or multi-core CPU Salient features: One or more caches: cache affinity is important Semaphores/locks typically implemented as spin-locks:

More information

Taking your next integration or BPM project to the cloud WebSphere Integration User Group, 12 July 2012 IBM Hursley

Taking your next integration or BPM project to the cloud WebSphere Integration User Group, 12 July 2012 IBM Hursley Mark Tomlinson CTO, Cloud Computing, IBM UK & Ireland Taking your next integration or BPM project to the cloud WebSphere Integration User Group, 12 July 2012 IBM Hursley Today s organizations strive to

More information

Hyper-Converged Infrastructure: Providing New Opportunities for Improved Availability

Hyper-Converged Infrastructure: Providing New Opportunities for Improved Availability Hyper-Converged Infrastructure: Providing New Opportunities for Improved Availability IT teams in companies of all sizes face constant pressure to meet the Availability requirements of today s Always-On

More information

Modern Data Warehouse The New Approach to Azure BI

Modern Data Warehouse The New Approach to Azure BI Modern Data Warehouse The New Approach to Azure BI History On-Premise SQL Server Big Data Solutions Technical Barriers Modern Analytics Platform On-Premise SQL Server Big Data Solutions Modern Analytics

More information

Deploying virtualisation in a production grid

Deploying virtualisation in a production grid Deploying virtualisation in a production grid Stephen Childs Trinity College Dublin & Grid-Ireland TERENA NRENs and Grids workshop 2 nd September 2008 www.eu-egee.org EGEE and glite are registered trademarks

More information

OS Virtualization. Linux Containers (LXC)

OS Virtualization. Linux Containers (LXC) OS Virtualization Emulate OS-level interface with native interface Lightweight virtual machines No hypervisor, OS provides necessary support Referred to as containers Solaris containers, BSD jails, Linux

More information

Amazon EC2 Container Service: Manage Docker-Enabled Apps in EC2

Amazon EC2 Container Service: Manage Docker-Enabled Apps in EC2 Amazon EC2 Container Service: Manage Docker-Enabled Apps in EC2 Ian Massingham AWS Technical Evangelist @IanMmmm 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved Agenda Containers

More information

Nested Virtualization and Server Consolidation

Nested Virtualization and Server Consolidation Nested Virtualization and Server Consolidation Vara Varavithya Department of Electrical Engineering, KMUTNB varavithya@gmail.com 1 Outline Virtualization & Background Nested Virtualization Hybrid-Nested

More information

STREAMLINING THE DELIVERY, PROTECTION AND MANAGEMENT OF VIRTUAL DESKTOPS. VMware Workstation and Fusion. A White Paper for IT Professionals

STREAMLINING THE DELIVERY, PROTECTION AND MANAGEMENT OF VIRTUAL DESKTOPS. VMware Workstation and Fusion. A White Paper for IT Professionals WHITE PAPER NOVEMBER 2016 STREAMLINING THE DELIVERY, PROTECTION AND MANAGEMENT OF VIRTUAL DESKTOPS VMware Workstation and Fusion A White Paper for IT Professionals Table of Contents Overview 3 The Changing

More information

Complete Data Protection & Disaster Recovery Solution

Complete Data Protection & Disaster Recovery Solution Complete Data Protection & Disaster Recovery Solution Quadric Software 2015 We were looking at other solutions. Alike was the best with XenServer, and none of them had Alike s compression abilities. After

More information

Easily Managing Hybrid IT with Transformation Technology

Easily Managing Hybrid IT with Transformation Technology White Paper White Paper Managing Public Cloud Computing in the Enterprise Easily Managing Hybrid IT with Transformation Technology A Quick Start Guide 1 Table of Contents Abstract... 3 Traditional Approaches

More information

Globus Platform Services for Data Publication. Greg Nawrocki University of Chicago & Argonne National Lab GeoDaRRS August 7, 2018

Globus Platform Services for Data Publication. Greg Nawrocki University of Chicago & Argonne National Lab GeoDaRRS August 7, 2018 Globus Platform Services for Data Publication Greg Nawrocki greg@globus.org University of Chicago & Argonne National Lab GeoDaRRS August 7, 2018 Outline Globus Overview Globus Data Publication v1 Lessons

More information

Virtualizing Oracle 11g/R2 RAC Database on Oracle VM: Methods/Tips

Virtualizing Oracle 11g/R2 RAC Database on Oracle VM: Methods/Tips Virtualizing Oracle 11g/R2 RAC Database on Oracle VM: Methods/Tips Saar Maoz, RACPack RAC Development, Oracle Kai Yu, Oracle Solutions Engineering, Dell Inc About Authors Saar Maoz Consulting Software

More information

Overview of ATLAS PanDA Workload Management

Overview of ATLAS PanDA Workload Management Overview of ATLAS PanDA Workload Management T. Maeno 1, K. De 2, T. Wenaus 1, P. Nilsson 2, G. A. Stewart 3, R. Walker 4, A. Stradling 2, J. Caballero 1, M. Potekhin 1, D. Smith 5, for The ATLAS Collaboration

More information

CLOUD COMPUTING. Rajesh Kumar. DevOps Architect.

CLOUD COMPUTING. Rajesh Kumar. DevOps Architect. CLOUD COMPUTING Rajesh Kumar DevOps Architect @RajeshKumarIN www.rajeshkumar.xyz www.scmgalaxy.com 1 Session Objectives This session will help you to: Introduction to Cloud Computing Cloud Computing Architecture

More information