Resources and Services Virtualization without Boundaries (ReSerVoir) Benny Rochwerger April 14, 2008 IBM Labs in Haifa
The Evolution of the Power Grid The Burden Iron Works Water Wheel http://w w w.rootsw eb.com/~nytigs/burdenpayrollrecords.htm The Pearl Street Station http://ieee-virtual-museum.org/collection/event.php?id=3456876 http://w w w.pbase.com/rbenny/image/29116201 The US National Power Grid http://w w w.anl.gov/media_center/logos22-1/electricity.htm Make your own infrastructure Not the company s main business but a considerable competitive advantage The utility industry Metering Limited reach Reproducible (yet costly) Efficient distribution Federation of providers The diversity factor Economies of scale 2 Reservoir @ 7th HLRS Scalable Global File System Workshop
The Evolution of the Compute Grid R E S E R V O I R Google @ The Dulles, OR http://w w w.smcplus.com/applications.asp?id=32 http://w w w.informationw eek.com/galleries/show Image.jhtml?galleryID=62&imageID=13 http://w w w.by-star.net/techspeak/datacenter/ Make your own infrastructure Not the company s main business but a considerable competitive advantage The utility industry Metering Limited reach Reproducible (yet costly) Efficient distribution Federation of providers The diversity factor Economies of scale 3 Reservoir @ 7th HLRS Scalable Global File System Workshop
The Evolution of the Compute Grid R E S E R V O I R today s commercial clouds have not been open and general purpose, but instead been mostly proprietary and specialized for the specific internal uses (e.g., large-scale data analysis) of the companies that developed them. The idea that we might want to enable interoperability between providers (as in the electric power grid) has not yet surfaced http://w w w.smcplus.com/applications.asp?id=32 will move towards a mix of microproduction Google @ The Dulles, OR and large utilities, with increasing numbers of small-scale producers co-existing with largescale regional producers, and load being distributed among them http://w w w.informationw eek.com/galleries/show Image.jhtml?galleryID=62&imageID=13 http://w w w.by-star.net/techspeak/datacenter/ dynamically Make your own infrastructure Not the company s main business but a considerable competitive advantage The utility industry Efficient distribution MeteringThere s Grid in then thar Clouds Federation - Ian Foster of providers Limited reach The diversity factor Reproducible (yet costly) Economies of scale 4 Reservoir @ 7th HLRS Scalable Global File System Workshop
The RESERVOIR Vision The Next Generation Infrastructure for Service Delivery Provide revolutionary foundation for a new European infrastructure where resources and services can be transparently and dynamically managed, provisioned and relocated like utilities virtually without borders No single facility/provider can create a seemingly infinite infrastructure capable of serving massive amounts of users at all times, from all locations Federation of clouds Leverage the diversity factor to achieve economies of scale Leverage locality Analogies exists in areas outside IT services: Electrical power delivery: capacity can be shifted to guarantee supply and lower costs Roaming cellular communications: Talk wherever you are Impact Enable utility-like deployment of services, relieving the service consumer from awareness of the IT attributes while assuring QoS and security guarantees Aim to create the basis for future service products 5 Reservoir @ 7th HLRS Scalable Global File System Workshop
Approach Focus on technologies that enable to build cooperating computing clouds Connect computing clouds to create an even bigger cloud Integration of virtualization technologies with grid computing driven by new techniques for business service management The Service Oriented Infrastructure (SOI) equation: Virtualization -Aware Grid e.g., VM usage/size as the unit for metering and billing + + Grid-Aware Virtualization e.g., live migration across administrative domains BSM e.g., policy-based management of service-level agreement = SOI Building on this equation we will architect and implement a platform for supporting complex services, which Enables dynamic deployment of complex multi-tier services across heterogeneous administration domains Uses virtualization of servers, storage and network to allow migration without borders Supports service definition, SLA management, accounting and billing 6 Reservoir @ 7th HLRS Scalable Global File System Workshop
SOI: Grid Computing Grid node or Service Site Physical Resources Service Tasks 7 Reservoir @ 7th HLRS Scalable Global File System Workshop
SOI: Grid Computing + Virtualization Virtual Execution Environment (VEE) Improved isolation, Relax dependencies, Well defined billing units 8 Reservoir @ 7th HLRS Scalable Global File System Workshop
SOI: Grid Computing + Virtualization + BSM Policy 1: If possible keep VEEs from the same organization in the same physical box 9 Reservoir @ 7th HLRS Scalable Global File System Workshop
SOI: Grid Computing + Virtualization + BSM Policy 1: If possible keep VEEs from the same organization in the same physical box 10 Reservoir @ 7th HLRS Scalable Global File System Workshop
SOI: Grid Computing + Virtualization + BSM Policy 1: If possible keep VEEs from the same organization in the same physical box Policy 2: Turn off underutilized physical boxes 11 Reservoir @ 7th HLRS Scalable Global File System Workshop
SOI: Grid Computing + Virtualization + BSM Policy 1: If possible keep VEEs from the same organization in the same physical box Policy 2: Turn off underutilized physical boxes 12 Reservoir @ 7th HLRS Scalable Global File System Workshop
SOI: Grid Computing + Virtualization + BSM Policy 1: If possible keep VEEs from the same organization in the same physical box Policy 2: Turn off underutilized physical boxes Local optimizations (within a single site): placement, power, etc. 13 Reservoir @ 7th HLRS Scalable Global File System Workshop
SOI: Grid Computing + Virtualization + BSM Boundaries Policy 3: If possible keep VEEs in owning organization 14 Reservoir @ 7th HLRS Scalable Global File System Workshop
SOI: Grid Computing + Virtualization + BSM Boundaries Policy 3: If possible keep VEEs in owning organization 15 Reservoir @ 7th HLRS Scalable Global File System Workshop
SOI: Grid Computing + Virtualization + BSM Boundaries Policy 4: If possible keep VEEs in least number of external organizations 16 Reservoir @ 7th HLRS Scalable Global File System Workshop
SOI: Grid Computing + Virtualization + BSM Boundaries Policy 4: If possible keep VEEs in least number of external organizations 17 Reservoir @ 7th HLRS Scalable Global File System Workshop
SOI: Grid Computing + Virtualization + BSM Boundaries Policy 5: Follow the service customer 18 Reservoir @ 7th HLRS Scalable Global File System Workshop
SOI: Grid Computing + Virtualization + BSM Boundaries Policy 5: Follow the service customer 19 Reservoir @ 7th HLRS Scalable Global File System Workshop
SOI: Grid Computing + Virtualization + BSM Boundaries Policy 5: Follow the service customer Migration across sites Global optimizations: placement, cost, bandwidth, etc. 20 Reservoir @ 7th HLRS Scalable Global File System Workshop
Virtualize the Network Create virtual networks connecting VEEs regardless of physical server location 21 Reservoir @ 7th HLRS Scalable Global File System Workshop
Virtualize the Network and the Storage Enable secure access to relevant data regardless of storage location 22 Reservoir @ 7th HLRS Scalable Global File System Workshop
The next generation SOI Security Scalability Availability Reliability Cost Efficient Data Intensive Personalization Mobility Latency Manageability 23 Reservoir @ 7th HLRS Scalable Global File System Workshop
Challenges From service (high level, business concepts) to infrastructure (low level, IT concepts) A service definition language that captures in functional requirements of the service Mapping of high level service requirements and metrics (e.g., response time) to infrastructure level requirements and metrics (e.g., CPU utilization) Policy-based management across administrative domains (clouds) Multi-level SLA management (service consumer, services provider/s, infrastructure provider) Separation of functional responsibilities and collaborative reconciliation Service level utility analog of electricity power, Dynamically automatically hire additional 'power from a another cloud Enable intra-site and inter-site workload optimization, HA and SLA management The capability of creating fully isolated virtual organizations spread across geographies and management domains. Through an integrated approach to virtualization of servers, network and storage Introduce capabilities for mobility of virtual resources and services across different administrative domains, and for management of disparate virtualized environments End to end performance of virtualized systems Identify typical service workload for which Reservoir-like infrastructure is advantageous Pinpoint causes of performance degradation, for the selected service workload(s), in virtualized environments Introduce service workload-specific optimizations to relevant virtualization layer 24 Reservoir @ 7th HLRS Scalable Global File System Workshop
The Reservoir Architecture Service Provider SD+ SLA Service Manager SLA SLA VEE Management System VEE Management Enablement Layer Virtualized Physical Resource (e.g., Hypervisor) Infrastructure Provider = Site/Domain/Cloud 25 Reservoir @ 7th HLRS Scalable Global File System Workshop
The Reservoir Architecture Service Provider Monitor service and enforce SLA compliance by managing capacity of Service Components (VEEs) or/ and size of Service Tiers Deals with translation/mapping of service concepts/metrics (response SLA time) to infrastructure concepts/ metrics (VEE size) SD+ SLA Service Manager VEE Management System SLA VEE Management Enablement Layer Virtualized Physical Resource (e.g., Hypervisor) Infrastructure Provider = Site/Domain/Cloud 26 Reservoir @ 7th HLRS Scalable Global File System Workshop
The Reservoir Architecture Service Provider Monitor service and enforce SLA compliance by managing capacity of Service Components (VEEs) or/ and size of Service Tiers Deals with translation/mapping of service concepts/metrics (response SLA time) to infrastructure concepts/ metrics (VEE size) SD+ SLA Service Manager VEE Management System SLA Monitor VEEs and find best VEE placement that meet constraint satisfaction problem Deals federation of domains VEE Management Enablement Layer Virtualized Physical Resource (e.g., Hypervisor) Infrastructure Provider = Site/Domain/Cloud 27 Reservoir @ 7th HLRS Scalable Global File System Workshop
Project Structure A1: Architecture (IBM) A4: Service Management (TID) A3: VEE Management (UCM) A2: VEE Infrastructure (IBM) A5: Testbed and Scenarios (UniMe) A6: Dissemination (CETIC) 28 Reservoir @ 7th HLRS Scalable Global File System Workshop
The RESERVOIR Consortium Partner IBM HRL Telefonica I+D UCM Thales SAP Sun Microsystems DATAMAT University Lugano University UMEA University Messina UC London CETIC OGF Role Technology Technology Technology Technology Use-Cases Use-Cases + Tech Technology Technology Technology Technology Technology Technology Standardization Comment Project Lead, Virtualization/SOA Infrastructure Service Technology, Billing Infrastructure Grid, Dynamic Allocation Technology Security, Virtualization Infrastructure, Hosting Use-Cases, Contribution to Requirement an Standards Contribution to Standards, Java Services, Monitoring Service Management Technologies Partner, Monitoring and SLA Management Monitoring, Measuring and Billing Technology Grid Experience, Testbed Development, Virtualization Technology Security Grid and Virtualization Standards 29 Reservoir @ 7th HLRS Scalable Global File System Workshop
Project Status 3 Years FP7 project started in February Kickoff meeting hosted by IBM in Haifa at the beginning of February 21 participants from abroad, representing all 12 partners (+local IBMers) Architecture work-package fully at work First version of architectural spec. expected at the end of May Started working on building the testbed 15 machines at UniMe, 4 machines at IBM (2 more on the way) and 8 machines on the way to UCM Web site up and running Come visit us at http://www.reservoir-fp7.eu/ Next F2F meeting planned for beginning of June To be hosted by Telefonica in Barcelona Coincide with OGF23 30 Reservoir @ 7th HLRS Scalable Global File System Workshop
31 Reservoir @ 7th HLRS Scalable Global File System Workshop
A2: VEE Infrastructure Virtual Machine Technologies (IBM) Improve performance of VEE execution for typical RESERVOIR workloads Provide VEEMS enablement layer for virtual machines Relocation Enablement (IBM) Network Virtualization Storage Virtualization Java Service Containers (Sun) Provide VEEMS enablement layer for virtual java service containers 32 Reservoir @ 7th HLRS Scalable Global File System Workshop
A3: VEE Management VEE Provisioning and Supervision (UCM) Image management Monitoring Allocation Policy Management (Datamat) Policy based placement and migration Federation of Management Domains (UCM) Built atop WSRF interfaces to access remote VEE Supervisors Push new and leverage existing OGF/DMTF/OASIS standards Interoperability between administrative domains and scheduling heuristics on federated and utility architectures. 33 Reservoir @ 7th HLRS Scalable Global File System Workshop
A4: Service Management Service Definition (UCL) Design a new service description language that will allow the description of service interfaces, service lifecycle, interface bindings to implementations, service deployment, SLA requirements for a service, rules for VEEs (re)configuration and (re)organisation and service components distribution and configuration Revisit the service lifecycle definition and extend it to accommodate the influence of virtualisation Extend tools available for service design (for example the Eclipse Web Tools Platform) Standardize the service description language Service Management (TID) SLA monitoring across administrative domains settings and service-oriented architectures. Integrate monitoring with resource allocation and scheduling and take explicit account of the potentially synchronous nature of service invocations. Automatic deployment of services based on complex service definition Accounting, Billing and Payment (TID) Accounting and billing arrangements for outsourced services are based on raw machine resource consumption (CPU-time, storage capacity etc) RESERVOIR will pursue the definition of a framework that allows accounting and billing in terms of the services that were completed, taking into consideration the quality of service that was provided. 34 Reservoir @ 7th HLRS Scalable Global File System Workshop
A5: Experimentation and Validation Testbed (UniMe) Create the necessary environment for testing and validation A multi-site environment running the last release of RESERVOIR middleware to evaluate (built on physical resources distributed/owned/managed by some of the project s partners) Scenario 1: egov application (Thales) Automatic adjustment of resources and domains cooperation. Scenario 2: SAP business application (SAP) Business application oriented use cases and the opportunities to execute them on a flexible infrastructure. Scenario 3: Utility computing (Sun) Deploy arbitrary operating system and application stacks on remote resources. Provide secure and seamless access to them. Adjust resource allocation on-demand without the end user noticing disruption of service Scenario 4: Telco application (TID) Hosting web sites that deals with massive access (e.g., the Olympics games) High degree of personalization and support for mashups 35 Reservoir @ 7th HLRS Scalable Global File System Workshop