Concepts Introduced in Chapter 6. Warehouse-Scale Computers. Programming Models for WSCs. Important Design Factors for WSCs

Similar documents
ΕΠΛ372 Παράλληλη Επεξεργάσια

Warehouse-Scale Computers to Exploit Request-Level and Data-Level Parallelism

Lecture 20: WSC, Datacenters. Topics: warehouse-scale computing and datacenters (Sections )

CS 61C: Great Ideas in Computer Architecture (Machine Structures) Lecture 17 Datacenters and Cloud Compu5ng

Warehouse-Scale Computing

CS 61C: Great Ideas in Computer Architecture (Machine Structures) Warehouse-Scale Computing

Distributed Data Infrastructures, Fall 2017, Chapter 2. Jussi Kangasharju

HCI: Hyper-Converged Infrastructure

CPSC 426/526. Cloud Computing. Ennan Zhai. Computer Science Department Yale University

New Approach to Unstructured Data

CSE 124: THE DATACENTER AS A COMPUTER. George Porter November 20 and 22, 2017

Cloud Computing. What is cloud computing. CS 537 Fall 2017

Data Centers and Cloud Computing. Data Centers

Distributed Systems. 31. The Cloud: Infrastructure as a Service Paul Krzyzanowski. Rutgers University. Fall 2013

Today s Lecture. CS 61C: Great Ideas in Computer Architecture (Machine Structures) Map Reduce

PROCESS & DATA CENTER COOLING. How and Why To Prepare For What s To Come JOHN MORRIS REGIONAL DIRECTOR OF SALES

Data Centers and Cloud Computing

Data Centers and Cloud Computing. Slides courtesy of Tim Wood

Planning a Green Datacenter

S I T E I N F R A S T R U C T U R E C U R R E N T P R O B L E M S A N D P R O P O S E D S O L U T I O N S

Taking The Heat Off University of East Anglia Data Centre

Topics. CIT 470: Advanced Network and System Administration. Google DC in The Dalles. Google DC in The Dalles. Data Centers

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure

By John Kim, Chair SNIA Ethernet Storage Forum. Several technology changes are collectively driving the need for faster networking speeds.

Prefabricated Data Center Solutions: Coming of Age

Embedded Technosolutions

SMART SOLUTIONS TM INTELLIGENT, INTEGRATED INFRASTRUCTURE FOR THE DATA CENTER

Hot vs Cold Energy Efficient Data Centers. - SVLG Data Center Center Efficiency Summit

Scaling Data Center Application Infrastructure. Gary Orenstein, Gear6

Cloud Computing and Hadoop Distributed File System. UCSB CS170, Spring 2018

THE ZADARA CLOUD. An overview of the Zadara Storage Cloud and VPSA Storage Array technology WHITE PAPER

Emerging Trends of Micro Data Centers

RACKSPACE ONMETAL I/O V2 OUTPERFORMS AMAZON EC2 BY UP TO 2X IN BENCHMARK TESTING

White Paper. Platform9 ROI for Hybrid Clouds

Passive RDHx as a Cost Effective Alternative to CRAH Air Cooling. Jeremiah Stikeleather Applications Engineer

5 Reasons to Host Oracle E-Business Suite on the AWS Cloud

Level 3 Certificate in Cloud Services (for the Level 3 Infrastructure Technician Apprenticeship) Cloud Services

How is server administration different in the Cloud?

Reducing Data Center Cooling Costs through Airflow Containment

Virtualization and consolidation

Business Benefits of Policy Based Data De-Duplication Data Footprint Reduction with Quality of Service (QoS) for Data Protection

Magellan Project. Jeff Broughton NERSC Systems Department Head October 7, 2009

Case study on Green Data Center. Reliance Jio Infocomm Limited

Intelligent Placement of Datacenters for Internet Services. Íñigo Goiri, Kien Le, Jordi Guitart, Jordi Torres, and Ricardo Bianchini

Ecolibrium. Cloud control. Keeping data centres cool. OCTOBER 2018 VOLUME 17.9 RRP $14.95 PRINT POST APPROVAL NUMBER PP352532/00001

Recent Trends. CS 537 Section 11 Large Scale Systems. Effects of Google s HW philosophy. Google Design Philosophy 5/5/09

Nimble Storage vs HPE 3PAR: A Comparison Snapshot

Next-Generation Cloud Platform

ECE Enterprise Storage Architecture. Fall ~* CLOUD *~. Tyler Bletsch Duke University

Cloud Computing Economies of Scale

Community Driven Innovation and Sustainability. Revolutionary IT Infrastructures

MSYS 4480 AC Systems Winter 2015

Scalable, Secure. Jason Rylands

Veritas NetBackup Appliance Family OVERVIEW BROCHURE

Data center interconnect for the enterprise hybrid cloud

Planning for the HDP Cluster

Cloud Bursting: Top Reasons Your Organization will Benefit. Scott Jeschonek Director of Cloud Products Avere Systems

HGST: Market Creator to Market Leader

How Liquid Cooling Helped Two University Data Centers Achieve Cooling Efficiency Goals. Michael Gagnon Coolcentric October

Analytics in the Cloud Mandate or Option?

Power & Cooling Considerations for Virtualized Environments

Evaluating Cloud Storage Strategies. James Bottomley; CTO, Server Virtualization

CLOUD ECONOMICS: HOW TO QUANTIFY THE BENEFITS OF MOVING TO THE CLOUD

Green IT Project: The Business Case

CS / Cloud Computing. Recitation 3 September 9 th & 11 th, 2014

DIVING IN: INSIDE THE DATA CENTER

Clouds, Convergence & Consolidation

Greening Terremark Data Centers

CS 61C: Great Ideas in Computer Architecture. MapReduce

Enterprise Architectures The Pace Accelerates Camberley Bates Managing Partner & Analyst

Oracle Database Exadata Cloud Service Exadata Performance, Cloud Simplicity DATABASE CLOUD SERVICE

ClearPath Forward Libra 8480 / 8490

TWO FINANCIAL IMPACT STUDIES. Seal IT Equipment Cabinets for Significant Annual Cost Savings and Simple Payback in a Few Short Months

COMPTIA CLO-001 EXAM QUESTIONS & ANSWERS

Introduction To Cloud Computing

CLOUD COMPUTING. Rajesh Kumar. DevOps Architect.

Hortonworks Data Platform

Reduce Latency and Increase Application Performance Up to 44x with Adaptec maxcache 3.0 SSD Read and Write Caching Solutions

EBOOK. FROM DISASTER RECOVERY TO ACTIVE-ACTIVE: NuoDB AND MULTI-DATA CENTER DEPLOYMENTS

Lecture 7: Data Center Networks

Strong Consistency versus Weak Consistency

DATA CENTER COLOCATION BUILD VS. BUY

The Google File System

Government Data Center Modernization

Cisco Wide Area Application Services and Cisco Nexus Family Switches: Enable the Intelligent Data Center

Private Cloud Project Leads to Lower Server Footprint and US$595,000 Savings

Dell EMC Hyper-Converged Infrastructure

A Data Center Heat Removal System That Saves... 80% Discover Inertech s Award-winning Data Center Heat Removal System

EN2910A: Advanced Computer Architecture Topic 06: Supercomputers & Data Centers Prof. Sherief Reda School of Engineering Brown University

Cisco HyperFlex HX220c M4 and HX220c M4 All Flash Nodes

AMD Disaggregates the Server, Defines New Hyperscale Building Block

NewSQL Without Compromise

Atlantis Computing Adds the Ability to Address Classic Server Workloads

Key aspects of cloud computing. Towards fuller utilization. Two main sources of resource demand. Cluster Scheduling

Simplified. Software-Defined Storage INSIDE SSS

2013 AWS Worldwide Public Sector Summit Washington, D.C.

Data centers control: challenges and opportunities

WHITE PAPER. Header Title. Side Bar Copy. Header Title 5 Reasons to Consider Disaster Recovery as a Service for IBM i WHITEPAPER

Making Middleboxes Someone Else s Problem: Network Processing as a Cloud Service

Technology Changes in Data Centers

Transcription:

Concepts Introduced in Chapter 6 Warehouse-Scale Computers A cluster is a collection of desktop computers or servers connected together by a local area network to act as a single larger computer. introduction to warehouse-scale computing programming models infrastructure and costs cloud computing A warehouse-scale computer (WSC) is a cluster comprised of tens of thousands of servers. The cost may be on the order of $150M for the building, electrical and cooling infrastructure, the servers, and the networking equipment that houses 50,000 to 100,000 servers. A WSC can be used to provide internet services. search - Google social networking - Facebook video sharing - YouTube online sales - Amazon cloud computing services - Rackspace and many more applications Important Design Factors for WSCs Programming Models for WSCs WSC goals and requirements in common with servers. cost-performance - work done per dollar energy eciency - work done per joule dependability via redundancy network I/O interactive and batch processing workloads WSC aspects that are distinct from servers. Ample parallelism is always available in a WSC. Operational costs represent a greater fraction of the cost of a WSC. Customization is easier for the scale of an WSC. MapReduce (or the open source Hadoop) is the most popular framework for batch processing in a WSC. Map applies a programmer-supplied function to each logical input record to produce a set of key-value pairs. Reduce collapses these values using another programmer-supplied function. Both tasks are highly parallel.

Programming Models for WSCs (cont.) Storage for a WSC There is often a high variability in performance between the dierent WSC servers due to a variety of reasons. varying load on servers le may or may not be in a le cache distance over network can vary hardware anamolies A WSC will start backup executions on other nodes when tasks have not yet completed and take the result that nishes rst. Rely on data (le) replication to help with read performance and availability. A WSC also has to cope with variability in load. servers entire WSC Often WSC services are performed with in-house software to reduce costs and optimize for performance. A WSC uses local disks inside the servers as opposed to network attached storage (NAS). The Google le system (GFS) uses local disks and maintains at least three replicas to improve dependability by covering not only disk failures, but also power failures to a rack or a cluster of racks by placing the replicas on dierent clusters. A read is serviced by one of the three replicas, but a write has to go to all three replicas. Google uses a relaxed consistency model in that all three replicas have to eventually match, but not all at the same time. WSC Networking WSC Location A WSC uses a hierarchy of networks for interconnection. The standard rack holds 48 servers connected by a 48-port Ethernet switch. A rack switch has 2 to 8 uplinks to a higher switch. So the bandwidth leaving the rack is 6 (48/8) to 24 (48/2) times less than the bandwidth within a rack. There are array switches that are more expensive to allow higher connectivity. There may also be Layer 3 routers to connect the arrays together and to the Internet. The goal of the software is to maximize locality of communication relative to the rack. proximity to Internet backbone optical bers proximity to users of service to reduce Internet access latency electricity availability and cost property tax rate low risk from environmental disasters stability of country low temperature to decrease cooling cost

WSC Power and Cooling Measuring WSC Eciency power usage of just the WSC IT equipment 33% for processors 30% for DRAM 10% for disks 5% for networking 22% for other components within the servers Air conditioning is used to cool server room, requiring 10%-20% of IT equipment power due mostly to fans. Chilled water is often used to cool the air, requiring 30% to 50% of IT equipment power. Outside cooling towers can leverage lower outside temperature. power power utilization eectiveness (PUE) is a widely used simple metric. Median PUE reported in a 2006 study was 1.69. performance PUE = total_facility_power IT _equipment_power Bandwidth is an important metric as there may be many simultaneous user requests or metadata generation batch jobs. Latency is also an important metric as it is seen by users when they make requests. Users will use a search engine less as the response time increases. Also users are more productive in responding to interactive information when the system response time is faster as they are less distracted. Google WSC Innovations to Improve Energy Eciency Cost of a WSC Modied server containers. Separated hot and cold chambers to reduce variation in air temperative, which allows air to be delivered at higher temperatures due to less severe worst-case hot spots. Operating servers at higher temperatures allowed use of cooling towers instead of the more inecient traditional chillers. Shrunk distince of air circulation loop to reduce energy required to move air. Located WSCs in more temperate climates to allow more use of evaporative cooling. Deployed extensive monitoring to measure actual PUE. Designed motherboards that only need a single 12-volt supply so that a UPS could be provided using standard batteries with each server. Google PUE was 1.23 in 2007 and was 1.12 in 2011. capital expenditures (CAPEX) CAPEX is the cost to build a WSC, which includes the building, power and cooling infrastructure, and initial IT equipment (servers and networking equipment). operational expenditures (OPEX) OPEX is the cost to operate a WSC, which includes buying replacement equipment, electricity, and salaries.

Advent of Cloud Computing Improvements in Operational Techniques Cloud computing can be thought of as providing computing as a utility, where a customer pays for only what they use, just as we do for electricity. Cloud computing relies on increasingly larger WSCs which provide several benets if properly set up and operated. improvements in operational techniques economies of scale reduces customer risks of over-provisioning or under-provisioning WSCs have led to innovations in system software to provide high reliability. failover - Automatically restarting an application that fails without requiring administrative intervention. rewall - Examines each network packet to determine whether or not it should be forwarded to its destination. virtual machine - A software layer that executes applications like a physical machine. Protection against denial-of-service attacks. Economies of Scale Reducing Customer Risks WSCs oer economies of scale that cannot be achieved with a data center. 5.7 times reduction in storage costs 7.1 times reduction in administrative costs 7.3 times reduction in networking costs volume discount price reductions PUE of perhaps 1.2 versus PUE of 2.0 for a data center better utilization of WSC by being available to the public WSCs reduce risks of over-provisioning or under-provisioning, particularly for start-up companies. Providing too much equipment means overspending. Providing too little equipment means demand may not be able to be met, which can give a bad impression to potential new customers.

Amazon Web Services Fallacies and Pitfalls Amazon oered Amazon Simple Storage Service (Amazon S3) and Amazon Elastic Computer Cloud (Amazon EC2) in 2006. Relied on virtual machines. Provides better protection for users. Simplied software distribution within a WSC. The ability to reliably kill a virtual machine made it easier to control resource usage. Being able to limit use of resources simplied providing multiple price points for customers. Improved exibility in server conguration. Relied on open source software. Provided service at very low cost. No contract required. Fallacy: Capital costs of a WSC facility are higher than the servers that it houses. Pitfall: Trying to save power with inactive low power modes versus active low power modes. Pitfall: Using too wimpy a processor when trying to improve WSC cost-performance. Fallacy: Replacing all disks with Flash memory will improve cost-performance of a WSC.