CS 61C: Great Ideas in Computer Architecture (Machine Structures) Lecture 17 Datacenters and Cloud Compu5ng

Similar documents
Warehouse-Scale Computing

CS 61C: Great Ideas in Computer Architecture (Machine Structures) Warehouse-Scale Computing

Levels of Representation / Interpretation

Warehouse-Scale Computers to Exploit Request-Level and Data-Level Parallelism

Concepts Introduced in Chapter 6. Warehouse-Scale Computers. Programming Models for WSCs. Important Design Factors for WSCs

ΕΠΛ372 Παράλληλη Επεξεργάσια

Intro to Software as a Service (SaaS) and Cloud Computing

E.g., Google s Oregon WSC. In the news. Server, Rack, Array. Containers in WSCs 7/27/15. New- School Machine Structures (It s a bit more complicated!

CS 61C: Great Ideas in Computer Architecture Lecture 21: Warehouse- Scale Compu1ng, MapReduce, and Spark

Green Computing: Datacentres

Today s Lecture. CS 61C: Great Ideas in Computer Architecture (Machine Structures) Map Reduce

Topics. CIT 470: Advanced Network and System Administration. Google DC in The Dalles. Google DC in The Dalles. Data Centers

COEN-4730 Computer Architecture Lecture 10 Warehouse Scale Computing (WSC)

COMPUTER ORGANIZATION AND DESIGN The Hardware/Software Interface. 5 th. Edition. Chapter 1. Computer Abstractions and Technology

Distributed Data Infrastructures, Fall 2017, Chapter 2. Jussi Kangasharju

CS 110 Computer Architecture

Cloud Computing Economies of Scale

Green Computing: Datacentres

EN2910A: Advanced Computer Architecture Topic 06: Supercomputers & Data Centers Prof. Sherief Reda School of Engineering Brown University

CSE 124: THE DATACENTER AS A COMPUTER. George Porter November 20 and 22, 2017

Cloud Computing. What is cloud computing. CS 537 Fall 2017

Lecture 20: WSC, Datacenters. Topics: warehouse-scale computing and datacenters (Sections )

The Computer Revolution. Classes of Computers. Chapter 1

Data Centers and Cloud Computing. Data Centers

In the news. Request- Level Parallelism (RLP) Agenda 10/7/11

Data Centers and Cloud Computing

CS 61C: Great Ideas in Computer Architecture. MapReduce

Data Center Fundamentals: The Datacenter as a Computer

Next-Generation Cloud Platform

Computer Architecture. Fall Dongkun Shin, SKKU

Public Cloud Leverage For IT/Business Alignment Business Goals Agility to speed time to market, adapt to market demands Elasticity to meet demand whil

Data Centers and Cloud Computing. Slides courtesy of Tim Wood

Green Computing and Sustainability

Infrastructure Innovation Opportunities Y Combinator 2013

Course Overview. ECE 1779 Introduction to Cloud Computing. Marking. Class Mechanics. Eyal de Lara

CPSC 426/526. Cloud Computing. Ennan Zhai. Computer Science Department Yale University

Internet-Scale Datacenter Economics: Costs & Opportunities High Performance Transaction Systems 2011

Instructors: Randy H. Katz David A. PaHerson hhp://inst.eecs.berkeley.edu/~cs61c/fa10. Fall Lecture #9. Agenda

LECTURE 1. Introduction

Cloud Computing Economics

Datacenter Efficiency Trends. Cary Roberts Tellme, a Microsoft Subsidiary

COMPUTER ORGANIZATION AND DESIGN. 5 th Edition. The Hardware/Software Interface. Chapter 1. Computer Abstractions and Technology

Agenda. AWS Database Services Traditional vs AWS Data services model Amazon RDS Redshift DynamoDB ElastiCache

Data Transport: Defining the Problem and the Solution for Photonics in servers

COST EFFICIENCY VS ENERGY EFFICIENCY. Anna Lepak Universität Hamburg Seminar: Energy-Efficient Programming Wintersemester 2014/2015

Data Center Fundamentals: The Datacenter as a Computer

Computer Science 146. Computer Architecture

CS / Cloud Computing. Recitation 3 September 9 th & 11 th, 2014

EECS2021E EECS2021E. The Computer Revolution. Morgan Kaufmann Publishers September 12, Chapter 1 Computer Abstractions and Technology 1

1 Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Optimizing the Data Center with an End to End Solutions Approach

Datacenters. Mendel Rosenblum. CS142 Lecture Notes - Datacenters

CS3350B Computer Architecture. Introduction

Introduction to Database Services

7 Best Practices for Increasing Efficiency, Availability and Capacity. XXXX XXXXXXXX Liebert North America

MSYS 4480 AC Systems Winter 2015

CS 61C: Great Ideas in Computer Architecture. Course Introduction, Number Representation

Chapter 1. The Computer Revolution

Storage Optimization with Oracle Database 11g

CS6453. Data-Intensive Systems: Rachit Agarwal. Technology trends, Emerging challenges & opportuni=es

CS 6240: Parallel Data Processing in MapReduce: Module 1. Mirek Riedewald

Cloud Computing. Ennan Zhai. Computer Science at Yale University

Case study on Green Data Center. Reliance Jio Infocomm Limited

Experience the GRID Today with Oracle9i RAC

Introduction to Data Management CSE 344

Introduction To Cloud Computing

Lecture 7: Data Center Networks

COS 318: Operating Systems. Introduction

COMPUTER ARCHITECTURE AND OPERATING SYSTEMS (CS31702)

Energy Efficient Data Centers

Warehouse- Scale Computing and the BDAS Stack

CS5950 / CS6030 Cloud Computing

Green IT and Green DC

Cooling on Demand - scalable and smart cooling solutions. Marcus Edwards B.Sc.

IBM FlashSystem. IBM FLiP Tool Wie viel schneller kann Ihr IBM i Power Server mit IBM FlashSystem 900 / V9000 Storage sein?

Internet II. CS10 : Beauty and Joy of Computing. cs10.berkeley.edu. !!Senior Lecturer SOE Dan Garcia!!! Garcia UCB!

vstart 50 VMware vsphere Solution Specification

A Computer Scientist Looks at the Energy Problem

HPC over Cloud. July 16 th, SCENT HPC Summer GIST. SCENT (Super Computing CENTer) GIST (Gwangju Institute of Science & Technology)

The Stampede is Coming Welcome to Stampede Introductory Training. Dan Stanzione Texas Advanced Computing Center

ECE 486/586. Computer Architecture. Lecture # 2

Efficient On-Demand Operations in Distributed Infrastructures

CRITICAL FACILITIES ROUNDTABLE HEAT REMOVAL IN DATA CENTERS ENGINEERED SOLUTIONS

Smart Solutions for a smarter planet. IBM System z... myths vs truth

Hot vs Cold Energy Efficient Data Centers. - SVLG Data Center Center Efficiency Summit

Lecture 9: MIMD Architectures

CSE6331: Cloud Computing

2013 AWS Worldwide Public Sector Summit Washington, D.C.

Automating Information Lifecycle Management with

Instructors Randy H. Katz hdp://inst.eecs.berkeley.edu/~cs61c/fa13. Power UVlizaVon Efficiency. Datacenter. Infrastructure.

TECHNOLOGIES CO., LTD.

CS252 S05. CMSC 411 Computer Systems Architecture Lecture 18 Storage Systems 2. I/O performance measures. I/O performance measures

CSE 451: Operating Systems Spring Module 12 Secondary Storage. Steve Gribble

COURSE OVERVIEW. Introduction to Computer Engineering 2015 Spring by Euiseong Seo

Windows Servers In Microsoft Azure

Software as a Service (SaaS), Service-Oriented Architecture (SOA), and Cloud Computing

Patterson s Projects, People, Impact

L3/L4 Multiple Level Cache concept using ADS

Improving Blade Economics with Virtualization

Computer Organization and Structure. Bing-Yu Chen National Taiwan University

Transcription:

CS 61C: Great Ideas in Computer Architecture (Machine Structures) Lecture 17 Datacenters and Cloud Compu5ng Instructor: Dan Garcia h;p://inst.eecs.berkeley.edu/~cs61c/ 2/28/13 1

In the news Google disclosed that it conhnuously uses enough electricity to power 200,000 homes, but it says that in doing so, it also makes the planet greener. Search cost per day (per person) same as running a 60- wa; bulb for 3 hours Urs Hoelzle, Google SVP Co- author of today s reading h;p://www.nyhmes.com/2011/09/09/technology/google- details- and- defends- its- use- of- electricity.html 2/28/13 2

Review Great Ideas in Computer Architecture 1. Layers of RepresentaHon/InterpretaHon 2. Moore s Law 3. Principle of Locality/Memory Hierarchy 4. Parallelism 5. Performance Measurement and Improvement 6. Dependability via Redundancy 2/28/13 3

Computer Eras: Mainframe 1950s- 60s Processor (CPU) I/O Big Iron : IBM, UNIVAC, build $1M computers for businesses è COBOL, Fortran, Hmesharing OS 2/28/13 4

Minicomputer Eras: 1970s Using integrated circuits, Digital, HP build $10k computers for labs, universihes è C, UNIX OS 2/28/13 5

PC Era: Mid 1980s - Mid 2000s Using microprocessors, Apple, IBM, build $1k computer for 1 person è Basic, Java, Windows OS 2/28/13 6

PostPC Era: Late 2000s -?? 2/28/13 Personal Mobile Devices (PMD): Relying on wireless networking, Apple, Nokia, build $500 smartphone and tablet computers for individuals è ObjecHve C, Java, Android OS + ios Cloud CompuHng: Using Local Area Networks, Amazon, Google, build $200M Warehouse Scale Computers with 100,000 servers for Internet Services for PMDs è MapReduce, Ruby on Rails 7

Why Cloud CompuHng Now? The Web Space Race : Build- out of extremely large datacenters (10,000 s of commodity PCs) Build- out driven by growth in demand (more users) Infrastructure solware and OperaHonal experhse Discovered economy of scale: 5-7x cheaper than provisioning a medium- sized (1000 servers) facility More pervasive broadband Internet so can access remote computers efficiently CommodiHzaHon of HW & SW Standardized solware stacks 8"

March 2013 AWS Instances & Prices Instance Per Hour Ratio to Small Compute Units Virtual Cores Compute Unit/ Core Memory (GiB) Disk (GiB) Address Standard Small $0.065 1.0 1.0 1 1.00 1.7 160 32 bit Standard Large $0.260 4.0 4.0 2 2.00 7.5 850 64 bit Standard Extra Large $0.520 8.0 8.0 4 2.00 15.0 1690 64 bit High-Memory Extra Large $0.460 5.9 6.5 2 3.25 17.1 420 64 bit High-Memory Double Extra Large $0.920 11.8 13.0 4 3.25 34.2 850 64 bit High-Memory Quadruple Extra Large $1.840 23.5 26.0 8 3.25 68.4 1690 64 bit High-CPU Medium $0.165 2.0 5.0 2 2.50 1.7 350 32 bit High-CPU Extra Large $0.660 8.0 20.0 8 2.50 7.0 1690 64 bit Closest computer in WSC example is Standard Extra Large @ At these low rates, Amazon EC2 can make money! even if used only 50% of Hme 2/28/13 9

Warehouse Scale Computers Massive scale datacenters: 10,000 to 100,000 servers + networks to connect them together Emphasize cost- efficiency A;enHon to power: distribuhon and cooling (relahvely) homogeneous hardware/solware Offer very large applicahons (Internet services): search, social networking, video sharing Very highly available: < 1 hour down/year Must cope with failures common at scale WSCs are no less worthy of the experhse of computer systems architects than any other class of machines Barroso and Hoelzle 2009 2/28/13 10

Design Goals of a WSC Unique to Warehouse- scale Ample parallelism: Batch apps: large number independent data sets with independent processing. Also known as Data- Level Parallelism Scale and its Opportuni5es/Problems RelaHvely small number of these make design cost expensive and difficult to amorhze But price breaks are possible from purchases of very large numbers of commodity servers Must also prepare for high # of component failures Opera5onal Costs Count: Cost of equipment purchases << cost of ownership 2/28/13 11

E.g., Google s Oregon WSC 2/28/13 12

Containers in WSCs Inside WSC 2/28/13 Inside Container 13

Equipment Inside a WSC Server (in rack format): 1 ¾ inches high 1U, x 19 inches x 16-20 inches: 8 cores, 16 GB DRAM, 4x1 TB disk 7 foot Rack: 40-80 servers + Ethernet local area network (1-10 Gbps) switch in middle ( rack switch ) Array (aka cluster): 16-32 server racks + larger local area network switch ( array switch ) 10X faster è cost 100X: cost f(n 2 ) 2/28/13 14

Server, Rack, Array 2/28/13 15

Google Server Internals Google Server 2/28/13 16

Defining Performance What does it mean to say X is faster than Y? 2009 Ferrari 599 GTB 2 passengers, 11.1 secs for quarter mile (call it 10sec) 2009 Type D school bus 54 passengers, quarter mile Hme? (let s guess 1 min) h;p://www.youtube.com/watch?v=kwycoquhuna Response Time or Latency: Hme between start and complehon of a task (Hme to move vehicle ¼ mile) Throughput or Bandwidth: total amount of work in a given Hme (passenger- miles in 1 hour) 2/28/13 17

Coping with Performance in Array Lower latency to DRAM in another server than local disk Higher bandwidth to local disk than to DRAM in another server Local Rack Array Racks -- 1 30 Servers 1 80 2400 Cores (Processors) 8 640 19,200 DRAM Capacity (GB) 16 1,280 38,400 Disk Capacity (TB) 4 320 9,600 DRAM Latency (microseconds) 0.1 100 300 Disk Latency (microseconds) 10,000 11,000 12,000 DRAM Bandwidth (MB/sec) 20,000 100 10 2/28/13 Disk Bandwidth (MB/sec) 200 100 10 18

Coping with Workload VariaHon Workload 2X Midnight Noon Midnight Online service: Peak usage 2X off- peak 2/28/13 19

Impact of latency, bandwidth, failure, varying workload on WSC solware? WSC Solware must take care where it places data within an array to get good performance WSC Solware must cope with failures gracefully WSC Solware must scale up and down gracefully in response to varying demand More elaborate hierarchy of memories, failure tolerance, workload accommodahon makes WSC solware development more challenging than solware for single computer 2/28/13 20

Power vs. Server UHlizaHon Server power usage as load varies idle to 100% Uses ½ peak power when idle! Uses ⅔ peak power when 10% uhlized! 90%@ 50%! Most servers in WSC uhlized 10% to 50% Goal should be Energy- Propor5onality: % peak load = % peak energy 2/28/13 21

Power Usage EffecHveness Overall WSC Energy Efficiency: amount of computahonal work performed divided by the total energy used in the process Power Usage EffecHveness (PUE): Total building power / IT equipment power A power efficiency measure for WSC, not including efficiency of servers, networking gear 1.0 = perfechon 2/28/13 22

PUE in the Wild (2007) 2/28/13 23

High PUE: Where Does Power Go? Power DistribuHon Unit Uninterruptable Power Supply (ba;ery) Chiller cools warm water from Air CondiHoner Servers + Networking Computer Room Air CondiHoner 2/28/13 24

Google WSC A PUE: 1.24 Careful air flow handling Don t mix server hot air exhaust with cold air (separate warm aisle from cold aisle) Short path to cooling so li;le energy spent moving cold or hot air long distances Keeping servers inside containers helps control air flow Elevated cold aisle temperatures 81 F instead of tradihonal 65-68 F Found reliability OK if run servers ho;er Use of free cooling Cool warm water outside by evaporahon in cooling towers Locate WSC in moderate climate so not too hot or too cold Per- server 12- V DC UPS Rather than WSC wide UPS, place single ba;ery per server board Increases WSC efficiency from 90% to 99% Measure vs. eshmate PUE, publish PUE, and improve operahon 2/28/13 Spring 2011 - - Lecture #1 25

Summary Parallelism is one of the Great Ideas Applies at many levels of the system from instruchons to warehouse scale computers Post PC Era: Parallel processing, smart phone to WSC WSC SW must cope with failures, varying load, varying HW latency bandwidth WSC HW sensihve to cost, energy efficiency WSCs support many of the applicahons we have come to depend on 2/28/13 26