Data Center Fundamentals: The Datacenter as a Computer

Similar documents
CSE 124: THE DATACENTER AS A COMPUTER. George Porter November 20 and 22, 2017

CSE 291: Data Center Networking. Spring 2015 Tu/Th 8:00-9:20am George Porter UC San Diego

Lecture 7: Data Center Networks

Introduction To Cloud Computing

Data Center Fundamentals: The Datacenter as a Computer

L19 Data Center Network Architectures

Data Centers and Cloud Computing

Data Centers and Cloud Computing. Slides courtesy of Tim Wood

Data Centers and Cloud Computing. Data Centers

Datacenter Backbone Enterprise Cellular Wireless

Introduction to data centers

Module Day Topic. 1 Definition of Cloud Computing and its Basics

Network Implications of Cloud Computing Presentation to Internet2 Meeting November 4, 2010

Distributed Data Infrastructures, Fall 2017, Chapter 2. Jussi Kangasharju

Intro to Software as a Service (SaaS) and Cloud Computing

Distributed Systems. 31. The Cloud: Infrastructure as a Service Paul Krzyzanowski. Rutgers University. Fall 2013

Faculté Polytechnique

CPSC 426/526. Cloud Computing. Ennan Zhai. Computer Science Department Yale University

Cloud Computing. What is cloud computing. CS 537 Fall 2017

Automating Cloud Networking with RedHat OpenStack

ECE Enterprise Storage Architecture. Fall ~* CLOUD *~. Tyler Bletsch Duke University

Introduction to Cloud Computing. [thoughtsoncloud.com] 1

In this unit we are going to look at cloud computing. Cloud computing, also known as 'on-demand computing', is a kind of Internet-based computing,

THE ZADARA CLOUD. An overview of the Zadara Storage Cloud and VPSA Storage Array technology WHITE PAPER

Big Data and Cloud Computing

CSE6331: Cloud Computing

Lecture 10.1 A real SDN implementation: the Google B4 case. Antonio Cianfrani DIET Department Networking Group netlab.uniroma1.it

Emerging Trends of Micro Data Centers

Defining Software Defined Storage Laz Vekiarides Entrepreneur and Technology Executive

Mellanox Virtual Modular Switch

MapReduce for Scalable and Cloud Computing

Data Center Networks. Networking Case Studies. Cloud CompuMng. Cloud CompuMng. Cloud Service Models. Cloud Service Models

CLOUD COMPUTING ABSTRACT

Transform Your Business To An Open Hybrid Cloud Architecture. Presenter Name Title Date

CS 6393 Lecture 10. Cloud Computing. Prof. Ravi Sandhu Executive Director and Endowed Chair. April 12,

CS 6240: Parallel Data Processing in MapReduce: Module 1. Mirek Riedewald

epldt Cloud Services 4 th National ICT Summit National ICT Confederation of the Philippines (NICP) GENSAN, November 4-6, 2011 Sonny Valdez CTO, epldt

DISTRIBUTED SYSTEMS [COMP9243] Lecture 8a: Cloud Computing WHAT IS CLOUD COMPUTING? 2. Slide 3. Slide 1. Why is it called Cloud?

Dell EMC Hyper-Converged Infrastructure

CS 61C: Great Ideas in Computer Architecture (Machine Structures) Lecture 17 Datacenters and Cloud Compu5ng

Redefining Enterprise Storage: EMC Storage Strategy

Cloud Computing Context. Image: T. Sridhar

Cloud Computing. DB Special Topics Lecture (10/5/2012) Kyle Hale Maciej Swiech

Cloud Computing. Ennan Zhai. Computer Science at Yale University

Microsoft Azure Course Content

Why Datrium DVX is Best for VDI

Windows Azure Services - At Different Levels

CHARTING THE FUTURE OF SOFTWARE DEFINED NETWORKING

CHEM-E Process Automation and Information Systems: Applications

EBOOK DATABASE CONSIDERATIONS FOR DOCKER

MapReduce for Scalable and Cloud Computing

Simulation of Cloud Computing Environments with CloudSim

Title DC Automation: It s a MARVEL!

VMWARE EBOOK. Easily Deployed Software-Defined Storage: A Customer Love Story

Cloud Infrastructure and Operations Chapter 2B/8 Page Main concept from which Cloud Computing developed

Atos Canopy Orchestrated Hybrid Cloud. Mark Nouris - Atos Head of Cloud Michael Kollar Head of Cloud engineering & TIC

How Software Defined Everything can reduce TCO

BUILD A BUSINESS CASE

Data Centres: The Infrastructure That Underpins The Cloud. May 2017

New Approach to Unstructured Data

Pocket: Elastic Ephemeral Storage for Serverless Analytics

Cloud Computing introduction

Cloud Computing. Technologies and Types

Dell EMC Hyper-Converged Infrastructure

Modernize Your Infrastructure

Composable Infrastructure for Public Cloud Service Providers

Utilizing Datacenter Networks: Centralized or Distributed Solutions?

Cloud Computing and Service-Oriented Architectures

Flash in a Hybrid Cloud World. How Cloud Shift will affect flash in the Data Center Steve Knipple: Cloud Shift Advisors

UVA HPC & BIG DATA COURSE. Cloud Computing. Adam Belloum

Popular SIEM vs aisiem

Deploying TeraVM in an OpenStack Environment

Efficient On-Demand Operations in Distributed Infrastructures

Cloud & AWS Essentials Agenda. Introduction What is the cloud? DevOps approach Basic AWS overview. VPC EC2 and EBS S3 RDS.

Programowanie w chmurze na platformie Java EE Wykład 1 - dr inż. Piotr Zając

Cloud Computing Briefing Presentation. DANU

Cloud Programming. Programming Environment Oct 29, 2015 Osamu Tatebe

MATE-EC2: A Middleware for Processing Data with Amazon Web Services

Transforming Management for Modern Scale-Out Infrastructure

New Fresh Storage Approach for New IT Challenges Laurent Denel Philippe Nicolas OpenIO

Cloud Computing An IT Paradigm Changer

Cloud Computing: Making the Right Choice for Your Organization

The OpenCirrus TM Project: A global Testbed for Cloud Computing R&D

The next step in Software-Defined Storage with Virtual SAN

DriveScale-DellEMC Reference Architecture

São Paulo. August,

Lecture 20: WSC, Datacenters. Topics: warehouse-scale computing and datacenters (Sections )

Lecture 09: VMs and VCS head in the clouds

Modern hyperconverged infrastructure. Karel Rudišar Systems Engineer, Vmware Inc.

The End of Storage. Craig Nunes. HP Storage Marketing Worldwide Hewlett-Packard

NaaS Network-as-a-Service in the Cloud

DIGGING INTO THE SOFTWARE DEFINED DATA CENTER

Cloud Computing Lecture 4

Cloud computing and Citrix C3 Last update 28 July 2009 Feedback welcome -- Michael. IT in the Clouds. Dr Michael Harries Citrix Labs

Fundamental Concepts and Models

Concepts Introduced in Chapter 6. Warehouse-Scale Computers. Programming Models for WSCs. Important Design Factors for WSCs

Cloud Computing Technologies and Types

Lesson 14: Cloud Computing

Virtual vs Physical ADC

Modelos de Negócio na Era das Clouds. André Rodrigues, Cloud Systems Engineer

Transcription:

Data Center Fundamentals: The Datacenter as a Computer George Porter CSE 124 Feb 9, 2016 *Includes material taken from Barroso et al., 2013, and UCSD 222a.

Much in our life is now on the web 2

The web is driven by data Data + = Product RecommendaGons Data + = Custom StaGons Data + = Personalized Search 3

Data-driven, per-user customizagon Data = + App... App... App... App 3 Product RecommendaGons App... App 1 App... App 2 App... App... App... App... App...

Cloud Computing! Elastic resources Expand and contract resources Pay-per-use Infrastructure on demand Multi-tenancy Multiple independent users Security and resource isolation Amortize the cost of the (shared) infrastructure Flexible service management Resiliency: isolate failure of servers and storage Workload movement: move work to other locations 5

Cloud Service Models! Software as a Service (Saas) Provider licenses applications to users as a service e.g., customer relationship management, email, Avoid costs of installation, maintenance, patches, Platform as a Service (Paas) Provider offers software platform for building applications e.g., Google s App-Engine Avoid worrying about scalability of platform Infrastructure as a Service (Iaas) Provider offers raw computing, storage, and network e.g., Amazon s Elastic Computing Cloud (EC2) Avoid buying servers and estimating resource needs 6

Data centers with 100,000+ servers MicrosoN MicrosoN Google Facebook

These things are really big 100 billion searches per month 1.15 billion users 120+ million users 8

The need for rapid growth 1989 1993 1997 2001 2005 2009 2013 9

The need for rapid growth 1989 1993 1997 2001 2005 2009 2013 Web Created 10

The need for rapid growth 1989 1993 1997 2001 2005 2009 2013 Web Created Google s 1st cluster (15 years) 11

The need for rapid growth 1989 1993 1997 2001 2005 2009 2013 Web Created Google s 1st cluster (15 years) (10 years) 12

Building blocks of modern data centers Network switch Rack

Top-of-Rack Architecture! Rack of servers Commodity servers And top-of-rack switch Modular design Preconfigured racks Power, network, and storage cabling Aggregate to the next level 14

Racks of servers (Google)

Facebook

Google

Extreme Modularity! Containers Many containers 18

Not just a collecgon of servers A data center isn t just a small internet Why? Administered as a single domain Trusted administrators No need to be compagble with the outside world Except for traffic to/from users No need for internagonal standards bodies Though why do standards help?

Front-End datacenter traffic Wide-area Internet Data center Internet Users

Front-End datacenter traffic Wide-area Internet Web Video Data sizes driven by the content that users actually consume Growth largely due to higher bitrate content (IP TV/movies, iphone FaceGme) Mobile Internet source of new users ONen constrained by the last mile Voice Music Photos

Multi-Tier Applications! Applications consist of tasks Many separate components Running on different machines Commodity computers Many general-purpose computers Not one big mainframe Easier scaling Aggregator! Aggregator Front end Server Aggregator! Aggregator! Worker 22 Worker Worker Worker Worker

Back-end datacenter traffic Back-end analygcs: ConnecGons between informagon Users who bought X also bought Y Key differengator determining success Facebook vs Friendster Amazon vs Buy.com Large-scale join computagons spanning thousands of nodes Need bandwidth as well as allto-all connecgvity SorGng / Searching CollaboraGve Filtering Map/Reduce Distributed Key/Value stores Video storage, postproducgon, and transmission

Data-intensive applicagon requirements All-to-all Gather/Scamer Performance gated on latency of shuffle phases Need high bisecgon bandwidth Performance gated on speed of slowest RPC/ parallel operagon Need low variance

Increasingly stringent network requirements Low one-way latency (10s of microseconds) 10/40 Gbps at TOR (and soon endhosts) CongesGon-free operagon/low queuing Dynamic traffic and an increasingly dynamic topology

From networks to backplanes Before: Network connects servers to users (FTP, telnet, ) Massive compugng = Gghtly coupled supercomputer Proprietary interconnects Working sets of data Today: Servers connected to each other Data-intensive, web-scale Massive compugng = Datacenters Commodity Datacenter network becomes the compu0ng backplane

Data Center Challenges Traffic load balancing Support for VM migragon Achieving bisecgon bandwidth Power savings / Cooling Network management (provisioning) Security (dealing with mulgple tenants) 27

Host Virtualization! Multiple virtual machines on one physical machine Applications run unmodified as on real machine VM can migrate from one computer to another 28

VMM Virtual Switches 29

Data Center Storage Example Finding a Needle in Haystack: Facebook's Photo Storage, OSDI 10

The storage hierarchy

Latency, bandwidth, and capacity

Performance of flash