ΕΠΛ372 Παράλληλη Επεξεργάσια
|
|
- Amice Goodwin
- 5 years ago
- Views:
Transcription
1 ΕΠΛ372 Παράλληλη Επεξεργάσια Warehouse Scale Computing and Services Γιάννος Σαζεϊδης Εαρινό Εξάμηνο 2014 READING 1. Read Barroso The Datacenter as a Computer Slides based on Hennesy and Patterson CAQA 5 th edition Morgan and Kaufman
2 Introduction Introduction Warehouse-scale computer (WSC) Provides Internet services Search, social networking, online maps, video sharing, online shopping, , cloud computing, etc. Differences with HPC clusters : Clusters have higher performance processors and network Clusters emphasize thread-level parallelism, WSCs emphasize requestlevel parallelism Differences with co-located-datacenters: Datacenters consolidate different machines and software into one location Datacenters emphasize virtual machines and hardware heterogeneity in order to serve varied customers Private and public cloud Often term DC refers to both WSC and co-located-dc
3 Introduction Introduction Important design factors for WSC: (same) Cost-performance Small savings add up Energy efficiency Affects power distribution and cooling Work per joule Dependability via redundancy Commodity HW and software based redundancy Network I/O Consistency and interface to world Interactive and batch processing workloads Search but also calculate meta data (rank pages)
4 Introduction Introduction Important design factors for WSC: (differ.) Ample computational parallelism is not important Most jobs are totally independent Request-level parallelism Mostly read and often write to not shared data Data in storage for batch Operational costs count Power consumption is a primary, not secondary, constraint when designing system Scale and its opportunities and problems Can afford to build customized systems since WSC require volume purchase. Failures more common
5 Causes of Outages and Anomalies (2400 servers 1 st yr) Number Cause 1-2 Power outage 4 Upgrades Hard-disk 1000s Dram Problematic machines 5000 Server crashes
6 Figure 6.3 Average CPU utilization of more than 5000 servers during a 6-month period at Google. Servers are rarely completely idle or fully utilized, in-stead operating most of the time at between 10% and 50% of their maximum utilization. (From Figure 1 in Barroso and Hölzle [2007].) The column the third from the right in Figure 6.4 calculates percentages plus or minus 5% to come up with the weightings; thus, 1.2% for the 90% row means that 1.2% of servers were between 85% and 95% utilized.
7 Computer Architecture of WSC WSC often use a hierarchy of networks for interconnection Each 19 rack holds 48 1U servers connected to a rack switch Rack switches are uplinked to switch higher in hierarchy Uplink has 48 / n times lower bandwidth, where n = # of uplink ports Oversubscription Goal is to maximize locality of communication relative to the rack Computer Ar4chitecture of WSC
8 Figure 6.5 Hierarchy of switches in a WSC. (Based on Figure 1.2 of Barroso and Hölzle [2009].)
9 Storage Storage options: Use disks inside the servers, or Network attached storage through Infiniband Computer Ar4chitecture of WSC WSCs generally rely on local disks Google File System (GFS) uses local disks and maintains at least three replicas
10 Array Switch Switch that connects an array of racks Array switch should have 10 X the bisection bandwidth of rack switch Cost of n-port switch grows as n 2 Often utilize content addressable memory chips and FPGAs Computer Ar4chitecture of WSC
11 WSC Memory Hierarchy Servers can access DRAM and disks on other servers using a NUMA-style interface Computer Ar4chitecture of WSC
12 Figure 6.7 Graph of latency, bandwidth, and capacity of the memory hierarchy of a WSC for data in Figure 6.6 [Barroso and Hölzle 2009]. Copyright 2011, Elsevier Inc. All rights Reserved.
13 Infrastructure and Costs of WSC Location of WSC Proximity to Internet backbones, electricity cost, property tax rates, low risk from earthquakes, floods, and hurricanes Power distribution Physcical Infrastrcuture and Costs of WSC
14 Infrastructure and Costs of WSC Cooling Air conditioning used to cool server room 64 F 71 F Keep temperature higher (closer to 71 F) Cooling towers can also be used Physcical Infrastrcuture and Costs of WSC
15 Infrastructure and Costs of WSC Cooling system also uses water (evaporation and spills) E.g. 70,000 to 200,000 gallons per day for an 8 MW facility Power cost breakdown: Chillers: 30-50% of the power used by the IT equipment Air conditioning: 10-20% of the IT power, mostly due to fans How many servers can a WSC support? Each server: Nameplate power rating gives maximum power consumption To get actual, measure power under actual workloads Oversubscribe cumulative server power by 40%, but monitor power closely Physcical Infrastrcuture and Costs of WSC
16 Measuring Efficiency of a WSC Power Utilization Effectiveness (PUE) = Total facility power / IT equipment power Median PUE on 2006 study was 1.69 Today large facilities close 1.1 Performance Latency is important metric because it is seen by users Bing study: users will use search less as response time increases Service Level Objectives (SLOs)/Service Level Agreements (SLAs) E.g. 99% of requests be below 100 ms Physcical Infrastrcuture and Costs of WSC
17 Figure 6.22 Power usage effectiveness (PUE) of 10 Google WSCs over time. Google A is the WSC described in this section. It is the highest line in Q3 07 and Q2 10. (From Facebook recently announced a new datacenter that should deliver an impressive PUE of 1.07 (see The Prineville Oregon Facility has no air conditioning and no chilled water. It relies strictly on outside air, which is brought in one side of the building, filtered, cooled via misters, pumped across the IT equipment, and then sent out the building by exhaust fans. In addition, the servers use a custom power supply that allows the power distribution system to skip one of the voltage conversion steps in Figure 6.9. Copyright 2011, Elsevier Inc. All rights Reserved.
18 Cost of a WSC Capital expenditures (CAPEX) Cost to build a WSC Acquire servers, switchets Operational expenditures (OPEX) Cost to operate a WSC Physcical Infrastrcuture and Costs of WSC
19 Cloud Computing WSCs offer economies of scale that cannot be achieved with a datacenter: 5.7 times reduction in storage costs 7.1 times reduction in administrative costs 7.3 times reduction in networking costs This has given rise to cloud services such as Amazon Web Services Utility Computing Based on using open source virtual machine and operating system software Cloud Computing
20 Prgrm g Models and Workloads Batch processing framework: MapReduce Map: applies a programmer-supplied function to each logical input record Runs on thousands of computers Provides new set of key-value pairs as intermediate values Reduce: collapses values using another programmer-supplied function Programming Models and Workloads for WSCs
21 Prgrm g Models and Workloads Example: map (String key, String value): // key: document name // value: document contents for each word w in value EmitIntermediate(w, 1 ); // Produce list of all words reduce (String key, Iterator values): // key: a word // value: a list of counts int result = 0; for each v in values: result += ParseInt(v); // get integer from key-value pair Emit(AsString(result)); Programming Models and Workloads for WSCs
22 Prgrm g Models and Workloads MapReduce runtime environment schedules map and reduce task to WSC nodes Availability: Use replicas of data across different servers If one slow or fails start on another Use relaxed consistency: No need for all replicas to always agree Programming Models and Workloads for WSCs Workload demands Often vary considerably
23 Search CloudSuite Benchmark Overview Client Multithreaded client Frontend server Tomcat running Apache Nutch to n Index servers Hadoop 0.20 IPC server running Apache Nutch to n Document servers Hadoop 0.20 IPC server running Apache Nutch 1.2 Notes Index and document dataset is distributed to the number of index and document servers Apache Nutch 1.2 gives you the choice of running index and document server seperated or on the same process together Query flow description Client send query request to frontend server Frontend asks each index server to search in it s own part of index dataset for document matches All index servers search their part of index dataset in parallel Frontend server collects, sorts the replies and asks the index servers the title and url of top results (detail request) Frontend receives the urls and asks the document servers to search in it s own part of dataset for the summaries of the top documents. Frontend receives the summaries and assemblies the reply HTML page and sends 23 the answer back to client
Warehouse-Scale Computers to Exploit Request-Level and Data-Level Parallelism
Warehouse-Scale Computers to Exploit Request-Level and Data-Level Parallelism The datacenter is the computer Luiz Andre Barroso, Google (2007) Outline Introduction to WSCs Programming Models and Workloads
More informationConcepts Introduced in Chapter 6. Warehouse-Scale Computers. Programming Models for WSCs. Important Design Factors for WSCs
Concepts Introduced in Chapter 6 Warehouse-Scale Computers A cluster is a collection of desktop computers or servers connected together by a local area network to act as a single larger computer. introduction
More informationLecture 20: WSC, Datacenters. Topics: warehouse-scale computing and datacenters (Sections )
Lecture 20: WSC, Datacenters Topics: warehouse-scale computing and datacenters (Sections 6.1-6.7) 1 Warehouse-Scale Computer (WSC) 100K+ servers in one WSC ~$150M overall cost Requests from millions of
More informationEN2910A: Advanced Computer Architecture Topic 06: Supercomputers & Data Centers Prof. Sherief Reda School of Engineering Brown University
EN2910A: Advanced Computer Architecture Topic 06: Supercomputers & Data Centers Prof. Sherief Reda School of Engineering Brown University Material from: The Datacenter as a Computer: An Introduction to
More informationWarehouse-Scale Computing
ecture 31 Computer Science 61C Spring 2017 April 7th, 2017 Warehouse-Scale Computing 1 New-School Machine Structures (It s a bit more complicated!) Software Hardware Parallel Requests Assigned to computer
More informationCS 61C: Great Ideas in Computer Architecture (Machine Structures) Warehouse-Scale Computing
CS 61C: Great Ideas in Computer Architecture (Machine Structures) Warehouse-Scale Computing Instructors: Nicholas Weaver & Vladimir Stojanovic http://inst.eecs.berkeley.edu/~cs61c/ Coherency Tracked by
More informationCS 61C: Great Ideas in Computer Architecture (Machine Structures) Lecture 17 Datacenters and Cloud Compu5ng
CS 61C: Great Ideas in Computer Architecture (Machine Structures) Lecture 17 Datacenters and Cloud Compu5ng Instructor: Dan Garcia h;p://inst.eecs.berkeley.edu/~cs61c/ 2/28/13 1 In the news Google disclosed
More informationToday s Lecture. CS 61C: Great Ideas in Computer Architecture (Machine Structures) Map Reduce
CS 61C: Great Ideas in Computer Architecture (Machine Structures) Map Reduce 8/29/12 Instructors Krste Asanovic, Randy H. Katz hgp://inst.eecs.berkeley.edu/~cs61c/fa12 Fall 2012 - - Lecture #3 1 Today
More informationHot vs Cold Energy Efficient Data Centers. - SVLG Data Center Center Efficiency Summit
Hot vs Cold Energy Efficient Data Centers - SVLG Data Center Center Efficiency Summit KC Mares November 2014 The Great Debate about Hardware Inlet Temperature Feb 2003: RMI report on high-performance data
More informationThe amount of data increases every day Some numbers ( 2012):
1 The amount of data increases every day Some numbers ( 2012): Data processed by Google every day: 100+ PB Data processed by Facebook every day: 10+ PB To analyze them, systems that scale with respect
More information2/26/2017. The amount of data increases every day Some numbers ( 2012):
The amount of data increases every day Some numbers ( 2012): Data processed by Google every day: 100+ PB Data processed by Facebook every day: 10+ PB To analyze them, systems that scale with respect to
More informationCSE 124: THE DATACENTER AS A COMPUTER. George Porter November 20 and 22, 2017
CSE 124: THE DATACENTER AS A COMPUTER George Porter November 20 and 22, 2017 ATTRIBUTION These slides are released under an Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0) Creative
More informationTopics. CIT 470: Advanced Network and System Administration. Google DC in The Dalles. Google DC in The Dalles. Data Centers
CIT 470: Advanced Network and System Administration Data Centers Topics Data Center: A facility for housing a large amount of computer or communications equipment. 1. Racks 2. Power 3. PUE 4. Cooling 5.
More informationCS 61C: Great Ideas in Computer Architecture. MapReduce
CS 61C: Great Ideas in Computer Architecture MapReduce Guest Lecturer: Justin Hsia 3/06/2013 Spring 2013 Lecture #18 1 Review of Last Lecture Performance latency and throughput Warehouse Scale Computing
More informationGreen Computing: Datacentres
Green Computing: Datacentres Simin Nadjm-Tehrani Department of Computer and Information Science (IDA) Linköping University Sweden Many thanks to Jordi Cucurull For earlier versions of this course material
More informationGreen Computing: Datacentres
Green Computing: Datacentres Simin Nadjm-Tehrani Department of Computer and Information Science (IDA) Linköping University Sweden Many thanks to Jordi Cucurull For earlier versions of this course material
More informationBig Data Programming: an Introduction. Spring 2015, X. Zhang Fordham Univ.
Big Data Programming: an Introduction Spring 2015, X. Zhang Fordham Univ. Outline What the course is about? scope Introduction to big data programming Opportunity and challenge of big data Origin of Hadoop
More informationParallel Programming Principle and Practice. Lecture 10 Big Data Processing with MapReduce
Parallel Programming Principle and Practice Lecture 10 Big Data Processing with MapReduce Outline MapReduce Programming Model MapReduce Examples Hadoop 2 Incredible Things That Happen Every Minute On The
More information7 Best Practices for Increasing Efficiency, Availability and Capacity. XXXX XXXXXXXX Liebert North America
7 Best Practices for Increasing Efficiency, Availability and Capacity XXXX XXXXXXXX Liebert North America Emerson Network Power: The global leader in enabling Business-Critical Continuity Automatic Transfer
More informationData Center Fundamentals: The Datacenter as a Computer
Data Center Fundamentals: The Datacenter as a Computer George Porter CSE 124 Feb 10, 2017 Includes material from (1) Barroso, Clidaras, and Hölzle, as well as (2) Evrard (Michigan), used with permission
More informationA Computer Scientist Looks at the Energy Problem
A Computer Scientist Looks at the Energy Problem Randy H. Katz University of California, Berkeley EECS BEARS Symposium February 12, 2009 Energy permits things to exist; information, to behave purposefully.
More informationMicrosoft s Cloud. Delivering operational excellence in the cloud Infrastructure. Erik Jan van Vuuren Azure Lead Microsoft Netherlands
Microsoft s Cloud Delivering operational excellence in the cloud Infrastructure Erik Jan van Vuuren Azure Lead Microsoft Netherlands 4 Megatrends are driving a revolution 44% of users (350M people) access
More informationLarge-Scale GPU programming
Large-Scale GPU programming Tim Kaldewey Research Staff Member Database Technologies IBM Almaden Research Center tkaldew@us.ibm.com Assistant Adjunct Professor Computer and Information Science Dept. University
More informationCloud Computing Economies of Scale
Cloud Computing Economies of Scale AWS Executive Symposium 2010 James Hamilton, 2010/7/15 VP & Distinguished Engineer, Amazon Web Services email: James@amazon.com web: mvdirona.com/jrh/work blog: perspectives.mvdirona.com
More informationMSYS 4480 AC Systems Winter 2015
MSYS 4480 AC Systems Winter 2015 Module 12: DATA CENTERS By Satwinder Singh 14/05/2015 1 Data Centers Data Centers are specialized environments that safeguard your company's most valuable equipment and
More informationIntroduction to Hadoop. Owen O Malley Yahoo!, Grid Team
Introduction to Hadoop Owen O Malley Yahoo!, Grid Team owen@yahoo-inc.com Who Am I? Yahoo! Architect on Hadoop Map/Reduce Design, review, and implement features in Hadoop Working on Hadoop full time since
More informationNext-Generation Cloud Platform
Next-Generation Cloud Platform Jangwoo Kim Jun 24, 2013 E-mail: jangwoo@postech.ac.kr High Performance Computing Lab Department of Computer Science & Engineering Pohang University of Science and Technology
More informationData centers control: challenges and opportunities
1 Data centers control: challenges and opportunities Damiano Varagnolo Luleå University of Technology Aug. 24th, 2015 First part: what is a datacenter and what is its importance in our society Data centers
More informationBigDataBench-MT: Multi-tenancy version of BigDataBench
BigDataBench-MT: Multi-tenancy version of BigDataBench Gang Lu Beijing Academy of Frontier Science and Technology BigDataBench Tutorial, ASPLOS 2016 Atlanta, GA, USA n Software perspective Multi-tenancy
More informationPROCESS & DATA CENTER COOLING. How and Why To Prepare For What s To Come JOHN MORRIS REGIONAL DIRECTOR OF SALES
PROCESS & DATA CENTER COOLING How and Why To Prepare For What s To Come JOHN MORRIS REGIONAL DIRECTOR OF SALES CONGRATULATIONS 2 BOLD PREDICTION 3 Where We Are TODAY Computer Technology NOW advances more
More informationDistributed Data Infrastructures, Fall 2017, Chapter 2. Jussi Kangasharju
Distributed Data Infrastructures, Fall 2017, Chapter 2 Jussi Kangasharju Chapter Outline Warehouse-scale computing overview Workloads and software infrastructure Failures and repairs Note: Term Warehouse-scale
More informationLecture: Networks, Disks, Datacenters, GPUs. Topics: networks wrap-up, disks and reliability, datacenters, GPU intro (Sections
Lecture: Networks, Disks, Datacenters, GPUs Topics: networks wrap-up, disks and reliability, datacenters, GPU intro (Sections 6.1-6.7, App D, Ch 4) 1 Packets/Flits A message is broken into multiple packets
More informationInternet-Scale Datacenter Economics: Costs & Opportunities High Performance Transaction Systems 2011
Internet-Scale Datacenter Economics: Costs & Opportunities High Performance Transaction Systems 2011 James Hamilton, 2011/10/24 VP & Distinguished Engineer, Amazon Web Services email: James@amazon.com
More informationDatacenters. Mendel Rosenblum. CS142 Lecture Notes - Datacenters
Datacenters Mendel Rosenblum Evolution of datacenters 1960's, 1970's: a few very large time-shared computers 1980's, 1990's: heterogeneous collection of lots of smaller machines. Today and into the future:
More informationPassive RDHx as a Cost Effective Alternative to CRAH Air Cooling. Jeremiah Stikeleather Applications Engineer
Passive RDHx as a Cost Effective Alternative to CRAH Air Cooling Jeremiah Stikeleather Applications Engineer Passive RDHx as a Cost Effective Alternative to CRAH Air Cooling While CRAH cooling has been
More informationBENEFITS OF ASETEK LIQUID COOLING FOR DATA CENTERS
BENEFITS OF ASETEK LIQUID COOLING FOR DATA CENTERS Asetek has leveraged its expertise as the world-leading provider of efficient liquid cooling systems to create its RackCDU and ISAC direct-to-chip liquid
More informationThe MapReduce Abstraction
The MapReduce Abstraction Parallel Computing at Google Leverages multiple technologies to simplify large-scale parallel computations Proprietary computing clusters Map/Reduce software library Lots of other
More informationSession 6: Data Center Energy Management Strategies
Session 6: Data Center Energy Management Strategies Data Center Energy Use and Opportunities: Applying Best Practices to meet Executive Order Requirements Dale Sartor, PE Lawrence Berkeley National Laboratory
More informationPrefabricated Data Center Solutions: Coming of Age
Prefabricated Data Center Solutions: Coming of Age Steven Carlini - Global Data Center Marketing Director April 2014 Schneider Electric 1 Schneider Electric 2 Your ability to compete relies on the design,
More informationInfrastructure Innovation Opportunities Y Combinator 2013
Infrastructure Innovation Opportunities Y Combinator 2013 James Hamilton, 2013/1/22 VP & Distinguished Engineer, Amazon Web Services email: James@amazon.com web: mvdirona.com/jrh/work blog: perspectives.mvdirona.com
More informationCase Study: Energy Systems Integration Facility (ESIF)
Case Study: Energy Systems Integration Facility (ESIF) at the U.S. Department of Energy s National Renewable Energy Laboratory (NREL) in Golden, Colorado OFFICE HIGH BAY LABORATORIES DATA CENTER 1 Case
More informationSystem Overview. Liquid Cooling for Data Centers. Negative Pressure Cooling Without Risk of Leaks
Liquid Cooling for Data Centers System Overview The Chilldyne Cool-Flo System is a direct-to-chip liquid cooling system that delivers coolant under negative pressure. Chilldyne s technologies were designed
More informationMulti-tenancy version of BigDataBench
Multi-tenancy version of BigDataBench Gang Lu Institute of Computing Technology, Chinese Academy of Sciences BigDataBench Tutorial MICRO 2014 Cambridge, UK INSTITUTE OF COMPUTING TECHNOLOGY 1 Multi-tenancy
More informationLecture 11 Hadoop & Spark
Lecture 11 Hadoop & Spark Dr. Wilson Rivera ICOM 6025: High Performance Computing Electrical and Computer Engineering Department University of Puerto Rico Outline Distributed File Systems Hadoop Ecosystem
More informationCSE 344 MAY 2 ND MAP/REDUCE
CSE 344 MAY 2 ND MAP/REDUCE ADMINISTRIVIA HW5 Due Tonight Practice midterm Section tomorrow Exam review PERFORMANCE METRICS FOR PARALLEL DBMSS Nodes = processors, computers Speedup: More nodes, same data
More informationWhere We Are. Review: Parallel DBMS. Parallel DBMS. Introduction to Data Management CSE 344
Where We Are Introduction to Data Management CSE 344 Lecture 22: MapReduce We are talking about parallel query processing There exist two main types of engines: Parallel DBMSs (last lecture + quick review)
More informationData Center Trends and Challenges
Data Center Trends and Challenges, IBM Fellow Chief Engineer - Data Center Energy Efficiency Customers challenges are driven by the following Increasing IT demand Continued cost pressure Responsive to
More informationIn the news. Request- Level Parallelism (RLP) Agenda 10/7/11
In the news "The cloud isn't this big fluffy area up in the sky," said Rich Miller, who edits Data Center Knowledge, a publicaeon that follows the data center industry. "It's buildings filled with servers
More informationRAMCloud. Scalable High-Performance Storage Entirely in DRAM. by John Ousterhout et al. Stanford University. presented by Slavik Derevyanko
RAMCloud Scalable High-Performance Storage Entirely in DRAM 2009 by John Ousterhout et al. Stanford University presented by Slavik Derevyanko Outline RAMCloud project overview Motivation for RAMCloud storage:
More informationRAMCloud and the Low- Latency Datacenter. John Ousterhout Stanford University
RAMCloud and the Low- Latency Datacenter John Ousterhout Stanford University Most important driver for innovation in computer systems: Rise of the datacenter Phase 1: large scale Phase 2: low latency Introduction
More informationEnergy Logic: Emerson Network Power. A Roadmap for Reducing Energy Consumption in the Data Center. Ross Hammond Managing Director
Energy Logic: A Roadmap for Reducing Energy Consumption in the Data Center Emerson Network Power Ross Hammond Managing Director Agenda Energy efficiency Where We Are Today What Data Center Managers Are
More informationThe End of Redundancy. Alan Wood Sun Microsystems May 8, 2009
The End of Redundancy Alan Wood Sun Microsystems May 8, 2009 Growing Demand, Shrinking Resources By 2008, 50% of current data centers will have insufficient power and cooling capacity to meet the demands
More informationPlanning a Green Datacenter
Planning a Green Datacenter Dr. Dominique Singy Contribution to: Green IT Crash Course Organisator: Green IT Special Interest Group of the Swiss Informatics Society ETH-Zurich / February 13, Green Data
More informationCloud Programming. Programming Environment Oct 29, 2015 Osamu Tatebe
Cloud Programming Programming Environment Oct 29, 2015 Osamu Tatebe Cloud Computing Only required amount of CPU and storage can be used anytime from anywhere via network Availability, throughput, reliability
More informationCPSC 426/526. Cloud Computing. Ennan Zhai. Computer Science Department Yale University
CPSC 426/526 Cloud Computing Ennan Zhai Computer Science Department Yale University Recall: Lec-7 In the lec-7, I talked about: - P2P vs Enterprise control - Firewall - NATs - Software defined network
More informationEnergy Usage in Cloud Part2. Salih Safa BACANLI
Energy Usage in Cloud Part2 Salih Safa BACANLI Cooling Virtualization Energy Proportional System Conclusion Cooling After servers, the second largest consumer of power in a data center is the cooling system.(kava,
More informationParallel Nested Loops
Parallel Nested Loops For each tuple s i in S For each tuple t j in T If s i =t j, then add (s i,t j ) to output Create partitions S 1, S 2, T 1, and T 2 Have processors work on (S 1,T 1 ), (S 1,T 2 ),
More informationParallel Partition-Based. Parallel Nested Loops. Median. More Join Thoughts. Parallel Office Tools 9/15/2011
Parallel Nested Loops Parallel Partition-Based For each tuple s i in S For each tuple t j in T If s i =t j, then add (s i,t j ) to output Create partitions S 1, S 2, T 1, and T 2 Have processors work on
More informationCISC 7610 Lecture 2b The beginnings of NoSQL
CISC 7610 Lecture 2b The beginnings of NoSQL Topics: Big Data Google s infrastructure Hadoop: open google infrastructure Scaling through sharding CAP theorem Amazon s Dynamo 5 V s of big data Everyone
More informationRecent Trends. CS 537 Section 11 Large Scale Systems. Effects of Google s HW philosophy. Google Design Philosophy 5/5/09
Recent Trends CS 537 Section 11 Large Scale Systems Michael Swift Computing is moving away from the desktop To mobile device: smart phones To data centers: cloud computing Why? Cheap communication Enables
More informationCAS 2K13 Sept Jean-Pierre Panziera Chief Technology Director
CAS 2K13 Sept. 2013 Jean-Pierre Panziera Chief Technology Director 1 personal note 2 Complete solutions for Extreme Computing b ubullx ssupercomputer u p e r c o p u t e r suite s u e Production ready
More informationCOST EFFICIENCY VS ENERGY EFFICIENCY. Anna Lepak Universität Hamburg Seminar: Energy-Efficient Programming Wintersemester 2014/2015
COST EFFICIENCY VS ENERGY EFFICIENCY Anna Lepak Universität Hamburg Seminar: Energy-Efficient Programming Wintersemester 2014/2015 TOPIC! Cost Efficiency vs Energy Efficiency! How much money do we have
More informationHow High Temperature Data Centers & Intel Technologies save Energy, Money, Water and Greenhouse Gas Emissions
Intel Intelligent Power Management Intel How High Temperature Data Centers & Intel Technologies save Energy, Money, Water and Greenhouse Gas Emissions Power savings through the use of Intel s intelligent
More informationTopics. Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples
Hadoop Introduction 1 Topics Big Data Analytics What is and Why Hadoop? Comparison to other technologies Hadoop architecture Hadoop ecosystem Hadoop usage examples 2 Big Data Analytics What is Big Data?
More informationCloud Computing. What is cloud computing. CS 537 Fall 2017
Cloud Computing CS 537 Fall 2017 What is cloud computing Illusion of infinite computing resources available on demand Scale-up for most apps Elimination of up-front commitment Small initial investment,
More informationExploiting Temporal Diversity of Water Efficiency to Make Data Center Less "Thirsty"
Exploiting Temporal Diversity of Water Efficiency to Make Data Center Less "Thirsty" M. A. Islam, K. Ahmed, Shaolei Ren, and G. Quan Florida International University 1 A massive data center Google's data
More informationCloud Computing Economics
Cloud Computing Economics Cloud for the Enterprise 2010 James Hamilton, 2010/11/11 VP & Distinguished Engineer, Amazon Web Services email: James@amazon.com web: mvdirona.com/jrh/work blog: perspectives.mvdirona.com
More informationServer room guide helps energy managers reduce server consumption
Server room guide helps energy managers reduce server consumption Jan Viegand Viegand Maagøe Nr. Farimagsgade 37 1364 Copenhagen K Denmark jv@viegandmaagoe.dk Keywords servers, guidelines, server rooms,
More informationEmbedded Technosolutions
Hadoop Big Data An Important technology in IT Sector Hadoop - Big Data Oerie 90% of the worlds data was generated in the last few years. Due to the advent of new technologies, devices, and communication
More informationMapReduce: A Programming Model for Large-Scale Distributed Computation
CSC 258/458 MapReduce: A Programming Model for Large-Scale Distributed Computation University of Rochester Department of Computer Science Shantonu Hossain April 18, 2011 Outline Motivation MapReduce Overview
More informationDatabase System Architectures Parallel DBs, MapReduce, ColumnStores
Database System Architectures Parallel DBs, MapReduce, ColumnStores CMPSCI 445 Fall 2010 Some slides courtesy of Yanlei Diao, Christophe Bisciglia, Aaron Kimball, & Sierra Michels- Slettvet Motivation:
More informationIntroduction to Data Management CSE 344
Introduction to Data Management CSE 344 Lecture 24: MapReduce CSE 344 - Fall 2016 1 HW8 is out Last assignment! Get Amazon credits now (see instructions) Spark with Hadoop Due next wed CSE 344 - Fall 2016
More informationJason Waxman General Manager High Density Compute Division Data Center Group
Jason Waxman General Manager High Density Compute Division Data Center Group Today 2015 More Users Only 25% of the world is Internet connected today 1 New technologies will connect over 1 billion additional
More informationData Center Energy Savings By the numbers
Data Center Energy Savings By the numbers Chris Kurkjian, PE HP Critical Facility Services provided by EYP Mission Critical Facilities 2008 Hewlett-Packard Development Company, L.P. The information contained
More informationBest Practices for Setting BIOS Parameters for Performance
White Paper Best Practices for Setting BIOS Parameters for Performance Cisco UCS E5-based M3 Servers May 2013 2014 Cisco and/or its affiliates. All rights reserved. This document is Cisco Public. Page
More informationHiTune. Dataflow-Based Performance Analysis for Big Data Cloud
HiTune Dataflow-Based Performance Analysis for Big Data Cloud Jinquan (Jason) Dai, Jie Huang, Shengsheng Huang, Bo Huang, Yan Liu Intel Asia-Pacific Research and Development Ltd Shanghai, China, 200241
More informationInternet Scale Storage
Internet Scale Storage SIGMOD 2011 James Hamilton, 2011/6/14 VP & Distinguished Engineer, Amazon Web Services email: James@amazon.com web: mvdirona.com/jrh/work blog: perspectives.mvdirona.com Agenda Cloud
More informationWhen Milliseconds Matter. The Definitive Buying Guide to Network Services for Healthcare Organizations
When Milliseconds Matter The Definitive Buying Guide to Network Services for Healthcare Organizations The Changing Landscape of Healthcare IT Pick any of the top trends in healthcare and you ll find both
More informationData Clustering on the Parallel Hadoop MapReduce Model. Dimitrios Verraros
Data Clustering on the Parallel Hadoop MapReduce Model Dimitrios Verraros Overview The purpose of this thesis is to implement and benchmark the performance of a parallel K- means clustering algorithm on
More informationHigh Volume Throughput Computers (HVC): An ICT View of Datacenter Computers
High Volume Throughput Computers (HVC): An ICT View of Datacenter Computers Jianfeng Zhan ( 詹剑锋 ) http://prof.ict.ac.cn/jfzhan http://weibo.com/jfzhan Outline Motivation Related work Challenges and Opportunities
More informationEcolibrium. Cloud control. Keeping data centres cool. OCTOBER 2018 VOLUME 17.9 RRP $14.95 PRINT POST APPROVAL NUMBER PP352532/00001
Ecolibrium OCTOBER 2018 VOLUME 17.9 RRP $14.95 PRINT POST APPROVAL NUMBER PP352532/00001 Cloud control Keeping data centres cool. COVER FEATURE Cloud control Such is the growth in data centres globally
More informationGreen Data Centers A Guideline
Green Data Centers A Guideline Sekhar Kondepudi Ph.D Associate Professor Smart Buildings & Smart Cities Vice Chair Focus Group on Smart Sustainable Cities ITU-TRCSL Workshop on Greening the Future: Bridging
More informationCloud & Datacenter EGA
Cloud & Datacenter EGA The Stock Exchange of Thailand Materials excerpt from SET internal presentation and virtualization vendor e.g. vmware For Educational purpose and Internal Use Only SET Virtualization/Cloud
More informationCS6030 Cloud Computing. Acknowledgements. Today s Topics. Intro to Cloud Computing 10/20/15. Ajay Gupta, WMU-CS. WiSe Lab
CS6030 Cloud Computing Ajay Gupta B239, CEAS Computer Science Department Western Michigan University ajay.gupta@wmich.edu 276-3104 1 Acknowledgements I have liberally borrowed these slides and material
More informationHow much power?! Data Centre Efficiency. How much Money?! Carbon Reduction Commitment (CRC)
How much power?! Data Centre Efficiency The US EPA found in 2006 data centres used 1.5% of total US consumption (5) For Bangor University data centres use approximately 5-10% of the total university electricity
More informationHow Liquid Cooling Helped Two University Data Centers Achieve Cooling Efficiency Goals. Michael Gagnon Coolcentric October
How Liquid Cooling Helped Two University Data Centers Achieve Cooling Efficiency Goals Michael Gagnon Coolcentric October 13 2011 Data Center Trends Energy consumption in the Data Center (DC) 2006-61 billion
More informationPLATFORM AND SOFTWARE AS A SERVICE THE MAPREDUCE PROGRAMMING MODEL AND IMPLEMENTATIONS
PLATFORM AND SOFTWARE AS A SERVICE THE MAPREDUCE PROGRAMMING MODEL AND IMPLEMENTATIONS By HAI JIN, SHADI IBRAHIM, LI QI, HAIJUN CAO, SONG WU and XUANHUA SHI Prepared by: Dr. Faramarz Safi Islamic Azad
More informationNutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure
Nutanix Tech Note Virtualizing Microsoft Applications on Web-Scale Infrastructure The increase in virtualization of critical applications has brought significant attention to compute and storage infrastructure.
More informationCooling on Demand - scalable and smart cooling solutions. Marcus Edwards B.Sc.
Cooling on Demand - scalable and smart cooling solutions Marcus Edwards B.Sc. Schroff UK Limited Data Centre cooling what a waste of money! A 1MW data center needs 177,000,000 kwh in its lifetime of 10
More informationHybrid Warm Water Direct Cooling Solution Implementation in CS300-LC
Hybrid Warm Water Direct Cooling Solution Implementation in CS300-LC Roger Smith Mississippi State University Giridhar Chukkapalli Cray, Inc. C O M P U T E S T O R E A N A L Y Z E 1 Safe Harbor Statement
More informationProgramming Models MapReduce
Programming Models MapReduce Majd Sakr, Garth Gibson, Greg Ganger, Raja Sambasivan 15-719/18-847b Advanced Cloud Computing Fall 2013 Sep 23, 2013 1 MapReduce In a Nutshell MapReduce incorporates two phases
More informationMagellan Project. Jeff Broughton NERSC Systems Department Head October 7, 2009
Magellan Project Jeff Broughton NERSC Systems Department Head October 7, 2009 1 Magellan Background National Energy Research Scientific Computing Center (NERSC) Argonne Leadership Computing Facility (ALCF)
More informationOptimizing Apache Spark with Memory1. July Page 1 of 14
Optimizing Apache Spark with Memory1 July 2016 Page 1 of 14 Abstract The prevalence of Big Data is driving increasing demand for real -time analysis and insight. Big data processing platforms, like Apache
More informationGreen IT Project: The Business Case
Green IT Project: The Business Case Indumathi Lnu University at Albany DATE We will discuss Identifying green IT projects Funding green IT projects Building the business case Case study: University at
More informationCS / Cloud Computing. Recitation 3 September 9 th & 11 th, 2014
CS15-319 / 15-619 Cloud Computing Recitation 3 September 9 th & 11 th, 2014 Overview Last Week s Reflection --Project 1.1, Quiz 1, Unit 1 This Week s Schedule --Unit2 (module 3 & 4), Project 1.2 Questions
More informationPower Systems for Your Business
Hotel Mulia Jakarta Power Systems for Your Business Septia Sukariningrum Power Systems Technical Sales Specialist IBM Indonesia The datacenter is changing Server sprawl resulting in lack of space Datacenter
More informationDatacenter Efficiency Trends. Cary Roberts Tellme, a Microsoft Subsidiary
Datacenter Efficiency Trends Cary Roberts Tellme, a Microsoft Subsidiary Power Slashing Tools P3 Kill-A-WATT Measures min, max, and instantaneous voltage and current draw Calculates power (watts), kilowatt
More informationCS 61C: Great Ideas in Computer Architecture (Machine Structures) Warehouse-Scale Computing, MapReduce, and Spark
CS 61C: Great Ideas in Computer Architecture (Machine Structures) Warehouse-Scale Computing, MapReduce, and Spark Instructors: Krste Asanović & Randy H. Katz http://inst.eecs.berkeley.edu/~cs61c/ 11/8/17
More informationEnergy-Efficient Datacenters
Energy-Efficient Datacenters Massoud Pedram, Fellow, IEEE Abstract Pervasive use of cloud computing and the resulting rise in the number of datacenters and hosting centers (which provide platform or software
More informationcontext: massive systems
cutting the electric bill for internetscale systems Asfandyar Qureshi (MIT) Rick Weber (Akamai) Hari Balakrishnan (MIT) John Guttag (MIT) Bruce Maggs (Duke/Akamai) Éole @ flickr context: massive systems
More information