Data Management and Analysis for Energy Efficient HPC Centers

Size: px
Start display at page:

Download "Data Management and Analysis for Energy Efficient HPC Centers"

Transcription

1 Data Management and Analysis for Energy Efficient HPC Centers Presented by Ghaleb Abdulla, Anna Maria Bailey and John Weaver; Copyright 2015 OSIsoft, LLC

2 Data management and analysis for energy efficient HPC centers Ghaleb Abdulla, Anna Maria Bailey and John Weaver LLNL-PRES-XXXXXX This work was performed under the auspices of the U.S. Department of Energy by under Contract DE-AC52-07NA Lawrence Livermore National Security, LLC

3 Acknowledgement Philip Top (LLNL) Chuck Wells (OSIsoft) EEHPC D/R group EEHPC PUE/TUE group LLNL-PRES-xxxxxx 3

4 Data collection helps assess the health and efficiency of our facilities Perform event analysis Loss of power, voltage sag (dip), Voltage swell, etc. Understand where and how power is used Perform energy efficiency studies Plan for Exascale computing Understand current usage patterns Relate component, system, and facility level data LLNL-PRES-xxxxxx 4

5 Implement Centralized System Manage Gather and Evaluate Large Amounts of Data Data Sources Rack, Equipment, Metering, Building Management, Utility Interfaces tens of thousands of Real Time Data Streams Analyze Correlate Real Time Data Notify Centralized Event Notification Created an operational, event, and real-time data management infrastructure of all external and internal data sources Visualize View Data and Reports LLNL-PRES-xxxxxx 5

6 Current data sources spread across LLNL Synap Cabarnet Env. Data ALC Building Mgmt. System Nexus HPC Rack Metering Seqouia & Vulcan Power OSIsoft PI System MV90/EEM Sitewide Metering Siemens SCADA Electric Utility Data Forseer Env. Monitoring PMU (B181 & B453) LLNL Weather and Env. Info. LLNL-PRES-xxxxxx 6

7 A hierarchical data structure for efficient data discovery LLNL-PRES-xxxxxx 7

8 Data analysis Exploratory data analysis Use coresight for quick data exploration tasks Use PI DataLink for reporting and some data analysis tasks Use R for more advanced or repetitive tasks Example: Motif matching LLNL-PRES-xxxxxx 8

9 Agenda for the rest of the talk 1. Demand response and Dynamic power management (DPM) 2. Energy efficiency metrics 3. Power quality and outage 4. Power usage characteristics of Sequoia 5. PI Data Server compression study LLNL-PRES-xxxxxx 9

10 What is Dynamic Power management? Adjusting power parameters on-the-fly while ensuring deadlines of running software are met Several strategies, but we are looking into one fine-grained power management strategy: Power capping, running the CPU with power bound LLNL-PRES-xxxxxx 10

11 Appro System Intel Xeon ES-2670 OS TOSS Interconnect IB QDR 426 TeraFLOP/s peak Memory 41,472 GB 1296 nodes, 16 cores/node Power 564kW in 675 ft 2 #94 on November, 2013 Top 500 LLNL-PRES-xxxxxx 11

12 Power bound experiment Embarrassingly parallel application and memory bound application, single socket runs Run the application with the same power bound across cluster processors Characterize processor variations across several power bounds Data shows that dynamic power management will be challenging LLNL-PRES-xxxxxx 12

13 Effective clock frequency multiplier Processor performance with 80W and 65W PB Normalized slow-down LLNL-PRES-xxxxxx 13

14 Energy efficiency metrics PUE/TUE We found a meter not reporting correctly (data redundancy) LLNL-PRES-xxxxxx 14

15 Energy efficiency metrics LLNL-PRES-xxxxxx 15

16 Sequoia Parameters IBM Blue Gene*/Q architecture 98,304 nodes 1,572,864 cores 20 PF, 3 rd on Top 500 June racks 91% liquid cooled 30 gpm/rack at 62 F 9% air cooled 1700 cfm/rack at 70 F 4800 square feet *Copyright 2013 by International Business Machine Corporation LLNL-PRES-xxxxxx 16

17 PUE Dashboard PUE calculated using the metered data (not sequoia rack power) PUE is now a tag in the DB We found a meter not reporting correctly, data verification using different sensors and interfaces High spikes are when Sequoia is down for maintenance Regular maintenance schedule with one major outage Daily and weekly cycles LLNL-PRES-xxxxxx 17

18 PUE TUE graph for Sequoia PUE TUE (for Sequoia) /10/14 0:00 7/20/14 0:00 7/30/14 0:00 8/9/14 0:00 8/19/14 0:00 8/29/14 0:00 9/8/14 0:00 9/18/14 0:00 9/28/14 0:00 Date & Time 1.6 LLNL-PRES-xxxxxx 18

19 Power quality and outage LLNL-PRES-xxxxxx 19

20 Event time and date7:41 pm on 10/27/2013 LLNL-PRES-xxxxxx 20

21 Event time and date: 10:51 pm on 02/08/2015 Wind with gusts over 67 always has a SW and SSW direction Winds gusting below 67 come from other directions LLNL-PRES-xxxxxx 21

22 Power usage characteristics of Sequoia LLNL-PRES-xxxxxx 22

23 Bursty energy (power) usage Users jobs Bringing the machine down for maintenance results in dumping over 5 MW of power back to the grid in a short period of time Bursty behavior of real workload, Power fluctuations are more abrupt LLNL-PRES-xxxxxx 23

24 Characterizing the maintenance schedule Scheduled maintenance happens every Wednesday Base load ~ 5MW Can go down to 180KW or lower Duration depends on what kind of maintenance will take place ~ 50 seconds to go from 5.5 MW down to 100 KW. LLNL-PRES-xxxxxx 24

25 A closer look at the Sequoia shutdown events LLNL-PRES-xxxxxx 25

26 PI Data Server Compression Study LLNL-PRES-xxxxxx 26

27 The question How does compression change the characteristics of the signal? Frequency data from PMU device Arbiter Systems, model 1133A power Sentinel LLNL-PRES-xxxxxx 27

28 Experiment Collect raw data and store it in binary format on the file system Collect data into PI Data Server with different levels of compression Use FFT analysis to compare the signal with no compression and different levels of compression Averaging 6 hours over 15 minute window LLNL-PRES-xxxxxx 28

29 Amplitude Distortion and compression level Compression Level = frequency Uncompressed Compressed Frequency (Hz) LLNL-PRES-xxxxxx 29

30 Amplitude Distortion and compression level Compression Level = frequency Uncompressed Compressed Frequency (Hz) LLNL-PRES-xxxxxx 30

31 Pi Compression Level Distortion frequency Frequencies higher than 10Hz will be distorted with compression level above Distortion frequency Cutoff Frequency LLNL-PRES-xxxxxx 31

32 Compression Ratios LLNL-PRES-xxxxxx 32

33 Frequency of Divergence (Hz) Distance & signal divergence 10 1 PMU Seperation vs Frequency difference Distance (m) LLNL-PRES-xxxxxx 33

34 Ghaleb Abdulla Senior Computer Scientist Director, Institute for Scientific Computing Research Copyright 2015 OSIsoft, LLC 34

35 Questions Please wait for the microphone before asking your questions State your name & company Copyright 2015 OSIsoft, LLC 3 5

36 Copyright 2015 OSIsoft, LLC

OSIsoft at Lawrence Livermore National Laboratory

OSIsoft at Lawrence Livermore National Laboratory OSIsoft at OSIsoft Federal Workshop 2014 June 3, 2014 Marriann Silveira, PE This work was performed under the auspices of the U.S. Department of Energy by under Contract DE-AC52-07NA27344. Lawrence Livermore

More information

Energy Efficiency for Exascale. High Performance Computing, Grids and Clouds International Workshop Cetraro Italy June 27, 2012 Natalie Bates

Energy Efficiency for Exascale. High Performance Computing, Grids and Clouds International Workshop Cetraro Italy June 27, 2012 Natalie Bates Energy Efficiency for Exascale High Performance Computing, Grids and Clouds International Workshop Cetraro Italy June 27, 2012 Natalie Bates The Exascale Power Challenge Where do we get a 1000x improvement

More information

Economic Viability of Hardware Overprovisioning in Power- Constrained High Performance Compu>ng

Economic Viability of Hardware Overprovisioning in Power- Constrained High Performance Compu>ng Economic Viability of Hardware Overprovisioning in Power- Constrained High Performance Compu>ng Energy Efficient Supercompu1ng, SC 16 November 14, 2016 This work was performed under the auspices of the U.S.

More information

MVAPICH User s Group Meeting August 20, 2015

MVAPICH User s Group Meeting August 20, 2015 MVAPICH User s Group Meeting August 20, 2015 LLNL-PRES-676271 This work was performed under the auspices of the U.S. Department of Energy by under contract DE-AC52-07NA27344. Lawrence Livermore National

More information

Power Bounds and Large Scale Computing

Power Bounds and Large Scale Computing 1 Power Bounds and Large Scale Computing Friday, March 1, 2013 Bronis R. de Supinski 1 Tapasya Patki 2, David K. Lowenthal 2, Barry L. Rountree 1 and Martin Schulz 1 2 University of Arizona This work has

More information

Oak Ridge National Laboratory Computing and Computational Sciences

Oak Ridge National Laboratory Computing and Computational Sciences Oak Ridge National Laboratory Computing and Computational Sciences OFA Update by ORNL Presented by: Pavel Shamis (Pasha) OFA Workshop Mar 17, 2015 Acknowledgments Bernholdt David E. Hill Jason J. Leverman

More information

Practical Uses of PI AF

Practical Uses of PI AF Practical Uses of PI AF Matt Rivett Sr. OSI PI Developer PJM Interconnection Date Forum Name Agenda About PJM PI AF Infrastructure AF Databases Generator AGC Synchrophasor Thunder Storm Alert Server Monitoring

More information

Mapping MPI+X Applications to Multi-GPU Architectures

Mapping MPI+X Applications to Multi-GPU Architectures Mapping MPI+X Applications to Multi-GPU Architectures A Performance-Portable Approach Edgar A. León Computer Scientist San Jose, CA March 28, 2018 GPU Technology Conference This work was performed under

More information

The FUSE Microgrid: Academic Research on Critical Infrastructures using the PI System

The FUSE Microgrid: Academic Research on Critical Infrastructures using the PI System The FUSE Microgrid: Academic Research on Critical Infrastructures using the PI System Presented by: Dr. Joachim Fabini, Senior Scientist, TU Wien, Austria About TU Wien TU Wien is one of the major Austrian

More information

System Software Solutions for Exploiting Power Limited HPC Systems

System Software Solutions for Exploiting Power Limited HPC Systems http://scalability.llnl.gov/ System Software Solutions for Exploiting Power Limited HPC Systems 45th Martin Schulz, LLNL/CASC SPEEDUP Workshop on High-Performance Computing September 2016, Basel, Switzerland

More information

EQT Midstream Reliability Centered Maintenance Program. Oscar Smith, Senior Principal Engineer, EQT Corporation November 2, 2016

EQT Midstream Reliability Centered Maintenance Program. Oscar Smith, Senior Principal Engineer, EQT Corporation November 2, 2016 EQT Midstream Reliability Centered Maintenance Program Oscar Smith, Senior Principal Engineer, EQT Corporation November 2, 2016 2016 REGIONAL SEMINARS Copyright 2016 OSIsoft, LLC 1 EQT Corporation Corporate

More information

8233-E8B 3x6-core ENERGY STAR Power and Performance Data Sheet

8233-E8B 3x6-core ENERGY STAR Power and Performance Data Sheet 8233-E8B 3x6-core ENERGY STAR Power and Performance Data Sheet ii 8233-E8B 3x6-core ENERGY STAR Power and Performance Data Sheet Contents 8233-E8B 3x6-core ENERGY STAR Power and Performance Data Sheet...

More information

PowerExecutive. Tom Brey IBM Agenda. Why PowerExecutive. Fundamentals of PowerExecutive. - The Data Center Power/Cooling Crisis.

PowerExecutive. Tom Brey IBM Agenda. Why PowerExecutive. Fundamentals of PowerExecutive. - The Data Center Power/Cooling Crisis. PowerExecutive IBM Agenda Why PowerExecutive - The Data Center Power/Cooling Crisis Fundamentals of PowerExecutive 1 The Data Center Power/Cooling Crisis Customers want more IT processing cycles to run

More information

Performance and Energy Usage of Workloads on KNL and Haswell Architectures

Performance and Energy Usage of Workloads on KNL and Haswell Architectures Performance and Energy Usage of Workloads on KNL and Haswell Architectures Tyler Allen 1 Christopher Daley 2 Doug Doerfler 2 Brian Austin 2 Nicholas Wright 2 1 Clemson University 2 National Energy Research

More information

USERS CONFERENCE Copyright 2016 OSIsoft, LLC

USERS CONFERENCE Copyright 2016 OSIsoft, LLC Bridge IT and OT with a process data warehouse Presented by Matt Ziegler, OSIsoft Complexity Problem Complexity Drives the Need for Integrators Disparate assets or interacting one-by-one Monitoring Real-time

More information

Driving Business Value through Enterprise Agreements and Partnering

Driving Business Value through Enterprise Agreements and Partnering Driving Business Value through Enterprise Agreements and Partnering Presented by Dwayne Presenter Kalma Name/s and Tyler Duncan Agenda Introductions The Partnership Team: ebay, Dell, and OSIsoft Background

More information

Cray XC Scalability and the Aries Network Tony Ford

Cray XC Scalability and the Aries Network Tony Ford Cray XC Scalability and the Aries Network Tony Ford June 29, 2017 Exascale Scalability Which scalability metrics are important for Exascale? Performance (obviously!) What are the contributing factors?

More information

PI SERVER 2012 Do. More. Faster. Now! Copyri g h t 2012 OSIso f t, LLC.

PI SERVER 2012 Do. More. Faster. Now! Copyri g h t 2012 OSIso f t, LLC. PI SERVER 2012 Do. More. Faster. Now! Copyri g h t 2012 OSIso f t, LLC. AUGUST 7, 2007 APRIL 14, 2010 APRIL 24, 2012 Copyri g h t 2012 OSIso f t, LLC. 2 PI SERVER 2010 PERFORMANCE 2010 R3 Max Point Count

More information

8205-E6C ENERGY STAR Power and Performance Data Sheet

8205-E6C ENERGY STAR Power and Performance Data Sheet 8205-E6C ENERGY STAR Power and Performance Data Sheet ii 8205-E6C ENERGY STAR Power and Performance Data Sheet Contents 8205-E6C ENERGY STAR Power and Performance Data Sheet........ 1 iii iv 8205-E6C ENERGY

More information

NERSC Site Update. National Energy Research Scientific Computing Center Lawrence Berkeley National Laboratory. Richard Gerber

NERSC Site Update. National Energy Research Scientific Computing Center Lawrence Berkeley National Laboratory. Richard Gerber NERSC Site Update National Energy Research Scientific Computing Center Lawrence Berkeley National Laboratory Richard Gerber NERSC Senior Science Advisor High Performance Computing Department Head Cori

More information

Measurement and Management Technologies (MMT)

Measurement and Management Technologies (MMT) IBM Systems Lab Services and Training Measurement and Management Technologies (MMT) Data Center Solutions for a Smarter Planet AGENDA IBM Energy Efficiency Program & Accomplishments IBM CO2 Reduction Commitment

More information

The Blue Water s File/Archive System. Data Management Challenges Michelle Butler

The Blue Water s File/Archive System. Data Management Challenges Michelle Butler The Blue Water s File/Archive System Data Management Challenges Michelle Butler (mbutler@ncsa.illinois.edu) NCSA is a World leader in deploying supercomputers and providing scientists with the software

More information

IBM CORAL HPC System Solution

IBM CORAL HPC System Solution IBM CORAL HPC System Solution HPC and HPDA towards Cognitive, AI and Deep Learning Deep Learning AI / Deep Learning Strategy for Power Power AI Platform High Performance Data Analytics Big Data Strategy

More information

Quantifying power consumption variations of HPC systems using SPEC MPI benchmarks

Quantifying power consumption variations of HPC systems using SPEC MPI benchmarks Center for Information Services and High Performance Computing (ZIH) Quantifying power consumption variations of HPC systems using SPEC MPI benchmarks EnA-HPC, Sept 16 th 2010, Robert Schöne, Daniel Molka,

More information

MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis

MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis EU H2020 FETHPC project ANTAREX (g.a. 671623) EU FP7 ERC Project MULTITHERMAN (g.a.291125) HPC

More information

7 Best Practices for Increasing Efficiency, Availability and Capacity. XXXX XXXXXXXX Liebert North America

7 Best Practices for Increasing Efficiency, Availability and Capacity. XXXX XXXXXXXX Liebert North America 7 Best Practices for Increasing Efficiency, Availability and Capacity XXXX XXXXXXXX Liebert North America Emerson Network Power: The global leader in enabling Business-Critical Continuity Automatic Transfer

More information

Power Constrained HPC

Power Constrained HPC http://scalability.llnl.gov/ Power Constrained HPC Martin Schulz Center or Applied Scientific Computing (CASC) Lawrence Livermore National Laboratory With many collaborators and Co-PIs, incl.: LLNL: Barry

More information

SUN CUSTOMER READY HPC CLUSTER: REFERENCE CONFIGURATIONS WITH SUN FIRE X4100, X4200, AND X4600 SERVERS Jeff Lu, Systems Group Sun BluePrints OnLine

SUN CUSTOMER READY HPC CLUSTER: REFERENCE CONFIGURATIONS WITH SUN FIRE X4100, X4200, AND X4600 SERVERS Jeff Lu, Systems Group Sun BluePrints OnLine SUN CUSTOMER READY HPC CLUSTER: REFERENCE CONFIGURATIONS WITH SUN FIRE X4100, X4200, AND X4600 SERVERS Jeff Lu, Systems Group Sun BluePrints OnLine April 2007 Part No 820-1270-11 Revision 1.1, 4/18/07

More information

JÜLICH SUPERCOMPUTING CENTRE Site Introduction Michael Stephan Forschungszentrum Jülich

JÜLICH SUPERCOMPUTING CENTRE Site Introduction Michael Stephan Forschungszentrum Jülich JÜLICH SUPERCOMPUTING CENTRE Site Introduction 09.04.2018 Michael Stephan JSC @ Forschungszentrum Jülich FORSCHUNGSZENTRUM JÜLICH Research Centre Jülich One of the 15 Helmholtz Research Centers in Germany

More information

SmartSacramento Distribution Automation

SmartSacramento Distribution Automation SmartSacramento Distribution Automation Presented by Michael Greenhalgh, Project Manager Lora Anguay, Sr. Project Manager Agenda 1. About SMUD 2. Distribution Automation Project Overview 3. Data Requirements

More information

EReinit: Scalable and Efficient Fault-Tolerance for Bulk-Synchronous MPI Applications

EReinit: Scalable and Efficient Fault-Tolerance for Bulk-Synchronous MPI Applications EReinit: Scalable and Efficient Fault-Tolerance for Bulk-Synchronous MPI Applications Sourav Chakraborty 1, Ignacio Laguna 2, Murali Emani 2, Kathryn Mohror 2, Dhabaleswar K (DK) Panda 1, Martin Schulz

More information

Copyri g h t 2012 OSIso f t, LLC. 1

Copyri g h t 2012 OSIso f t, LLC. 1 1 Architecture and Best Practices (Recommendation for PI Systems) Presented by John Daniels Customer Support Engineer Agenda PI System High Availability PI Server level (such as PI Server HA, AF HA, PI

More information

Preparing GPU-Accelerated Applications for the Summit Supercomputer

Preparing GPU-Accelerated Applications for the Summit Supercomputer Preparing GPU-Accelerated Applications for the Summit Supercomputer Fernanda Foertter HPC User Assistance Group Training Lead foertterfs@ornl.gov This research used resources of the Oak Ridge Leadership

More information

InterAct User. InterAct User (01/2011) Page 1

InterAct User. InterAct User (01/2011) Page 1 InterAct's Energy & Cost Analysis module provides users with a set of analytical reporting tools: Load Analysis Usage and Variance Analysis Trending Baseline Analysis Energy & Cost Benchmark What If Analysis

More information

HPC projects. Grischa Bolls

HPC projects. Grischa Bolls HPC projects Grischa Bolls Outline Why projects? 7th Framework Programme Infrastructure stack IDataCool, CoolMuc Mont-Blanc Poject Deep Project Exa2Green Project 2 Why projects? Pave the way for exascale

More information

Future of Enzo. Michael L. Norman James Bordner LCA/SDSC/UCSD

Future of Enzo. Michael L. Norman James Bordner LCA/SDSC/UCSD Future of Enzo Michael L. Norman James Bordner LCA/SDSC/UCSD SDSC Resources Data to Discovery Host SDNAP San Diego network access point for multiple 10 Gbs WANs ESNet, NSF TeraGrid, CENIC, Internet2, StarTap

More information

Peeling the Power Onion

Peeling the Power Onion CERCS IAB Workshop, April 26, 2010 Peeling the Power Onion Hsien-Hsin S. Lee Associate Professor Electrical & Computer Engineering Georgia Tech Power Allocation for Server Farm Room Datacenter 8.1 Total

More information

Green Computing: Realities and Myths

Green Computing: Realities and Myths IBM Deep Computing Green Computing: Realities and Myths Simeon McAleer IBM Deep Computing 5/29/2008 Purposes and Caveats What does green computing mean to IBM What are some of the green IBM Solutions What

More information

Managing Hardware Power Saving Modes for High Performance Computing

Managing Hardware Power Saving Modes for High Performance Computing Managing Hardware Power Saving Modes for High Performance Computing Second International Green Computing Conference 2011, Orlando Timo Minartz, Michael Knobloch, Thomas Ludwig, Bernd Mohr timo.minartz@informatik.uni-hamburg.de

More information

TOSS - A RHEL-based Operating System for HPC Clusters

TOSS - A RHEL-based Operating System for HPC Clusters TOSS - A RHEL-based Operating System for HPC Clusters Supercomputing 2017 Red Hat Booth November 14, 2017 Ned Bass System Software Development Group Leader Livermore Computing Division LLNL-PRES-741473

More information

Power Attack Defense: Securing Battery-Backed Data Centers

Power Attack Defense: Securing Battery-Backed Data Centers Power Attack Defense: Securing Battery-Backed Data Centers Presented by Chao Li, PhD Shanghai Jiao Tong University 2016.06.21, Seoul, Korea Risk of Power Oversubscription 2 3 01. Access Control 02. Central

More information

An End2End (E2E) Operationalized Pipeline for Predictive Analysis for the Intelligent Grid

An End2End (E2E) Operationalized Pipeline for Predictive Analysis for the Intelligent Grid An End2End (E2E) Operationalized Pipeline for Predictive Analysis for the Intelligent Grid Presented by Peng Li & Yun Zhao China Southern Power Grid EPRI Vijay K Narayanan Microsoft Agenda China Southern

More information

CAS 2K13 Sept Jean-Pierre Panziera Chief Technology Director

CAS 2K13 Sept Jean-Pierre Panziera Chief Technology Director CAS 2K13 Sept. 2013 Jean-Pierre Panziera Chief Technology Director 1 personal note 2 Complete solutions for Extreme Computing b ubullx ssupercomputer u p e r c o p u t e r suite s u e Production ready

More information

Leveraging Flash in HPC Systems

Leveraging Flash in HPC Systems Leveraging Flash in HPC Systems IEEE MSST June 3, 2015 This work was performed under the auspices of the U.S. Department of Energy by under Contract DE-AC52-07NA27344. Lawrence Livermore National Security,

More information

Lecture 9: MIMD Architectures

Lecture 9: MIMD Architectures Lecture 9: MIMD Architectures Introduction and classification Symmetric multiprocessors NUMA architecture Clusters Zebo Peng, IDA, LiTH 1 Introduction MIMD: a set of general purpose processors is connected

More information

The Impact of Inter-node Latency versus Intra-node Latency on HPC Applications The 23 rd IASTED International Conference on PDCS 2011

The Impact of Inter-node Latency versus Intra-node Latency on HPC Applications The 23 rd IASTED International Conference on PDCS 2011 The Impact of Inter-node Latency versus Intra-node Latency on HPC Applications The 23 rd IASTED International Conference on PDCS 2011 HPC Scale Working Group, Dec 2011 Gilad Shainer, Pak Lui, Tong Liu,

More information

A Breakthrough in Non-Volatile Memory Technology FUJITSU LIMITED

A Breakthrough in Non-Volatile Memory Technology FUJITSU LIMITED A Breakthrough in Non-Volatile Memory Technology & 0 2018 FUJITSU LIMITED IT needs to accelerate time-to-market Situation: End users and applications need instant access to data to progress faster and

More information

The Energy Challenge in HPC

The Energy Challenge in HPC ARNDT BODE Professor Arndt Bode is the Chair for Computer Architecture at the Leibniz-Supercomputing Center. He is Full Professor for Informatics at TU Mü nchen. His main research includes computer architecture,

More information

Lustre at Scale The LLNL Way

Lustre at Scale The LLNL Way Lustre at Scale The LLNL Way D. Marc Stearman Lustre Administration Lead Livermore uting - LLNL This work performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory

More information

Analyzing the Performance of IWAVE on a Cluster using HPCToolkit

Analyzing the Performance of IWAVE on a Cluster using HPCToolkit Analyzing the Performance of IWAVE on a Cluster using HPCToolkit John Mellor-Crummey and Laksono Adhianto Department of Computer Science Rice University {johnmc,laksono}@rice.edu TRIP Meeting March 30,

More information

Overview. Energy-Efficient and Power-Constrained Techniques for ExascaleComputing. Motivation: Power is becoming a leading design constraint in HPC

Overview. Energy-Efficient and Power-Constrained Techniques for ExascaleComputing. Motivation: Power is becoming a leading design constraint in HPC Energy-Efficient and Power-Constrained Techniques for ExascaleComputing Stephanie Labasan Computer and Information Science University of Oregon 17 October 2016 Overview Motivation: Power is becoming a

More information

Systems and Technology Group. IBM Technology and Solutions Jan Janick IBM Vice President Modular Systems and Storage Development

Systems and Technology Group. IBM Technology and Solutions Jan Janick IBM Vice President Modular Systems and Storage Development Systems and Technology Group IBM Technology and Solutions Jan Janick IBM Vice President Modular Systems and Storage Development Power and cooling are complex issues There is no single fix. IBM is working

More information

TriStar MPPT (150V and 600V) Wind Charging Control Info

TriStar MPPT (150V and 600V) Wind Charging Control Info TriStar MPPT (150V and 600V) Wind Charging Control Info Morningstar s TriStar MPPT controllers are designed for solar PV applications. However, Morningstar MPPT controllers are also compatible with other

More information

IBM Power Advanced Compute (AC) AC922 Server

IBM Power Advanced Compute (AC) AC922 Server IBM Power Advanced Compute (AC) AC922 Server The Best Server for Enterprise AI Highlights IBM Power Systems Accelerated Compute (AC922) server is an acceleration superhighway to enterprise- class AI. A

More information

PQube 3. Power Quality, Energy & Environment Monitoring. New Low-Cost, High-Precision

PQube 3. Power Quality, Energy & Environment Monitoring. New Low-Cost, High-Precision New Low-Cost, High-Precision Power Quality, Energy & Environment Monitoring PQube 3 Power Quality Class A Energy Revenue Grade class 0.2s Accuracy Additional AC, DC, and Environment Sensor Channels World

More information

The IBM Blue Gene/Q: Application performance, scalability and optimisation

The IBM Blue Gene/Q: Application performance, scalability and optimisation The IBM Blue Gene/Q: Application performance, scalability and optimisation Mike Ashworth, Andrew Porter Scientific Computing Department & STFC Hartree Centre Manish Modani IBM STFC Daresbury Laboratory,

More information

On the Path to Energy-Efficient Exascale: A Perspective from the Green500

On the Path to Energy-Efficient Exascale: A Perspective from the Green500 On the Path to Energy-Efficient Exascale: A Perspective from the Green500 Wu FENG, wfeng@vt.edu Dept. of Computer Science Dept. of Electrical & Computer Engineering Health Sciences Virginia Bioinformatics

More information

Wind Plant Systems Grid Integration Plant and Turbine Controls SCADA

Wind Plant Systems Grid Integration Plant and Turbine Controls SCADA GE Energy Wind Plant Systems Grid Integration Plant and Turbine Controls SCADA Mahesh Morjaria Manager, Wind Plant Systems Platform Adopted for Wind Power Management class http://www.icaen.uiowa.edu/ edu/~ie_155/

More information

Dynamic Power Optimization for Higher Server Density Racks A Baidu Case Study with Intel Dynamic Power Technology

Dynamic Power Optimization for Higher Server Density Racks A Baidu Case Study with Intel Dynamic Power Technology Dynamic Power Optimization for Higher Server Density Racks A Baidu Case Study with Intel Dynamic Power Technology Executive Summary Intel s Digital Enterprise Group partnered with Baidu.com conducted a

More information

Motivation Goal Idea Proposition for users Study

Motivation Goal Idea Proposition for users Study Exploring Tradeoffs Between Power and Performance for a Scientific Visualization Algorithm Stephanie Labasan Computer and Information Science University of Oregon 23 November 2015 Overview Motivation:

More information

Acceleration of HPC applications on hybrid CPU-GPU systems: When can Multi-Process Service (MPS) help?

Acceleration of HPC applications on hybrid CPU-GPU systems: When can Multi-Process Service (MPS) help? Acceleration of HPC applications on hybrid CPU- systems: When can Multi-Process Service (MPS) help? GTC 2018 March 28, 2018 Olga Pearce (Lawrence Livermore National Laboratory) http://people.llnl.gov/olga

More information

The Stampede is Coming: A New Petascale Resource for the Open Science Community

The Stampede is Coming: A New Petascale Resource for the Open Science Community The Stampede is Coming: A New Petascale Resource for the Open Science Community Jay Boisseau Texas Advanced Computing Center boisseau@tacc.utexas.edu Stampede: Solicitation US National Science Foundation

More information

Some Joules Are More Precious Than Others: Managing Renewable Energy in the Datacenter

Some Joules Are More Precious Than Others: Managing Renewable Energy in the Datacenter Some Joules Are More Precious Than Others: Managing Renewable Energy in the Datacenter Christopher Stewart The Ohio State University cstewart@cse.ohio-state.edu Kai Shen University of Rochester kshen@cs.rochester.edu

More information

LANL High Performance Computing Facilities Operations. Rick Rivera and Farhad Banisadr. Facility Data Center Management

LANL High Performance Computing Facilities Operations. Rick Rivera and Farhad Banisadr. Facility Data Center Management Slide-1 LANL High Performance Computing Facilities Operations Rick Rivera and Farhad Banisadr Facility Data Center Management Division Slide-2 Outline Overview - LANL Computer Data Centers Power and Cooling

More information

OVERVIEW: KEY APPLICATIONS

OVERVIEW: KEY APPLICATIONS WECC Reliability Coordinator Website OVERVIEW: KEY APPLICATIONS presented by Kirk Stewart, Manager Applications Agenda WECCRC.org Website Wide-Area Views (WAV) Historical Data Archiving and Reporting My

More information

The Stampede is Coming Welcome to Stampede Introductory Training. Dan Stanzione Texas Advanced Computing Center

The Stampede is Coming Welcome to Stampede Introductory Training. Dan Stanzione Texas Advanced Computing Center The Stampede is Coming Welcome to Stampede Introductory Training Dan Stanzione Texas Advanced Computing Center dan@tacc.utexas.edu Thanks for Coming! Stampede is an exciting new system of incredible power.

More information

Reliability Models for Electric Power Systems

Reliability Models for Electric Power Systems Reliability Models for Electric Power Systems White Paper 23 Revision 1 by Neil Rasmussen > Executive summary This white paper explains the sources of downtime in electric power systems and provides an

More information

IDEA Campus Energy UNIVERSITY OF MARYLAND Securing Our Networked Utility Infrastructure

IDEA Campus Energy UNIVERSITY OF MARYLAND Securing Our Networked Utility Infrastructure IDEA Campus Energy 2018 UNIVERSITY OF MARYLAND Securing Our Networked Utility Infrastructure University of Maryland Presenters Mary-Ann Ibeziako, Director, Facilities Management Engineering & Energy Don

More information

Aggregation of Real-Time System Monitoring Data for Analyzing Large-Scale Parallel and Distributed Computing Environments

Aggregation of Real-Time System Monitoring Data for Analyzing Large-Scale Parallel and Distributed Computing Environments Aggregation of Real-Time System Monitoring Data for Analyzing Large-Scale Parallel and Distributed Computing Environments Swen Böhm 1,2, Christian Engelmann 2, and Stephen L. Scott 2 1 Department of Computer

More information

Architecting High Performance Computing Systems for Fault Tolerance and Reliability

Architecting High Performance Computing Systems for Fault Tolerance and Reliability Architecting High Performance Computing Systems for Fault Tolerance and Reliability Blake T. Gonzales HPC Computer Scientist Dell Advanced Systems Group blake_gonzales@dell.com Agenda HPC Fault Tolerance

More information

Outline. March 5, 2012 CIRMMT - McGill University 2

Outline. March 5, 2012 CIRMMT - McGill University 2 Outline CLUMEQ, Calcul Quebec and Compute Canada Research Support Objectives and Focal Points CLUMEQ Site at McGill ETS Key Specifications and Status CLUMEQ HPC Support Staff at McGill Getting Started

More information

MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis

MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis EU H2020 FETHPC project ANTAREX (g.a. 671623) EU FP7 ERC Project MULTITHERMAN (g.a.291125) EETHPC,

More information

in Action Fujitsu High Performance Computing Ecosystem Human Centric Innovation Innovation Flexibility Simplicity

in Action Fujitsu High Performance Computing Ecosystem Human Centric Innovation Innovation Flexibility Simplicity Fujitsu High Performance Computing Ecosystem Human Centric Innovation in Action Dr. Pierre Lagier Chief Technology Officer Fujitsu Systems Europe Innovation Flexibility Simplicity INTERNAL USE ONLY 0 Copyright

More information

Overview. Idea: Reduce CPU clock frequency This idea is well suited specifically for visualization

Overview. Idea: Reduce CPU clock frequency This idea is well suited specifically for visualization Exploring Tradeoffs Between Power and Performance for a Scientific Visualization Algorithm Stephanie Labasan & Matt Larsen (University of Oregon), Hank Childs (Lawrence Berkeley National Laboratory) 26

More information

Disaster Recovery and Mitigation: Is your business prepared when disaster hits?

Disaster Recovery and Mitigation: Is your business prepared when disaster hits? 1 Disaster Recovery and Mitigation: Is your business prepared when disaster hits? 2 Our speaker today: Catherine Roy, Director of PMO at Hosting 15 years Project Management experience At HOSTING since

More information

The Hopper System: How the Largest* XE6 in the World Went From Requirements to Reality! Katie Antypas, Tina Butler, and Jonathan Carter

The Hopper System: How the Largest* XE6 in the World Went From Requirements to Reality! Katie Antypas, Tina Butler, and Jonathan Carter The Hopper System: How the Largest* XE6 in the World Went From Requirements to Reality! Katie Antypas, Tina Butler, and Jonathan Carter CUG 2011, May 25th, 2011 1 Requirements to Reality Develop RFP Select

More information

OSIsoft Cloud Services Core Infrastructure for Developing Partner Applications

OSIsoft Cloud Services Core Infrastructure for Developing Partner Applications OSIsoft Cloud Services Core Infrastructure for Developing Partner Applications Presented by Laurent Garrigues, Gregg Le Blanc, Paul Kaiser Agenda Overview Platform Tour Demo Partner Preview Program Q&A

More information

Predictive Resilience: Leveraging Integrated DCIM to Reduce Data Center Downtime

Predictive Resilience: Leveraging Integrated DCIM to Reduce Data Center Downtime July 17, 2013 Predictive Resilience: Leveraging Integrated DCIM to Reduce Data Center Downtime siemens.com/answers Predictive Resilience Table of content What is DCIM & Its Business Value Extract Maximum

More information

HPC Solutions in High Density Data Centers

HPC Solutions in High Density Data Centers Executive Report HPC Solutions in High Density Data Centers How CyrusOne s Houston West data center campus delivers the highest density solutions to customers With the ever-increasing demand on IT resources,

More information

HPC Technology Trends

HPC Technology Trends HPC Technology Trends High Performance Embedded Computing Conference September 18, 2007 David S Scott, Ph.D. Petascale Product Line Architect Digital Enterprise Group Risk Factors Today s s presentations

More information

IBM InfoSphere Streams v4.0 Performance Best Practices

IBM InfoSphere Streams v4.0 Performance Best Practices Henry May IBM InfoSphere Streams v4.0 Performance Best Practices Abstract Streams v4.0 introduces powerful high availability features. Leveraging these requires careful consideration of performance related

More information

HPC Downtime Budgets: Moving SRE Practice to the Rest of the World

HPC Downtime Budgets: Moving SRE Practice to the Rest of the World LA-UR-16-24361 HPC Downtime Budgets: Moving SRE Practice to the Rest of the World SREcon Europe 2016 Cory Lueninghoener July 12, 2016 Operated by Los Alamos National Security, LLC for the U.S. Department

More information

Roadmapping of HPC interconnects

Roadmapping of HPC interconnects Roadmapping of HPC interconnects MIT Microphotonics Center, Fall Meeting Nov. 21, 2008 Alan Benner, bennera@us.ibm.com Outline Top500 Systems, Nov. 2008 - Review of most recent list & implications on interconnect

More information

COST EFFICIENCY VS ENERGY EFFICIENCY. Anna Lepak Universität Hamburg Seminar: Energy-Efficient Programming Wintersemester 2014/2015

COST EFFICIENCY VS ENERGY EFFICIENCY. Anna Lepak Universität Hamburg Seminar: Energy-Efficient Programming Wintersemester 2014/2015 COST EFFICIENCY VS ENERGY EFFICIENCY Anna Lepak Universität Hamburg Seminar: Energy-Efficient Programming Wintersemester 2014/2015 TOPIC! Cost Efficiency vs Energy Efficiency! How much money do we have

More information

SKA. data processing, storage, distribution. Jean-Marc Denis Head of Strategy, BigData & HPC. Oct. 16th, 2017 Paris. Coyright Atos 2017

SKA. data processing, storage, distribution. Jean-Marc Denis Head of Strategy, BigData & HPC. Oct. 16th, 2017 Paris. Coyright Atos 2017 SKA data processing, storage, distribution Oct. 16th, 2017 Paris Jean-Marc Denis Head of Strategy, BigData & HPC Coyright Atos 2017 What are SKA data and compute needs? Some key numbers Source: Image Swinburn

More information

Slurm Configuration Impact on Benchmarking

Slurm Configuration Impact on Benchmarking Slurm Configuration Impact on Benchmarking José A. Moríñigo, Manuel Rodríguez-Pascual, Rafael Mayo-García CIEMAT - Dept. Technology Avda. Complutense 40, Madrid 28040, SPAIN Slurm User Group Meeting 16

More information

LAMMPS Performance Benchmark and Profiling. July 2012

LAMMPS Performance Benchmark and Profiling. July 2012 LAMMPS Performance Benchmark and Profiling July 2012 Note The following research was performed under the HPC Advisory Council activities Participating vendors: AMD, Dell, Mellanox Compute resource - HPC

More information

HP GTC Presentation May 2012

HP GTC Presentation May 2012 HP GTC Presentation May 2012 Today s Agenda: HP s Purpose-Built SL Server Line Desktop GPU Computing Revolution with HP s Z Workstations Hyperscale the new frontier for HPC New HPC customer requirements

More information

Welcome to PI World Transmission & Distribution Industry Session

Welcome to PI World Transmission & Distribution Industry Session Welcome to PI World Transmission & Distribution Industry Session Kevin P Walsh Bill McEvoy OSIsoft Power and Utilities Team Kevin P Walsh Global T&D and Smart Grids William E. McEvoy - Global Distributed

More information

A Comparative Study of High Performance Computing on the Cloud. Lots of authors, including Xin Yuan Presentation by: Carlos Sanchez

A Comparative Study of High Performance Computing on the Cloud. Lots of authors, including Xin Yuan Presentation by: Carlos Sanchez A Comparative Study of High Performance Computing on the Cloud Lots of authors, including Xin Yuan Presentation by: Carlos Sanchez What is The Cloud? The cloud is just a bunch of computers connected over

More information

Shared Infrastructure: Scale-Out Advantages and Effects on TCO

Shared Infrastructure: Scale-Out Advantages and Effects on TCO Shared Infrastructure: Scale-Out Advantages and Effects on TCO A Dell TM Technical White Paper Dell Data Center Solutions THIS WHITE PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN TYPOGRAPHICAL

More information

Energy Logic: Emerson Network Power. A Roadmap for Reducing Energy Consumption in the Data Center. Ross Hammond Managing Director

Energy Logic: Emerson Network Power. A Roadmap for Reducing Energy Consumption in the Data Center. Ross Hammond Managing Director Energy Logic: A Roadmap for Reducing Energy Consumption in the Data Center Emerson Network Power Ross Hammond Managing Director Agenda Energy efficiency Where We Are Today What Data Center Managers Are

More information

High Scalability Resource Management with SLURM Supercomputing 2008 November 2008

High Scalability Resource Management with SLURM Supercomputing 2008 November 2008 High Scalability Resource Management with SLURM Supercomputing 2008 November 2008 Morris Jette (jette1@llnl.gov) LLNL-PRES-408498 Lawrence Livermore National Laboratory What is SLURM Simple Linux Utility

More information

Ft Worth Data Center Power Metering for Real Time PUE Calculation. Scope of Work

Ft Worth Data Center Power Metering for Real Time PUE Calculation. Scope of Work Ft Worth Data Center Power Metering for Real Time PUE Calculation Scope of Work Prepared for the General Services Administration By Lawrence Berkeley National Laboratory Rod Mahdavi December 13, 2010 Executive

More information

What s New in PI Analytics and PI Notifications

What s New in PI Analytics and PI Notifications What s New in PI Analytics and PI Notifications Beth McNeill - OSIsoft Glenn Moffett - OSIsoft Roman Schindlauer - Microsoft Analytic and Notification Landscape Visualization & DataLink for Excel Performance

More information

CP2K Performance Benchmark and Profiling. April 2011

CP2K Performance Benchmark and Profiling. April 2011 CP2K Performance Benchmark and Profiling April 2011 Note The following research was performed under the HPC Advisory Council activities Participating vendors: AMD, Dell, Mellanox Compute resource - HPC

More information

Using Alluxio to Improve the Performance and Consistency of HDFS Clusters

Using Alluxio to Improve the Performance and Consistency of HDFS Clusters ARTICLE Using Alluxio to Improve the Performance and Consistency of HDFS Clusters Calvin Jia Software Engineer at Alluxio Learn how Alluxio is used in clusters with co-located compute and storage to improve

More information

Improving Situational Awareness for Utilities Operators and Energy Managers

Improving Situational Awareness for Utilities Operators and Energy Managers Improving Situational Awareness for Utilities Operators and Energy Managers Presented by: Mary-Ann Ibeziako PI World Conference September 24-27, 2018 Barcelona, Spain 1 University of Maryland, College

More information

Exploring Hardware Overprovisioning in Power-Constrained, High Performance Computing

Exploring Hardware Overprovisioning in Power-Constrained, High Performance Computing Exploring Hardware Overprovisioning in Power-Constrained, High Performance Computing Tapasya Patki 1 David Lowenthal 1 Barry Rountree 2 Martin Schulz 2 Bronis de Supinski 2 1 The University of Arizona

More information

The State and Needs of IO Performance Tools

The State and Needs of IO Performance Tools The State and Needs of IO Performance Tools Scalable Tools Workshop Lake Tahoe, CA August 6 12, 2017 This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National

More information