Energy Efficiency Tuning: READEX. Madhura Kumaraswamy Technische Universität München

Size: px
Start display at page:

Download "Energy Efficiency Tuning: READEX. Madhura Kumaraswamy Technische Universität München"

Transcription

1 Energy Efficiency Tuning: READEX Madhura Kumaraswamy Technische Universität München

2 Project Overview READEX Starting date: 1. September 2015 Duration: 3 years Runtime Exploitation of Application Dynamism for Energy-efficient exascale computing Funding: European Commission Horizon 2020 grant agreement

3 Project Partners Technische Universität Dresden, Germany Norwegian University of Science and Technology, Norway Technische Universität München, Germany IT4 Innovations National Supercomputing Center, Czech Republic National University of Ireland, Ireland Intel Corporation, France GNS, Germany 3

4 Motivation Challenges Energy consumption Extreme scale Dynamism Awareness Ability Effort Problems Solution Automatic tuning Dynamic Design-time/Runtime 4

5 Systems Scenario Methodology Formalism for dynamic auto-tuning in embedded systems Detect and analyze dynamism at design-time, switch configurations at runtime Design-time Runtime 5

6 Periscope Tuning Framework (PTF) Automatic application analysis & tuning Tune performance and energy (statically) Plug-in-based architecture Evaluate alternatives online Scalable and distributed framework Support variety of parallel paradigms MPI, OpenMP, OpenCL, Parallel pattern Developed in the Autotune EU-FP7 project Plugins explore tuning space and run performance analysis strategies during a phase 6

7 Score-P Scalable Performance Measurement Infrastructure for Parallel Codes Common instrumentation and measurement infrastructure Periscope accesses Score-P infrastructure via online access interface 7

8 ENOPT Library Support for OpenMP, MPI, hybrid and sequential code Provides socket and node level counter measurements

9 Tuning Plugins DVFS Frequency tuning for energy delay product Region level tuning Uses governors and daemons to set static or dynamic power setting Parallelism capping Thread number tuning for energy delay product Exhaustive and curve fitting based prediction For OpenMP applications

10 Variation of Measurements Energy consumption of the SeisSol application at different compute nodes Normalized energy consumption of the SeisSol application at different compute nodes 10

11 Tuning with Periscope Tuning Framework 11

12 Dynamism Intra-phase Inter-phase 12

13 PEPC Benchmark of the DEISA Benchmark Suite MPI_Alltoall performance over 2048 phases 13

14 Scenario-Based Tuning Design Time Analysis Periscope Tuning Framework (PTF) Runtime Scenarios with Tuning Model Runtime Tuning READEX Runtime Library (RRL) 14

15 Terminology Significant Regions: Coarse-granular code regions Runtime Situations: Instances of significant regions Identifiers: Distinguish rts's with different characteristics Region identifiers, phase identifiers, input identifiers Scenarios: rts's with same characteristics Classifier: Groups rts s into scenarios Selector: Returns a best configuration for a scenario Tuning Model: Set of scenarios Classifier based on the identifiers Selector for each scenario Yury Oleynik oleynik@in.tum.de 15

16 Design Time Analysis with PTF Preparation steps Analysis Tuning Model 16

17 Runtime Tuning with the READEX Runtime Library Detect phase identifiers Calibration if needed? Post process region measurements Store objective measurements Detect scenarios based on region identifiers Detect configurations Switching decision making Switch configuration Initiate objective measurements

18 Validation and Project Goals Energy measurements using HDEEM infrastructure 18

19 Conclusion Energy-efficiency at exascale Application developers and users will have to care Lack of capabilities Awareness Expertise Resources Proposed solution READEX: Exploit dynamism Detect at design-, exploit at runtime Tools-aided auto-tuning methodology 19

20 Thank you! Questions? 20

21 Additional Slides 21

22 Predicted vs Measured Time for Seissol 22

23 Plugin Evaluation Status 23

24 Inter-phase Dynamism Indeed application of GNS Identifiers for adaptation strategy Valleys vs hills 24

READEX: A Tool Suite for Dynamic Energy Tuning. Michael Gerndt Technische Universität München

READEX: A Tool Suite for Dynamic Energy Tuning. Michael Gerndt Technische Universität München READEX: A Tool Suite for Dynamic Energy Tuning Michael Gerndt Technische Universität München Campus Garching 2 SuperMUC: 3 Petaflops, 3 MW 3 READEX Runtime Exploitation of Application Dynamism for Energy-efficient

More information

READEX Runtime Exploitation of Application Dynamism for Energyefficient

READEX Runtime Exploitation of Application Dynamism for Energyefficient READEX Runtime Exploitation of Application Dynamism for Energyefficient exascale computing EnA-HPC @ ISC 17 Robert Schöne TUD Project Motivation Applications exhibit dynamic behaviour Changing resource

More information

AutoTune Workshop. Michael Gerndt Technische Universität München

AutoTune Workshop. Michael Gerndt Technische Universität München AutoTune Workshop Michael Gerndt Technische Universität München AutoTune Project Automatic Online Tuning of HPC Applications High PERFORMANCE Computing HPC application developers Compute centers: Energy

More information

Automatic Tuning of HPC Applications with Periscope. Michael Gerndt, Michael Firbach, Isaias Compres Technische Universität München

Automatic Tuning of HPC Applications with Periscope. Michael Gerndt, Michael Firbach, Isaias Compres Technische Universität München Automatic Tuning of HPC Applications with Periscope Michael Gerndt, Michael Firbach, Isaias Compres Technische Universität München Agenda 15:00 15:30 Introduction to the Periscope Tuning Framework (PTF)

More information

READEX: Linking Two Ends of the Computing Continuum to Improve Energy-efficiency in Dynamic Applications

READEX: Linking Two Ends of the Computing Continuum to Improve Energy-efficiency in Dynamic Applications READEX: Linking Two Ends of the Computing Continuum to Improve Energy-efficiency in Dynamic Applications Per Gunnar Kjeldsberg, Andreas Gocht, Michael Gerndt, Lubomir Riha, Joseph Schuchart, and Umbreen

More information

Code Auto-Tuning with the Periscope Tuning Framework

Code Auto-Tuning with the Periscope Tuning Framework Code Auto-Tuning with the Periscope Tuning Framework Renato Miceli, SENAI CIMATEC renato.miceli@fieb.org.br Isaías A. Comprés, TUM compresu@in.tum.de Project Participants Michael Gerndt, TUM Coordinator

More information

Tuning Alya with READEX for Energy-Efficiency

Tuning Alya with READEX for Energy-Efficiency Tuning Alya with READEX for Energy-Efficiency Venkatesh Kannan 1, Ricard Borrell 2, Myles Doyle 1, Guillaume Houzeaux 2 1 Irish Centre for High-End Computing (ICHEC) 2 Barcelona Supercomputing Centre (BSC)

More information

The READEX formalism for automatic tuning for energy efficiency

The READEX formalism for automatic tuning for energy efficiency Computing DOI 10.1007/s00607-016-0532-7 The READEX formalism for automatic tuning for energy efficiency Joseph Schuchart 1 Michael Gerndt 2 Per Gunnar Kjeldsberg 3 Michael Lysaght 4 David Horák 5 Lubomír

More information

MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis

MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis EU H2020 FETHPC project ANTAREX (g.a. 671623) EU FP7 ERC Project MULTITHERMAN (g.a.291125) EETHPC,

More information

MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis

MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis MULTITHERMAN: Out-of-band High-Resolution HPC Power and Performance Monitoring Support for Big-Data Analysis EU H2020 FETHPC project ANTAREX (g.a. 671623) EU FP7 ERC Project MULTITHERMAN (g.a.291125) HPC

More information

Analyzing I/O Performance on a NEXTGenIO Class System

Analyzing I/O Performance on a NEXTGenIO Class System Analyzing I/O Performance on a NEXTGenIO Class System holger.brunst@tu-dresden.de ZIH, Technische Universität Dresden LUG17, Indiana University, June 2 nd 2017 NEXTGenIO Fact Sheet Project Research & Innovation

More information

Managing Hardware Power Saving Modes for High Performance Computing

Managing Hardware Power Saving Modes for High Performance Computing Managing Hardware Power Saving Modes for High Performance Computing Second International Green Computing Conference 2011, Orlando Timo Minartz, Michael Knobloch, Thomas Ludwig, Bernd Mohr timo.minartz@informatik.uni-hamburg.de

More information

A Simple Framework for Energy Efficiency Evaluation and Hardware Parameter. Tuning with Modular Support for Different HPC Platforms

A Simple Framework for Energy Efficiency Evaluation and Hardware Parameter. Tuning with Modular Support for Different HPC Platforms A Simple Framework for Energy Efficiency Evaluation and Hardware Parameter Tuning with Modular Support for Different HPC Platforms Ondrej Vysocky, Jan Zapletal and Lubomir Riha IT4Innovations, VSB Technical

More information

An Exascale Programming, Multi objective Optimisation and Resilience Management Environment Based on Nested Recursive Parallelism.

An Exascale Programming, Multi objective Optimisation and Resilience Management Environment Based on Nested Recursive Parallelism. This project has received funding from the European Union s Horizon 2020 research and innovation programme under grant agreement No. 671603 An Exascale Programming, ulti objective Optimisation and Resilience

More information

[Scalasca] Tool Integrations

[Scalasca] Tool Integrations Mitglied der Helmholtz-Gemeinschaft [Scalasca] Tool Integrations Aug 2011 Bernd Mohr CScADS Performance Tools Workshop Lake Tahoe Contents Current integration of various direct measurement tools Paraver

More information

AUTOMATIC SMT THREADING

AUTOMATIC SMT THREADING AUTOMATIC SMT THREADING FOR OPENMP APPLICATIONS ON THE INTEL XEON PHI CO-PROCESSOR WIM HEIRMAN 1,2 TREVOR E. CARLSON 1 KENZO VAN CRAEYNEST 1 IBRAHIM HUR 2 AAMER JALEEL 2 LIEVEN EECKHOUT 1 1 GHENT UNIVERSITY

More information

Potentials and Limitations for Energy Efficiency Auto-Tuning

Potentials and Limitations for Energy Efficiency Auto-Tuning Center for Information Services and High Performance Computing (ZIH) Potentials and Limitations for Energy Efficiency Auto-Tuning Parco Symposium Application Autotuning for HPC (Architectures) Robert Schöne

More information

Accelerating sequential computer vision algorithms using commodity parallel hardware

Accelerating sequential computer vision algorithms using commodity parallel hardware Accelerating sequential computer vision algorithms using commodity parallel hardware Platform Parallel Netherlands GPGPU-day, 28 June 2012 Jaap van de Loosdrecht NHL Centre of Expertise in Computer Vision

More information

Performance Analysis for Large Scale Simulation Codes with Periscope

Performance Analysis for Large Scale Simulation Codes with Periscope Performance Analysis for Large Scale Simulation Codes with Periscope M. Gerndt, Y. Oleynik, C. Pospiech, D. Gudu Technische Universität München IBM Deutschland GmbH May 2011 Outline Motivation Periscope

More information

I/O Profiling Towards the Exascale

I/O Profiling Towards the Exascale I/O Profiling Towards the Exascale holger.brunst@tu-dresden.de ZIH, Technische Universität Dresden NEXTGenIO & SAGE: Working towards Exascale I/O Barcelona, NEXTGenIO facts Project Research & Innovation

More information

Performance Cockpit: An Extensible GUI Platform for Performance Tools

Performance Cockpit: An Extensible GUI Platform for Performance Tools Performance Cockpit: An Extensible GUI Platform for Performance Tools Tianchao Li and Michael Gerndt Institut für Informatik, Technische Universität München, Boltzmannstr. 3, D-85748 Garching bei Mu nchen,

More information

Scalasca support for Intel Xeon Phi. Brian Wylie & Wolfgang Frings Jülich Supercomputing Centre Forschungszentrum Jülich, Germany

Scalasca support for Intel Xeon Phi. Brian Wylie & Wolfgang Frings Jülich Supercomputing Centre Forschungszentrum Jülich, Germany Scalasca support for Intel Xeon Phi Brian Wylie & Wolfgang Frings Jülich Supercomputing Centre Forschungszentrum Jülich, Germany Overview Scalasca performance analysis toolset support for MPI & OpenMP

More information

D4.7 Multi-Objective Dynamic Optimizer (b)

D4.7 Multi-Objective Dynamic Optimizer (b) H2020 FETHPC-1-2014 An Exascale Programming, Multi-objective Optimisation and Resilience Management Environment Based on Nested Recursive Parallelism Project Number 671603 D4.7 Multi-Objective Dynamic

More information

MPI RUNTIMES AT JSC, NOW AND IN THE FUTURE

MPI RUNTIMES AT JSC, NOW AND IN THE FUTURE , NOW AND IN THE FUTURE Which, why and how do they compare in our systems? 08.07.2018 I MUG 18, COLUMBUS (OH) I DAMIAN ALVAREZ Outline FZJ mission JSC s role JSC s vision for Exascale-era computing JSC

More information

Overview of research activities Toward portability of performance

Overview of research activities Toward portability of performance Overview of research activities Toward portability of performance Do dynamically what can t be done statically Understand evolution of architectures Enable new programming models Put intelligence into

More information

DICE: a Model-Driven DevOps Framework for Big Data

DICE: a Model-Driven DevOps Framework for Big Data DICE: a Model-Driven DevOps Framework for Big Data Giuliano Casale Imperial College London DICE Horizon 2020 Project Grant Agreement no. 644869 http://www.dice-h2020.eu Funded by the Horizon 2020 Framework

More information

Programming Support for Heterogeneous Parallel Systems

Programming Support for Heterogeneous Parallel Systems Programming Support for Heterogeneous Parallel Systems Siegfried Benkner Department of Scientific Computing Faculty of Computer Science University of Vienna http://www.par.univie.ac.at Outline Introduction

More information

Performance analysis with Periscope

Performance analysis with Periscope Performance analysis with Periscope M. Gerndt, V. Petkov, Y. Oleynik, S. Benedict Technische Universität petkovve@in.tum.de March 2010 Outline Motivation Periscope (PSC) Periscope performance analysis

More information

Score-P A Joint Performance Measurement Run-Time Infrastructure for Periscope, Scalasca, TAU, and Vampir

Score-P A Joint Performance Measurement Run-Time Infrastructure for Periscope, Scalasca, TAU, and Vampir Score-P A Joint Performance Measurement Run-Time Infrastructure for Periscope, Scalasca, TAU, and Vampir Andreas Knüpfer, Christian Rössel andreas.knuepfer@tu-dresden.de, c.roessel@fz-juelich.de 2011-09-26

More information

Recent Developments in Score-P and Scalasca V2

Recent Developments in Score-P and Scalasca V2 Mitglied der Helmholtz-Gemeinschaft Recent Developments in Score-P and Scalasca V2 Aug 2015 Bernd Mohr 9 th Scalable Tools Workshop Lake Tahoe YOU KNOW YOU MADE IT IF LARGE COMPANIES STEAL YOUR STUFF August

More information

HPC IN EUROPE. Organisation of public HPC resources

HPC IN EUROPE. Organisation of public HPC resources HPC IN EUROPE Organisation of public HPC resources Context Focus on publicly-funded HPC resources provided primarily to enable scientific research and development at European universities and other publicly-funded

More information

HPC projects. Grischa Bolls

HPC projects. Grischa Bolls HPC projects Grischa Bolls Outline Why projects? 7th Framework Programme Infrastructure stack IDataCool, CoolMuc Mont-Blanc Poject Deep Project Exa2Green Project 2 Why projects? Pave the way for exascale

More information

Powernightmares: The Challenge of Efficiently Using Sleep States on Multi-Core Systems

Powernightmares: The Challenge of Efficiently Using Sleep States on Multi-Core Systems Powernightmares: The Challenge of Efficiently Using Sleep States on Multi-Core Systems Thomas Ilsche, Marcus Hähnel, Robert Schöne, Mario Bielert, and Daniel Hackenberg Technische Universität Dresden Observation

More information

IRMOS Newsletter. Issue N 5 / January Editorial. In this issue... Dear Reader, Editorial p.1

IRMOS Newsletter. Issue N 5 / January Editorial. In this issue... Dear Reader, Editorial p.1 IRMOS Newsletter Issue N 5 / January 2011 In this issue... Editorial Editorial p.1 Highlights p.2 Special topic: The IRMOS Repository p.5 Recent project outcomes p.6 Keep in touch with IRMOS p.8 Dear Reader,

More information

Performance Analysis of Large-Scale OpenMP and Hybrid MPI/OpenMP Applications with Vampir NG

Performance Analysis of Large-Scale OpenMP and Hybrid MPI/OpenMP Applications with Vampir NG Performance Analysis of Large-Scale OpenMP and Hybrid MPI/OpenMP Applications with Vampir NG Holger Brunst Center for High Performance Computing Dresden University, Germany June 1st, 2005 Overview Overview

More information

WP6 D6.2 Project website

WP6 D6.2 Project website WP6 D6.2 Project website Project title: Project Acronym: Promoting Youth Scientific Career Awareness and it Attractiveness through Multistakeholder Cooperation MultiCO Project ID: 665100 Prepared by: University

More information

Scope of activities scientific and research work in informatics, information technology, control theory, robotics and artificial intelligence

Scope of activities scientific and research work in informatics, information technology, control theory, robotics and artificial intelligence 1956-2016 Scope of activities scientific and research work in informatics, information technology, control theory, robotics and artificial intelligence Departments: Parallel and distributed information

More information

Extending SLURM with Support for GPU Ranges

Extending SLURM with Support for GPU Ranges Available on-line at www.prace-ri.eu Partnership for Advanced Computing in Europe Extending SLURM with Support for GPU Ranges Seren Soner a, Can Özturana,, Itir Karac a a Computer Engineering Department,

More information

DG CONNECT (Unit H5) Update on Data Centre Activities

DG CONNECT (Unit H5) Update on Data Centre Activities DG CONNECT (Unit H5) Update on Data Centre Activities Svetoslav Mihaylov Scientific/Technical Project Officer Smart Cities and Sustainability Directorate-General Communications Networks, Content and Technology

More information

Dynamic Performance Tuning for Speculative Threads

Dynamic Performance Tuning for Speculative Threads Dynamic Performance Tuning for Speculative Threads Yangchun Luo, Venkatesan Packirisamy, Nikhil Mungre, Ankit Tarkas, Wei-Chung Hsu, and Antonia Zhai Dept. of Computer Science and Engineering Dept. of

More information

Partner in a European project & how to get there - View from Russian Insider on project ISTOK.Ru

Partner in a European project & how to get there - View from Russian Insider on project ISTOK.Ru Information Society Technologies to Open Knowledge. Russia Partner in a European project & how to get there - View from Russian Insider on project ISTOK.Ru Oleg Luksha Russian Technology Transfer Network

More information

D6.1 AllScale Computing Infrastructure

D6.1 AllScale Computing Infrastructure H2020 FETHPC-1-2014 An Exascale Programming, Multi-objective Optimisation and Resilience Management Environment Based on Nested Recursive Parallelism Project Number 671603 D6.1 AllScale Computing Infrastructure

More information

GOING ARM A CODE PERSPECTIVE

GOING ARM A CODE PERSPECTIVE GOING ARM A CODE PERSPECTIVE ISC18 Guillaume Colin de Verdière JUNE 2018 GCdV PAGE 1 CEA, DAM, DIF, F-91297 Arpajon, France June 2018 A history of disruptions All dates are installation dates of the machines

More information

IBM High Performance Computing Toolkit

IBM High Performance Computing Toolkit IBM High Performance Computing Toolkit Pidad D'Souza (pidsouza@in.ibm.com) IBM, India Software Labs Top 500 : Application areas (November 2011) Systems Performance Source : http://www.top500.org/charts/list/34/apparea

More information

Empowering the Service Economy with SLA-aware Infrastructures in the project

Empowering the Service Economy with SLA-aware Infrastructures in the project Empowering the Service Economy with SLA-aware Infrastructures in the project SLA@SOI ETSI Workshop Grids, Clouds & Service Infrastructures, Sophia Antipolis, Dec 2-3, 2009 Ramin Yahyapour Technische Universität

More information

Application Example Running on Top of GPI-Space Integrating D/C

Application Example Running on Top of GPI-Space Integrating D/C Application Example Running on Top of GPI-Space Integrating D/C Tiberiu Rotaru Fraunhofer ITWM This project is funded from the European Union s Horizon 2020 Research and Innovation programme under Grant

More information

Presenting: Comparing the Power and Performance of Intel's SCC to State-of-the-Art CPUs and GPUs

Presenting: Comparing the Power and Performance of Intel's SCC to State-of-the-Art CPUs and GPUs Presenting: Comparing the Power and Performance of Intel's SCC to State-of-the-Art CPUs and GPUs A paper comparing modern architectures Joakim Skarding Christian Chavez Motivation Continue scaling of performance

More information

Approaches to I/O Scalability Challenges in the ECMWF Forecasting System

Approaches to I/O Scalability Challenges in the ECMWF Forecasting System Approaches to I/O Scalability Challenges in the ECMWF Forecasting System PASC 16, June 9 2016 Florian Rathgeber, Simon Smart, Tiago Quintino, Baudouin Raoult, Stephan Siemen, Peter Bauer Development Section,

More information

Performance Analysis with Periscope

Performance Analysis with Periscope Performance Analysis with Periscope M. Gerndt, V. Petkov, Y. Oleynik, S. Benedict Technische Universität München periscope@lrr.in.tum.de October 2010 Outline Motivation Periscope overview Periscope performance

More information

European IPR Helpdesk. Service Offer. Get your ticket to innovation! Jörg Scherer European IPR Helpdesk CEO Eurice GmbH Gijon 17/04/2018

European IPR Helpdesk. Service Offer. Get your ticket to innovation! Jörg Scherer European IPR Helpdesk CEO Eurice GmbH Gijon 17/04/2018 European IPR Helpdesk Service Offer Get your ticket to innovation! Jörg Scherer European IPR Helpdesk CEO Eurice GmbH Gijon 17/04/2018 The European IPR Helpdesk.an Innovation Support Action offering support

More information

ECMWF s Next Generation IO for the IFS Model

ECMWF s Next Generation IO for the IFS Model ECMWF s Next Generation IO for the Model Part of ECMWF s Scalability Programme Tiago Quintino, B. Raoult, P. Bauer ECMWF tiago.quintino@ecmwf.int ECMWF January 14, 2016 ECMWF s HPC Targets What do we do?

More information

H2020 EUB EU-Brazil Research and Development Cooperation in Advanced Cyber Infrastructure. NCP Training Brussels, 18 September 2014

H2020 EUB EU-Brazil Research and Development Cooperation in Advanced Cyber Infrastructure. NCP Training Brussels, 18 September 2014 H2020 EUB 2015 EU-Brazil Research and Development Cooperation in Advanced Cyber Infrastructure NCP Training Brussels, 18 September 2014 H2020 EUB 2015 This topic is a major element for the implementation

More information

Crossing the Architectural Barrier: Evaluating Representative Regions of Parallel HPC Applications

Crossing the Architectural Barrier: Evaluating Representative Regions of Parallel HPC Applications Crossing the Architectural Barrier: Evaluating Representative Regions of Parallel HPC Applications Alexandra Ferrerón (University of Zaragoza), Radhika Jagtap, Sascha Bischoff, Roxana Rușitoru (ARM) Senior

More information

Deadline-based Scheduling for GPU with Preemption Support

Deadline-based Scheduling for GPU with Preemption Support Deadline-based Scheduling for GPU with Preemption Support N. Capodieci, R. Cavicchioli, M. Bertogna, A. Paramakuru. University of Modena and Reggio Emilia NVIDIA Corp. 12/12/2018 RTSS 2018, NASHVILLE 1

More information

Introduction to VI-HPS

Introduction to VI-HPS Introduction to VI-HPS José Gracia HLRS Virtual Institute High Productivity Supercomputing Goal: Improve the quality and accelerate the development process of complex simulation codes running on highly-parallel

More information

Project Proposals. Advanced Operating Systems / Embedded Systems (2016/2017)

Project Proposals. Advanced Operating Systems / Embedded Systems (2016/2017) Project Proposals / Embedded Systems (2016/2017) Giuseppe Massari, Federico Terraneo giuseppe.massari@polimi.it federico.terraneo@polimi.it Project Rules 2/40 General rules Two types of project: Code development

More information

The DEEP (and DEEP-ER) projects

The DEEP (and DEEP-ER) projects The DEEP (and DEEP-ER) projects Estela Suarez - Jülich Supercomputing Centre BDEC for Europe Workshop Barcelona, 28.01.2015 The research leading to these results has received funding from the European

More information

The Mont-Blanc approach towards Exascale

The Mont-Blanc approach towards Exascale http://www.montblanc-project.eu The Mont-Blanc approach towards Exascale Alex Ramirez Barcelona Supercomputing Center Disclaimer: Not only I speak for myself... All references to unavailable products are

More information

QoS-aware resource allocation and load-balancing in enterprise Grids using online simulation

QoS-aware resource allocation and load-balancing in enterprise Grids using online simulation QoS-aware resource allocation and load-balancing in enterprise Grids using online simulation * Universität Karlsruhe (TH) Technical University of Catalonia (UPC) Barcelona Supercomputing Center (BSC) Samuel

More information

Exploiting Task-Parallelism on GPU Clusters via OmpSs and rcuda Virtualization

Exploiting Task-Parallelism on GPU Clusters via OmpSs and rcuda Virtualization Exploiting Task-Parallelism on Clusters via Adrián Castelló, Rafael Mayo, Judit Planas, Enrique S. Quintana-Ortí RePara 2015, August Helsinki, Finland Exploiting Task-Parallelism on Clusters via Power/energy/utilization

More information

EUMETSAT EXPERIENCE WITH MULTICAST ACROSS GÉANT

EUMETSAT EXPERIENCE WITH MULTICAST ACROSS GÉANT 1 EUMETSAT EXPERIENCE WITH MULTICAST ACROSS GÉANT Lothar.Wolf@eumetsat.int Competence Area Manager for Data Services OVERVIEW EUMETSAT Background WAN links Multicast accross GÉANT infrastructure Summary

More information

The EuroHPC strategic initiative

The EuroHPC strategic initiative Amsterdam, 12 December 2017 The EuroHPC strategic initiative Thomas Skordas Director, DG CONNECT-C, European Commission The European HPC strategy in Horizon 2020 Infrastructure Capacity of acquiring leadership-class

More information

On the Path towards Exascale Computing in the Czech Republic and in Europe

On the Path towards Exascale Computing in the Czech Republic and in Europe MARTIN PALKOVIČ Martin Palkovič is the managing director of IT4Innovations, the national supercomputing center in the Czech Republic. Before joining IT4Innovations, he was working for 12 years at imec

More information

EuroHPC: the European HPC Strategy

EuroHPC: the European HPC Strategy HPC User Forum Campus Teratec, Bruyères-le-Châtel (F) 6-7 March 2018 EuroHPC: the European HPC Strategy Leonardo Flores Añover Senior Expert, HPC and Quantum Computing Unit DG CONNECT, European Commission

More information

Design, Development and Improvement of Nagios System Monitoring for Large Clusters

Design, Development and Improvement of Nagios System Monitoring for Large Clusters Available online at www.prace-ri.eu Partnership for Advanced Computing in Europe Design, Development and Improvement of Nagios System Monitoring for Large Clusters Daniela Galetti 1, Federico Paladin 2

More information

Portable Power/Performance Benchmarking and Analysis with WattProf

Portable Power/Performance Benchmarking and Analysis with WattProf Portable Power/Performance Benchmarking and Analysis with WattProf Amir Farzad, Boyana Norris University of Oregon Mohammad Rashti RNET Technologies, Inc. Motivation Energy efficiency is becoming increasingly

More information

Ateles performance assessment report

Ateles performance assessment report Ateles performance assessment report Document Information Reference Number Author Contributor(s) Date Application Service Level Keywords AR-4, Version 0.1 Jose Gracia (USTUTT-HLRS) Christoph Niethammer,

More information

COL862: Low Power Computing Maximizing Performance Under a Power Cap: A Comparison of Hardware, Software, and Hybrid Techniques

COL862: Low Power Computing Maximizing Performance Under a Power Cap: A Comparison of Hardware, Software, and Hybrid Techniques COL862: Low Power Computing Maximizing Performance Under a Power Cap: A Comparison of Hardware, Software, and Hybrid Techniques Authors: Huazhe Zhang and Henry Hoffmann, Published: ASPLOS '16 Proceedings

More information

GPU Debugging Made Easy. David Lecomber CTO, Allinea Software

GPU Debugging Made Easy. David Lecomber CTO, Allinea Software GPU Debugging Made Easy David Lecomber CTO, Allinea Software david@allinea.com Allinea Software HPC development tools company Leading in HPC software tools market Wide customer base Blue-chip engineering,

More information

Welcome to the TIPS Workshop , Brussels

Welcome to the TIPS Workshop , Brussels Welcome to the TIPS Workshop 24.10.2013, Brussels The TIPS project is supported by the European Commission within the Seventh Framework Programme for TIPS goals Provide support to FP7 transport R&D projects

More information

EU Liaison Update. General Assembly. Matthew Scott & Edit Herczog. Reference : GA(18)021. Trondheim. 14 June 2018

EU Liaison Update. General Assembly. Matthew Scott & Edit Herczog. Reference : GA(18)021. Trondheim. 14 June 2018 EU Liaison Update General Assembly Matthew Scott & Edit Herczog Reference : GA(18)021 Trondheim 14 June 2018 Agenda Introduction Outcomes of meetings with DG CNECT & RTD, & other stakeholders Latest FP9

More information

Challenges in HPC I/O

Challenges in HPC I/O Challenges in HPC I/O Universität Basel Julian M. Kunkel German Climate Computing Center / Universität Hamburg 10. October 2014 Outline 1 High-Performance Computing 2 Parallel File Systems and Challenges

More information

Performance analysis basics

Performance analysis basics Performance analysis basics Christian Iwainsky Iwainsky@rz.rwth-aachen.de 25.3.2010 1 Overview 1. Motivation 2. Performance analysis basics 3. Measurement Techniques 2 Why bother with performance analysis

More information

MDHIM: A Parallel Key/Value Store Framework for HPC

MDHIM: A Parallel Key/Value Store Framework for HPC MDHIM: A Parallel Key/Value Store Framework for HPC Hugh Greenberg 7/6/2015 LA-UR-15-25039 HPC Clusters Managed by a job scheduler (e.g., Slurm, Moab) Designed for running user jobs Difficult to run system

More information

Efficient AMG on Hybrid GPU Clusters. ScicomP Jiri Kraus, Malte Förster, Thomas Brandes, Thomas Soddemann. Fraunhofer SCAI

Efficient AMG on Hybrid GPU Clusters. ScicomP Jiri Kraus, Malte Förster, Thomas Brandes, Thomas Soddemann. Fraunhofer SCAI Efficient AMG on Hybrid GPU Clusters ScicomP 2012 Jiri Kraus, Malte Förster, Thomas Brandes, Thomas Soddemann Fraunhofer SCAI Illustration: Darin McInnis Motivation Sparse iterative solvers benefit from

More information

April 2 nd, Bob Burroughs Director, HPC Solution Sales

April 2 nd, Bob Burroughs Director, HPC Solution Sales April 2 nd, 2019 Bob Burroughs Director, HPC Solution Sales Today - Introducing 2 nd Generation Intel Xeon Scalable Processors how Intel Speeds HPC performance Work Time System Peak Efficiency Software

More information

Interactive Analysis of Large Distributed Systems with Scalable Topology-based Visualization

Interactive Analysis of Large Distributed Systems with Scalable Topology-based Visualization Interactive Analysis of Large Distributed Systems with Scalable Topology-based Visualization Lucas M. Schnorr, Arnaud Legrand, and Jean-Marc Vincent e-mail : Firstname.Lastname@imag.fr Laboratoire d Informatique

More information

The AutoTune Project

The AutoTune Project The AutoTune Project Siegfried Benkner (on behalf of the Autotune consortium) Research Group Scientific Computing University of Vienna AutoTune: Interna-onal Workshop on Code Auto- Tuning, CGO 2015, San

More information

Slurm BOF SC13 Bull s Slurm roadmap

Slurm BOF SC13 Bull s Slurm roadmap Slurm BOF SC13 Bull s Slurm roadmap SC13 Eric Monchalin Head of Extreme Computing R&D 1 Bullx BM values Bullx BM bullx MPI integration ( runtime) Automatic Placement coherency Scalable launching through

More information

Score-P. SC 14: Hands-on Practical Hybrid Parallel Application Performance Engineering 1

Score-P. SC 14: Hands-on Practical Hybrid Parallel Application Performance Engineering 1 Score-P SC 14: Hands-on Practical Hybrid Parallel Application Performance Engineering 1 Score-P Functionality Score-P is a joint instrumentation and measurement system for a number of PA tools. Provide

More information

Building supercomputers from embedded technologies

Building supercomputers from embedded technologies http://www.montblanc-project.eu Building supercomputers from embedded technologies Alex Ramirez Barcelona Supercomputing Center Technical Coordinator This project and the research leading to these results

More information

Getting Insider Information via the New MPI Tools Information Interface

Getting Insider Information via the New MPI Tools Information Interface Getting Insider Information via the New MPI Tools Information Interface EuroMPI 2016 September 26, 2016 Kathryn Mohror This work was performed under the auspices of the U.S. Department of Energy by Lawrence

More information

European energy efficient supercomputer project

European energy efficient supercomputer project http://www.montblanc-project.eu European energy efficient supercomputer project Simon McIntosh-Smith University of Bristol (Based on slides from Alex Ramirez, BSC) Disclaimer: Speaking for myself... All

More information

ELASTIC: Dynamic Tuning for Large-Scale Parallel Applications

ELASTIC: Dynamic Tuning for Large-Scale Parallel Applications Workshop on Extreme-Scale Programming Tools 18th November 2013 Supercomputing 2013 ELASTIC: Dynamic Tuning for Large-Scale Parallel Applications Toni Espinosa Andrea Martínez, Anna Sikora, Eduardo César

More information

Evaluating the Performance and Energy Efficiency of the COSMO-ART Model System

Evaluating the Performance and Energy Efficiency of the COSMO-ART Model System Evaluating the Performance and Energy Efficiency of the COSMO-ART Model System Joseph Charles & William Sawyer (CSCS), Manuel F. Dolz (UHAM), Sandra Catalán (UJI) EnA-HPC, Dresden September 1-2, 2014 1

More information

The ANTAREX Approach to AutoTuning and Adaptivity for Energy efficient HPC systems

The ANTAREX Approach to AutoTuning and Adaptivity for Energy efficient HPC systems The ANTAREX Approach to AutoTuning and Adaptivity for Energy efficient HPC systems The ANTAREX Team Nesus Fifth Working Group Meeting Ljubljana, July 8 th, 2016 ANTAREX AutoTuning and Adaptivity approach

More information

Overcoming Distributed Debugging Challenges in the MPI+OpenMP Programming Model

Overcoming Distributed Debugging Challenges in the MPI+OpenMP Programming Model Overcoming Distributed Debugging Challenges in the MPI+OpenMP Programming Model Lai Wei, Ignacio Laguna, Dong H. Ahn Matthew P. LeGendre, Gregory L. Lee This work was performed under the auspices of the

More information

Designing Power-Aware Collective Communication Algorithms for InfiniBand Clusters

Designing Power-Aware Collective Communication Algorithms for InfiniBand Clusters Designing Power-Aware Collective Communication Algorithms for InfiniBand Clusters Krishna Kandalla, Emilio P. Mancini, Sayantan Sur, and Dhabaleswar. K. Panda Department of Computer Science & Engineering,

More information

Using Automated Performance Modeling to Find Scalability Bugs in Complex Codes

Using Automated Performance Modeling to Find Scalability Bugs in Complex Codes Using Automated Performance Modeling to Find Scalability Bugs in Complex Codes A. Calotoiu 1, T. Hoefler 2, M. Poke 1, F. Wolf 1 1) German Research School for Simulation Sciences 2) ETH Zurich September

More information

CPU-GPU Heterogeneous Computing

CPU-GPU Heterogeneous Computing CPU-GPU Heterogeneous Computing Advanced Seminar "Computer Engineering Winter-Term 2015/16 Steffen Lammel 1 Content Introduction Motivation Characteristics of CPUs and GPUs Heterogeneous Computing Systems

More information

Horizon 2020 INFRADEV Design studies. RICHFIELDS Working Packages 2 Deliverable D2.2. Project identity. Date delivered: M6

Horizon 2020 INFRADEV Design studies. RICHFIELDS Working Packages 2 Deliverable D2.2. Project identity. Date delivered: M6 Horizon 2020 INFRADEV-1-2014 - Design studies RICHFIELDS Working Packages 2 Deliverable D2.2 Project identity Date delivered: M6 Author: Christina Sadler Deliverable lead beneficiaries: EUFIC Project Project

More information

EuroHPC and the European HPC Strategy HPC User Forum September 4-6, 2018 Dearborn, Michigan, USA

EuroHPC and the European HPC Strategy HPC User Forum September 4-6, 2018 Dearborn, Michigan, USA EuroHPC and the European HPC Strategy HPC User Forum September 4-6, 2018 Dearborn, Michigan, USA Leonardo Flores Añover Senior Expert - HPC and Quantum technologies DG CONNECT European Commission Overall

More information

Performance and Energy Usage of Workloads on KNL and Haswell Architectures

Performance and Energy Usage of Workloads on KNL and Haswell Architectures Performance and Energy Usage of Workloads on KNL and Haswell Architectures Tyler Allen 1 Christopher Daley 2 Doug Doerfler 2 Brian Austin 2 Nicholas Wright 2 1 Clemson University 2 National Energy Research

More information

The Mont-Blanc project Updates from the Barcelona Supercomputing Center

The Mont-Blanc project Updates from the Barcelona Supercomputing Center montblanc-project.eu @MontBlanc_EU The Mont-Blanc project Updates from the Barcelona Supercomputing Center Filippo Mantovani This project has received funding from the European Union's Horizon 2020 research

More information

Energy Efficient Computing Systems (EECS) Magnus Jahre Coordinator, EECS

Energy Efficient Computing Systems (EECS) Magnus Jahre Coordinator, EECS Energy Efficient Computing Systems (EECS) Magnus Jahre Coordinator, EECS Who am I? Education Master of Technology, NTNU, 2007 PhD, NTNU, 2010. Title: «Managing Shared Resources in Chip Multiprocessor Memory

More information

Current situation of policy and projects in Norway

Current situation of policy and projects in Norway Norwegian Embassy Japan CCS Forum 2018 18 June 2018 Current situation of policy and projects in Norway Mr Tom Knappskog, Counsellor, Norwegian Embassy Tokyo Dr Hiroshi Matsumoto, Senior Advisor, Innovation

More information

Introduction to Parallel Performance Engineering

Introduction to Parallel Performance Engineering Introduction to Parallel Performance Engineering Markus Geimer, Brian Wylie Jülich Supercomputing Centre (with content used with permission from tutorials by Bernd Mohr/JSC and Luiz DeRose/Cray) Performance:

More information

Introducing OpenMP Tasks into the HYDRO Benchmark

Introducing OpenMP Tasks into the HYDRO Benchmark Available online at www.prace-ri.eu Partnership for Advanced Computing in Europe Introducing OpenMP Tasks into the HYDRO Benchmark Jérémie Gaidamour a, Dimitri Lecas a, Pierre-François Lavallée a a 506,

More information

Plattformübergreifende Softwareentwicklung für heterogene Multicore-Systeme

Plattformübergreifende Softwareentwicklung für heterogene Multicore-Systeme Plattformübergreifende Softwareentwicklung für heterogene Multicore-Systeme Dr.-Ing. Timo Stripf 1 Managing Director Technolgy Outline Multicore Motivation Automatic Parallelization Interactive Parallelization

More information

A Framework for Modeling GPUs Power Consumption

A Framework for Modeling GPUs Power Consumption A Framework for Modeling GPUs Power Consumption Sohan Lal, Jan Lucas, Michael Andersch, Mauricio Alvarez-Mesa, Ben Juurlink Embedded Systems Architecture Technische Universität Berlin Berlin, Germany January

More information