Leveraging Parallelware in MAESTRO and EPEEC

Similar documents
Joachim Biercamp Deutsches Klimarechenzentrum (DKRZ) With input from Peter Bauer, Reinhard Budich, Sylvie Joussaume, Bryan Lawrence.

Barcelona Supercomputing Center

The DEEP (and DEEP-ER) projects

Earth System Sciences in the Times of Brilliant Technologies

Overview Introduction to Ambient Networks (AN) Migration and Deployment Issues Migration Principles Migration Roadmap Phases Overlay vs. Interworking

EU Research Infra Integration: a vision from the BSC. Josep M. Martorell, PhD Associate Director

HPC IN EUROPE. Organisation of public HPC resources

Building supercomputers from embedded technologies

DEISA. An European GRID-empowered infrastructure for Science and Industry" Pisa 11 maggio 2005 Angelo De Florio

Debugging at Scale Lindon Locks

The EOSC A personal vision (supported by some facts)

AMD ACCELERATING TECHNOLOGIES FOR EXASCALE COMPUTING FELLOW 3 OCTOBER 2016

Non-Conventional and Low-Cost Wireless Access Concepts in WWI Ambient Networks

1. Publishable Summary

Tuning Alya with READEX for Energy-Efficiency

Workshop: Innovation Procurement in Horizon 2020 PCP Contractors wanted

SPEED-5G overview Quality of Service Provision and capacity Expansion through Extended-DSA for 5G

GOING ARM A CODE PERSPECTIVE

Towards Exascale Programming Models HPC Summit, Prague Erwin Laure, KTH

Future Internet Experiments over National Research & Education Networks: The Use Cases of FEDERICA & NOVI over European NRENs - GÉANT

The Mont-Blanc approach towards Exascale

Victor Alessandrini IDRIS - CNRS

Pedraforca: a First ARM + GPU Cluster for HPC

EuroEXA Driving the technology towards exascale

EuroHPC Bologna 23 Marzo Gabriella Scipione

EEPOS automation and energy management platform Technical documentation of the implemented EEPOS gateway and the related middleware

The EuroHPC strategic initiative

Pre-announcement of upcoming procurement, NWP18, at National Supercomputing Centre at Linköping University

Update of Post-K Development Yutaka Ishikawa RIKEN AICS

Performance-oriented development

International Association for Hydrogen Safety

HPC SERVICE PROVISION FOR THE UK

NEW_InnoNet stakeholder analysis

Generating SVG weather maps and meteorological graphs using Magics++

Hybrid KAUST Many Cores and OpenACC. Alain Clo - KAUST Research Computing Saber Feki KAUST Supercomputing Lab Florent Lebeau - CAPS

Administrative Status

Percipient StorAGe for Exascale Data Centric Computing Computing for the Exascale

The challenges of the ECMWF graphics packages

Architecture, Programming and Performance of MIC Phi Coprocessor

GPU Debugging Made Easy. David Lecomber CTO, Allinea Software

OpenMP 4.0: A Significant Paradigm Shift in Parallelism

ETP4HPC IN A NUTSHELL

SPACE SITUATIONAL AWARENESS

GIGAS. GEOSS INSPIRE & GMES an Action in Support

USER MANUAL FOR V1.0.

High Performance Computing : Code_Saturne in the PRACE project

Extended Dataflow Model For Automated Parallel Execution Of Algorithms

Programming Models for Multi- Threading. Brian Marshall, Advanced Research Computing

AUTOMATIC SMT THREADING

Addressing the Increasing Challenges of Debugging on Accelerated HPC Systems. Ed Hinkel Senior Sales Engineer

Conference Radisson Blu Centrum Hotel - WARSAW 04 / 05 October 2011

Next Generation CEA Computing Centres

PRESS RELEASE. PhoxTroT: Optical Interconnect Technologies Revolutionized Data Centers and HPC Systems

OVERVIEW OF MPC JUNE 24 TH LLNL Meeting June 15th, 2015 PAGE 1

From the latency to the throughput age. Prof. Jesús Labarta Director Computer Science Dept (BSC) UPC

Výpočetní zdroje IT4Innovations a PRACE pro využití ve vědě a výzkumu

Approaches to I/O Scalability Challenges in the ECMWF Forecasting System

NULIFE Network of excellence Nuclear plant life prediction. Rauno Rintamaa and Irina Aho-Mantila VTT, Espoo, Finland Coordinator of NULIFE

CLAW FORTRAN Compiler source-to-source translation for performance portability

BSC and integrating persistent data and parallel programming models

Overview of research activities Toward portability of performance

Open Archives Forum - Technical Validation -

Innovative SOFC Architecture based on Triode Operation. Deliverable D6.2 Web site dedicated to the scientific community

Géraud Guilloud. Horizon-2020 appels Atelier 5G & Internet Technologies & Digital Security

Federated Services and Data Management in PRACE

Arm's role in co-design for the next generation of HPC platforms

NEXTGenIO Performance Tools for In-Memory I/O

Project overview. Gelsomina Pappalardo National Research Council of Italy. AEROCOM - ACTRIS Meeting Hamburg, Germany 26 September 2013

F-Interop Online Platform of Interoperability and Performance Tests for the Internet of Things

The Digitising European Industry strategy & H2020 calls related to Cyber-Physical Systems

Compiler Tools for HighLevel Parallel Languages

The Heterogeneous Programming Jungle. Service d Expérimentation et de développement Centre Inria Bordeaux Sud-Ouest

MPI in 2020: Opportunities and Challenges. William Gropp

IT Innovation Centre, University of Southampton, UK. Deutsches Geo-Forschungs-Zentrum - GFZ, Germany. Fraunhofer IOSB, Germany

HUMBOLDT Application Scenario: Protected Areas

Advances in Time-Parallel Four Dimensional Data Assimilation in a Modular Software Framework

Designing a System Engineering Environment in a structured way

IRRIIS - Integrated Risk Reduction of Information-based Infrastructure Systems

An Introduction to OpenACC

D9.3 STREETLIFE Website

A Breakthrough in Non-Volatile Memory Technology FUJITSU LIMITED

CASCOM. Context-Aware Business Application Service Co-ordination ordination in Mobile Computing Environments

IS-ENES2 Kick-off meeting Sergi Girona, Chair of the Board of Directors

Energy Efficient Computing Systems (EECS) Magnus Jahre Coordinator, EECS

Energy Efficiency Tuning: READEX. Madhura Kumaraswamy Technische Universität München

The Go4IT project. Toward a TTCN-3 open environment for IPv6 protocols testing. Project identity card

FPGA-based Supercomputing: New Opportunities and Challenges

Eurocode Basis of structural design

Cloud-based Rapid Elastic MAnufacturing

EU-TW Partnership. l l

This document is a preview generated by EVS

Interconnection of Armenian e- Infrastructures with the pan- Euroepan Integrated Environments

EO Ground Segment Evolution Reflections by

The Mont-Blanc project Updates from the Barcelona Supercomputing Center

CSD3 The Cambridge Service for Data Driven Discovery. A New National HPC Service for Data Intensive science

Digital Single Market Technologies and Public Service Modernisation Package -DSM. Grazyna Wojcieszko DG CONNECT

SEVENTH FRAMEWORK PROGRAMME THEME [ICT ] [Photonics]

TEDS BEE TEST OF DIGITAL SERVICES FOR BUILDING ENERGY EFFICIENCY A PROJECT TOWARDS THE ENERGY SAVING AND SUSTAINABILITY OF PUBLIC BUILDINGS

Towards a codelet-based runtime for exascale computing. Chris Lauderdale ET International, Inc.

Data Replication: Automated move and copy of data. PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013

Transcription:

Leveraging Parallelware in MAESTRO and EPEEC and Enhancements to Parallelware Manuel Arenaz manuel.arenaz@appentra.com PRACE booth #2033 Thursday, 15 November 2018 Dallas, US http://www.prace-ri.eu/praceatsc18/

Index European Research: H2020 FETHPC programme Parallelware Software before MAESTRO & EPEEC EPEEC: European joint Effort toward a Highly Productive Programming Environment for Heterogeneous Exascale Computing Benefits and known limitations MAESTRO: Middleware for memory and data-awareness in workflows Projects MAESTRO and EPEEC Enhancements to Parallelware Software due to Maestro & EPEEC

Index European Research: H2020 FETHPC programme Parallelware Software before MAESTRO & EPEEC EPEEC: European joint Effort toward a Highly Productive Programming Environment for Heterogeneous Exascale Computing Benefits and known limitations MAESTRO: Middleware for memory and data-awareness in workflows Projects MAESTRO and EPEEC Enhancements to Parallelware Software due to Maestro & EPEEC

European Research: H2020 FETHPC Programme Subtopic b) Exascale system software and management, to advance the state of the art in system software and management for node architectures. Subtopic a) High productivity programming environments for exascale, to simplify application software development for large- and extreme-scale systems. How can Parallelware tools help to address these R&D challenges?

Index European Research: H2020 FETHPC programme Parallelware Software before MAESTRO & EPEEC EPEEC: European joint Effort toward a Highly Productive Programming Environment for Heterogeneous Exascale Computing Benefits and known limitations MAESTRO: Middleware for memory and data-awareness in workflows Projects MAESTRO and EPEEC Enhancements to Parallelware Software due to Maestro & EPEEC

Parallelware Software before MAESTRO & EPEEC Parallelware (libpw) Parallelware front-end Parallelware middle-end C OpenMP 4.5 Semantic Analysis Engine GUI Desktop Parallelware back-end OpenACC 2.0 Multi-Threading Offloading

Parallelware Software: Benefits Parallelware (libpw) Parallelware front-end Parallelware middle-end C OpenMP 4.5 Semantic Analysis Engine GUI Desktop Parallelware back-end OpenACC 2.0 Multi-Threading Offloading Good training tool for parallel programming, particularly for GPUs. Support for multithreading and offloading in OpenMP 4.5. Support for offloading in OpenACC. Good user messages explaining why and how a loop can be parallelized. Management of data scoping of variables (eg. private, shared, reduction) Good code examples for training. Multi-platform: Linux, Windows, MacOSX

Parallelware Software: Known Limitations Parallelware (libpw) Parallelware front-end Parallelware middle-end C OpenMP 4.5 Semantic Analysis Engine GUI Desktop Parallelware back-end OpenACC 2.0 Multi-Threading Offloading Needs enhancements to analyze real applications (eg. MPI+X). Lack of support for structs in C. Lack of support for C++ and Fortran. Lack of support for tasking and SIMD. Lack of support for multiple files and procedure calls. More precise support for data-scoping.

Index European Research: H2020 FETHPC programme Parallelware Software before MAESTRO & EPEEC EPEEC: European joint Effort toward a Highly Productive Programming Environment for Heterogeneous Exascale Computing Benefits and known limitations MAESTRO: Middleware for memory and data-awareness in workflows Projects MAESTRO and EPEEC Enhancements to Parallelware Software due to Maestro & EPEEC

www.maestro-data.eu Middleware for memory and data-awareness in workflows Objective: Build a data-aware and memory-aware middleware framework that addresses ubiquitous problems of data movement in complex memory hierarchies and at many levels of the HPC software stack. Partners: Forschungszentrum Julich Gmbh (Juelich, Germany) - Coordinator Commissariat à l'énergie atomique et aux Énergies alternatives (CEA, France) Appentra Solutions SL (Appentra, Spain) Eidgenoessische Technische Hochschule Zuerich (ETH Zürich, Switzerland) European Centre for Medium-range Weather Forecasts (ECMWF, United Kingdom) Seagate Systems UK Limited (Seagate Systems, United Kingdom) Cray Computer Gmbh (Cray, Switzerland)

www.maestro-data.eu Middleware for memory and data-awareness in workflows The Maestro project has been set up to tackle one of the most important and difficult problems in HPC, namely the orchestration of data across multiple levels of the memory and storage hardware as well as the software stack. Although data movement is now recognized as the primary obstacle to performance efficiency, much of the software stack is not well suited to optimizing data movement, and was instead designed in an age where optimizing arithmetic operations was the priority. The Maestro project aims to capture the data- and memory-aware aspects of applications and the software stack into a new middleware layer which will perform basic data movement and optimisation on behalf of the application, also making use of modern memory systems. The Maestro project will provide a unique opportunity to challenge traditional approaches for handling data objects and data movements in complex HPC applications and workflows, which will be key for efficient exploitation of future exascale level supercomputers. Prof. Dirk Pleiter, Coordinator of the Maestro project.

www.maestro-data.eu Appentra will enhance the Parallelware software for data awareness to support the Maestro middleware. Appentra brings to the project its long experience in tools for static analysis of HPC codes, which will be used to analyze the application and workflow requirements of the Maestro middleware, and to co-design the Maestro middleware. Appentra brings to the project its long experience in tools for static analysis of HPC codes, which will be used to develop new components of the Maestro middleware concerned with data access and dataflow as well as data-aware execution and orchestration. Appentra will contribute to the validation and demonstration of the Maestro middleware. Appentra leads the coordination of the dissemination activities in Maestro.

www.maestro-data.eu Parallelware (libpw) Parallelware front-end Parallelware middle-end OpenMP 4.5 C Semantic Analysis Engine GUI Desktop Parallelware back-end OpenACC 2.0 Multi-Threading Offloading The Parallelware middle-end will be enhanced by adding the new source code static analysis capabilities needed by the Maestro data orchestration middleware.

www.maestro-data.eu Parallelware (libpw) Parallelware front-end Parallelware middle-end OpenMP 4.5 C Semantic Analysis Engine 1. GUI Desktop 2. OpenACC 2.0 Multi-Threading Offloading Preparation of the Parallelware software to expose the information available in the middle-end: Parallelware back-end Hidden in Parallelware Trainer as the amount of information would be overwhelming. Two new ways to expose the information of Parallelware middle-end: Parallelware Analyzer, new command-line tool. Extension to libpw, new API for third-party tools to access to Parallelware capabilities.

www.maestro-data.eu Parallelware (libpw) Parallelware front-end Parallelware middle-end Parallelware back-end OpenMP 4.5 C OpenACC 2.0 Semantic Analysis Engine Multi-Threading Offloading on going GUI Desktop Command Line Tool 3. First release of Parallelware Analyzer version BETA. a. b. Proposed first set of analyses: --datascoping, --functions, --datalayout, --code and --overview Enhancements to the Parallelware middle-end under development: i. ii. Tracking of scalars across multiple files and multiple procedures Tracking of fields of structs

Index European Research: H2020 FETHPC programme Parallelware Software before MAESTRO & EPEEC EPEEC: European joint Effort toward a Highly Productive Programming Environment for Heterogeneous Exascale Computing Benefits and known limitations MAESTRO: Middleware for memory and data-awareness in workflows Projects MAESTRO and EPEEC Enhancements to Parallelware Software due to Maestro & EPEEC

European joint Effort toward a Highly Productive Programming Environment for Heterogeneous Exascale Computing Objective: Develop and deploy a production-ready parallel programming environment that turns upcoming overwhelmingly-heterogeneous exascale supercomputers into manageable platforms for domain application developers. Partners: Barcelona Supercomputing Center - Centro Nacional de Supercomputacion (BSC, Spain) - Coordinator Fraunhofer Gesellschaft Zur Foerderung der Angewandten Forschung E.V. (Fraunhofer, Germany) Inesc ID - Instituto de Engenharia de Sistemas e Computadores, Investigação e Desenvolvimento em Lisboa (Inesc ID, Portugal) Institut National de Recherche en Informatique et Automatique (Inria, France) Appentra Solutions SL (Appentra, Spain) Cineca Consorzio Interuniversitario (Cineca, Italy) Eta Scale Ab (Eta Scale, Sweden) Centre Europeen de Recherche et de Formation Avancée en Calcul Scientifique (Cerfacs, France) Interuniversitair Micro-electronica Centrum (Imec, Belgium) Uppsala Universitet (Uu, Sweden)

European joint Effort toward a Highly Productive Programming Environment for Heterogeneous Exascale Computing The consortium will significantly advance and integrate existing state-of-the-art components based on European technology (programming models, runtime systems, and tools) with key features enabling 3 overarching objectives: high coding productivity, high performance, and energy awareness. An automatic generator of compiler directives will provide outstanding coding productivity from the very beginning of the application developing/porting process. Developers will be able to leverage either shared memory or distributed-shared memory programming flavours, and code in their preferred language: C, Fortran, or C++. EPEEC will ensure the composability and interoperability of its programming models and runtimes, which will incorporate specific features to handle data-intensive and extreme-data applications. Enhanced leading-edge performance tools will offer integral profiling, performance prediction, and visualisation of traces. Five applications representative of different relevant scientific domains will serve as part of a strong inter-disciplinary co-design approach and as technology demonstrators. EPEEC exploits results from past FET projects that led to the cutting-edge software components it builds upon, and pursues influencing the most relevant parallel programming standardisation bodies.

Parallelware (libpw) Parallelware front-end Parallelware middle-end Parallelware back-end OpenMP 4.5 C Semantic Analysis Engine OpenACC 2.0 Multi-Threading Offloading on going GUI Desktop Command Line Tool The Parallelware front-end and back-end will be enhanced in order to meet the needs of the EPEEC highly productive programming environment for heterogeneous exascale computing. 19

Parallelware (libpw) Parallelware front-end Parallelware middle-end Parallelware back-end OpenMP 4.5 C Semantic Analysis Engine C++ Fortran OpenACC 2.0 OmpSs Multi-Threading Tasking Offloading FPGAs on going GUI Desktop Command Line Tool 1. Currently conducting studies to understand the requirements of the EPEEC applications, EPEEC programming models and EPEEC target hardware platforms. a. b. c. Programming languages: C++ (ie. C enriched with vector, algorithm and templates), Fortran Programming models: OmpSs, OpenMP (tasking) Hardware platforms: GPUs, FPGAs 20

Index European Research: H2020 FETHPC programme Parallelware Software before MAESTRO & EPEEC EPEEC: European joint Effort toward a Highly Productive Programming Environment for Heterogeneous Exascale Computing Benefits and known limitations MAESTRO: Middleware for memory and data-awareness in workflows Projects MAESTRO and EPEEC Enhancements to Parallelware Software due to Maestro & EPEEC

Enhancements to Parallelware Software Parallelware (libpw) Parallelware front-end Parallelware middle-end Parallelware back-end OpenMP 4.5 C Semantic Analysis Engine C++ Fortran OpenACC 2.0 OmpSs Multi-Threading Tasking Offloading FPGAs on going GUI Desktop Command Line Tool The projects MAESTRO and EPEEC have just started, so these capabilities will be developed incrementally following a co-design approach guided by (pre-)exascale applications.

Leveraging Parallelware in MAESTRO and EPEEC and Enhancements to Parallelware Manuel Arenaz manuel.arenaz@appentra.com PRACE booth #2033 Thursday, 15 November 2018 Dallas, US