The DEEP (and DEEP-ER) projects

Size: px

Start display at page:

Download "The DEEP (and DEEP-ER) projects"

Donald Holland
5 years ago
Views:

1 The DEEP (and DEEP-ER) projects Estela Suarez - Jülich Supercomputing Centre BDEC for Europe Workshop Barcelona, The research leading to these results has received funding from the European Community's Seventh Framework Programme (FP7/ ) under Grant Agreement n and n

2 Topics DEEP Cluster-Booster concept Software stack Programming environ. Performance tools Energy efficiency Applications: Co-design Evaluation/demonstration DEEP-ER Improve I/O Improve resiliency New memory technology Applications: Co-design Evaluation/demonstration 2

3 Positioning Constellation Systems IBM Blue Gene/L Cluster Systems Graphic-card accelerated IBM Blue Gene Family Infiniband fat tree Torus 3d/5d/6d Low - medium scalable architecture Applications with Complex Regular communication pattern Highly scalable architecture 3

Gene Family Infiniband fat tree Torus 3d/5d/6d Low -

4 Positioning Constellation Systems IBM Blue Gene/L Cluster Systems Graphic-card accelerated IBM Blue Gene Family Infiniband fat tree Torus 3d/5d/6d Low - medium scalable architecture Highly scalable architecture 4

5 DEEP hardware 128 Xeon (Sandy Bridge) 384 Xeon Phi (KNC) 5

6 DEEP hardware DEEP Cluster at JSC ASIC Evaluator (32 KNCs) Booster Chassis (32 KNCs) 6

7 Warm-water cooling Dry Coolers 100 KW DEEP Cluster KW DEEP Booster 40 o C 45 o C

Booster Interface NAM: Network Attached Memory NVM:

8 Enhance DEEP architecture Disk Disk NAM CN CN EXTOLL NAM Legend: CN: Cluster Node : Booster Node BI: Booster Interface NAM: Network Attached Memory NVM: Non Volatile Memory NIC NVM MEM NAM NIC MEM NIC CN NIC 8

9 Programming environment Cluster ParaStation MPI Booster Interface Booster Infiniband MPI_Comm_spawn Cluster Booster Protocol Extoll OmpSs on top of MPI provides pragmas to ease the offload process 9

10 Software Architecture Source code Compiler Application binaries DEEP Runtime 10

11 Scalable I/O Improve I/O scalability on all usage-levels BeeGFS leverages DEEP architecture and novel memory technology Extended I/O APIs combine performance with ease of use SIONlib E10 guide the development by synthetic I/O benchmarks, resiliency scheme and real-world applications

Resiliency Develop a hierarchical, distributed checkpoint/restart scheme leveraging DEEP-ER architecture Stage checkpoints in NVM and NAM close to the Booster Nodes Provide

12 Resiliency Develop a hierarchical, distributed checkpoint/restart scheme leveraging DEEP-ER architecture Stage checkpoints in NVM and NAM close to the Booster Nodes Provide checkpoint/restart APIs for task-based MPI offload model Develop OmpSs extensions for automatic task resiliency with applications guiding the development and validating the results 12

13 Application-driven approach DEEP+DEEP-ER applications: Brain simulation (EPFL) Space weather simulation (KULeuven) Climate simulation (CYI) Computational fluid engineering (CERFACS) High temperature superconductivity (CINECA) Seismic imaging (CGGVS) Human exposure to electromagnetic fields (INRIA) Geoscience (BADW-LRZ) Radio astronomy (Astron) Oil exploration (BSC) Lattice QCD (UREG) Goals: Co-design and evaluation of DEEP architecture and its programmability Analysis of the I/O and resiliency requirements of HPC codes 13

14 DEEP Status (M38) Hardware status: DEEP Cluster Booster Chassis (32 KNCs) ASIC Evaluator (32 KNCs) Energy Efficiency Evaluator (16 KNCs) Software status: System software implemented Validation on DEEP Hardware ongoing Programming model completed: Global MPI + OmpSs: offload of highly parallel tasks Booster Chassis ASIC Evaluator Scientific Applications: Optimised (vectorisation, threading) Application division implemented 14

15 DEEP-ER Status (M16) Hardware status: Overall architecture design finished NVM under evaluation with applications NAM in development Software status: Same environment as in DEEP Extensions for I/O and resiliency: I/O: BeeGFS, SIONlib, Exascale10 Resilency: application-based + task-based checkpoint On-node NVM (Intel DC P3700) SIONlib Scientific Applications: Applications analysed, optimisations ongoing 15

16 Take aways Exascale poses challenges Energy, Resiliency, Scalability, Programmability Have to face more and huger levels of parallelism Computing will become (even more) heterogeneous Some new ideas are around DEEP allows to map application's levels of scalability onto hardware follows new approaches for the programming paradigm handles heterogeneity in an innovate way Address also I/O and resiliency DEEP-ER More info:

17 DEEP and DEEP-ER EU-Exascale projects 20 partners Total budget: 28,3 M EU-funding: 14,5 M Nov 2011 Sept 2016 Visit us at PRACE Days 15 Dublin,

Dynamical Exascale Entry Platform

DEEP Dynamical Exascale Entry Platform 2 nd IS-ENES Workshop on High performance computing for climate models 30.01.2013, Toulouse, France Estela Suarez The research leading to these results has received