GraS. An Environmental Application on Windows-HPC. Christian Terboven

Similar documents
Hybrid Parallelization: Performance from SMP Building Blocks

Performance Tools for Technical Computing

HPC Tools on Windows. Christian Terboven Center for Computing and Communication RWTH Aachen University.

HPC on Windows. Visual Studio 2010 and ISV Software

Tasking and OpenMP Success Stories

Windows-HPC Environment at RWTH Aachen University

Debugging Serial and Parallel Programs with Visual Studio

First Experiences with Intel Cluster OpenMP

OpenMP for Accelerators

Getting Performance from OpenMP Programs on NUMA Architectures

NUMA-aware OpenMP Programming

Peta-Scale Simulations with the HPC Software Framework walberla:

C++ and OpenMP. 1 ParCo 07 Terboven C++ and OpenMP. Christian Terboven. Center for Computing and Communication RWTH Aachen University, Germany

Runtime Correctness Checking for Emerging Programming Paradigms

Using Intel VTune Amplifier XE for High Performance Computing

Making Supercomputing More Available and Accessible Windows HPC Server 2008 R2 Beta 2 Microsoft High Performance Computing April, 2010

Windows RWTH Aachen University

Introduction to OpenMP

"Charting the Course to Your Success!" MOC A Developing High-performance Applications using Microsoft Windows HPC Server 2008

Optimising MPI Applications for Heterogeneous Coupled Clusters with MetaMPICH

aixcelerate 2016 Paul Kapinos

Intel VTune Amplifier XE

simulation framework for piecewise regular grids

RWTH GPU-Cluster. Sandra Wienke March Rechen- und Kommunikationszentrum (RZ) Fotos: Christian Iwainsky

Bei Wang, Dmitry Prohorov and Carlos Rosales

OpenMP Overview. in 30 Minutes. Christian Terboven / Aachen, Germany Stand: Version 2.

Verification and Validation Introducing Simulink Design Verifier

VLPL-S Optimization on Knights Landing

High Performance Computing on Windows. Debugging with VS2005 Debugging parallel programs. Christian Terboven

walberla: Developing a Massively Parallel HPC Framework

New Features Winter-Release 2018/2019

Parallel Algorithm Design. Parallel Algorithm Design p. 1

The GPU-Cluster. Sandra Wienke Rechen- und Kommunikationszentrum (RZ) Fotos: Christian Iwainsky

Evaluation of Asynchronous Offloading Capabilities of Accelerator Programming Models for Multiple Devices

Tutorial: Application MPI Task Placement

ELP. Effektive Laufzeitunterstützung für zukünftige Programmierstandards. Speaker: Tim Cramer, RWTH Aachen University

Spatial Data Structures

Microsoft Windows HPC Server 2008 R2 for the Cluster Developer

Innovative Tools for the Future Coordinated Operation of the Pan-European Electricity Transmission System FP-7 Project Umbrella

Intersection Acceleration

The Project 12/06/2016 PUBLIC ELYNTEGRATION G.A

Sample some Pi Monte. Introduction. Creating the Simulation. Answers & Teacher Notes

Maximum Entropy (Maxent)

Towards NUMA Support with Distance Information

Profiling: Understand Your Application

Spatial Data Structures

Figure 1: Workflow of object-based classification

Segmentation

Keywords: DSM, DMM, Process simulation, product development, process planning

Object-Based Classification & ecognition. Zutao Ouyang 11/17/2015

Intel Parallel Studio XE Cluster Edition - Intel MPI - Intel Traceanalyzer & Collector

Binding Nested OpenMP Programs on Hierarchical Memory Architectures

IMPORTANCE OF TOPOLOGY IN A GIS PROJECT OF MONITORING THE SOILS IN AGRICULTURAL LAND

Introduction to OpenMP

Installing Visual Studio for Report Design

Mapping Distance and Density

HPC Application Porting to CUDA at BSC

CS 475: Parallel Programming Introduction

Cover Page. Abstract ID Paper Title. Automated extraction of linear features from vehicle-borne laser data

Parallel Computer Architecture - Basics -

HPC on Sun Today and Tomorrow VIRACOCHA: An Efficient Parallelization Framework Processing in Virtual Environments. Andreas Gerndt

Topographic Lidar Data Employed to Map, Preserve U.S. History

How to scale Nested OpenMP Applications on the ScaleMP vsmp Architecture

Accelerators in Technical Computing: Is it Worth the Pain?

Spatial Data Structures

Ray Tracing Acceleration Data Structures

Parallel I/O Libraries and Techniques

Massively Parallel Phase Field Simulations using HPC Framework walberla

Suitability Modeling with GIS

Technical Computing Suite supporting the hybrid system

Package lhs. R topics documented: January 4, 2018

Introduction to OpenMP

Recent Developments in Score-P and Scalasca V2

All-Terminal Reliability Evaluation through a Monte Carlo simulation based on an MPI implementation

The lhs Package. October 22, 2006

GROMACS Performance Benchmark and Profiling. September 2012

TOOLS FOR IMPROVING CROSS-PLATFORM SOFTWARE DEVELOPMENT

Spatial Data Structures

Capturing Topology in Graph Pattern Matching

Adventures in Load Balancing at Scale: Successes, Fizzles, and Next Steps

Measures of Dispersion

Computational Fluid Dynamics with the Lattice Boltzmann Method KTH SCI, Stockholm

HPC learning using Cloud infrastructure

Splotch: High Performance Visualization using MPI, OpenMP and CUDA

ICON Performance Benchmark and Profiling. March 2012

Performance Tools and Holistic HPC Workflows

Exploring 3D visualisation of vegetation

The Potential of Diffusive Load Balancing at Large Scale

What else is available besides OpenMP?

Segmentation

Evaluating the Performance and Energy Efficiency of the COSMO-ART Model System

Approximation Algorithms for Geometric Intersection Graphs

HPC code modernization with Intel development tools

Programming Models for Supercomputing in the Era of Multicore

Photon Mapping. Kadi Bouatouch IRISA

PART ONE: Learn About Area of a Parallelogram

Chapter 1. Introduction

Let s say I give you a homework assignment today with 100 problems. Each problem takes 2 hours to solve. The homework is due tomorrow.

LLGrid: On-Demand Grid Computing with gridmatlab and pmatlab

Sensor Models / Occupancy Mapping / Density Mapping

Transcription:

An Environmental Application on Windows-HPC Christian Terboven terboven@rz.rwth-aachen.de Center for Computing and Communication RWTH Aachen University, Germany Windows-HPC User Group March 12th, Fraunhofer SCAI, Germany

Agenda o HPC off the beaten track : Grassland Succession (HPC on Windows with Delphi) o An HPC-Environment with Delphi?! 2

: Simulation of Grassland Succession o Institute for Environmental Research (Bio5) and Research Institute for Ecosystem Analysis and Assessment (GAIAC) Simulation of grassland development under var. land use forms decision support for grassland mgmt. on the landscape scale Dataset: Eifel National Park established in 2004, includes open grasslands of former military training site Vogelsang 3 Simulation tool developed at Bio5 / GAIAC, developed in Delphi on and for Windows.

: Model Concept o : Spatial Explicit Dynamic Model Model is divided up into cell in which all processes take place Species may immigrate from neighboring cells, or may spread over further distances via seed dispersal Since not every species present can be modeled, a set of representatives was chosen: 10 kinds of grasses or herbs 3 kinds of bushes 4 kinds of trees 4

: Delphi for HPC o The simulation code was not suitable for the Eifel dataset: Delphi currently supports only 32bit (not enough memory) Dataset was manually partitioned to conduct simulation No means of interaction between the partitions o Work done so far: Creation of an MPI application including seed exchange for the whole Eifel dataset. o Seed-funded by JARA-SIM. 5 Left: bad Right: good load balancing.

: Current state of the Parallelization o Landscape map is flattened and separated in squared cells of 10 meters edge length rectangular grid. Several fields are empty, since the Eifel NP is not of a rectangular shape. o A simulated day consists of steps, performed independently for every cell: 1. Growth simulation for every plant contained in the cell 2. Calculation of the amount of seeds to be exchanged with other cells 3. Execution of the seed exchange 4. Calculation of the impact of seed received by other cells 6

Agenda o HPC off the beaten track : Grassland Succession (HPC on Windows with Delphi) o An HPC-Environment with Delphi?! 7

o Well, obviously we cannot use Visual Studio. Tooling for Delphi (1/2) 8 o For Delphi there is a Sampling Profiler: It shows the time spent per time (not quite mapped to the source code), see the image on the right. o No support for any kind of parallelism.

Tooling for Delphi (2/2) o Intel VTune works with any binary (when using Sampling) and can event map results to source code, as long as there is a PDB file good to get a hardware-oriented overview. 9

: An HPC-Environment for Delphi o Delphi comes with virtually no support for parallel programming, so we created / modified a suitable environment: Parallel program Delphi MPI-Binding Parallel program Delphi MPI-Binding Intel-MPI / Microsoft-MPI InfiniBand Network MPI Performance Analysis: I-MPI: Intel Trace Analyzer and Collector MS-MPI (w/ Tracing): Vampir on Windows 10 Multicore Compute Node o No support for Shared- Memory parallelization, we are thinking of rewriting some components in C++... Simulation Time per Year [sec] 90 80 70 60 50 40 30 20 10 0 1 2 4 8 #Nodes 1 ppn 2 ppn 4 ppn 8 ppn

Open Questions o Shall they continue to bet on Delphi? Tool support for Delphi is limited There is no real Delphi for Linux o How to handle Monte-Carlo verification runs, where a model is evaluated a 1000 times in parallel? Use MPI? Use SOA? Use parametric sweep / array job + collection tool? 11 o Note: The Windows HPC Server 2008 enabled HPC as a tool for this user community.

The End Thank you for your attention! 12