A dynamic load-balancing strategy for large scale CFD-applications
|
|
- Hester Wendy King
- 5 years ago
- Views:
Transcription
1 A dynamic load-balancing strategy for large scale CFD-applications Philipp Offenhäuser /20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
2 Outline Motivation and Background Load-Balancing strategy Results Further Challenges Conclusion 2/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
3 Motivation - Top500 from 1993 to Rmax Rank 1 Rmax Rank 10 #Cores Rank 1 #Cores Rank Performance [GF lop/s] # Cores Year 10 0 (Source: top500.org) 3/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
4 Motivation HPC systems generate their power by facilitating hundreds of thousands of cores Massive parallel algorithms are implemented Computational effort must be distributed evenly across all cores Initial domain decomposition techniques are well know Well balanced applications for scientific use-cases are developed Performance [GF lop/s] Rmax Rank Rmax Rank #Cores Rank #Cores Rank Year # Cores (Source: top500.org) 4/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
5 Definitions Element-weight: Specific computational effort of an element. 5/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
6 Definitions Element-weight: Specific computational effort of an element. Load of a MPI-domain: Sum of the element-weights of a MPI-domain. 5/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
7 Definitions Element-weight: Specific computational effort of an element. Load of a MPI-domain: Sum of the element-weights of a MPI-domain. Load-imbalance: Load-imbalance L = Load-imbalance T = maximum load average load maximum computation time average computation time 5/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
8 Example 1 - Finite-Volume-Code FS3D References: Institute of Aerospace Thermodynamics - University of Stuttgart 6/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
9 Examples for load-imbalance - Volume of Fluid Method y ,7 0, , , ,7 0 Treatment of multiple phases 0 liquid f (x, t) = (0; 1) interface 1 solid Reconstruction of the interface Additional computational effort for the interface elements Source: ITLR x 7/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
10 Examples for load-imbalance - Volume of Fluid Method y ,7 0, , , ,7 0 Treatment of multiple phases 0 liquid f (x, t) = (0; 1) interface 1 solid Reconstruction of the interface Additional computational effort for the interface elements Source: ITLR x 7/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
11 Example 2 - Discontinuous Galerkin Spectral Element Method (DG SEM) based code FLEXI References: Institute of Aerodynamics and Gas Dynamics - University of Stuttgart Numerical research group FLEXI on github: 8/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
12 FLEXI - a high-order numerical framework Hight-order numerical approach Massively parallel CFD-code designed for HPC-systems DG SEM enables efficient communication-pattern (communication hiding) FLEXI tested on different HPC-systems: Hornet, HazelHen, JUQueen, Marconi FLEXI is used for very large academic use-cases 9/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
13 Example - gas injection Numerical approach was extended to simulate complex fluid-flow The FV-Sub-Cell-Method is used for shock capturing Code is used for industrial use cases Depending on the local solution the FV-Sub-Cell-Method is turned on/off FV-Sub-Cell-Method adds local computational effort 10/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
14 Local computational effort T = 0,00 ms T = 0,25 ms T = 0,50 ms 11/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
15 Motivation for dynamic load-balancing load / average load core id Load distribution of a test-case on 96 cores. Small number of cores are over-loaded Huge number of cores are under-loaded Performance of the application suffers from the overload on a small number of cores 12/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
16 A dynamic load-balancing strategy Step 1: Calculate a new optimized load distribution Step 2: Distribute the load between MPI-processes 13/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
17 A dynamic load-balancing strategy Step 1: Calculate a new optimized load distribution Step 2: Distribute the load between MPI-processes Challenges: Global optimization problem Simple and fast calculation of the optimized load distribution Keep the communication overhead small 13/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
18 A dynamic load-balancing strategy Step 1: Calculate a new optimized load distribution Step 2: Distribute the load between MPI-processes Challenges: Global optimization problem Simple and fast calculation of the optimized load distribution Keep the communication overhead small Challenges: Communication-structure originates during run-time Communication-structure changes during run-time Keep the communication overhead small 13/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
19 A dynamic load-balancing strategy - approach w2 w1 Elementweight Core Core Using a space-filling-curve for the domain-decomposition Mapping the 3D-problem to a 1D-problem Minimal communication (only one time) Simple communication pattern 14/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
20 1. Calculate a new optimized load distribution An optimization algorithm based on the element specific computational effort is used for the load-distribution. cavg Pn = min( c P n 1 avg + Pn 1, cp n 1 avg + Pn 2 ) load / average load load / average load core id core id Load distribution without load-balancing Load distribution with load-balancing Load distribution of a test-case on 96 cores. 15/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
21 Step 2: Distributing the load between MPI-processes Using again the 1D-structure of the space-filling-curve Intra-node-communication via MPI-Shared-Memory-Window Shared-Memory used for communication and to store the results during the load-balancing MPI-communication only between nodes 16/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
22 Results - Element distribution T 0 = 0,00ms T 1 = 0,25ms T 2 = 0,50ms Element distribution depends on the computational effort Redistrubution of the elements every 1,000 timesteps 10% reduction of the wall-time 17/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
23 Dynamic load distribution 18/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
24 Further Challenges - detection of load-imbalance at run-time State of the art: I Different powerful performance measuring tools are available I All tools need an instrumentation and produce overhead I Results of the measurement only available after simulation I No feedback loop to the application 19/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
25 Further Challenges - detection of load-imbalance at run-time State of the art: I Different powerful performance measuring tools are available I All tools need an instrumentation and produce overhead I Results of the measurement only available after simulation I No feedback loop to the application Challenges: I Measurement has to be integrated in the application I Feedback loop to the application I Measurement of the load-imbalance with a minimal overhead 19/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
26 Conclusion HPC systems generate their power by facilitating hundreds of thousands of cores Massively parallel CFD-codes are implemented Initial domain decomposition techniques are well know Well balanced CFD-applications Simulation of complex fluid flow varying load distribution during run-time occurrence of the load distribution hard to predict spatially and chronologically simulation specific load-imbalance occurs For efficient application execution dynamic load-balance strategies are indispensable Efficient redistribution of the computational load Detection of load-imbalance with a very low overhead 20/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
27 Thank you for your attention! 20/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: ::
Unstructured Grid Numbering Schemes for GPU Coalescing Requirements
Unstructured Grid Numbering Schemes for GPU Coalescing Requirements Andrew Corrigan 1 and Johann Dahm 2 Laboratories for Computational Physics and Fluid Dynamics Naval Research Laboratory 1 Department
More informationHPC Considerations for Scalable Multidiscipline CAE Applications on Conventional Linux Platforms. Author: Correspondence: ABSTRACT:
HPC Considerations for Scalable Multidiscipline CAE Applications on Conventional Linux Platforms Author: Stan Posey Panasas, Inc. Correspondence: Stan Posey Panasas, Inc. Phone +510 608 4383 Email sposey@panasas.com
More informationScalable Dynamic Load Balancing of Detailed Cloud Physics with FD4
Center for Information Services and High Performance Computing (ZIH) Scalable Dynamic Load Balancing of Detailed Cloud Physics with FD4 Minisymposium on Advances in Numerics and Physical Modeling for Geophysical
More informationKeisuke Sawada. Department of Aerospace Engineering Tohoku University
March 29th, 213 : Next Generation Aircraft Workshop at Washington University Numerical Study of Wing Deformation Effect in Wind-Tunnel Testing Keisuke Sawada Department of Aerospace Engineering Tohoku
More informationPerformance of the 3D-Combustion Simulation Code RECOM-AIOLOS on IBM POWER8 Architecture. Alexander Berreth. Markus Bühler, Benedikt Anlauf
PADC Anual Workshop 20 Performance of the 3D-Combustion Simulation Code RECOM-AIOLOS on IBM POWER8 Architecture Alexander Berreth RECOM Services GmbH, Stuttgart Markus Bühler, Benedikt Anlauf IBM Deutschland
More informationAsynchronous OpenCL/MPI numerical simulations of conservation laws
Asynchronous OpenCL/MPI numerical simulations of conservation laws Philippe HELLUY 1,3, Thomas STRUB 2. 1 IRMA, Université de Strasbourg, 2 AxesSim, 3 Inria Tonus, France IWOCL 2015, Stanford Conservation
More informationThe Potential of Diffusive Load Balancing at Large Scale
Center for Information Services and High Performance Computing The Potential of Diffusive Load Balancing at Large Scale EuroMPI 2016, Edinburgh, 27 September 2016 Matthias Lieber, Kerstin Gößner, Wolfgang
More informationSPEED-UP GEARBOX SIMULATIONS BY INTEGRATING SCORG. Dr. Christine Klier, Sahand Saheb-Jahromi, Ludwig Berger*
SPEED-UP GEARBOX SIMULATIONS BY INTEGRATING SCORG Dr. Christine Klier, Sahand Saheb-Jahromi, Ludwig Berger* CFD SCHUCK ENGINEERING Engineering Services in computational fluid Dynamics (CFD) 25 employees
More informationRecent results with elsa on multi-cores
Michel Gazaix (ONERA) Steeve Champagneux (AIRBUS) October 15th, 2009 Outline Short introduction to elsa elsa benchmark on HPC platforms Detailed performance evaluation IBM Power5, AMD Opteron, INTEL Nehalem
More informationPerformance Optimization of a Massively Parallel Phase-Field Method Using the HPC Framework walberla
Performance Optimization of a Massively Parallel Phase-Field Method Using the HPC Framework walberla SIAM PP 2016, April 13 th 2016 Martin Bauer, Florian Schornbaum, Christian Godenschwager, Johannes Hötzer,
More informationParallel Algorithm Design. Parallel Algorithm Design p. 1
Parallel Algorithm Design Parallel Algorithm Design p. 1 Overview Chapter 3 from Michael J. Quinn, Parallel Programming in C with MPI and OpenMP Another resource: http://www.mcs.anl.gov/ itf/dbpp/text/node14.html
More informationsimulation framework for piecewise regular grids
WALBERLA, an ultra-scalable multiphysics simulation framework for piecewise regular grids ParCo 2015, Edinburgh September 3rd, 2015 Christian Godenschwager, Florian Schornbaum, Martin Bauer, Harald Köstler
More informationPeta-Scale Simulations with the HPC Software Framework walberla:
Peta-Scale Simulations with the HPC Software Framework walberla: Massively Parallel AMR for the Lattice Boltzmann Method SIAM PP 2016, Paris April 15, 2016 Florian Schornbaum, Christian Godenschwager,
More informationLoad Balancing and Data Migration in a Hybrid Computational Fluid Dynamics Application
Load Balancing and Data Migration in a Hybrid Computational Fluid Dynamics Application Esteban Meneses Patrick Pisciuneri Center for Simulation and Modeling (SaM) University of Pittsburgh University of
More informationPublic Release: Native Overset Mesh in FOAM-Extend
Public Release: Native Overset Mesh in FOAM-Extend Vuko Vukčević and Hrvoje Jasak Faculty of Mechanical Engineering and Naval Architecture, Uni Zagreb, Croatia Wikki Ltd. United Kingdom 5th OpenFOAM UK
More informationCHRONO::HPC DISTRIBUTED MEMORY FLUID-SOLID INTERACTION SIMULATIONS. Felipe Gutierrez, Arman Pazouki, and Dan Negrut University of Wisconsin Madison
CHRONO::HPC DISTRIBUTED MEMORY FLUID-SOLID INTERACTION SIMULATIONS Felipe Gutierrez, Arman Pazouki, and Dan Negrut University of Wisconsin Madison Support: Rapid Innovation Fund, U.S. Army TARDEC ASME
More informationNUMERICAL INVESTIGATION OF THE FLOW BEHAVIOR INTO THE INLET GUIDE VANE SYSTEM (IGV)
University of West Bohemia» Department of Power System Engineering NUMERICAL INVESTIGATION OF THE FLOW BEHAVIOR INTO THE INLET GUIDE VANE SYSTEM (IGV) Publication was supported by project: Budování excelentního
More informationRyan C. Hulguin TACC-Intel Highly Parallel Computing Symposium April 10th-11th, 2012 Austin, TX
Ryan C. Hulguin TACC-Intel Highly Parallel Computing Symposium April 10th-11th, 2012 Austin, TX Outline Introduction Knights Ferry Technical Specifications CFD Governing Equations Numerical Algorithm Solver
More informationEFFICIENT SOLVER FOR LINEAR ALGEBRAIC EQUATIONS ON PARALLEL ARCHITECTURE USING MPI
EFFICIENT SOLVER FOR LINEAR ALGEBRAIC EQUATIONS ON PARALLEL ARCHITECTURE USING MPI 1 Akshay N. Panajwar, 2 Prof.M.A.Shah Department of Computer Science and Engineering, Walchand College of Engineering,
More informationSimulation of Turbulent Particulate Flow on HPC Systems
Simulation of Turbulent Particulate Flow on HPC Systems Konstantin Fröhlich, Lennart Schneiders, Matthias Meinke, Wolfgang Schröder k.froehlich@aia.rwth-aachen.de Institute of Aerodynamics RWTH Aachen
More informationMulti-Domain Pattern. I. Problem. II. Driving Forces. III. Solution
Multi-Domain Pattern I. Problem The problem represents computations characterized by an underlying system of mathematical equations, often simulating behaviors of physical objects through discrete time
More informationHigh-Order Methods for Turbulent Transport in Engineering and Geosciences.
High-Order Methods for Turbulent Transport in Engineering and Geosciences. PI: Paul Fischer, University of Illinois, Urbana-Champaign, fischerp@illinois.edu Collaborators: Ananias Tomboulides, University
More informationRadial Basis Function-Generated Finite Differences (RBF-FD): New Opportunities for Applications in Scientific Computing
Radial Basis Function-Generated Finite Differences (RBF-FD): New Opportunities for Applications in Scientific Computing Natasha Flyer National Center for Atmospheric Research Boulder, CO Meshes vs. Mesh-free
More informationUNIVERSITY OF CASTILLA-LA MANCHA. Computing Systems Department
UNIVERSITY OF CASTILLA-LA MANCHA Computing Systems Department A case study on implementing virtual 5D torus networks using network components of lower dimensionality HiPINEB 2017 Francisco José Andújar
More informationTeam 194: Aerodynamic Study of Airflow around an Airfoil in the EGI Cloud
Team 194: Aerodynamic Study of Airflow around an Airfoil in the EGI Cloud CFD Support s OpenFOAM and UberCloud Containers enable efficient, effective, and easy access and use of MEET THE TEAM End-User/CFD
More informationUncertainty Analysis: Parameter Estimation. Jackie P. Hallberg Coastal and Hydraulics Laboratory Engineer Research and Development Center
Uncertainty Analysis: Parameter Estimation Jackie P. Hallberg Coastal and Hydraulics Laboratory Engineer Research and Development Center Outline ADH Optimization Techniques Parameter space Observation
More informationComputational Fluid Dynamics with the Lattice Boltzmann Method KTH SCI, Stockholm
Computational Fluid Dynamics with the Lattice Boltzmann Method KTH SCI, Stockholm March 17 March 21, 2014 Florian Schornbaum, Martin Bauer, Simon Bogner Chair for System Simulation Friedrich-Alexander-Universität
More informationLarge Scale Simulation of Cloud Cavitation Collapse
PRACEdays17 Barcelona, May 16-18, 2017 Large Scale Simulation of Cloud Cavitation Collapse Ursula Rasthofer with: Fabian Wermelinger, Petr Karnakov, Panagiotis Hadjidoukas, Petros Koumoutsakos CSElab Computational
More informationICON for HD(CP) 2. High Definition Clouds and Precipitation for Advancing Climate Prediction
ICON for HD(CP) 2 High Definition Clouds and Precipitation for Advancing Climate Prediction High Definition Clouds and Precipitation for Advancing Climate Prediction ICON 2 years ago Parameterize shallow
More informationApplied Reservoir Simulation. Course. Workshop Problems
Applied Reservoir Simulation Course Workshop Problems Reservoir Simulation Application Training Course and (Eclipse) Workshop GeoQuest Training and Development, And NExT Denver and Houston March 06 Applied
More informationDesign Verification of Hydraulic Performance. for a Centrifugal Pump using. Computational Fluid Dynamics (CFD)
Design Verification of Hydraulic Performance for a Centrifugal Pump using Computational Fluid Dynamics (CFD) Anil S. Akole (Asst. Manager) Center for Design & Research Pentair Water India Private Limited
More informationMassively Parallel Phase Field Simulations using HPC Framework walberla
Massively Parallel Phase Field Simulations using HPC Framework walberla SIAM CSE 2015, March 15 th 2015 Martin Bauer, Florian Schornbaum, Christian Godenschwager, Johannes Hötzer, Harald Köstler and Ulrich
More informationIntroduction to High-Performance Computing
Introduction to High-Performance Computing 2 What is High Performance Computing? There is no clear definition Computing on high performance computers Solving problems / doing research using computer modeling,
More informationFourteen years of Cactus Community
Fourteen years of Cactus Community Frank Löffler Center for Computation and Technology Louisiana State University, Baton Rouge, LA September 6th 2012 Outline Motivation scenario from Astrophysics Cactus
More informationLS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance
11 th International LS-DYNA Users Conference Computing Technology LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance Gilad Shainer 1, Tong Liu 2, Jeff Layton
More informationTowards Efficient MapReduce Using MPI
Towards Efficient MapReduce Using MPI Torsten Hoefler¹, Andrew Lumsdaine¹, Jack Dongarra² ¹Open Systems Lab Indiana University Bloomington ²Dept. of Computer Science University of Tennessee Knoxville 09/09/09
More informationSpeedup Altair RADIOSS Solvers Using NVIDIA GPU
Innovation Intelligence Speedup Altair RADIOSS Solvers Using NVIDIA GPU Eric LEQUINIOU, HPC Director Hongwei Zhou, Senior Software Developer May 16, 2012 Innovation Intelligence ALTAIR OVERVIEW Altair
More informationAdaptive-Mesh-Refinement Pattern
Adaptive-Mesh-Refinement Pattern I. Problem Data-parallelism is exposed on a geometric mesh structure (either irregular or regular), where each point iteratively communicates with nearby neighboring points
More informationA Scalable Adaptive Mesh Refinement Framework For Parallel Astrophysics Applications
A Scalable Adaptive Mesh Refinement Framework For Parallel Astrophysics Applications James Bordner, Michael L. Norman San Diego Supercomputer Center University of California, San Diego 15th SIAM Conference
More informationDynamic Load Balancing for Weather Models via AMPI
Dynamic Load Balancing for Eduardo R. Rodrigues IBM Research Brazil edrodri@br.ibm.com Celso L. Mendes University of Illinois USA cmendes@ncsa.illinois.edu Laxmikant Kale University of Illinois USA kale@cs.illinois.edu
More informationTwo-Phase flows on massively parallel multi-gpu clusters
Two-Phase flows on massively parallel multi-gpu clusters Peter Zaspel Michael Griebel Institute for Numerical Simulation Rheinische Friedrich-Wilhelms-Universität Bonn Workshop Programming of Heterogeneous
More informationLoad Balancing for Parallel Multi-core Machines with Non-Uniform Communication Costs
Load Balancing for Parallel Multi-core Machines with Non-Uniform Communication Costs Laércio Lima Pilla llpilla@inf.ufrgs.br LIG Laboratory INRIA Grenoble University Grenoble, France Institute of Informatics
More informationOpenFOAM on POWER8. Stretching the performance envelope. A White Paper by OCF
OpenFOAM on POWER8 Stretching the performance envelope A White Paper by OCF Executive Summary In this white paper, we will show that the IBM Power architecture provides a uniquely powerful platform for
More informationPreliminary Experiences with the Uintah Framework on on Intel Xeon Phi and Stampede
Preliminary Experiences with the Uintah Framework on on Intel Xeon Phi and Stampede Qingyu Meng, Alan Humphrey, John Schmidt, Martin Berzins Thanks to: TACC Team for early access to Stampede J. Davison
More informationGE Usage & Trends
CFD @ GE Usage & Trends Dr. Senior Principal Engineer GE Global Research 06 January 2018 Overview of CFD at GE Wide penetration Aviation, Power, Oil & Gas, Renewables Aerodynamics, heat transfer, aeromechanics,
More informationDebojyoti Ghosh. Adviser: Dr. James Baeder Alfred Gessow Rotorcraft Center Department of Aerospace Engineering
Debojyoti Ghosh Adviser: Dr. James Baeder Alfred Gessow Rotorcraft Center Department of Aerospace Engineering To study the Dynamic Stalling of rotor blade cross-sections Unsteady Aerodynamics: Time varying
More informationMultigrid Solvers in CFD. David Emerson. Scientific Computing Department STFC Daresbury Laboratory Daresbury, Warrington, WA4 4AD, UK
Multigrid Solvers in CFD David Emerson Scientific Computing Department STFC Daresbury Laboratory Daresbury, Warrington, WA4 4AD, UK david.emerson@stfc.ac.uk 1 Outline Multigrid: general comments Incompressible
More informationSimulation and Optimization of Biomedical Devices. Mike Nicolai Jülich
Simulation and Optimization of Biomedical Devices. Mike Nicolai Jülich 7.5.2013 Contents SimLab: Highly Scalable Fluids and Solids Engineering Goals Projects The XNS flow solver Features Code Performance
More informationPreliminary Spray Cooling Simulations Using a Full-Cone Water Spray
39th Dayton-Cincinnati Aerospace Sciences Symposium Preliminary Spray Cooling Simulations Using a Full-Cone Water Spray Murat Dinc Prof. Donald D. Gray (advisor), Prof. John M. Kuhlman, Nicholas L. Hillen,
More informationMPI Optimizations via MXM and FCA for Maximum Performance on LS-DYNA
MPI Optimizations via MXM and FCA for Maximum Performance on LS-DYNA Gilad Shainer 1, Tong Liu 1, Pak Lui 1, Todd Wilde 1 1 Mellanox Technologies Abstract From concept to engineering, and from design to
More informationAcknowledgments. Amdahl s Law. Contents. Programming with MPI Parallel programming. 1 speedup = (1 P )+ P N. Type to enter text
Acknowledgments Programming with MPI Parallel ming Jan Thorbecke Type to enter text This course is partly based on the MPI courses developed by Rolf Rabenseifner at the High-Performance Computing-Center
More informationPerformance and Accuracy of Lattice-Boltzmann Kernels on Multi- and Manycore Architectures
Performance and Accuracy of Lattice-Boltzmann Kernels on Multi- and Manycore Architectures Dirk Ribbrock, Markus Geveler, Dominik Göddeke, Stefan Turek Angewandte Mathematik, Technische Universität Dortmund
More informationA Scalable GPU-Based Compressible Fluid Flow Solver for Unstructured Grids
A Scalable GPU-Based Compressible Fluid Flow Solver for Unstructured Grids Patrice Castonguay and Antony Jameson Aerospace Computing Lab, Stanford University GTC Asia, Beijing, China December 15 th, 2011
More informationcomputational Fluid Dynamics - Prof. V. Esfahanian
Three boards categories: Experimental Theoretical Computational Crucial to know all three: Each has their advantages and disadvantages. Require validation and verification. School of Mechanical Engineering
More informationHFSS: Optimal Phased Array Modeling Using Domain Decomposition
HFSS: Optimal Phased Array Modeling Using Domain Decomposition 15. 0 Release Authors: Dane Thompson Nick Hirth Irina Gordion Sara Louie Presenter: Dane Thompson Motivation Electronically scannable antenna
More informationCharacterizing Imbalance in Large-Scale Parallel Programs. David Bo hme September 26, 2013
Characterizing Imbalance in Large-Scale Parallel Programs David o hme September 26, 2013 Need for Performance nalysis Tools mount of parallelism in Supercomputers keeps growing Efficient resource usage
More informationHPC Methods for Coupling Spectral Cloud Microphysics with the COSMO Model
Center for Information Services and High Performance Computing (ZIH) HPC Methods for Coupling Spectral Cloud Microphysics with the COSMO Model Max Planck Institute for Meteorology 17 March 2011 Matthias
More informationOverview. Idea: Reduce CPU clock frequency This idea is well suited specifically for visualization
Exploring Tradeoffs Between Power and Performance for a Scientific Visualization Algorithm Stephanie Labasan & Matt Larsen (University of Oregon), Hank Childs (Lawrence Berkeley National Laboratory) 26
More informationOptimizing LS-DYNA Productivity in Cluster Environments
10 th International LS-DYNA Users Conference Computing Technology Optimizing LS-DYNA Productivity in Cluster Environments Gilad Shainer and Swati Kher Mellanox Technologies Abstract Increasing demand for
More information3D Finite Difference Time-Domain Modeling of Acoustic Wave Propagation based on Domain Decomposition
3D Finite Difference Time-Domain Modeling of Acoustic Wave Propagation based on Domain Decomposition UMR Géosciences Azur CNRS-IRD-UNSA-OCA Villefranche-sur-mer Supervised by: Dr. Stéphane Operto Jade
More informationCMSC 714 Lecture 6 MPI vs. OpenMP and OpenACC. Guest Lecturer: Sukhyun Song (original slides by Alan Sussman)
CMSC 714 Lecture 6 MPI vs. OpenMP and OpenACC Guest Lecturer: Sukhyun Song (original slides by Alan Sussman) Parallel Programming with Message Passing and Directives 2 MPI + OpenMP Some applications can
More informationNumerical Investigation of Transonic Shock Oscillations on Stationary Aerofoils
Numerical Investigation of Transonic Shock Oscillations on Stationary Aerofoils A. Soda, T. Knopp, K. Weinman German Aerospace Center DLR, Göttingen/Germany Symposium on Hybrid RANS-LES Methods Stockholm/Sweden,
More informationSOLIDWORKS Flow Simulation Options
SOLIDWORKS Flow Simulation Options SOLIDWORKS Flow Simulation includes an options dialogue window that allows for defining default options to use for a new project. Some of the options included are unit
More informationMassively Parallel Finite Element Simulations with deal.ii
Massively Parallel Finite Element Simulations with deal.ii Timo Heister, Texas A&M University 2012-02-16 SIAM PP2012 joint work with: Wolfgang Bangerth, Carsten Burstedde, Thomas Geenen, Martin Kronbichler
More informationComputational Fluid Dynamics
Computational Fluid Dynamics Prof. Dr.-Ing. Siegfried Wagner Institut für Aerodynamik und Gasdynamik, Universität Stuttgart, Pfaffenwaldring 21, 70550 Stuttgart A large number of highly qualified papers
More informationA Large-Scale Study of Soft- Errors on GPUs in the Field
A Large-Scale Study of Soft- Errors on GPUs in the Field Bin Nie*, Devesh Tiwari +, Saurabh Gupta +, Evgenia Smirni*, and James H. Rogers + *College of William and Mary + Oak Ridge National Laboratory
More informationSuccessful Applications of CFD within the Aircraft Certification Process TLG Aerospace. Unrestricted Siemens AG 2018
Successful Applications of CFD within the Aircraft Certification Process TLG Aerospace Unrestricted Siemens AG 2018 About TLG Aerospace TLG is an engineering services company providing design, analysis,
More informationTechniques to improve the scalability of Checkpoint-Restart
Techniques to improve the scalability of Checkpoint-Restart Bogdan Nicolae Exascale Systems Group IBM Research Ireland 1 Outline A few words about the lab and team Challenges of Exascale A case for Checkpoint-Restart
More informationA Scalable Parallel LSQR Algorithm for Solving Large-Scale Linear System for Seismic Tomography
1 A Scalable Parallel LSQR Algorithm for Solving Large-Scale Linear System for Seismic Tomography He Huang, Liqiang Wang, Po Chen(University of Wyoming) John Dennis (NCAR) 2 LSQR in Seismic Tomography
More informationNumerical Verification of Large Scale CFD Simulations: One Way to Prepare the Exascale Challenge
Numerical Verification of Large Scale CFD Simulations: One Way to Prepare the Exascale Challenge Christophe DENIS Christophe.Denis@edf.fr EDF Resarch and Development - EDF Lab Clamart August 22, 2014 16
More informationDevelopment of a Maxwell Equation Solver for Application to Two Fluid Plasma Models. C. Aberle, A. Hakim, and U. Shumlak
Development of a Maxwell Equation Solver for Application to Two Fluid Plasma Models C. Aberle, A. Hakim, and U. Shumlak Aerospace and Astronautics University of Washington, Seattle American Physical Society
More informationIntro to Parallel Computing
Outline Intro to Parallel Computing Remi Lehe Lawrence Berkeley National Laboratory Modern parallel architectures Parallelization between nodes: MPI Parallelization within one node: OpenMP Why use parallel
More informationGTC 2017 S7672. OpenACC Best Practices: Accelerating the C++ NUMECA FINE/Open CFD Solver
David Gutzwiller, NUMECA USA (david.gutzwiller@numeca.com) Dr. Ravi Srinivasan, Dresser-Rand Alain Demeulenaere, NUMECA USA 5/9/2017 GTC 2017 S7672 OpenACC Best Practices: Accelerating the C++ NUMECA FINE/Open
More informationOzenCloud Case Studies
OzenCloud Case Studies Case Studies, April 20, 2015 ANSYS in the Cloud Case Studies: Aerodynamics & fluttering study on an aircraft wing using fluid structure interaction 1 Powered by UberCloud http://www.theubercloud.com
More informationIntroducing Overdecomposition to Existing Applications: PlasComCM and AMPI
Introducing Overdecomposition to Existing Applications: PlasComCM and AMPI Sam White Parallel Programming Lab UIUC 1 Introduction How to enable Overdecomposition, Asynchrony, and Migratability in existing
More informationCharm++ Workshop 2010
Charm++ Workshop 2010 Eduardo R. Rodrigues Institute of Informatics Federal University of Rio Grande do Sul - Brazil ( visiting scholar at CS-UIUC ) errodrigues@inf.ufrgs.br Supported by Brazilian Ministry
More informationAnalysis of fluid-solid coupling vibration characteristics of probe based on ANSYS Workbench
Analysis of fluid-solid coupling vibration characteristics of probe based on ANSYS Workbench He Wang 1, a, Changzheng Zhao 1, b and Hongzhi Chen 1, c 1 Shandong University of Science and Technology, Qingdao
More informationCME 345: MODEL REDUCTION
CME 345: MODEL REDUCTION Parameterized Partial Differential Equations Charbel Farhat Stanford University cfarhat@stanford.edu 1 / 19 Outline 1 Initial Boundary Value Problems 2 Typical Parameters of Interest
More informationCommunication and Optimization Aspects of Parallel Programming Models on Hybrid Architectures
Communication and Optimization Aspects of Parallel Programming Models on Hybrid Architectures Rolf Rabenseifner rabenseifner@hlrs.de Gerhard Wellein gerhard.wellein@rrze.uni-erlangen.de University of Stuttgart
More informationImplementation of an integrated efficient parallel multiblock Flow solver
Implementation of an integrated efficient parallel multiblock Flow solver Thomas Bönisch, Panagiotis Adamidis and Roland Rühle adamidis@hlrs.de Outline Introduction to URANUS Why using Multiblock meshes
More informationAteles performance assessment report
Ateles performance assessment report Document Information Reference Number Author Contributor(s) Date Application Service Level Keywords AR-4, Version 0.1 Jose Gracia (USTUTT-HLRS) Christoph Niethammer,
More informationOptimize HPC - Application Efficiency on Many Core Systems
Meet the experts Optimize HPC - Application Efficiency on Many Core Systems 2018 Arm Limited Florent Lebeau 27 March 2018 2 2018 Arm Limited Speedup Multithreading and scalability I wrote my program to
More informationPerformance Benefits of NVIDIA GPUs for LS-DYNA
Performance Benefits of NVIDIA GPUs for LS-DYNA Mr. Stan Posey and Dr. Srinivas Kodiyalam NVIDIA Corporation, Santa Clara, CA, USA Summary: This work examines the performance characteristics of LS-DYNA
More informationFault tolerant issues in large scale applications
Fault tolerant issues in large scale applications Romain Teyssier George Lake, Ben Moore, Joachim Stadel and the other members of the project «Cosmology at the petascale» SPEEDUP 2010 1 Outline Computational
More informationDirect Numerical Simulation and Turbulence Modeling for Fluid- Structure Interaction in Aerodynamics
Available online at www.prace-ri.eu Partnership for Advanced Computing in Europe Direct Numerical Simulation and Turbulence Modeling for Fluid- Structure Interaction in Aerodynamics Thibaut Deloze a, Yannick
More informationFast Dynamic Load Balancing for Extreme Scale Systems
Fast Dynamic Load Balancing for Extreme Scale Systems Cameron W. Smith, Gerrett Diamond, M.S. Shephard Computation Research Center (SCOREC) Rensselaer Polytechnic Institute Outline: n Some comments on
More informationAdaptive Refinement Tree (ART) code. N-Body: Parallelization using OpenMP and MPI
Adaptive Refinement Tree (ART) code N-Body: Parallelization using OpenMP and MPI 1 Family of codes N-body: OpenMp N-body: MPI+OpenMP N-body+hydro+cooling+SF: OpenMP N-body+hydro+cooling+SF: MPI 2 History:
More informationParallelization of Scientific Applications (II)
Parallelization of Scientific Applications (II) Parallelization of Particle Based Methods Russian-German School on High Performance Computer Systems, June, 27 th until July, 6 th 2005, Novosibirsk 4. Day,
More informationVisualization and Data Analysis using VisIt - In Situ Visualization -
Mitglied der Helmholtz-Gemeinschaft Visualization and Data Analysis using VisIt - In Situ Visualization - Jens Henrik Göbbert 1, Herwig Zilken 1 1 Jülich Supercomputing Centre, Forschungszentrum Jülich
More informationSimulation of Liquid-Gas-Solid Flows with the Lattice Boltzmann Method
Simulation of Liquid-Gas-Solid Flows with the Lattice Boltzmann Method June 21, 2011 Introduction Free Surface LBM Liquid-Gas-Solid Flows Parallel Computing Examples and More References Fig. Simulation
More informationA-posteriori Diffusion Analysis of Numerical Schemes in Wavenumber Domain
2th Annual CFD Symposium, August 9-1, 218, Bangalore A-posteriori Diffusion Analysis of Numerical Schemes in Wavenumber Domain S. M. Joshi & A. Chatterjee Department of Aerospace Engineering Indian Institute
More informationEvaluating the Performance of the Community Atmosphere Model at High Resolutions
Evaluating the Performance of the Community Atmosphere Model at High Resolutions Soumi Manna MS candidate, University of Wyoming Mentor: Dr. Ben Jamroz National Center for Atmospheric Research Boulder,
More informationAir Force Materiel Command
Air Force Materiel Command Developing, Fielding, and Sustaining America s Aerospace Force High-Fidelity Physics- Based Modeling in Support of Test & Evaluation Dr. Ed Kraft AEDC/CZ NDIA M&S Conference
More informationPerformance Metrics of a Parallel Three Dimensional Two-Phase DSMC Method for Particle-Laden Flows
Performance Metrics of a Parallel Three Dimensional Two-Phase DSMC Method for Particle-Laden Flows Benzi John* and M. Damodaran** Division of Thermal and Fluids Engineering, School of Mechanical and Aerospace
More informationIntroduction to Parallel Performance Engineering
Introduction to Parallel Performance Engineering Markus Geimer, Brian Wylie Jülich Supercomputing Centre (with content used with permission from tutorials by Bernd Mohr/JSC and Luiz DeRose/Cray) Performance:
More informationA New Approach to Modeling Physical Systems: Discrete Event Simulations of Grid-based Models
A New Approach to Modeling Physical Systems: Discrete Event Simulations of Grid-based Models H. Karimabadi 1, Y. Omelchenko 1, J. Driscoll 1, N. Omidi 1, R. Fujimoto 2, S. Pande 2, and K. S. Perumalla
More informationTowards Large Scale Predictive Flow Simulations
Towards Large Scale Predictive Flow Simulations using ITAPS Tools and Services Onkar Sahni a a Present address: Center for Predictive Engineering and Computational Sciences (PECOS), ICES, UT Austin ollaborators:
More information60x Computational Fluid Dynamics and Visualisation
60x Computational Fluid Dynamics and Visualisation Jamil Appa BAE Systems Advanced Technology Centre 1 Outline BAE Systems - Introduction Aerodynamic Design Challenges Why GPUs? CFD on GPUs Example Kernel
More informationReal Application Performance and Beyond
Real Application Performance and Beyond Mellanox Technologies Inc. 2900 Stender Way, Santa Clara, CA 95054 Tel: 408-970-3400 Fax: 408-970-3403 http://www.mellanox.com Scientists, engineers and analysts
More information