Unstructured Grid Numbering Schemes for GPU Coalescing Requirements

Size: px
Start display at page:

Download "Unstructured Grid Numbering Schemes for GPU Coalescing Requirements"

Transcription

1 Unstructured Grid Numbering Schemes for GPU Coalescing Requirements Andrew Corrigan 1 and Johann Dahm 2 Laboratories for Computational Physics and Fluid Dynamics Naval Research Laboratory 1 Department of Aerospace Engineering University of Michigan 2

2 Acknowledgements Rainald Löhner (George Mason University) K. Kailasanath Gopal Patnaik Junhui Liu Ravi Ramamurti Douglas Schwer David Kessler Work sponsored through ONR/NRL 6.1

3 Motivation: JENRE JENRE: Jet Engine Noise Reduction code Navy-developed simulation tool for jet engine noise prediction Requires accurate representation of complex nozzle geometry

4 Motivation: JENRE Computationally demanding Hundreds of millions of grid cells Long time duration Increasingly complex physics J. Liu, Numerical Investigation of Advanced Military Aircraft Noise Reduction Concepts DoD HPC Challenge Project for FY2012 Consumes millions of CPU hours per year

5 Motivation: JENRE Physics Euler equations Navier-Stokes equations Numerics Cell-centered finite volume Node-centered finite element Grid Unstructured (Tetrahedral, Hexahedral, Prism, Pyramid) Structured (Hexahedral) Cartesian (Hexahedral) Hybrid Parallelism Distributed memory parallelism via MPI. Shared memory parallelism via Thrust. Single codebase Extensive use of generic programming via C++ templates. Recompile for particular physics, numerics, grid, computational architecture. Individual components can be specialized as required (e.g., grid numbering).

6 Unstructured Grids Accurately represent complex geometry Provide precise control over grid spacing Grid entities can be arbitrarily numbered Lead to an indirect, scattered memory access pattern Potentially lead to a significant reduction of performance

7 Memory Access Pattern Gather-scatter memory access pattern Edge/face-based solvers. Common non-trivial memory access pattern in CFD and other areas. Reflects physical symmetry of conservation laws at discrete level. Sparse solvers are often implemented directly in terms of grid connectivity. For each edge: 1. Gather from points. 2. Compute on edge. 3. Scatter back to points. Edges must be colored for parallel execution

8 Numbering Reorder points to improve memory access pattern as edges are traversed Should be tailored to the unique requirements of each computational architecture 0 1 2

9 Bandwidth Minimization Intended for minimizing CPU cache misses as edges are traversed Examples: (RCM) Reverse Cuthill- McKee, Space-filling curve, Wavefront, Strives to keep data close in physical space close in memory space

10 Coalescing Memory transactions are serviced on a persegment basis Data stored at {L,R} points are accessed simultaneously Adjacency in physical space does NOT imply adjacency in memory space

11 Unstructured Grid Numbering 1. Detect lines 2. Alternate numbering between points on each line 3. Color edges, first along, then between lines 4. Sort edges by its color and point index tuple

12 Detect Lines An external direction field is imposed User input parameter to algorithm. A coordinate direction often works very well. Detect lines along the edges most aligned with this direction Lines may merge or branch

13 Detect Lines 2D triangular grid Detected lines

14 Alternate Numbering Along Lines The numbering is alternated along each line Good coalescence for intra-line edges Requires lines of sufficient length

15 Color Edges Along Lines The first edge colors are constrained to edges along lines The edge groups within these edge colors will typically exhibit good coalescence

16 Color Edges Along Lines The first edge colors are constrained to edges along lines The edge groups within these edge colors will typically exhibit good coalescence

17 Color Edges Between Lines Edges between lines are colored between pairs of lines Avoids large jumps between lines as much as possible. Coalescence will be optimal when these edges are oriented consistently

18 Color Edges Between Lines Edges between lines are colored between pairs of lines Avoids large jumps between lines as much as possible. Coalescence will be optimal when these edges are oriented consistently

19 Color Edges Between Lines Edges between lines are colored between pairs of lines Avoids large jumps between lines as much as possible. Coalescence will be optimal when these edges are oriented consistently

20 Color Edges Between Lines Edges between lines are colored between pairs of lines Avoids large jumps between lines as much as possible. Coalescence will be optimal when these edges are oriented consistently

21 Number Edges Sort edges within each color group according to the point-numbering Consecutive points across edges are numbered with a unit stride

22 Benchmarks JENRE high-speed compressible flow solver Fully-coupled, finite element spatial discretization Flux-corrected transport limiting Explicit Taylor-Galerkin time-stepping Single Geforce GTX 580 Computational throughput measured To compute one time step (averaged over many) Tetrahedral cells / second Includes non-edge-loops FEFLO-GEN3D advancing front grid generator With Cartesian core option

23 Benchmark: Shock Tube 2.5M Tetrahedral cells Based on a fully structured grid. Bandwidth minimization numbering 27M Tetrahedral cells / second Line-based numbering 41M Tetrahedral cells / second

24 Benchmark: Shock Tube 1.9M Tetrahedral cells Advancing front + Cartesian core grid generation. Bandwidth minimization numbering 24M Tetrahedral cells / second Line-based numbering 35M Tetrahedral cells / second

25 Benchmark: NACA M Tetrahedral cells Bandwidth minimization 18M Tetrahedral cells / second Line-based numbering 21M Tetrahedral cells / second

26 Benchmark: Nozzle 5.8M Tetrahedral cells Bandwidth minimization 19M Tetrahedral cells / second Line-based numbering 28M Tetrahedral cells / second

27 Conclusions A significant performance improvement is possible if unstructured grid numbering schemes are tailored for GPU coalescing requirements. Bandwidth minimization numbering Is not directly relevant to achieving coalesced memory access. Does not achieve optimal coalescence in simple cases. Does provide reliable baseline computational performance. Line-based numbering Directly relevant to achieving coalesced memory access. Near-optimal coalescence in simple cases. Dependent on presence of long parallel lines in grid connectivity. Should not be an issue for highly-refined LES grids.

A Hybrid Grid Compressible Flow Solver for Large-Scale Supersonic Jet Noise Simulations on Multi-GPU Clusters

A Hybrid Grid Compressible Flow Solver for Large-Scale Supersonic Jet Noise Simulations on Multi-GPU Clusters 50th AIAA Aerospace Sciences Meeting including the New Horizons Forum and Aerospace Exposition 09-12 January 2012, Nashville, Tennessee AIAA 2012-0564 A Hybrid Grid Compressible Flow Solver for Large-Scale

More information

Unstructured Finite Volume Code on a Cluster with Mul6ple GPUs per Node

Unstructured Finite Volume Code on a Cluster with Mul6ple GPUs per Node Unstructured Finite Volume Code on a Cluster with Mul6ple GPUs per Node Keith Obenschain & Andrew Corrigan Laboratory for Computa;onal Physics and Fluid Dynamics Naval Research Laboratory Washington DC,

More information

A Scalable GPU-Based Compressible Fluid Flow Solver for Unstructured Grids

A Scalable GPU-Based Compressible Fluid Flow Solver for Unstructured Grids A Scalable GPU-Based Compressible Fluid Flow Solver for Unstructured Grids Patrice Castonguay and Antony Jameson Aerospace Computing Lab, Stanford University GTC Asia, Beijing, China December 15 th, 2011

More information

Developing the TELEMAC system for HECToR (phase 2b & beyond) Zhi Shang

Developing the TELEMAC system for HECToR (phase 2b & beyond) Zhi Shang Developing the TELEMAC system for HECToR (phase 2b & beyond) Zhi Shang Outline of the Talk Introduction to the TELEMAC System and to TELEMAC-2D Code Developments Data Reordering Strategy Results Conclusions

More information

Radial Basis Function-Generated Finite Differences (RBF-FD): New Opportunities for Applications in Scientific Computing

Radial Basis Function-Generated Finite Differences (RBF-FD): New Opportunities for Applications in Scientific Computing Radial Basis Function-Generated Finite Differences (RBF-FD): New Opportunities for Applications in Scientific Computing Natasha Flyer National Center for Atmospheric Research Boulder, CO Meshes vs. Mesh-free

More information

A DRAG PREDICTION VALIDATION STUDY FOR AIRCRAFT AERODYNAMIC ANALYSIS

A DRAG PREDICTION VALIDATION STUDY FOR AIRCRAFT AERODYNAMIC ANALYSIS A DRAG PREDICTION VALIDATION STUDY FOR AIRCRAFT AERODYNAMIC ANALYSIS Akio OCHI, Eiji SHIMA Kawasaki Heavy Industries, ltd Keywords: CFD, Drag prediction, Validation Abstract A CFD drag prediction validation

More information

Introduction to C omputational F luid Dynamics. D. Murrin

Introduction to C omputational F luid Dynamics. D. Murrin Introduction to C omputational F luid Dynamics D. Murrin Computational fluid dynamics (CFD) is the science of predicting fluid flow, heat transfer, mass transfer, chemical reactions, and related phenomena

More information

Parallel Mesh Multiplication for Code_Saturne

Parallel Mesh Multiplication for Code_Saturne Parallel Mesh Multiplication for Code_Saturne Pavla Kabelikova, Ales Ronovsky, Vit Vondrak a Dept. of Applied Mathematics, VSB-Technical University of Ostrava, Tr. 17. listopadu 15, 708 00 Ostrava, Czech

More information

Meshing of flow and heat transfer problems

Meshing of flow and heat transfer problems Meshing of flow and heat transfer problems Luyao Zou a, Zhe Li b, Qiqi Fu c and Lujie Sun d School of, Shandong University of science and technology, Shandong 266590, China. a zouluyaoxf@163.com, b 1214164853@qq.com,

More information

Asynchronous OpenCL/MPI numerical simulations of conservation laws

Asynchronous OpenCL/MPI numerical simulations of conservation laws Asynchronous OpenCL/MPI numerical simulations of conservation laws Philippe HELLUY 1,3, Thomas STRUB 2. 1 IRMA, Université de Strasbourg, 2 AxesSim, 3 Inria Tonus, France IWOCL 2015, Stanford Conservation

More information

Large-scale Gas Turbine Simulations on GPU clusters

Large-scale Gas Turbine Simulations on GPU clusters Large-scale Gas Turbine Simulations on GPU clusters Tobias Brandvik and Graham Pullan Whittle Laboratory University of Cambridge A large-scale simulation Overview PART I: Turbomachinery PART II: Stencil-based

More information

Techniques for Using the Method of Manufactured Solutions for Verification and Uncertainty Quantification of CFD Simulations Having Discontinuities

Techniques for Using the Method of Manufactured Solutions for Verification and Uncertainty Quantification of CFD Simulations Having Discontinuities Techniques for Using the Method of Manufactured Solutions for Verification and Uncertainty Quantification of CFD Simulations Having Discontinuities Ben Grier Clemson University Richard Figliola, Larry

More information

Multigrid Solvers in CFD. David Emerson. Scientific Computing Department STFC Daresbury Laboratory Daresbury, Warrington, WA4 4AD, UK

Multigrid Solvers in CFD. David Emerson. Scientific Computing Department STFC Daresbury Laboratory Daresbury, Warrington, WA4 4AD, UK Multigrid Solvers in CFD David Emerson Scientific Computing Department STFC Daresbury Laboratory Daresbury, Warrington, WA4 4AD, UK david.emerson@stfc.ac.uk 1 Outline Multigrid: general comments Incompressible

More information

Modeling External Compressible Flow

Modeling External Compressible Flow Tutorial 3. Modeling External Compressible Flow Introduction The purpose of this tutorial is to compute the turbulent flow past a transonic airfoil at a nonzero angle of attack. You will use the Spalart-Allmaras

More information

Structured Grid Generation for Turbo Machinery Applications using Topology Templates

Structured Grid Generation for Turbo Machinery Applications using Topology Templates Structured Grid Generation for Turbo Machinery Applications using Topology Templates January 13th 2011 Martin Spel martin.spel@rtech.fr page 1 Agenda: R.Tech activities Grid Generation Techniques Structured

More information

NIA CFD Seminar, October 4, 2011 Hyperbolic Seminar, NASA Langley, October 17, 2011

NIA CFD Seminar, October 4, 2011 Hyperbolic Seminar, NASA Langley, October 17, 2011 NIA CFD Seminar, October 4, 2011 Hyperbolic Seminar, NASA Langley, October 17, 2011 First-Order Hyperbolic System Method If you have a CFD book for hyperbolic problems, you have a CFD book for all problems.

More information

HARNESSING IRREGULAR PARALLELISM: A CASE STUDY ON UNSTRUCTURED MESHES. Cliff Woolley, NVIDIA

HARNESSING IRREGULAR PARALLELISM: A CASE STUDY ON UNSTRUCTURED MESHES. Cliff Woolley, NVIDIA HARNESSING IRREGULAR PARALLELISM: A CASE STUDY ON UNSTRUCTURED MESHES Cliff Woolley, NVIDIA PREFACE This talk presents a case study of extracting parallelism in the UMT2013 benchmark for 3D unstructured-mesh

More information

1.2 Numerical Solutions of Flow Problems

1.2 Numerical Solutions of Flow Problems 1.2 Numerical Solutions of Flow Problems DIFFERENTIAL EQUATIONS OF MOTION FOR A SIMPLIFIED FLOW PROBLEM Continuity equation for incompressible flow: 0 Momentum (Navier-Stokes) equations for a Newtonian

More information

Optimization with Gradient and Hessian Information Calculated Using Hyper-Dual Numbers

Optimization with Gradient and Hessian Information Calculated Using Hyper-Dual Numbers Optimization with Gradient and Hessian Information Calculated Using Hyper-Dual Numbers Jeffrey A. Fike and Juan J. Alonso Department of Aeronautics and Astronautics, Stanford University, Stanford, CA 94305,

More information

Real Application Performance and Beyond

Real Application Performance and Beyond Real Application Performance and Beyond Mellanox Technologies Inc. 2900 Stender Way, Santa Clara, CA 95054 Tel: 408-970-3400 Fax: 408-970-3403 http://www.mellanox.com Scientists, engineers and analysts

More information

Computational Fluid Dynamics for Engineers

Computational Fluid Dynamics for Engineers Tuncer Cebeci Jian P. Shao Fassi Kafyeke Eric Laurendeau Computational Fluid Dynamics for Engineers From Panel to Navier-Stokes Methods with Computer Programs With 152 Figures, 19 Tables, 84 Problems and

More information

NUMERICAL SIMULATIONS OF FLOW THROUGH AN S-DUCT

NUMERICAL SIMULATIONS OF FLOW THROUGH AN S-DUCT NUMERICAL SIMULATIONS OF FLOW THROUGH AN S-DUCT 1 Pravin Peddiraju, 1 Arthur Papadopoulos, 2 Vangelis Skaperdas, 3 Linda Hedges 1 BETA CAE Systems USA, Inc., USA, 2 BETA CAE Systems SA, Greece, 3 CFD Consultant,

More information

Towards a complete FEM-based simulation toolkit on GPUs: Geometric Multigrid solvers

Towards a complete FEM-based simulation toolkit on GPUs: Geometric Multigrid solvers Towards a complete FEM-based simulation toolkit on GPUs: Geometric Multigrid solvers Markus Geveler, Dirk Ribbrock, Dominik Göddeke, Peter Zajac, Stefan Turek Institut für Angewandte Mathematik TU Dortmund,

More information

3D ADI Method for Fluid Simulation on Multiple GPUs. Nikolai Sakharnykh, NVIDIA Nikolay Markovskiy, NVIDIA

3D ADI Method for Fluid Simulation on Multiple GPUs. Nikolai Sakharnykh, NVIDIA Nikolay Markovskiy, NVIDIA 3D ADI Method for Fluid Simulation on Multiple GPUs Nikolai Sakharnykh, NVIDIA Nikolay Markovskiy, NVIDIA Introduction Fluid simulation using direct numerical methods Gives the most accurate result Requires

More information

Simulation of the Dynamics of Micro Air Vehicles. Ravi Ramamurti and William Sandberg

Simulation of the Dynamics of Micro Air Vehicles. Ravi Ramamurti and William Sandberg AIAA 2000-0896 Simulation of the Dynamics of Micro Air Vehicles Ravi Ramamurti and William Sandberg Laboratory for Computational Physics and Fluid Dynamics Naval Research Laboratory Washington, D.C. 20375

More information

Numerical Methods for PDEs. SSC Workgroup Meetings Juan J. Alonso October 8, SSC Working Group Meetings, JJA 1

Numerical Methods for PDEs. SSC Workgroup Meetings Juan J. Alonso October 8, SSC Working Group Meetings, JJA 1 Numerical Methods for PDEs SSC Workgroup Meetings Juan J. Alonso October 8, 2001 SSC Working Group Meetings, JJA 1 Overview These notes are meant to be an overview of the various memory access patterns

More information

FAST ALGORITHMS FOR CALCULATIONS OF VISCOUS INCOMPRESSIBLE FLOWS USING THE ARTIFICIAL COMPRESSIBILITY METHOD

FAST ALGORITHMS FOR CALCULATIONS OF VISCOUS INCOMPRESSIBLE FLOWS USING THE ARTIFICIAL COMPRESSIBILITY METHOD TASK QUARTERLY 12 No 3, 273 287 FAST ALGORITHMS FOR CALCULATIONS OF VISCOUS INCOMPRESSIBLE FLOWS USING THE ARTIFICIAL COMPRESSIBILITY METHOD ZBIGNIEW KOSMA Institute of Applied Mechanics, Technical University

More information

Verification and Validation in CFD and Heat Transfer: ANSYS Practice and the New ASME Standard

Verification and Validation in CFD and Heat Transfer: ANSYS Practice and the New ASME Standard Verification and Validation in CFD and Heat Transfer: ANSYS Practice and the New ASME Standard Dimitri P. Tselepidakis & Lewis Collins ASME 2012 Verification and Validation Symposium May 3 rd, 2012 1 Outline

More information

Efficient Finite Element Geometric Multigrid Solvers for Unstructured Grids on GPUs

Efficient Finite Element Geometric Multigrid Solvers for Unstructured Grids on GPUs Efficient Finite Element Geometric Multigrid Solvers for Unstructured Grids on GPUs Markus Geveler, Dirk Ribbrock, Dominik Göddeke, Peter Zajac, Stefan Turek Institut für Angewandte Mathematik TU Dortmund,

More information

Recent applications of overset mesh technology in SC/Tetra

Recent applications of overset mesh technology in SC/Tetra Recent applications of overset mesh technology in SC/Tetra NIA CFD Seminar October 6, 2014 Tomohiro Irie Software Cradle Co., Ltd. 1 Contents Introduction Software Cradle SC/Tetra Background of Demands

More information

Multi-Mesh CFD. Chris Roy Chip Jackson (1 st year PhD student) Aerospace and Ocean Engineering Department Virginia Tech

Multi-Mesh CFD. Chris Roy Chip Jackson (1 st year PhD student) Aerospace and Ocean Engineering Department Virginia Tech Multi-Mesh CFD Chris Roy Chip Jackson (1 st year PhD student) Aerospace and Ocean Engineering Department Virginia Tech cjroy@vt.edu May 21, 2014 CCAS Program Review, Columbus, OH 1 Motivation Automated

More information

SENSEI / SENSEI-Lite / SENEI-LDC Updates

SENSEI / SENSEI-Lite / SENEI-LDC Updates SENSEI / SENSEI-Lite / SENEI-LDC Updates Chris Roy and Brent Pickering Aerospace and Ocean Engineering Dept. Virginia Tech July 23, 2014 Collaborations with Math Collaboration on the implicit SENSEI-LDC

More information

Studies of the Continuous and Discrete Adjoint Approaches to Viscous Automatic Aerodynamic Shape Optimization

Studies of the Continuous and Discrete Adjoint Approaches to Viscous Automatic Aerodynamic Shape Optimization Studies of the Continuous and Discrete Adjoint Approaches to Viscous Automatic Aerodynamic Shape Optimization Siva Nadarajah Antony Jameson Stanford University 15th AIAA Computational Fluid Dynamics Conference

More information

HPC Computer Aided CINECA

HPC Computer Aided CINECA HPC Computer Aided Engineering @ CINECA Raffaele Ponzini Ph.D. CINECA SuperComputing Applications and Innovation Department SCAI 16-18 June 2014 Segrate (MI), Italy Outline Open-source CAD and Meshing

More information

A dynamic load-balancing strategy for large scale CFD-applications

A dynamic load-balancing strategy for large scale CFD-applications A dynamic load-balancing strategy for large scale CFD-applications Philipp Offenhäuser 10.10.2017 1/20 :: A dynamic load-balancing strategy for large scale CFD-applications :: 10.10.2017 :: Outline Motivation

More information

Analysis, extensions and applications of the Finite-Volume Particle Method (FVPM) PN-II-RU-TE Synthesis of the technical report -

Analysis, extensions and applications of the Finite-Volume Particle Method (FVPM) PN-II-RU-TE Synthesis of the technical report - Analysis, extensions and applications of the Finite-Volume Particle Method (FVPM) PN-II-RU-TE-2011-3-0256 - Synthesis of the technical report - Phase 1: Preparation phase Authors: Delia Teleaga, Eliza

More information

Memory Hierarchy Management for Iterative Graph Structures

Memory Hierarchy Management for Iterative Graph Structures Memory Hierarchy Management for Iterative Graph Structures Ibraheem Al-Furaih y Syracuse University Sanjay Ranka University of Florida Abstract The increasing gap in processor and memory speeds has forced

More information

Development of a Maxwell Equation Solver for Application to Two Fluid Plasma Models. C. Aberle, A. Hakim, and U. Shumlak

Development of a Maxwell Equation Solver for Application to Two Fluid Plasma Models. C. Aberle, A. Hakim, and U. Shumlak Development of a Maxwell Equation Solver for Application to Two Fluid Plasma Models C. Aberle, A. Hakim, and U. Shumlak Aerospace and Astronautics University of Washington, Seattle American Physical Society

More information

Aerodynamic Design Optimization of UAV Rotor Blades using a Genetic Algorithm

Aerodynamic Design Optimization of UAV Rotor Blades using a Genetic Algorithm Aerodynamic Design Optimization of UAV Rotor Blades using a Genetic Algorithm Hak-Min Lee 1), Nahm-Keon Hur 2) and *Oh-Joon Kwon 3) 1), 3) Department of Aerospace Engineering, KAIST, Daejeon 305-600, Korea

More information

Flux Vector Splitting Methods for the Euler Equations on 3D Unstructured Meshes for CPU/GPU Clusters

Flux Vector Splitting Methods for the Euler Equations on 3D Unstructured Meshes for CPU/GPU Clusters Flux Vector Splitting Methods for the Euler Equations on 3D Unstructured Meshes for CPU/GPU Clusters Manfred Liebmann Technische Universität München Chair of Optimal Control Center for Mathematical Sciences,

More information

Performance Benefits of NVIDIA GPUs for LS-DYNA

Performance Benefits of NVIDIA GPUs for LS-DYNA Performance Benefits of NVIDIA GPUs for LS-DYNA Mr. Stan Posey and Dr. Srinivas Kodiyalam NVIDIA Corporation, Santa Clara, CA, USA Summary: This work examines the performance characteristics of LS-DYNA

More information

Preliminary Spray Cooling Simulations Using a Full-Cone Water Spray

Preliminary Spray Cooling Simulations Using a Full-Cone Water Spray 39th Dayton-Cincinnati Aerospace Sciences Symposium Preliminary Spray Cooling Simulations Using a Full-Cone Water Spray Murat Dinc Prof. Donald D. Gray (advisor), Prof. John M. Kuhlman, Nicholas L. Hillen,

More information

The Finite Element Method

The Finite Element Method The Finite Element Method A Practical Course G. R. Liu and S. S. Quek Chapter 1: Computational modeling An overview 1 CONTENTS INTRODUCTION PHYSICAL PROBLEMS IN ENGINEERING COMPUTATIONAL MODELLING USING

More information

CPU/GPU COMPUTING FOR AN IMPLICIT MULTI-BLOCK COMPRESSIBLE NAVIER-STOKES SOLVER ON HETEROGENEOUS PLATFORM

CPU/GPU COMPUTING FOR AN IMPLICIT MULTI-BLOCK COMPRESSIBLE NAVIER-STOKES SOLVER ON HETEROGENEOUS PLATFORM Sixth International Symposium on Physics of Fluids (ISPF6) International Journal of Modern Physics: Conference Series Vol. 42 (2016) 1660163 (14 pages) The Author(s) DOI: 10.1142/S2010194516601630 CPU/GPU

More information

APPLIED COMPUTATIONAL FLUID DYNAMICS TECHNIQUES

APPLIED COMPUTATIONAL FLUID DYNAMICS TECHNIQUES APPLIED COMPUTATIONAL FLUID DYNAMICS TECHNIQUES AN INTRODUCTION BASED ON FINITE ELEMENT METHODS Second Edition Rainald Löhner Center for Computational Fluid Dynamics, Department of Computational and Data

More information

Numerical and theoretical analysis of shock waves interaction and reflection

Numerical and theoretical analysis of shock waves interaction and reflection Fluid Structure Interaction and Moving Boundary Problems IV 299 Numerical and theoretical analysis of shock waves interaction and reflection K. Alhussan Space Research Institute, King Abdulaziz City for

More information

Generic Refinement and Block Partitioning enabling efficient GPU CFD on Unstructured Grids

Generic Refinement and Block Partitioning enabling efficient GPU CFD on Unstructured Grids Generic Refinement and Block Partitioning enabling efficient GPU CFD on Unstructured Grids Matthieu Lefebvre 1, Jean-Marie Le Gouez 2 1 PhD at Onera, now post-doc at Princeton, department of Geosciences,

More information

An efficient method for predicting zero-lift or boundary-layer drag including aeroelastic effects for the design environment

An efficient method for predicting zero-lift or boundary-layer drag including aeroelastic effects for the design environment The Aeronautical Journal November 2015 Volume 119 No 1221 1451 An efficient method for predicting zero-lift or boundary-layer drag including aeroelastic effects for the design environment J. A. Camberos

More information

Turbostream: A CFD solver for manycore

Turbostream: A CFD solver for manycore Turbostream: A CFD solver for manycore processors Tobias Brandvik Whittle Laboratory University of Cambridge Aim To produce an order of magnitude reduction in the run-time of CFD solvers for the same hardware

More information

Virtual EM Inc. Ann Arbor, Michigan, USA

Virtual EM Inc. Ann Arbor, Michigan, USA Functional Description of the Architecture of a Special Purpose Processor for Orders of Magnitude Reduction in Run Time in Computational Electromagnetics Tayfun Özdemir Virtual EM Inc. Ann Arbor, Michigan,

More information

Predictive Engineering and Computational Sciences. Data Structures and Methods for Unstructured Distributed Meshes. Roy H. Stogner

Predictive Engineering and Computational Sciences. Data Structures and Methods for Unstructured Distributed Meshes. Roy H. Stogner PECOS Predictive Engineering and Computational Sciences Data Structures and Methods for Unstructured Distributed Meshes Roy H. Stogner The University of Texas at Austin May 23, 2012 Roy H. Stogner Distributed

More information

Speculative Atomics Case-study of the GPU Optimization of the Material Point Method for Graphics

Speculative Atomics Case-study of the GPU Optimization of the Material Point Method for Graphics Speculative Atomics Case-study of the GPU Optimization of the Material Point Method for Graphics Gergely Klar UCLA Computer Graphics & Vision Laboratory Motivation A. Stomakhin, C. Schroeder, L. Chai,

More information

computational Fluid Dynamics - Prof. V. Esfahanian

computational Fluid Dynamics - Prof. V. Esfahanian Three boards categories: Experimental Theoretical Computational Crucial to know all three: Each has their advantages and disadvantages. Require validation and verification. School of Mechanical Engineering

More information

EXPLICIT AND IMPLICIT TVD AND ENO HIGH RESOLUTION ALGORITHMS APPLIED TO THE EULER AND NAVIER-STOKES EQUATIONS IN THREE-DIMENSIONS RESULTS

EXPLICIT AND IMPLICIT TVD AND ENO HIGH RESOLUTION ALGORITHMS APPLIED TO THE EULER AND NAVIER-STOKES EQUATIONS IN THREE-DIMENSIONS RESULTS EXPLICIT AND IMPLICIT TVD AND ENO HIGH RESOLUTION ALGORITHMS APPLIED TO THE EULER AND NAVIER-STOKES EQUATIONS IN THREE-DIMENSIONS RESULTS Edisson Sávio de Góes Maciel, edissonsavio@yahoo.com.br Mechanical

More information

GPU PROGRESS AND DIRECTIONS IN APPLIED CFD

GPU PROGRESS AND DIRECTIONS IN APPLIED CFD Eleventh International Conference on CFD in the Minerals and Process Industries CSIRO, Melbourne, Australia 7-9 December 2015 GPU PROGRESS AND DIRECTIONS IN APPLIED CFD Stan POSEY 1*, Simon SEE 2, and

More information

Application of GPU-Based Computing to Large Scale Finite Element Analysis of Three-Dimensional Structures

Application of GPU-Based Computing to Large Scale Finite Element Analysis of Three-Dimensional Structures Paper 6 Civil-Comp Press, 2012 Proceedings of the Eighth International Conference on Engineering Computational Technology, B.H.V. Topping, (Editor), Civil-Comp Press, Stirlingshire, Scotland Application

More information

Three dimensional meshless point generation technique for complex geometry

Three dimensional meshless point generation technique for complex geometry Three dimensional meshless point generation technique for complex geometry *Jae-Sang Rhee 1), Jinyoung Huh 2), Kyu Hong Kim 3), Suk Young Jung 4) 1),2) Department of Mechanical & Aerospace Engineering,

More information

Algorithmic Design for Exaflop Machines

Algorithmic Design for Exaflop Machines Algorithmic Design for Exaflop Machines Rainald Löhner Center for Computational Fluid Dynamics College of Science, cfd.gmu.edu/~rlohner Outline Problems Targeted Foreseeable Machine Architecture(s) Current

More information

Driven Cavity Example

Driven Cavity Example BMAppendixI.qxd 11/14/12 6:55 PM Page I-1 I CFD Driven Cavity Example I.1 Problem One of the classic benchmarks in CFD is the driven cavity problem. Consider steady, incompressible, viscous flow in a square

More information

Optimization of HOM Couplers using Time Domain Schemes

Optimization of HOM Couplers using Time Domain Schemes Optimization of HOM Couplers using Time Domain Schemes Workshop on HOM Damping in Superconducting RF Cavities Carsten Potratz Universität Rostock October 11, 2010 10/11/2010 2009 UNIVERSITÄT ROSTOCK FAKULTÄT

More information

GEOMETRY MODELING & GRID GENERATION

GEOMETRY MODELING & GRID GENERATION GEOMETRY MODELING & GRID GENERATION Dr.D.Prakash Senior Assistant Professor School of Mechanical Engineering SASTRA University, Thanjavur OBJECTIVE The objectives of this discussion are to relate experiences

More information

Adaptive Mesh Astrophysical Fluid Simulations on GPU. San Jose 10/2/2009 Peng Wang, NVIDIA

Adaptive Mesh Astrophysical Fluid Simulations on GPU. San Jose 10/2/2009 Peng Wang, NVIDIA Adaptive Mesh Astrophysical Fluid Simulations on GPU San Jose 10/2/2009 Peng Wang, NVIDIA Overview Astrophysical motivation & the Enzo code Finite volume method and adaptive mesh refinement (AMR) CUDA

More information

Program: Advanced Certificate Program

Program: Advanced Certificate Program Program: Advanced Certificate Program Course: CFD-Vehicle Aerodynamics Directorate of Training and Lifelong Learning #470-P, Peenya Industrial Area, 4th Phase Peenya, Bengaluru 560 058 www.msruas.ac.in

More information

Missile External Aerodynamics Using Star-CCM+ Star European Conference 03/22-23/2011

Missile External Aerodynamics Using Star-CCM+ Star European Conference 03/22-23/2011 Missile External Aerodynamics Using Star-CCM+ Star European Conference 03/22-23/2011 StarCCM_StarEurope_2011 4/6/11 1 Overview 2 Role of CFD in Aerodynamic Analyses Classical aerodynamics / Semi-Empirical

More information

A new multidimensional-type reconstruction and limiting procedure for unstructured (cell-centered) FVs solving hyperbolic conservation laws

A new multidimensional-type reconstruction and limiting procedure for unstructured (cell-centered) FVs solving hyperbolic conservation laws HYP 2012, Padova A new multidimensional-type reconstruction and limiting procedure for unstructured (cell-centered) FVs solving hyperbolic conservation laws Argiris I. Delis & Ioannis K. Nikolos (TUC)

More information

Best Practices Workshop: Overset Meshing

Best Practices Workshop: Overset Meshing Best Practices Workshop: Overset Meshing Overview Introduction to Overset Meshes Range of Application Workflow Demonstrations and Best Practices What are Overset Meshes? Overset meshes are also known as

More information

This is an author-deposited version published in: Eprints ID: 4362

This is an author-deposited version published in:   Eprints ID: 4362 This is an author-deposited version published in: http://oatao.univ-toulouse.fr/ Eprints ID: 4362 To cite this document: CHIKHAOUI Oussama, GRESSIER Jérémie, GRONDIN Gilles. Assessment of the Spectral

More information

Towards real-time prediction of Tsunami impact effects on nearshore infrastructure

Towards real-time prediction of Tsunami impact effects on nearshore infrastructure Towards real-time prediction of Tsunami impact effects on nearshore infrastructure Manfred Krafczyk & Jonas Tölke Inst. for Computational Modeling in Civil Engineering http://www.cab.bau.tu-bs.de 24.04.2007

More information

Large scale Imaging on Current Many- Core Platforms

Large scale Imaging on Current Many- Core Platforms Large scale Imaging on Current Many- Core Platforms SIAM Conf. on Imaging Science 2012 May 20, 2012 Dr. Harald Köstler Chair for System Simulation Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen,

More information

Literature Report. Daniël Pols. 23 May 2018

Literature Report. Daniël Pols. 23 May 2018 Literature Report Daniël Pols 23 May 2018 Applications Two-phase flow model The evolution of the momentum field in a two phase flow problem is given by the Navier-Stokes equations: u t + u u = 1 ρ p +

More information

Continuum-Microscopic Models

Continuum-Microscopic Models Scientific Computing and Numerical Analysis Seminar October 1, 2010 Outline Heterogeneous Multiscale Method Adaptive Mesh ad Algorithm Refinement Equation-Free Method Incorporates two scales (length, time

More information

Improving Performance of Sparse Matrix-Vector Multiplication

Improving Performance of Sparse Matrix-Vector Multiplication Improving Performance of Sparse Matrix-Vector Multiplication Ali Pınar Michael T. Heath Department of Computer Science and Center of Simulation of Advanced Rockets University of Illinois at Urbana-Champaign

More information

Large-scale Structural Analysis Using General Sparse Matrix Technique

Large-scale Structural Analysis Using General Sparse Matrix Technique Large-scale Structural Analysis Using General Sparse Matrix Technique Yuan-Sen Yang 1), Shang-Hsien Hsieh 1), Kuang-Wu Chou 1), and I-Chau Tsai 1) 1) Department of Civil Engineering, National Taiwan University,

More information

Scientific Computation on Graphics Processing Unit using CUDA

Scientific Computation on Graphics Processing Unit using CUDA Scientific Computation on Graphics Processing Unit using CUDA Submitted in partial fulfillment of the requirements of the degree of Master of Technology by Pradip Narendrakumar Panchal (Roll No. 09307406)

More information

Digital-X. Towards Virtual Aircraft Design and Testing based on High-Fidelity Methods - Recent Developments at DLR -

Digital-X. Towards Virtual Aircraft Design and Testing based on High-Fidelity Methods - Recent Developments at DLR - Digital-X Towards Virtual Aircraft Design and Testing based on High-Fidelity Methods - Recent Developments at DLR - O. Brodersen, C.-C. Rossow, N. Kroll DLR Institute of Aerodynamics and Flow Technology

More information

1 Past Research and Achievements

1 Past Research and Achievements Parallel Mesh Generation and Adaptation using MAdLib T. K. Sheel MEMA, Universite Catholique de Louvain Batiment Euler, Louvain-La-Neuve, BELGIUM Email: tarun.sheel@uclouvain.be 1 Past Research and Achievements

More information

Stability Analysis of the Muscl Method on General Unstructured Grids for Applications to Compressible Fluid Flow

Stability Analysis of the Muscl Method on General Unstructured Grids for Applications to Compressible Fluid Flow Stability Analysis of the Muscl Method on General Unstructured Grids for Applications to Compressible Fluid Flow F. Haider 1, B. Courbet 1, J.P. Croisille 2 1 Département de Simulation Numérique des Ecoulements

More information

AMS526: Numerical Analysis I (Numerical Linear Algebra)

AMS526: Numerical Analysis I (Numerical Linear Algebra) AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 5: Sparse Linear Systems and Factorization Methods Xiangmin Jiao Stony Brook University Xiangmin Jiao Numerical Analysis I 1 / 18 Sparse

More information

NUMERICAL VISCOSITY. Convergent Science White Paper. COPYRIGHT 2017 CONVERGENT SCIENCE. All rights reserved.

NUMERICAL VISCOSITY. Convergent Science White Paper. COPYRIGHT 2017 CONVERGENT SCIENCE. All rights reserved. Convergent Science White Paper COPYRIGHT 2017 CONVERGENT SCIENCE. All rights reserved. This document contains information that is proprietary to Convergent Science. Public dissemination of this document

More information

Validation of an Unstructured Overset Mesh Method for CFD Analysis of Store Separation D. Snyder presented by R. Fitzsimmons

Validation of an Unstructured Overset Mesh Method for CFD Analysis of Store Separation D. Snyder presented by R. Fitzsimmons Validation of an Unstructured Overset Mesh Method for CFD Analysis of Store Separation D. Snyder presented by R. Fitzsimmons Stores Separation Introduction Flight Test Expensive, high-risk, sometimes catastrophic

More information

High-order methods for the next generation of computational engineering software

High-order methods for the next generation of computational engineering software High-order methods for the next generation of computational engineering software Rubén Sevilla Zienkiewicz Centre for Computational Engineering College of Engineering Swansea University Swansea Wales,

More information

High performance 2D Discrete Fourier Transform on Heterogeneous Platforms. Shrenik Lad, IIIT Hyderabad Advisor : Dr. Kishore Kothapalli

High performance 2D Discrete Fourier Transform on Heterogeneous Platforms. Shrenik Lad, IIIT Hyderabad Advisor : Dr. Kishore Kothapalli High performance 2D Discrete Fourier Transform on Heterogeneous Platforms Shrenik Lad, IIIT Hyderabad Advisor : Dr. Kishore Kothapalli Motivation Fourier Transform widely used in Physics, Astronomy, Engineering

More information

Development of an Integrated Computational Simulation Method for Fluid Driven Structure Movement and Acoustics

Development of an Integrated Computational Simulation Method for Fluid Driven Structure Movement and Acoustics Development of an Integrated Computational Simulation Method for Fluid Driven Structure Movement and Acoustics I. Pantle Fachgebiet Strömungsmaschinen Karlsruher Institut für Technologie KIT Motivation

More information

Lecture 6: Input Compaction and Further Studies

Lecture 6: Input Compaction and Further Studies PASI Summer School Advanced Algorithmic Techniques for GPUs Lecture 6: Input Compaction and Further Studies 1 Objective To learn the key techniques for compacting input data for reduced consumption of

More information

Numerical Analysis of Shock Tube Problem by using TVD and ACM Schemes

Numerical Analysis of Shock Tube Problem by using TVD and ACM Schemes Numerical Analysis of Shock Tube Problem by using TVD and Schemes Dr. Mukkarum Husain, Dr. M. Nauman Qureshi, Syed Zaid Hasany IST Karachi, Email: mrmukkarum@yahoo.com Abstract Computational Fluid Dynamics

More information

Shape optimisation using breakthrough technologies

Shape optimisation using breakthrough technologies Shape optimisation using breakthrough technologies Compiled by Mike Slack Ansys Technical Services 2010 ANSYS, Inc. All rights reserved. 1 ANSYS, Inc. Proprietary Introduction Shape optimisation technologies

More information

Porting a parallel rotor wake simulation to GPGPU accelerators using OpenACC

Porting a parallel rotor wake simulation to GPGPU accelerators using OpenACC DLR.de Chart 1 Porting a parallel rotor wake simulation to GPGPU accelerators using OpenACC Melven Röhrig-Zöllner DLR, Simulations- und Softwaretechnik DLR.de Chart 2 Outline Hardware-Architecture (CPU+GPU)

More information

cuibm A GPU Accelerated Immersed Boundary Method

cuibm A GPU Accelerated Immersed Boundary Method cuibm A GPU Accelerated Immersed Boundary Method S. K. Layton, A. Krishnan and L. A. Barba Corresponding author: labarba@bu.edu Department of Mechanical Engineering, Boston University, Boston, MA, 225,

More information

Flux Vector Splitting Methods for the Euler Equations on 3D Unstructured Meshes for CPU/GPU Clusters

Flux Vector Splitting Methods for the Euler Equations on 3D Unstructured Meshes for CPU/GPU Clusters Flux Vector Splitting Methods for the Euler Equations on 3D Unstructured Meshes for CPU/GPU Clusters Manfred Liebmann Technische Universität München Chair of Optimal Control Center for Mathematical Sciences,

More information

Recent results with elsa on multi-cores

Recent results with elsa on multi-cores Michel Gazaix (ONERA) Steeve Champagneux (AIRBUS) October 15th, 2009 Outline Short introduction to elsa elsa benchmark on HPC platforms Detailed performance evaluation IBM Power5, AMD Opteron, INTEL Nehalem

More information

A Hybrid Cartesian Grid and Gridless Method for Compressible Flows

A Hybrid Cartesian Grid and Gridless Method for Compressible Flows rd AIAA Aerospace Sciences Meeting and Exhibit,, January 5, Reno, Nevada A Hybrid Cartesian Grid and Gridless Method for Compressible Flows Hong Luo and Joseph D. Baum Science Applications International

More information

High-Order CENO Reconstruction Scheme For Three-Dimensional Unstructured Mesh

High-Order CENO Reconstruction Scheme For Three-Dimensional Unstructured Mesh High-Order CENO Reconstruction Scheme For Three-Dimensional Unstructured Mesh by Al-Amin Aziz A thesis submitted in conformity with the requirements for the degree of Masters of Applied Science Graduate

More information

Gradient Free Design of Microfluidic Structures on a GPU Cluster

Gradient Free Design of Microfluidic Structures on a GPU Cluster Gradient Free Design of Microfluidic Structures on a GPU Cluster Austen Duffy - Florida State University SIAM Conference on Computational Science and Engineering March 2, 2011 Acknowledgements This work

More information

Best Practices: Volume Meshing Kynan Maley

Best Practices: Volume Meshing Kynan Maley Best Practices: Volume Meshing Kynan Maley Volume Meshing Volume meshing is the basic tool that allows the creation of the space discretization needed to solve most of the CAE equations for: CFD Stress

More information

NUMERICAL 3D TRANSONIC FLOW SIMULATION OVER A WING

NUMERICAL 3D TRANSONIC FLOW SIMULATION OVER A WING Review of the Air Force Academy No.3 (35)/2017 NUMERICAL 3D TRANSONIC FLOW SIMULATION OVER A WING Cvetelina VELKOVA Department of Technical Mechanics, Naval Academy Nikola Vaptsarov,Varna, Bulgaria (cvetelina.velkova1985@gmail.com)

More information

Numerical Investigation of Transonic Shock Oscillations on Stationary Aerofoils

Numerical Investigation of Transonic Shock Oscillations on Stationary Aerofoils Numerical Investigation of Transonic Shock Oscillations on Stationary Aerofoils A. Soda, T. Knopp, K. Weinman German Aerospace Center DLR, Göttingen/Germany Symposium on Hybrid RANS-LES Methods Stockholm/Sweden,

More information

Next-generation CFD: Real-Time Computation and Visualization

Next-generation CFD: Real-Time Computation and Visualization Next-generation CFD: Real-Time Computation and Visualization Christian F. Janßen Hamburg University of Technology Tesla C1060, ~20 million lattice nodes [2010] Kinetic approaches for the simulation of

More information

From Biological Cells to Populations of Individuals: Complex Systems Simulations with CUDA (S5133)

From Biological Cells to Populations of Individuals: Complex Systems Simulations with CUDA (S5133) From Biological Cells to Populations of Individuals: Complex Systems Simulations with CUDA (S5133) Dr Paul Richmond Research Fellow University of Sheffield (NVIDIA CUDA Research Centre) Overview Complex

More information

PROTECTION AGAINST MODELING AND SIMULATION UNCERTAINTIES IN DESIGN OPTIMIZATION NSF GRANT DMI

PROTECTION AGAINST MODELING AND SIMULATION UNCERTAINTIES IN DESIGN OPTIMIZATION NSF GRANT DMI PROTECTION AGAINST MODELING AND SIMULATION UNCERTAINTIES IN DESIGN OPTIMIZATION NSF GRANT DMI-9979711 Bernard Grossman, William H. Mason, Layne T. Watson, Serhat Hosder, and Hongman Kim Virginia Polytechnic

More information

Generic finite element capabilities for forest-of-octrees AMR

Generic finite element capabilities for forest-of-octrees AMR Generic finite element capabilities for forest-of-octrees AMR Carsten Burstedde joint work with Omar Ghattas, Tobin Isaac Institut für Numerische Simulation (INS) Rheinische Friedrich-Wilhelms-Universität

More information