Case Study - Computational Fluid Dynamics (CFD) using Graphics Processing Units
|
|
- Hester Cross
- 6 years ago
- Views:
Transcription
1 - Computational Fluid Dynamics (CFD) using Graphics Processing Units Aaron F. Shinn Mechanical Science and Engineering Dept., UIUC Summer School 2009: Many-Core Processors for Science and Engineering Applications, A.F. Shinn CFD using GPUs 1 / 30
2 What is CFD? Introduction Computational Fluid Dynamics: solve governing equations of fluid motion numerically - Conservation of Mass (Continuity Equation) - Conservation of Momentum (Newton s 2nd Law) - Conservation of Energy (1st Law of Thermodynamics) Coupled set of nonlinear Partial Differential Equations (PDEs) Solution time can be very long makes GPUs very attractive A.F. Shinn CFD using GPUs 2 / 30
3 General Governing Equations Conservation of Mass Conservation of Momentum Conservation of Energy ρc p DT Dt ρ t + ρu = 0 ρ Du Dt = p + τ = βt Dp Dt + (k T ) + Φ viscous stress tensor: τ = µ ( ui x j + u j x i ) + δ ij λ( u) substantial derivative: D( ) Dt = ( ) t + u ( ) A.F. Shinn CFD using GPUs 3 / 30
4 of Illustrate CFD implementation issues with real research example CU-FLOW: general-purpose Cartesian-based 3D Navier-Stokes solver written in C/CUDA for GPUs First implementation of fractional-step/multigrid Navier-Stokes solver for Large-Eddy Simulations (LES) of turbulence on GPUs Many different variations of this code were created Countless hours spent on algorithm design, optimizations, and debugging! A.F. Shinn CFD using GPUs 4 / 30
5 Governing Equations for this Study 3D Incompressible Navier-Stokes equations Conservation of Mass Conservation of Momentum u = 0 u t + u u = 1 ρ p + ν 2 u A.F. Shinn CFD using GPUs 5 / 30
6 Numerical Methodology Discretized via Finite-Volume Method on a staggered Cartesian mesh. Smagorinsky SGS model used for turbulence modeling. Solved equations with fractional-step procedure. - Pressure-Poisson equation (PPE) solved using red-black Gauss-Seidel. - Geometric multigrid used for convergence acceleration of PPE solution. - Temporal advancement: explicit 2nd-order Adams-Bashforth scheme. - Spatial derivatives: 2nd-order central differencing. A.F. Shinn CFD using GPUs 6 / 30
7 Geometric Multigrid: V-cycle Figure: Multigrid V-cycle, where S=smooth, R=restrict residual, P=prolongate. Only three mesh levels are shown for simplicity. A.F. Shinn CFD using GPUs 7 / 30
8 Multigrid: How good is it? Consider a unit square 2D domain, solve Laplace equation 2 φ = 0 on that domain Multigrid converges in just a few iterations, whereas using a single grid takes thousands! Figure: Residuals of multigrid and single grid for solution of the Laplace equation on a 256x256 grid, tolerance = A.F. Shinn CFD using GPUs 8 / 30
9 Layout of CU-FLOW code Preprocessing on CPU set I.C. and B.C. generate mesh copy data to GPU Time-stepping loop controlled on CPU for(n=1; n<=nsteps; n++) { Processing solution on GPU (call kernels) advance velocity from u n to u (Adams-Bashforth) advance p n to p n+1 (Multigrid V-cycle) advance u to u n+1 } // end time-stepping loop Postprocessing on CPU copy data from GPU write plot files A.F. Shinn CFD using GPUs 9 / 30
10 Mapping between threads and cells A.F. Shinn CFD using GPUs 10 / 30
11 Multithreading Multigrid Optimal block size may conflict with mesh level dimensions. Example: would like a 4x4x4 mesh as coarsest level, but 32x1x8 is optimal block size. Cannot map one-to-one due to dimensions of block exceeding mesh. Question: how to resolve this conflict? Possible solution: set block size based on mesh level. A.F. Shinn CFD using GPUs 11 / 30
12 Multithreading Multigrid Host code for calling a kernel // define *fine mesh* dimensions of the blocks #define bx_f 32 #define by_f 1 #define bz_f 8 // define *coarse mesh* dimensions of the blocks #define bx_c 4 #define by_c 4 #define bz_c 4... for( n = 1; n<=ngrid; n++) { // use block size for coarse mesh by default bx = bx_c; by = by_c; bz = bz_c; // for finer meshes, use better block size if ( nx[n]%bx_f == 0 && ny[n]%by_f == 0 ) { bx = bx_f; by = by_f; bz = bz_f; } }... dim3 block(bx,by,bz); dim3 grid(nx[n]/bx,ny[n]/by); kernel<<<grid, block>>>(..., n,...); A.F. Shinn CFD using GPUs 12 / 30
13 Multithreading Multigrid Device code for kernel global void kernel(..., n,...) { // i = tx + 2, j = ty + 2 (offset thread indices to mesh indices) i = threadidx.x + blockidx.x * blockdim.x + 2; j = threadidx.y + blockidx.y * blockdim.y + 2; for (slice=0; slice<=nz[n]/blockdim.z-1; slice++) { k = threadidx.z + slice * blockdim.z + 2; m = i + (j-1)*(nx[n]+2) + \ (k-1)*(nx[n]+2)*(ny[n]+2) + begin[n] - 1; } }... kernel computations... A.F. Shinn CFD using GPUs 13 / 30
14 CUDA implementation of Red-Black Gauss-Seidel Color the grid like a checkerboard to enable parallel processing of pressure First update the red pressures, then update the black pressures Figure: 2D example of red-black coloring of a mesh A.F. Shinn CFD using GPUs 14 / 30
15 CUDA implementation of Red-Black Gauss-Seidel Updating pressure: host code for( icyc = 1; icyc<=ncyc; icyc++) { // go through all V-cycles for( n = ngrid; n>=1; n--) { // downleg of V-cycle // use block size for coarse mesh by default bx = bx_c; by = by_c; bz = bz_c; // for finer meshes, use better block size if ( nx[n]%bx_f == 0 && ny[n]%by_f == 0 ) {bx = bx_f; by = by_f; bz = bz_f;} dim3 block(bx,by,bz); dim3 grid(nx[n]/bx,ny[n]/by); for( iswp = 1; iswp<=nswp; iswp++) { red_kernel<<<grid, block>>>(..., n,...); black_kernel<<<grid, block>>>(..., n,...); }... A.F. Shinn CFD using GPUs 15 / 30
16 CUDA implementation of Red-Black Gauss-Seidel red kernel: device code global void red_kernel(... ) { i = threadidx.x + blockidx.x * blockdim.x + 2; j = threadidx.y + blockidx.y * blockdim.y + 2; for (slice=0; slice<=nz_d[n]/blockdim.z-1; slice++) { k = threadidx.z + slab * blockdim.z + 2; if( (i+j+k)%2==0 ) { // test if red cell m = i + (j-1)*(nx[n]+2)+(k-1)*(nx[n]+2)*(ny[n]+2)+begin[n]-1; xm = xm[m]; xp = xp[m]; ym = ym[m]; yp = yp[m]; zm = zm[m]; zp = zp[m]; res = (aw_d[m] * pressure_d[xm] + ae_d[m] * pressure_d[xp] + \ as_d[m] * pressure_d[ym] + an_d[m] * pressure_d[yp] + \ al_d[m] * pressure_d[zm] + ah_d[m] * pressure_d[zp] + \ resc_d[m]) / ap_d[m]; pressure_d[m] = relxp*(res) + (1.0-relxp)*pressure_d[m]; } // end if } //end slice } //end kernel A.F. Shinn CFD using GPUs 16 / 30
17 Profiling of CU-FLOW Red-black Gauss-Seidel kernels consume over 2/3 of GPU time! Must optimize red-black Gauss-Seidel kernels A.F. Shinn CFD using GPUs 17 / 30
18 CUDA implementation of Red-Black Gauss-Seidel Memory management in red-black kernels - Global memory: easiest, but slow - Shared memory: gives marginally better performance, perhaps due to low data reuse or handling of boundary halos for each sub-domain in shared memory. - Texture memory: fetch device memory through textures instead of expensive global memory load. Currently working on this. This is an alternative to avoid uncoalesed memory loads. A.F. Shinn CFD using GPUs 18 / 30
19 Computational Resources GPU verison: CUDA, CPU version: Fortran. Single-precision used for all calculations. Dell Precision 690 Workstation (Linux: Red Hat Enterprise 5) CPU: 3.0 GHz Intel Xeon GPU: NVIDIA Tesla C1060 ( 1 teraflop) A.F. Shinn CFD using GPUs 19 / 30
20 Laminar Flow in 3D Lid-Driven Cube Figure: Computational domain for 3D lid-driven cube. Re L =1000 mesh: 128x128x128, constant mesh spacing. A.F. Shinn CFD using GPUs 20 / 30
21 Laminar Flow in 3D Lid-Driven Cube A.F. Shinn CFD using GPUs 21 / 30
22 Turbulent Flow in 3D square duct Figure: Computational domain for 3D square duct. Re τ =360 mesh: 256x64x64, 3% geometric stretching in y-z plane. A.F. Shinn CFD using GPUs 22 / 30
23 3D square duct (Re τ =360) Figure: Contours and velocity vectors of instantaneous streamwise velocity in cross-flow plane at x = 2.0. A.F. Shinn CFD using GPUs 23 / 30
24 3D square duct (Re τ =360) (a) present GPU simulation (b) Madabhushi and Vanka Figure: Velocity vectors of mean flowfield in cross-flow plane. A.F. Shinn CFD using GPUs 24 / 30
25 Speedup of GPU vs. CPU Performance of GPU versus CPU for first 100 time-steps of simulation, with block size bx=by=bz=4 Table 1: Laminar flow in lid-driven cube. mesh Fortran code (sec) CUDA code (sec) speedup (CPU/GPU) 16x16x x32x x64x x128x Table 2: Turbulent flow in a square duct. mesh Fortran code (sec) CUDA code (sec) speedup (CPU/GPU) 256x64x x64x A.F. Shinn CFD using GPUs 25 / 30
26 Speedup of GPU vs. CPU Performance of GPU versus CPU for first 100 time-steps of simulation, with block size bx=by=bz=4 on coarser meshes and bx=32,by=1,bz=8 on finer meshes. Table 1: Laminar flow in lid-driven cube mesh Fortran code (sec) GPU code (sec) speedup (CPU/GPU) 16x16x x32x x64x x128x Speedup improved by factor of 2.8 for 128x128x128 case Table 2: Turbulent flow in square duct mesh Fortran code (sec) GPU code (sec) speedup (CPU/GPU) 256x64x Speedup improved by factor of 2.4 for 256x64x64 case A.F. Shinn CFD using GPUs 26 / 30
27 Introduction Speedup of GPU scaled with the problem size; largest problem size yielded maximum speedup. Single precision did not appreciably affect the results, even for turbulent flows. Global memory easiest to use, but worst for memory latency. Need global residuals to observe convergence. This requires cudamemcpy between CPU/GPU. Very expensive, so decide when you really need to see the residuals. A.F. Shinn CFD using GPUs 27 / 30
28 Introduction Optimization can be a time drain. Need to decide when code is good enough Two possibilities: - Code is complete, just needs porting to CUDA and tuning. Maybe have more time to optimize - Code is not complete, need to add physics features, write in CUDA, and tune. Maybe need to spend more time on physics algorithm and get what you can get out of minimal time coding in CUDA A.F. Shinn CFD using GPUs 28 / 30
29 Future Work Introduction Model complex geometries in flow using the Immersed Boundary Method (IBM) Multi-GPU capability - collaborating with John Stone, UIUC A.F. Shinn CFD using GPUs 29 / 30
30 References Introduction [1] H. Ku, R. Hirsh, and T. Taylor. A Pseudospectral Method for Solution of the Three-Dimensional Incompressible Navier-Stokes Equations. Journal of Computational Physics, 70: , [2] R.K. Madabhushi and S.P. Vanka. Large eddy simulation of turbulence-driven secondary flow in a square duct. Phys. Fluids, 3(11): , A.F. Shinn CFD using GPUs 30 / 30
Computational Fluid Dynamics (CFD) using Graphics Processing Units
Computational Fluid Dynamics (CFD) using Graphics Processing Units Aaron F. Shinn Mechanical Science and Engineering Dept., UIUC Accelerators for Science and Engineering Applications: GPUs and Multicores
More informationCOMPUTATIONAL FLUID DYNAMICS USING GRAPHICS PROCESSING UNITS: CHALLENGES AND OPPORTUNITIES
Proceedings of the ASME 2011 International Mechanical Engineering Congress & Exposition IMECE2011 November 11-17, 2011, Denver, Colorado, USA IMECE2011-65260 COMPUTATIONAL FLUID DYNAMICS USING GRAPHICS
More informationStream Function-Vorticity CFD Solver MAE 6263
Stream Function-Vorticity CFD Solver MAE 66 Charles O Neill April, 00 Abstract A finite difference CFD solver was developed for transient, two-dimensional Cartesian viscous flows. Flow parameters are solved
More informationTwo-Phase flows on massively parallel multi-gpu clusters
Two-Phase flows on massively parallel multi-gpu clusters Peter Zaspel Michael Griebel Institute for Numerical Simulation Rheinische Friedrich-Wilhelms-Universität Bonn Workshop Programming of Heterogeneous
More informationcuibm A GPU Accelerated Immersed Boundary Method
cuibm A GPU Accelerated Immersed Boundary Method S. K. Layton, A. Krishnan and L. A. Barba Corresponding author: labarba@bu.edu Department of Mechanical Engineering, Boston University, Boston, MA, 225,
More information1.2 Numerical Solutions of Flow Problems
1.2 Numerical Solutions of Flow Problems DIFFERENTIAL EQUATIONS OF MOTION FOR A SIMPLIFIED FLOW PROBLEM Continuity equation for incompressible flow: 0 Momentum (Navier-Stokes) equations for a Newtonian
More informationMultigrid Solvers in CFD. David Emerson. Scientific Computing Department STFC Daresbury Laboratory Daresbury, Warrington, WA4 4AD, UK
Multigrid Solvers in CFD David Emerson Scientific Computing Department STFC Daresbury Laboratory Daresbury, Warrington, WA4 4AD, UK david.emerson@stfc.ac.uk 1 Outline Multigrid: general comments Incompressible
More informationComputational Fluid Dynamics using OpenCL a Practical Introduction
19th International Congress on Modelling and Simulation, Perth, Australia, 12 16 December 2011 http://mssanz.org.au/modsim2011 Computational Fluid Dynamics using OpenCL a Practical Introduction T Bednarz
More informationDriven Cavity Example
BMAppendixI.qxd 11/14/12 6:55 PM Page I-1 I CFD Driven Cavity Example I.1 Problem One of the classic benchmarks in CFD is the driven cavity problem. Consider steady, incompressible, viscous flow in a square
More informationCUDA. Fluid simulation Lattice Boltzmann Models Cellular Automata
CUDA Fluid simulation Lattice Boltzmann Models Cellular Automata Please excuse my layout of slides for the remaining part of the talk! Fluid Simulation Navier Stokes equations for incompressible fluids
More informationACCELERATION OF A COMPUTATIONAL FLUID DYNAMICS CODE WITH GPU USING OPENACC
Nonlinear Computational Aeroelasticity Lab ACCELERATION OF A COMPUTATIONAL FLUID DYNAMICS CODE WITH GPU USING OPENACC N I C H O L S O N K. KO U K PA I Z A N P H D. C A N D I D AT E GPU Technology Conference
More informationRealistic Animation of Fluids
Realistic Animation of Fluids p. 1/2 Realistic Animation of Fluids Nick Foster and Dimitri Metaxas Realistic Animation of Fluids p. 2/2 Overview Problem Statement Previous Work Navier-Stokes Equations
More informationIntroduction to Multigrid and its Parallelization
Introduction to Multigrid and its Parallelization! Thomas D. Economon Lecture 14a May 28, 2014 Announcements 2 HW 1 & 2 have been returned. Any questions? Final projects are due June 11, 5 pm. If you are
More informationA Scalable GPU-Based Compressible Fluid Flow Solver for Unstructured Grids
A Scalable GPU-Based Compressible Fluid Flow Solver for Unstructured Grids Patrice Castonguay and Antony Jameson Aerospace Computing Lab, Stanford University GTC Asia, Beijing, China December 15 th, 2011
More informationA Study of the Development of an Analytical Wall Function for Large Eddy Simulation of Turbulent Channel and Rectangular Duct Flow
University of Wisconsin Milwaukee UWM Digital Commons Theses and Dissertations August 2014 A Study of the Development of an Analytical Wall Function for Large Eddy Simulation of Turbulent Channel and Rectangular
More informationTeam 194: Aerodynamic Study of Airflow around an Airfoil in the EGI Cloud
Team 194: Aerodynamic Study of Airflow around an Airfoil in the EGI Cloud CFD Support s OpenFOAM and UberCloud Containers enable efficient, effective, and easy access and use of MEET THE TEAM End-User/CFD
More informationInvestigation of cross flow over a circular cylinder at low Re using the Immersed Boundary Method (IBM)
Computational Methods and Experimental Measurements XVII 235 Investigation of cross flow over a circular cylinder at low Re using the Immersed Boundary Method (IBM) K. Rehman Department of Mechanical Engineering,
More informationAn Embedded Boundary Method with Adaptive Mesh Refinements
An Embedded Boundary Method with Adaptive Mesh Refinements Marcos Vanella and Elias Balaras 8 th World Congress on Computational Mechanics, WCCM8 5 th European Congress on Computational Methods in Applied
More informationPhD Student. Associate Professor, Co-Director, Center for Computational Earth and Environmental Science. Abdulrahman Manea.
Abdulrahman Manea PhD Student Hamdi Tchelepi Associate Professor, Co-Director, Center for Computational Earth and Environmental Science Energy Resources Engineering Department School of Earth Sciences
More information2.7 Cloth Animation. Jacobs University Visualization and Computer Graphics Lab : Advanced Graphics - Chapter 2 123
2.7 Cloth Animation 320491: Advanced Graphics - Chapter 2 123 Example: Cloth draping Image Michael Kass 320491: Advanced Graphics - Chapter 2 124 Cloth using mass-spring model Network of masses and springs
More informationEfficient Tridiagonal Solvers for ADI methods and Fluid Simulation
Efficient Tridiagonal Solvers for ADI methods and Fluid Simulation Nikolai Sakharnykh - NVIDIA San Jose Convention Center, San Jose, CA September 21, 2010 Introduction Tridiagonal solvers very popular
More informationPossibility of Implicit LES for Two-Dimensional Incompressible Lid-Driven Cavity Flow Based on COMSOL Multiphysics
Possibility of Implicit LES for Two-Dimensional Incompressible Lid-Driven Cavity Flow Based on COMSOL Multiphysics Masanori Hashiguchi 1 1 Keisoku Engineering System Co., Ltd. 1-9-5 Uchikanda, Chiyoda-ku,
More informationLarge-scale Gas Turbine Simulations on GPU clusters
Large-scale Gas Turbine Simulations on GPU clusters Tobias Brandvik and Graham Pullan Whittle Laboratory University of Cambridge A large-scale simulation Overview PART I: Turbomachinery PART II: Stencil-based
More informationCFD-1. Introduction: What is CFD? T. J. Craft. Msc CFD-1. CFD: Computational Fluid Dynamics
School of Mechanical Aerospace and Civil Engineering CFD-1 T. J. Craft George Begg Building, C41 Msc CFD-1 Reading: J. Ferziger, M. Peric, Computational Methods for Fluid Dynamics H.K. Versteeg, W. Malalasekara,
More information3D ADI Method for Fluid Simulation on Multiple GPUs. Nikolai Sakharnykh, NVIDIA Nikolay Markovskiy, NVIDIA
3D ADI Method for Fluid Simulation on Multiple GPUs Nikolai Sakharnykh, NVIDIA Nikolay Markovskiy, NVIDIA Introduction Fluid simulation using direct numerical methods Gives the most accurate result Requires
More informationENERGY-224 Reservoir Simulation Project Report. Ala Alzayer
ENERGY-224 Reservoir Simulation Project Report Ala Alzayer Autumn Quarter December 3, 2014 Contents 1 Objective 2 2 Governing Equations 2 3 Methodolgy 3 3.1 BlockMesh.........................................
More informationThe Development of a Navier-Stokes Flow Solver with Preconditioning Method on Unstructured Grids
Proceedings of the International MultiConference of Engineers and Computer Scientists 213 Vol II, IMECS 213, March 13-15, 213, Hong Kong The Development of a Navier-Stokes Flow Solver with Preconditioning
More informationMESHLESS SOLUTION OF INCOMPRESSIBLE FLOW OVER BACKWARD-FACING STEP
Vol. 12, Issue 1/2016, 63-68 DOI: 10.1515/cee-2016-0009 MESHLESS SOLUTION OF INCOMPRESSIBLE FLOW OVER BACKWARD-FACING STEP Juraj MUŽÍK 1,* 1 Department of Geotechnics, Faculty of Civil Engineering, University
More informationAdaptive-Mesh-Refinement Hydrodynamic GPU Computation in Astrophysics
Adaptive-Mesh-Refinement Hydrodynamic GPU Computation in Astrophysics H. Y. Schive ( 薛熙于 ) Graduate Institute of Physics, National Taiwan University Leung Center for Cosmology and Particle Astrophysics
More informationBackward facing step Homework. Department of Fluid Mechanics. For Personal Use. Budapest University of Technology and Economics. Budapest, 2010 autumn
Backward facing step Homework Department of Fluid Mechanics Budapest University of Technology and Economics Budapest, 2010 autumn Updated: October 26, 2010 CONTENTS i Contents 1 Introduction 1 2 The problem
More informationAn added mass partitioned algorithm for rigid bodies and incompressible flows
An added mass partitioned algorithm for rigid bodies and incompressible flows Jeff Banks Rensselaer Polytechnic Institute Overset Grid Symposium Mukilteo, WA October 19, 216 Collaborators Bill Henshaw,
More informationPerformance of Implicit Solver Strategies on GPUs
9. LS-DYNA Forum, Bamberg 2010 IT / Performance Performance of Implicit Solver Strategies on GPUs Prof. Dr. Uli Göhner DYNAmore GmbH Stuttgart, Germany Abstract: The increasing power of GPUs can be used
More informationALE Seamless Immersed Boundary Method with Overset Grid System for Multiple Moving Objects
Tenth International Conference on Computational Fluid Dynamics (ICCFD10), Barcelona,Spain, July 9-13, 2018 ICCFD10-047 ALE Seamless Immersed Boundary Method with Overset Grid System for Multiple Moving
More informationHigh-Order Finite-Element Earthquake Modeling on very Large Clusters of CPUs or GPUs
High-Order Finite-Element Earthquake Modeling on very Large Clusters of CPUs or GPUs Gordon Erlebacher Department of Scientific Computing Sept. 28, 2012 with Dimitri Komatitsch (Pau,France) David Michea
More informationWhy Use the GPU? How to Exploit? New Hardware Features. Sparse Matrix Solvers on the GPU: Conjugate Gradients and Multigrid. Semiconductor trends
Imagine stream processor; Bill Dally, Stanford Connection Machine CM; Thinking Machines Sparse Matrix Solvers on the GPU: Conjugate Gradients and Multigrid Jeffrey Bolz Eitan Grinspun Caltech Ian Farmer
More informationDevelopment of an Integrated Computational Simulation Method for Fluid Driven Structure Movement and Acoustics
Development of an Integrated Computational Simulation Method for Fluid Driven Structure Movement and Acoustics I. Pantle Fachgebiet Strömungsmaschinen Karlsruher Institut für Technologie KIT Motivation
More informationSENSEI / SENSEI-Lite / SENEI-LDC Updates
SENSEI / SENSEI-Lite / SENEI-LDC Updates Chris Roy and Brent Pickering Aerospace and Ocean Engineering Dept. Virginia Tech July 23, 2014 Collaborations with Math Collaboration on the implicit SENSEI-LDC
More informationComputational Study of Laminar Flowfield around a Square Cylinder using Ansys Fluent
MEGR 7090-003, Computational Fluid Dynamics :1 7 Spring 2015 Computational Study of Laminar Flowfield around a Square Cylinder using Ansys Fluent Rahul R Upadhyay Master of Science, Dept of Mechanical
More information3D Helmholtz Krylov Solver Preconditioned by a Shifted Laplace Multigrid Method on Multi-GPUs
3D Helmholtz Krylov Solver Preconditioned by a Shifted Laplace Multigrid Method on Multi-GPUs H. Knibbe, C. W. Oosterlee, C. Vuik Abstract We are focusing on an iterative solver for the three-dimensional
More informationThree Dimensional Numerical Simulation of Turbulent Flow Over Spillways
Three Dimensional Numerical Simulation of Turbulent Flow Over Spillways Latif Bouhadji ASL-AQFlow Inc., Sidney, British Columbia, Canada Email: lbouhadji@aslenv.com ABSTRACT Turbulent flows over a spillway
More informationShape optimisation using breakthrough technologies
Shape optimisation using breakthrough technologies Compiled by Mike Slack Ansys Technical Services 2010 ANSYS, Inc. All rights reserved. 1 ANSYS, Inc. Proprietary Introduction Shape optimisation technologies
More informationAdarsh Krishnamurthy (cs184-bb) Bela Stepanova (cs184-bs)
OBJECTIVE FLUID SIMULATIONS Adarsh Krishnamurthy (cs184-bb) Bela Stepanova (cs184-bs) The basic objective of the project is the implementation of the paper Stable Fluids (Jos Stam, SIGGRAPH 99). The final
More informationMiddle East Technical University Mechanical Engineering Department ME 485 CFD with Finite Volume Method Fall 2017 (Dr. Sert)
Middle East Technical University Mechanical Engineering Department ME 485 CFD with Finite Volume Method Fall 2017 (Dr. Sert) ANSYS Fluent Tutorial Developing Laminar Flow in a 2D Channel 1 How to use This
More informationFlow and Heat Transfer in a Mixing Elbow
Flow and Heat Transfer in a Mixing Elbow Objectives The main objectives of the project are to learn (i) how to set up and perform flow simulations with heat transfer and mixing, (ii) post-processing and
More informationD036 Accelerating Reservoir Simulation with GPUs
D036 Accelerating Reservoir Simulation with GPUs K.P. Esler* (Stone Ridge Technology), S. Atan (Marathon Oil Corp.), B. Ramirez (Marathon Oil Corp.) & V. Natoli (Stone Ridge Technology) SUMMARY Over the
More informationAirfoil Design Optimization Using Reduced Order Models Based on Proper Orthogonal Decomposition
Airfoil Design Optimization Using Reduced Order Models Based on Proper Orthogonal Decomposition.5.5.5.5.5.5.5..5.95.9.85.8.75.7 Patrick A. LeGresley and Juan J. Alonso Dept. of Aeronautics & Astronautics
More informationImplementation of Adaptive Coarsening Algorithm on GPU using CUDA
Implementation of Adaptive Coarsening Algorithm on GPU using CUDA 1. Introduction , In scientific computing today, the high-performance computers grow
More informationSELECTIVE ALGEBRAIC MULTIGRID IN FOAM-EXTEND
Student Submission for the 5 th OpenFOAM User Conference 2017, Wiesbaden - Germany: SELECTIVE ALGEBRAIC MULTIGRID IN FOAM-EXTEND TESSA UROIĆ Faculty of Mechanical Engineering and Naval Architecture, Ivana
More informationCUDA/OpenGL Fluid Simulation. Nolan Goodnight
CUDA/OpenGL Fluid Simulation Nolan Goodnight ngoodnight@nvidia.com Document Change History Version Date Responsible Reason for Change 0.1 2/22/07 Nolan Goodnight Initial draft 1.0 4/02/07 Nolan Goodnight
More informationDevelopment of New Method for Flow Computations in Vehicle Ventilation
2005:110 CIV MASTER S THESIS Development of New Method for Flow Computations in Vehicle Ventilation FRIDA NORDIN MASTER OF SCIENCE PROGRAMME Luleå University of Technology Department of Applied Physics
More informationStudies of the Continuous and Discrete Adjoint Approaches to Viscous Automatic Aerodynamic Shape Optimization
Studies of the Continuous and Discrete Adjoint Approaches to Viscous Automatic Aerodynamic Shape Optimization Siva Nadarajah Antony Jameson Stanford University 15th AIAA Computational Fluid Dynamics Conference
More informationHigh-Fidelity Simulation of Unsteady Flow Problems using a 3rd Order Hybrid MUSCL/CD scheme. A. West & D. Caraeni
High-Fidelity Simulation of Unsteady Flow Problems using a 3rd Order Hybrid MUSCL/CD scheme ECCOMAS, June 6 th -11 th 2016, Crete Island, Greece A. West & D. Caraeni Outline Industrial Motivation Numerical
More informationA FULLY COUPLED FINITE VOLUME SOLVER FOR THE SOLUTION OF INCOMPRESSIBLE FLOWS ON LOCALLY REFINED NON-MATCHING BLOCK-STRUCTURED GRIDS
VI International Conference on Adaptive Modeling and Simulation ADMOS 2013 J. P. Moitinho de Almeida, P. Díez, C. Tiago and N. Parés (Eds) A FULLY COUPLED FINITE VOLUME SOLVER FOR THE SOLUTION OF INCOMPRESSIBLE
More informationStrömningslära Fluid Dynamics. Computer laboratories using COMSOL v4.4
UMEÅ UNIVERSITY Department of Physics Claude Dion Olexii Iukhymenko May 15, 2015 Strömningslära Fluid Dynamics (5FY144) Computer laboratories using COMSOL v4.4!! Report requirements Computer labs must
More informationA TALENTED CPU-TO-GPU MEMORY MAPPING TECHNIQUE
A TALENTED CPU-TO-GPU MEMORY MAPPING TECHNIQUE Abu Asaduzzaman, Deepthi Gummadi, and Chok M. Yip Department of Electrical Engineering and Computer Science Wichita State University Wichita, Kansas, USA
More informationParallelization study of a VOF/Navier-Stokes model for 3D unstructured staggered meshes
Parallelization study of a VOF/Navier-Stokes model for 3D unstructured staggered meshes L. Jofre, O. Lehmkuhl, R. Borrell, J. Castro and A. Oliva Corresponding author: cttc@cttc.upc.edu Centre Tecnològic
More informationIntroduction to Parallel Computing with CUDA. Oswald Haan
Introduction to Parallel Computing with CUDA Oswald Haan ohaan@gwdg.de Schedule Introduction to Parallel Computing with CUDA Using CUDA CUDA Application Examples Using Multiple GPUs CUDA Application Libraries
More informationMultigrid Pattern. I. Problem. II. Driving Forces. III. Solution
Multigrid Pattern I. Problem Problem domain is decomposed into a set of geometric grids, where each element participates in a local computation followed by data exchanges with adjacent neighbors. The grids
More informationCalculate a solution using the pressure-based coupled solver.
Tutorial 19. Modeling Cavitation Introduction This tutorial examines the pressure-driven cavitating flow of water through a sharpedged orifice. This is a typical configuration in fuel injectors, and brings
More informationSoftware and Performance Engineering for numerical codes on GPU clusters
Software and Performance Engineering for numerical codes on GPU clusters H. Köstler International Workshop of GPU Solutions to Multiscale Problems in Science and Engineering Harbin, China 28.7.2010 2 3
More informationIntroduction to C omputational F luid Dynamics. D. Murrin
Introduction to C omputational F luid Dynamics D. Murrin Computational fluid dynamics (CFD) is the science of predicting fluid flow, heat transfer, mass transfer, chemical reactions, and related phenomena
More informationModeling External Compressible Flow
Tutorial 3. Modeling External Compressible Flow Introduction The purpose of this tutorial is to compute the turbulent flow past a transonic airfoil at a nonzero angle of attack. You will use the Spalart-Allmaras
More informationNumerical Study of Turbulent Flow over Backward-Facing Step with Different Turbulence Models
Numerical Study of Turbulent Flow over Backward-Facing Step with Different Turbulence Models D. G. Jehad *,a, G. A. Hashim b, A. K. Zarzoor c and C. S. Nor Azwadi d Department of Thermo-Fluids, Faculty
More informationHigh-order solutions of transitional flow over the SD7003 airfoil using compact finite-differencing and filtering
High-order solutions of transitional flow over the SD7003 airfoil using compact finite-differencing and filtering Daniel J. Garmann and Miguel R. Visbal Air Force Research Laboratory, Wright-Patterson
More informationAeroacoustic computations with a new CFD solver based on the Lattice Boltzmann Method
Aeroacoustic computations with a new CFD solver based on the Lattice Boltzmann Method D. Ricot 1, E. Foquet 2, H. Touil 3, E. Lévêque 3, H. Machrouki 4, F. Chevillotte 5, M. Meldi 6 1: Renault 2: CS 3:
More informationSimulation of Flow Development in a Pipe
Tutorial 4. Simulation of Flow Development in a Pipe Introduction The purpose of this tutorial is to illustrate the setup and solution of a 3D turbulent fluid flow in a pipe. The pipe networks are common
More informationAdaptive Mesh Astrophysical Fluid Simulations on GPU. San Jose 10/2/2009 Peng Wang, NVIDIA
Adaptive Mesh Astrophysical Fluid Simulations on GPU San Jose 10/2/2009 Peng Wang, NVIDIA Overview Astrophysical motivation & the Enzo code Finite volume method and adaptive mesh refinement (AMR) CUDA
More informationChapter 1 Introduction
Chapter 1 Introduction Ibrahim Sezai Department of Mechanical Engineering Eastern Mediterranean University Fall 2009-2010 What is CFD? CFD is the simulation of fluids engineering systems using modeling
More informationAn Overview of Computational Fluid Dynamics
An Overview of Computational Fluid Dynamics Dr. Nor Azwadi bin Che Sidik Faculty of Mechanical Engineering Universiti Teknologi Malaysia INSPIRING CREATIVE AND INNOVATIVE MINDS 1 What is CFD? C computational
More informationOverview of Traditional Surface Tracking Methods
Liquid Simulation With Mesh-Based Surface Tracking Overview of Traditional Surface Tracking Methods Matthias Müller Introduction Research lead of NVIDIA PhysX team PhysX GPU acc. Game physics engine www.nvidia.com\physx
More informationNumerical Methods for PDEs. SSC Workgroup Meetings Juan J. Alonso October 8, SSC Working Group Meetings, JJA 1
Numerical Methods for PDEs SSC Workgroup Meetings Juan J. Alonso October 8, 2001 SSC Working Group Meetings, JJA 1 Overview These notes are meant to be an overview of the various memory access patterns
More informationDirected Optimization On Stencil-based Computational Fluid Dynamics Application(s)
Directed Optimization On Stencil-based Computational Fluid Dynamics Application(s) Islam Harb 08/21/2015 Agenda Motivation Research Challenges Contributions & Approach Results Conclusion Future Work 2
More informationLattice Boltzmann with CUDA
Lattice Boltzmann with CUDA Lan Shi, Li Yi & Liyuan Zhang Hauptseminar: Multicore Architectures and Programming Page 1 Outline Overview of LBM An usage of LBM Algorithm Implementation in CUDA and Optimization
More informationFEMLAB Exercise 1 for ChE366
FEMLAB Exercise 1 for ChE366 Problem statement Consider a spherical particle of radius r s moving with constant velocity U in an infinitely long cylinder of radius R that contains a Newtonian fluid. Let
More informationMaximize automotive simulation productivity with ANSYS HPC and NVIDIA GPUs
Presented at the 2014 ANSYS Regional Conference- Detroit, June 5, 2014 Maximize automotive simulation productivity with ANSYS HPC and NVIDIA GPUs Bhushan Desam, Ph.D. NVIDIA Corporation 1 NVIDIA Enterprise
More informationcomputational Fluid Dynamics - Prof. V. Esfahanian
Three boards categories: Experimental Theoretical Computational Crucial to know all three: Each has their advantages and disadvantages. Require validation and verification. School of Mechanical Engineering
More informationNUMERICAL VISCOSITY. Convergent Science White Paper. COPYRIGHT 2017 CONVERGENT SCIENCE. All rights reserved.
Convergent Science White Paper COPYRIGHT 2017 CONVERGENT SCIENCE. All rights reserved. This document contains information that is proprietary to Convergent Science. Public dissemination of this document
More informationThe 3D DSC in Fluid Simulation
The 3D DSC in Fluid Simulation Marek K. Misztal Informatics and Mathematical Modelling, Technical University of Denmark mkm@imm.dtu.dk DSC 2011 Workshop Kgs. Lyngby, 26th August 2011 Governing Equations
More informationIntroduction to Numerical General Purpose GPU Computing with NVIDIA CUDA. Part 1: Hardware design and programming model
Introduction to Numerical General Purpose GPU Computing with NVIDIA CUDA Part 1: Hardware design and programming model Dirk Ribbrock Faculty of Mathematics, TU dortmund 2016 Table of Contents Why parallel
More informationNon-Newtonian Transitional Flow in an Eccentric Annulus
Tutorial 8. Non-Newtonian Transitional Flow in an Eccentric Annulus Introduction The purpose of this tutorial is to illustrate the setup and solution of a 3D, turbulent flow of a non-newtonian fluid. Turbulent
More informationT6: Position-Based Simulation Methods in Computer Graphics. Jan Bender Miles Macklin Matthias Müller
T6: Position-Based Simulation Methods in Computer Graphics Jan Bender Miles Macklin Matthias Müller Jan Bender Organizer Professor at the Visual Computing Institute at Aachen University Research topics
More informationA Novel Approach to High Speed Collision
A Novel Approach to High Speed Collision Avril Slone University of Greenwich Motivation High Speed Impact Currently a very active research area. Generic projectile- target collision 11 th September 2001.
More informationCUDA Memory Types All material not from online sources/textbook copyright Travis Desell, 2012
CUDA Memory Types All material not from online sources/textbook copyright Travis Desell, 2012 Overview 1. Memory Access Efficiency 2. CUDA Memory Types 3. Reducing Global Memory Traffic 4. Example: Matrix-Matrix
More informationA 3D VOF model in cylindrical coordinates
A 3D VOF model in cylindrical coordinates Marmar Mehrabadi and Markus Bussmann Department of Mechanical and Industrial Engineering, University of Toronto Recently, volume of fluid (VOF) methods have improved
More informationSolver Settings. Introductory FLUENT Training ANSYS, Inc. All rights reserved. ANSYS, Inc. Proprietary
Solver Settings Introductory FLUENT Training 2006 ANSYS, Inc. All rights reserved. 2006 ANSYS, Inc. All rights reserved. 5-2 Outline Using the Solver Setting Solver Parameters Convergence Definition Monitoring
More informationModule Memory and Data Locality
GPU Teaching Kit Accelerated Computing Module 4.4 - Memory and Data Locality Tiled Matrix Multiplication Kernel Objective To learn to write a tiled matrix-multiplication kernel Loading and using tiles
More informationGradient Free Design of Microfluidic Structures on a GPU Cluster
Gradient Free Design of Microfluidic Structures on a GPU Cluster Austen Duffy - Florida State University SIAM Conference on Computational Science and Engineering March 2, 2011 Acknowledgements This work
More informationUnstructured Mesh Generation for Implicit Moving Geometries and Level Set Applications
Unstructured Mesh Generation for Implicit Moving Geometries and Level Set Applications Per-Olof Persson (persson@mit.edu) Department of Mathematics Massachusetts Institute of Technology http://www.mit.edu/
More informationOpenACC programming for GPGPUs: Rotor wake simulation
DLR.de Chart 1 OpenACC programming for GPGPUs: Rotor wake simulation Melven Röhrig-Zöllner, Achim Basermann Simulations- und Softwaretechnik DLR.de Chart 2 Outline Hardware-Architecture (CPU+GPU) GPU computing
More informationMETHODS FOR MULTILEVEL PARALLELISM ON GPU CLUSTERS: APPLICATION TO A MULTIGRID ACCELERATED NAVIER-STOKES SOLVER
METHODS FOR MULTILEVEL PARALLELISM ON GPU CLUSTERS: APPLICATION TO A MULTIGRID ACCELERATED NAVIER-STOKES SOLVER by Dana A. Jacobsen A thesis submitted in partial fulfillment of the requirements for the
More informationNumerical Analysis of Shock Tube Problem by using TVD and ACM Schemes
Numerical Analysis of Shock Tube Problem by using TVD and Schemes Dr. Mukkarum Husain, Dr. M. Nauman Qureshi, Syed Zaid Hasany IST Karachi, Email: mrmukkarum@yahoo.com Abstract Computational Fluid Dynamics
More informationTurbulence Modeling. Gilles Eggenspieler, Ph.D. Senior Product Manager
Turbulence Modeling Gilles Eggenspieler, Ph.D. Senior Product Manager 1 Overview The Role of Steady State (RANS) Turbulence Modeling Overview of Reynolds-Averaged Navier Stokes (RANS) Modeling Capabilities
More information60x Computational Fluid Dynamics and Visualisation
60x Computational Fluid Dynamics and Visualisation Jamil Appa BAE Systems Advanced Technology Centre 1 Outline BAE Systems - Introduction Aerodynamic Design Challenges Why GPUs? CFD on GPUs Example Kernel
More informationGeneric Refinement and Block Partitioning enabling efficient GPU CFD on Unstructured Grids
Generic Refinement and Block Partitioning enabling efficient GPU CFD on Unstructured Grids Matthieu Lefebvre 1, Jean-Marie Le Gouez 2 1 PhD at Onera, now post-doc at Princeton, department of Geosciences,
More informationA High-Order Accurate Unstructured GMRES Solver for Poisson s Equation
A High-Order Accurate Unstructured GMRES Solver for Poisson s Equation Amir Nejat * and Carl Ollivier-Gooch Department of Mechanical Engineering, The University of British Columbia, BC V6T 1Z4, Canada
More informationRecent developments for the multigrid scheme of the DLR TAU-Code
www.dlr.de Chart 1 > 21st NIA CFD Seminar > Axel Schwöppe Recent development s for the multigrid scheme of the DLR TAU-Code > Apr 11, 2013 Recent developments for the multigrid scheme of the DLR TAU-Code
More informationFlow in an Intake Manifold
Tutorial 2. Flow in an Intake Manifold Introduction The purpose of this tutorial is to model turbulent flow in a simple intake manifold geometry. An intake manifold is a system of passages which carry
More informationCartoon parallel architectures; CPUs and GPUs
Cartoon parallel architectures; CPUs and GPUs CSE 6230, Fall 2014 Th Sep 11! Thanks to Jee Choi (a senior PhD student) for a big assist 1 2 3 4 5 6 7 8 9 10 11 12 13 14 ~ socket 14 ~ core 14 ~ HWMT+SIMD
More informationWhat is Multigrid? They have been extended to solve a wide variety of other problems, linear and nonlinear.
AMSC 600/CMSC 760 Fall 2007 Solution of Sparse Linear Systems Multigrid, Part 1 Dianne P. O Leary c 2006, 2007 What is Multigrid? Originally, multigrid algorithms were proposed as an iterative method to
More informationFAST ALGORITHMS FOR CALCULATIONS OF VISCOUS INCOMPRESSIBLE FLOWS USING THE ARTIFICIAL COMPRESSIBILITY METHOD
TASK QUARTERLY 12 No 3, 273 287 FAST ALGORITHMS FOR CALCULATIONS OF VISCOUS INCOMPRESSIBLE FLOWS USING THE ARTIFICIAL COMPRESSIBILITY METHOD ZBIGNIEW KOSMA Institute of Applied Mechanics, Technical University
More information