Stan Posey NVIDIA, Santa Clara, CA, USA;

Size: px
Start display at page:

Download "Stan Posey NVIDIA, Santa Clara, CA, USA;"

Transcription

1 Stan Posey NVIDIA, Santa Clara, CA, USA;

2 Agenda: GPU Progress and Directions for CAE Introduction of GPUs in HPC Progress of CFD on GPUs Review of OpenFOAM on GPUs Discussion on WRF Developments 2

3 CFD Algorithm Suitability for GPUs CFD Speed-Ups Demonstrated in Range of Time Schemes and Spatial Discretization Explicit [usually compressible] ~15x Stencil operations, uniform memory refs ~5x Stencil operations, renumbering schemes ~x Factors Based on Comparisons with Xeon 8-core Sandy Bridge CPU Strategy: Directives Strategy: Directives Implicit [usually incompressible] ~5x ~2-3x Linear algebra solver, uniform memory refs Strategy: Libraries ISVs Linear algebra solver, renumbering schemes Strategy: Libraries Structured Grid Unstructured 3

4 Turbostream: CFD for Turbomachinery Source: Sample Turbostream GPU Simulations Typical Routine Simulation Large-scale Simulation ~19x Speedup 4

5 SD++ and Jameson Aerodynamics Research Stanford University Aerospace Computing Lab Prof. Antony Jameson GPU Application Jameson-developed CFD software SD++ for high order method aerodynamic simulations GPU Benefit Use of 16 x Tesla M2070: 15 hrs vs. 202 hrs for 16 x Xeon X5670 Fast turnaround of complex LES simulations that would otherwise be impractical for CPU-only use 15 hours on 16 x M2070s 202 hours ( > one week) on 16 Xeon x5670 CPUs Transitional flow over SD70053 airfoil, 21M DOF, Ma =.2, Re=60K, AoA=4, 4 th order, 400K RK iters 5

6 Fighter Jet Engine Noise Reduction on GPUs U.S. DoD Naval Research Lab Lab for Computational Physics and Fluid Dynamics GPU Application GPU Benefit NRL-developed CFD software JENRE for simulation of jet engine acoustics Use of Tesla M2070: 3x vs. Hex core Intel (Westmere) CPU More detailed mesh simulations possible for longer durations of jet engine transient conditions 6

7 Commercial Aircraft Wing Design on GPUs COMAC and SJTU Commercial Aircraft Corporation of China GPU Application SJTU-developed CFD software NUS3D for aerodynamic simulations of wing shapes COMAC Wing Candidate GPU Benefit Use of Tesla C2070: 20x 37x vs. single core Intel core i7 CPU Faster simulations for more wing design candidates vs. wind tunnel testing Expanding to multi-gpu and full aircraft ONERA M6 Wing CFD Simulation 7

8 GPU Development Status for CFD Particle CFD (LBM, SPH, etc.) generally better fit vs. continuum Fully deployed explicit solvers generally outperform implicit Explicit i,j,k stencil operations good fit for massively parallel threads Most CFD is distributed parallel across CPU multicores/nodes Fits GPU parallel model well and preserves costly MPI investment Focus on hybrid parallel schemes that utilize all CPU cores + GPU GPU development strategy depends on profile starting point: Legacy explicit scheme: compiler directives such as OpenACC New explicit scheme: CUDA and stencil libraries Legacy implicit scheme: CUDA and libs for solver; OpenACC for rest New implicit scheme: CUDA and libs for solver, matrix assembly, etc. 8

9 FluiDyna and Aerodynamic-Aware Surface Design RTT DeltaGen for photo realistic 3D visualization Integrates FluiDyna LBultra CFD functionality as plug-in Designer only must specify resolution and velocity Simulation data displayed live with GPU performance Courtesy of FluiDyna and Lbultra CFD Software: 9

10 Prometech and Particleworks for Multiphase Flow Oil Flow in HB Gearbox MPS-based method developed at the University of Tokyo [Prof. Koshizuka] Particleworks 3.0 GPU vs. 4 core i7 Courtesy of Prometech Software and Particleworks CFD Software 10

11 Availability of Commercial DSFD-Based Software ISV Software Application Method GPU Status PowerFLOW Aerodynamics LBM Evaluation Lbultra Aerodynamics LBM Available v2.0 XFlow Aerodynamics LBM Evaluation Project Falcon Aerodynamics LBM Evaluation Particleworks Multiphase/FS MPS (~SPH) Available v3.1 BARRACUDA Multiphase/FS MP-PIC In development EDEM Discrete phase DEM In development ANSYS Fluent DDPM Multiphase/FS DEM In development STAR-CCM+ Multiphase/FS DEM Evaluation AFEA High impact SPH Available v2.0 ESI High impact SPH, ALE In development LSTC High impact SPH, ALE Evaluation Altair High impact SPH, ALE Evaluation 11

12 Grid-Based Commercial CFD and GPU Progress ISV Primary Applications (Green color indicates CUDA-ready during 2013) ANSYS ANSYS Mechanical; ANSYS Fluent; ANSYS HFSS DS SIMULIA Abaqus/Standard; Abaqus/Explicit; Abaqus/CFD MSC Software Altair CD-adapco Autodesk ESI Group Siemens LSTC Mentor Metacomp MSC Nastran; Marc; Adams RADIOSS; AcuSolve STAR-CD; STAR-CCM+ AS Mechanical, Moldflow, AS CFD PAM-CRASH imp; CFD-ACE+ NX Nastran LS-DYNA; LS-DYNA CFD FloEFD, FloTherm CFD++ 12

13 Additional Commercial GPU Developments ISV Domain Location Primary Applications FluiDyna CFD Germany Culises for OpenFOAM; LBultra Vratis CFD Poland Speed-IT for OpenFOAM; ARAEL Prometech CFD Japan Particleworks Turbostream CFD England, UK Turbostream IMPETUS Explicit FEA Sweden AFEA AVL CFD Austria FIRE CoreTech CFD (molding) Taiwan Moldex3D Intes Implicit FEA Germany PERMAS Next Limit CFD Spain XFlow CPFD CFD USA BARRACUDA Flow Science CFD USA FLOW-3D 13

14 Status Summary of ISVs and GPU Computing Every primary ISV has products available on GPUs or undergoing evaluation The 4 largest ISVs all have products based on GPUs, some at 3rd generation #1 ANSYS, #2 DS SIMULIA, #3 MSC Software, and #4 Altair The top 4 out of 5 ISV applications are available on GPUs today ANSYS Fluent, ANSYS Mechanical, Abaqus/Standard, MSC Nastran, (LS-DYNA implicit only) Several new ISVs were founded with GPUs as a primary competitive strategy Prometech, FluiDyna, Vratis, IMPETUS, Turbostream Open source CFD OpenFOAM available on GPUs today with many options Commercial options: FluiDyna, Vratis; Open source options: Cufflink, Symscape ofgpu, RAS, etc. 14

15 CFD Algorithm Characterization: Discretization Structured Grid FV Unstructured FV Unstructured FE Finite Volume Finite Element: 15

16 CFD Algorithm Characterization: Time Integration Structured Grid FV Unstructured FV Unstructured FE Explicit Usually Compressible Finite Volume Finite Element: Implicit Usually Incompressible 16

17 CFD Algorithm Characterization: Time Integration Structured Grid FV Unstructured FV Unstructured FE Explicit Numerical operations on I,J,K stencil, no solver [Typically flat profiles: GPU strategy of directives (OpenACC)] Usually Compressible Finite Volume Finite Element: Implicit Usually Incompressible 17

18 GPU Acceleration Relative to Single 8-Core CPU Structured Grid FV Unstructured FV Unstructured FE Explicit Usually Compressible Implicit Usually Incompressible ~15x ~5x Turbostream Veloxi SJTU RANS Finite Volume - SD++ Stanford (Jameson) - FEFLO (Lohner) Finite Element: 18

19 GPU Acceleration Relative to Single 8-Core CPU Structured Grid FV Unstructured FV Unstructured FE Explicit Usually Compressible ~15x ~5x Turbostream Veloxi SJTU RANS Finite Volume - SD++ Stanford (Jameson) - FEFLO (Lohner) Finite Element: Implicit Usually Incompressible Sparse matrix linear algebra iterative solvers [Hot spot ~50%, small % LoC: GPU strategy of CUDA and libs] 19

20 GPU Acceleration Relative to Single 8-Core CPU Structured Grid FV Unstructured FV Unstructured FE Explicit Usually Compressible ~15x ~5x Turbostream Veloxi SJTU RANS - SD++ Stanford (Jameson) - FEFLO (Lohner) Implicit Usually Incompressible Finite Volume - ANSYS Fluent - Culises for OpenFOAM - SpeedIT for OpenFOAM - CFD-ACE+ - FIRE ~2x Finite Element: - Moldflow - AcuSolve - Moldex3D 20

21 Commercial CFD Focus on Sparse Solvers for GPU CFD Application Software Read input, matrix Set-up GPU Implicit Sparse Matrix Operations - Hand-CUDA Parallel - GPU Libraries, CUBLAS 50% - 65% of Profile time, Small % LoC Implicit Sparse Matrix Operations (Investigating OpenACC for more tasks on GPU) CPU - OpenACC Directives Global solution, write output + 21

22 NVIDIA-Developed Library of Linear Solvers Library of nested solvers for large sparse Ax=b Nesting creates a solver hierarchy, e.g. BiCGstab AMG Jacobi Example solvers MC-DILU Jacobi, simple local (neighbor) operations, no/little setup BiCGStab, local and global operations, no setup MC-DILU, graph coloring and factorization at setup AMG, multi-level scheme, on each level: graph coarsening and matrixmatrix products at setup 22

23 ISV Progress with NVIDIA CFD Solver Library Committed: ANSYS ANSYS Fluent and ANSYS CFD : #1 in CFD FluiDyna Culises library use in OpenFOAM: OpenFOAM is #2 in CFD for leveraged hardware Evaluation: Autodesk AS Moldflow: the leader in plastic mold injection simulation Autodesk AS CFD: important to the design engineering market and being hosted on Autodesk cloud Discussion: CD-adapco STAR-CCM+: the # 2 CFD code for software rev, either #2 or #3 for leveraged hardware ESI CFD-ACE+: important CFD code in the semiconductor/electronics industry along with others Cradle SC/Tetra: #3 CFD in Japan (behind ANSYS Fluent and STAR-CCM+) and primary CFD code at Toyota Targets: Altair AcuSolve: GMRES Metacomp CFD++: AMG Mentor FloEFD: AMG SIMULIA Abaqus/CFD: use ML from Petsc LSTC LS-DYNA CFD: AMG AVL FIRE: AMG Convergent Technologies Converge CFD: GMRES 23

24 ANSYS and NVIDIA Technical Collaboration Release ANSYS Mechanical ANSYS Fluent ANSYS EM 13.0 Dec 2010 SMP, Single GPU, Sparse and PCG/JCG Solvers ANSYS Nexxim 14.0 Dec Distributed ANSYS; + Multi-node Support Radiation Heat Transfer (beta) ANSYS Nexxim 14.5 Nov Multi-GPU Support; + Hybrid PCG; + Kepler GPU Support + Radiation HT; + GPU AMG Solver (beta), Single GPU ANSYS Nexxim 15.0 Q CUDA 5 Kepler Tuning + Multi-GPU AMG Solver; + CUDA 5 Kepler Tuning ANSYS Nexxim ANSYS HFSS (Transient) 24

25 ANSYS Fluent 14.5 and Radiation HT on GPU VIEWFAC Utility: Use on CPUs, GPUs or both ~2x speedup Radiation HT Applications: - Underhood cooling - Cabin comfort HVAC - Furnace simulations RAY TRACING Utility: Uses OptiX library from NVIDIA with up to ~15x speedup (Use on GPU only) - Solar loads on buildings - Combustor in turbine - Electronics passive cooling 25

26 ANSYS Fluent CPU Job Profile for Coupled PBNS Non-linear iterations Assemble Linear System of Equations Runtime: ~ 35% Accelerate this first Solve Linear System of Equations: Ax = b ~ 65% No Converged? Yes Stop 26

27 ANSYS Fluent AMG Solver Time per Iteration (Sec) ANSYS Fluent GPU-Based AMG Solver from NVIDIA ANSYS Fluent 14.5 Performance Results by NVIDIA, Nov Airfoil and Aircraft Models with Hexahedral Cells K20X 3930K(6) Lower is Better 2 x Core-i7 3930K, Only 6 Cores Used 6 2.4x Solver settings: x Airfoil (hex 784K) Aircraft (hex 1798K) CPU Fluent solver: F-cycle, agg8, DILU, 0pre, 3post GPU nvamg solver: V-cycle, agg8, MC-DILU, 0pre, 3post NOTE: Times for solver only 27

28 Comparison of AMG Cycles on CPU and GPU 2D Convection Case: F-cycle best for both CPU and GPU Lower is Better CPU-F GPU-F 28

29 GPUs and Distributed Cluster Computing Partition on CPU Geometry decomposed: partitions put on independent cluster nodes; CPU distributed parallel processing Nodes distributed parallel using MPI N1 N1 N2 N3 N4 1 Global Solution 29

30 GPUs and Distributed Cluster Computing Partition on CPU Geometry decomposed: partitions put on independent cluster nodes; CPU distributed parallel processing Nodes distributed parallel using MPI N1 1 Execution on CPU + GPU N1 N2 N3 N4 G1 G2 G3 G4 GPUs shared memory parallel using OpenMP under distributed parallel Global Solution 30

31 ANSYS Fluent Preview for 2 x CPU + 2 x Tesla K20X ANSYS Fluent 15.0 Preview Performance Results by NVIDIA, Feb x K20X E5_2680(16) Lower is Better 2 x E5_2680 SB CPUs, 16 cores total, only 2 cores used with GPUs 1 2.1x Solver settings: x CPU Fluent solver: F-cycle, agg8, DILU, 0pre, 3post 0 Helix (tet 1173K) Airfoil (hex 784K) GPU nvamg solver: V-cycle, agg8, MC-DILU, 0pre, 3post NOTE: Times for solver only 31

32 ANSYS Fluent Scaling Results for 4 x Tesla K20X ANSYS Fluent 15.0 Preview Performance Results by NVIDIA, Mar Helix (Tet 1.2M) Airfoil (Hex.78M) Sedan (Mixed 3.6M) Perfect Scaling Higher is Better Hardware Setup: 2 server nodes 2 GPUs each node Infiniband network GPU Solver Settings: V-cycle, agg8/2, MC-DILU, 0pre, 3post 1 K20X(1) K20X(2) K20X(3) K20X(4) NOTES: Results for solver only Sedan case starts with 2 GPUs 32

33 ANSYS Fluent 15.0 Multi-GPU Demonstration Multi-GPU Acceleration of a 16-Core ANSYS Fluent Simulation of External Aero 2.9X Solver Speedup Click to Launch Movie Xeon E CPUs + Tesla K20X GPUs CPU Configuration CPU + GPU Configuration 16-Core Server Node 8-Cores 8-Cores G1 G2 G3 G4 33

34 Summary: Opportunity for Advanced CFD Problem Statement: CFD demand for increased levels of CFD model resolution for improved simulation accuracy CFD use is 80% steady state RANS today rather a short-cut to faster turn-around Fluid flow is inherently unsteady and in need of better turbulence treatment CPU-based HPC limits advanced CFD Opportunity: CFD ISVs have developed URANS, DES, and LES capabilities which undergo very limited use CPU-based turnaround times are impractical for many product development workflows Large Eddy Simulation (LES) is of most interest and has a high degree of arithmetic intensity GPU computing can offer a practical solution for LES that doesn t exist today with CPUs 34

35 Conclusions For CAE on GPUs Opportunities exist for GPUs to provide significant performance acceleration for solver intensive large jobs Improved product quality Shorten product engineering cycles (Faster Time-to-Market) Better Total Cost of Ownership (TCO) Cut down energy consumption in the CAE process Simulations recently considered intractable are now possible Large Eddy Simulation (LES) with a high degree of arithmetic intensity Parameter optimization with highly increased number of jobs 35

36 Stan Posey NVIDIA, Santa Clara, CA, USA;

Accelerated ANSYS Fluent: Algebraic Multigrid on a GPU. Robert Strzodka NVAMG Project Lead

Accelerated ANSYS Fluent: Algebraic Multigrid on a GPU. Robert Strzodka NVAMG Project Lead Accelerated ANSYS Fluent: Algebraic Multigrid on a GPU Robert Strzodka NVAMG Project Lead A Parallel Success Story in Five Steps 2 Step 1: Understand Application ANSYS Fluent Computational Fluid Dynamics

More information

The Visual Computing Company

The Visual Computing Company The Visual Computing Company GPU Acceleration Benefits for Applied CAE Axel Koehler, Senior Solutions Architect HPC, NVIDIA HPC Advisory Council Meeting, April 2014, Lugano Outline General overview about

More information

ANSYS Improvements to Engineering Productivity with HPC and GPU-Accelerated Simulation

ANSYS Improvements to Engineering Productivity with HPC and GPU-Accelerated Simulation ANSYS Improvements to Engineering Productivity with HPC and GPU-Accelerated Simulation Ray Browell nvidia Technology Theater SC12 1 2012 ANSYS, Inc. nvidia Technology Theater SC12 HPC Revolution Recent

More information

Stan Posey, CAE Industry Development NVIDIA, Santa Clara, CA, USA

Stan Posey, CAE Industry Development NVIDIA, Santa Clara, CA, USA Stan Posey, CAE Industry Development NVIDIA, Santa Clara, CA, USA NVIDIA and HPC Evolution of GPUs Public, based in Santa Clara, CA ~$4B revenue ~5,500 employees Founded in 1999 with primary business in

More information

The Fermi GPU and HPC Application Breakthroughs

The Fermi GPU and HPC Application Breakthroughs The Fermi GPU and HPC Application Breakthroughs Peng Wang, PhD HPC Developer Technology Group Stan Posey HPC Industry Development NVIDIA, Santa Clara, CA, USA NVIDIA Corporation 2009 Overview GPU Computing:

More information

GPU-Acceleration of CAE Simulations. Bhushan Desam NVIDIA Corporation

GPU-Acceleration of CAE Simulations. Bhushan Desam NVIDIA Corporation GPU-Acceleration of CAE Simulations Bhushan Desam NVIDIA Corporation bdesam@nvidia.com 1 AGENDA GPUs in Enterprise Computing Business Challenges in Product Development NVIDIA GPUs for CAE Applications

More information

Maximize automotive simulation productivity with ANSYS HPC and NVIDIA GPUs

Maximize automotive simulation productivity with ANSYS HPC and NVIDIA GPUs Presented at the 2014 ANSYS Regional Conference- Detroit, June 5, 2014 Maximize automotive simulation productivity with ANSYS HPC and NVIDIA GPUs Bhushan Desam, Ph.D. NVIDIA Corporation 1 NVIDIA Enterprise

More information

ACCELERATING CFD AND RESERVOIR SIMULATIONS WITH ALGEBRAIC MULTI GRID Chris Gottbrath, Nov 2016

ACCELERATING CFD AND RESERVOIR SIMULATIONS WITH ALGEBRAIC MULTI GRID Chris Gottbrath, Nov 2016 ACCELERATING CFD AND RESERVOIR SIMULATIONS WITH ALGEBRAIC MULTI GRID Chris Gottbrath, Nov 2016 Challenges What is Algebraic Multi-Grid (AMG)? AGENDA Why use AMG? When to use AMG? NVIDIA AmgX Results 2

More information

HPC and IT Issues Session Agenda. Deployment of Simulation (Trends and Issues Impacting IT) Mapping HPC to Performance (Scaling, Technology Advances)

HPC and IT Issues Session Agenda. Deployment of Simulation (Trends and Issues Impacting IT) Mapping HPC to Performance (Scaling, Technology Advances) HPC and IT Issues Session Agenda Deployment of Simulation (Trends and Issues Impacting IT) Discussion Mapping HPC to Performance (Scaling, Technology Advances) Discussion Optimizing IT for Remote Access

More information

ANSYS HPC Technology Leadership

ANSYS HPC Technology Leadership ANSYS HPC Technology Leadership 1 ANSYS, Inc. November 14, Why ANSYS Users Need HPC Insight you can t get any other way It s all about getting better insight into product behavior quicker! HPC enables

More information

GPU PROGRESS AND DIRECTIONS IN APPLIED CFD

GPU PROGRESS AND DIRECTIONS IN APPLIED CFD Eleventh International Conference on CFD in the Minerals and Process Industries CSIRO, Melbourne, Australia 7-9 December 2015 GPU PROGRESS AND DIRECTIONS IN APPLIED CFD Stan POSEY 1*, Simon SEE 2, and

More information

ANSYS HPC. Technology Leadership. Barbara Hutchings ANSYS, Inc. September 20, 2011

ANSYS HPC. Technology Leadership. Barbara Hutchings ANSYS, Inc. September 20, 2011 ANSYS HPC Technology Leadership Barbara Hutchings barbara.hutchings@ansys.com 1 ANSYS, Inc. September 20, Why ANSYS Users Need HPC Insight you can t get any other way HPC enables high-fidelity Include

More information

Speedup Altair RADIOSS Solvers Using NVIDIA GPU

Speedup Altair RADIOSS Solvers Using NVIDIA GPU Innovation Intelligence Speedup Altair RADIOSS Solvers Using NVIDIA GPU Eric LEQUINIOU, HPC Director Hongwei Zhou, Senior Software Developer May 16, 2012 Innovation Intelligence ALTAIR OVERVIEW Altair

More information

A Scalable GPU-Based Compressible Fluid Flow Solver for Unstructured Grids

A Scalable GPU-Based Compressible Fluid Flow Solver for Unstructured Grids A Scalable GPU-Based Compressible Fluid Flow Solver for Unstructured Grids Patrice Castonguay and Antony Jameson Aerospace Computing Lab, Stanford University GTC Asia, Beijing, China December 15 th, 2011

More information

GPU-Accelerated Algebraic Multigrid for Commercial Applications. Joe Eaton, Ph.D. Manager, NVAMG CUDA Library NVIDIA

GPU-Accelerated Algebraic Multigrid for Commercial Applications. Joe Eaton, Ph.D. Manager, NVAMG CUDA Library NVIDIA GPU-Accelerated Algebraic Multigrid for Commercial Applications Joe Eaton, Ph.D. Manager, NVAMG CUDA Library NVIDIA ANSYS Fluent 2 Fluent control flow Accelerate this first Non-linear iterations Assemble

More information

HPC Considerations for Scalable Multidiscipline CAE Applications on Conventional Linux Platforms. Author: Correspondence: ABSTRACT:

HPC Considerations for Scalable Multidiscipline CAE Applications on Conventional Linux Platforms. Author: Correspondence: ABSTRACT: HPC Considerations for Scalable Multidiscipline CAE Applications on Conventional Linux Platforms Author: Stan Posey Panasas, Inc. Correspondence: Stan Posey Panasas, Inc. Phone +510 608 4383 Email sposey@panasas.com

More information

Performance of Implicit Solver Strategies on GPUs

Performance of Implicit Solver Strategies on GPUs 9. LS-DYNA Forum, Bamberg 2010 IT / Performance Performance of Implicit Solver Strategies on GPUs Prof. Dr. Uli Göhner DYNAmore GmbH Stuttgart, Germany Abstract: The increasing power of GPUs can be used

More information

Why HPC for. ANSYS Mechanical and ANSYS CFD?

Why HPC for. ANSYS Mechanical and ANSYS CFD? Why HPC for ANSYS Mechanical and ANSYS CFD? 1 HPC Defined High Performance Computing (HPC) at ANSYS: An ongoing effort designed to remove computing limitations from engineers who use computer aided engineering

More information

GE Usage & Trends

GE Usage & Trends CFD @ GE Usage & Trends Dr. Senior Principal Engineer GE Global Research 06 January 2018 Overview of CFD at GE Wide penetration Aviation, Power, Oil & Gas, Renewables Aerodynamics, heat transfer, aeromechanics,

More information

CUDA Accelerated Compute Libraries. M. Naumov

CUDA Accelerated Compute Libraries. M. Naumov CUDA Accelerated Compute Libraries M. Naumov Outline Motivation Why should you use libraries? CUDA Toolkit Libraries Overview of performance CUDA Proprietary Libraries Address specific markets Third Party

More information

Faster Innovation - Accelerating SIMULIA Abaqus Simulations with NVIDIA GPUs. Baskar Rajagopalan Accelerated Computing, NVIDIA

Faster Innovation - Accelerating SIMULIA Abaqus Simulations with NVIDIA GPUs. Baskar Rajagopalan Accelerated Computing, NVIDIA Faster Innovation - Accelerating SIMULIA Abaqus Simulations with NVIDIA GPUs Baskar Rajagopalan Accelerated Computing, NVIDIA 1 Engineering & IT Challenges/Trends NVIDIA GPU Solutions AGENDA Abaqus GPU

More information

System Level Cooling, Fatigue, and Durability. Co-Simulation. Stuart A. Walker, Ph.D.

System Level Cooling, Fatigue, and Durability. Co-Simulation. Stuart A. Walker, Ph.D. System Level Cooling, Fatigue, and Durability Analysis via Multiphysics Co-Simulation Stuart A. Walker, Ph.D. swalker@altair.com Outline Motivation Presentation of process Presentation of tools Presentation

More information

Particleworks: Particle-based CAE Software fully ported to GPU

Particleworks: Particle-based CAE Software fully ported to GPU Particleworks: Particle-based CAE Software fully ported to GPU Introduction PrometechVideo_v3.2.3.wmv 3.5 min. Particleworks Why the particle method? Existing methods FEM, FVM, FLIP, Fluid calculation

More information

Performance Benefits of NVIDIA GPUs for LS-DYNA

Performance Benefits of NVIDIA GPUs for LS-DYNA Performance Benefits of NVIDIA GPUs for LS-DYNA Mr. Stan Posey and Dr. Srinivas Kodiyalam NVIDIA Corporation, Santa Clara, CA, USA Summary: This work examines the performance characteristics of LS-DYNA

More information

AmgX 2.0: Scaling toward CORAL Joe Eaton, November 19, 2015

AmgX 2.0: Scaling toward CORAL Joe Eaton, November 19, 2015 AmgX 2.0: Scaling toward CORAL Joe Eaton, November 19, 2015 Agenda Introduction to AmgX Current Capabilities Scaling V2.0 Roadmap for the future 2 AmgX Fast, scalable linear solvers, emphasis on iterative

More information

Multi-GPU simulations in OpenFOAM with SpeedIT technology.

Multi-GPU simulations in OpenFOAM with SpeedIT technology. Multi-GPU simulations in OpenFOAM with SpeedIT technology. Attempt I: SpeedIT GPU-based library of iterative solvers for Sparse Linear Algebra and CFD. Current version: 2.2. Version 1.0 in 2008. CMRS format

More information

Dell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance

Dell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance Dell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance This Dell EMC technical white paper discusses performance benchmarking results and analysis for Simulia

More information

SIDACTGmbH. D ata Analysis and Compression Technologies. femzip: compression of simulation results

SIDACTGmbH. D ata Analysis and Compression Technologies. femzip: compression of simulation results SIDACTGmbH Simulation D D ata Analysis and Compression Technologies femzip: compression of simulation results Compression with FEMZIP FEMZIP tools are specialized for the compression of simulation results.

More information

Large-scale Gas Turbine Simulations on GPU clusters

Large-scale Gas Turbine Simulations on GPU clusters Large-scale Gas Turbine Simulations on GPU clusters Tobias Brandvik and Graham Pullan Whittle Laboratory University of Cambridge A large-scale simulation Overview PART I: Turbomachinery PART II: Stencil-based

More information

Real Application Performance and Beyond

Real Application Performance and Beyond Real Application Performance and Beyond Mellanox Technologies Inc. 2900 Stender Way, Santa Clara, CA 95054 Tel: 408-970-3400 Fax: 408-970-3403 http://www.mellanox.com Scientists, engineers and analysts

More information

ANSYS EnSight. Advanced PostProcessing capabilities. Janos Szegletes

ANSYS EnSight. Advanced PostProcessing capabilities. Janos Szegletes ANSYS EnSight Advanced PostProcessing capabilities Janos Szegletes 2018.05.03. www.econengineering.com1 Fluid Packages - Overview ANSYS CFD Premium ANSYS Ensight SpaceClaim 4 HPC DesignXplorer CFX Fluent

More information

Turbostream: A CFD solver for manycore

Turbostream: A CFD solver for manycore Turbostream: A CFD solver for manycore processors Tobias Brandvik Whittle Laboratory University of Cambridge Aim To produce an order of magnitude reduction in the run-time of CFD solvers for the same hardware

More information

Recent Advances in Modelling Wind Parks in STAR CCM+ Steve Evans

Recent Advances in Modelling Wind Parks in STAR CCM+ Steve Evans Recent Advances in Modelling Wind Parks in STAR CCM+ Steve Evans Introduction Company STAR-CCM+ Agenda Wind engineering at CD-adapco STAR-CCM+ & EnviroWizard Developments for Offshore Simulation CD-adapco:

More information

Application of GPU technology to OpenFOAM simulations

Application of GPU technology to OpenFOAM simulations Application of GPU technology to OpenFOAM simulations Jakub Poła, Andrzej Kosior, Łukasz Miroslaw jakub.pola@vratis.com, www.vratis.com Wroclaw, Poland Agenda Motivation Partial acceleration SpeedIT OpenFOAM

More information

Solving Large Complex Problems. Efficient and Smart Solutions for Large Models

Solving Large Complex Problems. Efficient and Smart Solutions for Large Models Solving Large Complex Problems Efficient and Smart Solutions for Large Models 1 ANSYS Structural Mechanics Solutions offers several techniques 2 Current trends in simulation show an increased need for

More information

Unstructured Grid Numbering Schemes for GPU Coalescing Requirements

Unstructured Grid Numbering Schemes for GPU Coalescing Requirements Unstructured Grid Numbering Schemes for GPU Coalescing Requirements Andrew Corrigan 1 and Johann Dahm 2 Laboratories for Computational Physics and Fluid Dynamics Naval Research Laboratory 1 Department

More information

Digital-X. Towards Virtual Aircraft Design and Testing based on High-Fidelity Methods - Recent Developments at DLR -

Digital-X. Towards Virtual Aircraft Design and Testing based on High-Fidelity Methods - Recent Developments at DLR - Digital-X Towards Virtual Aircraft Design and Testing based on High-Fidelity Methods - Recent Developments at DLR - O. Brodersen, C.-C. Rossow, N. Kroll DLR Institute of Aerodynamics and Flow Technology

More information

First Steps of YALES2 Code Towards GPU Acceleration on Standard and Prototype Cluster

First Steps of YALES2 Code Towards GPU Acceleration on Standard and Prototype Cluster First Steps of YALES2 Code Towards GPU Acceleration on Standard and Prototype Cluster YALES2: Semi-industrial code for turbulent combustion and flows Jean-Matthieu Etancelin, ROMEO, NVIDIA GPU Application

More information

Directed Optimization On Stencil-based Computational Fluid Dynamics Application(s)

Directed Optimization On Stencil-based Computational Fluid Dynamics Application(s) Directed Optimization On Stencil-based Computational Fluid Dynamics Application(s) Islam Harb 08/21/2015 Agenda Motivation Research Challenges Contributions & Approach Results Conclusion Future Work 2

More information

Recent Developments and Roadmap Part 0: Introduction. 12 th International LS-DYNA User s Conference June 5, 2012

Recent Developments and Roadmap Part 0: Introduction. 12 th International LS-DYNA User s Conference June 5, 2012 Recent Developments and Roadmap Part 0: Introduction 12 th International LS-DYNA User s Conference June 5, 2012 1 Outline Introduction Recent developments. See the separate PDFs for: LS-PrePost Dummies

More information

Appendix P. Multi-Physics Simulation Technology in NX. Christian Ruel (Maya Htt, Canada)

Appendix P. Multi-Physics Simulation Technology in NX. Christian Ruel (Maya Htt, Canada) 251 Appendix P Multi-Physics Simulation Technology in NX Christian Ruel (Maya Htt, Canada) 252 Multi-Physics Simulation Technology in NX Abstract As engineers increasingly rely on simulation models within

More information

Reproducibility of Complex Turbulent Flow Using Commercially-Available CFD Software

Reproducibility of Complex Turbulent Flow Using Commercially-Available CFD Software Reports of Research Institute for Applied Mechanics, Kyushu University No.150 (47 59) March 2016 Reproducibility of Complex Turbulent Using Commercially-Available CFD Software Report 1: For the Case of

More information

SENSEI / SENSEI-Lite / SENEI-LDC Updates

SENSEI / SENSEI-Lite / SENEI-LDC Updates SENSEI / SENSEI-Lite / SENEI-LDC Updates Chris Roy and Brent Pickering Aerospace and Ocean Engineering Dept. Virginia Tech July 23, 2014 Collaborations with Math Collaboration on the implicit SENSEI-LDC

More information

OpenFOAM + GPGPU. İbrahim Özküçük

OpenFOAM + GPGPU. İbrahim Özküçük OpenFOAM + GPGPU İbrahim Özküçük Outline GPGPU vs CPU GPGPU plugins for OpenFOAM Overview of Discretization CUDA for FOAM Link (cufflink) Cusp & Thrust Libraries How Cufflink Works Performance data of

More information

GTC 2013: DEVELOPMENTS IN GPU-ACCELERATED SPARSE LINEAR ALGEBRA ALGORITHMS. Kyle Spagnoli. Research EM Photonics 3/20/2013

GTC 2013: DEVELOPMENTS IN GPU-ACCELERATED SPARSE LINEAR ALGEBRA ALGORITHMS. Kyle Spagnoli. Research EM Photonics 3/20/2013 GTC 2013: DEVELOPMENTS IN GPU-ACCELERATED SPARSE LINEAR ALGEBRA ALGORITHMS Kyle Spagnoli Research Engineer @ EM Photonics 3/20/2013 INTRODUCTION» Sparse systems» Iterative solvers» High level benchmarks»

More information

Industrial achievements on Blue Waters using CPUs and GPUs

Industrial achievements on Blue Waters using CPUs and GPUs Industrial achievements on Blue Waters using CPUs and GPUs HPC User Forum, September 17, 2014 Seattle Seid Korić PhD Technical Program Manager Associate Adjunct Professor koric@illinois.edu Think Big!

More information

CMSC 714 Lecture 6 MPI vs. OpenMP and OpenACC. Guest Lecturer: Sukhyun Song (original slides by Alan Sussman)

CMSC 714 Lecture 6 MPI vs. OpenMP and OpenACC. Guest Lecturer: Sukhyun Song (original slides by Alan Sussman) CMSC 714 Lecture 6 MPI vs. OpenMP and OpenACC Guest Lecturer: Sukhyun Song (original slides by Alan Sussman) Parallel Programming with Message Passing and Directives 2 MPI + OpenMP Some applications can

More information

Fluid-Structure Interaction in STAR-CCM+ Alan Mueller CD-adapco

Fluid-Structure Interaction in STAR-CCM+ Alan Mueller CD-adapco Fluid-Structure Interaction in STAR-CCM+ Alan Mueller CD-adapco What is FSI? Air Interaction with a Flexible Structure What is FSI? Water/Air Interaction with a Structure Courtesy CFD Marine Courtesy Germanischer

More information

GPU Computing Ecosystem

GPU Computing Ecosystem GPU Computing Ecosystem CUDA 5 Enterprise level GPU Development GPU Development Paths Libraries, Directives, Languages GPU Tools Tools, libraries and plug-ins for GPU codes Tesla K10 Kepler! Tesla K20

More information

Impact of STAR-CCM+ v7.0 in the Automotive Industry Frederick J. Ross, CD-adapco Director, Ground Transportation

Impact of STAR-CCM+ v7.0 in the Automotive Industry Frederick J. Ross, CD-adapco Director, Ground Transportation Impact of STAR-CCM+ v7.0 in the Automotive Industry Frederick J. Ross, CD-adapco Director, Ground Transportation Vehicle Simulation Components Vehicle Aerodynamics Design Studies Aeroacoustics Water/Dirt

More information

GPU Computing fuer rechenintensive Anwendungen. Axel Koehler NVIDIA

GPU Computing fuer rechenintensive Anwendungen. Axel Koehler NVIDIA GPU Computing fuer rechenintensive Anwendungen Axel Koehler NVIDIA GeForce Quadro Tegra Tesla 2 Continued Demand for Ever Faster Supercomputers First-principles simulation of combustion for new high-efficiency,

More information

Two-Phase flows on massively parallel multi-gpu clusters

Two-Phase flows on massively parallel multi-gpu clusters Two-Phase flows on massively parallel multi-gpu clusters Peter Zaspel Michael Griebel Institute for Numerical Simulation Rheinische Friedrich-Wilhelms-Universität Bonn Workshop Programming of Heterogeneous

More information

PARALUTION - a Library for Iterative Sparse Methods on CPU and GPU

PARALUTION - a Library for Iterative Sparse Methods on CPU and GPU - a Library for Iterative Sparse Methods on CPU and GPU Dimitar Lukarski Division of Scientific Computing Department of Information Technology Uppsala Programming for Multicore Architectures Research Center

More information

Recent results with elsa on multi-cores

Recent results with elsa on multi-cores Michel Gazaix (ONERA) Steeve Champagneux (AIRBUS) October 15th, 2009 Outline Short introduction to elsa elsa benchmark on HPC platforms Detailed performance evaluation IBM Power5, AMD Opteron, INTEL Nehalem

More information

Missile External Aerodynamics Using Star-CCM+ Star European Conference 03/22-23/2011

Missile External Aerodynamics Using Star-CCM+ Star European Conference 03/22-23/2011 Missile External Aerodynamics Using Star-CCM+ Star European Conference 03/22-23/2011 StarCCM_StarEurope_2011 4/6/11 1 Overview 2 Role of CFD in Aerodynamic Analyses Classical aerodynamics / Semi-Empirical

More information

Dell HPC System for Manufacturing System Architecture and Application Performance

Dell HPC System for Manufacturing System Architecture and Application Performance Dell HPC System for Manufacturing System Architecture and Application Performance This Dell technical white paper describes the architecture of the Dell HPC System for Manufacturing and discusses performance

More information

ACCELERATION OF A COMPUTATIONAL FLUID DYNAMICS CODE WITH GPU USING OPENACC

ACCELERATION OF A COMPUTATIONAL FLUID DYNAMICS CODE WITH GPU USING OPENACC Nonlinear Computational Aeroelasticity Lab ACCELERATION OF A COMPUTATIONAL FLUID DYNAMICS CODE WITH GPU USING OPENACC N I C H O L S O N K. KO U K PA I Z A N P H D. C A N D I D AT E GPU Technology Conference

More information

Recent applications of overset mesh technology in SC/Tetra

Recent applications of overset mesh technology in SC/Tetra Recent applications of overset mesh technology in SC/Tetra NIA CFD Seminar October 6, 2014 Tomohiro Irie Software Cradle Co., Ltd. 1 Contents Introduction Software Cradle SC/Tetra Background of Demands

More information

Recent & Upcoming Features in STAR-CCM+ for Aerospace Applications Deryl Snyder, Ph.D.

Recent & Upcoming Features in STAR-CCM+ for Aerospace Applications Deryl Snyder, Ph.D. Recent & Upcoming Features in STAR-CCM+ for Aerospace Applications Deryl Snyder, Ph.D. Outline Introduction Aerospace Applications Summary New Capabilities for Aerospace Continuity Convergence Accelerator

More information

CDA Workshop Physical & Numerical Hydraulic Modelling. STAR-CCM+ Presentation

CDA Workshop Physical & Numerical Hydraulic Modelling. STAR-CCM+ Presentation CDA Workshop Physical & Numerical Hydraulic Modelling STAR-CCM+ Presentation ENGINEERING SIMULATION CFD FEA Mission Increase the competitiveness of companies through optimization of their product development

More information

Ansys Fluent R Michele Andreoli

Ansys Fluent R Michele Andreoli Ansys Fluent R 17.0 Michele Andreoli (m.andreoli@enginsoft.it) Table of contents User Interface Fluent Meshing Solver Numerics New features Innovative Solutions New User Interface: Ribbon-Driven Solver

More information

ANSYS High. Computing. User Group CAE Associates

ANSYS High. Computing. User Group CAE Associates ANSYS High Performance Computing User Group 010 010 CAE Associates Parallel Processing in ANSYS ANSYS offers two parallel processing methods: Shared-memory ANSYS: Shared-memory ANSYS uses the sharedmemory

More information

DNV GL s 16th Technology Week

DNV GL s 16th Technology Week OIL & GAS DNV GL s 16th Technology Week Advanced Simulation for Offshore Application: Application of CFD for Computing VIM of Floating Structures 1 SAFER, SMARTER, GREENER OUTLINE Introduction Elements

More information

Reproducibility of Complex Turbulent Flow Using Commercially-Available CFD Software

Reproducibility of Complex Turbulent Flow Using Commercially-Available CFD Software Reports of Research Institute for Applied Mechanics, Kyushu University, No.150 (60-70) March 2016 Reproducibility of Complex Turbulent Flow Using Commercially-Available CFD Software Report 2: For the Case

More information

HPC Usage for Aerodynamic Flow Computation with Different Levels of Detail

HPC Usage for Aerodynamic Flow Computation with Different Levels of Detail DLR.de Folie 1 HPCN-Workshop 14./15. Mai 2018 HPC Usage for Aerodynamic Flow Computation with Different Levels of Detail Cornelia Grabe, Marco Burnazzi, Axel Probst, Silvia Probst DLR, Institute of Aerodynamics

More information

High-level Abstraction for Block Structured Applications: A lattice Boltzmann Exploration

High-level Abstraction for Block Structured Applications: A lattice Boltzmann Exploration High-level Abstraction for Block Structured Applications: A lattice Boltzmann Exploration Jianping Meng, Xiao-Jun Gu, David R. Emerson, Gihan Mudalige, István Reguly and Mike B Giles Scientific Computing

More information

Reproducibility of Complex Turbulent Flow Using Commercially-Available CFD Software

Reproducibility of Complex Turbulent Flow Using Commercially-Available CFD Software Reports of Research Institute for Applied Mechanics, Kyushu University No.150 (71 83) March 2016 Reproducibility of Complex Turbulent Flow Using Commercially-Available CFD Software Report 3: For the Case

More information

OzenCloud Case Studies

OzenCloud Case Studies OzenCloud Case Studies Case Studies, April 20, 2015 ANSYS in the Cloud Case Studies: Aerodynamics & fluttering study on an aircraft wing using fluid structure interaction 1 Powered by UberCloud http://www.theubercloud.com

More information

The State of Accelerated Applications. Michael Feldman

The State of Accelerated Applications. Michael Feldman The State of Accelerated Applications Michael Feldman Accelerator Market in HPC Nearly half of all new HPC systems deployed incorporate accelerators Accelerator hardware performance has been advancing

More information

GPU COMPUTING WITH MSC NASTRAN 2013

GPU COMPUTING WITH MSC NASTRAN 2013 SESSION TITLE WILL BE COMPLETED BY MSC SOFTWARE GPU COMPUTING WITH MSC NASTRAN 2013 Srinivas Kodiyalam, NVIDIA, Santa Clara, USA THEME Accelerated computing with GPUs SUMMARY Current trends in HPC (High

More information

Validation of an Unstructured Overset Mesh Method for CFD Analysis of Store Separation D. Snyder presented by R. Fitzsimmons

Validation of an Unstructured Overset Mesh Method for CFD Analysis of Store Separation D. Snyder presented by R. Fitzsimmons Validation of an Unstructured Overset Mesh Method for CFD Analysis of Store Separation D. Snyder presented by R. Fitzsimmons Stores Separation Introduction Flight Test Expensive, high-risk, sometimes catastrophic

More information

NVIDIA Update and Directions on GPU Acceleration for Earth System Models

NVIDIA Update and Directions on GPU Acceleration for Earth System Models NVIDIA Update and Directions on GPU Acceleration for Earth System Models Stan Posey, HPC Program Manager, ESM and CFD, NVIDIA, Santa Clara, CA, USA Carl Ponder, PhD, Applications Software Engineer, NVIDIA,

More information

Contents. 1 CoreTech System Co., Ltd.

Contents. 1 CoreTech System Co., Ltd. Contents Advanced Support for Intelligent Workflow Improved User Interface 2 Expanded Gate Types.. 2 Enhanced Runner Wizard. 2 Customized Cooling Channel Templates. 3 Parameterized Mesh Generator... 3

More information

Second Symposium on Hybrid RANS-LES Methods, 17/18 June 2007

Second Symposium on Hybrid RANS-LES Methods, 17/18 June 2007 1 Zonal-Detached Eddy Simulation of Transonic Buffet on a Civil Aircraft Type Configuration V.BRUNET and S.DECK Applied Aerodynamics Department The Buffet Phenomenon Aircraft in transonic conditions Self-sustained

More information

INNOVATIVE CFD FOR SUPER-COMPUTER RESULTS ON YOUR DESKTOP

INNOVATIVE CFD FOR SUPER-COMPUTER RESULTS ON YOUR DESKTOP INNOVATIVE CFD FOR SUPER-COMPUTER RESULTS ON YOUR DESKTOP XFlow is a next generation CFD software that uses a proprietary, particle-based, meshless approach which can easily handle traditionally complex

More information

The Effect of In-Network Computing-Capable Interconnects on the Scalability of CAE Simulations

The Effect of In-Network Computing-Capable Interconnects on the Scalability of CAE Simulations The Effect of In-Network Computing-Capable Interconnects on the Scalability of CAE Simulations Ophir Maor HPC Advisory Council ophir@hpcadvisorycouncil.com The HPC-AI Advisory Council World-wide HPC non-profit

More information

Accelerating Implicit LS-DYNA with GPU

Accelerating Implicit LS-DYNA with GPU Accelerating Implicit LS-DYNA with GPU Yih-Yih Lin Hewlett-Packard Company Abstract A major hindrance to the widespread use of Implicit LS-DYNA is its high compute cost. This paper will show modern GPU,

More information

JSAE Benchmark of Automotive Aerodynamic Test Measurements

JSAE Benchmark of Automotive Aerodynamic Test Measurements Automotive JSAE Benchmark of Automotive Aerodynamic Test Measurements Ahmed-Type Car Body Versus CFD Software Predictions By Boris Marovic, Automotive Industry Manager, Mentor Graphics T he Society of

More information

Challenges Simulating Real Fuel Combustion Kinetics: The Role of GPUs

Challenges Simulating Real Fuel Combustion Kinetics: The Role of GPUs Challenges Simulating Real Fuel Combustion Kinetics: The Role of GPUs M. J. McNenly and R. A. Whitesides GPU Technology Conference March 27, 2014 San Jose, CA LLNL-PRES-652254! This work performed under

More information

MSC Nastran Explicit Nonlinear (SOL 700) on Advanced SGI Architectures

MSC Nastran Explicit Nonlinear (SOL 700) on Advanced SGI Architectures MSC Nastran Explicit Nonlinear (SOL 700) on Advanced SGI Architectures Presented By: Dr. Olivier Schreiber, Application Engineering, SGI Walter Schrauwen, Senior Engineer, Finite Element Development, MSC

More information

Speed and Accuracy of CFD: Achieving Both Successfully ANSYS UK S.A.Silvester

Speed and Accuracy of CFD: Achieving Both Successfully ANSYS UK S.A.Silvester Speed and Accuracy of CFD: Achieving Both Successfully ANSYS UK S.A.Silvester 2010 ANSYS, Inc. All rights reserved. 1 ANSYS, Inc. Proprietary Content ANSYS CFD Introduction ANSYS, the company Simulation

More information

EnSight October 2013 Japan EnSight User Meeting Tokyo, Japan. Darin McKinnis VP Sales and Marketing

EnSight October 2013 Japan EnSight User Meeting Tokyo, Japan. Darin McKinnis VP Sales and Marketing EnSight 10.1 October 2013 Japan EnSight User Meeting Tokyo, Japan Darin McKinnis VP Sales and Marketing darin@ceisoftware.com Business Reports 2012 was our best year in company history > 95% renewal rate

More information

A Comprehensive Study on the Performance of Implicit LS-DYNA

A Comprehensive Study on the Performance of Implicit LS-DYNA 12 th International LS-DYNA Users Conference Computing Technologies(4) A Comprehensive Study on the Performance of Implicit LS-DYNA Yih-Yih Lin Hewlett-Packard Company Abstract This work addresses four

More information

FRAUNHOFER INSTITUTE FOR ALGORITHMS AND SCIENTIFIC COMPUTING SCAI

FRAUNHOFER INSTITUTE FOR ALGORITHMS AND SCIENTIFIC COMPUTING SCAI FRAUNHOFER INSTITUTE FOR ALGORITHMS AND SCIENTIFIC COMPUTING SCAI MpCCI 4.5.2 Release Notes MpCCI 4.5.2 GUI Improve the start and error management of simulation codes in MpCCI GUI. Enhanced rules lists

More information

FUSION1200 Scalable x86 SMP System

FUSION1200 Scalable x86 SMP System FUSION1200 Scalable x86 SMP System Introduction Life Sciences Departmental System Manufacturing (CAE) Departmental System Competitive Analysis: IBM x3950 Competitive Analysis: SUN x4600 / SUN x4600 M2

More information

Scalable x86 SMP Server FUSION1200

Scalable x86 SMP Server FUSION1200 Scalable x86 SMP Server FUSION1200 Challenges Scaling compute-power is either Complex (scale-out / clusters) or Expensive (scale-up / SMP) Scale-out - Clusters Requires advanced IT skills / know-how (high

More information

Fundamentals of Modeling with Simcenter 3D Robin Boeykens

Fundamentals of Modeling with Simcenter 3D Robin Boeykens Fundamentals of Modeling with Simcenter 3D Robin Boeykens robin.boeykens@siemens.com Realize innovation. 3D CAE for the digital twin Simcenter 3D Page 2 Simcenter 3D Engineering Desktop Simcenter 3D Engineering

More information

Adjoint Solver Workshop

Adjoint Solver Workshop Adjoint Solver Workshop Why is an Adjoint Solver useful? Design and manufacture for better performance: e.g. airfoil, combustor, rotor blade, ducts, body shape, etc. by optimising a certain characteristic

More information

Aerodynamics of a hi-performance vehicle: a parallel computing application inside the Hi-ZEV project

Aerodynamics of a hi-performance vehicle: a parallel computing application inside the Hi-ZEV project Workshop HPC enabling of OpenFOAM for CFD applications Aerodynamics of a hi-performance vehicle: a parallel computing application inside the Hi-ZEV project A. De Maio (1), V. Krastev (2), P. Lanucara (3),

More information

Click to edit Master title style

Click to edit Master title style Click to edit Master title style LES LES Applications for for Internal Internal Combustion Engines Engines David Gosman & Richard Johns CD-adapco, June 2011 Some Qs and As Why would we use LES calculations

More information

Porting Scalable Parallel CFD Application HiFUN on NVIDIA GPU

Porting Scalable Parallel CFD Application HiFUN on NVIDIA GPU Porting Scalable Parallel CFD Application NVIDIA D. V., N. Munikrishna, Nikhil Vijay Shende 1 N. Balakrishnan 2 Thejaswi Rao 3 1. S & I Engineering Solutions Pvt. Ltd., Bangalore, India 2. Aerospace Engineering,

More information

High Performance Computing

High Performance Computing High Performance Computing ADVANCED SCIENTIFIC COMPUTING Dr. Ing. Morris Riedel Adjunct Associated Professor School of Engineering and Natural Sciences, University of Iceland Research Group Leader, Juelich

More information

S0432 NEW IDEAS FOR MASSIVELY PARALLEL PRECONDITIONERS

S0432 NEW IDEAS FOR MASSIVELY PARALLEL PRECONDITIONERS S0432 NEW IDEAS FOR MASSIVELY PARALLEL PRECONDITIONERS John R Appleyard Jeremy D Appleyard Polyhedron Software with acknowledgements to Mark A Wakefield Garf Bowen Schlumberger Outline of Talk Reservoir

More information

TAU mesh deformation. Thomas Gerhold

TAU mesh deformation. Thomas Gerhold TAU mesh deformation Thomas Gerhold The parallel mesh deformation of the DLR TAU-Code Introduction Mesh deformation method & Parallelization Results & Applications Conclusion & Outlook Introduction CFD

More information

Efficient Multi-GPU CUDA Linear Solvers for OpenFOAM

Efficient Multi-GPU CUDA Linear Solvers for OpenFOAM Efficient Multi-GPU CUDA Linear Solvers for OpenFOAM Alexander Monakov, amonakov@ispras.ru Institute for System Programming of Russian Academy of Sciences March 20, 2013 1 / 17 Problem Statement In OpenFOAM,

More information

Algorithms, System and Data Centre Optimisation for Energy Efficient HPC

Algorithms, System and Data Centre Optimisation for Energy Efficient HPC 2015-09-14 Algorithms, System and Data Centre Optimisation for Energy Efficient HPC Vincent Heuveline URZ Computing Centre of Heidelberg University EMCL Engineering Mathematics and Computing Lab 1 Energy

More information

A Hybrid Grid Compressible Flow Solver for Large-Scale Supersonic Jet Noise Simulations on Multi-GPU Clusters

A Hybrid Grid Compressible Flow Solver for Large-Scale Supersonic Jet Noise Simulations on Multi-GPU Clusters 50th AIAA Aerospace Sciences Meeting including the New Horizons Forum and Aerospace Exposition 09-12 January 2012, Nashville, Tennessee AIAA 2012-0564 A Hybrid Grid Compressible Flow Solver for Large-Scale

More information

Dynamic Mode Decomposition analysis of flow fields from CFD Simulations

Dynamic Mode Decomposition analysis of flow fields from CFD Simulations Dynamic Mode Decomposition analysis of flow fields from CFD Simulations Technische Universität München Thomas Indinger Lukas Haag, Daiki Matsumoto, Christoph Niedermeier in collaboration with Agenda Motivation

More information

XFlow HIGH FIDELITY COMPUTATIONAL FLUID DYNAMICS

XFlow HIGH FIDELITY COMPUTATIONAL FLUID DYNAMICS XFlow HIGH FIDELITY COMPUTATIONAL FLUID DYNAMICS XFlow OVERVIEW In the traditional mesh-based approach to solving Computational Fluid Dynamics (CFD) problems, reliability is highly dependent on the quality

More information

Krishnan Suresh Associate Professor Mechanical Engineering

Krishnan Suresh Associate Professor Mechanical Engineering Large Scale FEA on the GPU Krishnan Suresh Associate Professor Mechanical Engineering High-Performance Trick Computations (i.e., 3.4*1.22): essentially free Memory access determines speed of code Pick

More information