Particleworks: Particle-based CAE Software fully ported to GPU

Similar documents
Two-Phase flows on massively parallel multi-gpu clusters

ANSYS Improvements to Engineering Productivity with HPC and GPU-Accelerated Simulation

Applications of ICFD /SPH Solvers by LS-DYNA to Solve Water Splashing Impact to Automobile Body. Abstract

Maximize automotive simulation productivity with ANSYS HPC and NVIDIA GPUs

Example Simulations in OpenFOAM

The Cray CX1 puts massive power and flexibility right where you need it in your workgroup

Twin screw extrusion Image courtesy of Japan Steel Works, Ltd.

Solution Recording and Playback: Vortex Shedding

ANSYS HPC Technology Leadership

HPC and IT Issues Session Agenda. Deployment of Simulation (Trends and Issues Impacting IT) Mapping HPC to Performance (Scaling, Technology Advances)

Coupled Analysis of FSI

Express Introductory Training in ANSYS Fluent Workshop 07 Tank Flushing

Speedup Altair RADIOSS Solvers Using NVIDIA GPU

CFD in COMSOL Multiphysics

Team 194: Aerodynamic Study of Airflow around an Airfoil in the EGI Cloud

High-Fidelity Simulation of Unsteady Flow Problems using a 3rd Order Hybrid MUSCL/CD scheme. A. West & D. Caraeni

ANSYS HPC. Technology Leadership. Barbara Hutchings ANSYS, Inc. September 20, 2011

Applications of ICFD solver by LS-DYNA in Automotive Fields to Solve Fluid-Solid-Interaction (FSI) Problems

Divergence-Free Smoothed Particle Hydrodynamics

Computational Fluid Dynamics PRODUCT SHEET

Investigation and Feasibility Study of Linux and Windows in the Computational Processing Power of ANSYS Software

Recent applications of overset mesh technology in SC/Tetra

Efficient Tridiagonal Solvers for ADI methods and Fluid Simulation

Axial Channel Water Jacket Cooling

S0432 NEW IDEAS FOR MASSIVELY PARALLEL PRECONDITIONERS

OzenCloud Case Studies

Tutorial: Riser Simulation Using Dense Discrete Phase Model

JSAE Benchmark of Automotive Aerodynamic Test Measurements

Simulation of Turbulent Flow around an Airfoil

Investigation of cross flow over a circular cylinder at low Re using the Immersed Boundary Method (IBM)

A new hybrid slurry CFD model compared with experimental results

Coupled Simulation of the Fluid Flow and Conjugate Heat Transfer in Press Hardening Processes

An Overview of Computational Fluid Dynamics

High performance Computing and O&G Challenges

Swapnil Nimse Project 1 Challenge #2

ACCELERATING CFD AND RESERVOIR SIMULATIONS WITH ALGEBRAIC MULTI GRID Chris Gottbrath, Nov 2016

OpenFOAM GUIDE FOR BEGINNERS

TFLOP Performance for ANSYS Mechanical

Robust Simulation of Sparsely Sampled Thin Features in SPH-Based Free Surface Flows

How to set up a 2D CFD case in LS-DYNA. Marcus Timgren DYNAmore Nordic

Object-Oriented CFD Solver Design

Adaptive-Mesh-Refinement Hydrodynamic GPU Computation in Astrophysics

Large scale Imaging on Current Many- Core Platforms

ONE DIMENSIONAL (1D) SIMULATION TOOL: GT-POWER

Advances of parallel computing. Kirill Bogachev May 2016

Tutorial: Hydrodynamics of Bubble Column Reactors

SPC 307 Aerodynamics. Lecture 1. February 10, 2018

Speed and Accuracy of CFD: Achieving Both Successfully ANSYS UK S.A.Silvester

Flow in an Intake Manifold

midas NFX 2017R1 Release Note

Free Surface Flow Simulations

Using a Single Rotating Reference Frame

Ashwin Shridhar et al. Int. Journal of Engineering Research and Applications ISSN : , Vol. 5, Issue 6, ( Part - 5) June 2015, pp.

Figure 2: Water Into Kerosene, Volume Fraction (Left) And Total Density Of Mixture (Right)

Computations of Jet Cooling

Water Sloshing Tutorial (Particleworks)

Pressure Drop Evaluation in a Pilot Plant Hydrocyclone

Accelerating Implicit LS-DYNA with GPU

Introduction to C omputational F luid Dynamics. D. Murrin

Navier-Stokes & Flow Simulation

Coastal impact of a tsunami Review of numerical models

Project 2 Solution. General Procedure for Model Setup

NVIDIA. Interacting with Particle Simulation in Maya using CUDA & Maximus. Wil Braithwaite NVIDIA Applied Engineering Digital Film

A Massively Parallel Two-Phase Solver for Incompressible Fluids on Multi-GPU Clusters

Performance of Implicit Solver Strategies on GPUs

A laboratory-dualsphysics modelling approach to support landslide-tsunami hazard assessment

A hybrid slurry CFD model: Euler-Euler to Euler-Lagrange

SPEED-UP GEARBOX SIMULATIONS BY INTEGRATING SCORG. Dr. Christine Klier, Sahand Saheb-Jahromi, Ludwig Berger*

Real Application Performance and Beyond

ANSYS High. Computing. User Group CAE Associates

Modeling Evaporating Liquid Spray

Shape optimization and active flow control for improved aerodynamic properties Siniša Krajnovic

Comparison of Classic and Finned Piston Reciprocating Linear Air Compressor Using COMSOL Multiphysics

Maximizing Memory Performance for ANSYS Simulations

NUMERICAL SIMULATION OF FALLING FILM THICKNESS FLOWING OVER HORIZONTAL TUBES

Modeling Unsteady Compressible Flow

Isotropic Porous Media Tutorial

Flow Over Cylinder. Gönül Öykü Güngör Department of Mechanical Engineering Bilkent University Ankara, Turkey

KEY STAR TECHNOLOGIES: DISPERSED MULTIPHASE FLOW AND LIQUID FILM MODELLING DAVID GOSMAN EXEC VP TECHNOLOGY, CD-adapco

Automotive Fluid-Structure Interaction (FSI) Concepts, Solutions and Applications. Laz Foley, ANSYS Inc.

Why HPC for. ANSYS Mechanical and ANSYS CFD?

CFD projects at CERN Detector cooling and beyond

ANSYS AIM 16.0 Overview. AIM Program Management

Modeling Evaporating Liquid Spray

Simulation of Turbulent Flow around an Airfoil

A Scalable GPU-Based Compressible Fluid Flow Solver for Unstructured Grids

Keywords: flows past a cylinder; detached-eddy-simulations; Spalart-Allmaras model; flow visualizations

Simulation Advances. Antenna Applications

Modeling and Simulation of Single Phase Fluid Flow and Heat Transfer in Packed Beds

Simulation and Validation of Turbulent Pipe Flows

Auto Injector Syringe. A Fluent Dynamic Mesh 1DOF Tutorial

A Comprehensive Study on the Performance of Implicit LS-DYNA

Strömningslära Fluid Dynamics. Computer laboratories using COMSOL v4.4

Optimization of an Axial Pump using CFturbo, PumpLinx & optislang

2008 International ANSYS Conference

Water. Notes. Free surface. Boundary conditions. This week: extend our 3D flow solver to full 3D water We need to add two things:

The viscous forces on the cylinder are proportional to the gradient of the velocity field at the

Tutorial 2: Particles convected with the flow along a curved pipe.

MPI Optimizations via MXM and FCA for Maximum Performance on LS-DYNA

FLUENT Secondary flow in a teacup Author: John M. Cimbala, Penn State University Latest revision: 26 January 2016

Transcription:

Particleworks: Particle-based CAE Software fully ported to GPU

Introduction PrometechVideo_v3.2.3.wmv 3.5 min.

Particleworks

Why the particle method? Existing methods FEM, FVM, FLIP, Fluid calculation points strictly depends on conservation laws. In each time step Surface capturing & tracking Mesh generation Marker creation - annihilation

Particle based approach The number of fluid calculation points does not decrease In some of our client s cases, it is difficult to solve with other methods. In each time step Surface capturing & tracking Mesh generation Marker creation - annihilation 5

Comparison with other software For pipe flow problems (no free surface), Particleworks may take longer calculation time than other mesh based simulators. Calculation Points: 28339 Calculation Time: 7728 sec. Particleworks (Particle based) Inflow: 0.2 m/s KOZO KEIKAKU ENGINEERING Inc. Calculation Points: 27440 Calculation Time: 336 sec. FloEFD (Mesh based) Length: 10 mm Diameter: 1.4 mm Time: 0.5 s

Comparison with other software Particleworks can solve free-surface problems, while most of other mesh based simulators have difficulties. (e.g. Pipe flow with a spray nozzle) Calculation Points: 42000 Calculation Time: 9000 sec. KOZO KEIKAKU ENGINEERING Inc. Particleworks (Particle based) Inflow: 0.2 m/s FloEFD (Mesh based) Difficult to simulate Length: 10 mm Diameter: 1.4 mm Time: 0.5 s

u(r)[m/s] Appendix: Hagen-Poiseuille Flow of Newtonian Fluid In Particleworks simulation on Hagen-Poiseuille flow of Newtonian fluid, the velocity distribution obtained in the model coincides well with theoretical results. Simulation model Precondition Injecting fluid into circular cylinder with 1.5 mm diameter and 15mm long KOZO KEIKAKU ENGINEERING Inc. Result The maximum velocity in the centre of cylinder is coincided well with the theoretical results. The decrease trend of velocity towards the edge of cylinder is also well-coincided. Theoretical velocity distribution u(r) u r u 3n 1 n 1 0 1 r R Setting u 0 =0.1m/s in this model n 1 n Simulation result Particle distance 1.0 10-4 m Time interval Kinematic viscosity coefficient Analysis model image 5.0 10-5 s 1.5 10-6 m 2 / s Comparison of velocity distribution *Particle colour indicates velocity Red line shows theoretical figures and yellow dots show velocity of each particle 0.25 0.2 0.15 0.1 0.05 0-7.5E-04-5.0E-04-2.5E-04 5.0E-19 2.5E-04 5.0E-04 7.5E-04 *Measures velocity of particles r[m] located between 10 13mm from inlet

Particleworks v3.0.2 Demo Particleworks.cmd 3 min.

Particleworks Spec. Sheet Computational Fluid Dynamics Incompressible fluid Weakly compressible gas Free surface flow Non-Newtonian fluid Applicable Phenomena Newtonian and non-newtonian flow Laminar and turbulent flow Thermal conduction Moving boundary Fluid - Rigid coupling Fluid - Powder coupling Surface tension Physics Models Based on MPS Pressure calculation (Implicit / Explicit) Viscosity calculation (Implicit / Explicit) Turbulent model: RANS + LES 2D analysis Calculation technique SMP and MPP GPU Tsunami Drain water fall Back step flow

Next update We plan to release ver. 3.1 June, 2012 Features One-way coupling with existing air flow software Pressure boundary condition applied to free surface Python JIT for customizing non-newtonian solver DEM module ported to GPU And more

Applications

Automotive industry Particle simulation is mainly applied to oil and water behavior. 1. Engine crank case 2. Oil jet simulation in cylinder [Other Application] Gas tank sloshing Auto painting Water immersion etc. 3. Power-train transmission 4. Mud & water splashing

Chemical industry Particle simulation is used for highly viscous fluid and powder simulation, especially for material mixing purpose. Twin extruders Mixer From JAPAN STEEL WORKS, Ltd

About our GPU solver

CPU vs GPU Speed-ups Particleworks ver. 3.0 (2011) Fully ported to GPU CPU: Intel Core i7 920 4 Cores GPU: NVIDIA Tesla C2070 8.0-0.3 M particles 6.8 7.4 7.9 6.0 4.0 2.0 1.0 1.0 1.0 0.0 Japan Steel Works, Ltd.

Energy consumption comparison for 1 calculation job Relative Energy Consumption 100% 80% 60% 40% 20% 100% 45% Hardware PC:MAS-i7WS (G-DEP) OS:Windows HPC server 2008r2 CPU:Intel Core i7 X990 @3.47GHz GPU:NVIDIA C2075 Main memory:24gb Simulation settings 1.46 M particles Simulation time: 1.0[s] 0% CPU 6 cores CPU 1 core GPU 1card

Speed-ups CPUs vs GPUs CPU (Xeon X5637 dual socket) vs. GPU (M2090) comparison results. CPU GPU 5.0 4.0 3.0 3.0 4.0 5.0 4.0 3.0 2.4 3.4 3.9 2.0 1.7 2.0 1.0 1.0 1.0 1.0 0.0 Intel Xeon 6 cores 1 (12 cores) 2 (24 cores) 3 (36 cores) 0.0 Intel Xeon 6 cores 1 2 3 # of Nodes (with 2 CPUs) # of Cards CPU: Intel Xeon X5637 (6 cores) @3.06GHz GPU: NVIDIA M2090

Porting resource, schedule & strategy Motivation High performance gain reasonably Goal Full GPU porting Collision detection, Physics models & Iterative solver. Resource 2 persons (Full time) + 2 students (1 day / week) Schedule 2010/5 start 2011/2 release Strategy Test driven We needed CPU porting before GPU porting

OpenCL : JIT Customize Function (Prototype & In-house) Motivation Marketing sales & Consulting Particle interaction customization Goal JIT customization function Using OpenCL Resource 1 person (1 week) Strategy User can use these Particleworks variables at OpenCL kernel Buffer Function Parameter 20

Our GPU Cluster

Linux Spec. 4 nodes 8 CPUs + 12 GPUs 192 GB RAM Infiniband QDR dual port 5000W PSU CentOS

Linux Spec. / Compute node CPU x 2 Intel Xeon X5685/3.06GHz/6cores 48GB RAM GPU x 3 NVIDIA M2090 512cores 6GB DRAM Infiniband x 2 QDR 40 Gbps Power Supply Unit 1000W

Software configuration Client node Head node Compute nodes Shared directory License Solver modules Input data Output data

Results: Mixing About 10M particles 12 GPUs Calculation time : 100 hours 30 min. Simulation time : 1 sec.

Results: TSUNAMI About 5 M particles Simulation time: 10 min. Calculation time: GPU: 60 hours 2 min. CPU: 82 hours 29 min. 0min. 6min. 12min. 33min. 26min. 21min. 4.3 km 30 m 38min. 42min. 60min. 3.1 km

Thank you