Recent Advances in ANSYS Toward RDO Practices Using optislang. Wim Slagter, ANSYS Inc. Herbert Güttler, MicroConsult GmbH

Similar documents
TFLOP Performance for ANSYS Mechanical

Maximize automotive simulation productivity with ANSYS HPC and NVIDIA GPUs

ANSYS Improvements to Engineering Productivity with HPC and GPU-Accelerated Simulation

Solving Large Complex Problems. Efficient and Smart Solutions for Large Models

Design Exploration and Robust Design. Judd Kaiser Product Manager, ANSYS Workbench Platform

ANSYS HPC Technology Leadership

HPC and IT Issues Session Agenda. Deployment of Simulation (Trends and Issues Impacting IT) Mapping HPC to Performance (Scaling, Technology Advances)

ANSYS HPC. Technology Leadership. Barbara Hutchings ANSYS, Inc. September 20, 2011

HPC and IT Issues Session Agenda. Deployment of Simulation (Trends and Issues Impacting IT) Mapping HPC to Performance (Scaling, Technology Advances)

Why HPC for. ANSYS Mechanical and ANSYS CFD?

The Cray CX1 puts massive power and flexibility right where you need it in your workgroup

TECHNICAL GUIDELINES FOR APPLICANTS TO PRACE 13 th CALL (T ier-0)

DYNARDO Dynardo GmbH Technology update. optislang v4.1. Robust Design Optimization. Johannes Will Dynardo GmbH

Faster Innovation - Accelerating SIMULIA Abaqus Simulations with NVIDIA GPUs. Baskar Rajagopalan Accelerated Computing, NVIDIA

Understanding Hardware Selection to Speedup Your CFD and FEA Simulations

ACCELERATING CFD AND RESERVOIR SIMULATIONS WITH ALGEBRAIC MULTI GRID Chris Gottbrath, Nov 2016

optislang recent developments

Computer Aided Engineering with Today's Multicore, InfiniBand-Based Clusters ANSYS, Inc. All rights reserved. 1 ANSYS, Inc.

Accelerated ANSYS Fluent: Algebraic Multigrid on a GPU. Robert Strzodka NVAMG Project Lead

HETEROGENEOUS HPC, ARCHITECTURAL OPTIMIZATION, AND NVLINK STEVE OBERLIN CTO, TESLA ACCELERATED COMPUTING NVIDIA

Enhancing Analysis-Based Design with Quad-Core Intel Xeon Processor-Based Workstations

Engineers can be significantly more productive when ANSYS Mechanical runs on CPUs with a high core count. Executive Summary

Introduction to Workbench Scripting & Customization ANSYS, Inc. November 29, 2012

Stan Posey, CAE Industry Development NVIDIA, Santa Clara, CA, USA

Parametric. Practices. Patrick Cunningham. CAE Associates Inc. and ANSYS Inc. Proprietary 2012 CAE Associates Inc. and ANSYS Inc. All rights reserved.

Structural Mechanics With ANSYS Pierre THIEFFRY Lead Product Manager ANSYS, Inc.

Mesh Morphing and the Adjoint Solver in ANSYS R14.0. Simon Pereira Laz Foley

Optimization of an Axial Pump using CFturbo, PumpLinx & optislang

LBRN - HPC systems : CCT, LSU

DYNARDO Dynardo GmbH CFD Examples. Dr.-Ing. Johannes Will President Dynardo GmbH

Accelerating Implicit LS-DYNA with GPU

Making Supercomputing More Available and Accessible Windows HPC Server 2008 R2 Beta 2 Microsoft High Performance Computing April, 2010

Webinar optislang & ANSYS Workbench. Dynardo GmbH

Agenda. 9:00 Welcome. 1:00 - Computing Utilities. 9:15 - Mechanical Demonstration. 1:30 - CFD Update. 3:00 Break 3:15 ANSYS Customization Toolkit

HPC Architectures. Types of resource currently in use

Optimising the Mantevo benchmark suite for multi- and many-core architectures

HA-PACS/TCA: Tightly Coupled Accelerators for Low-Latency Communication between GPUs

Fast Radial Basis Functions for Engineering Applications. Prof. Marco Evangelos Biancolini University of Rome Tor Vergata

Speedup Altair RADIOSS Solvers Using NVIDIA GPU

GPU-Acceleration of CAE Simulations. Bhushan Desam NVIDIA Corporation

Managing CAE Simulation Workloads in Cluster Environments

Agenda. 9:00 Welcome. 1:00 - Computing Utilities. 9:15 - Mechanical Demonstration. 1:30 - CFD Update. 3:00 Break 3:15 ANSYS Customization Toolkit

HARNESSING IRREGULAR PARALLELISM: A CASE STUDY ON UNSTRUCTURED MESHES. Cliff Woolley, NVIDIA

Why Supercomputing Partnerships Matter for CFD Simulations

OzenCloud Case Studies

Optimate CFD Evaluation Optimate Glider Optimization Case

ANSYS AIM 16.0 Overview. AIM Program Management

New developments in Statistics on Structures. Sebastian Wolff

Shape optimisation using breakthrough technologies

Pump Modeler Template Documentation

Thread and Data parallelism in CPUs - will GPUs become obsolete?

Tightly Coupled Accelerators Architecture

2008 International ANSYS Conference

Analysis of low cycle fatigue considering geometric manufacturing tolerances

MELLANOX EDR UPDATE & GPUDIRECT MELLANOX SR. SE 정연구

RECENT TRENDS IN GPU ARCHITECTURES. Perspectives of GPU computing in Science, 26 th Sept 2016

SIMPLIFYING HPC SIMPLIFYING HPC FOR ENGINEERING SIMULATION WITH ANSYS

System Design of Kepler Based HPC Solutions. Saeed Iqbal, Shawn Gao and Kevin Tubbs HPC Global Solutions Engineering.

Improving Your Structural Mechanics Simulations with Release 14.0

14MMFD-34 Parallel Efficiency and Algorithmic Optimality in Reservoir Simulation on GPUs

ANSYS Fluent 14 Performance Benchmark and Profiling. October 2012

Illinois Proposal Considerations Greg Bauer

Dell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance

How то Use HPC Resources Efficiently by a Message Oriented Framework.

CAD-parametric optimization with optislang-ansys workbench

MSC Nastran Explicit Nonlinear (SOL 700) on Advanced SGI Architectures

ANSYS 14.0 Geometry and Meshing Update Steve Varnam ANSYS UK Ltd.

Gateways to Discovery: Cyberinfrastructure for the Long Tail of Science

Large scale Imaging on Current Many- Core Platforms

Applica'ons So-ware Example

Technical guide. Windows HPC server 2016 for LS-DYNA How to setup. Reference system setup - v1.0

Headline in Arial Bold 30pt. Visualisation using the Grid Jeff Adie Principal Systems Engineer, SAPK July 2008

Commercial Implementations of Optimization Software and its Application to Fluid Dynamics Problems

University at Buffalo Center for Computational Research

Introduction to ANSYS DesignXplorer

Big Data Analytics Performance for Large Out-Of- Core Matrix Solvers on Advanced Hybrid Architectures

HPC Enabling R&D at Philip Morris International

Tiny GPU Cluster for Big Spatial Data: A Preliminary Performance Evaluation

TECHNICAL GUIDELINES FOR APPLICANTS TO PRACE 16 th CALL (T ier-0)

AN INTRODUCTION TO CLUSTER COMPUTING

Topology-Optimization for internal Fluid Flow: TOSCA Fluid FE-DESIGN the optimization company

TECHNICAL GUIDELINES FOR APPLICANTS TO PRACE 11th CALL (T ier-0)

Vectorisation and Portable Programming using OpenCL

CST STUDIO SUITE R Supported GPU Hardware

Simulation Advances for RF, Microwave and Antenna Applications

Intel Many Integrated Core (MIC) Matt Kelly & Ryan Rawlins

The Stampede is Coming: A New Petascale Resource for the Open Science Community

GPU computing at RZG overview & some early performance results. Markus Rampp

ANSYS Fluid Structure Interaction for Thermal Management and Aeroelasticity

ANSYS Workbench as an Integration Platform for CAE Simulation

Performance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA

Dell EMC Ready Bundle for HPC Digital Manufacturing ANSYS Performance

Simulation Advances. Antenna Applications

Aerodynamics of a hi-performance vehicle: a parallel computing application inside the Hi-ZEV project

APENet: LQCD clusters a la APE

New Technologies in CST STUDIO SUITE CST COMPUTER SIMULATION TECHNOLOGY

An advanced RBF Morph application: coupled CFD-CSM Aeroelastic Analysis of a Full Aircraft Model and Comparison to Experimental Data

Topology Optimization in Fluid Dynamics

HPC-CINECA infrastructure: The New Marconi System. HPC methods for Computational Fluid Dynamics and Astrophysics Giorgio Amati,

The determination of the correct

Transcription:

Recent Advances in ANSYS Toward RDO Practices Using optislang Wim Slagter, ANSYS Inc. Herbert Güttler, MicroConsult GmbH 1

Product Development Pressures Source: Engineering Simulation & HPC Usage Survey with over 1,800 ANSYS respondents (Feb 2013) 2

The Cost of Being Wrong The cost of failure has never been so high, even for successful companies Source: The Detroit News, April 27, 2013 3

Practices toward Robust Design Best in Class 4 Practice Maturity Level Using six sigma and robust design optimization analysis; seeking a design with a probabilistic goal Using proprietary or third-party design optimization algorithms or tools Integrated system design and optimization of hardware, electronics and software Deploying adjoint solver techniques 3 Simultaneous execution of automated updates of multiple design points for design optimization studies Established job scheduling strategy for optimized use of both local and remote hardware Multi-goal analyses with multiple design input parameters 2 Multiple physics, design point analysis for conceptual design studies Parameterized models for what-if analyses, with automation Input/output parameter relationship based on design exploration tools 1 Single analysis for validation purposes Manual adjustment of design parameters Single physics 4 Beginner Increasing impact on product integrity

Challenges to Adopt Robust Design Practices Full series of simulations takes too long Difficult to build a parametric geometry or mesh Lack of simulation resources (hw/sw) Difficult to string various tools together Which parameters are relevant Baseline model is difficult to solve Lack of faith in simulation Algorithm choices Other Difficult to understand results My design is not suitable for optimization 0 5 10 15 20 25 30 35 40 Source: ANSYS Survey, Q1 2011 5

Recent Advances to Overcome Top Challenges 6

Reduced Time to Insight 7

Software Performance Faster startup, geometry import, meshing, solving, parallel, Rating 1000 4000 900 3500 800 700 3000 600 500 2500 400 2000 300 200 1500 100 1000 0 500 Shared Topology Time (min) 0 ANSYS Fluent 15.0 using GPU s Time (Min) 4 3 Tube Bundle 32x Faster! R14.5 R15.0 2 0 Swept Meshing 14.5 15 13.0.0 14.0.0 15.0.0 68% Faster! 0 2048 4096 6144 8192 1024012288 1 Number of Cores 111 million cell (truck) model Pre-release results Scalable at ~10K cells per core! 8

optislang inside ANSYS Workbench optislang modules Sensitivity, Optimization and Robustness are directly available in ANSYS Workbench Easy to use: Minimize user input Best practice default modules Pre-defined post processing modes 9

Some Recent optislang/workbench Updates Interrupt, save, send & continue: If needed stop your analysis, save Workbench, and continue analysis later Recalculate Failed Design Points: Restart when design evaluations may fail 10

Building a Parametric Model 11

Parametric Model Parametric CAD Bi-directional CAD interfaces CAD Model Workbench Model 12 Workbench is a Parametric and Persistent platform Parameterize with just a click

What if Your Model is Dead? Use SpaceClaim to easily create parameters from neutral files STEP, IGES, Parasolid, ACIS, etc. 13 Use Mesh Morphing to modify geometry without parameters

Mesh Morphing Adjust the Mesh for each design variation! 14

Smart Optimization with the Adjoint Solver The Adjoint solver directly computes a more optimal shape depending upon the optimization goal The Adjoint solver directly predicts the gain in performance The mesh is morphed to the more optimal shape specified by the Adjoint solver Iteration 1 DP = -232.8 Expect change 10.0 Iteration 2 Actual change 9.0 DP = -223.8 Expect change 8.9 Iteration 3 Actual change 6.9 DP = -216.9 Expect change 7.0 Iteration 4 Actual change 3.1 DP = -213.8 Goal: Minimizing Pressure Drop Total improvement of 8% 15

Fast and Affordable Design Studies 16

HPC Hardware/Software Partnerships with IT industry leaders, ensuring optimized HPC performance, a roadmap to the future, and wrap-around support ANSYS and Intel 60% speed-up on Xeon E5-serie processors; ANSYS Mechanical 15.0 is the 1 st release on Intel Xeon Phi ANSYS and NVIDIA GPU acceleration of ANSYS Mechanical and Fluent; AMG solver of ANSYS Fluent 15.0 will support GPU s ANSYS and HP Benchmarking, HPC Best Practices 17 HPC performance optimizes the utilization of licenses, hardware, and people

Sequential Design Point Update Serial Queue dp1 dp2 dp3 Serial queues can be time prohibitive Dpn 18

Simultaneous Design Point Update dp1 Project on client dp2 Simultaneous Solve can dramatically reduce time to insight RSM Remote Solve Manager dp3 Dpn 19

Simultaneous Design Point Update dp1 Project on client dp2 Simultaneous License usage can be cost prohibitive But! RSM License Server dp3 Dpn 20

HPC Parametric Packs Project on client Rapid and affordable updates dp1 Number of Simultaneous Design Points Enabled 64 RSM dp2 32 dp3 21 16 8 4 1 2 3 4 5 Number 2013 ANSYS, of HPC Inc. Parametric November 22, Pack 2013 Licenses Dpn License Server With HPC Parametric Packs

Advances in Workbench R15.0 - For Managing Large Number of Design Configurations RSM Enhancements Improved efficiency of RSM Design Point updates Improved robustness and scalability Added support for Univa Grid Engine Added component override for design point update Added support for Mechanical/MAPDL restart Non-root users on Linux can now use RSM wizard Enriched support for RSM customization 22

Advances in Workbench R15.0 - Enriched Project Report Content Projects with Design Points will include a sub-report for each Design Point Link shown in Report column in the Design Points table in the main report Get detailed results for every Design Point! 23

Examples Using ANSYS Workbench 14.5.7 and optislang 4.0.6 on a HPC Cluster Herbert Güttler, MicroConsult 24

Tools (Hardware: Oct 2013) 160 E5 V2 Ivy Bridge cores @ 3.0 GHz 304 E5 Sandy Bridge cores @2.9 GHz 6..16 GB / core RAM (4,0 TB Total) Accelerators: 22 Fermi M207x, 10 Kepler K20x 2 Xeon Phi 7210P Peak Performance ANSYS single job: 3.1 TFLOPs accumulated / 24 Jobs: 10 TFLOPs Infiniband interconnect Compute servers SSD only Remote Access: 3x HP-RGS SLES 11 SP02 for compute nodes Closed loop aircooled rack MicroConsult H. Güttler, 22.11.2013, page 25

Numerical Effort for a random selection of MCE Projects ANSYS MAPDL, sparse solver 260 s on a 1 TFLOPs machine 40 s on a 2 TFLOPs machine Source: AnandTech MicroConsult H. Güttler, 22.11.2013, page 26

Performance Results MicroConsult H. Güttler, 22.11.2013, page 27

Benchmarking (ANSYS Mechanical) ideal scaling MicroConsult H. Güttler, 22.11.2013, page 28

Benchmarking (ANSYS Fluent) 4000 3500 3000 2500 Rating 2000 1500 13.0.0 14.0.0 15.0.0 1000 500 0 0 2048 4096 6144 8192 10240 12288 Number of Cores MicroConsult H. Güttler, 22.11.2013, page 29

Essentials: Performance is very case dependent Looking at DOFs won t tell you much about the actual performance GPUs accelerate numbercrunching Scaling for ANSYS Mechanical is much different compared to CFD A cluster can run a single big job or many small jobs Optimization requires solving many designs Many design require many licenses With R14.5 came HPC Parametric Pack licenses (license multipliers) HPC Parametric Pack licensing works only via Workbench Design Points MicroConsult H. Güttler, 22.11.2013, page 30

How it s done No feature to activate GPUs MicroConsult H. Güttler, 22.11.2013, page 31

How it s done ANSYS DesignModeler Licenses cannot be multiplied MicroConsult H. Güttler, 22.11.2013, page 32

Cases used for benchmarking Power Window Actuator: 6 bodies, 15 contacts, 3.3 MDOF, 18 TFLOP / iteration Sensitivity study, Uses Geometry Updates Mountain Bike Frame: 1 body, no contacts, 2.1 MDOF, 0.8 TFLOP / iteration Sensitivity study, Uses Geometry Updates Beam in Bending: 1 body, no contacts, 4.0 MDOF, 88 TFLOP / iteration Sensitivity study, No Geometry Updates MicroConsult H. Güttler, 22.11.2013, page 33

Power Window Actuator, Single Design 2x speedup with GPUs MicroConsult H. Güttler, 22.11.2013, page 34

Note: For a single solution, GPU are controlled via Advanced Properties MicroConsult H. Güttler, 22.11.2013, page 35

Power Window Actuator, Sensitivity Analysis Time limit Time limit Time limit Zeit Generation Designs 1-4 Solution Designs 1-4 Generation Designs 5-8 Solution Designs 5-8 Generation Designs 9-12 Solution Designs 9-12 Running 4 design points on 4 compute nodes simultaneously: Designs are created sequentially in batches A new set of design points is sent to RSM for processing only after the previous set is completed Since we had at least one non-converging design in each set, the runtime is completely controlled by the (user defined) time limit MicroConsult H. Güttler, 22.11.2013, page 36

Mountain Bike Frame: Single Design Hardware: 64x Performance: 4x MicroConsult H. Güttler, 22.11.2013, page 37

Mountain Bike Frame: Sensitivity Analysis 64 Designs MicroConsult H. Güttler, 22.11.2013, page 38

Beam in Bending: Single Design 2P-1N-0G: 2.92h Hardware: 64x Performance: 24x MicroConsult H. Güttler, 22.11.2013, page 39

Beam in Bending: Sensitivity Analysis 48 Designs 3x speedup with GPUs MicroConsult H. Güttler, 22.11.2013, page 40

Summary: The Power Window Actuator case suffers from instability of the model. Chances are good to achieve speedups when going parallel or using GPUs. The Mountain Bike Frame is too small (TFLOP / iteration) to benefit from going parallel. The total runtime is dominated from the preparation stage, not by the solution. The Beam in Bending is a synthetic case that demonstrates what is possible when the times for preparation are negligible and there is a lot of number crunching to do. MicroConsult H. Güttler, 22.11.2013, page 41

Lessons learned I HPC Parametric Pack licenses can only be used when the designs are submitted via the Design Point table Geometry updates have to be done upfront/sequentially, because DesignModeler is not supported by HPC Parametric Pack Licenses You have to know your model very well to avoid bad designs Efficiency of HPC / GPUs is case dependent Running many design simultaneously will most likely help, unless the case is dominated by geometry preparation GPUs are not supported by R14.5 when running jobs via update all design points. We had to modify the Python Scripts directly to add the command for using GPUs (-acc nvidia -na 2) MicroConsult H. Güttler, 22.11.2013, page 42

Outlook ANSYS Release 15 is just around the corner (Dec. 2013) optislang 4.1 was released on Nov. 18 optislang 4.1 and ANSYS 15 should enable updating the Design Point Table on the fly MicroConsult H. Güttler, 22.11.2013, page 43

Acknowledgements Andreas Grosche, Dynardo Jochen Haesemeyer, CADFEM GmbH Holger Mai, MicroConsult Engineering GmbH Jeff Beisheim, Simon Cross, ANSYS Inc. MicroConsult H. Güttler, 22.11.2013, page 44