OpenFOAM on POWER8. Stretching the performance envelope. A White Paper by OCF
|
|
- Roger Hubbard
- 5 years ago
- Views:
Transcription
1 OpenFOAM on POWER8 Stretching the performance envelope A White Paper by OCF
2 Executive Summary In this white paper, we will show that the IBM Power architecture provides a uniquely powerful platform for computational fluid dynamic simulations, in particular when combined with the open source OpenFOAM application and OCF's unique integration and support services. Illustrative benchmark results are provided in this document as proof points. If you are engaged in the fields of engineering and science, whether from a commercial or and academic perspective, and have requirement to simulate fluid flows, you will gain a deeper understanding of the benefits that OCF's OpenFOAM on POWER8 package can deliver. Introduction Computational fluid dynamic simulations remain a powerful tool in a wide range of engineering and scientific disciplines, including aerospace, automotive, power generation, chemical manufacturing, medical research, and astrophysics. For those serious about achieving the right results using CFD, the IBM Power platform offers performance benefits up to 3x better than the x86 platform. High performance computing expert OCF believes that OpenFOAM running on IBM s POWER8 server offers the right combination of performance, reliability and cost-effectiveness, especially when combined with OCF s unique integration service and support offering, which brings together the benefits of OpenFOAM with the high performance of POWER8, for faster, more reliable modelling.
3 The power and the pain The OpenFOAM (Open Field Operation and Manipulation) CFD Toolbox is a free, open source CFD software package with a large user base across most areas of engineering and science, from both commercial and academic organisations. It offers a wide range of features to solve anything from complex fluid flows involving chemical reactions, turbulence and heat transfer, to solid dynamics and electro-magnetics. As with all CFD applications, reliability and performance are key. Calculating the millions of data points involved in simulating reality at high resolutions consumes vast amounts of computation time. It is not uncommon for simulations to take days or even longer to run. In a world where achieving a faster time to market is key and can be business-critical, this is a huge bottleneck in the process. A variety of factors influence computation time for example time step size, under relaxation factors (URF) or discretisation parameters but there is a limit to how much these parameters can be adjusted before simulation results no longer meet quality requirements. OCF s POWER8 solution Ultimately, the performance of a CFD simulation using OpenFOAM is heavily dependent on the quality and performance of the underlying hardware. IBM Power Systems are designed for big data and deliver the performance and throughput of POWER8 combined with the cost savings that the use of industry standards can bring. The Power S822LC server delivers high application performance and throughput based on its built-for-big-data architecture incorporating POWER8 processors, tightly coupled FPGAs and accelerators, and faster I/O using CAPI. It delivers more processing power while increasing workload density and reducing floor space requirements, and offers a modular design to scale from a single rack to hundreds, simplicity of ordering, and a strong innovation roadmap for GPUs. At the heart of the Power S822LC server is IBM s POWER8 processor. It consists of a family of superscalar symmetric multi-processors based on the Power architecture, designed to be a massively multi-threaded chip. Each core can handle eight hardware threads simultaneously, for a total of 96 threads executed simultaneously on a 12-core chip. The processor also makes use of very large amounts of on- and off-chip edram caches, and on-chip memory controllers to enable very high bandwidth to memory and system I/O. High performance computing integrator OCF provides installation, integration and support for the IBM POWER8 server. Whether you are modelling CFD in a commercial or academic organisation, fast reliable processing of your analyses is key. The OCF high speed platform delivers benefits across a wide range of disciplines, including aerospace, automotive, power generation, chemical manufacturing, medical research, and astrophysics. OCF combines the benefits of OpenFOAM which offers users complete freedom to customise and extend its existing functionality, with the high performance benefits of POWER8, for faster, more reliable modelling.
4 Benchmarks OpenFOAM on POWER8 is a true high performance computing solution for the solution and analysis of problems that involve fluid flows. A recent benchmark study showed that OpenFOAM on POWER8 was 3x faster than the x86 platform. For benchmarking, OpenFOAM was compiled on x86 and POWER8 (with only minor configuration changes) and two benchmarks were run. One used a motorbike example and the other used a car example, and the results illustrated a number of benefits. Motorbike Performance Running different problem sizes on P8, Ivybridge and Haswell Best threads/socket setting used for every data point Conclusion POWER8 is 1.5-3x faster than x86 due to: Memory Bandwidth: 3x Caches: 2x SMT Runtimes simplefoam Runtime (s) E+00 5.E+07 1E+08 Cells P8 x86 Ivybridge x86 Haswell Largest Case (100M Cells): Ivybridge seems to reach memory limit (96GB), although not swapping Similar but less obvious effect on P8 (128 GB) Haswell box has more memory (256GB) è linear behaviour Smaller Cases Show memory bandwidth dependence
5 Car Performance Conclusion Similar behaviour than motorbike Speedup x Larger advantage for P8 for larger cases simplefoam Runtime [s] x 9M 2.4x 40M P8 Ivybridge Dynamic Simulation pisofoam: Simulates transient behaviour Runtimes even longer than simplefoam hinders industrial use pisofoam Conclusion Similar speedup as simplefoam x 1.7x 2.2x P8 Ivybridge M 1000 steps 25M 300 steps 40M 300 steps The key benefits of OpenFOAM on Power are: Achieve up to 3x better performance Make huge productivity gains POWER8 allows GPU acceleration OpenFOAM scales well on POWER8 To view the benchmark details and results in full, please download the results here
6 In summary POWER8 offers up to 3x better performance than x86 on typical benchmarks OpenFOAM scales well on POWER8 GPU acceleration with POWER8 is possible OpenFOAM remains the application of choice for its reliability and openness, which offers users complete freedom to customise and extend its existing functionality, either by themselves or by others. Almost everything (including meshing, and pre- and post-processing) runs in parallel as standard, enabling users to take full advantage of computer hardware at their disposal. HPC experts OCF can install, customise and optimise for your application by combining OpenFOAM with POWER8 in order to deliver the right solution. You are invited to take a test drive in order to see for yourself what this powerful combination can offer: click here. OCF PLC 5 Rotunda Business Centre Thorncliffe Park Chapeltown Sheffield S35 2PG Tel: info@ocf.co.uk Website: Specialty Power Systems
Multi-GPU simulations in OpenFOAM with SpeedIT technology.
Multi-GPU simulations in OpenFOAM with SpeedIT technology. Attempt I: SpeedIT GPU-based library of iterative solvers for Sparse Linear Algebra and CFD. Current version: 2.2. Version 1.0 in 2008. CMRS format
More informationIBM POWER SYSTEMS: YOUR UNFAIR ADVANTAGE
IBM POWER SYSTEMS: YOUR UNFAIR ADVANTAGE Choosing IT infrastructure is a crucial decision, and the right choice will position your organization for success. IBM Power Systems provides an innovative platform
More informationTeam 194: Aerodynamic Study of Airflow around an Airfoil in the EGI Cloud
Team 194: Aerodynamic Study of Airflow around an Airfoil in the EGI Cloud CFD Support s OpenFOAM and UberCloud Containers enable efficient, effective, and easy access and use of MEET THE TEAM End-User/CFD
More informationOpenFOAM Performance Testing and Profiling. October 2017
OpenFOAM Performance Testing and Profiling October 2017 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Huawei, Mellanox Compute resource - HPC
More informationThe future is parallel but it may not be easy
The future is parallel but it may not be easy Michael J. Flynn Maxeler and Stanford University M. J. Flynn 1 HiPC Dec 07 Outline I The big technology tradeoffs: area, time, power HPC: What s new at the
More informationMaximize automotive simulation productivity with ANSYS HPC and NVIDIA GPUs
Presented at the 2014 ANSYS Regional Conference- Detroit, June 5, 2014 Maximize automotive simulation productivity with ANSYS HPC and NVIDIA GPUs Bhushan Desam, Ph.D. NVIDIA Corporation 1 NVIDIA Enterprise
More informationInfrastructure Matters: POWER8 vs. Xeon x86
Advisory Infrastructure Matters: POWER8 vs. Xeon x86 Executive Summary This report compares IBM s new POWER8-based scale-out Power System to Intel E5 v2 x86- based scale-out systems. A follow-on report
More informationIntroduction to Parallel Programming in OpenMp Dr. Yogish Sabharwal Department of Computer Science & Engineering Indian Institute of Technology, Delhi
Introduction to Parallel Programming in OpenMp Dr. Yogish Sabharwal Department of Computer Science & Engineering Indian Institute of Technology, Delhi Lecture - 01 Introduction to Parallel Computing Architectures
More informationAccelerating Real-Time Big Data. Breaking the limitations of captive NVMe storage
Accelerating Real-Time Big Data Breaking the limitations of captive NVMe storage 18M IOPs in 2u Agenda Everything related to storage is changing! The 3rd Platform NVM Express architected for solid state
More informationBuilding NVLink for Developers
Building NVLink for Developers Unleashing programmatic, architectural and performance capabilities for accelerated computing Why NVLink TM? Simpler, Better and Faster Simplified Programming No specialized
More informationHow to Speed up Database Applications with a Purpose-Built SSD Storage Solution
How to Speed up Database Applications with a Purpose-Built SSD Storage Solution SAN Accessible Storage Array Speeds Applications by up to 25x Introduction Whether deployed in manufacturing, finance, web
More informationTransport Simulations beyond Petascale. Jing Fu (ANL)
Transport Simulations beyond Petascale Jing Fu (ANL) A) Project Overview The project: Peta- and exascale algorithms and software development (petascalable codes: Nek5000, NekCEM, NekLBM) Science goals:
More informationTrends in HPC (hardware complexity and software challenges)
Trends in HPC (hardware complexity and software challenges) Mike Giles Oxford e-research Centre Mathematical Institute MIT seminar March 13th, 2013 Mike Giles (Oxford) HPC Trends March 13th, 2013 1 / 18
More informationWorking Differently Accelerating Virtual Product Design with Intel Quad-Core Technology and ESI Group Software
White Paper Quad-Core Intel/ESI Group Workstation White paper Working Differently ccelerating Virtual Product Design with Intel Quad-Core Technology and ESI Group Software Workstation supercomputers powered
More informationHigh Performance Computing with Fujitsu
High Performance Computing with Fujitsu Ivo Doležel 0 2017 FUJITSU FUJITSU Software HPC Cluster Suite A complete HPC software stack solution HPC cluster general characteristics HPC clusters consist primarily
More informationEnhancing Analysis-Based Design with Quad-Core Intel Xeon Processor-Based Workstations
Performance Brief Quad-Core Workstation Enhancing Analysis-Based Design with Quad-Core Intel Xeon Processor-Based Workstations With eight cores and up to 80 GFLOPS of peak performance at your fingertips,
More informationThe Digitising European Industry strategy & H2020 calls related to Cyber-Physical Systems
The Digitising European Industry strategy & H2020 calls related to Cyber-Physical Systems #DigitiseEU Dr. Werner Steinhögl European Commission - DG CONNECT Technologies and Systems for Digitising Industry
More informationSAS Enterprise Miner Performance on IBM System p 570. Jan, Hsian-Fen Tsao Brian Porter Harry Seifert. IBM Corporation
SAS Enterprise Miner Performance on IBM System p 570 Jan, 2008 Hsian-Fen Tsao Brian Porter Harry Seifert IBM Corporation Copyright IBM Corporation, 2008. All Rights Reserved. TABLE OF CONTENTS ABSTRACT...3
More informationIBM Power Systems HPC Cluster
IBM Power Systems HPC Cluster Highlights Complete and fully Integrated HPC cluster for demanding workloads Modular and Extensible: match components & configurations to meet demands Integrated: racked &
More informationTurbostream: A CFD solver for manycore
Turbostream: A CFD solver for manycore processors Tobias Brandvik Whittle Laboratory University of Cambridge Aim To produce an order of magnitude reduction in the run-time of CFD solvers for the same hardware
More informationHPC with GPU and its applications from Inspur. Haibo Xie, Ph.D
HPC with GPU and its applications from Inspur Haibo Xie, Ph.D xiehb@inspur.com 2 Agenda I. HPC with GPU II. YITIAN solution and application 3 New Moore s Law 4 HPC? HPC stands for High Heterogeneous Performance
More informationFirst Steps of YALES2 Code Towards GPU Acceleration on Standard and Prototype Cluster
First Steps of YALES2 Code Towards GPU Acceleration on Standard and Prototype Cluster YALES2: Semi-industrial code for turbulent combustion and flows Jean-Matthieu Etancelin, ROMEO, NVIDIA GPU Application
More informationPRIMEPOWER Server Architecture Excels in Scalability and Flexibility
PRIMEPOWER Server Architecture Excels in Scalability and Flexibility A D.H. Brown Associates, Inc. White Paper Prepared for Fujitsu This document is copyrighted by D.H. Brown Associates, Inc. (DHBA) and
More informationPerformance Benefits of NVIDIA GPUs for LS-DYNA
Performance Benefits of NVIDIA GPUs for LS-DYNA Mr. Stan Posey and Dr. Srinivas Kodiyalam NVIDIA Corporation, Santa Clara, CA, USA Summary: This work examines the performance characteristics of LS-DYNA
More informationEfficient use of OpenFOAM in industry
Elmer Technologies: Efficient use of OpenFOAM in industry Author: Oskar Elmgren Elmer Technologies Built on motorsport experience Specializing in product and technology development Simulation and prototype
More informationOutline Marquette University
COEN-4710 Computer Hardware Lecture 1 Computer Abstractions and Technology (Ch.1) Cristinel Ababei Department of Electrical and Computer Engineering Credits: Slides adapted primarily from presentations
More informationPerformance of the 3D-Combustion Simulation Code RECOM-AIOLOS on IBM POWER8 Architecture. Alexander Berreth. Markus Bühler, Benedikt Anlauf
PADC Anual Workshop 20 Performance of the 3D-Combustion Simulation Code RECOM-AIOLOS on IBM POWER8 Architecture Alexander Berreth RECOM Services GmbH, Stuttgart Markus Bühler, Benedikt Anlauf IBM Deutschland
More informationUsing Industry Standards to Exploit the Advantages and Resolve the Challenges of Multicore Technology
Using Industry Standards to Exploit the Advantages and Resolve the Challenges of Multicore Technology September 19, 2007 Markus Levy, EEMBC and Multicore Association Enabling the Multicore Ecosystem Multicore
More informationHPC and IT Issues Session Agenda. Deployment of Simulation (Trends and Issues Impacting IT) Mapping HPC to Performance (Scaling, Technology Advances)
HPC and IT Issues Session Agenda Deployment of Simulation (Trends and Issues Impacting IT) Discussion Mapping HPC to Performance (Scaling, Technology Advances) Discussion Optimizing IT for Remote Access
More informationAn introduction to Machine Learning silicon
An introduction to Machine Learning silicon November 28 2017 Insight for Technology Investors AI/ML terminology Artificial Intelligence Machine Learning Deep Learning Algorithms: CNNs, RNNs, etc. Additional
More informationCyberServe Atom Servers
DA TA S HE E T S CyberServe Atom Servers Release Date: Q1 2019 Suitable For: Storage Appliance, Network Appliance Tags: CyberServe, Intel Atom Introduction: Perfect appliance servers, the CyberServe range
More informationCenter Extreme Scale CS Research
Center Extreme Scale CS Research Center for Compressible Multiphase Turbulence University of Florida Sanjay Ranka Herman Lam Outline 10 6 10 7 10 8 10 9 cores Parallelization and UQ of Rocfun and CMT-Nek
More informationOpenPOWER Performance
OpenPOWER Performance Alex Mericas Chief Engineer, OpenPOWER Performance IBM Delivering the Linux ecosystem for Power SOLUTIONS OpenPOWER IBM SOFTWARE LINUX ECOSYSTEM OPEN SOURCE Solutions with full stack
More informationBig Data Systems on Future Hardware. Bingsheng He NUS Computing
Big Data Systems on Future Hardware Bingsheng He NUS Computing http://www.comp.nus.edu.sg/~hebs/ 1 Outline Challenges for Big Data Systems Why Hardware Matters? Open Challenges Summary 2 3 ANYs in Big
More informationANSYS HPC. Technology Leadership. Barbara Hutchings ANSYS, Inc. September 20, 2011
ANSYS HPC Technology Leadership Barbara Hutchings barbara.hutchings@ansys.com 1 ANSYS, Inc. September 20, Why ANSYS Users Need HPC Insight you can t get any other way HPC enables high-fidelity Include
More informationErkenntnisse aus aktuellen Performance- Messungen mit LS-DYNA
14. LS-DYNA Forum, Oktober 2016, Bamberg Erkenntnisse aus aktuellen Performance- Messungen mit LS-DYNA Eric Schnepf 1, Dr. Eckardt Kehl 1, Chih-Song Kuo 2, Dymitrios Kyranas 2 1 Fujitsu Technology Solutions
More informationIBM FlashSystem. IBM FLiP Tool Wie viel schneller kann Ihr IBM i Power Server mit IBM FlashSystem 900 / V9000 Storage sein?
FlashSystem Family 2015 IBM FlashSystem IBM FLiP Tool Wie viel schneller kann Ihr IBM i Power Server mit IBM FlashSystem 900 / V9000 Storage sein? PiRT - Power i Round Table 17 Sep. 2015 Daniel Gysin IBM
More informationOzenCloud Case Studies
OzenCloud Case Studies Case Studies, April 20, 2015 ANSYS in the Cloud Case Studies: Aerodynamics & fluttering study on an aircraft wing using fluid structure interaction 1 Powered by UberCloud http://www.theubercloud.com
More informationIBM Power AC922 Server
IBM Power AC922 Server The Best Server for Enterprise AI Highlights More accuracy - GPUs access system RAM for larger models Faster insights - significant deep learning speedups Rapid deployment - integrated
More informationPrepAwayExam. High-efficient Exam Materials are the best high pass-rate Exam Dumps
PrepAwayExam http://www.prepawayexam.com/ High-efficient Exam Materials are the best high pass-rate Exam Dumps Exam : C4060-155 Title : System x Server Family Sales V1 Vendors : IBM Version : DEMO Get
More informationLS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance
11 th International LS-DYNA Users Conference Computing Technology LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance Gilad Shainer 1, Tong Liu 2, Jeff Layton
More information32 Hyper-Threading on SMP Systems
32 Hyper-Threading on SMP Systems If you have not read the book (Performance Assurance for IT Systems) check the introduction to More Tasters on the web site http://www.b.king.dsl.pipex.com/ to understand
More informationGPU ACCELERATED COMPUTING. 1 st AlsaCalcul GPU Challenge, 14-Jun-2016, Strasbourg Frédéric Parienté, Tesla Accelerated Computing, NVIDIA Corporation
GPU ACCELERATED COMPUTING 1 st AlsaCalcul GPU Challenge, 14-Jun-2016, Strasbourg Frédéric Parienté, Tesla Accelerated Computing, NVIDIA Corporation GAMING PRO ENTERPRISE VISUALIZATION DATA CENTER AUTO
More informationPre-compiled applications and utilities in OpenFOAM
Pre-compiled applications and utilities in OpenFOAM Tommaso Lucchini Department of Energy Politecnico di Milano Learning outcome You will learn... the meaning of the words applications, solvers, and utilities
More informationPower Systems for Your Business
Hotel Mulia Jakarta Power Systems for Your Business Septia Sukariningrum Power Systems Technical Sales Specialist IBM Indonesia The datacenter is changing Server sprawl resulting in lack of space Datacenter
More informationParallel Computer Architecture
Parallel Computer Architecture What is Parallel Architecture? A parallel computer is a collection of processing elements that cooperate to solve large problems fast Some broad issues: Resource Allocation:»
More informationAccelerating Applications. the art of maximum performance computing James Spooner Maxeler VP of Acceleration
Accelerating Applications the art of maximum performance computing James Spooner Maxeler VP of Acceleration Introduction The Process The Tools Case Studies Summary What do we mean by acceleration? How
More informationIBM Information Technology Guide For ANSYS Fluent Customers
IBM ISV & Developer Relations Manufacturing IBM Information Technology Guide For ANSYS Fluent Customers A collaborative effort between ANSYS and IBM 2 IBM Information Technology Guide For ANSYS Fluent
More informationIDC White Paper: Creating World Leading Systems Using a Common Processor Microarchitecture: Combining the Best from Mainframes, UNIX Servers, and HPC
IDC White Paper: Creating World Leading Systems Using a Common Processor Microarchitecture: Combining the Best from Mainframes, UNIX Servers, and HPC A Common Microarchitecture Fujitsu centers its processor
More informationGPU > CPU. FOR HIGH PERFORMANCE COMPUTING PRESENTATION BY - SADIQ PASHA CHETHANA DILIP
GPU > CPU. FOR HIGH PERFORMANCE COMPUTING PRESENTATION BY - SADIQ PASHA CHETHANA DILIP INTRODUCTION or With the exponential increase in computational power of todays hardware, the complexity of the problem
More informationAdvanced Simulation Library Expanding software ecosystem for the DSP/FPGA/GPU market. September 3, 2015
Advanced Simulation Library Expanding software ecosystem for the DSP/FPGA/GPU market September 3, 2015 1 ASL Advanced Simulation Library - free and open source, OpenCL-based multiphysics simulation software
More informationHow Might Recently Formed System Interconnect Consortia Affect PM? Doug Voigt, SNIA TC
How Might Recently Formed System Interconnect Consortia Affect PM? Doug Voigt, SNIA TC Three Consortia Formed in Oct 2016 Gen-Z Open CAPI CCIX complex to rack scale memory fabric Cache coherent accelerator
More informationOpenPOWER Innovations for HPC. IBM Research. IWOPH workshop, ISC, Germany June 21, Christoph Hagleitner,
IWOPH workshop, ISC, Germany June 21, 2017 OpenPOWER Innovations for HPC IBM Research Christoph Hagleitner, hle@zurich.ibm.com IBM Research - Zurich Lab IBM Research - Zurich Established in 1956 45+ different
More informationHigh Performance Computing on GPUs using NVIDIA CUDA
High Performance Computing on GPUs using NVIDIA CUDA Slides include some material from GPGPU tutorial at SIGGRAPH2007: http://www.gpgpu.org/s2007 1 Outline Motivation Stream programming Simplified HW and
More informationHPC future trends from a science perspective
HPC future trends from a science perspective Simon McIntosh-Smith University of Bristol HPC Research Group simonm@cs.bris.ac.uk 1 Business as usual? We've all got used to new machines being relatively
More informationCustomer Success Story Los Alamos National Laboratory
Customer Success Story Los Alamos National Laboratory Panasas High Performance Storage Powers the First Petaflop Supercomputer at Los Alamos National Laboratory Case Study June 2010 Highlights First Petaflop
More informationMPI Optimizations via MXM and FCA for Maximum Performance on LS-DYNA
MPI Optimizations via MXM and FCA for Maximum Performance on LS-DYNA Gilad Shainer 1, Tong Liu 1, Pak Lui 1, Todd Wilde 1 1 Mellanox Technologies Abstract From concept to engineering, and from design to
More informationAutomated, Digitized Post-Printing Solutions Enabling Additive Manufacturing at Scale
Automated, Digitized Post-Printing Solutions Enabling Additive Manufacturing at Scale A Technology That Changes Everything Unleashing the Transformative Power of Additive Manufacturing It s Automated.
More informationThe Cambridge Bio-Medical-Cloud An OpenStack platform for medical analytics and biomedical research
The Cambridge Bio-Medical-Cloud An OpenStack platform for medical analytics and biomedical research Dr Paul Calleja Director of Research Computing University of Cambridge Global leader in science & technology
More informationIBM Cell Processor. Gilbert Hendry Mark Kretschmann
IBM Cell Processor Gilbert Hendry Mark Kretschmann Architectural components Architectural security Programming Models Compiler Applications Performance Power and Cost Conclusion Outline Cell Architecture:
More informationsimulation framework for piecewise regular grids
WALBERLA, an ultra-scalable multiphysics simulation framework for piecewise regular grids ParCo 2015, Edinburgh September 3rd, 2015 Christian Godenschwager, Florian Schornbaum, Martin Bauer, Harald Köstler
More informationDeep Learning mit PowerAI - Ein Überblick
Stephen Lutz Deep Learning mit PowerAI - Open Group Master Certified IT Specialist Technical Sales IBM Cognitive Infrastructure IBM Germany Ein Überblick Stephen.Lutz@de.ibm.com What s that? and what s
More informationConsiderations for LS-DYNA Workflow Efficiencies in an HPC Linux Environment
9 th International LS-DYNA Users Conference Computing / Code Technology (2) Considerations for LS-DYNA Workflow Efficiencies in an HPC Linux Environment Stanley Posey HPC Applications Development SGI,
More informationGPUs and Emerging Architectures
GPUs and Emerging Architectures Mike Giles mike.giles@maths.ox.ac.uk Mathematical Institute, Oxford University e-infrastructure South Consortium Oxford e-research Centre Emerging Architectures p. 1 CPUs
More informationACCELERATED COMPLEX EVENT PROCESSING WITH GRAPHICS PROCESSING UNITS
ACCELERATED COMPLEX EVENT PROCESSING WITH GRAPHICS PROCESSING UNITS Prabodha Srimal Rodrigo Registration No. : 138230V Degree of Master of Science Department of Computer Science & Engineering University
More informationConsiderations for LS-DYNA Efficiency in SGI IRIX and Linux Environments with a NUMA System Architecture
4 th European LS-DYNA Users Conference MPP / Linux Cluster / Hardware I Considerations for LS-DYNA Efficiency in SGI IRIX and Linux Environments with a NUMA System Architecture Authors: Stan Posey, Nick
More informationEngineers can be significantly more productive when ANSYS Mechanical runs on CPUs with a high core count. Executive Summary
white paper Computer-Aided Engineering ANSYS Mechanical on Intel Xeon Processors Engineer Productivity Boosted by Higher-Core CPUs Engineers can be significantly more productive when ANSYS Mechanical runs
More informationIntroduction: PURPOSE BUILT HARDWARE. ARISTA WHITE PAPER HPC Deployment Scenarios
HPC Deployment Scenarios Introduction: Private and public High Performance Computing systems are continually increasing in size, density, power requirements, storage, and performance. As these systems
More informationWhy FPGAs will win the Accelerator Battle: Building Computers that Minimize Data Movement
Why FPGAs will win the Accelerator Battle: Building Computers that Minimize Data Movement Allan Cantle President & Founder www.nallatech.com Overview» Commercial Realities For HPC» A View From Berkeley»
More informationPerformance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA
Performance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA Pak Lui, Gilad Shainer, Brian Klaff Mellanox Technologies Abstract From concept to
More information(ii) Why are we going to multi-core chips to find performance? Because we have to.
CSE 30321 Computer Architecture I Fall 2009 Lab 06 Introduction to Multi-core Processors and Parallel Programming Assigned: November 3, 2009 Due: November 17, 2009 1. Introduction: This lab will introduce
More informationBuilding Blocks. Operating Systems, Processes, Threads
Building Blocks Operating Systems, Processes, Threads Outline What does an Operating System (OS) do? OS types in HPC The Command Line Processes Threads Threads on accelerators OS performance optimisation
More informationFaster Innovation - Accelerating SIMULIA Abaqus Simulations with NVIDIA GPUs. Baskar Rajagopalan Accelerated Computing, NVIDIA
Faster Innovation - Accelerating SIMULIA Abaqus Simulations with NVIDIA GPUs Baskar Rajagopalan Accelerated Computing, NVIDIA 1 Engineering & IT Challenges/Trends NVIDIA GPU Solutions AGENDA Abaqus GPU
More informationReal Application Performance and Beyond
Real Application Performance and Beyond Mellanox Technologies Inc. 2900 Stender Way, Santa Clara, CA 95054 Tel: 408-970-3400 Fax: 408-970-3403 http://www.mellanox.com Scientists, engineers and analysts
More informationAccelerating Implementation of Low Power Artificial Intelligence at the Edge
Accelerating Implementation of Low Power Artificial Intelligence at the Edge A Lattice Semiconductor White Paper November 2018 The emergence of smart factories, cities, homes and mobile are driving shifts
More informationGen-Z Memory-Driven Computing
Gen-Z Memory-Driven Computing Our vision for the future of computing Patrick Demichel Distinguished Technologist Explosive growth of data More Data Need answers FAST! Value of Analyzed Data 2005 0.1ZB
More informationSpeedup Altair RADIOSS Solvers Using NVIDIA GPU
Innovation Intelligence Speedup Altair RADIOSS Solvers Using NVIDIA GPU Eric LEQUINIOU, HPC Director Hongwei Zhou, Senior Software Developer May 16, 2012 Innovation Intelligence ALTAIR OVERVIEW Altair
More informationIndustrial finite element analysis: Evolution and current challenges. Keynote presentation at NAFEMS World Congress Crete, Greece June 16-19, 2009
Industrial finite element analysis: Evolution and current challenges Keynote presentation at NAFEMS World Congress Crete, Greece June 16-19, 2009 Dr. Chief Numerical Analyst Office of Architecture and
More informationNew Approach to Unstructured Data
Innovations in All-Flash Storage Deliver a New Approach to Unstructured Data Table of Contents Developing a new approach to unstructured data...2 Designing a new storage architecture...2 Understanding
More informationIntroduction to parallel Computing
Introduction to parallel Computing VI-SEEM Training Paschalis Paschalis Korosoglou Korosoglou (pkoro@.gr) (pkoro@.gr) Outline Serial vs Parallel programming Hardware trends Why HPC matters HPC Concepts
More informationANSYS HPC Technology Leadership
ANSYS HPC Technology Leadership 1 ANSYS, Inc. November 14, Why ANSYS Users Need HPC Insight you can t get any other way It s all about getting better insight into product behavior quicker! HPC enables
More informationDescription of Power8 Nodes Available on Mio (ppc[ ])
Description of Power8 Nodes Available on Mio (ppc[001-002]) Introduction: HPC@Mines has released two brand-new IBM Power8 nodes (identified as ppc001 and ppc002) to production, as part of our Mio cluster.
More informationDDN. DDN Updates. Data DirectNeworks Japan, Inc Shuichi Ihara. DDN Storage 2017 DDN Storage
DDN DDN Updates Data DirectNeworks Japan, Inc Shuichi Ihara DDN A Broad Range of Technologies to Best Address Your Needs Protection Security Data Distribution and Lifecycle Management Open Monitoring Your
More information! Readings! ! Room-level, on-chip! vs.!
1! 2! Suggested Readings!! Readings!! H&P: Chapter 7 especially 7.1-7.8!! (Over next 2 weeks)!! Introduction to Parallel Computing!! https://computing.llnl.gov/tutorials/parallel_comp/!! POSIX Threads
More informationDrive your future with Exertis Hammer
Drive your future with Exertis Hammer No.1 supplier for HDDs and SSDs Drive your future with Exertis Hammer >20 million drives sold When it comes to drives, Exertis Hammer has the answers... all backed
More informationAn analysis of object-based intelligent image
An analysis of object-based intelligent image processing and retrieval system Abstract-In order to improve the process of analysis and retrieval of images, it is necessary to examine the execution of such
More informationA Framework for Industrial Simulation and Data Analytics. Yann Debray Scilab Center of Excellence, ESI Group
A Framework for Industrial Simulation and Data Analytics Yann Debray Scilab Center of Excellence, ESI Group Copyright ESI Copyright Group, 2017. ESI All Group, rights reserved. 2017. All rights reserved.
More informationConcurrent execution of an analytical workload on a POWER8 server with K40 GPUs A Technology Demonstration
Concurrent execution of an analytical workload on a POWER8 server with K40 GPUs A Technology Demonstration Sina Meraji sinamera@ca.ibm.com Berni Schiefer schiefer@ca.ibm.com Tuesday March 17th at 12:00
More informationHigher Level Programming Abstractions for FPGAs using OpenCL
Higher Level Programming Abstractions for FPGAs using OpenCL Desh Singh Supervising Principal Engineer Altera Corporation Toronto Technology Center ! Technology scaling favors programmability CPUs."#/0$*12'$-*
More informationHigh Performance Computing
High Performance Computing ADVANCED SCIENTIFIC COMPUTING Dr. Ing. Morris Riedel Adjunct Associated Professor School of Engineering and Natural Sciences, University of Iceland Research Group Leader, Juelich
More informationAcuSolve Performance Benchmark and Profiling. October 2011
AcuSolve Performance Benchmark and Profiling October 2011 Note The following research was performed under the HPC Advisory Council activities Participating vendors: AMD, Dell, Mellanox, Altair Compute
More informationCisco FirePOWER 8000 Series Appliances
Data Sheet Cisco FirePOWER 8000 Series Appliances Product Overview Finding a network security appliance with exactly the right throughput, interface options, and threat protection for all the different
More informationLOW LATENCY DATA DISTRIBUTION IN CAPITAL MARKETS: GETTING IT RIGHT
LOW LATENCY DATA DISTRIBUTION IN CAPITAL MARKETS: GETTING IT RIGHT PATRICK KUSTER Head of Business Development, Enterprise Capabilities, Thomson Reuters +358 (40) 840 7788; patrick.kuster@thomsonreuters.com
More informationHPC-CINECA infrastructure: The New Marconi System. HPC methods for Computational Fluid Dynamics and Astrophysics Giorgio Amati,
HPC-CINECA infrastructure: The New Marconi System HPC methods for Computational Fluid Dynamics and Astrophysics Giorgio Amati, g.amati@cineca.it Agenda 1. New Marconi system Roadmap Some performance info
More informationOptimising the Mantevo benchmark suite for multi- and many-core architectures
Optimising the Mantevo benchmark suite for multi- and many-core architectures Simon McIntosh-Smith Department of Computer Science University of Bristol 1 Bristol's rich heritage in HPC The University of
More informationExascale: challenges and opportunities in a power constrained world
Exascale: challenges and opportunities in a power constrained world Carlo Cavazzoni c.cavazzoni@cineca.it SuperComputing Applications and Innovation Department CINECA CINECA non profit Consortium, made
More informationParallel Computing Ideas
Parallel Computing Ideas K. 1 1 Department of Mathematics 2018 Why When to go for speed Historically: Production code Code takes a long time to run Code runs many times Code is not end in itself 2010:
More informationAccelerating Virtual Product Design with Quad-Core Intel Xeon Processor-Based Workstations and ESI Group Software
Performance rief Quad-Core Intel/ESI Group Workstation ccelerating Virtual Product Design with Quad-Core Intel Xeon Processor-ased Workstations and ESI Group Software Workstation supercomputers powered
More informationHypervisor Part 1- What is a Hypervisor and How Does it Work?
WHITEPAPER Hypervisor Part 1- What is a Hypervisor and How Does it Work? Randy Martin Product Manager 5/3/17 BlackBerry QNX Not for general distribution. Intended for Advanced Sterilization Products 1
More informationIBM DS8880F All-flash Data Systems
IBM DS8880F All-flash Data Systems Gary F Albert Offering Manager and Business Line Manager IBM DS8880 Release 8..1 and Roadmap November 016 NAND Flash technology is used across the IBM Systems Flash Storage
More information