AMD EPYC and NAMD Powering the Future of HPC February, 2019

Size: px
Start display at page:

Download "AMD EPYC and NAMD Powering the Future of HPC February, 2019"

Transcription

1 AMD EPYC and NAMD Powering the Future of HPC February, 19 Exceptional Core Performance NAMD is a compute-intensive workload that benefits from AMD EPYC s high core IPC (Instructions Per Clock) and high number of cores. Standards Based AMD is committed to industry standards, offering you a choice in x86 processors with design innovations that target the evolving needs of modern datacenters. High Density, Low Cost Compute requirements are increasing, datacenter space is not. AMD s EPYC processor offers high core density with full access to all features. Innovative architecture means outstanding performance at a low cost. Partner Ecosystem AMD s broad partner ecosystem and collaborative engineering provide tested and validated solutions that help lower your risk and total cost of ownership. NAMD NAMD, recipient of a Gordon Bell Award and a 1 Sidney Fernbach Award, is a parallel molecular dynamics code designed for high-performance simulation of large biomolecular systems. AMD EPYC: Nanoscale Molecular Dynamics Designed from the ground up for a new generation of solutions, AMD EPYC processors implement a philosophy of choice without restriction. Choose the number of cores and sockets that meet your needs without sacrificing key features like memory and I/O. Each EPYC processor can have from 8 to 3 cores with access to an exceptional amount of I/O and memory regardless of the number of cores in use, including 18 PCIe lanes, and support for up to TB of high speed memory per socket. EPYC s innovative architecture translates to tremendous performance at a low cost. More importantly, the performance you re paying for is appropriate to the performance you need. I/O intensive workloads can utilize the plentiful I/O bandwidth with the right number of cores avoiding overpaying for unneeded power while compute-intensive workloads can make use of fully loaded core counts, dual sockets and plenty of memory. AMD EPYC processors help enable more performance, flexibility, and security PERFORMANCE. AMD EPYC processors bring a new balance to the datacenter. Utilizing an x86 architecture, the AMD EPYC processor, brings together high core counts, large memory capacity, ample memory bandwidth and massive I/O with the right ratios to help performance reach new heights. FLEXIBILITY. Match core count with application needs without compromising processor features. EPYC s balanced set of resources means more freedom to right-size the server configuration to the workload. SECURITY. AMD EPYC features the industry s first dedicated security processor embedded in an x86-architecture server processor. The processor manages secure boot, memory encryption, and secure virtualization on the processor itself. Encryption keys never leave the processor where they can be exposed to intruders. SCALABILITY. Scale-up or scale-out, AMD and its ecosystem partners offer high-performance network connectivity options for applications at massive scale. 18 Advanced Micro Devices, Inc.

2 AMD EPYC for Molecular Dynamics NAMD Core IPC is a critical factor in optimizing performance of NAMD. AMD EPYC server processors employing the revolutionary Zen microarchitecture helps ensure that you get the most out of your system, minimizing execution time and increasing overall utilization of your deployment. Based on Charm++ parallel objects, NAMD scales to hundreds of cores for typical simulations and beyond 5, cores for the largest simulations. NAMD uses the popular molecular graphics program VMD for simulation setup and trajectory analysis, but is also file-compatible with AMBER, CHARMM, and X-PLOR. The EPYC Advantage: AMD EPYC server processors offer 8 memory channels of DDR4-666 and up to TB of memory per processor, yielding exceptional memory bandwidth and capacity. Many High-Performance Compute (HPC) workloads require you to balance performance vs per-core license costs to manage your overall cost. AMD EPYC processors offer a consistent set of features across the product line, allowing users to optimize the number of cores required for their workloads without sacrificing features, memory channels, memory capacity, or I/O lanes. Whether you need 8, 16, 4, or 3 physical cores per socket, you will have access to 8 channels of memory per processor across all EPYC server processors. The EPYC Advantage: Performance - The AMD EPYC processor brings new balance to the datacenter. The highest core count yet in an AMD x86-architecture server processor, large memory capacity, memory bandwidth and I/O density are all brought together with the right ratios to help performance reach new heights. As workloads demand more processor cores, the communications between processor cores becomes critical to efficiently solving the complex problems faced by customers. As cluster sizes increase, the communication requirements between nodes rises quickly and can limit scaling at large node counts. 19 Advanced Micro Devices, Inc. NAMD is distributed free of charge with source code. You can build NAMD yourself or download binaries for a wide variety of platforms. NAMD Benchmarks The industry standard benchmarks for NAMD include STMV, ApoA1, and f1atpase. These benchmarks are well established NAMD workloads that allow users to compare performance of different hardware solutions to determine which solutions are best for their needs.

3 Performance Benchmarks and Testing NAMD benchmarks provide a basis of evaluating hardware performance. Standard models are provided that represent typical usage. The benchmarks used were STMV, APOA1, and f1atpase. NAMD testing was performed on a cluster of dualsocket systems. Tests were run using AMD EPYC 7351, AMD EPYC 7371, AMD EPYC 7451, and EPYC 761 processors. Both the EPYC 7351 processor and the EPYC 7371 processor have 16 cores. However, the EPYC 7371 processor runs at a higher frequency. The EPYC 7351 processor has a base frequency of.4 GHz and a boost frequency of.9 GHz. The EPYC 7371 processor runs at a base frequency of 3.1 GHz and a boost frequency of 3.6 GHz. Each EPYC 7451 processor has 4 cores with a base frequency of.3 GHz and a boost frequency of.9 GHz. Finally, the EPYC 761 processor has 3 cores and runs at a base frequency of. GHz and a boost frequency of.7 GHz. Each system has a total of 16 channels of dual-rank DDR4-666 memory, 8 channels per processor. Tested Hardware/Software configuration Compute Nodes CPUs x EPYC 7351 / x EPYC 7371 / x EPYC 7451 / x EPYC 761 Cores 16 cores, 3 threads per CPU (64 threads per system) / 16 cores, 3 threads per CPU (64 threads per system) / 4 cores, 48 threads per CPU (96 threads per system) / 3 cores, 64 threads per CPU (18 Memory NIC Storage: OS Storage: Data threads per system) 56GB (16x) Dual-Rank DDR4-666 Mellanox ConnectX-5 EDR 1Gb Infiniband x16 PCIe 1 x 56 GB NVMe 1 x 1 TB NVMe Software OS RHEL 7.5 ( el7.x86_64) Mellanox OFED Driver MLNX_OFED_LINUX (OFED ) MPI Version OpenMPI Application NAMD_.13b Switch BIOS Setting OS Settings Network Mellanox EDR 1Gb/s Managed Switch (MSB78-ESF) Configuration Options SMT=ON Boost=ON SMEE=Disabled Determinism Slider = Power SVM=Disabled Global C State Control=Enabled Governor=Performance, CC6 Disabled NAMD Compilation: NAMD version.13b was compiled from source on RHEL 7.5 using the gcc compiler and OpenMPI The default optimization flags were used. No further compile time optimizations were done. The FFTW and TCL libraries used are the precompiled versions from the NAMD website. The same binary was used for all benchmarks across all configurations. 19 Advanced Micro Devices, Inc. 3

4 NAMD Performance and Scaling: Single Node Performance Single node performance was compared between the 7351, 7371, 7451, and 761 processors. There were three different industry standard NAMD models used: STMV, ApoA1, and f1atpase. The STMV benchmark is one of the most common benchmarks used to compare NAMD performance across various platforms. STMV is useful for demonstrating scaling to thousands of processors. ApoA1 has been the standard NAMD cross-platform benchmark for years. Another commonly used NAMD benchmark is f1atpase. STMV: EPYC 7351 vs 7371 vs 7451 vs 761 Figure 1 details the performance of the benchmark on a single, two-socket system across 4 different AMD EPYC processors. The 7351 and 7371 processors have 16 cores per socket, however the clock speeds on the 7371 CPUs are higher. The 7451 CPU has 4 cores per socket. The 761 has 3 cores per socket. The performance increases with more cores and/or frequency as expected..8 NAMD STMV Performance EPYC 7351 vs 7371 vs 7451 vs ns/day (higher is better) Single socket node performance Figure 1 19 Advanced Micro Devices, Inc. 4

5 EPYC 7351 vs 7371 vs 7451 vs 761 ApoA1 Figure details the performance of the ApoA1 benchmark using the same configurations. Again, the 7351 and 7371 CPUs have 16 cores per socket however the clock speeds on the 7371 CPUs are higher. The 7451 CPU has 4 cores per socket. The 761 has 3 cores per socket. The performance increases with more cores and/or frequency as expected. 7 NAMD APOA1 Performance EPYC 7351 vs 7371 vs 7451 vs ns/day (higher is better) Single socket node performance Figure 19 Advanced Micro Devices, Inc. 5

6 EPYC 7351 vs 7371 vs 7451 vs 761 f1atpase Finally, figure 3 details the performance of the f1atpase benchmark using the same configurations. The 7351 and 7371 CPUs have 16 cores per socket however the clock speeds on the 7371 CPUs are higher. The 7451 CPU has 4 cores per socket. The 761 has 3 cores per socket. The performance increases with more cores and/or frequency as expected..5 NAMD f1atpase Performance EPYC 7351 vs 7371 vs 7451 vs 761 ns/day (higher is better) Single socket node performance Figure 3 19 Advanced Micro Devices, Inc. 6

7 NAMD Performance and Scaling: Multi Node Scaling Each of the 3 benchmarks, STMV, ApoA1, and f1atpase, were then scaled out to multiple nodes and hundreds of cores to see how well the workload scales. EPYC 7351 Multi Node Scaling STMV Figure 4 details the performance of the STMV model when running across multiple systems, showing how the performance of the benchmark improves through 16 nodes, 51 cores, 14 threads. Each node has 3 physical cores with SMT (simultaneous multi-threading) enabled for 64 logical threads. Each node runs 64 MPI ranks with each rank bound to one thread. The data shows exceptional scaling for this workload. 16 NAMD STMV EPYC 7351 Scaling 14 1 Scaling Factor Number of Threads Figure 4 19 Advanced Micro Devices, Inc. 7

8 EPYC 7451 Multi Node Scaling STMV Figure 5 details the performance of the benchmark when adding additional nodes, showing how the performance of the benchmark improves through 8 nodes, 384 cores, 768 threads. Each node has 48 cores with SMT enabled for 96 logical threads. Each node runs 96 MPI ranks with each rank bound to one thread. The data shows exceptional scaling for this model as well. 8 NAMD STMV EPYC 7451 Scaling 7 6 Scaling Factor Number of Threads Figure 5 19 Advanced Micro Devices, Inc. 8

9 EPYC 7351 Multi Node Scaling f1atpase Figure 6 & 7 show the performance of the f1atpase model when adding additional systems. Each 7351 system has 3 cores with SMT enabled for 64 logical threads, running 1 MPI rank each. The 7451 systems have 48 physical cores with SMT enabled for a total of 96 logical threads, each running 1 MPI rank. Scaling remains very good, even though this model is smaller than SMTV NAMD f1atpase EPYC 7351 Scaling Scaling Factor Number of Threads Figure 6 NAMD f1atpase EPYC 7451 Scaling Scaling Factor Number of Threads Figure 7 19 Advanced Micro Devices, Inc. 9

10 Summary NAMD benchmarks were conducted on single, two-socket systems, running AMD EPYC 7351, 7371, 7451, and 761 processors. Scaling testing was run on the SMTV and f1atpase models on both the 7351 and 7451 processors. Results were as expected with higher core frequencies and more cores resulting in better overall performance. NAMD scales very well across all benchmarks to a large number of cores. Conclusion AMD empowers the development of fast, accurate molecular dynamics simulations running on costeffective clustered systems. For more information about AMD s EPYC line of processors visit: For more information about NAMD visit: Authors This paper is authored by Anre Kashyap in collaboration with Marc Baker and Kevin Mayo. Scale-out testing on the EPYC cluster shows impressive results on these benchmarks. Pure performance was highest with the 3-core EPYC 761. Per-core performance was highest with the 16-core EPYC Whether you need the dominating system level performance and density of the EPYC 761 or the equally dominating percore performance of the EPYC 7371, all products offer exceptional core IPC, and both provide your organization a significant benefit. Customers can pick the most optimal part based on their unique requirements. DISCLAIMER The information contained herein is for informational purposes only and is subject to change without notice. While every precaution has been taken in the preparation of this document, it may contain technical inaccuracies, omissions and typographical errors, and AMD is under no obligation to update or otherwise correct this information. Advanced Micro Devices, Inc. makes no representations or warranties with respect to the accuracy or completeness of the contents of this document, and assumes no liability of any kind, including the implied warranties of noninfringement, merchantability or fitness for particular purposes, with respect to the operation or use of AMD hardware, software or other products described herein. No license, including implied or arising by estoppel, to any intellectual property rights is granted by this document. Terms and limitations applicable to the purchase or use of AMD s products are as set forth in a signed agreement between the parties or in AMD's Standard Terms and Conditions of Sale. GD Advanced Micro Devices, Inc. All rights reserved. AMD, the AMD Arrow logo, EPYC, and combinations thereof are trademarks of Advanced Micro Devices, Inc. Other product names used in this publication are for identification purposes only and may be trademarks of their respective companies. 19 Advanced Micro Devices, Inc. 1

Memory Population Guidelines for AMD EPYC Processors

Memory Population Guidelines for AMD EPYC Processors Memory Population Guidelines for AMD EPYC Processors Publication # 56301 Revision: 0.70 Issue Date: July 2018 Advanced Micro Devices 2018 Advanced Micro Devices, Inc. All rights reserved. The information

More information

EPYC VIDEO CUG 2018 MAY 2018

EPYC VIDEO CUG 2018 MAY 2018 AMD UPDATE CUG 2018 EPYC VIDEO CRAY AND AMD PAST SUCCESS IN HPC AMD IN TOP500 LIST 2002 TO 2011 2011 - AMD IN FASTEST MACHINES IN 11 COUNTRIES ZEN A FRESH APPROACH Designed from the Ground up for Optimal

More information

Microsoft Windows 2016 Mellanox 100GbE NIC Tuning Guide

Microsoft Windows 2016 Mellanox 100GbE NIC Tuning Guide Microsoft Windows 2016 Mellanox 100GbE NIC Tuning Guide Publication # 56288 Revision: 1.00 Issue Date: June 2018 2018 Advanced Micro Devices, Inc. All rights reserved. The information contained herein

More information

NAMD Performance Benchmark and Profiling. February 2012

NAMD Performance Benchmark and Profiling. February 2012 NAMD Performance Benchmark and Profiling February 2012 Note The following research was performed under the HPC Advisory Council activities Participating vendors: AMD, Dell, Mellanox Compute resource -

More information

NAMD Performance Benchmark and Profiling. January 2015

NAMD Performance Benchmark and Profiling. January 2015 NAMD Performance Benchmark and Profiling January 2015 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell, Mellanox Compute resource

More information

NAMD GPU Performance Benchmark. March 2011

NAMD GPU Performance Benchmark. March 2011 NAMD GPU Performance Benchmark March 2011 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Dell, Intel, Mellanox Compute resource - HPC Advisory

More information

NUMA Topology for AMD EPYC Naples Family Processors

NUMA Topology for AMD EPYC Naples Family Processors NUMA Topology for AMD EPYC Naples Family Publication # 56308 Revision: 0.70 Issue Date: May 2018 Advanced Micro Devices 2018 Advanced Micro Devices, Inc. All rights reserved. The information contained

More information

AMD EPYC Processors Showcase High Performance for Network Function Virtualization (NFV)

AMD EPYC Processors Showcase High Performance for Network Function Virtualization (NFV) White Paper December, 2018 AMD EPYC Processors Showcase High Performance for Network Function Virtualization (NFV) Executive Summary Data centers and cloud service providers are creating a technology shift

More information

CAUTIONARY STATEMENT 1 EPYC PROCESSOR ONE YEAR ANNIVERSARY JUNE 2018

CAUTIONARY STATEMENT 1 EPYC PROCESSOR ONE YEAR ANNIVERSARY JUNE 2018 CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to, the features, functionality, availability, timing,

More information

Altair OptiStruct 13.0 Performance Benchmark and Profiling. May 2015

Altair OptiStruct 13.0 Performance Benchmark and Profiling. May 2015 Altair OptiStruct 13.0 Performance Benchmark and Profiling May 2015 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell, Mellanox Compute

More information

NVMe SSD Performance Evaluation Guide for Windows Server 2016 and Red Hat Enterprise Linux 7.4

NVMe SSD Performance Evaluation Guide for Windows Server 2016 and Red Hat Enterprise Linux 7.4 NVMe SSD Performance Evaluation Guide for Windows Server 2016 and Red Hat Enterprise Linux 7.4 Publication # 56367 Revision: 0.70 Issue Date: August 2018 Advanced Micro Devices 2018 Advanced Micro Devices,

More information

Linux Network Tuning Guide for AMD EPYC Processor Based Servers

Linux Network Tuning Guide for AMD EPYC Processor Based Servers Linux Network Tuning Guide for AMD EPYC Processor Application Note Publication # 56224 Revision: 1.00 Issue Date: November 2017 Advanced Micro Devices 2017 Advanced Micro Devices, Inc. All rights reserved.

More information

LAMMPS Performance Benchmark and Profiling. July 2012

LAMMPS Performance Benchmark and Profiling. July 2012 LAMMPS Performance Benchmark and Profiling July 2012 Note The following research was performed under the HPC Advisory Council activities Participating vendors: AMD, Dell, Mellanox Compute resource - HPC

More information

NAMD Performance Benchmark and Profiling. November 2010

NAMD Performance Benchmark and Profiling. November 2010 NAMD Performance Benchmark and Profiling November 2010 Note The following research was performed under the HPC Advisory Council activities Participating vendors: HP, Mellanox Compute resource - HPC Advisory

More information

AMBER 11 Performance Benchmark and Profiling. July 2011

AMBER 11 Performance Benchmark and Profiling. July 2011 AMBER 11 Performance Benchmark and Profiling July 2011 Note The following research was performed under the HPC Advisory Council activities Participating vendors: AMD, Dell, Mellanox Compute resource -

More information

CP2K Performance Benchmark and Profiling. April 2011

CP2K Performance Benchmark and Profiling. April 2011 CP2K Performance Benchmark and Profiling April 2011 Note The following research was performed under the HPC Advisory Council activities Participating vendors: AMD, Dell, Mellanox Compute resource - HPC

More information

AMD EPYC PRESENTS OPPORTUNITY TO SAVE ON SOFTWARE LICENSING COSTS

AMD EPYC PRESENTS OPPORTUNITY TO SAVE ON SOFTWARE LICENSING COSTS AMD EPYC PRESENTS OPPORTUNITY TO SAVE ON SOFTWARE LICENSING COSTS BUSINESS SELECTION OF PROCESSOR SHOULD FACTOR IN SOFTWARE COSTS EXECUTIVE SUMMARY Software licensing models for many server applications

More information

Broadcast-Quality, High-Density HEVC Encoding with AMD EPYC Processors

Broadcast-Quality, High-Density HEVC Encoding with AMD EPYC Processors Solution Brief December, 2018 2018 Broadcast-Quality, High-Density HEVC Encoding with AMD EPYC Processors HIGHLIGHTS o The AMD EPYC SoC brings a new balance to the datacenter. Utilizing an x86-architecture,

More information

AMD Radeon ProRender plug-in for Unreal Engine. Installation Guide

AMD Radeon ProRender plug-in for Unreal Engine. Installation Guide AMD Radeon ProRender plug-in for Unreal Engine Installation Guide This document is a guide on how to install and configure AMD Radeon ProRender plug-in for Unreal Engine. DISCLAIMER The information contained

More information

CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to

CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to AMD s strategy and focus, expected datacenter total

More information

NVMe Performance Testing and Optimization Application Note

NVMe Performance Testing and Optimization Application Note NVMe Performance Testing and Optimization Application Note Publication # 56163 Revision: 0.72 Issue Date: December 2017 Advanced Micro Devices 2017 Advanced Micro Devices, Inc. All rights reserved. The

More information

LAMMPS-KOKKOS Performance Benchmark and Profiling. September 2015

LAMMPS-KOKKOS Performance Benchmark and Profiling. September 2015 LAMMPS-KOKKOS Performance Benchmark and Profiling September 2015 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell, Mellanox, NVIDIA

More information

Fan Control in AMD Radeon Pro Settings. User Guide. This document is a quick user guide on how to configure GPU fan speed in AMD Radeon Pro Settings.

Fan Control in AMD Radeon Pro Settings. User Guide. This document is a quick user guide on how to configure GPU fan speed in AMD Radeon Pro Settings. Fan Control in AMD Radeon Pro Settings User Guide This document is a quick user guide on how to configure GPU fan speed in AMD Radeon Pro Settings. DISCLAIMER The information contained herein is for informational

More information

STAR-CCM+ Performance Benchmark and Profiling. July 2014

STAR-CCM+ Performance Benchmark and Profiling. July 2014 STAR-CCM+ Performance Benchmark and Profiling July 2014 Note The following research was performed under the HPC Advisory Council activities Participating vendors: CD-adapco, Intel, Dell, Mellanox Compute

More information

FOR ENTERPRISE 18.Q3. August 8 th, 2018

FOR ENTERPRISE 18.Q3. August 8 th, 2018 18.Q3 August 8 th, 2018 AMD RADEON PRO SOFTWARE TM Making the Best AMD RADEON PRO SOFTWARE TM Making the Best Quality Performance Simplicity Virtualization AMD RADEON PRO SOFTWARE TM Your Workstation Virtually

More information

Enhance your Cloud Security with AMD EPYC Hardware Memory Encryption

Enhance your Cloud Security with AMD EPYC Hardware Memory Encryption Enhance your Cloud Security with AMD EPYC Hardware Memory Encryption White Paper October, 2018 Introduction Consumers and enterprises are becoming increasingly concerned about the security of their digital

More information

Scheduling Strategies for HPC as a Service (HPCaaS) for Bio-Science Applications

Scheduling Strategies for HPC as a Service (HPCaaS) for Bio-Science Applications Scheduling Strategies for HPC as a Service (HPCaaS) for Bio-Science Applications Sep 2009 Gilad Shainer, Tong Liu (Mellanox); Jeffrey Layton (Dell); Joshua Mora (AMD) High Performance Interconnects for

More information

CPMD Performance Benchmark and Profiling. February 2014

CPMD Performance Benchmark and Profiling. February 2014 CPMD Performance Benchmark and Profiling February 2014 Note The following research was performed under the HPC Advisory Council activities Special thanks for: HP, Mellanox For more information on the supporting

More information

GROMACS (GPU) Performance Benchmark and Profiling. February 2016

GROMACS (GPU) Performance Benchmark and Profiling. February 2016 GROMACS (GPU) Performance Benchmark and Profiling February 2016 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Dell, Mellanox, NVIDIA Compute

More information

Changing your Driver Options with Radeon Pro Settings. Quick Start User Guide v2.1

Changing your Driver Options with Radeon Pro Settings. Quick Start User Guide v2.1 Changing your Driver Options with Radeon Pro Settings Quick Start User Guide v2.1 This guide will show you how to switch between Professional Mode and Gaming Mode when using Radeon Pro Software. DISCLAIMER

More information

AMD EPYC Delivers Linear Scalability for Docker with Bare-Metal Performance

AMD EPYC Delivers Linear Scalability for Docker with Bare-Metal Performance Solution Brief February, 2019 AMD EPYC Delivers Linear Scalability for Docker with Bare-Metal Performance The AMD EPYC SoC brings a new balance to the datacenter. Utilizing x86 architecture, the AMD EPYC

More information

Thermal Design Guide for Socket SP3 Processors

Thermal Design Guide for Socket SP3 Processors Thermal Design Guide for Socket SP3 Processors Publication # 55423 Rev: 3.00 Issue Date: November 2017 2017 Advanced Micro Devices, Inc. All rights reserved. The information contained herein is for informational

More information

Forza Horizon 4 Benchmark Guide

Forza Horizon 4 Benchmark Guide Forza Horizon 4 Benchmark Guide Copyright 2018 Playground Games Limited. The Playground Games name and logo, the Forza Horizon 4 name and logo and the Forza Horizon 4 insignia are trademarks of Playground

More information

Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms

Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Family-Based Platforms Executive Summary Complex simulations of structural and systems performance, such as car crash simulations,

More information

OpenFOAM Performance Testing and Profiling. October 2017

OpenFOAM Performance Testing and Profiling. October 2017 OpenFOAM Performance Testing and Profiling October 2017 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Huawei, Mellanox Compute resource - HPC

More information

White Paper AMD64 TECHNOLOGY SPECULATIVE STORE BYPASS DISABLE

White Paper AMD64 TECHNOLOGY SPECULATIVE STORE BYPASS DISABLE White Paper AMD64 TECHNOLOGY SPECULATIVE STORE BYPASS DISABLE 2018 Advanced Micro Devices Inc. All rights reserved. The information contained herein is for informational purposes only, and is subject to

More information

Changing your Driver Options with Radeon Pro Settings. Quick Start User Guide v3.0

Changing your Driver Options with Radeon Pro Settings. Quick Start User Guide v3.0 Changing your Driver Options with Radeon Pro Settings Quick Start User Guide v3.0 This guide will show you how to switch between Professional Mode and Gaming Mode when using Radeon Pro Software. DISCLAIMER

More information

GROMACS Performance Benchmark and Profiling. August 2011

GROMACS Performance Benchmark and Profiling. August 2011 GROMACS Performance Benchmark and Profiling August 2011 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell, Mellanox Compute resource

More information

DR. LISA SU

DR. LISA SU CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to AMD s strategy and focus, expected datacenter total

More information

OCTOPUS Performance Benchmark and Profiling. June 2015

OCTOPUS Performance Benchmark and Profiling. June 2015 OCTOPUS Performance Benchmark and Profiling June 2015 2 Note The following research was performed under the HPC Advisory Council activities Special thanks for: HP, Mellanox For more information on the

More information

SNAP Performance Benchmark and Profiling. April 2014

SNAP Performance Benchmark and Profiling. April 2014 SNAP Performance Benchmark and Profiling April 2014 Note The following research was performed under the HPC Advisory Council activities Participating vendors: HP, Mellanox For more information on the supporting

More information

Driver Options in AMD Radeon Pro Settings. User Guide

Driver Options in AMD Radeon Pro Settings. User Guide Driver Options in AMD Radeon Pro Settings User Guide This guide will show you how to switch between Professional Mode and Gaming Mode when using Radeon Pro Software. DISCLAIMER The information contained

More information

Performance Tuning Guidelines for Low Latency Response on AMD EPYC -Based Servers Application Note

Performance Tuning Guidelines for Low Latency Response on AMD EPYC -Based Servers Application Note Performance Tuning Guidelines for Low Latency Response on AMD EPYC -Based Servers Publication # 56263 Revision: 3.00 Issue Date: January 2018 Advanced Micro Devices 2018 Advanced Micro Devices, Inc. All

More information

Linux Network Tuning Guide for AMD EPYC Processor Based Servers

Linux Network Tuning Guide for AMD EPYC Processor Based Servers Linux Network Tuning Guide for AMD EPYC Processor Application Note Publication # 56224 Revision: 1.10 Issue Date: May 2018 Advanced Micro Devices 2018 Advanced Micro Devices, Inc. All rights reserved.

More information

HYCOM Performance Benchmark and Profiling

HYCOM Performance Benchmark and Profiling HYCOM Performance Benchmark and Profiling Jan 2011 Acknowledgment: - The DoD High Performance Computing Modernization Program Note The following research was performed under the HPC Advisory Council activities

More information

* ENDNOTES: RVM-26 AND RZG-01.

* ENDNOTES: RVM-26 AND RZG-01. 2 * ENDNOTES: RVM-26 AND RZG-01. 3 4 5 6 7 *SEE ENDNOTES GD-126 ** RESULTS MAY VARY. SEE ENDNOTES RZP-31 8 * SEE ENDNOTES: RZP-31 ** SEE ENDNOTES: GD-126 *** AMD DEFINES PREMIUM PROCESSOR COOLING AS A

More information

GROMACS Performance Benchmark and Profiling. September 2012

GROMACS Performance Benchmark and Profiling. September 2012 GROMACS Performance Benchmark and Profiling September 2012 Note The following research was performed under the HPC Advisory Council activities Participating vendors: AMD, Dell, Mellanox Compute resource

More information

Altair RADIOSS Performance Benchmark and Profiling. May 2013

Altair RADIOSS Performance Benchmark and Profiling. May 2013 Altair RADIOSS Performance Benchmark and Profiling May 2013 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Altair, AMD, Dell, Mellanox Compute

More information

Game-changing Extreme GPU computing with The Dell PowerEdge C4130

Game-changing Extreme GPU computing with The Dell PowerEdge C4130 Game-changing Extreme GPU computing with The Dell PowerEdge C4130 A Dell Technical White Paper This white paper describes the system architecture and performance characterization of the PowerEdge C4130.

More information

MILC Performance Benchmark and Profiling. April 2013

MILC Performance Benchmark and Profiling. April 2013 MILC Performance Benchmark and Profiling April 2013 Note The following research was performed under the HPC Advisory Council activities Special thanks for: HP, Mellanox For more information on the supporting

More information

CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to the features, functionality, availability, timing,

More information

CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to

CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to AMD s positioning in the datacenter market; expected

More information

LS-DYNA Performance Benchmark and Profiling. April 2015

LS-DYNA Performance Benchmark and Profiling. April 2015 LS-DYNA Performance Benchmark and Profiling April 2015 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell, Mellanox Compute resource

More information

Solid State Graphics (SSG) SDK Setup and Raw Video Player Guide

Solid State Graphics (SSG) SDK Setup and Raw Video Player Guide Solid State Graphics (SSG) SDK Setup and Raw Video Player Guide PAGE 1 Radeon Pro SSG SDK Setup To enable you to access the capabilities of the Radeon Pro SSG card, it comes with extensions for Microsoft

More information

ABySS Performance Benchmark and Profiling. May 2010

ABySS Performance Benchmark and Profiling. May 2010 ABySS Performance Benchmark and Profiling May 2010 Note The following research was performed under the HPC Advisory Council activities Participating vendors: AMD, Dell, Mellanox Compute resource - HPC

More information

LS-DYNA Performance Benchmark and Profiling. October 2017

LS-DYNA Performance Benchmark and Profiling. October 2017 LS-DYNA Performance Benchmark and Profiling October 2017 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: LSTC, Huawei, Mellanox Compute resource

More information

Dell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance

Dell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance Dell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance This Dell EMC technical white paper discusses performance benchmarking results and analysis for Simulia

More information

AMD Graphics Team Last Updated February 11, 2013 APPROVED FOR PUBLIC DISTRIBUTION. 1 3DMark Overview February 2013 Approved for public distribution

AMD Graphics Team Last Updated February 11, 2013 APPROVED FOR PUBLIC DISTRIBUTION. 1 3DMark Overview February 2013 Approved for public distribution AMD Graphics Team Last Updated February 11, 2013 APPROVED FOR PUBLIC DISTRIBUTION 1 3DMark Overview February 2013 Approved for public distribution 2 3DMark Overview February 2013 Approved for public distribution

More information

HyperTransport Technology

HyperTransport Technology HyperTransport Technology in 2009 and Beyond Mike Uhler VP, Accelerated Computing, AMD President, HyperTransport Consortium February 11, 2009 Agenda AMD Roadmap Update Torrenza, Fusion, Stream Computing

More information

AMD Graphics Team Last Updated April 29, 2013 APPROVED FOR PUBLIC DISTRIBUTION. 1 3DMark Overview April 2013 Approved for public distribution

AMD Graphics Team Last Updated April 29, 2013 APPROVED FOR PUBLIC DISTRIBUTION. 1 3DMark Overview April 2013 Approved for public distribution AMD Graphics Team Last Updated April 29, 2013 APPROVED FOR PUBLIC DISTRIBUTION 1 3DMark Overview April 2013 Approved for public distribution 2 3DMark Overview April 2013 Approved for public distribution

More information

Expand In-Memory Capacity at a Fraction of the Cost of DRAM: AMD EPYCTM and Ultrastar

Expand In-Memory Capacity at a Fraction of the Cost of DRAM: AMD EPYCTM and Ultrastar White Paper March, 2019 Expand In-Memory Capacity at a Fraction of the Cost of DRAM: AMD EPYCTM and Ultrastar Massive Memory for AMD EPYC-based Servers at a Fraction of the Cost of DRAM The ever-expanding

More information

Intel Cluster Ready Allowed Hardware Variances

Intel Cluster Ready Allowed Hardware Variances Intel Cluster Ready Allowed Hardware Variances Solution designs are certified as Intel Cluster Ready with an exact bill of materials for the hardware and the software stack. When instances of the certified

More information

LS-DYNA Performance Benchmark and Profiling. October 2017

LS-DYNA Performance Benchmark and Profiling. October 2017 LS-DYNA Performance Benchmark and Profiling October 2017 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: LSTC, Huawei, Mellanox Compute resource

More information

AcuSolve Performance Benchmark and Profiling. October 2011

AcuSolve Performance Benchmark and Profiling. October 2011 AcuSolve Performance Benchmark and Profiling October 2011 Note The following research was performed under the HPC Advisory Council activities Participating vendors: AMD, Dell, Mellanox, Altair Compute

More information

ANSYS Fluent 14 Performance Benchmark and Profiling. October 2012

ANSYS Fluent 14 Performance Benchmark and Profiling. October 2012 ANSYS Fluent 14 Performance Benchmark and Profiling October 2012 Note The following research was performed under the HPC Advisory Council activities Special thanks for: HP, Mellanox For more information

More information

OpenPOWER Performance

OpenPOWER Performance OpenPOWER Performance Alex Mericas Chief Engineer, OpenPOWER Performance IBM Delivering the Linux ecosystem for Power SOLUTIONS OpenPOWER IBM SOFTWARE LINUX ECOSYSTEM OPEN SOURCE Solutions with full stack

More information

October Quick Reference Guide

October Quick Reference Guide October 2018 1.5 Quick Reference Guide PREFACE 2018 Advanced Micro Devices, Inc. All rights reserved The information contained herein is for informational purposes only, and is subject to change without

More information

AcuSolve Performance Benchmark and Profiling. October 2011

AcuSolve Performance Benchmark and Profiling. October 2011 AcuSolve Performance Benchmark and Profiling October 2011 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell, Mellanox, Altair Compute

More information

Himeno Performance Benchmark and Profiling. December 2010

Himeno Performance Benchmark and Profiling. December 2010 Himeno Performance Benchmark and Profiling December 2010 Note The following research was performed under the HPC Advisory Council activities Participating vendors: AMD, Dell, Mellanox Compute resource

More information

CES TECH DAY JIM ANDERSON. SVP and GM, Computing and Graphics Business Group

CES TECH DAY JIM ANDERSON. SVP and GM, Computing and Graphics Business Group CES TECH DAY JIM ANDERSON SVP and GM, Computing and Graphics Business Group CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including

More information

CESM (Community Earth System Model) Performance Benchmark and Profiling. August 2011

CESM (Community Earth System Model) Performance Benchmark and Profiling. August 2011 CESM (Community Earth System Model) Performance Benchmark and Profiling August 2011 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell,

More information

TPC-E testing of Microsoft SQL Server 2016 on Dell EMC PowerEdge R830 Server and Dell EMC SC9000 Storage

TPC-E testing of Microsoft SQL Server 2016 on Dell EMC PowerEdge R830 Server and Dell EMC SC9000 Storage TPC-E testing of Microsoft SQL Server 2016 on Dell EMC PowerEdge R830 Server and Dell EMC SC9000 Storage Performance Study of Microsoft SQL Server 2016 Dell Engineering February 2017 Table of contents

More information

HETEROGENEOUS SYSTEM ARCHITECTURE: PLATFORM FOR THE FUTURE

HETEROGENEOUS SYSTEM ARCHITECTURE: PLATFORM FOR THE FUTURE HETEROGENEOUS SYSTEM ARCHITECTURE: PLATFORM FOR THE FUTURE Haibo Xie, Ph.D. Chief HSA Evangelist AMD China OUTLINE: The Challenges with Computing Today Introducing Heterogeneous System Architecture (HSA)

More information

INTRODUCTION TO OPENCL TM A Beginner s Tutorial. Udeepta Bordoloi AMD

INTRODUCTION TO OPENCL TM A Beginner s Tutorial. Udeepta Bordoloi AMD INTRODUCTION TO OPENCL TM A Beginner s Tutorial Udeepta Bordoloi AMD IT S A HETEROGENEOUS WORLD Heterogeneous computing The new normal CPU Many CPU s 2, 4, 8, Very many GPU processing elements 100 s Different

More information

CAUTIONARY STATEMENT 1 AMD NEXT HORIZON NOVEMBER 6, 2018

CAUTIONARY STATEMENT 1 AMD NEXT HORIZON NOVEMBER 6, 2018 CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to AMD s positioning in the datacenter market; expected

More information

EFFICIENT SPARSE MATRIX-VECTOR MULTIPLICATION ON GPUS USING THE CSR STORAGE FORMAT

EFFICIENT SPARSE MATRIX-VECTOR MULTIPLICATION ON GPUS USING THE CSR STORAGE FORMAT EFFICIENT SPARSE MATRIX-VECTOR MULTIPLICATION ON GPUS USING THE CSR STORAGE FORMAT JOSEPH L. GREATHOUSE, MAYANK DAGA AMD RESEARCH 11/20/2014 THIS TALK IN ONE SLIDE Demonstrate how to save space and time

More information

AMD NVMe/SATA RAID Quick Start Guide for Windows Operating Systems

AMD NVMe/SATA RAID Quick Start Guide for Windows Operating Systems AMD NVMe/SATA RAID Quick Start Guide for Windows Operating Systems Publication # 56268 Revision: 1.02 Issue Date: April 2018 Advanced Micro Devices 2018 Advanced Micro Devices, Inc. All rights reserved.

More information

n N c CIni.o ewsrg.au

n N c CIni.o ewsrg.au @NCInews NCI and Raijin National Computational Infrastructure 2 Our Partners General purpose, highly parallel processors High FLOPs/watt and FLOPs/$ Unit of execution Kernel Separate memory subsystem GPGPU

More information

VMware vsphere 6.5. Radeon Pro V340 MxGPU Deployment Guide for. Version 1.0

VMware vsphere 6.5. Radeon Pro V340 MxGPU Deployment Guide for. Version 1.0 for VMware vsphere 6.5 Version 1.0 This document covers set up, installation, and configuration of MxGPU with Radeon Pro V340 in a VMware vsphere 6.5 environment. DISCLAIMER The information contained herein

More information

Maximizing Six-Core AMD Opteron Processor Performance with RHEL

Maximizing Six-Core AMD Opteron Processor Performance with RHEL Maximizing Six-Core AMD Opteron Processor Performance with RHEL Bhavna Sarathy Red Hat Technical Lead, AMD Sanjay Rao Senior Software Engineer, Red Hat Sept 4, 2009 1 Agenda Six-Core AMD Opteron processor

More information

Radeon Pro Software: Radeon Pro ReLive. User Guide v3.0

Radeon Pro Software: Radeon Pro ReLive. User Guide v3.0 Radeon Pro Software: Radeon Pro ReLive User Guide v3.0 This guide will detail how to use Radeon Pro ReLive to capture high quality desktop videos and screenshots for your professional needs. DISCLAIMER

More information

IFS RAPS14 benchmark on 2 nd generation Intel Xeon Phi processor

IFS RAPS14 benchmark on 2 nd generation Intel Xeon Phi processor IFS RAPS14 benchmark on 2 nd generation Intel Xeon Phi processor D.Sc. Mikko Byckling 17th Workshop on High Performance Computing in Meteorology October 24 th 2016, Reading, UK Legal Disclaimer & Optimization

More information

AMD Radeon ProRender plug-in for Universal Scene Description. Installation Guide

AMD Radeon ProRender plug-in for Universal Scene Description. Installation Guide AMD Radeon ProRender plug-in for Universal Scene Description Installation Guide This document is a guide on how to install and configure AMD Radeon ProRender plug-in for Universal Scene Description (USD).

More information

NVIDIA GPU BOOST FOR TESLA

NVIDIA GPU BOOST FOR TESLA NVIDIA GPU BOOST FOR TESLA DA-06767-001_v02 January 2014 Application Note DOCUMENT CHANGE HISTORY DA-06767-001_v02 Version Date Authors Description of Change 01 March 28, 2013 GG, SM Initial Release 02

More information

Performance Analysis of HPC Applications on Several Dell PowerEdge 12 th Generation Servers

Performance Analysis of HPC Applications on Several Dell PowerEdge 12 th Generation Servers Performance Analysis of HPC Applications on Several Dell PowerEdge 12 th Generation Servers This Dell technical white paper evaluates and provides recommendations for the performance of several HPC applications

More information

Intel Many Integrated Core (MIC) Architecture

Intel Many Integrated Core (MIC) Architecture Intel Many Integrated Core (MIC) Architecture Karl Solchenbach Director European Exascale Labs BMW2011, November 3, 2011 1 Notice and Disclaimers Notice: This document contains information on products

More information

Understanding GPGPU Vector Register File Usage

Understanding GPGPU Vector Register File Usage Understanding GPGPU Vector Register File Usage Mark Wyse AMD Research, Advanced Micro Devices, Inc. Paul G. Allen School of Computer Science & Engineering, University of Washington AGENDA GPU Architecture

More information

ICON Performance Benchmark and Profiling. March 2012

ICON Performance Benchmark and Profiling. March 2012 ICON Performance Benchmark and Profiling March 2012 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell, Mellanox Compute resource - HPC

More information

Performance and Energy Efficiency of the 14 th Generation Dell PowerEdge Servers

Performance and Energy Efficiency of the 14 th Generation Dell PowerEdge Servers Performance and Energy Efficiency of the 14 th Generation Dell PowerEdge Servers This white paper details the performance improvements of Dell PowerEdge servers with the Intel Xeon Processor Scalable CPU

More information

DRAM and Storage-Class Memory (SCM) Overview

DRAM and Storage-Class Memory (SCM) Overview Page 1 of 7 DRAM and Storage-Class Memory (SCM) Overview Introduction/Motivation Looking forward, volatile and non-volatile memory will play a much greater role in future infrastructure solutions. Figure

More information

Intel Cluster Toolkit Compiler Edition 3.2 for Linux* or Windows HPC Server 2008*

Intel Cluster Toolkit Compiler Edition 3.2 for Linux* or Windows HPC Server 2008* Intel Cluster Toolkit Compiler Edition. for Linux* or Windows HPC Server 8* Product Overview High-performance scaling to thousands of processors. Performance leadership Intel software development products

More information

Intelligent Tiered Storage Acceleration Software for Windows 10

Intelligent Tiered Storage Acceleration Software for Windows 10 for Windows 10 QUICK START GUIDE April 2018 2018 Advanced Micro Devices, Inc. All rights reserved. AMD, the AMD logo, Ryzen, Threadripper, and combinations thereof are trademarks are of Advanced Micro

More information

Optimizations of BLIS Library for AMD ZEN Core

Optimizations of BLIS Library for AMD ZEN Core Optimizations of BLIS Library for AMD ZEN Core 1 Introduction BLIS [1] is a portable software framework for instantiating high-performance BLAS-like dense linear algebra libraries [2] The framework was

More information

HPC and AI Solution Overview. Garima Kochhar HPC and AI Innovation Lab

HPC and AI Solution Overview. Garima Kochhar HPC and AI Innovation Lab HPC and AI Solution Overview Garima Kochhar HPC and AI Innovation Lab 1 Dell EMC HPC and DL team charter Design, develop and integrate HPC and DL Heading systems Lorem ipsum dolor sit amet, consectetur

More information

SOLUTIONS BRIEF: Transformation of Modern Healthcare

SOLUTIONS BRIEF: Transformation of Modern Healthcare SOLUTIONS BRIEF: Transformation of Modern Healthcare Healthcare & The Intel Xeon Scalable Processor Intel is committed to bringing the best of our manufacturing, design and partner networks to enable our

More information

CP2K Performance Benchmark and Profiling. April 2011

CP2K Performance Benchmark and Profiling. April 2011 CP2K Performance Benchmark and Profiling April 2011 Note The following research was performed under the HPC Advisory Council HPC works working group activities Participating vendors: HP, Intel, Mellanox

More information

AMD EPYC CORPORATE BRAND GUIDELINES

AMD EPYC CORPORATE BRAND GUIDELINES AMD EPYC CORPORATE BRAND GUIDELINES VERSION 1 MAY 2017 CONTACT Address Advanced Micro Devices, Inc 7171 Southwest Pkwy Austin, Texas 78735 United States Phone 1-512-602-1000 Online Email: Brand.Team@amd.com

More information

Implementing Storage in Intel Omni-Path Architecture Fabrics

Implementing Storage in Intel Omni-Path Architecture Fabrics white paper Implementing in Intel Omni-Path Architecture Fabrics Rev 2 A rich ecosystem of storage solutions supports Intel Omni- Path Executive Overview The Intel Omni-Path Architecture (Intel OPA) is

More information

Source RT Group Shared Memory Destination RT Horizontal Pass Vertical Pass Single Shader Tile #1 Tile #2 Tile #3 Tile #4 Tile #5 Start with the store cache filled with border color Store Cache

More information

Messaging Overview. Introduction. Gen-Z Messaging

Messaging Overview. Introduction. Gen-Z Messaging Page 1 of 6 Messaging Overview Introduction Gen-Z is a new data access technology that not only enhances memory and data storage solutions, but also provides a framework for both optimized and traditional

More information