Innovative Alternate Architecture for Exascale Computing. Surya Hotha Director, Product Marketing

Similar documents
Arm Processor Technology Update and Roadmap

Atos ARM solutions for HPC

Arm's role in co-design for the next generation of HPC platforms

Comparative Benchmarking of the First Generation of HPC-Optimised Arm Processors on Isambard

The Mont-Blanc project Updates from the Barcelona Supercomputing Center

Arm in HPC. Toshinori Kujiraoka Sales Manager, APAC HPC Tools Arm Arm Limited

The Arm Technology Ecosystem: Current Products and Future Outlook

SUSE Linux Entreprise Server for ARM

European energy efficient supercomputer project

Top 5 Reasons HPE Delivers the Best Microsoft Azure Stack Solution

ARM High Performance Computing

CCR. ISC18 June 28, Kevin Pedretti, Jim H. Laros III, Si Hammond SAND C. Photos placed in horizontal env

Jay Kruemcke Sr. Product Manager, HPC, Arm,

The Mont-Blanc approach towards Exascale

Standardized Firmware for ARMv8 based Volume Servers

Building supercomputers from embedded technologies

S8765 Performance Optimization for Deep- Learning on the Latest POWER Systems

GOING ARM A CODE PERSPECTIVE

HPE ProLiant DL360 Gen P 16GB-R P408i-a 8SFF 500W PS Performance Server (P06453-B21)

Agenda. Sun s x Sun s x86 Strategy. 2. Sun s x86 Product Portfolio. 3. Virtualization < 1 >

Ampere emag Processor Optimized for the Cloud Kumar Sankaran Vice President, Software & Platforms, Ampere

ARISTA: Improving Application Performance While Reducing Complexity

April 2 nd, Bob Burroughs Director, HPC Solution Sales

Software Ecosystem for Arm-based HPC

The Mont-Blanc Project

Beyond Hardware IP An overview of Arm development solutions

Performance and Energy Efficiency of the 14 th Generation Dell PowerEdge Servers

OpenPOWER Performance

Transforming the Data Center with ARM

HPE ProLiant ML350 Gen P 16GB-R E208i-a 8SFF 1x800W RPS Solution Server (P04674-S01)

HPE ProLiant Gen10. Franz Weberberger Presales Consultant Server

OCTOPUS Performance Benchmark and Profiling. June 2015

Building supercomputers from commodity embedded chips

Intel Many Integrated Core (MIC) Architecture

Looking ahead with IBM i. 10+ year roadmap

Predicting Service Outage Using Machine Learning Techniques. HPE Innovation Center

HPE ProLiant ML350 Gen10 Server

IBM Power AC922 Server

Power Systems AC922 Overview. Chris Mann IBM Distinguished Engineer Chief System Architect, Power HPC Systems December 11, 2017

Fast-track Hybrid IT Transformation with Intel Data Center Blocks for Cloud

Smarter Systems In Your Cloud Deployment

HPE ProLiant DL580 Gen10 Server

Japan s post K Computer Yutaka Ishikawa Project Leader RIKEN AICS

Pedraforca: a First ARM + GPU Cluster for HPC

Toward Building up Arm HPC Ecosystem --Fujitsu s Activities--

unleashed the future Intel Xeon Scalable Processors for High Performance Computing Alexey Belogortsev Field Application Engineer

OpenPOWER Performance

Design a Remote-Office or Branch-Office Data Center with Cisco UCS Mini

ARM mbed Towards Secure, Scalable, Efficient IoT of Scale

Update of Post-K Development Yutaka Ishikawa RIKEN AICS

World s most advanced data center accelerator for PCIe-based servers

IBM Power 9 надежная платформа для развертывания облаков. Ташкент. Юрий Кондратенко Cross-Brand Sales Specialist

Erkenntnisse aus aktuellen Performance- Messungen mit LS-DYNA

Industrial-Strength High-Performance RISC-V Processors for Energy-Efficient Computing

Post-K: Building the Arm HPC Ecosystem

Intel Select Solutions for Professional Visualization with Advantech Servers & Appliances

Goro Watanabe. Bill King. OOW 2013 The Best Platform for Big Data and Oracle Database 12c. EVP Fujitsu R&D Center North America

Design a Remote-Office or Branch-Office Data Center with Cisco UCS Mini

ARM instruction sets and CPUs for wide-ranging applications

IBM Power Advanced Compute (AC) AC922 Server

SGI Overview. HPC User Forum Dearborn, Michigan September 17 th, 2012

Post-K Development and Introducing DLU. Copyright 2017 FUJITSU LIMITED

RapidIO.org Update.

ARM processors driving automotive innovation

Huawei KunLun Mission Critical Server. KunLun 9008/9016/9032 Technical Specifications

RapidIO.org Update. Mar RapidIO.org 1

Cisco UCS B230 M2 Blade Server

Resource Saving: Latest Innovation in Optimized Cloud Infrastructure

IBM System x family brochure

Huawei KunLun Mission Critical Server. KunLun 9008/9016/9032 Technical Specifications

INCREASE IT EFFICIENCY, REDUCE OPERATING COSTS AND DEPLOY ANYWHERE

Broadberry. Artificial Intelligence Server for Fraud. Date: Q Application: Artificial Intelligence

Data Sheet FUJITSU Server PRIMERGY BX924 S3 Dual Socket Server Blade

Barcelona Supercomputing Center

IBM Power Systems: Open Innovation to put data to work. Juan López-Vidriero Mata Director técnico de ventas de servidores

Interconnect Your Future

Next Generation Enterprise Solutions from ARM

Building the Ecosystem for ARM Servers

W H I T E P A P E R S e r v e r R e f r e s h t o M e e t t h e C h a n g i n g N e e d s o f I T?

S THE MAKING OF DGX SATURNV: BREAKING THE BARRIERS TO AI SCALE. Presenter: Louis Capps, Solution Architect, NVIDIA,

Highest Levels of Scalability Simplified Network Manageability Maximum System Productivity

Sugon TC6600 blade server

Intel Workstation Platforms

IBM System x family brochure

EPYC VIDEO CUG 2018 MAY 2018

Supercomputing with Commodity CPUs: Are Mobile SoCs Ready for HPC?

NERSC Site Update. National Energy Research Scientific Computing Center Lawrence Berkeley National Laboratory. Richard Gerber

Find the right platform for your server needs

Data Sheet FUJITSU Server PRIMERGY CX400 M4 Scale out Server

Gen-Z Memory-Driven Computing

TECHNOLOGIES CO., LTD.

Interconnect Your Future

Data Sheet Fujitsu Server PRIMERGY CX250 S2 Dual Socket Server Node

Comparison and analysis of parallel tasking performance for an irregular application

FUJITSU Server PRIMERGY CX400 M4 Workload-specific power in a modular form factor. 0 Copyright 2018 FUJITSU LIMITED

Optimising the Mantevo benchmark suite for multi- and many-core architectures

Maximizing heterogeneous system performance with ARM interconnect and CCIX

Lenovo Enterprise Portfolio

ACCELERATED COMPUTING: THE PATH FORWARD. Jen-Hsun Huang, Co-Founder and CEO, NVIDIA SC15 Nov. 16, 2015

Open storage architecture for private Oracle database clouds

Transcription:

Innovative Alternate Architecture for Exascale Computing Surya Hotha Director, Product Marketing

Cavium Corporate Overview Enterprise Mobile Infrastructure Data Center and Cloud Service Provider Cloud Multi-Core MIPS, ARM Processors, Security, SDN Switch and Server/Storage Connectivity ~$10B TAM

Arm & Cavium Enabling Alternate Architecture for HPC Most widely used CPU architecture out ships x86 by 20X Viable alternative for x86 Broad ecosystem Provides optimized compilers, tool chains & libraries for HPC The processor company for the infrastructure 16+ years of high perf multi core SoC development expertise ARM architecture licensee, broad portfolio of IP Established leadership in Arm server space: Two generations of custom Armv8-A dual socket server SoC, Xeon E5 class perf

Key Press at SC17 for HPE Helps Businesses Capitalize on High Performance Computing and Artificial Intelligence Applications with New High-Density Compute and Storage GIGABYTE Announces Production Availability of Cavium's ThunderX2-based Server Portfolio Cray Catapults Arm-Based Processors Into Supercomputing Ingrasys Announces Production Systems Based on Cavium's ThunderX2 Processor ARM Benchmarks Show HPC Ripe for Processor Shakeup Red Hat introduces Arm server support for Red Hat Enterprise Linux Microsemi Announces Adaptec Smart Storage Adapter Support for Cavium ThunderX2 ARM-Based CPUs 2016 Cavium, Inc. Confidential and Proprietary Information

Cavium Receives Honors in 2017 HPCwire Readers and Editors Choice Awards Best HPC Collaboration (Academia/Government/Industry) -- Editors Choice European R&D project Mont-Blanc investigates a new type of energyefficient computer architecture for HPC, leveraging Cavium s ThunderX2 Arm-based processor Top 5 New Products or Technologies to Watch -- Editors Choice Bull Sequana X1310, with Cavium s ThunderX2 Arm-based processor 2016 Cavium, Inc. Confidential and Proprietary Information

ThunderX Momentum in HPC Continues to Grow Memory Bandwidth Floating Point Integer 1.0 2.2X 2.5X 3X 2-3X better HPC performance Server platforms at World s premier HPC Labs Significant HPC Engagements

OEM Partners HPE Apollo 70 2U Industry Standard Form Factor using HPEs scalable HPC enclosure. 1U Tray for highest compute density (4x 2P Nodes per Enclosure) Hot pluggable trays, HPE Gen-10 Power supplies. Reliable and proven infrastructure. 2U Tray for addition of acceleration (2x 2P + 2GPU Nodes Per Enclosure) X1310- Bull Sequana Compute Blade TundraTM Extreme Scale XC50 Supercomputer Hot pluggable trays, 4x 2P Nodes per blade Chassis with 16 compute blades 128 Sockets Inter-Aries communication over backplane

LLC Distributed Cache Cavium Coherent Processor Interconnect Gen2 Cavium CN99XX - 1 st member of Family 24/28/32 Custom ARMv8 cores IO Subsystem Trust Zone Power Management Fully Out-Of-Order (OOO) Execution 1S and 2S Configuration CORES CORES CCPI2 Up to 8 Memory Controllers Up to 16 DIMMs per Socket Server Class RAS features Integrated IO CORES CORES CCPI2 Server class virtualization Integrated IOs Extensive Power Management Memory Controllers 2 nd gen Arm server SoC Delivers 2-3X higher performance

Key Differentiators of Higher Core & Thread Count Rich I/O Connectivity 33% Higher Memory Bandwidth Dual-Socket Support 33% Higher Memory Capacity

Comprehensive HPC Ecosystem Applications AWP-ODC SNAP RMG VPIC Software Stack Tools & Dev Environment Allinea Studio OS, Firmware IHV 10 Commercial Compiler, Tool Chain, Libraries available from Arm (Allinea Studio) Support for community codes rapidly increasing

ThunderX2 Memory Bandwidth Stream Triad @2666 Intel Xeon Gold 6148 Processor 98 GB/s 132 GB/s 0 20 40 60 80 100 120 140 ThunderX2 is 33% higher performance compared to Skylake

ThunderX2 SPEC CPU 2017 Rate Performance 0.78X Relative SPECCPU2017_Rate INT 1X 0.8X Relative SPECCPU2017_Rate FP 1X 0.76X 0.75X GNU ICC GNU GNU ICC GNU Intel Xeon Gold 6148 Processor Intel Xeon Gold 6148 Processor Cavium + Arm working on optimized compilers Target: ~15% higher perf icc18 -O3 -xhost -ipo -O3 -no-prec-div -auto-p32 -qopt-prefetch -qopt-mem-layout-trans=3 Gcc 12 7.2 -Ofast -flto -mabi=ilp32 *Measured ThunderX2 perf projected for production silicon w/ Turbo

Normalized Performance NAMD VPIC AWP-ODC OpenFOAM CloverLeaf TeaLeaf ThunderX2 Performance vs Intel Xeon Gold 1.3X 1X 1X 1X 1X 1X 1X Intel Cavium ThunderX2: 32 core 2.2GHz using gcc 7.2 and open source libraries (OpenBLAS, FFTW, OpenMPI ) Intel Xeon Gold 6148 Processor: 20 core, 2.4 GHz, with ICC18 and Intel optimized libraries (MKL, IntelMPI, Intel FFTW ) 13 Cavium + Arm + Partners working on vectorizing compiler & optimized libraries Target: ~15% higher perf

Cavium at SC17 Booth#349 Presentation @Arm HPC User Group, Cavium ThunderX2 Applications & Performance 2.30p-2.45p, Monday: Aspen Ballroom, Grand Hyatt Hotel Presentation, Innovative Alternate Architecture for Exascale 10.30a-11a, Tuesday: RedHat mini-theater Presentation @Exhibitor Forum, Innovative Alternate Architecture for Exascale 2.30p-3p, Tuesday: SC Room 501-502 Presentation @HPE, Innovative Alternate Architecture for Exascale 2.30p-3p, Tuesday: HPE mini-theater #925 Partner Panel: The Arm Software Ecosystem: Are We There Yet? 1:30-3 pm, Tuesday: SC room 201-203 Speakers include: nvidia, CERN, U. Bristol and U. Michigan BoF : Arm User Experience: Testbeds and deployment at HPC centers 5:15 pm, Tuesday: SC room 701 Speakers include: Oak Ridge, RIKEN and Bristol Presentation, Innovative Alternate Architecture for Exascale 2.30p-3p, Wednesday: SUSE booth Also look out on the exhibition floor for booths from Cavium partners. To schedule a private meeting, e-mail: Lilly.Ly@cavium.com

Innovations For Exascale Computing