GPU ACCELERATED COMPUTING. 1 st AlsaCalcul GPU Challenge, 14-Jun-2016, Strasbourg Frédéric Parienté, Tesla Accelerated Computing, NVIDIA Corporation

Similar documents
RECENT TRENDS IN GPU ARCHITECTURES. Perspectives of GPU computing in Science, 26 th Sept 2016

TECHNICAL OVERVIEW ACCELERATED COMPUTING AND THE DEMOCRATIZATION OF SUPERCOMPUTING

TECHNICAL OVERVIEW ACCELERATED COMPUTING AND THE DEMOCRATIZATION OF SUPERCOMPUTING

ACCELERATED COMPUTING: THE PATH FORWARD. Jensen Huang, Founder & CEO SC17 Nov. 13, 2017

ACCELERATED COMPUTING: THE PATH FORWARD. Jen-Hsun Huang, Co-Founder and CEO, NVIDIA SC15 Nov. 16, 2015

The State of Accelerated Applications. Michael Feldman

GPUs and the Future of Accelerated Computing Emerging Technology Conference 2014 University of Manchester

TECHNICAL OVERVIEW ACCELERATED COMPUTING AND THE DEMOCRATIZATION OF SUPERCOMPUTING

Accelerating High Performance Computing.

The Effect of In-Network Computing-Capable Interconnects on the Scalability of CAE Simulations

SUPERCHARGE DEEP LEARNING WITH DGX-1. Markus Weber SC16 - November 2016

NVIDIA GPU TECHNOLOGY UPDATE

TESLA V100 PERFORMANCE GUIDE. Life Sciences Applications

TESLA P100 PERFORMANCE GUIDE. Deep Learning and HPC Applications

Deep Learning mit PowerAI - Ein Überblick

TESLA P100 PERFORMANCE GUIDE. HPC and Deep Learning Applications

A NEW COMPUTING ERA. Shanker Trivedi Senior Vice President Enterprise Business at NVIDIA

SYNERGIE VON HPC UND DEEP LEARNING MIT NVIDIA GPUS

World s most advanced data center accelerator for PCIe-based servers

Building the Most Efficient Machine Learning System

The Tesla Accelerated Computing Platform

Interconnect Your Future

Is your IT Infrastructure Ready for Machine Learning & Artificial Intelligence?

TESLA V100 PERFORMANCE GUIDE May 2018

MACHINE LEARNING WITH NVIDIA AND IBM POWER AI

IBM Deep Learning Solutions

TESLA PLATFORM. Jan 2018

DEEP NEURAL NETWORKS CHANGING THE AUTONOMOUS VEHICLE LANDSCAPE. Dennis Lui August 2017

HETEROGENEOUS HPC, ARCHITECTURAL OPTIMIZATION, AND NVLINK STEVE OBERLIN CTO, TESLA ACCELERATED COMPUTING NVIDIA

Characterization and Benchmarking of Deep Learning. Natalia Vassilieva, PhD Sr. Research Manager

Autonomous Driving Solutions

HPC with the NVIDIA Accelerated Computing Toolkit Mark Harris, November 16, 2015

DEEP LEARNING ALISON B LOWNDES. Deep Learning Solutions Architect & Community Manager EMEA

Scaling in a Heterogeneous Environment with GPUs: GPU Architecture, Concepts, and Strategies

unleashed the future Intel Xeon Scalable Processors for High Performance Computing Alexey Belogortsev Field Application Engineer

Making Supercomputing More Available and Accessible Windows HPC Server 2008 R2 Beta 2 Microsoft High Performance Computing April, 2010

ENDURING DIFFERENTIATION. Timothy Lanfear

ENDURING DIFFERENTIATION Timothy Lanfear

TESLA ACCELERATED COMPUTING. Mike Wang Solutions Architect NVIDIA Australia & NZ

Introduction to High Performance Computing. Shaohao Chen Research Computing Services (RCS) Boston University

Technologies and application performance. Marc Mendez-Bermond HPC Solutions Expert - Dell Technologies September 2017

GROMACS (GPU) Performance Benchmark and Profiling. February 2016

Accelerated Platforms: The Future of Computing. Marc Hamilton, VP Solutions Architecture & Engineering, NVIDIA Korea AI Conference 2018

Deploying remote GPU virtualization with rcuda. Federico Silla Technical University of Valencia Spain

Building the Most Efficient Machine Learning System

Hybrid KAUST Many Cores and OpenACC. Alain Clo - KAUST Research Computing Saber Feki KAUST Supercomputing Lab Florent Lebeau - CAPS

DGX SYSTEMS: DEEP LEARNING FROM DESK TO DATA CENTER. Markus Weber and Haiduong Vo

WHAT S NEW IN CUDA 8. Siddharth Sharma, Oct 2016

THE HIGH-END VIRTUALIZATION COMPANY SERVER AGGREGATION CREATING THE POWER OF ONE

Cyclone SGI Cloud Computing for HPC. Christian Tanasescu Vice President Software Engineering

Interconnect Your Future Enabling the Best Datacenter Return on Investment. TOP500 Supercomputers, November 2017

INTRODUCING THE DGX FAMILY. Marc Domenech May 8, 2017

Fast Hardware For AI

CALMIP : HIGH PERFORMANCE COMPUTING

Accelerating Insights In the Technical Computing Transformation

A NEW COMPUTING ERA. DAVID B. KIRK, FELLOW NVIDIA AI Conference Singapore 2017

Clustering Optimizations How to achieve optimal performance? Pak Lui

Is remote GPU virtualization useful? Federico Silla Technical University of Valencia Spain

TACKLING THE CHALLENGES OF NEXT GENERATION HEALTHCARE

IBM CORAL HPC System Solution

Trends in systems and how to get efficient performance

Deep Learning Performance and Cost Evaluation

NVIDIA DGX SYSTEMS PURPOSE-BUILT FOR AI

TOWARDS ACCELERATED DEEP LEARNING IN HPC AND HYPERSCALE ARCHITECTURES Environnement logiciel pour l apprentissage profond dans un contexte HPC

Deep Learning: Transforming Engineering and Science The MathWorks, Inc.

Remote GPU virtualization: pros and cons of a recent technology. Federico Silla Technical University of Valencia Spain

Dell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance

Deep Learning Performance and Cost Evaluation

HPC Solution. Technology for a New Era in Computing

FUJITSU Server PRIMERGY CX400 M4 Workload-specific power in a modular form factor. 0 Copyright 2018 FUJITSU LIMITED

APPROACHES. TO GPU COMPUTING Libraries, OpenACC Directives, and Languages

In partnership with. VelocityAI REFERENCE ARCHITECTURE WHITE PAPER

STRATEGIES TO ACCELERATE VASP WITH GPUS USING OPENACC. Stefan Maintz, Dr. Markus Wetzstein

Object recognition and computer vision using MATLAB and NVIDIA Deep Learning SDK

19. prosince 2018 CIIRC Praha. Milan Král, IBM Radek Špimr

GPU-Accelerated Deep Learning

LAMMPS-KOKKOS Performance Benchmark and Profiling. September 2015

FUSION1200 Scalable x86 SMP System

Erkenntnisse aus aktuellen Performance- Messungen mit LS-DYNA

The rcuda technology: an inexpensive way to improve the performance of GPU-based clusters Federico Silla

Gen-Z Memory-Driven Computing

Game-changing Extreme GPU computing with The Dell PowerEdge C4130

S THE MAKING OF DGX SATURNV: BREAKING THE BARRIERS TO AI SCALE. Presenter: Louis Capps, Solution Architect, NVIDIA,

The Exascale Era Has Arrived

DDN About Us Solving Large Enterprise and Web Scale Challenges

NAMD GPU Performance Benchmark. March 2011

GPU Computing and Its Applications

Arm in HPC. Toshinori Kujiraoka Sales Manager, APAC HPC Tools Arm Arm Limited

DEEP NEURAL NETWORKS AND GPUS. Julie Bernauer

GPU Computing Ecosystem

Introduc)on to High Performance Compu)ng Advanced Research Computing

HP GTC Presentation May 2012

S CUDA on Xavier

Unlock business value with HPC & Artificial Intelligence. FORUM TERATEC June 19, 2018 José RODRIGUES HPC Sales Manager

POWERING THE AI REVOLUTION JENSEN HUANG, FOUNDER & CEO GTC 2017

A NEW COMPUTING ERA JENSEN HUANG, FOUNDER & CEO GTC CHINA 2017

ENABLING NEW SCIENCE GPU SOLUTIONS

IBM Power AC922 Server

DEEP LEARNING WITH GPUS Maxim Milakov, Senior HPC DevTech Engineer, NVIDIA

Headline in Arial Bold 30pt. Visualisation using the Grid Jeff Adie Principal Systems Engineer, SAPK July 2008

Transcription:

GPU ACCELERATED COMPUTING 1 st AlsaCalcul GPU Challenge, 14-Jun-2016, Strasbourg Frédéric Parienté, Tesla Accelerated Computing, NVIDIA Corporation

GAMING PRO ENTERPRISE VISUALIZATION DATA CENTER AUTO THE WORLD LEADER IN VISUAL COMPUTING

It s time to start planning for the end of Moore s Law, and it s worth pondering how it will end, not just when. Robert Colwell Director, Microsystems Technology Office, DARPA

HOW GPU ACCELERATION WORKS Application Code Compute-Intensive Functions GPU 5% of Code Rest of Sequential CPU Code CPU +

TESLA ACCELERATED COMPUTING PLATFORM Focused on Co-Design from Top to Bottom TFLOPS Fast GPU Engineered for High Throughput NVIDIA GPU x86 CPU Productive Programming Model & Tools Expert Co-Design Accessibility 5,5 5,0 4,5 P100 APPLICATION 4,0 3,5 MIDDLEWARE 3,0 2,5 2,0 1,5 1,0 0,5 0,0 M1060 M2090 K20 K80 Fast GPU + Strong CPU 2008 2010 2012 2014 2016 SYS SW LARGE SYSTEMS PROCESSOR

DATA CENTER TODAY Well-suited For Transactional Workloads Running on Lots of Nodes THE DREAM For Important Workloads with Infinite Need for Computing Network Fabric Server Racks Commodity Computers Interconnected with Vast Network Overhead Few Lightning-Fast Nodes with Performance of Thousands of Commodity Computers

70% OF TOP HPC APPS ACCELERATED INTERSECT360 SURVEY OF TOP APPS TOP 25 APPS IN SURVEY GROMACS SIMULIA Abaqus NAMD AMBER ANSYS Mechanical Exelis IDL MSC NASTRAN LAMMPS NWChem LS-DYNA Schrodinger Gaussian GAMESS 9 of top 10 Apps Accelerated 35 of top 50 Apps Accelerated ANSYS Fluent WRF VASP OpenFOAM CHARMM Quantum Espresso ANSYS CFX Star-CD CCSM COMSOL Star-CCM+ BLAST Intersect360, Nov 2015 HPC Application Support for GPU Computing = All popular functions accelerated = Some popular functions accelerated = In development = Not supported

400+ GPU-Accelerated Applications www.nvidia.com/appscatalog

TESLA K80: 10X FASTER ON REAL-WORLD APPS 15x K80 CPU 10x 5x 0x Benchmarks Molecular Dynamics Quantum Chemistry Physics CPU: 12 cores, E5-2697v2 @ 2.70GHz. 64GB System Memory, CentOS 6.2 GPU: Single Tesla K80, Boost enabled

ns/day BIG PROBLEMS NEED FAST COMPUTERS 2.5x Faster than the Largest CPU Data Center 50 45 AMBER Simulation of CRISPR, Nature s Tool for Genome Editing 1 Node with 4 P100 GPUs 40 35 30 25 20 15 10 5 48 CPU Nodes Comet Supercomputer Biotech discovery of the century -MIT Technology Review 12/2014 0 0 20 40 60 80 100 AMBER 16 Pre-release, CRSPR based on PDB ID 5f9r, 336,898 atoms CPU: Dual Socket Intel E5-2680v3 12 cores, 128 GB DDR4 per node, FDR IB # of Processors (CPUs and GPUs)

Tesla Accelerates Discoveries Using a supercomputer powered by the Tesla Platform with over 3,000 Tesla accelerators, University of Illinois scientists performed the first all-atom simulation of the HIV virus and discovered the chemical structure of its capsid the perfect target for fighting the infection. Without GPU, the supercomputer would need to be 5x larger for similar performance.

A NEW COMPUTING MODEL 100% 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% ImageNet Traditional CV Deep Learning 2009 2010 2011 2012 2013 2014 2015 2016 Traditional Computer Vision Experts + Time Deep Learning Object Detection DNN + Data + HPC Deep Learning Achieves Superhuman Results

EVERY INDUSTRY WANTS DEEP LEARNING Cloud Service Provider Medicine Media & Entertainment Security & Defense Autonomous Machines Image/Video classification Cancer cell detection Video captioning Face recognition Pedestrian detection Speech recognition Diabetic grading Content based search Video surveillance Lane tracking Natural language processing Drug discovery Real time translation Cyber security Recognize traffic sign

TESLA FOR SIMULATION LIBRARIES DIRECTIVES LANGUAGES ACCELERATED COMPUTING TOOLKIT TESLA ACCELERATED COMPUTING

END-TO-END PRODUCT FAMILY HYPERSCALE HPC MIXED-APPS HPC STRONG-SCALING HPC FULLY INTEGRATED DL SUPERCOMPUTER Tesla M4, M40 Tesla K80 Tesla P100 DGX-1 Hyperscale deployment for DL training, inference, video & image processing HPC data centers running mix of CPU and GPU workloads Hyperscale & HPC data centers running apps that scale to multiple GPUs For customers who need to get going now with fully integrated solution

NVIDIA COMPUTEWORKS COMPUTEWORKS GAMEWORKS VRWORKS DESIGNWORKS DRIVEWORKS JETPACK CUDA 8 cudnn 5 nvgraph IndeX plug-in for ParaView and other technologies such as: AMGx, cusolver, cusparse, OpenACC, NSIGHT, THRUST

Sep 28-29, 2016 Amsterdam www.gputechconf.eu #GTC16 EUROPE S BRIGHTEST MINDS & BEST IDEAS DEEP LEARNING & ARTIFICIAL INTELLIGENCE SELF-DRIVING CARS VIRTUAL REALITY & AUGMENTED REALITY SUPERCOMPUTING & HPC GTC Europe is a two-day conference designed to expose the innovative ways developers, businesses and academics are using parallel computing to transform our world. 2 Days 800 Attendees 50+ Exhibitors 50+ Speakers 15+ Tracks 15+ Workshops 1-to-1 Meetings

Q&A