GTC Jensen Huang Founder & CEO

Size: px
Start display at page:

Download "GTC Jensen Huang Founder & CEO"

Transcription

1 GTC 2018 Jensen Huang Founder & CEO

2 2

3 3

4 4

5 SCREEN-SPACE AMBIENT OCCLUSION BAKED LIGHTING 5

6 GLOBAL ILLUMINATION 6

7 SCREEN-SPACE REFLECTIONS ENVIRONMENT MAPS 7

8 RAY TRACED REFLECTIONS 8

9 SCREEN-SPACE REFRACTION DEPTH SORTING 9

10 CAUSTICS 10

11 SUBSURFACE SHADING APPROXIMATION 11

12 SUBSURFACE SCATTERING 12

13 13

14 ANNOUNCING NVIDIA RTX TECHNOLOGY 14

15 ANNOUNCING QUADRO GV100 WITH NVIDIA RTX TECHNOLOGY GIANT LEAP FOR REAL-TIME COMPUTER GRAPHICS 2 GV100s Connected by NVLink2 64GB HBM2 Memory 10,240 CUDA Cores 236 TFLOPS Tensor Cores NVIDIA OptiX Vulkan Microsoft DXR NVIDIA RTX Technology NVIDIA Volta GPU 15

16 ONE BILLION IMAGES RENDERED EVERY YEAR GAMING MEDIA & ENTERTAINMENT PRODUCT DESIGN ARCHITECTURE 400 Games 500 Movies 12M Designers 150,000 Architects 16

17 TRADITIONAL RENDER FARM 280 Dual-CPU Servers 168 kw 17

18 NVIDIA RTX QUADRO GV100 BIG SAVINGS FOR RENDERING 14 Quad-GPU Servers 24 kw 1/5 the Cost 1/7 the Space 1/7 the Power 18

19 NVIDIA RTX EXCITEMENT TOOLS ENGINES GAMING MEDIA & ENTERTAINMENT PRODUCT DESIGN ARCHITECTURE With RTX we can now do ray tracing renders interactively. It s just fantastic! Sébastien Guichou, CTO, Isotropix NVIDIA RTX opens the door to make ray tracing a reality! Kim Libreri, CTO, Epic Games 19

20 RISE OF GPU COMPUTING 10 7 GPU-Accelerated Computing 820,000 GPU Developers 10X in 5 Yrs X per year 8M CUDA Downloads 5X in 5 Yrs Single-threaded perf 1.5X per year , GTC Registrations 4X in 5 Yrs 40 Years of CPU Trend Data Original data up to the year 2010 collected and plotted by M. Horowitz, F. Labonte, O. Shacham, K. Olukotun, L. Hammond, and C. Batten New plot and data collected for by K. Rupp PF 2018 Total GPU FLOPS of Top 50 Systems 15X in 5 Yrs 20

21 SCIENCE NEEDS SUPERCHARGED COMPUTERS Reinventing the Lithium-Ion Battery 7 Days on Titan Mapping the Earth s Core 17 Days on Titan Cloud-Resolving Climate Simulation 840 Days on Piz Daint Understanding HIV s Structure 16 Days on Blue Waters 21

22 SUPERCHARGED COMPUTING Fermi GPU Server 2013 HPC Applications Amber 12 NAMD 2.9 GPU Acceleration Stack cublas 5.0 cufft 5.0 NPP 5.0 CUDA 5.0 curand 5.0 cusparse 5.0 Res Mgr R304 BaseOS CentOS GPU Accelerated Computing 10 Volta GPU Server 2018 Moore s Law HPC Applications Amber 16 CHROMA 2018 Gyrokinetic TC 2017 LAMMPS 2018 MILC 2018 NAMD 2.13 Quantum Esp. 6.1 SPECFEM3D 2018 GPU Acceleration Stack cublas 9.0 cufft 9.0 NPP 9.0 CUDA 9.0 curand 9.0 cusparse 9.0 Res Mgr R384 BaseOS Ubuntu CPU Measured performance of Amber, CHROMA, GTC, LAMMPS, MILC, NAMD, Quantum Espresso, SPECFEM3D 22

23 TRADITIONAL HPC CLUSTER 600 Dual-CPU Servers 360 kw 23

24 NVIDIA TESLA V100 BIG SAVINGS FOR HPC 30 Quad-GPU Servers 48 kw 1/5 the Cost 1/7 the Space 1/7 the Power 24

25 CLARA MEDICAL IMAGING SUPERCOMPUTER IMAGING & VISUALIZATION APPS CUDA CUDNN TENSORRT OGL RTX GPU CONTAINERS VGPU NVIDIA GPU SERVER 25

26 CLARA MEDICAL IMAGING SUPERCOMPUTER IMAGING & VISUALIZATION APPS CUDA CUDNN TENSORRT OGL RTX GPU CONTAINERS VGPU NVIDIA GPU SERVER DL-BASED IMAGE RECONSTRUCTION DL-BASED BRAIN SEGMENTATION CINEMATIC RENDERING 26

27 CLARA MEDICAL IMAGING SUPERCOMPUTER IMAGING & VISUALIZATION APPS CUDA CUDNN TENSORRT OGL RTX GPU CONTAINERS VGPU NVIDIA GPU SERVER 27

28 IMAGING DEVELOPMENT PARTNERS ULTRASOUND MRI CT X-RAY MAMMO PET HEALTHCARE PROVIDERS STARTUPS IMAGING COMPANIES 28

29 NVIDIA AI PLATFORM Announcing NEW 32GB 2X Announcing NEW 32GB 2X Tesla V100 DGX-1 and DGX Station Every Cloud Every Computer Maker NVIDIA GPU Cloud NVIDIA AI Inference TITAN V 29

30 AlexNet 30

31 CAMBRIAN EXPLOSION Convolutional Networks Recurrent Networks Generative Adversarial Networks Reinforcement Learning New Species Capsule Nets Encoder/Decoder ReLu BatchNorm LSTM GRU Beam Search 3D-GAN MedGAN Conditional GAN DQN Simulation Mixture of Experts Neural Collaborative Filtering Concat Dropout Pooling WaveNet CTC Attention Coupled GAN Speech Enhancement GAN DDPG Block Sparse LSTM 31

32 CAMBRIAN EXPLOSION Convolutional Networks Recurrent Networks Generative Adversarial Networks Reinforcement Learning New Species Capsule Nets Encoder/Decoder ReLu BatchNorm LSTM GRU Beam Search 3D-GAN MedGAN Conditional GAN DQN Simulation Mixture of Experts Neural Collaborative Filtering Concat Dropout Pooling WaveNet CTC Attention Coupled GAN Speech Enhancement GAN DDPG Block Sparse LSTM 32

33 THE WORLD WANTS A GIGANTIC GPU 33

34 THE WORLD S LARGEST GPU 16 Tesla V100 32GB Connected by NVSwitch On-chip Memory Fabric Semantic Extended Across All GPUs 512GB HBM2 and 14.4TB/sec Aggregate 81,920 CUDA Cores 2,000 TFLOPS Tensor Cores 34

35 THE WORLD S LARGEST GPU 2B Transistors TSMC 12FFN 16 Tesla V100 32GB Connected by NVSwitch On-chip Memory Fabric Semantic Extended Across All GPUs 512GB HBM2 and 14.4TB/sec Aggregate 81,920 CUDA Cores 2,000 TFLOPS Tensor Cores 35

36 THE WORLD S LARGEST GPU 18 Links 25Gbps * 8 Bi-directional 16 Tesla V100 32GB Connected by NVSwitch On-chip Memory Fabric Semantic Extended Across All GPUs 512GB HBM2 and 14.4TB/sec Aggregate 81,920 CUDA Cores 2,000 TFLOPS Tensor Cores 36

37 THE WORLD S LARGEST GPU 7.2 Terabits/sec or 900 GB/sec 16 Tesla V100 32GB Connected by NVSwitch On-chip Memory Fabric Semantic Extended Across All GPUs 512GB HBM2 and 14.4TB/sec Aggregate 81,920 CUDA Cores 2,000 TFLOPS Tensor Cores 37

38 THE WORLD S LARGEST GPU Every GPU-to-GPU at 300 GB/sec 16 Tesla V100 32GB Connected by NVSwitch On-chip Memory Fabric Semantic Extended Across All GPUs 512GB HBM2 and 14.4TB/sec Aggregate 81,920 CUDA Cores 2,000 TFLOPS Tensor Cores 38

39 ANNOUNCING NVIDIA DGX-2 THE LARGEST GPU EVER CREATED 2 PFLOPS 512GB HBM2 10 kw 350 lbs 39

40 10X IN 6 MONTHS DGX-1 V100 16GB SEPT 17 Framework pytorch 0.2 TensorFlow 1.3 MXNet 0.11 Caffe CNTK 2.0 Python 2.7 System Software Stack NCCL cudnn cublas 9.0 cufft 9.0 NPP 9.0 CUDA 9.0 Res Mgr R384 BaseOS days 10 Fairseq is a neural machine translation network, published by Facebook in May 17. Fairseq is trained with WMT 14 English-French dataset in 55 epochs DGX-2 V100 32GB MAR 18 5 Framework pytorch 0.3 TensorFlow 1.7 MXNet 1.0 Caffe CNTK 2.3 Python 2.7 or 3.6 System Software Stack NCCL 2.2 cudnn 7.1 cublas 9.2 cufft 9.2 NPP 9.2 CUDA 9.2 Res Mgr R396 BaseOS DGX-1 Time to Train FAIRSEQ 1.5 days DGX-2 40

41 ANNOUNCING NVIDIA DGX-2 $399K Available in Q3 41

42 TRADITIONAL HYPERSCALE CLUSTER 300 Dual-CPU Servers $3M 180 kw 42

43 NVIDIA DGX-2 FOR DEEP LEARNING 1 DGX-2 Big Savings for Deep Learning 10 kw 1/8 the Cost 1/60 the Space 1/18 the Power 43

44 500X IN 5 YEARS 2 GTX 580s DEC 12 AlexNet Framework System Software Stack cuda-convnet NCCL N/A cudnn N/A cublas 5.0 cufft 5.0 NPP 5.0 CUDA 5.0 Res Mgr R days DGX-2 MAR 18 Framework NV Caffe System Software Stack NCCL 2.2 cudnn 7.1 cublas 9.2 cufft 9.2 NPP 9.2 CUDA 9.2 Res Mgr R min 2 GTX 580s DGX-2 Time to Train AlexNet 44

45 NVIDIA GPU CLOUD Optimized Stacks for Every Cloud 20,000+ Registered Organizations 30 Containers NOW on AWS, GCP, AliCloud, Oracle Cloud, DGX 45

46 PLASTER 46

47 NVIDIA AI INFERENCE ASR RNN++ SPEECH SYNTH DGN, S2S RECOMMENDER MLP-NCF NLP RNN IMAGE / VIDEO CNN TensorRT CNNs 30M HYPERSCALE SERVERS TensorRT 2 INT8 TensorRT 3 Tensor Core TensorRT 4 TensorFlow Integration Kaldi Optimization ONNX WinML 190X IMAGE / VIDEO ResNet-50 with TensorFlow Integration 50X NLP GNMT 45X RECOMMENDER Neural Collaborative Filtering 36X SPEECH SYNTH WaveNet 60X ASR DeepSpeech 2 DNN Sept 16 Apr 17 Sept 17 Apr 18 All speed-ups are chip-to-chip CPU to GV

48 ANNOUNCING KUBERNETES ON NVIDIA GPUS Scale-up Thousands of GPUs Instantly Multi-region, Self-healing Cluster Orchestration GPU Optimized Out-of-the-Box KUBERNETES GPU ACCELERATED NVIDIA GPU CLOUD APPLICATIONS NVIDIA GPU CONTAINERS DOCKER NVIDIA GPUs AWS GCP AZURE NVIDIA GPU SERVERS 48

49 49

50 NVIDIA AI INFERENCE CSPs VIDEO ANALYTICS SPEECH RECOMMENDATION SYSTEMS MAPPING AUTOMOTIVE ROBOTICS SMART CITIES ETAIL HEALTHCARE MANUFACTURING NVIDIA s inference platform made it possible to derive real-time understanding of live videos. Nicolas Koumchatzky, Head of Cortex, Twitter We believe TensorRT could dramatically improve productivity for our enterprise customers. Markus Noga, Head of Machine Learning, SAP 50

51 NVIDIA AI PLATFORM Tesla V100 NEW 32GB DGX Systems NEW with V100 32GB NEW DGX-2 Every Cloud NGC Now on AWS, GCP, AliCloud, Oracle NVIDIA GPU Cloud 30 GPU-Optimized Containers NVIDIA AI Inference NEW TensorRT 4, TensorFlow Kaldi, ONNX, WinML TITAN V Out of stock! 51

52 NVIDIA RESEARCH RECENT WORK 200 RESEARCHERS Seattle Redmond Santa Clara Salt Lake City St. Louis Austin Westford Charlottesville Durham Lund Berlin Helsinki Bill Dally NVIDIA Chief Scientist Graphics Deep Learning Robotics Computer Vision Parallel Architectures Programming Systems Circuits VLSI Networks RTX CNN Image Inpainting NVSwitch Noise-to-Noise Denoising CuDNN Progressive GAN 52

53 NVIDIA RESEARCH CONDITIONAL GAN 200 RESEARCHERS Seattle Redmond Santa Clara Salt Lake City St. Louis Austin Westford Charlottesville Durham Lund Berlin Helsinki Bill Dally NVIDIA Chief Scientist Graphics Deep Learning Robotics Computer Vision Parallel Architectures Programming Systems Circuits VLSI Networks 53

54 EVERYTHING THAT MOVES WILL BE AUTONOMOUS Cars Robotaxis Trucks Delivery Vans Buses Tractors 54

55 NVIDIA DRIVE END-TO-END PLATFORM COLLECT DATA TRAIN MODELS SIMULATE DRIVE Cars Pedestrians Path Cars Pedestrians Path Lanes Signs Lights Lanes Signs Lights 55

56 NVIDIA PERCEPTION INFRASTRUCTURE LARGE-SCALE DEEP LEARNING MODEL DEVELOPMENT Data Factory Train on NVIDIA DGX Library of Labeled Data Workflow, Tools, Supercomputing Infrastructure Data Ingest, Labeling, Training, Validation, Adaptation Automation, Best Model Discovery, Traceability, Reproducibility Purpose-built for Safety Standards of Automotive Data is the new source code DRIVE Pegasus Validate/ Verify Test Data 56

57 57

58 58

59 NVIDIA DRIVE ROADMAP ONE ARCHITECTURE DRIVE Pegasus Orin Auto-Grade Super Energy-Efficient ASIL-D Functional Safety DRIVE PX 2 DRIVE Xavier DRIVE PX Parker 59

60 SIMULATION THE PATH TO BILLIONS OF MILES World drives trillions of miles each year. U.S. has 770 accidents per billion miles. A fleet of 20 test cars cover 1 million miles per year. 60

61 ANNOUNCING NVIDIA DRIVE SIM AND CONSTELLATION AV VALIDATION SYSTEM Virtual Reality AV Simulator Same Architecture as DRIVE Computer Simulate Rare and Difficult Conditions, Recreate Scenarios, Run Regression Tests, Drive Billions of Virtual Miles 10,000 Constellations Drive 3B Miles per Year 61

62 ANNOUNCING NVIDIA DRIVE SIM AND CONSTELLATION AV VALIDATION SYSTEM Virtual Reality AV Simulator Same Architecture as DRIVE Computer Simulate Rare and Difficult Conditions, Recreate Scenarios, Run Regression Tests, Drive Billions of Virtual Miles 10,000 Constellations Drive 3B Miles per Year 62

63 ANNOUNCING NVIDIA DRIVE SIM AND CONSTELLATION AV VALIDATION SYSTEM Virtual Reality AV Simulator Same Architecture as DRIVE Computer Simulate Rare and Difficult Conditions, Recreate Scenarios, Run Regression Tests, Drive Billions of Virtual Miles 10,000 Constellations Drive 3B Miles per Year 63

64 ANNOUNCING NVIDIA DRIVE SIM AND CONSTELLATION AV VALIDATION SYSTEM Virtual Reality AV Simulator Same Architecture as DRIVE Computer Simulate Rare and Difficult Conditions, Recreate Scenarios, Run Regression Tests, Drive Billions of Virtual Miles 10,000 Constellations Drive 3B Miles per Year 64

65 65

66 CARS TRUCKS 370 PARTNERS DEVELOPING ON NVIDIA DRIVE MOBILITY SERVICES SUPPLIERS MAPPING LIDAR CAMERA / RADAR STARTUPS 66

67 ROBOTICS BOOSTS EVERY INDUSTRY Delivery Consumer Healthcare Agriculture Retail Logistics Manufacturing 67

68 NVIDIA ISAAC ROBOTICS PLATFORM SIMULATION TRAINING DEPLOYMENT SDK 68

69 69

70 70

71 THE GPU COMPUTING REVOLUTION CONTINUES QUADRO GV100 NEW TESLA V100 32GB NEW TENSORRT 4 AND MORE DRIVE SIM & CONSTELLATION ISAAC NVIDIA RTX NEW DGX-2 1 ST 2PF COMPUTER 300 SERVERS IN A BOX Kubernetes On NVIDIA GPUs ONE ARCHITECTURE XAVIER PEGASUS - ORIN CLARA GRAPHICS AI AUTO NEW PLATFORMS 71

72 72

NVIDIA PLATFORM FOR AI

NVIDIA PLATFORM FOR AI NVIDIA PLATFORM FOR AI João Paulo Navarro, Solutions Architect - Linkedin i am ai HTTPS://WWW.YOUTUBE.COM/WATCH?V=GIZ7KYRWZGQ 2 NVIDIA Gaming VR AI & HPC Self-Driving Cars GPU Computing 3 GPU COMPUTING

More information

GTC was the introduction to the future of AI, a protector, a healer, a helper, a guardian, a visionary, and just a little slice of amazing.

GTC was the introduction to the future of AI, a protector, a healer, a helper, a guardian, a visionary, and just a little slice of amazing. GTC 20I8 GTC was the introduction to the future of AI, a protector, a healer, a helper, a guardian, a visionary, and just a little slice of amazing. IT Business Edge Press quote. Publication Clearly the

More information

A NEW COMPUTING ERA JENSEN HUANG, FOUNDER & CEO GTC CHINA 2017

A NEW COMPUTING ERA JENSEN HUANG, FOUNDER & CEO GTC CHINA 2017 A NEW COMPUTING ERA JENSEN HUANG, FOUNDER & CEO GTC CHINA 2017 TWO FORCES DRIVING THE FUTURE OF COMPUTING 10 7 Transistors (thousands) 10 6 10 5 1.1X per year 10 4 10 3 10 2 1.5X per year Single-threaded

More information

POWERING THE AI REVOLUTION JENSEN HUANG, FOUNDER & CEO GTC 2017

POWERING THE AI REVOLUTION JENSEN HUANG, FOUNDER & CEO GTC 2017 POWERING THE AI REVOLUTION JENSEN HUANG, FOUNDER & CEO GTC 2017 LIFE AFTER MOORE S LAW 10 7 40 Years of Microprocessor Trend Data 10 6 10 5 Transistors (thousands) 1.1X per year 10 4 10 3 1.5X per year

More information

A NEW COMPUTING ERA. Shanker Trivedi Senior Vice President Enterprise Business at NVIDIA

A NEW COMPUTING ERA. Shanker Trivedi Senior Vice President Enterprise Business at NVIDIA A NEW COMPUTING ERA Shanker Trivedi Senior Vice President Enterprise Business at NVIDIA THE ERA OF AI AI CLOUD MOBILE PC 2 TWO FORCES DRIVING THE FUTURE OF COMPUTING 10 7 Transistors (thousands) 10 5 1.1X

More information

A NEW COMPUTING ERA. DAVID B. KIRK, FELLOW NVIDIA AI Conference Singapore 2017

A NEW COMPUTING ERA. DAVID B. KIRK, FELLOW NVIDIA AI Conference Singapore 2017 A NEW COMPUTING ERA DAVID B. KIRK, FELLOW NVIDIA AI Conference Singapore 2017 TWO FORCES DRIVING THE FUTURE OF COMPUTING 10 7 Transistors (thousands) 10 5 1.1X per year 10 3 1.5X per year Single-threaded

More information

Inference Optimization Using TensorRT with Use Cases. Jack Han / 한재근 Solutions Architect NVIDIA

Inference Optimization Using TensorRT with Use Cases. Jack Han / 한재근 Solutions Architect NVIDIA Inference Optimization Using TensorRT with Use Cases Jack Han / 한재근 Solutions Architect NVIDIA Search Image NLP Maps TensorRT 4 Adoption Use Cases Speech Video AI Inference is exploding 1 Billion Videos

More information

SUPERCHARGED COMPUTING FOR THE DA VINCIS AND EINSTEINS OF OUR TIME

SUPERCHARGED COMPUTING FOR THE DA VINCIS AND EINSTEINS OF OUR TIME SUPERCHARGED COMPUTING FOR THE DA VINCIS AND EINSTEINS OF OUR TIME Twenty-five years ago, we set out to transform computer graphics. Fueled by the massive growth of the gaming market and its insatiable

More information

INVESTOR UPDATE. September 2018

INVESTOR UPDATE. September 2018 INVESTOR UPDATE September 2018 SAFE HARBOR Forward-Looking Statements Except for the historical information contained herein, certain matters in this presentation including, but not limited to, statements

More information

NEW NVIDIA PLATFORM FOR AI

NEW NVIDIA PLATFORM FOR AI NEW NVIDIA PLATFORM FOR AI Pedro Mario Cruz e Silva (pcruzesilva@nvidia.com) LinkedIn Solution Architect Manager Enterprise Latin America Global Oil & Gas Team "GTC 2017: 'I AM AI' OPENING IN KEYNOTE"

More information

SYNERGIE VON HPC UND DEEP LEARNING MIT NVIDIA GPUS

SYNERGIE VON HPC UND DEEP LEARNING MIT NVIDIA GPUS SYNERGIE VON HPC UND DEEP LEARNING MIT NVIDIA S Axel Koehler, Principal Solution Architect HPCN%Workshop%Goettingen,%14.%Mai%2018 NVIDIA - AI COMPUTING COMPANY Computer Graphics Computing Artificial Intelligence

More information

DEEP NEURAL NETWORKS CHANGING THE AUTONOMOUS VEHICLE LANDSCAPE. Dennis Lui August 2017

DEEP NEURAL NETWORKS CHANGING THE AUTONOMOUS VEHICLE LANDSCAPE. Dennis Lui August 2017 DEEP NEURAL NETWORKS CHANGING THE AUTONOMOUS VEHICLE LANDSCAPE Dennis Lui August 2017 THE RISE OF GPU COMPUTING APPLICATIONS 10 7 10 6 GPU-Computing perf 1.5X per year 1000X by 2025 ALGORITHMS 10 5 1.1X

More information

ACCELERATED COMPUTING: THE PATH FORWARD. Jensen Huang, Founder & CEO SC17 Nov. 13, 2017

ACCELERATED COMPUTING: THE PATH FORWARD. Jensen Huang, Founder & CEO SC17 Nov. 13, 2017 ACCELERATED COMPUTING: THE PATH FORWARD Jensen Huang, Founder & CEO SC17 Nov. 13, 2017 COMPUTING AFTER MOORE S LAW Tech Walker 40 Years of CPU Trend Data 10 7 GPU-Accelerated Computing 10 5 1.1X per year

More information

ENDURING DIFFERENTIATION. Timothy Lanfear

ENDURING DIFFERENTIATION. Timothy Lanfear ENDURING DIFFERENTIATION Timothy Lanfear WHERE ARE WE? 2 LIFE AFTER DENNARD SCALING 10 7 40 Years of Microprocessor Trend Data 10 6 10 5 10 4 Transistors (thousands) 1.1X per year 10 3 10 2 Single-threaded

More information

ENDURING DIFFERENTIATION Timothy Lanfear

ENDURING DIFFERENTIATION Timothy Lanfear ENDURING DIFFERENTIATION Timothy Lanfear WHERE ARE WE? 2 LIFE AFTER DENNARD SCALING GPU-ACCELERATED PERFORMANCE 10 7 40 Years of Microprocessor Trend Data 10 6 10 5 10 4 10 3 10 2 Single-threaded perf

More information

SUPERCHARGE DEEP LEARNING WITH DGX-1. Markus Weber SC16 - November 2016

SUPERCHARGE DEEP LEARNING WITH DGX-1. Markus Weber SC16 - November 2016 SUPERCHARGE DEEP LEARNING WITH DGX-1 Markus Weber SC16 - November 2016 NVIDIA Pioneered GPU Computing Founded 1993 $7B 9,500 Employees 100M NVIDIA GeForce Gamers The world s largest gaming platform Pioneering

More information

TACKLING THE CHALLENGES OF NEXT GENERATION HEALTHCARE

TACKLING THE CHALLENGES OF NEXT GENERATION HEALTHCARE TACKLING THE CHALLENGES OF NEXT GENERATION HEALTHCARE Nicola Rieke, Senior Deep Learning Solution Architect Healthcare EMEA Fausto Milletari, Senior Deep Learning Solution Architect Healthcare NALA INTRODUCTION

More information

Autonomous Driving Solutions

Autonomous Driving Solutions Autonomous Driving Solutions Oct, 2017 DrivePX2 & DriveWorks Marcus Oh (moh@nvidia.com) Sr. Solution Architect, NVIDIA This work is licensed under a Creative Commons Attribution-Share Alike 4.0 (CC BY-SA

More information

Accelerated Platforms: The Future of Computing. Marc Hamilton, VP Solutions Architecture & Engineering, NVIDIA Korea AI Conference 2018

Accelerated Platforms: The Future of Computing. Marc Hamilton, VP Solutions Architecture & Engineering, NVIDIA Korea AI Conference 2018 Accelerated Platforms: The Future of Computing Marc Hamilton, VP Solutions Architecture & Engineering, NVIDIA Korea AI Conference 2018 Forces Shaping Computing 10 7 10 6 10 5 GPU PERFORMANCE CPU PERFORMANCE

More information

Deep Learning: Transforming Engineering and Science The MathWorks, Inc.

Deep Learning: Transforming Engineering and Science The MathWorks, Inc. Deep Learning: Transforming Engineering and Science 1 2015 The MathWorks, Inc. DEEP LEARNING: TRANSFORMING ENGINEERING AND SCIENCE A THE NEW RISE ERA OF OF GPU COMPUTING 3 NVIDIA A IS NEW THE WORLD S ERA

More information

GPU FOR DEEP LEARNING. 周国峰 Wuhan University 2017/10/13

GPU FOR DEEP LEARNING. 周国峰 Wuhan University 2017/10/13 GPU FOR DEEP LEARNING chandlerz@nvidia.com 周国峰 Wuhan University 2017/10/13 Why Deep Learning Boost Today? Nvidia SDK for Deep Learning? Agenda CUDA 8.0 cudnn TensorRT (GIE) NCCL DIGITS 2 Why Deep Learning

More information

S INSIDE NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORK CONTAINERS

S INSIDE NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORK CONTAINERS S8497 - INSIDE NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORK CONTAINERS Chris Lamb CUDA and NGC Engineering, NVIDIA John Barco NGC Product Management, NVIDIA NVIDIA GPU Cloud (NGC) overview AGENDA Using NGC

More information

Object recognition and computer vision using MATLAB and NVIDIA Deep Learning SDK

Object recognition and computer vision using MATLAB and NVIDIA Deep Learning SDK Object recognition and computer vision using MATLAB and NVIDIA Deep Learning SDK 17 May 2016, Melbourne 24 May 2016, Sydney Werner Scholz, CTO and Head of R&D, XENON Systems Mike Wang, Solutions Architect,

More information

GPU ACCELERATED COMPUTING. 1 st AlsaCalcul GPU Challenge, 14-Jun-2016, Strasbourg Frédéric Parienté, Tesla Accelerated Computing, NVIDIA Corporation

GPU ACCELERATED COMPUTING. 1 st AlsaCalcul GPU Challenge, 14-Jun-2016, Strasbourg Frédéric Parienté, Tesla Accelerated Computing, NVIDIA Corporation GPU ACCELERATED COMPUTING 1 st AlsaCalcul GPU Challenge, 14-Jun-2016, Strasbourg Frédéric Parienté, Tesla Accelerated Computing, NVIDIA Corporation GAMING PRO ENTERPRISE VISUALIZATION DATA CENTER AUTO

More information

MACHINE LEARNING WITH NVIDIA AND IBM POWER AI

MACHINE LEARNING WITH NVIDIA AND IBM POWER AI MACHINE LEARNING WITH NVIDIA AND IBM POWER AI July 2017 Joerg Krall Sr. Business Ddevelopment Manager MFG EMEA jkrall@nvidia.com A NEW ERA OF COMPUTING AI & IOT Deep Learning, GPU 100s of billions of devices

More information

S8901 Quadro for AI, VR and Simulation

S8901 Quadro for AI, VR and Simulation S8901 Quadro for AI, VR and Simulation Carl Flygare, PNY Quadro Product Marketing Manager Allen Bourgoyne, NVIDIA Senior Product Marketing Manager The question of whether a computer can think is no more

More information

TOWARDS ACCELERATED DEEP LEARNING IN HPC AND HYPERSCALE ARCHITECTURES Environnement logiciel pour l apprentissage profond dans un contexte HPC

TOWARDS ACCELERATED DEEP LEARNING IN HPC AND HYPERSCALE ARCHITECTURES Environnement logiciel pour l apprentissage profond dans un contexte HPC TOWARDS ACCELERATED DEEP LEARNING IN HPC AND HYPERSCALE ARCHITECTURES Environnement logiciel pour l apprentissage profond dans un contexte HPC TERATECH Juin 2017 Gunter Roth, François Courteille DRAMATIC

More information

DEEP LEARNING ALISON B LOWNDES. Deep Learning Solutions Architect & Community Manager EMEA

DEEP LEARNING ALISON B LOWNDES. Deep Learning Solutions Architect & Community Manager EMEA DEEP LEARNING ALISON B LOWNDES Deep Learning Solutions Architect & Community Manager EMEA 1 THE GPU-ACCELERATED WORLD HPC DEEP LEARNING PC VIRTUALIZATION CLOUD GAMING RENDERING 2 3 Why is Deep Learning

More information

ACCELERATED COMPUTING: THE PATH FORWARD. Jen-Hsun Huang, Co-Founder and CEO, NVIDIA SC15 Nov. 16, 2015

ACCELERATED COMPUTING: THE PATH FORWARD. Jen-Hsun Huang, Co-Founder and CEO, NVIDIA SC15 Nov. 16, 2015 ACCELERATED COMPUTING: THE PATH FORWARD Jen-Hsun Huang, Co-Founder and CEO, NVIDIA SC15 Nov. 16, 2015 COMMODITY DISRUPTS CUSTOM SOURCE: Top500 ACCELERATED COMPUTING: THE PATH FORWARD It s time to start

More information

NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORKS

NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORKS TECHNICAL OVERVIEW NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORKS A Guide to the Optimized Framework Containers on NVIDIA GPU Cloud Introduction Artificial intelligence is helping to solve some of the most

More information

DGX UPDATE. Customer Presentation Deck May 8, 2017

DGX UPDATE. Customer Presentation Deck May 8, 2017 DGX UPDATE Customer Presentation Deck May 8, 2017 NVIDIA DGX-1: The World s Fastest AI Supercomputer FASTEST PATH TO DEEP LEARNING EFFORTLESS PRODUCTIVITY REVOLUTIONARY AI PERFORMANCE Fully-integrated

More information

TESLA V100 PERFORMANCE GUIDE. Life Sciences Applications

TESLA V100 PERFORMANCE GUIDE. Life Sciences Applications TESLA V100 PERFORMANCE GUIDE Life Sciences Applications NOVEMBER 2017 TESLA V100 PERFORMANCE GUIDE Modern high performance computing (HPC) data centers are key to solving some of the world s most important

More information

EFFICIENT INFERENCE WITH TENSORRT. Han Vanholder

EFFICIENT INFERENCE WITH TENSORRT. Han Vanholder EFFICIENT INFERENCE WITH TENSORRT Han Vanholder AI INFERENCING IS EXPLODING 2 Trillion Messages Per Day On LinkedIn 500M Daily active users of iflytek 140 Billion Words Per Day Translated by Google 60

More information

NVIDIA DLI HANDS-ON TRAINING COURSE CATALOG

NVIDIA DLI HANDS-ON TRAINING COURSE CATALOG NVIDIA DLI HANDS-ON TRAINING COURSE CATALOG Valid Through July 31, 2018 INTRODUCTION The NVIDIA Deep Learning Institute (DLI) trains developers, data scientists, and researchers on how to use artificial

More information

NVIDIA DEEP LEARNING INSTITUTE

NVIDIA DEEP LEARNING INSTITUTE NVIDIA DEEP LEARNING INSTITUTE TRAINING CATALOG Valid Through July 31, 2018 INTRODUCTION The NVIDIA Deep Learning Institute (DLI) trains developers, data scientists, and researchers on how to use artificial

More information

RECENT TRENDS IN GPU ARCHITECTURES. Perspectives of GPU computing in Science, 26 th Sept 2016

RECENT TRENDS IN GPU ARCHITECTURES. Perspectives of GPU computing in Science, 26 th Sept 2016 RECENT TRENDS IN GPU ARCHITECTURES Perspectives of GPU computing in Science, 26 th Sept 2016 NVIDIA THE AI COMPUTING COMPANY GPU Computing Computer Graphics Artificial Intelligence 2 NVIDIA POWERS WORLD

More information

Building the Most Efficient Machine Learning System

Building the Most Efficient Machine Learning System Building the Most Efficient Machine Learning System Mellanox The Artificial Intelligence Interconnect Company June 2017 Mellanox Overview Company Headquarters Yokneam, Israel Sunnyvale, California Worldwide

More information

NVIDIA DGX SYSTEMS PURPOSE-BUILT FOR AI

NVIDIA DGX SYSTEMS PURPOSE-BUILT FOR AI NVIDIA DGX SYSTEMS PURPOSE-BUILT FOR AI Overview Unparalleled Value Product Portfolio Software Platform From Desk to Data Center to Cloud Summary AI researchers depend on computing performance to gain

More information

WELCOME. Simona Jankowski, March 27, 2018

WELCOME. Simona Jankowski, March 27, 2018 WELCOME Simona Jankowski, March 27, SAFE HARBOR Forward-Looking Statements Except for the historical information contained herein, certain matters in this presentation including, but not limited to, statements

More information

Building the Most Efficient Machine Learning System

Building the Most Efficient Machine Learning System Building the Most Efficient Machine Learning System Mellanox The Artificial Intelligence Interconnect Company June 2017 Mellanox Overview Company Headquarters Yokneam, Israel Sunnyvale, California Worldwide

More information

DEEP NEURAL NETWORKS AND GPUS. Julie Bernauer

DEEP NEURAL NETWORKS AND GPUS. Julie Bernauer DEEP NEURAL NETWORKS AND GPUS Julie Bernauer GPU Computing GPU Computing Run Computations on GPUs x86 CUDA Framework to Program NVIDIA GPUs A simple sum of two vectors (arrays) in C void vector_add(int

More information

Fast Hardware For AI

Fast Hardware For AI Fast Hardware For AI Karl Freund karl@moorinsightsstrategy.com Sr. Analyst, AI and HPC Moor Insights & Strategy Follow my blogs covering Machine Learning Hardware on Forbes: http://www.forbes.com/sites/moorinsights

More information

Introduction to Deep Learning in Signal Processing & Communications with MATLAB

Introduction to Deep Learning in Signal Processing & Communications with MATLAB Introduction to Deep Learning in Signal Processing & Communications with MATLAB Dr. Amod Anandkumar Pallavi Kar Application Engineering Group, Mathworks India 2019 The MathWorks, Inc. 1 Different Types

More information

NVIDIA AI INFERENCE PLATFORM

NVIDIA AI INFERENCE PLATFORM TECHNICAL OVERVIEW NVIDIA AI INFERENCE PLATFORM Giant Leaps in Performance and Efficiency for AI Services, from the Data Center to the Network s Edge Introduction The artificial intelligence revolution

More information

DGX SYSTEMS: DEEP LEARNING FROM DESK TO DATA CENTER. Markus Weber and Haiduong Vo

DGX SYSTEMS: DEEP LEARNING FROM DESK TO DATA CENTER. Markus Weber and Haiduong Vo DGX SYSTEMS: DEEP LEARNING FROM DESK TO DATA CENTER Markus Weber and Haiduong Vo NVIDIA DGX SYSTEMS Agenda NVIDIA DGX-1 NVIDIA DGX STATION 2 ONE YEAR LATER NVIDIA DGX-1 Barriers Toppled, the Unsolvable

More information

NVIDIA AI BRAIN OF SELF DRIVING AND HD MAPPING. September 13, 2016

NVIDIA AI BRAIN OF SELF DRIVING AND HD MAPPING. September 13, 2016 NVIDIA AI BRAIN OF SELF DRIVING AND HD MAPPING September 13, 2016 AI FOR AUTONOMOUS DRIVING MAPPING KALDI LOCALIZATION DRIVENET Training on DGX-1 NVIDIA DGX-1 NVIDIA DRIVE PX 2 Driving with DriveWorks

More information

NVIDIA FOR DEEP LEARNING. Bill Veenhuis

NVIDIA FOR DEEP LEARNING. Bill Veenhuis NVIDIA FOR DEEP LEARNING Bill Veenhuis bveenhuis@nvidia.com Nvidia is the world s leading ai platform ONE ARCHITECTURE CUDA 2 GPU: Perfect Companion for Accelerating Apps & A.I. CPU GPU 3 Intro to AI AGENDA

More information

TESLA PLATFORM. Jan 2018

TESLA PLATFORM. Jan 2018 TESLA PLATFORM Jan 2018 A NEW ERA OF COMPUTING AI & IOT Deep Learning, GPU 100s of billions of devices MOBILE-CLOUD iphone, Amazon AWS 2.5 billion mobile users PC INTERNET WinTel, Yahoo! 1 billion PC users

More information

S CUDA on Xavier

S CUDA on Xavier S8868 - CUDA on Xavier Anshuman Bhat CUDA Product Manager Saikat Dasadhikari CUDA Engineering 29 th March 2018 1 CUDA ECOSYSTEM 2018 CUDA DOWNLOADS IN 2017 3,500,000 CUDA REGISTERED DEVELOPERS 800,000

More information

Deep Learning mit PowerAI - Ein Überblick

Deep Learning mit PowerAI - Ein Überblick Stephen Lutz Deep Learning mit PowerAI - Open Group Master Certified IT Specialist Technical Sales IBM Cognitive Infrastructure IBM Germany Ein Überblick Stephen.Lutz@de.ibm.com What s that? and what s

More information

NVIDIA GPU TECHNOLOGY UPDATE

NVIDIA GPU TECHNOLOGY UPDATE NVIDIA GPU TECHNOLOGY UPDATE May 2015 Axel Koehler Senior Solutions Architect, NVIDIA NVIDIA: The VISUAL Computing Company GAMING DESIGN ENTERPRISE VIRTUALIZATION HPC & CLOUD SERVICE PROVIDERS AUTONOMOUS

More information

Defense Data Generation in Distributed Deep Learning System Se-Yoon Oh / ADD-IDAR

Defense Data Generation in Distributed Deep Learning System Se-Yoon Oh / ADD-IDAR Defense Data Generation in Distributed Deep Learning System Se-Yoon Oh / 2017. 10. 31 syoh@add.re.kr Page 1/36 Overview 1. Introduction 2. Data Generation Synthesis 3. Distributed Deep Learning 4. Conclusions

More information

World s most advanced data center accelerator for PCIe-based servers

World s most advanced data center accelerator for PCIe-based servers NVIDIA TESLA P100 GPU ACCELERATOR World s most advanced data center accelerator for PCIe-based servers HPC data centers need to support the ever-growing demands of scientists and researchers while staying

More information

The Tesla Accelerated Computing Platform

The Tesla Accelerated Computing Platform The Tesla Accelerated Computing Platform Axel Koehler, Principal Solution Architect HPC Advisory Council Meeting Lugano 22 March 2016 Introduction TESLA Platform for HPC Agenda TESLA Platform for HYPERSCALE

More information

IBM Deep Learning Solutions

IBM Deep Learning Solutions IBM Deep Learning Solutions Reference Architecture for Deep Learning on POWER8, P100, and NVLink October, 2016 How do you teach a computer to Perceive? 2 Deep Learning: teaching Siri to recognize a bicycle

More information

Deep learning in MATLAB From Concept to CUDA Code

Deep learning in MATLAB From Concept to CUDA Code Deep learning in MATLAB From Concept to CUDA Code Roy Fahn Applications Engineer Systematics royf@systematics.co.il 03-7660111 Ram Kokku Principal Engineer MathWorks ram.kokku@mathworks.com 2017 The MathWorks,

More information

Small is the New Big: Data Analytics on the Edge

Small is the New Big: Data Analytics on the Edge Small is the New Big: Data Analytics on the Edge An overview of processors and algorithms for deep learning techniques on the edge Dr. Abhay Samant VP Engineering, Hiller Measurements Adjunct Faculty,

More information

MIXED PRECISION TRAINING: THEORY AND PRACTICE Paulius Micikevicius

MIXED PRECISION TRAINING: THEORY AND PRACTICE Paulius Micikevicius MIXED PRECISION TRAINING: THEORY AND PRACTICE Paulius Micikevicius What is Mixed Precision Training? Reduced precision tensor math with FP32 accumulation, FP16 storage Successfully used to train a variety

More information

NVDIA DGX Data Center Reference Design

NVDIA DGX Data Center Reference Design White Paper NVDIA DGX Data Center Reference Design Easy Deployment of DGX Servers for Deep Learning 2018-07-19 2018 NVIDIA Corporation. Contents Abstract ii 1. AI Workflow and Sizing 1 2. NVIDIA AI Software

More information

TESLA P100 PERFORMANCE GUIDE. HPC and Deep Learning Applications

TESLA P100 PERFORMANCE GUIDE. HPC and Deep Learning Applications TESLA P PERFORMANCE GUIDE HPC and Deep Learning Applications MAY 217 TESLA P PERFORMANCE GUIDE Modern high performance computing (HPC) data centers are key to solving some of the world s most important

More information

Deep Learning Inference on Openshift with GPUs

Deep Learning Inference on Openshift with GPUs Deep Learning Inference on Openshift with GPUs OpenShift Commons, Seattle, Dec 10 2018 Tripti Singhal Product Manager, NVIDIA Deep Learning Software Tushar Katarki Product Manager, AI on OpenShift AGENDA

More information

GPU-Accelerated Deep Learning

GPU-Accelerated Deep Learning GPU-Accelerated Deep Learning July 6 th, 2016. Greg Heinrich. Credits: Alison B. Lowndes, Julie Bernauer, Leo K. Tam. PRACTICAL DEEP LEARNING EXAMPLES Image Classification, Object Detection, Localization,

More information

TESLA V100 PERFORMANCE GUIDE May 2018

TESLA V100 PERFORMANCE GUIDE May 2018 TESLA V100 PERFORMANCE GUIDE May 2018 TESLA V100 The Fastest and Most Productive GPU for AI and HPC Volta Architecture Tensor Core Improved NVLink & HBM2 Volta MPS Improved SIMT Model Most Productive GPU

More information

S8765 Performance Optimization for Deep- Learning on the Latest POWER Systems

S8765 Performance Optimization for Deep- Learning on the Latest POWER Systems S8765 Performance Optimization for Deep- Learning on the Latest POWER Systems Khoa Huynh Senior Technical Staff Member (STSM), IBM Jonathan Samn Software Engineer, IBM Evolving from compute systems to

More information

Deploying Deep Learning Networks to Embedded GPUs and CPUs

Deploying Deep Learning Networks to Embedded GPUs and CPUs Deploying Deep Learning Networks to Embedded GPUs and CPUs Rishu Gupta, PhD Senior Application Engineer, Computer Vision 2015 The MathWorks, Inc. 1 MATLAB Deep Learning Framework Access Data Design + Train

More information

MIXED PRECISION TRAINING OF NEURAL NETWORKS. Carl Case, Senior Architect, NVIDIA

MIXED PRECISION TRAINING OF NEURAL NETWORKS. Carl Case, Senior Architect, NVIDIA MIXED PRECISION TRAINING OF NEURAL NETWORKS Carl Case, Senior Architect, NVIDIA OUTLINE 1. What is mixed precision training with FP16? 2. Considerations and methodology for mixed precision training 3.

More information

NVIDIA TESLA V100 GPU ARCHITECTURE THE WORLD S MOST ADVANCED DATA CENTER GPU

NVIDIA TESLA V100 GPU ARCHITECTURE THE WORLD S MOST ADVANCED DATA CENTER GPU NVIDIA TESLA V100 GPU ARCHITECTURE THE WORLD S MOST ADVANCED DATA CENTER GPU WP-08608-001_v1.1 August 2017 WP-08608-001_v1.1 TABLE OF CONTENTS Introduction to the NVIDIA Tesla V100 GPU Architecture...

More information

WELCOME. Shawn Simmons, Investor Relations May 10, 2017

WELCOME. Shawn Simmons, Investor Relations May 10, 2017 WELCOME Shawn Simmons, Investor Relations May 10, 2017 Safe Harbor Forward-Looking Statements Except for the historical information contained therein, certain matters in these presentations including,

More information

NVIDIA DEEP LEARNING PLATFORM

NVIDIA DEEP LEARNING PLATFORM TECHNICAL OVERVIEW NVIDIA DEEP LEARNING PLATFORM Giant Leaps in Performance and Efficiency for AI Services, From the Data Center to the Network s Edge Introduction Artificial intelligence (AI), the dream

More information

DEEP LEARNING AND DIGITS DEEP LEARNING GPU TRAINING SYSTEM

DEEP LEARNING AND DIGITS DEEP LEARNING GPU TRAINING SYSTEM DEEP LEARNING AND DIGITS DEEP LEARNING GPU TRAINING SYSTEM AGENDA 1 Introduction to Deep Learning 2 What is DIGITS 3 How to use DIGITS Practical DEEP LEARNING Examples Image Classification, Object Detection,

More information

The Exascale Era Has Arrived

The Exascale Era Has Arrived Technology Spotlight The Exascale Era Has Arrived Sponsored by NVIDIA Steve Conway, Earl Joseph, Bob Sorensen, and Alex Norton November 2018 EXECUTIVE SUMMARY Earlier this year, scientists broke the exascale

More information

Advancing State-of-the-Art of Autonomous Vehicles and Robotics Research using AWS GPU Instances

Advancing State-of-the-Art of Autonomous Vehicles and Robotics Research using AWS GPU Instances Advancing State-of-the-Art of Autonomous Vehicles and Robotics Research using AWS GPU Instances Adrien Gaidon - Machine Learning Lead, Toyota Research Institute Mike Garrison - Senior Systems Engineer,

More information

NVIDIA Accelerators Models HPE NVIDIA GV100 Nvlink Bridge Kit HPE NVIDIA Tesla V100 FHHL 16GB Computational Accelerator

NVIDIA Accelerators Models HPE NVIDIA GV100 Nvlink Bridge Kit HPE NVIDIA Tesla V100 FHHL 16GB Computational Accelerator Overview Hewlett Packard supports, on select HPE ProLiant servers, computational accelerator modules based on NVIDIA Tesla, NVIDIA GRID, and NVIDIA Quadro Graphical Processing Unit (GPU) technology. The

More information

INTRODUCING THE DGX FAMILY. Marc Domenech May 8, 2017

INTRODUCING THE DGX FAMILY. Marc Domenech May 8, 2017 INTRODUCING THE DGX FAMILY Marc Domenech May 8, 2017 NVIDIA Pioneered GPU Computing Founded 1993 $7B 9,500 Employees 100M NVIDIA GeForce Gamers The world s largest gaming platform Pioneering AI computing

More information

Cisco UCS C480 ML M5 Rack Server Performance Characterization

Cisco UCS C480 ML M5 Rack Server Performance Characterization White Paper Cisco UCS C480 ML M5 Rack Server Performance Characterization The Cisco UCS C480 ML M5 Rack Server platform is designed for artificial intelligence and machine-learning workloads. 2018 Cisco

More information

HOW LEADING-EDGE COMPUTING TECHNOLOGIES ARE HELPING REIMAGINE CITIES OF THE FUTURE. Andrew Rink, AEC Industry Marketing GTC China - November 22, 2018

HOW LEADING-EDGE COMPUTING TECHNOLOGIES ARE HELPING REIMAGINE CITIES OF THE FUTURE. Andrew Rink, AEC Industry Marketing GTC China - November 22, 2018 HOW LEADING-EDGE COMPUTING TECHNOLOGIES ARE HELPING REIMAGINE CITIES OF THE FUTURE Andrew Rink, AEC Industry Marketing GTC China - November 22, 2018 COMPUTING TECHNOLOGY TRENDS IN AEC GPU-Accelerated Workflows

More information

Shrinath Shanbhag Senior Software Engineer Microsoft Corporation

Shrinath Shanbhag Senior Software Engineer Microsoft Corporation Accelerating GPU inferencing with DirectML and DirectX 12 Shrinath Shanbhag Senior Software Engineer Microsoft Corporation Machine Learning Machine learning has become immensely popular over the last decade

More information

TESLA P100 PERFORMANCE GUIDE. Deep Learning and HPC Applications

TESLA P100 PERFORMANCE GUIDE. Deep Learning and HPC Applications TESLA P PERFORMANCE GUIDE Deep Learning and HPC Applications SEPTEMBER 217 TESLA P PERFORMANCE GUIDE Modern high performance computing (HPC) data centers are key to solving some of the world s most important

More information

POINT CLOUD DEEP LEARNING

POINT CLOUD DEEP LEARNING POINT CLOUD DEEP LEARNING Innfarn Yoo, 3/29/28 / 57 Introduction AGENDA Previous Work Method Result Conclusion 2 / 57 INTRODUCTION 3 / 57 2D OBJECT CLASSIFICATION Deep Learning for 2D Object Classification

More information

NVIDIA DATA LOADING LIBRARY (DALI)

NVIDIA DATA LOADING LIBRARY (DALI) NVIDIA DATA LOADING LIBRARY (DALI) RN-09096-001 _v01 September 2018 Release Notes TABLE OF CONTENTS Chapter Chapter Chapter Chapter Chapter 1. 2. 3. 4. 5. DALI DALI DALI DALI DALI Overview...1 Release

More information

Xilinx ML Suite Overview

Xilinx ML Suite Overview Xilinx ML Suite Overview Yao Fu System Architect Data Center Acceleration Xilinx Accelerated Computing Workloads Machine Learning Inference Image classification and object detection Video Streaming Frame

More information

TECHNICAL OVERVIEW ACCELERATED COMPUTING AND THE DEMOCRATIZATION OF SUPERCOMPUTING

TECHNICAL OVERVIEW ACCELERATED COMPUTING AND THE DEMOCRATIZATION OF SUPERCOMPUTING TECHNICAL OVERVIEW ACCELERATED COMPUTING AND THE DEMOCRATIZATION OF SUPERCOMPUTING Table of Contents: The Accelerated Data Center Optimizing Data Center Productivity Same Throughput with Fewer Server Nodes

More information

HPE Deep Learning Cookbook: Recipes to Run Deep Learning Workloads. Natalia Vassilieva, Sergey Serebryakov

HPE Deep Learning Cookbook: Recipes to Run Deep Learning Workloads. Natalia Vassilieva, Sergey Serebryakov HPE Deep Learning Cookbook: Recipes to Run Deep Learning Workloads Natalia Vassilieva, Sergey Serebryakov Deep learning ecosystem today Software Hardware 2 HPE s portfolio for deep learning Government,

More information

TENSORRT. RN _v01 January Release Notes

TENSORRT. RN _v01 January Release Notes TENSORRT RN-08624-030_v01 January 2018 Release Notes TABLE OF CONTENTS Chapter Chapter Chapter Chapter 1. 2. 3. 4. Overview...1 Release 3.0.2... 2 Release 3.0.1... 4 Release 2.1... 10 RN-08624-030_v01

More information

GPU Coder: Automatic CUDA and TensorRT code generation from MATLAB

GPU Coder: Automatic CUDA and TensorRT code generation from MATLAB GPU Coder: Automatic CUDA and TensorRT code generation from MATLAB Ram Kokku 2018 The MathWorks, Inc. 1 GPUs and CUDA programming faster Performance CUDA OpenCL C/C++ GPU Coder MATLAB Python Ease of programming

More information

NVIDIA Update and Directions on GPU Acceleration for Earth System Models

NVIDIA Update and Directions on GPU Acceleration for Earth System Models NVIDIA Update and Directions on GPU Acceleration for Earth System Models Stan Posey, HPC Program Manager, ESM and CFD, NVIDIA, Santa Clara, CA, USA Carl Ponder, PhD, Applications Software Engineer, NVIDIA,

More information

HETEROGENEOUS HPC, ARCHITECTURAL OPTIMIZATION, AND NVLINK STEVE OBERLIN CTO, TESLA ACCELERATED COMPUTING NVIDIA

HETEROGENEOUS HPC, ARCHITECTURAL OPTIMIZATION, AND NVLINK STEVE OBERLIN CTO, TESLA ACCELERATED COMPUTING NVIDIA HETEROGENEOUS HPC, ARCHITECTURAL OPTIMIZATION, AND NVLINK STEVE OBERLIN CTO, TESLA ACCELERATED COMPUTING NVIDIA STATE OF THE ART 2012 18,688 Tesla K20X GPUs 27 PetaFLOPS FLAGSHIP SCIENTIFIC APPLICATIONS

More information

Demystifying Deep Learning

Demystifying Deep Learning Demystifying Deep Learning Mandar Gujrathi Mandar.Gujrathi@mathworks.com.au 2015 The MathWorks, Inc. 1 2 Deep Learning Applications Voice assistants (speech to text) Teaching character to beat video game

More information

NVIDIA TURING GPU ARCHITECTURE. Graphics Reinvented

NVIDIA TURING GPU ARCHITECTURE. Graphics Reinvented NVIDIA TURING GPU ARCHITECTURE Graphics Reinvented WP-09183-001_v01 TABLE OF CONTENTS Introduction to the NVIDIA Turing Architecture...1 NVIDIA Turing Key Features... 3 New Streaming Multiprocessor (SM)...

More information

Characterization and Benchmarking of Deep Learning. Natalia Vassilieva, PhD Sr. Research Manager

Characterization and Benchmarking of Deep Learning. Natalia Vassilieva, PhD Sr. Research Manager Characterization and Benchmarking of Deep Learning Natalia Vassilieva, PhD Sr. Research Manager Deep learning applications Vision Speech Text Other Search & information extraction Security/Video surveillance

More information

S8688 : INSIDE DGX-2. Glenn Dearth, Vyas Venkataraman Mar 28, 2018

S8688 : INSIDE DGX-2. Glenn Dearth, Vyas Venkataraman Mar 28, 2018 S8688 : INSIDE DGX-2 Glenn Dearth, Vyas Venkataraman Mar 28, 2018 Why was DGX-2 created Agenda DGX-2 internal architecture Software programming model Simple application Results 2 DEEP LEARNING TRENDS Application

More information

SUPERCHARGED COMPUTING FOR THE DA VINCIS AND EINSTEINS OF OUR TIME

SUPERCHARGED COMPUTING FOR THE DA VINCIS AND EINSTEINS OF OUR TIME SUPERCHARGED COMPUTING FOR THE DA VINCIS AND EINSTEINS OF OUR TIME We pioneered a supercharged form of computing loved by the most demanding computer users in the world scientists, designers, artists,

More information

OPENSEQ2SEQ: A DEEP LEARNING TOOLKIT FOR SPEECH RECOGNITION, SPEECH SYNTHESIS, AND NLP

OPENSEQ2SEQ: A DEEP LEARNING TOOLKIT FOR SPEECH RECOGNITION, SPEECH SYNTHESIS, AND NLP OPENSEQ2SEQ: A DEEP LEARNING TOOLKIT FOR SPEECH RECOGNITION, SPEECH SYNTHESIS, AND NLP Boris Ginsburg, Vitaly Lavrukhin, Igor Gitman, Oleksii Kuchaiev, Jason Li, Vahid Noroozi, Ravi Teja Gadde, Chip Nguyen

More information

GPUs and the Future of Accelerated Computing Emerging Technology Conference 2014 University of Manchester

GPUs and the Future of Accelerated Computing Emerging Technology Conference 2014 University of Manchester NVIDIA GPU Computing A Revolution in High Performance Computing GPUs and the Future of Accelerated Computing Emerging Technology Conference 2014 University of Manchester John Ashley Senior Solutions Architect

More information

Turing Architecture and CUDA 10 New Features. Minseok Lee, Developer Technology Engineer, NVIDIA

Turing Architecture and CUDA 10 New Features. Minseok Lee, Developer Technology Engineer, NVIDIA Turing Architecture and CUDA 10 New Features Minseok Lee, Developer Technology Engineer, NVIDIA Turing Architecture New SM Architecture Multi-Precision Tensor Core RT Core Turing MPS Inference Accelerated,

More information

CUDA: NEW AND UPCOMING FEATURES

CUDA: NEW AND UPCOMING FEATURES May 8-11, 2017 Silicon Valley CUDA: NEW AND UPCOMING FEATURES Stephen Jones, GTC 2018 CUDA ECOSYSTEM 2018 CUDA DOWNLOADS IN 2017 3,500,000 CUDA REGISTERED DEVELOPERS 800,000 GTC ATTENDEES 8,000+ 2 CUDA

More information

WELCOME. Simona Jankowski, March 19, 2019

WELCOME. Simona Jankowski, March 19, 2019 WELCOME Simona Jankowski, March 19, SAFE HARBOR Forward-Looking Statements Except for the historical information contained herein, certain matters in this presentation including, but not limited to, statements

More information

Accelerating your Embedded Vision / Machine Learning design with the revision Stack. Giles Peckham, Xilinx

Accelerating your Embedded Vision / Machine Learning design with the revision Stack. Giles Peckham, Xilinx Accelerating your Embedded Vision / Machine Learning design with the revision Stack Giles Peckham, Xilinx Xilinx Foundation at the Edge Vision Customers Using Xilinx >80 ADAS Models From 23 Makers >80

More information

Unified Deep Learning with CPU, GPU, and FPGA Technologies

Unified Deep Learning with CPU, GPU, and FPGA Technologies Unified Deep Learning with CPU, GPU, and FPGA Technologies Allen Rush 1, Ashish Sirasao 2, Mike Ignatowski 1 1: Advanced Micro Devices, Inc., 2: Xilinx, Inc. Abstract Deep learning and complex machine

More information

How GPUs Power Comcast's X1 Voice Remote and Smart Video Analytics. Jan Neumann Comcast Labs DC May 10th, 2017

How GPUs Power Comcast's X1 Voice Remote and Smart Video Analytics. Jan Neumann Comcast Labs DC May 10th, 2017 How GPUs Power Comcast's X1 Voice Remote and Smart Video Analytics Jan Neumann Comcast Labs DC May 10th, 2017 Comcast Applied Artificial Intelligence Lab Media & Video Analytics Smart TV Deep Learning

More information