DGX UPDATE. Customer Presentation Deck May 8, 2017

Similar documents
DGX SYSTEMS: DEEP LEARNING FROM DESK TO DATA CENTER. Markus Weber and Haiduong Vo

NVIDIA DGX SYSTEMS PURPOSE-BUILT FOR AI

INTRODUCING THE DGX FAMILY. Marc Domenech May 8, 2017

S INSIDE NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORK CONTAINERS

NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORKS

S8765 Performance Optimization for Deep- Learning on the Latest POWER Systems

SUPERCHARGE DEEP LEARNING WITH DGX-1. Markus Weber SC16 - November 2016

IBM Deep Learning Solutions

DGX-1 DOCKER USER GUIDE Josh Park Senior Solutions Architect Contents created by Jack Han Solutions Architect

SYNERGIE VON HPC UND DEEP LEARNING MIT NVIDIA GPUS

ACCELERATED COMPUTING: THE PATH FORWARD. Jensen Huang, Founder & CEO SC17 Nov. 13, 2017

Deep Learning mit PowerAI - Ein Überblick

TESLA V100 PERFORMANCE GUIDE. Life Sciences Applications

Autonomous Driving Solutions

NVDIA DGX Data Center Reference Design

Cisco UCS C480 ML M5 Rack Server Performance Characterization

IBM CORAL HPC System Solution

IBM Power AC922 Server

POWERING THE AI REVOLUTION JENSEN HUANG, FOUNDER & CEO GTC 2017

RECENT TRENDS IN GPU ARCHITECTURES. Perspectives of GPU computing in Science, 26 th Sept 2016

Looking ahead with IBM i. 10+ year roadmap

Building the Most Efficient Machine Learning System

NVIDIA DLI HANDS-ON TRAINING COURSE CATALOG

Optimizing Efficiency of Deep Learning Workloads through GPU Virtualization

TACKLING THE CHALLENGES OF NEXT GENERATION HEALTHCARE

MICROWAY S NVIDIA TESLA V100 GPU SOLUTIONS GUIDE

IBM SpectrumAI with NVIDIA Converged Infrastructure Solutions for AI workloads

Object recognition and computer vision using MATLAB and NVIDIA Deep Learning SDK

#techsummitch

EFFICIENT INFERENCE WITH TENSORRT. Han Vanholder

TOWARDS ACCELERATED DEEP LEARNING IN HPC AND HYPERSCALE ARCHITECTURES Environnement logiciel pour l apprentissage profond dans un contexte HPC

TESLA P100 PERFORMANCE GUIDE. HPC and Deep Learning Applications

OpenPOWER Performance

The Path to GPU as a Service in Kubernetes Renaud Gaubert Lead Kubernetes Engineer

Building the Most Efficient Machine Learning System

ACCELERATED COMPUTING: THE PATH FORWARD. Jen-Hsun Huang, Co-Founder and CEO, NVIDIA SC15 Nov. 16, 2015

Onto Petaflops with Kubernetes

NVIDIA DEEP LEARNING INSTITUTE

S THE MAKING OF DGX SATURNV: BREAKING THE BARRIERS TO AI SCALE. Presenter: Louis Capps, Solution Architect, NVIDIA,

NVIDIA GRID. Ralph Stocker, GRID Sales Specialist, Central Europe

HP GTC Presentation May 2012

IBM Power Advanced Compute (AC) AC922 Server

TECHNICAL OVERVIEW ACCELERATED COMPUTING AND THE DEMOCRATIZATION OF SUPERCOMPUTING

IBM Power Systems HPC Cluster

Deep Learning Inference on Openshift with GPUs

USING NGC WITH GOOGLE CLOUD PLATFORM

Voice, Image, Video : AI in action with AWS. 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

World s most advanced data center accelerator for PCIe-based servers

Graph Database and Analytics in a GPU- Accelerated Cloud Offering

NGC CONTAINER. DU _v02 November User Guide

Beyond Training The next steps of Machine Learning. Chris /in/chrisparsonsdev

Making Supercomputing More Available and Accessible Windows HPC Server 2008 R2 Beta 2 Microsoft High Performance Computing April, 2010

Hyper-Convergence De-mystified. Francis O Haire Group Technology Director

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

DataON and Intel Select Hyper-Converged Infrastructure (HCI) Maximizes IOPS Performance for Windows Server Software-Defined Storage

Scalable AI Infrastructure

Fast Hardware For AI

Vectordash. Vectordash Primer. A GPU accelerated cloud computing marketplace.

Shifter: Fast and consistent HPC workflows using containers

Unlock business value with HPC & Artificial Intelligence. FORUM TERATEC June 19, 2018 José RODRIGUES HPC Sales Manager

Docker Universal Control Plane Deploy and Manage On-Premises, Your Dockerized Distributed Applications

Microsoft Azure Stack Hybrid Cloud. The Modern System Architecture

WELCOME! TODAY S WEBINAR: RECIPES FOR PRODUCT DESIGN & AEC WORKSTATION SUCCESS. Mike Leach. July 25, Senior Workstation Technologist Lenovo

PART-I (B) (TECHNICAL SPECIFICATIONS & COMPLIANCE SHEET) Supply and installation of High Performance Computing System

HPE Deep Learning Cookbook: Recipes to Run Deep Learning Workloads. Natalia Vassilieva, Sergey Serebryakov

VEXATA FOR ORACLE. Digital Business Demands Performance and Scale. Solution Brief

A NEW COMPUTING ERA. Shanker Trivedi Senior Vice President Enterprise Business at NVIDIA

Hyper Converged Systems 250 and 380

Advancing State-of-the-Art of Autonomous Vehicles and Robotics Research using AWS GPU Instances

The Tesla Accelerated Computing Platform

MACHINE LEARNING WITH NVIDIA AND IBM POWER AI

NVIDIA Accelerators Models HPE NVIDIA GV100 Nvlink Bridge Kit HPE NVIDIA Tesla V100 FHHL 16GB Computational Accelerator

Please give me your feedback

IBM POWER SYSTEMS: YOUR UNFAIR ADVANTAGE

A NEW COMPUTING ERA JENSEN HUANG, FOUNDER & CEO GTC CHINA 2017

IBM Leading High Performance Computing and Deep Learning Technologies

TESLA P100 PERFORMANCE GUIDE. Deep Learning and HPC Applications

IBM Power Systems: Open innovation to put data to work Dexter Henderson Vice President IBM Power Systems

BEST PRACTICES FOR DOCKER

DEEP LEARNING ALISON B LOWNDES. Deep Learning Solutions Architect & Community Manager EMEA

Enterprise Network Compute System (ENCS)

Building NVLink for Developers

TENSORFLOW. DU _v1.8.0 June User Guide

Predicting Service Outage Using Machine Learning Techniques. HPE Innovation Center

Genomics on Cisco Metacloud + SwiftStack

HPE SimpliVity 380. Simplyfying Hybrid IT with HPE Wolfgang Privas Storage Category Manager

HPE ProLiant DL580 Gen10 Server

Preparing GPU-Accelerated Applications for the Summit Supercomputer

DEEP NEURAL NETWORKS AND GPUS. Julie Bernauer

BEST PRACTICES FOR DOCKER

HETEROGENEOUS HPC, ARCHITECTURAL OPTIMIZATION, AND NVLINK STEVE OBERLIN CTO, TESLA ACCELERATED COMPUTING NVIDIA

NVIDIA AI BRAIN OF SELF DRIVING AND HD MAPPING. September 13, 2016

Understanding the latent value in all content

Managing Deep Learning Workflows

HPE ProLiant DL360 Gen P 16GB-R P408i-a 8SFF 500W PS Performance Server (P06453-B21)

NVIDIA Update and Directions on GPU Acceleration for Earth System Models

CIO Forum Maximize the value of IT in today s economy

Bright Cluster Manager: Using the NVIDIA NGC Deep Learning Containers

The Cloud Evolution. Tom Kilroy, Vice President General Manager, Digital Enterprise Group

Kuberiter White Paper. Kubernetes. Cloud Provider Comparison Chart. Lawrence Manickam Kuberiter Inc

Transcription:

DGX UPDATE Customer Presentation Deck May 8, 2017

NVIDIA DGX-1: The World s Fastest AI Supercomputer FASTEST PATH TO DEEP LEARNING EFFORTLESS PRODUCTIVITY REVOLUTIONARY AI PERFORMANCE Fully-integrated and pre-optimized Insights in hours instead of weeks Optimized frameworks and cloud managed for faster insights DGX software stack for fastest GPU performance in the industry 2

ONE YEAR LATER NVIDIA DGX-1 Barriers Toppled, the Unsolvable Solved a Sampling of DGX-1 Impact DGX-1 launch UC Berkley CSIRO MIT CMU Fidelity Skymind RIKEN April 2016 April 2017 OpenAI NYU Mass. General Hosp. DFK IDSIA Microsoft SAP NVIDIA SATURNV launch Nimbix Noodle.ai 3

INTRODUCING THE DGX FAMILY AI WORKSTATION AI DATA CENTER CLOUD-SCALE AI DGX Station DGX-1 NVIDIA GPU Cloud with with The Personal AI Supercomputer Tesla P100 Tesla V100 Cloud service with the highest deep learning efficiency The World s First AI Supercomputer in a Box The Essential Instrument for AI Research 4

REVOLUTIONARY AI PERFORMANCE 3X system performance over prior generation Software stack delivers additional 30% faster training performance vs other GPU systems NVIDIA DGX unlocks the full potential of NVIDIA GPU s powered by software innovation 10X I/O performance with 2nd generation NVLink vs PCIe-connected GPU s New Tensor Core architecture inspired by the demands of deep learning 5

EFFORTLESS PRODUCTIVITY Save $x00,000 s on software engineering of DL frameworks Depend on NVIDIA-optimized frameworks instead of evolving open source software Save $100k+/yr in admin OpEx with cloud management, streamlined collaboration Monthly framework releases ensure maximized performance for DL ROI NVIDIA DGX software stack delivers immediate productivity that saves time and money 6

COMMON SOFTWARE STACK ACROSS DGX FAMILY Single, unified stack for deep learning frameworks Predictable execution across platforms Pervasive reach NVIDIA GPU Cloud DEEP LEARNING FRAMEWORKS DEEP LEARNING USER SOFTWARE NVIDIA DIGITS THIRD PARTY ACCELERATED SOLUTIONS CONTAINERIZATION TOOL NVIDIA Docker GPU DRIVER NVIDIA Driver DGX Station DGX-1 NVIDIA Cloud Service SYSTEM Host OS 7

ENTERPRISE BENEFITS OF DGX SOFTWARE NVIDIA Investments in Deep Learning Performance and Manageability Popular deep-learning frameworks - GPU-tuned by NVIDIA Engineering Practitioner productivity with minimal setup Driver and library independence for each framework Clean, minimal O/S base image Optimized drivers and libraries for maximized multi-gpu performance Non-disruptive updates for software and security 8

DGX CLOUD SERVICES Start Faster, Stay Productive Benefits for Deep Learning Workflow Features Container Registry Single software stack Develop once, deploy anywhere Scale across teams of practitioners Web UI and CLI Job Scheduling and Management Host Telemetry User Management Python SDK and REST API 9

10 STEPS TO SETUP A DIY SYSTEM 380 PAGES OF DOCS TO READ Step 1. Install Ubuntu linux (10 pg) Step 2. Install CUDA (41 pg) Step 3. Install CUDNN (154 pg) Step 4. Install and Upgrade PIP (20 pg) Step 5. Install BAZEL (build TF source) (50 pg) Step 6. Install TensorFlow (15 pg) Step 7. Upgrade Protobuf (15 pg) Step 8. Install Docker (75 pg) Step 9. Test the installation Step 10. Debug and fix install NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE. 10

DL FROM DEVELOPMENT TO PRODUCTION Accelerated Deep Learning Value with DGX Solutions From Desk DGX Station DGX-1/SATURNV/Cloud To Data Center or To Cloud installed optimized scaled Fast Bring-up Productive Experimentation Training at Scale Procure DGX Station Install / Compile Experiment Tune/ Optimize Deploy Train Insights 11

OUR STRATEGY IN THE DATACENTER: NVIDIA DGX-1 Highest Performance, Fully Integrated HW System 8 TB SSD 8 x Tesla V100 16GB 960 TFLOPS 8x Tesla V100 16GB 300 GB/s NVLink Hybrid Cube Mesh 2x Xeon 8 TB RAID 0 Quad IB 100Gbps, Dual 10GbE 3U 3200W 12

DEEP LEARNING FRAMEWORKS DEEP LEARNING USER SOFTWARE NVIDIA DIGITS THIRD PARTY ACCELERATED SOLUTIONS CONTAINERIZATION TOOL NVIDIA Docker GPU DRIVER NVIDIA Driver NVIDIA DGX-1 SOFTWARE STACK Fully Integrated Software for Instant Productivity Advantages: Instant productivity with NVIDIA optimized deep learning frameworks Caffe, CNTK, MXNet, PyTorch, TensorFlow, Theano, and Torch Performance optimized across the entire stack SYSTEM Host OS DGX SOFTWARE STACK Faster Time-to-Insight with pre-built, tested, and ready to run framework containers Flexibility to use different versions of libraries like libc, cudnn in each framework container 13

SIMPLIFY PORTABILITY WITH NVIDIA DOCKER CONTAINERS Benefits of Containers: Simplify deployment of GPU-accelerated applications Isolate individual frameworks or applications Share, collaborate, and test applications across different environments 14

THE POWER TO RUN MULTIPLE FRAMEWORKS AT ONCE Container Images portable across new driver versions Containerized Applications NVIDIA Docker NVIDIA Docker NVIDIA Docker NVIDIA Docker NVIDIA Docker Microsoft Cognitive Toolkit... Other Frameworks and Apps TF Tuned SW CNTK Tuned SW Caffe2 Tuned SW Pytorch Tuned SW Tuned SW CUDA RT CUDA RT CUDA RT CUDA RT CUDA RT Linux Kernel + CUDA Driver NVIDIA DGX-1 15

DGX-1: 96X FASTER THAN CPU DGX-1 7.4 hours 8-way GPU Server 18 hours Dual Socket CPU 711 hours 1X 40X 96X Workload: ResNet50, 90 epochs to solution CPU Server: Dual Xeon E5-2699 v4, 2.6GHz NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE. 16

RIKEN SUCCESS STORY Fujitsu and NVIDIA Build AI Supercomputer With 24 DGX-1s CHALLENGE Enterprises and research organizations embracing AI/DL Needed to accelerated research in areas including medicine, manufacturing and healthcare Conventional HPC architectures too costly and inefficient SOLUTION Partnered with Fujitsu for scale-out AI architecture built on DGX-1 24 DGX-1 s deliver 4 petaflops powering the RIKEN supercomputer NVIDIA COSMOS streamlines AI researcher workflow, helping accelerate RIKEN productivity IMPACT Accelerated real-world implementation of scale-out AI Enables RIKEN team to take advantage of next-gen DL algorithms Helping create future in which AI finds solutions to societal issues 17

BENEVOLENTAI: TRAINING REDUCED TO DAYS Technology Review Article on DGX-1: The Pint-Sized Supercomputer That Companies Are Scrambling to Get https://www.technologyreview.com/s/603075/the-pint-sized-supercomputer-that-companies-are-scrambling-to-get/ SYSTEM INSTALLATION TRAINING MODELS NVIDIA DGX-1 Days Other GPU System Weeks of Training The cost of renting enough servers on Amazon Web Services would surpass the system s $129,000 price tag within a year. -Jackie Hunter, CEO, BenevolentAI DGX-1 3x-4x FASTER TRAINING 18

INTRODUCING NVIDIA DGX STATION Groundbreaking AI at your desk The Personal AI Supercomputer for Researchers and Data Scientists Revolutionary form factor - designed for the desk, whisper-quiet Start experimenting in hours, not weeks, powered by DGX Stack Productivity that goes from desk to data center to cloud Breakthrough performance and precision powered by Volta 19

3X FASTER THAN THE FASTEST WORKSTATIONS Supercomputing performance at your desk 480 TFLOPS Water-cooled performance the only workstation built on 4 Tesla V100 s 3X 30% 5X 3X the performance of today s fastest GPU workstations with 30% faster training over non-dgx stack solutions 5X increase in I/O performance with 4-way next generation NVLink vs. PCIe-connected GPU s 20

ANNOUNCING NVIDIA GPU CLOUD GPU-ACCELERATED CLOUD PLATFORM OPTIMIZED FOR DEEP LEARNING Containerized in NVDocker Optimization across the full stack Always up-to-date Fully tested and maintained by NVIDIA NVIDIA GPU CLOUD Registry of Containers, Datasets, and Pre-trained models CSPs 21

DGX CLOUD SERVICES Detailed Feature Walkthrough 22

DGX CLOUD SERVICES Management Workflows Plug-in, Power up Authenticate appliances View appliances Node events (cloud connect, change of master, IP, etc.) Node state (connect, disconnect, ready, faulty) Recovery ISO Image Factory PXE Boot Image Container updates On-premises setup Cloud portal connectivity Cluster / Node Management Software Management 23

DGX CLOUD SERVICES Management Workflows Container Management Job Execution Job Status Hardware Health CPU, CPU, RAM Util. Alerts System Image System Software Docker Image Application / Job Scheduling Metrics / Notifications System Updates 24

DL FRAMEWORK WORKFLOW - NGC DGX Management in 7 Easy Steps BUILD MANAGE SCALE 1 2 3 4 5 6 7 User Mgmt Create Accounts User Mapping Assign Users to Projects Container Repo Pull/Push Frameworks to Node(s) Config Job Resources Assign GPU/CPU/RAM Review / Submit Job Ready / Warnings Job Mgmt Status / Detail / Clone Schedule FIFO Scheduler 25

NVIDIA DGX SYSTEMS Faster AI Innovation and Insight The World s First Portfolio of Purpose-Built AI Supercomputers Powered by NVIDIA GPU Cloud Get Started in AI Faster Effortless Productivity Performance Without Compromise For More Information: nvidia.com/dgx-systems 26