S INSIDE NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORK CONTAINERS

Similar documents
NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORKS

DGX UPDATE. Customer Presentation Deck May 8, 2017

NGC CONTAINER. DU _v02 November User Guide

NVIDIA DGX SYSTEMS PURPOSE-BUILT FOR AI

DGX SYSTEMS: DEEP LEARNING FROM DESK TO DATA CENTER. Markus Weber and Haiduong Vo

Deep Learning Inference on Openshift with GPUs

USING NGC WITH GOOGLE CLOUD PLATFORM

NVIDIA DLI HANDS-ON TRAINING COURSE CATALOG

PREPARING TO USE CONTAINERS

IBM Deep Learning Solutions

NVIDIA DEEP LEARNING INSTITUTE

Embedded GPGPU and Deep Learning for Industrial Market

Managing Deep Learning Workflows

The Path to GPU as a Service in Kubernetes Renaud Gaubert Lead Kubernetes Engineer

Supporting GPUs in Docker Containers on Apache Mesos

BEST PRACTICES FOR DOCKER

INTRODUCING THE DGX FAMILY. Marc Domenech May 8, 2017

EFFICIENT INFERENCE WITH TENSORRT. Han Vanholder

Autonomous Driving Solutions

HPE Deep Learning Cookbook: Recipes to Run Deep Learning Workloads. Natalia Vassilieva, Sergey Serebryakov

DGX-1 DOCKER USER GUIDE Josh Park Senior Solutions Architect Contents created by Jack Han Solutions Architect

Shifter at CSCS Docker Containers for HPC

NVIDIA VOLTA DEEP LEARNING AMI

NVIDIA GPU CLOUD. DU _v02 July Getting Started Guide

USING NGC WITH AZURE. DU _v01 September Setup Guide

NVDIA DGX Data Center Reference Design

BEST PRACTICES FOR DOCKER

SYNERGIE VON HPC UND DEEP LEARNING MIT NVIDIA GPUS

Introduction to Deep Learning in Signal Processing & Communications with MATLAB

CUDNN. DU _v07 December Installation Guide

NVIDIA PLATFORM FOR AI

A NEW COMPUTING ERA. Shanker Trivedi Senior Vice President Enterprise Business at NVIDIA

Bright Cluster Manager: Using the NVIDIA NGC Deep Learning Containers

Deep Learning: Transforming Engineering and Science The MathWorks, Inc.

NVIDIA FOR DEEP LEARNING. Bill Veenhuis

Shifter: Fast and consistent HPC workflows using containers

S8765 Performance Optimization for Deep- Learning on the Latest POWER Systems

Arup Nanda VP, Data Services Priceline.com

Deep Learning on AWS with TensorFlow and Apache MXNet

Shrinath Shanbhag Senior Software Engineer Microsoft Corporation

Containerizing GPU Applications with Docker for Scaling to the Cloud

Machine Learning. Bridging the OT IT Gap for Machine Learning with Ignition and AWS Greengrass

A NEW COMPUTING ERA JENSEN HUANG, FOUNDER & CEO GTC CHINA 2017

Onto Petaflops with Kubernetes

CS 470 Spring Virtualization and Cloud Computing. Mike Lam, Professor. Content taken from the following:

SUPERCHARGE DEEP LEARNING WITH DGX-1. Markus Weber SC16 - November 2016

BUILDING A GPU-FOCUSED CI SOLUTION

Getting Started with OpenStack

Advancing State-of-the-Art of Autonomous Vehicles and Robotics Research using AWS GPU Instances

Nvidia Jetson TX2 and its Software Toolset. João Fernandes 2017/2018

Snapdragon NPE Overview

Deep learning in MATLAB From Concept to CUDA Code

NVIDIA T4 FOR VIRTUALIZATION

World s most advanced data center accelerator for PCIe-based servers

Deploying Deep Learning Networks to Embedded GPUs and CPUs

TENSORFLOW. DU _v1.8.0 June User Guide

NCCL 2.0. Sylvain Jeaugey

SQL Server inside a docker container. Christophe LAPORTE SQL Server MVP/MCM SQL Saturday 735 Helsinki 2018

A Hands on Introduction to Docker

Inference Optimization Using TensorRT with Use Cases. Jack Han / 한재근 Solutions Architect NVIDIA

End to End Optimization Stack for Deep Learning

GPU-Accelerated Deep Learning

MIOVISION DEEP LEARNING TRAFFIC ANALYTICS SYSTEM FOR REAL-WORLD DEPLOYMENT. Kurtis McBride CEO, Miovision

NVIDIA DATA LOADING LIBRARY (DALI)

Xilinx ML Suite Overview

Deep Learning Inferencing on IBM Cloud with NVIDIA TensorRT

POWERING THE AI REVOLUTION JENSEN HUANG, FOUNDER & CEO GTC 2017

Any platform. Achieve more. Team agility

DEEP NEURAL NETWORKS CHANGING THE AUTONOMOUS VEHICLE LANDSCAPE. Dennis Lui August 2017

IBM SpectrumAI with NVIDIA Converged Infrastructure Solutions for AI workloads

Object recognition and computer vision using MATLAB and NVIDIA Deep Learning SDK

Designing GPU-accelerated applications with RTMaps (Real-Time Multisensor Applications) Framework and NVIDIA DriveWorks

Cisco UCS C480 ML M5 Rack Server Performance Characterization

TACKLING THE CHALLENGES OF NEXT GENERATION HEALTHCARE

TESLA V100 PERFORMANCE GUIDE. Life Sciences Applications

What s inside: What is deep learning Why is deep learning taking off now? Multiple applications How to implement a system.

Containers, Serverless and Functions in a nutshell. Eugene Fedorenko

A DEVOPS STATE OF MIND. Chris Van Tuin Chief Technologist, West

DEFECT INSPECTION FROM SCRATCH TO PRODUCTION. Andrew Liu, Ryan Shen Deep Learning Solution Architect

Docker 101 Workshop. Eric Smalling - Solution Architect, Docker

HOW TO BUILD A MODERN AI

Deep Learning mit PowerAI - Ein Überblick

State of Containers. Convergence of Big Data, AI and HPC

Intelligent System for AI. 清大資工周志遠 AII Workshop

GPU FOR DEEP LEARNING. 周国峰 Wuhan University 2017/10/13

GTC Jensen Huang Founder & CEO

S8901 Quadro for AI, VR and Simulation

Getting Started with Hadoop

Building the Most Efficient Machine Learning System

Welcome to Docker Birthday # Docker Birthday events (list available at Docker.Party) RSVPs 600 mentors Big thanks to our global partners:

Vinnie Saini Cloud Solution Architect Big Data & AI

How to Put Your AF Server into a Container

USING NGC WITH YOUR NVIDIA TITAN PC

Securing Microservices Containerized Security in AWS

MACHINE LEARNING WITH NVIDIA AND IBM POWER AI

Allowing Users to Run Services at the OLCF with Kubernetes

TENSORRT 4.0 RELEASE CANDIDATE (RC)

IBM Bluemix compute capabilities IBM Corporation

ArcGIS Enterprise in the Amazon Cloud

ACCELERATED COMPUTING: THE PATH FORWARD. Jensen Huang, Founder & CEO SC17 Nov. 13, 2017

Transcription:

S8497 - INSIDE NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORK CONTAINERS Chris Lamb CUDA and NGC Engineering, NVIDIA John Barco NGC Product Management, NVIDIA

NVIDIA GPU Cloud (NGC) overview AGENDA Using NGC on your PC, workstation or cloud provider Understanding best practices for working with containers Learn more: NGC meetup and sessions Q&A 2

CHALLENGES WITH COMPLEX SOFTWARE Current DIY GPU-accelerated AI and HPC deployments can be complex and time consuming to build, test and maintain Development of software frameworks by the community is moving very fast Requires high level of expertise to manage driver, library, framework dependencies Open Source Frameworks NVIDIA Libraries NVIDIA Docker NVIDIA Driver NVIDIA GPU 3

WHY CONTAINERS? Benefits of Containers: Simplify deployment of GPU-accelerated software, eliminating time-consuming software integration work Isolate individual deep learning frameworks and applications Share, collaborate, and test applications across different environments 4

DEPLOY ACROSS MULTIPLE PLATFORMS NVIDIA TITAN (powered by NVIDIA Volta or NVIDIA Pascal) NVIDIA DGX-1 and DGX Station Amazon EC2 P3 instances with NVIDIA Volta 5

VIRTUAL MACHINES VS. CONTAINERS Motivation Packaging and deployment mechanism for applications Consistent and reproducible deployment Lightweight and lower overhead than VMs Logical isolation from other applications Image credits 6

EXAMPLE NGC CONTAINER WORKFLOW NVIDIA builds application image composed of layers of files NGC Image(s) tested and released to NGC repository hosted at URLs like nvcr.io/nvidia/tensorflow $ docker run nvcr.io/ User pulls image to a machine and runs it 101 010 Image cached and OS isolated set of resources allocated (container) in which to execute Data & results accessed as a filesystem volume 7

ANATOMY OF AN NGC CONTAINER IMAGE R/W Layer fb91e851e672 Examples & Scripts 0c395732af81 DL Framework & Source 145c1bf7947a NVIDIA DeepLearning SDK Image Layers (R/O) f2233041f557 NVIDIA CUDA SDK ubuntu:16.04 8

ALWAYS UP-TO-DATE Monthly Releases from NVIDIA 9

BEST NVIDIA PERFORMANCE Over 6 months, up to 1.5X improvement with mixed-precision on ResNet-50 10

TARGET SYSTEM SETUP NGC Virtual Machine Images NVIDIA Deep Learning for Volta (AWS EC2 AMI) NGC Examples and Management Scripts https://github.com/nvidia/ngc-examples Pre-installed Up-to-date Ubuntu Server OS CUDA Drivers NVIDIA Container Runtime NGC Container Ready BaseOS On all DGX Systems Self-Install Setup Guide 11

LOG INTO NGC, PULL AND RUN 1 Create Account / Log In 2 Get API Key 3 Browse For Image 4 Log in on Machine & Run $ docker login nvcr.io Username: $oauthtoken Password: ******* $ docker run -it nvcr.io/nvidia/pytorch:18.02 12

RUNNING CONTAINERS WITH DATA $ docker run -rm it nvcr.io/ -volume /mnt/ssd/large_dataset:/workspace/large_dataset nvcri.io/nvidia/tensorflow:18.02 /workspace/large_dataset 101010 /mnt/ssd/large_dataset 13

NVIDIA GPU CLOUD APPLICATIONS Access to a Comprehensive Catalog of GPU-Accelerated Software 14

NGC MEETUP SESSIONS Meetup Tues 7:30-9:30PM Room 210E Frameworks 101 - the NVIDIA special sauce Verizon Case Study The Future of Multi-GPU Training Cloud Inferencing 101 with TensorRT Kubernetes on NVIDIA GPUs Joey Conway Bryan Larish (Verizon) Mike O Connor David Goodwin Ryan Olson Sessions NGC Sessions 1. How to Use NGC Containers on AWS 2. Inside NGC Deep Learning Framework Containers 3. Connect with the NGC Deep Learning Experts 4. Predicting 4G Wireless Network Quality with Deep-Learning Algorithm 5. NVIDIA IndeX 2.0 - Advanced Large-Scale Data Visualizations 6. GE's Evolution from HPC to AI in Healthcare 7. Quick and Easy DL Workflow Proof of Concept 8. Building Smart Handheld 3D Ultrasound Imaging System with GPU and NGC 15

S8497 - INSIDE NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORK CONTAINERS

GPU-ACCELERATED DEEP LEARNING SOFTWARE CONTAINERS Deep Learning Everywhere, for Everyone Innovation for Every Industry Quickly tap into the power of NVIDIA AI, from automotive, to healthcare, to fintech, and more Say Goodbye to DIY Ready-to-run deep learning software containers, tuned, tested, and certified by NVIDIA Stay Up To Date Monthly updates to deep learning containers NVIDIA GPU Cloud integrates GPU-optimized deep learning frameworks, runtimes, libraries, and OS into a ready-to-run container, available at no charge 17

GPU-OPTIMIZED DEEP LEARNING SOFTWARE Tuned, Tested, Certified, and Maintained by NVIDIA NVCaffe Caffe2 Microsoft Cognitive Toolkit (CNTK) DIGITS MXNet PyTorch TensorFlow Theano Torch CUDA (base level container for developers) NVIDIA TensorRT inference accelerator with ONNX support 18

ALWAYS UP-TO-DATE Monthly Updates from NVIDIA to Deep Learning Containers Containerized Applications Docker Engine Utility for NVIDIA GPUs Docker Engine Utility for NVIDIA GPUs Docker Engine Utility for NVIDIA GPUs Docker Engine Utility for NVIDIA GPUs Docker Engine Utility for NVIDIA GPUs... Other Frameworks and Apps TF Tuned SW CNTK Tuned SW Caffe2 Tuned SW CUDA RT CUDA RT CUDA RT PyTorch Tuned SW CUDA RT Tuned SW CUDA RT Linux Kernel + CUDA Driver 19