NVIDIA DEEP LEARNING INSTITUTE

Similar documents
NVIDIA DLI HANDS-ON TRAINING COURSE CATALOG

NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORKS

A NEW COMPUTING ERA JENSEN HUANG, FOUNDER & CEO GTC CHINA 2017

Voice, Image, Video : AI in action with AWS. 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.

Shrinath Shanbhag Senior Software Engineer Microsoft Corporation

Deep learning in MATLAB From Concept to CUDA Code

S INSIDE NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORK CONTAINERS

MIOVISION DEEP LEARNING TRAFFIC ANALYTICS SYSTEM FOR REAL-WORLD DEPLOYMENT. Kurtis McBride CEO, Miovision

Deploying Deep Learning Networks to Embedded GPUs and CPUs

Tutorial on Keras CAP ADVANCED COMPUTER VISION SPRING 2018 KISHAN S ATHREY

NVIDIA DGX SYSTEMS PURPOSE-BUILT FOR AI

Deep Learning mit PowerAI - Ein Überblick

Embedded GPGPU and Deep Learning for Industrial Market

NVIDIA FOR DEEP LEARNING. Bill Veenhuis

HPE Deep Learning Cookbook: Recipes to Run Deep Learning Workloads. Natalia Vassilieva, Sergey Serebryakov

Machine Learning in WAN Research

Introduction to Deep Learning in Signal Processing & Communications with MATLAB

Machine Learning in WAN Research

Event: PASS SQL Saturday - DC 2018 Presenter: Jon Tupitza, CTO Architect

What s inside: What is deep learning Why is deep learning taking off now? Multiple applications How to implement a system.

DGX SYSTEMS: DEEP LEARNING FROM DESK TO DATA CENTER. Markus Weber and Haiduong Vo

Inference Optimization Using TensorRT with Use Cases. Jack Han / 한재근 Solutions Architect NVIDIA

IBM Deep Learning Solutions

Fast Hardware For AI

Demystifying Deep Learning

Code Mania Artificial Intelligence: a. Module - 1: Introduction to Artificial intelligence and Python:

EFFICIENT INFERENCE WITH TENSORRT. Han Vanholder

The OpenVX Computer Vision and Neural Network Inference

Autonomous Driving Solutions

Embarquez votre Intelligence Artificielle (IA) sur CPU, GPU et FPGA

Building the Most Efficient Machine Learning System

DEFECT INSPECTION FROM SCRATCH TO PRODUCTION. Andrew Liu, Ryan Shen Deep Learning Solution Architect

World s most advanced data center accelerator for PCIe-based servers

IMAGERY FOR ARCGIS. Manage and Understand Your Imagery. Credit: Image courtesy of DigitalGlobe

Deep Learning Frameworks with Spark and GPUs

Deep Learning Frameworks. COSC 7336: Advanced Natural Language Processing Fall 2017

TensorFlow: A System for Learning-Scale Machine Learning. Google Brain

What s New in MATLAB and Simulink

A NEW COMPUTING ERA. Shanker Trivedi Senior Vice President Enterprise Business at NVIDIA

Practical Applications of Machine Learning for Image and Video in the Cloud

Characterization and Benchmarking of Deep Learning. Natalia Vassilieva, PhD Sr. Research Manager

Predicting Service Outage Using Machine Learning Techniques. HPE Innovation Center

Deep Learning: Transforming Engineering and Science The MathWorks, Inc.

NVIDIA PLATFORM FOR AI

DGX UPDATE. Customer Presentation Deck May 8, 2017

Managing Deep Learning Workflows

OpenACC/CUDA/OpenMP... 1 Languages and Libraries... 3 Multi-GPU support... 4 How OpenACC Works... 4

2015 The MathWorks, Inc. 1

CloudSwyft Learning-as-a-Service Course Catalog 2018 (Individual LaaS Course Catalog List)

AI & Machine Learning at Amazon

Review: The best frameworks for machine learning and deep learning

Neural Network Exchange Format

Defense Data Generation in Distributed Deep Learning System Se-Yoon Oh / ADD-IDAR

National College of Ireland Project Submission Sheet 2015/2016 School of Computing

MoonRiver: Deep Neural Network in C++

Building the Most Efficient Machine Learning System

A NEW COMPUTING ERA. DAVID B. KIRK, FELLOW NVIDIA AI Conference Singapore 2017

TESLA V100 PERFORMANCE GUIDE. Life Sciences Applications

Pervasive Computing. OpenLab Jan 14 04pm L Institute of Networked and Embedded Systems

SUPERCHARGE DEEP LEARNING WITH DGX-1. Markus Weber SC16 - November 2016

Deep Learning Inferencing on IBM Cloud with NVIDIA TensorRT

In partnership with. VelocityAI REFERENCE ARCHITECTURE WHITE PAPER

Pouya Kousha Fall 2018 CSE 5194 Prof. DK Panda

Presenter: Felix Goldberg, Ph.D. Chief AI Scientist Artificial Intelligence Cart AIC

CMPE 665:Multiple Processor Systems CUDA-AWARE MPI VIGNESH GOVINDARAJULU KOTHANDAPANI RANJITH MURUGESAN

Machine Learning on VMware vsphere with NVIDIA GPUs

GPU Coder: Automatic CUDA and TensorRT code generation from MATLAB

Application of Deep Learning Techniques in Satellite Telemetry Analysis.

Realtime Object Detection and Segmentation for HD Mapping

Xilinx ML Suite Overview

Open Standards for Vision and AI Peter McGuinness NNEF WG Chair CEO, Highwai, Inc May 2018

DEEP NEURAL NETWORKS AND GPUS. Julie Bernauer

Keras: Handwritten Digit Recognition using MNIST Dataset

GPU Computing and Its Applications

TENSORRT. RN _v01 January Release Notes

Data-Centric Innovation Summit DAN MCNAMARA SENIOR VICE PRESIDENT GENERAL MANAGER, PROGRAMMABLE SOLUTIONS GROUP

What s New in MATLAB and Simulink The MathWorks, Inc. 1

Deep Learning on AWS with TensorFlow and Apache MXNet

Accelerating Implementation of Low Power Artificial Intelligence at the Edge

COMP 551 Applied Machine Learning Lecture 16: Deep Learning

An introduction to Machine Learning silicon

OpenACC Course. Office Hour #2 Q&A

Big Data Specialized Studies

Convolutional Neural Network based Medical Imaging Segmentation: Recent Progress and Challenges. Jiaxing Tan

Demystifying Deep Learning

Deep learning prevalence. first neuroscience department. Spiking Neuron Operant conditioning First 1 Billion transistor processor

How to Build Optimized ML Applications with Arm Software

Vinnie Saini Cloud Solution Architect Big Data & AI

Understanding the latent value in all content

Accelerate AI with Cisco Computing Solutions

What s New in MATLAB and Simulink

OPERATIONALIZING MACHINE LEARNING USING GPU ACCELERATED, IN-DATABASE ANALYTICS

CERN openlab & IBM Research Workshop Trip Report

Why data science is the new frontier in software development

Deep Learning Accelerators

GPU ACCELERATED COMPUTING. 1 st AlsaCalcul GPU Challenge, 14-Jun-2016, Strasbourg Frédéric Parienté, Tesla Accelerated Computing, NVIDIA Corporation

DEEP NEURAL NETWORKS CHANGING THE AUTONOMOUS VEHICLE LANDSCAPE. Dennis Lui August 2017

Analytics and Visualization

DEEP LEARNING AND DIGITS DEEP LEARNING GPU TRAINING SYSTEM

Actionable Insights USE CASES. Plantronics All Rights Reserved. 1

Transcription:

NVIDIA DEEP LEARNING INSTITUTE TRAINING CATALOG Valid Through July 31, 2018

INTRODUCTION The NVIDIA Deep Learning Institute (DLI) trains developers, data scientists, and researchers on how to use artificial intelligence and accelerated computing to solve real-world problems across a wide range of domains. These include autonomous vehicles, digital content creation, finance, healthcare, intelligent video analytics, and more. In the deep learning courses, you ll learn how to train, optimize, and deploy neural networks using the latest tools, frameworks, and techniques for deep learning. In the accelerated computing courses, you ll learn how to assess, parallelize, optimize, and deploy GPUaccelerated computing applications across a wide range of application domains. DLI offers training in three formats: 1. Instructor-Led Workshops Full-day workshops include hands-on training and lectures delivered onsite by DLI certified instructors. To view upcoming workshops near you or request a workshop for your organization, visit www.nvidia.com/dli. 2. Online Courses Online courses teach you how to implement and deploy an end-to-end project through handson training in 6 hours. 3. Online Mini Courses Online mini courses teach a specific technology or development technique through hands-on training in 2 hours. DLI online training is designed as a self-paced learning experience for independent learners. Online training can be taken anytime from anywhere with access to a fully-configured GPUaccelerated workstation in the cloud. Get started with online training at www.nvidia.com/dlilabs. Certification Through built-in assessments, participants can earn certification to prove subject matter competency and support professional career growth. Certification is available for a handful of workshops and online courses today. Enterprise Training Solutions For organizations interested in transforming their workforce with deep learning and accelerated computing, DLI offers enterprise solutions that include hands-on online and onsite training for employees, executive briefings, and enterprise-level reporting. DLI enterprise solutions help organizations solve challenging problems, improve staff productivity, and become leaders in AI. Contact dli-request@nvidia.com for more information. 2

WORKSHOPS BY INDUSTRY AT A GLANCE FUNDAMENTALS OF DEEP LEARNING >>Fundamentals of Deep Learning for Computer Vision >>Fundamentals of Deep Learning for Multiple Data Types AUTONOMOUS VEHICLES >>Deep Learning for Autonomous Vehicles Perception GAME DEVELOPMENT AND DIGITAL CONTENT >>Deep Learning for Digital Content Creation using GANs and Autoencoders HEALTHCARE >>Deep Learning for Healthcare Image Analysis >>Deep Learning for Healthcare Genomics FINANCE >>Deep Learning for Finance Trading Strategy INTELLIGENT VIDEO ANALYTICS >>Deep Learning for Full Motion Video Analytics ACCELERATED COMPUTING >>Fundamentals of Accelerated Computing with CUDA C/C++ 3

ONLINE TRAINING BY INDUSTRY AT A GLANCE FUNDAMENTALS OF DEEP LEARNING Courses >>Fundamentals of Deep Learning for Computer Vision Mini Courses >>Image Classification with DIGITS >>Object Detection with DIGITS >>Neural Network Deployment with DIGITS and TensorRT >>Applications of Deep Learning with Caffe, Theano, and Torch >>Deep Learning Workflows with TensorFlow, MXNet, and NVIDIA-Docker >>Image Segmentation with TensorFlow >>Image Classification with Microsoft Cognitive Toolkit >>Linear Classification with TensorFlow >>Signal Processing with DIGITS GAME DEVELOPMENT AND DIGITAL CONTENT Mini Courses >>Image Creation using GANs with TensorFlow and DIGITS >>Image Style Transfer with Torch >>Rendered Image Denoising using Autoencoders HEALTHCARE Courses >>Deep Learning for Healthcare Image Analysis >>Deep Learning for Healthcare Genomics Mini Courses >>Modeling Time Series Data with Recurrent Neural Networks in Keras INTELLIGENT VIDEO ANALYTICS Mini Courses >>Object Detection for Full Motion Video >>Object Tracking for Large Scale Full Motion Video >>Deployment for Intelligent Video Analytics using TensorRT 4

ACCELERATED COMPUTING Courses >>Fundamentals of Accelerated Computing with CUDA C/C++ >>Fundamentals of Accelerated Computing with CUDA Python >>Fundamentals of Accelerated Computing with OpenACC Mini Courses >>Accelerated Applications with CUDA C/C++ >>GPU Memory Optimizations with C/C++ >>Accelerating Applications with GPU-Accelerated Libraries in C/C++ >>Using Thrust to Accelerate C++ >>Accelerating Applications with GPU-Accelerated Libraries in Python >>Accelerating Applications with CUDA Fortran >>GPU Memory Optimizations with Fortran >>Accelerating Applications with GPU-Accelerated Libraries in Fortran >>Introduction to Accelerated Computing >>Profile-Driven Approach to Accelerate Seismic Applications with OpenACC >>OpenACC 2X in 4 Steps >>Introduction to Multi-GPU Programming with MPI and OpenACC >>Advanced Multi-GPU Programming with MPI and OpenACC >>Profiling and Parallelizing with OpenACC >>Expressing Data Movement and Optimizing Loops with OpenACC >>Pipelining Work on the GPU with OpenACC 5

WORKSHOPS FUNDAMENTALS OF DEEP LEARNING FOR COMPUTER VISION PREREQUISITES: Technical background FRAMEWORKS: Caffe, DIGITS, Chinese Explore the fundamentals of deep learning by training neural networks and using results to improve performance and capabilities. In this course, you ll learn the basics of deep learning by training and deploying neural networks. You ll learn how to: >>Implement common deep learning workflows, such as image classification and object detection. >>Experiment with data, training parameters, network structure, and other strategies to increase performance and capability. >>Deploy your neural networks to start solving real-world problems. Upon completion, you ll be able to start solving problems on your own with deep learning. FUNDAMENTALS OF DEEP LEARNING FOR MULTIPLE DATA TYPES PREREQUISITES: Fundamentals of Deep Learning for Computer Vision or similar experience FRAMEWORKS: TensorFlow This course explores how convolutional and recurrent neural networks can be combined to generate effective descriptions of content within images and video clips. Learn how to train a network using TensorFlow and the MSCOCO dataset to generate captions from images and video by: >>Implementing deep learning workflows like image segmentation and text generation >>Comparing and contrasting data types, workflows, and frameworks >>Combining computer vision and natural language processing Upon completion, you ll be able to solve deep learning problems that require multiple types of data inputs. 6

DEEP LEARNING FOR AUTONOMOUS VEHICLES PERCEPTION PREREQUISITES: Fundamentals of Deep Learning for Computer Vision or similar experience FRAMEWORKS: TensorFlow, DIGITS, TensorRT In this course, you ll learn how to design, train, and deploy deep neural networks for autonomous vehicles using the NVIDIA DRIVE PX2 development platform. Learn how to: >>Integrate sensor input using the DriveWorks software stack >>Train a semantic segmentation neural network >>Optimize, validate, and deploy a trained neural network using TensorRT Upon completion, students will be able to create and optimize perception components for autonomous vehicles using DRIVE PX2. DEEP LEARNING FOR DIGITAL CONTENT CREATION USING GANS AND AUTOENCODERS PREREQUISITES: Fundamentals of Deep Learning for Computer Vision or similar experience FRAMEWORKS: TensorFlow, Theano, DIGITS Explore the latest techniques for designing, training, and deploying neural networks for digital content creation. You ll learn how to: >>Train a Generative Adversarial Network (GAN) to generate images >>Explore the architectural innovations and training techniques used to make arbitrary video style transfer >>Train your own denoiser for rendered images Upon completion, you ll be able to start creating digital assets using deep learning approaches. 7

DEEP LEARNING FOR HEALTHCARE IMAGE ANALYSIS PREREQUISITES: Fundamentals of Deep Learning for Computer Vision or similar experience FRAMEWORKS: Caffe, MXNet, TensorFlow This course explores how to apply Convolutional Neural Networks (CNNs) to MRI scans to perform a variety of medical tasks and calculations. You ll learn how to: >>Perform image segmentation on MRI images to determine the location of the left ventricle. >>Calculate ejection fractions by measuring differences between diastole and systole using CNNs applied to MRI scans to detect heart disease. >>Apply CNNs to MRI scans of LGGs to determine 1p/19q chromosome co-deletion status. Upon completion, you ll be able to apply CNNs to MRI scans to conduct a variety of medical tasks. DEEP LEARNING FOR HEALTHCARE GENOMICS PREREQUISITES: Fundamentals of Deep Learning for Computer Vision or similar experience FRAMEWORKS: Caffe, TensorFlow, Theano This course teaches you how to apply deep learning to detect chromosome co-deletion and search for motifs in genomic sequences. You ll learn how to: >>Understand the basics of Convolutional Neural Networks (CNNs) and how they work. >>Apply CNNs to MRI scans of LGGs to determine 1p/19q chromosome co-deletion status. >>Use the DragoNN toolkit to simulate genomic data and to search for motifs. Upon completion, you ll be able to: understand how CNNs work, evaluate MRI images using CNNs, and use real regulatory genomic data to research new motifs. 8

DEEP LEARNING FOR FINANCE TRADING STRATEGY PREREQUISITES: Fundamentals of Deep Learning for Computer Vision or similar experience FRAMEWORKS: TensorFlow Linear techniques like principal component analysis (PCA) are the workhorses of creating eigenportfolios for use in statistical arbitrage strategies. Other techniques using time series financial data are also prevalent. But now, trading strategies can be advanced with the power of deep neural networks. In this course, you ll learn how to: >>Prepare time series data and test network performance using training and test datasets >>Structure and train a LSTM network to accept vector inputs and make predictions >>Use the Autoencoder as anomaly detector to create an arbitrage strategy Upon completion, you ll be able to use time series financial data to make predictions and exploit arbitrage using neural networks. DEEP LEARNING FOR FULL MOTION VIDEO ANALYTICS PREREQUISITES: Fundamentals of Deep Learning for Computer Vision or similar experience FRAMEWORKS: TensorFlow Traffic cameras, drones, and aerial sensor platforms are collecting huge amounts of video footage, which requires advanced deep learning techniques to transform data into actionable insights. The first step in more complex deep learning workflows is detecting specific types of objects, which involves identification, classification, segmentation, prediction, and recommendation. In this course, you ll learn how to: >>Train and evaluate deep learning models using the TensorFlow Object Detection API >>Explore the strategies and trade-offs involved in developing high-quality neural network models for track moving objects in large-scale video datasets >>Optimize inference times using TensorRT for real-time applications Upon completion, you ll be able to deploy object detection and tracking networks to work on real-time, large-scale video streams. 9

FUNDAMENTALS OF ACCELERATED COMPUTING WITH CUDA C/C++ PREREQUISITES: Basic C/C++ competency The CUDA computing platform enables the acceleration of CPU-only applications to run on the world s fastest massively parallel GPUs. Experience C/C++ application acceleration by: >>Accelerating CPU-only applications to run their latent parallelism on GPUs >>Utilizing essential CUDA memory management techniques to optimize accelerated applications >>Exposing accelerated application potential for concurrency and exploiting it with CUDA streams >>Leveraging command line and visual profiling to guide and check your work Upon completion, you ll be able to accelerate and optimize existing C/C++ CPU-only applications using the most essential CUDA tools and techniques. You ll understand an iterative style of CUDA development that will allow you to ship accelerated applications fast. 10

ONLINE TRAINING FUNDAMENTALS OF DEEP LEARNING COURSES FUNDAMENTALS OF DEEP LEARNING FOR COMPUTER VISION PREREQUISITES: Technical background FRAMEWORKS: Caffe, DIGITS, Chinese PRICE: $90 Explore the fundamentals of deep learning by training neural networks and using results to improve performance and capabilities. In this course, you ll learn the basics of deep learning by training and deploying neural networks. You ll learn how to: >>Implement common deep learning workflows, such as image classification and object detection. >>Experiment with data, training parameters, network structure, and other strategies to increase performance and capability. >>Deploy your neural networks to start solving real-world problems. Upon completion, you ll be able to start solving problems on your own with deep learning. MINI COURSES IMAGE CLASSIFICATION WITH DIGITS PREREQUISITES: Technical background FRAMEWORKS: DIGITS, Chinese, Japanese PRICE: Free Learn how to train a deep neural network to recognize handwritten digits by loading image data into a training environment, choosing and training a network, testing with new data, and iterating to improve performance. 11

OBJECT DETECTION WITH DIGITS PREREQUISITES: Technical background FRAMEWORKS: DIGITS, Chinese PRICE: Free Learn how to detect objects using computer vision and deep learning by identifying a purposebuilt network and using end-to-end labeled data. NEURAL NETWORK DEPLOYMENT WITH DIGITS AND TENSORRT PREREQUISITES: Technical background FRAMEWORKS: DIGITS, TensorRT, Chinese Learn to deploy deep learning to applications that recognize and detect images in real-time. APPLICATIONS OF DEEP LEARNING WITH CAFFE, THEANO, AND TORCH PREREQUISITES: None FRAMEWORKS: Caffe, Theano, Torch Explore how deep learning works and how it will change the future of computing. DEEP LEARNING WORKFLOWS WITH TENSORFLOW, MXNET, AND NVIDIA DOCKER PREREQUISITES: Bash terminal familiarity FRAMEWORKS: TensorFlow and MXNet Learn how to use the NVIDIA Docker plug-in to containerize production-grade deep learning workflows using GPUs. 12

IMAGE SEGMENTATION WITH TENSORFLOW PREREQUISITES: Fundamentals of Deep Learning with Computer Vision or similar experience FRAMEWORKS: TensorFlow Learn to combine computer vision and natural language processing to describe scenes using deep learning. IMAGE CLASSIFICATION WITH MICROSOFT COGNITIVE TOOLKIT PREREQUISITES: None FRAMEWORKS: Microsoft Cognitive Toolkit Learn to train a neural network using the Microsoft Cognitive Toolkit framework. LINEAR CLASSIFICATION WITH TENSORFLOW PREREQUISITES: None FRAMEWORKS: TensorFlow Learn to make predictions from structured data using TensorFlow s TFLearn application programming interface (API). SIGNAL PROCESSING WITH DIGITS PREREQUISITES: Fundamentals of Deep Learning with Computer Vision or similar experience Learn how to classify both image and image-like data using deep learning by converting radio frequency (RF) signals into images to detect a weak signal corrupted by noise. 13

GAME DEVELOPMENT AND DIGITAL CONTENT MINI COURSES IMAGE CREATION USING GANS WITH TENSORFLOW AND DIGITS PREREQUISITES: Fundamentals of Deep Learning for Computer Vision or similar deep learning experience FRAMEWORKS: TensorFlow, DIGITS Discover how to train a generative adversarial network (GAN) to generate image content in DIGITS. IMAGE STYLE TRANSFER WITH TORCH PREREQUISITES: Fundamentals of Deep Learning for Computer Vision or similar deep learning experience FRAMEWORKS: Torch Learn how to transfer the look and feel of one image to another image by extracting distinct visual features using convolutional neural networks (CNNs). RENDERED IMAGE DENOISING USING AUTOENCODERS PREREQUISITES: Fundamentals of Deep Learning for Computer Vision or similar deep learning experience FRAMEWORKS: TensorFlow Explore how a neural network with an autoencoder can be used to dramatically speed up the removal of noise in ray-traced images. 14

HEALTHCARE COURSES DEEP LEARNING FOR HEALTHCARE IMAGE ANALYSIS PREREQUISITES: Fundamentals of Deep Learning for Computer Vision or similar experience FRAMEWORKS: Caffe, MXNet, TensorFlow PRICE: $90 This course explores how to apply Convolutional Neural Networks (CNNs) to MRI scans to perform a variety of medical tasks and calculations. You ll learn how to: >>Perform image segmentation on MRI images to determine the location of the left ventricle. >>Calculate ejection fractions by measuring differences between diastole and systole using CNNs applied to MRI scans to detect heart disease. >>Apply CNN s to MRI scans of LGGs to determine 1p/19q chromosome co-deletion status. Upon completion, you ll be able to apply CNNs to MRI scans to conduct a variety of medical tasks. DEEP LEARNING FOR HEALTHCARE GENOMICS PREREQUISITES: Fundamentals of Deep Learning for Computer Vision or similar experience FRAMEWORKS: Caffe, TensorFlow, Theano PRICE: $60 This course teaches you how to apply deep learning to detect chromosome co-deletion and search for motifs in genomic sequences. You ll learn how to: >>Understand the basics of Convolutional Neural Networks (CNNs) and how they work. >>Apply CNNs to MRI scans of LGGs to determine 1p/19q chromosome co-deletion status. >>Use the DragoNN toolkit to simulate genomic data and to search for motifs. Upon completion, you ll be able to: understand how CNNs work, evaluate MRI images using CNNs, and use real regulatory genomic data to research new motifs. 15

MINI COURSES MODELING TIME SERIES DATA WITH RECURRENT NEURAL NETWORKS IN KERAS PREREQUISITES: Fundamentals of Deep Learning for Computer Vision or similar deep learning experience FRAMEWORKS: Keras PRICE: Free Explore how to classify and forecast time-series data using recurrent neural networks (RNNs), such as modeling a patient s health over time. 16

INTELLIGENT VIDEO ANALYTICS MINI COURSES OBJECT DETECTION FOR FULL MOTION VIDEO PREREQUISITES: Fundamentals of Deep Learning for Computer Vision or similar deep learning experience FRAMEWORKS: TensorFlow Discover how to analyze video data by implementing object detection methods using deep learning. OBJECT TRACKING FOR LARGE SCALE FULL MOTION VIDEO PREREQUISITES: Object Detection for Full-Motion Video or similar deep learning experience FRAMEWORKS: TensorFlow Explore how to track moving objects in large-scale video datasets using a deep neural network model. DEPLOYMENT FOR INTELLIGENT VIDEO ANALYTICS USING TENSORRT PREREQUISITES: Fundamentals of Deep Learning for Computer Vision or similar deep learning experience FRAMEWORKS: TensorRT Learn how to use TensorRT to accelerate inferencing performance for neural networks. 17

ACCELERATED COMPUTING COURSES FUNDAMENTALS OF ACCELERATED COMPUTING WITH CUDA C/C++ PREREQUISITES: Basic C/C++ competency PRICE: $90 The CUDA computing platform enables the acceleration of CPU-only applications to run on the world s fastest massively parallel GPUs. Experience C/C++ application acceleration by: >>Accelerating CPU-only applications to run their latent parallelism on GPUs >>Utilizing essential CUDA memory management techniques to optimize accelerated applications >>Exposing accelerated application potential for concurrency and exploiting it with CUDA streams >>Leveraging command line and visual profiling to guide and check your work Upon completion, you ll be able to accelerate and optimize existing C/C++ CPU-only applications using the most essential CUDA tools and techniques. FUNDAMENTALS OF ACCELERATED COMPUTING WITH CUDA PYTHON PREREQUISITES: Basic Python and NumPy competency PRICE: $90 This course explores how to use Numba the just-in-time, type-specializing, Python function compiler to accelerate Python programs to run on massively parallel NVIDIA GPUs. You ll learn how to: >>Use Numba to compile CUDA kernels from NumPy universal functions (ufuncs) >>Use Numba to create and launch custom CUDA kernels >>Apply key GPU memory management techniques Upon completion, you ll be able to use Numba to compile and launch CUDA kernels to accelerate your Python applications on NVIDIA GPUs. 18

FUNDAMENTALS OF ACCELERATED COMPUTING WITH OPENACC PREREQUISITES: Basic C/C++ competency PRICE: $116 Learn the basics of OpenACC, a high-level programming language for programming on GPUs. This course is for anyone with some C/C++ experience who is interested in accelerating the performance of their applications beyond the limits of CPU-only programming. In this course, you ll learn: >>Four simple steps to accelerating your already existing application with OpenACC >>How to profile and optimize your OpenACC codebase >>How to program on multi-gpu systems by combining OpenACC with MPI Upon completion, you ll be able to build and optimize accelerated heterogeneous applications on multiple GPU clusters using a combination of OpenACC, CUDA-aware MPI, and NVIDIA profiling tools. MINI COURSES ACCELERATING APPLICATIONS WITH CUDA C/C++ PREREQUISITES: Basic C/C++ competency PRICE: Free Learn how to accelerate your C/C++ application using CUDA to harness the massively parallel power of NVIDIA GPUs. OPENACC 2X IN 4 STEPS PREREQUISITES: Basic C/C++ competency Learn how to accelerate C/C++ or Fortran applications using OpenACC to harness the massively parallel power of NVIDIA GPUs. 19

INTRODUCTION TO ACCELERATED COMPUTING PREREQUISITES: None Explore a variety of techniques for accelerating applications, including CUDA and OpenACC. GPU MEMORY OPTIMIZATIONS WITH C/C++ PREREQUISITES: Basic CUDA C/C++ competency Learn useful memory optimization techniques for programming with CUDA C/C++ on an NVIDIA GPU and how to use the NVIDIA Visual Profiler (NVVP) to support these optimizations. ACCELERATING APPLICATIONS WITH GPU-ACCELERATED LIBRARIES IN C/C++ PREREQUISITES: Basic CUDA C/C++ competency Learn how to accelerate your C/C++ application using CUDA-optimized libraries to harness the massively parallel power of NVIDIA GPUs. ACCELERATING APPLICATIONS WITH GPU-ACCELERATED LIBRARIES IN PYTHON PREREQUISITES: None Learn how to accelerate your Python application using CUDA-optimized libraries to harness the massively parallel power of NVIDIA GPUs. USING THRUST TO ACCELERATE C++ PREREQUISITES: Basic CUDA C/C++ competency Discover how to build GPU-accelerated applications in C/C++ that utilize the powerful Thrust library. 20

PROFILING AND PARALLELIZING WITH OPENACC PREREQUISITES: OpenACC 2X in 4 Steps PRICE: $10 Get hands-on experience with the first two steps of the OpenACC programming cycle. EXPRESSING DATA MOVEMENT AND OPTIMIZING LOOPS WITH OPENACC PREREQUISITES: OpenACC - 2X in 4 Steps, Profiling and Parallelizing with OpenACC PRICE: $15 Learn how to add data management and loop directives to optimize OpenACC accelerated code. INTRODUCTION TO MULTI-GPU PROGRAMMING WITH MPI AND OPENACC PREREQUISITES: OpenACC 2X in 4 Steps Explore how to program multi-gpu systems or GPU clusters using the Message Passing Interface (MPI) and OpenACC. ADVANCED MULTI-GPU PROGRAMMING WITH MPI AND OPENACC PREREQUISITES: Introduction to Multi-GPU Programming with MPI and OpenACC Learn how to improve multi-gpu MPI+OpenACC programs by overlapping communication with computation and handling noncontiguous halo updates. 21

PIPELINING WORK ON THE GPU WITH OPENACC PREREQUISITES: OpenACC - 2X in 4 Steps, Profiling and Parallelizing with OpenACC, and Expressing Data Movement and Optimizing Loops with OpenACC Learn how to optimize data copies in OpenACC applications to overlap with GPU computation using a simple technique called pipelining. PROFILE-DRIVEN APPROACH TO ACCELERATE SEISMIC APPLICATIONS WITH OPENACC PREREQUISITES: None Learn how to use PGI Profiler (PGPROF), a host and GPU profiling tool, with OpenACC to accelerate your C/C++ applications. ACCELERATING APPLICATIONS WITH CUDA FORTRAN PREREQUISITES: None Learn how to accelerate your Fortran application using CUDA to harness the massively parallel power of NVIDIA GPUs. GPU MEMORY OPTIMIZATIONS WITH CUDA FORTRAN PREREQUISITES: Accelerating Applications with CUDA Fortran Discover useful memory optimization techniques for programming with CUDA Fortran on an NVIDIA GPU and how to use the NVIDIA Visual Profiler (NVVP) to support these optimizations. 22

ACCELERATING APPLICATIONS WITH GPU-ACCELERATED LIBRARIES IN FORTRAN PREREQUISITES: Basic CUDA Fortran competency Learn how to accelerate your Fortran application using CUDA-optimized libraries. Visit www.nvidia.com/dli to get started. 23