Snapdragon NPE Overview

Size: px
Start display at page:

Download "Snapdragon NPE Overview"

Transcription

1 March 2018 Linaro Connect Hong Kong Snapdragon NPE Overview Mark Charlebois Director, Engineering Qualcomm Technologies, Inc.

2 Caffe2 Snapdragon Neural Processing Engine Efficient execution on Snapdragon Model framework/ network support Developer Tools Software accelerated runtime for the execution of deep neural networks on device What s new? Available at: developer.qualcomm.com Fixed and floating point optimizations Supports Caffe2, CNTK, MxNet New optimizations for networks TensorFlow, the TensorFlow logo and any related marks are trademarks of Google Inc 2

3 Elements of Snapdragon NPE SDK API C++ library in binary form and header files Java library for Android integration C++ and Python API support for interacting with DLC DLC Snapdragon NPE DNN model format Network is a collection of connected layers DNN models are stored in DLC files Snapdragon NPE SDK Tools Model converters to create Snapdragon NPE compatible DNN models from popular training framework formats Optimization and debugging support tools Support Assets Development host (x86 Ubuntu 14.04) User and reference documentation Tutorials and examples Benchmarking 3

4 OS Snapdragon NPE SW Diagram SDK Productivity Components 3 rd Party Apps Benchmarking Network Debug Tools Tutorial Samples User & Reference Docs OS Drivers Core Runtime Profiling Logging Compute Networks Runtime Engine Model Debug User Defined Layers (UDL) API CPU GPU libopencl.so DSP DL Container Model loader libsnpe_[a,c]dsp*.so libsnpe_dsp_*skel.so SDK API DNN Model Conversion Tools Caffe/2 -> DLC fixed TensorFlow -> DLC fixed Caffe/2 -> DLC Float TensorFlow -> DLC Float UDL Plugin Android & Linux (x86_64, Armv7, Armv7hf, AArch64) QuRT HW CPU Adreno GPU Hexagon DSP 4

5 Snapdragon NPE SDK SDK can be downloaded from Qualcomm Developer Network The NPE SDK supports Qualcomm Snapdragon 845, 820, 835, 625, 626, 650, 652, 653, 660, 630, 636, and 450 as well as the Qualcomm Snapdragon 820Am automotive platform and Qualcomm Snapdragon Flight. For Qualcomm Adreno GPU support, libopencl.so must be present on device. Toolchains: Android (armv7, aarch64) - GCC and Clang toolchains Linux (armv7, armv7hf, aarch64, x86_64*) - GCC on ARM, Clang on x86_64 * CPU only 5

6 NPE SDK Developer Tools snpe-net-run snpe-caffe-to-dlc snpe-caffe2-to-dlc snpe-tensorflow-to-dlc snpe-onnx-to-dlc* snpe-diagview snpe-dlc-info snpe-dlc-quantize snpe_bench.py *Coming soon 6

7 Using the Snapdragon NPE 7

8 Snapdragon NPE Workflow GoogleNet Inception SSD Alexnet ResNet User Defined Layer (UDL) enables prototyping of layers not yet supported MobileNet Caffe2 SqueezeNet Faster RCNN TensorFlow, the TensorFlow logo and any related marks are trademarks of Google Inc 8

9 Input Image Formatting In the Snapdragon NPE, images must be presented as a tensor of shape (height x width x channel), where channel is the fastest-changing dimension. (1, 1) (1, 1) (1, 1) (1,2) (1,2) (1,2) (H,W) See $SNPE_ROOT/models/alexnet/scripts/create_alexnet_raws.py in the SDK NCHW NCHW Mean Image Convert NHWC Caffe Format SNPE Format For current Snapdragon NPE SDK release, N=1. Batch support coming in future release. 9

10 Quantized vs Non-Quantized Models Non-quantized DLC files use 32 bit floating point representations of network parameters. Quantized DLC files use 8 bit fixed point representations of network parameters and are smaller..dlc File DSP Runtime (8-bit) CPU/GPU Runtime (32 bit) Quantized.dlc File 10

11 Making a Snapdragon NPE Enabled Application App setup bool useusersuppliedbuffers = false; // Set the Runtime static zdl::dlsystem::runtime_t runtime = zdl::snpe::snpefactory::isruntimeavailable(zdl::dlsystem::runtime_t::gpu))? zdl::dlsystem::runtime_t::gpu : zdl::dlsystem::runtime_t::cpu; // Load DLC Container std::unique_ptr<zdl::dlcontainer::idlcontainer> container = zdl::dlcontainer::idlcontainer::open(dlcpath); // Build SNPE instance zdl::snpe::snpebuilder snpebuilder(container); std::unique_ptr<zdl::snpe::snpe> snpe = snpebuilder.setoutputlayers({}).setruntimeprocessor(runtime).setudlbundle(udlbundle).setuseusersuppliedbuffers(useusersuppliedbuffers).build(); 11

12 Making a Snapdragon NPE Enabled Application Running the network (ITensor) // Load the inputs std::unique_ptr<zdl::dlsystem::itensor> inputtensor = loadinputtensor(snpe, fileline); // See SDK docs static zdl::dlsystem::tensormap outputtensormap; // Run the network snpe.execute(inputtensor, outputtensormap); zdl::dlsystem::stringlist tensornames = outputtensormap.gettensornames(); // Access the results std::for_each( tensornames.begin(), tensornames.end(), [&](const char* name){ } auto tensorptr = outputtensormap.gettensor(name); for ( auto it = tensorptr->cbegin(); it!= tensorptr->cend(); ++it ){ }); float f = *it;... 12

13 Making a SNPE Enabled Application Running the network (UserBuffer) // Load the Inputs loadinputuserbuffer(applicationinputbuffers, snpe, fileline); // See SDK Docs // Run the Network snpe.execute(inputmap, outputmap); const zdl::dlsystem::stringlist& outputbuffernames = outputmap.getuserbuffernames(); // Access the results std::for_each(outputbuffernames.begin(), outputbuffernames.end(), [&](const char* name) { auto buffer = applicationoutputbuffers.at(name).data(); }); float *f; for (auto i=0; i< buffer.size(); i+=sizeof(float)) { } f = reinterpret_cast<float *>(&buffer[i]);... 13

14 Thank you! Follow us on: For more information, visit us at: & Nothing in these materials is an offer to sell any of the components or devices referenced herein Qualcomm Technologies, Inc. and/or its affiliated companies. All Rights Reserved. Qualcomm is a trademark of Qualcomm Incorporated, registered in the United States and other countries. Other products and brand names may be trademarks or registered trademarks of their respective owners. References in this presentation to Qualcomm may mean Qualcomm Incorporated, Qualcomm Technologies, Inc., and/or other subsidiaries or business units within the Qualcomm corporate structure, as applicable. Qualcomm Incorporated includes Qualcomm s licensing business, QTL, and the vast majority of its patent portfolio. Qualcomm Technologies, Inc., a wholly-owned subsidiary of Qualcomm Incorporated, operates, along with its subsidiaries, substantially all of Qualcomm s engineering, research and development functions, and substantially all of its product and services businesses, including its semiconductor business, QCT.

Achieving on Mobile Devices

Achieving on Mobile Devices March 2018 @qualcomm_tech Achieving AI @Scale on Mobile Devices Qualcomm Technologies, Inc. Mobile is the largest computing platform in the world > 8.5 Billion Cumulative smartphone unit shipments forecast

More information

Perform. Travis Lanier Sr. Director, Product Management Qualcomm Technologies,

Perform. Travis Lanier Sr. Director, Product Management Qualcomm Technologies, Perform Travis Lanier Sr. Director, Product Management Qualcomm Technologies, Inc. @qualcomm More powerful and longer lasting mobile experiences Because nobody wants to be this person U.K. U.S. Germany

More information

Heterogeneous Computing Made Easy:

Heterogeneous Computing Made Easy: Heterogeneous Computing Made Easy: Qualcomm Symphony System Manager SDK Wenjia Ruan Sr. Engineer, Advanced Content Group Qualcomm Technologies, Inc. May 2017 Qualcomm Symphony System Manager SDK is a product

More information

Preparing for Mass Market Virtual Reality: A Mobile Perspective. Qualcomm Technologies, Inc. September 16, 2017

Preparing for Mass Market Virtual Reality: A Mobile Perspective. Qualcomm Technologies, Inc. September 16, 2017 Preparing for Mass Market Virtual Reality: A Mobile Perspective Qualcomm Technologies, Inc. September 16, 2017 Immersive Always-connected VR is meant to be Mobile 2 Automotive video streaming Crowded event

More information

RISC-V: Opportunities and Challenges in SoCs

RISC-V: Opportunities and Challenges in SoCs December 5, 2018 @qualcomm Santa Clara, CA RISC-V: Opportunities and Challenges in SoCs Greg Wright Sr Director, Engineering Qualcomm Technologies, Inc. Introductions Who am I? Why am I here? 2 Quick tour

More information

Machine Learning for Selected SI & PI Problems. Timothy Michalka Sr. Director, Engineering Qualcomm Technologies, Inc. 18-Oct-2017

Machine Learning for Selected SI & PI Problems. Timothy Michalka Sr. Director, Engineering Qualcomm Technologies, Inc. 18-Oct-2017 Machine Learning for Selected SI & PI Problems Timothy Michalka Sr. Director, Engineering Qualcomm Technologies, Inc. 18-Oct-2017 1 Outline ML @ Qualcomm Technologies, Inc. SI & PI project interests 2

More information

Immersion. Tim Leland Vice President, Product Management Qualcomm Technologies,

Immersion. Tim Leland Vice President, Product Management Qualcomm Technologies, Immersion Tim Leland Vice President, Product Management Qualcomm Technologies, Inc. @qualcomm Capturing life experiences and extending Realities Capture Color is a power which directly influences the soul

More information

Ultra-low Power Always-On Computer Vision

Ultra-low Power Always-On Computer Vision March 20, 2019 @qualcomm_tech Sunnyvale, California Ultra-low Power Always-On Computer Vision Edwin Park Principal Engineer Qualcomm Artificial Intelligence (AI) Research Qualcomm Technologies, Inc. Qualcomm

More information

Qualcomm Snapdragon Technologies

Qualcomm Snapdragon Technologies March 2018 Game Developer Conference (GDC) Qualcomm Snapdragon Technologies Hiren Bhinde, Director, XR Product Management Qualcomm Technologies, Inc. Qualcomm Technologies announcements & updates Snapdragon

More information

Movidius Neural Compute Stick

Movidius Neural Compute Stick Movidius Neural Compute Stick You may not use or facilitate the use of this document in connection with any infringement or other legal analysis concerning Intel products described herein. You agree to

More information

Making XR a reality for everyone

Making XR a reality for everyone May 29, 2018 @qualcomm Augmented World Expo Making XR a reality for everyone Hugo Swart, Senior Director, Head of XR Business Management Hiren Bhinde, Director, XR Product Management Qualcomm Technologies,

More information

2016 Seoul DevU Pipeline Cache Object. Bill Licea-Kane Engineer, Senior Staff Qualcomm Technologies, Inc

2016 Seoul DevU Pipeline Cache Object. Bill Licea-Kane Engineer, Senior Staff Qualcomm Technologies, Inc 2016 Seoul DevU Pipeline Cache Object Bill Licea-Kane Engineer, Senior Staff Qualcomm Technologies, Inc. 2016-10-21 Our application requires many many many Pipeline State Objects. Creation time is a huge

More information

NeoNet: Object centric training for image recognition

NeoNet: Object centric training for image recognition Daniel Fontijne, Koen E. A. van de Sande, Eren Gölge, R. Blythe Towal, Anthony Sarah, Cees G. M. Snoek Qualcomm Technologies, Inc., December 17, 2015 NeoNet: Object centric training for image recognition

More information

Mobile: the foundation of the digital economy

Mobile: the foundation of the digital economy Mobile: the foundation of the digital economy Cristiano Amon Executive Vice President, Qualcomm Technologies, Inc. and President, Qualcomm CDMA Technologies @cristianoamon 1 Mobile technology is powering

More information

Wu Zhiwen.

Wu Zhiwen. Wu Zhiwen zhiwen.wu@intel.com Agenda Background information OpenCV DNN module OpenCL acceleration Vulkan backend Sample 2 What is OpenCV? Open Source Compute Vision (OpenCV) library 2500+ Optimized algorithms

More information

Leading the world to 5G

Leading the world to 5G June 28, 2018 @5GwirelessEDGE Shanghai, China Leading the world to 5G Serge Willenegger SVP & GM, 5G & Industrial IoT Qualcomm Wireless GmbH A unifying connectivity platform to drive growth and innovation

More information

Date: 13 June Location: Sophia Antipolis. Integrating the SIM. Dr. Adrian Escott. Qualcomm Technologies, Inc.

Date: 13 June Location: Sophia Antipolis. Integrating the SIM. Dr. Adrian Escott. Qualcomm Technologies, Inc. Date: 13 June 2018 @qualcomm Location: Sophia Antipolis Integrating the SIM Dr. Adrian Escott Qualcomm Technologies, Inc. Agenda 1 2 3 4 5 6 Path to isim isim Size benefit Hardware Architecture Certification

More information

IoT with 5G Technology

IoT with 5G Technology July 25 th, 2018 Innofest ID 2018 Jakarta, Indonesia IoT with 5G Technology Transformation for Industrial 4.0 Shannedy Ong Country Director Indonesia Leading mobile innovation for over 30 years Digitized

More information

Qualcomm Snapdragon 450 Mobile Platform

Qualcomm Snapdragon 450 Mobile Platform Qualcomm Snapdragon 450 Mobile Platform Kedar Kondap Vice President, Product Management Qualcomm Technologies, Inc. Snapdragon 400 Tier @qualcomm Qualcomm Snapdragon is a product of Qualcomm Technologies,

More information

802.11ax: Meeting the demands of modern networks. Gopi Sirineni, Vice President Qualcomm Technologies, Inc. April 19,

802.11ax: Meeting the demands of modern networks. Gopi Sirineni, Vice President Qualcomm Technologies, Inc. April 19, 802.11ax: Meeting the demands of modern networks Gopi Sirineni, Vice President Qualcomm Technologies, Inc. April 19, 2017 @qualcomm The Wi-Fi landscape has changed More devices & data Diverse apps & services

More information

The Future of Mobility. Keith Kressin Senior Vice President, Product Management Qualcomm Technologies,

The Future of Mobility. Keith Kressin Senior Vice President, Product Management Qualcomm Technologies, The Future of Mobility Keith Kressin Senior Vice President, Product Management Qualcomm Technologies, Inc. @qualcomm The future of mobility Path to 5G extended Reality Artificial Intelligence Trends Evolution

More information

Emerging Vision Technologies: Enabling a New Era of Intelligent Devices

Emerging Vision Technologies: Enabling a New Era of Intelligent Devices Emerging Vision Technologies: Enabling a New Era of Intelligent Devices Computer vision overview Computer vision is being integrated in our daily lives Acquiring, processing, and understanding visual data

More information

Making always-on vision a reality. Dr. Evgeni Gousev Sr. Director, Engineering Qualcomm Technologies, Inc. September 22,

Making always-on vision a reality. Dr. Evgeni Gousev Sr. Director, Engineering Qualcomm Technologies, Inc. September 22, Making always-on vision a reality Dr. Evgeni Gousev Sr. Director, Engineering Qualcomm Technologies, Inc. September 22, 2017 @qualcomm Outline 1. Problem statement Challenges to develop always-on vision

More information

Enabling and Optimizing MariaDB on Qualcomm Centriq 2400 Arm-based Servers

Enabling and Optimizing MariaDB on Qualcomm Centriq 2400 Arm-based Servers Enabling and Optimizing MariaDB on Qualcomm Centriq 2400 Arm-based Servers World s First 10nm Server Processor Sandeep Sethia Staff Engineer Qualcomm Datacenter Technologies, Inc. February 25, 2018 MariaDB

More information

Heterogeneous Multi-Core Architecture Support for Dronecode

Heterogeneous Multi-Core Architecture Support for Dronecode Heterogeneous Multi-Core Architecture Support for Dronecode Mark Charlebois, March 24 th 2015 Qualcomm Technologies Inc (QTI) is a Silver member of Dronecode Dronecode has 2 main projects: https://www.dronecode.org/software/where-dronecode-used

More information

Bringing link-time optimization to the embedded world: (Thin)LTO with Linker Scripts

Bringing link-time optimization to the embedded world: (Thin)LTO with Linker Scripts Bringing link-time optimization to the embedded world: (Thin)LTO with Linker Scripts Tobias Edler von Koch, Sergei Larin, Shankar Easwaran, Hemant Kulkarni Qualcomm Innovation Center, Inc. What is a linker

More information

Mobile technology: A catalyst for change. Kedar Kondap Vice President, Product Management Qualcomm Technologies, Inc.

Mobile technology: A catalyst for change. Kedar Kondap Vice President, Product Management Qualcomm Technologies, Inc. Mobile technology: A catalyst for change Kedar Kondap Vice President, Product Management Qualcomm Technologies, Inc. Evolution of Wireless 384 kbps 21 Mbps 150 Mbps 2G Digital voice D-AMPS, GSM, IS-95

More information

NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORKS

NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORKS TECHNICAL OVERVIEW NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORKS A Guide to the Optimized Framework Containers on NVIDIA GPU Cloud Introduction Artificial intelligence is helping to solve some of the most

More information

The Mobile Future of extended Reality (XR) Hugo Swart Senior Director, Product Management Qualcomm Technologies, Inc.

The Mobile Future of extended Reality (XR) Hugo Swart Senior Director, Product Management Qualcomm Technologies, Inc. The Mobile Future of extended Reality (XR) Hugo Swart Senior Director, Product Management Qualcomm Technologies, Inc. June 29, 2017 XR is the future Industrial & manufacturing Engineering Healthcare M

More information

Global 5G spectrum update

Global 5G spectrum update Global 5G spectrum update Luigi Ardito Director, Government Affairs Qualcomm Incorporated European Commission driving a Gigabit Society 1 Deploying 5G across Europe by 2020 with pre-commercial trials starting

More information

5 GHz for consumers. Guillaume Lebrun Director 7 th June 2016

5 GHz for consumers. Guillaume Lebrun Director 7 th June 2016 5 GHz for consumers Guillaume Lebrun Director 7 th June 2016 More data to more devices in more places ~75% 25-50B Mobile traffic that will be rich content & video 1 by 2020 Connected devices 2 and IoT

More information

A New Foundation for the Connected Home. Qualcomm Technologies, Inc. June

A New Foundation for the Connected Home. Qualcomm Technologies, Inc. June A New Foundation for the Connected Home Qualcomm Technologies, Inc. June 2017 @qualcomm The connected home is evolving at a breakneck pace Number of connected devices per home (family of four) 50 24 8

More information

Accelerate Deep Learning Inference with openvino toolkit

Accelerate Deep Learning Inference with openvino toolkit Accelerate Deep Learning Inference with openvino toolkit Priyanka Bagade, IoT Developer Evangelist, Intel Core and Visual Computing Group Optimization Notice Intel s compilers may or may not optimize to

More information

Welcome to the 5G age

Welcome to the 5G age Welcome to the 5G age Cristiano Amon Executive Vice President, Qualcomm Technologies, Inc. and President, Qualcomm CDMA Technologies October 2017 @cristianoamon Mobile technology is powering the global

More information

CSR102x Starter Development Kit

CSR102x Starter Development Kit CSR102x Starter Development Kit Bluetooth Low Energy Single Mode SoC CSR102x is a product of Qualcomm Technologies International, Ltd. Qualcomm Bluetooth Low Energy Terminology clarification In this document

More information

Leading the World to 5G NR

Leading the World to 5G NR Leading the World to 5G NR Luigi Ardito Director Government Affairs, 5G Policy Lead Qualcomm Europe Inc. September 2017 Global snapshot of 5G spectrum Around the world, these bands have been allocated

More information

Vulkan API 杨瑜, 资深工程师

Vulkan API 杨瑜, 资深工程师 Vulkan API 杨瑜, 资深工程师 Vulkan Overview (1/3) Some History ~2011 became apparent that the API is getting in the way - Console Developers programmed GPUs To-the-Metal 2012 Khronos started work on GLCommon

More information

Innovative Wireless Technologies for Mobile Broadband

Innovative Wireless Technologies for Mobile Broadband Innovative Wireless Technologies for Mobile Broadband RRS-17-Africa Forum - WRC 19 Agenda : Challenges and Opportunities for Africa Emerging Innovative Technologies Dakar, Senegal, 31 March, 2017 Qualcomm

More information

NVIDIA FOR DEEP LEARNING. Bill Veenhuis

NVIDIA FOR DEEP LEARNING. Bill Veenhuis NVIDIA FOR DEEP LEARNING Bill Veenhuis bveenhuis@nvidia.com Nvidia is the world s leading ai platform ONE ARCHITECTURE CUDA 2 GPU: Perfect Companion for Accelerating Apps & A.I. CPU GPU 3 Intro to AI AGENDA

More information

New Technologies for UAV/UGV

New Technologies for UAV/UGV Qualcomm Research New Technologies for UAV/UGV Charles Bergan VP, Engineering Qualcomm Research Qualcomm Technologies, Inc. 2013-2016 QUALCOMM Incorporated and/or its subsidiaries. All rights reserved

More information

TEXAS INSTRUMENTS DEEP LEARNING (TIDL) GOES HERE FOR SITARA PROCESSORS GOES HERE

TEXAS INSTRUMENTS DEEP LEARNING (TIDL) GOES HERE FOR SITARA PROCESSORS GOES HERE YOUR TEXAS INSTRUMENTS VIDEO TITLE DEEP LEARNING (TIDL) GOES HERE FOR SITARA PROCESSORS OVERVIEW THE SUBTITLE GOES HERE Texas Instruments Deep Learning (TIDL) for Sitara Processors Overview Texas Instruments

More information

CSR102x Bluetooth Smart Product Line Overview

CSR102x Bluetooth Smart Product Line Overview CSR102x Bluetooth Smart Product Line Overview Speaker name Speaker title Speaker employing entity Date CSR102x Overview Qualcomm Bluetooth Low Energy Terminology clarification In this document you will

More information

Open Source and Standards: A Proposal for Collaboration

Open Source and Standards: A Proposal for Collaboration ETSI Workshop on Open Source and ization: Legal Interactions September 16, 2016 Sophia Antipolis Open Source and s: A Proposal for Collaboration David Marr VP & Legal Counsel Open Source Group Qualcomm

More information

Embedded. Connected. Aware. SW Delivery Process. February Inforce Computing. Do NOT Copy/Distribute without prior written permission

Embedded. Connected. Aware. SW Delivery Process. February Inforce Computing. Do NOT Copy/Distribute without prior written permission 2016 Inforce Computing. Do NOT Copy/Distribute without prior written permission Embedded. Connected. Aware. SW Delivery Process February 2016 BSP Delivery The overall BSP package which is downloaded from

More information

HPE Deep Learning Cookbook: Recipes to Run Deep Learning Workloads. Natalia Vassilieva, Sergey Serebryakov

HPE Deep Learning Cookbook: Recipes to Run Deep Learning Workloads. Natalia Vassilieva, Sergey Serebryakov HPE Deep Learning Cookbook: Recipes to Run Deep Learning Workloads Natalia Vassilieva, Sergey Serebryakov Deep learning ecosystem today Software Hardware 2 HPE s portfolio for deep learning Government,

More information

Impact of the current LLVM inlining strategy on complex embedded application memory utilization and performance

Impact of the current LLVM inlining strategy on complex embedded application memory utilization and performance Impact of the current LLVM inlining strategy on complex embedded application memory utilization and performance Sergei Larin Senior Staff Engineer, Harsha Jagasia Staff Engineer, Tobias Edler von Koch

More information

End to End Optimization Stack for Deep Learning

End to End Optimization Stack for Deep Learning End to End Optimization Stack for Deep Learning Presenter: Tianqi Chen Paul G. Allen School of Computer Science & Engineering University of Washington Collaborators University of Washington AWS AI Team

More information

Future Networked Car Geneva Auto Show 2018

Future Networked Car Geneva Auto Show 2018 March 8, 2018 Geneva, Switzerland Future Networked Car Symposium @ Geneva Auto Show 2018 Accelerating 5G for autonomous driving Jason Ellis Qualcomm - Automotive Director, Business Development for Connected

More information

S INSIDE NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORK CONTAINERS

S INSIDE NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORK CONTAINERS S8497 - INSIDE NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORK CONTAINERS Chris Lamb CUDA and NGC Engineering, NVIDIA John Barco NGC Product Management, NVIDIA NVIDIA GPU Cloud (NGC) overview AGENDA Using NGC

More information

5G Design and Technology. Durga Malladi SVP Engineering Qualcomm Technologies, Inc. October 19 th, 2016

5G Design and Technology. Durga Malladi SVP Engineering Qualcomm Technologies, Inc. October 19 th, 2016 5G Design and Technology Durga Malladi SVP Engineering Qualcomm Technologies, Inc. October 19 th, 2016 Mobile fueled the last 30 years interconnecting people 1980s Analog voice 1990s Digital voice 2000s

More information

How Qualcomm Wireless Reach M&E Catalyzes SGBs. Lauren H Reed Staff Analyst Government Affairs 1

How Qualcomm Wireless Reach M&E Catalyzes SGBs. Lauren H Reed Staff Analyst Government Affairs 1 How Qualcomm Wireless Reach M&E Catalyzes SGBs Lauren H Reed Staff Analyst Government Affairs 1 30 #1 years of driving the evolution of wireless fabless semiconductor company #1 #1 in 3G/4G LTE modem in

More information

Qualcomm WiPower Flexible Wireless Charging

Qualcomm WiPower Flexible Wireless Charging Geoff Gordon Staff Manager, Marketing Qualcomm Incorporated Qualcomm WiPower Flexible Wireless Charging Qualcomm WiPower wireless charging technology is licensed by Qualcomm Incorporated. Qualcomm WiPower

More information

Making 5G NR a commercial reality

Making 5G NR a commercial reality Making 5G NR a commercial reality Ultra-high fidelity media anywhere Immersive entertainment Safer, more autonomous transportation Connectivity is the new Electricity Reliable access to remote healthcare

More information

NVIDIA DATA LOADING LIBRARY (DALI)

NVIDIA DATA LOADING LIBRARY (DALI) NVIDIA DATA LOADING LIBRARY (DALI) RN-09096-001 _v01 September 2018 Release Notes TABLE OF CONTENTS Chapter Chapter Chapter Chapter Chapter 1. 2. 3. 4. 5. DALI DALI DALI DALI DALI Overview...1 Release

More information

Onto Petaflops with Kubernetes

Onto Petaflops with Kubernetes Onto Petaflops with Kubernetes Vishnu Kannan Google Inc. vishh@google.com Key Takeaways Kubernetes can manage hardware accelerators at Scale Kubernetes provides a playground for ML ML journey with Kubernetes

More information

Nvidia Jetson TX2 and its Software Toolset. João Fernandes 2017/2018

Nvidia Jetson TX2 and its Software Toolset. João Fernandes 2017/2018 Nvidia Jetson TX2 and its Software Toolset João Fernandes 2017/2018 In this presentation Nvidia Jetson TX2: Hardware Nvidia Jetson TX2: Software Machine Learning: Neural Networks Convolutional Neural Networks

More information

AI Benchmark: Running Deep Neural Networks on Android Smartphones

AI Benchmark: Running Deep Neural Networks on Android Smartphones AI Benchmark: Running Deep Neural Networks on Android Smartphones Andrey Ignatov 1, Radu Timofte 1, William Chou 2, Ke Wang 3, Max Wu 4, Tim Hartley 5, and Luc Van Gool 1 1 ETH Zurich, {andrey, radu.timofte,

More information

Towards 5G NR Commercialization

Towards 5G NR Commercialization Towards 5G NR Commercialization Accelerating 5G NR for Enhanced Mobile Broadband May 25 2017 5G NR will deliver new levels of capability and efficiency For enhanced mobile broadband and beyond Multi-Gigabit

More information

Inference Optimization Using TensorRT with Use Cases. Jack Han / 한재근 Solutions Architect NVIDIA

Inference Optimization Using TensorRT with Use Cases. Jack Han / 한재근 Solutions Architect NVIDIA Inference Optimization Using TensorRT with Use Cases Jack Han / 한재근 Solutions Architect NVIDIA Search Image NLP Maps TensorRT 4 Adoption Use Cases Speech Video AI Inference is exploding 1 Billion Videos

More information

Xilinx ML Suite Overview

Xilinx ML Suite Overview Xilinx ML Suite Overview Yao Fu System Architect Data Center Acceleration Xilinx Accelerated Computing Workloads Machine Learning Inference Image classification and object detection Video Streaming Frame

More information

GPU Coder: Automatic CUDA and TensorRT code generation from MATLAB

GPU Coder: Automatic CUDA and TensorRT code generation from MATLAB GPU Coder: Automatic CUDA and TensorRT code generation from MATLAB Ram Kokku 2018 The MathWorks, Inc. 1 GPUs and CUDA programming faster Performance CUDA OpenCL C/C++ GPU Coder MATLAB Python Ease of programming

More information

5G Spectrum Access. Wassim Chourbaji. Vice President, Government Affairs and Public Policy EMEA Qualcomm Technologies Inc.

5G Spectrum Access. Wassim Chourbaji. Vice President, Government Affairs and Public Policy EMEA Qualcomm Technologies Inc. 5G Spectrum Access Wassim Chourbaji Vice President, Government Affairs and Public Policy EMEA Qualcomm Technologies Inc. @WassimChourbaji Pioneering 5G bands for Europe A unifying connectivity fabric Always-available,

More information

AI Benchmark: Running Deep Neural Networks on Android Smartphones

AI Benchmark: Running Deep Neural Networks on Android Smartphones AI Benchmark: Running Deep Neural Networks on Android Smartphones arxiv:1810.01109v2 [cs.ai] 15 Oct 2018 Andrey Ignatov ETH Zurich andrey@vision.ee.ethz.ch Abstract Max Wu MediaTek, Inc. max.wu@mediatek.com

More information

Accelerate Machine Learning on macos with Intel Integrated Graphics. Hisham Chowdhury May 23, 2018

Accelerate Machine Learning on macos with Intel Integrated Graphics. Hisham Chowdhury May 23, 2018 Accelerate Machine Learning on macos with Intel Integrated Graphics Hisham Chowdhury May 23, 2018 Apple Machine Learning Stack Machine Learning Application 1 Machine Learning Application 2 Vision Natural

More information

Huawei HiAI DDK User Manual

Huawei HiAI DDK User Manual Huawei HiAI DDK User Manual Issue: V100.150.10 Date: 2018-03-09 Huawei Technologies Co., Ltd. Copyright Huawei Technologies Co., Ltd. 2018. All rights reserved. No part of this document may be reproduced

More information

Deep Learning: Transforming Engineering and Science The MathWorks, Inc.

Deep Learning: Transforming Engineering and Science The MathWorks, Inc. Deep Learning: Transforming Engineering and Science 1 2015 The MathWorks, Inc. DEEP LEARNING: TRANSFORMING ENGINEERING AND SCIENCE A THE NEW RISE ERA OF OF GPU COMPUTING 3 NVIDIA A IS NEW THE WORLD S ERA

More information

An introduction to Machine Learning silicon

An introduction to Machine Learning silicon An introduction to Machine Learning silicon November 28 2017 Insight for Technology Investors AI/ML terminology Artificial Intelligence Machine Learning Deep Learning Algorithms: CNNs, RNNs, etc. Additional

More information

Making 5G NR a reality

Making 5G NR a reality Making 5G NR a reality Silicon Valley 5G Summit Mountain View, CA October 19 th, 2017 Tingfang Ji Senior Director, Engineering Qualcomm Technologies, Inc. @qualcomm_tech NR Designing a unified, more capable

More information

Open Standards for Vision and AI Peter McGuinness NNEF WG Chair CEO, Highwai, Inc May 2018

Open Standards for Vision and AI Peter McGuinness NNEF WG Chair CEO, Highwai, Inc May 2018 Copyright Khronos Group 2018 - Page 1 Open Standards for Vision and AI Peter McGuinness NNEF WG Chair CEO, Highwai, Inc peter.mcguinness@gobrach.com May 2018 Khronos Mission E.g. OpenGL ES provides 3D

More information

Making 5G NR a reality

Making 5G NR a reality Making 5G NR a reality Laurent Fournier Sr. Director Technology Development Europe Qualcomm Technologies, Inc. November 17, 2016 DigiWorld Congress Scalability to address diverse service and devices Ultra-low

More information

Qualcomm Snapdragon 710 mobile platform

Qualcomm Snapdragon 710 mobile platform Qualcomm Snapdragon 710 mobile platform Kedar Kondap Vice President, Product Management Qualcomm Technologies, Inc. Qualcomm Snapdragon is a product of Qualcomm Technologies, Inc. and/or its subsidiaries

More information

NVIDIA DGX SYSTEMS PURPOSE-BUILT FOR AI

NVIDIA DGX SYSTEMS PURPOSE-BUILT FOR AI NVIDIA DGX SYSTEMS PURPOSE-BUILT FOR AI Overview Unparalleled Value Product Portfolio Software Platform From Desk to Data Center to Cloud Summary AI researchers depend on computing performance to gain

More information

More performance options

More performance options More performance options OpenCL, streaming media, and native coding options with INDE April 8, 2014 2014, Intel Corporation. All rights reserved. Intel, the Intel logo, Intel Inside, Intel Xeon, and Intel

More information

DGX UPDATE. Customer Presentation Deck May 8, 2017

DGX UPDATE. Customer Presentation Deck May 8, 2017 DGX UPDATE Customer Presentation Deck May 8, 2017 NVIDIA DGX-1: The World s Fastest AI Supercomputer FASTEST PATH TO DEEP LEARNING EFFORTLESS PRODUCTIVITY REVOLUTIONARY AI PERFORMANCE Fully-integrated

More information

Using Openembedded with Snapdragon Flight

Using Openembedded with Snapdragon Flight April 2016 Mark Charlebois Using Openembedded with Snapdragon Flight Background Snapdragon Flight board is available from Intrinsyc Based on Qualcomm APQ8074 Uses 3.4 kernel (Android) and Linux userspace

More information

Building U-Boot in CodeWarrior ARMv8

Building U-Boot in CodeWarrior ARMv8 NXP Semiconductors Document Number: AN5347 Application Note Rev. 0, 10/2016 Building U-Boot in CodeWarrior ARMv8 1 Introduction This application note defines guidelines for configuring CodeWarrior for

More information

Making Mobile 5G a Commercial Reality. Peter Carson Senior Director Product Marketing Qualcomm Technologies, Inc.

Making Mobile 5G a Commercial Reality. Peter Carson Senior Director Product Marketing Qualcomm Technologies, Inc. Making Mobile 5G a Commercial Reality Peter Carson Senior Director Product Marketing Qualcomm Technologies, Inc. Insatiable global data demand First phase of 5G NR will focus on enhanced MBB Enhanced mobile

More information

How to Build Optimized ML Applications with Arm Software

How to Build Optimized ML Applications with Arm Software How to Build Optimized ML Applications with Arm Software Arm Technical Symposia 2018 ML Group Overview Today we will talk about applied machine learning (ML) on Arm. My aim for today is to show you just

More information

How to Build Optimized ML Applications with Arm Software

How to Build Optimized ML Applications with Arm Software How to Build Optimized ML Applications with Arm Software Arm Technical Symposia 2018 Arm K.K. Senior FAE Ryuji Tanaka Overview Today we will talk about applied machine learning (ML) on Arm. My aim for

More information

DIY Security Camera using. Intel Movidius Neural Compute Stick

DIY Security Camera using. Intel Movidius Neural Compute Stick DIY Security Camera using Intel Movidius Neural Compute Stick Ashwin Vijayakumar Lead Applications Architect, Embedded Machine Intelligence Intel Artificial Intelligence Products Group (AIPG) What happened

More information

Profiling and Debugging OpenCL Applications with ARM Development Tools. October 2014

Profiling and Debugging OpenCL Applications with ARM Development Tools. October 2014 Profiling and Debugging OpenCL Applications with ARM Development Tools October 2014 1 Agenda 1. Introduction to GPU Compute 2. ARM Development Solutions 3. Mali GPU Architecture 4. Using ARM DS-5 Streamline

More information

Neural Network Exchange Format

Neural Network Exchange Format Copyright Khronos Group 2017 - Page 1 Neural Network Exchange Format Deploying Trained Networks to Inference Engines Viktor Gyenes, specification editor Copyright Khronos Group 2017 - Page 2 Outlook The

More information

Beyond Training The next steps of Machine Learning. Chris /in/chrisparsonsdev

Beyond Training The next steps of Machine Learning. Chris /in/chrisparsonsdev Beyond Training The next steps of Machine Learning Chris Parsons chrisparsons@uk.ibm.com @chrisparsonsdev /in/chrisparsonsdev What is this talk? Part 1 What is Machine Learning? AI Infrastructure PowerAI

More information

QPSI. Qualcomm Technologies Countermeasures Update

QPSI. Qualcomm Technologies Countermeasures Update QPSI Qualcomm Technologies Countermeasures Update 1 Introduction Sometime back in 2010 Let s have exploit countermeasures on our products Why? Hard to fix all bugs. We might as well make them more fun

More information

Hear, Listen now Speak

Hear, Listen now Speak Qualcomm Snapdragon Tech Summit 2017 #SnapdragonSummit Hear, Listen now Speak Hi-Fi Audio & Voice UI Evolution Ravi Satya Sr. Director, Product Management Qualcomm Technologies, Inc. 12/7/2017 @qualcomm

More information

ITU-R Handbook on Global Trends in IMT

ITU-R Handbook on Global Trends in IMT 1 Dr. Bienvenu AGBOKPONTO SOGLO Gov t Affairs Director, West and Central Africa ITU-R Handbook on Global Trends in IMT ATU Workshop on Implementation of WRC-15 Outcomes and Capacity Building on Development

More information

Bring Intelligence to the Edge with Intel Movidius Neural Compute Stick

Bring Intelligence to the Edge with Intel Movidius Neural Compute Stick Bring Intelligence to the Edge with Intel Movidius Neural Compute Stick Darren Crews Principal Engineer, Lead System Architect, Intel NTG Legal Disclaimer INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION

More information

Qualcomm AllPlay Smart Media Platform

Qualcomm AllPlay Smart Media Platform Presenter Name Presenter Title Qualcomm Connected Experiences, Inc. Qualcomm AllPlay Smart Media Platform Qualcomm AllPlay smart media platform 2014 ACE Award winner Internet of Things Technology, Service

More information

Deep Learning on Arm Cortex-M Microcontrollers. Rod Crawford Director Software Technologies, Arm

Deep Learning on Arm Cortex-M Microcontrollers. Rod Crawford Director Software Technologies, Arm Deep Learning on Arm Cortex-M Microcontrollers Rod Crawford Director Software Technologies, Arm What is Machine Learning (ML)? Artificial Intelligence Machine Learning Deep Learning Neural Networks Additional

More information

Relay: a high level differentiable IR. Jared Roesch TVMConf December 12th, 2018

Relay: a high level differentiable IR. Jared Roesch TVMConf December 12th, 2018 Relay: a high level differentiable IR Jared Roesch TVMConf December 12th, 2018!1 This represents months of joint work with lots of great folks:!2 TVM Stack Optimization Relay High-Level Differentiable

More information

CNN optimization. Rassadin A

CNN optimization. Rassadin A CNN optimization Rassadin A. 01.2017-02.2017 What to optimize? Training stage time consumption (CPU / GPU) Inference stage time consumption (CPU / GPU) Training stage memory consumption Inference stage

More information

Spectrum for 4G and 5G. Qualcomm Technologies, Inc. July, 2017

Spectrum for 4G and 5G. Qualcomm Technologies, Inc. July, 2017 Spectrum for 4G and 5G Qualcomm Technologies, Inc. July, 2017 Using all available spectrum types and spectrum bands Licensed spectrum Exclusive use Over 40 bands globally for LTE Shared spectrum New shared

More information

ITU Workshop Combating grey devices. Audrey Scozzaro Ferrazzini Standardisation and Industrial Policy Lead, EMENA Government Affairs 28 June 2016

ITU Workshop Combating grey devices. Audrey Scozzaro Ferrazzini Standardisation and Industrial Policy Lead, EMENA Government Affairs 28 June 2016 ITU Workshop Combating grey devices Audrey Scozzaro Ferrazzini Standardisation and Industrial Policy Lead, EMENA Government Affairs 28 June 2016 Security solutions Qualcomm s vision Theft Deterrence Enterprise

More information

Autonomous Driving Solutions

Autonomous Driving Solutions Autonomous Driving Solutions Oct, 2017 DrivePX2 & DriveWorks Marcus Oh (moh@nvidia.com) Sr. Solution Architect, NVIDIA This work is licensed under a Creative Commons Attribution-Share Alike 4.0 (CC BY-SA

More information

Speculations about Computer Architecture in Next Three Years. Jan. 20, 2018

Speculations about Computer Architecture in Next Three Years. Jan. 20, 2018 Speculations about Computer Architecture in Next Three Years shuchang.zhou@gmail.com Jan. 20, 2018 About me https://zsc.github.io/ Source-to-source transformation Cache simulation Compiler Optimization

More information

World s most advanced data center accelerator for PCIe-based servers

World s most advanced data center accelerator for PCIe-based servers NVIDIA TESLA P100 GPU ACCELERATOR World s most advanced data center accelerator for PCIe-based servers HPC data centers need to support the ever-growing demands of scientists and researchers while staying

More information

LINARO CONNECT 23 HKG18 George Grey, Linaro CEO

LINARO CONNECT 23 HKG18 George Grey, Linaro CEO LINARO CONNECT 23 HKG18 George Grey, Linaro CEO Welcome China Electronics Corporation (CEC) Joining Linaro as a Core Member Major Chinese Global IT Supplier China GCC Board Member 100+ companies, 14 listed

More information

Tutorial on Keras CAP ADVANCED COMPUTER VISION SPRING 2018 KISHAN S ATHREY

Tutorial on Keras CAP ADVANCED COMPUTER VISION SPRING 2018 KISHAN S ATHREY Tutorial on Keras CAP 6412 - ADVANCED COMPUTER VISION SPRING 2018 KISHAN S ATHREY Deep learning packages TensorFlow Google PyTorch Facebook AI research Keras Francois Chollet (now at Google) Chainer Company

More information

Deep learning in MATLAB From Concept to CUDA Code

Deep learning in MATLAB From Concept to CUDA Code Deep learning in MATLAB From Concept to CUDA Code Roy Fahn Applications Engineer Systematics royf@systematics.co.il 03-7660111 Ram Kokku Principal Engineer MathWorks ram.kokku@mathworks.com 2017 The MathWorks,

More information

CafeGPI. Single-Sided Communication for Scalable Deep Learning

CafeGPI. Single-Sided Communication for Scalable Deep Learning CafeGPI Single-Sided Communication for Scalable Deep Learning Janis Keuper itwm.fraunhofer.de/ml Competence Center High Performance Computing Fraunhofer ITWM, Kaiserslautern, Germany Deep Neural Networks

More information