IMAGE AND VISION PROCESSING ON TEGRA K1. Elif Albuz

Size: px
Start display at page:

Download "IMAGE AND VISION PROCESSING ON TEGRA K1. Elif Albuz"

Transcription

1 IMAGE AND VISION PROCESSING ON TEGRA K1 Elif Albuz

2 IMAGE AND VISION USE CASES Driven by using camera as a sensor Computational Photography and Videography Face, Body and Gesture Tracking 3D Scene/Object Reconstruction Augmented Reality

3 Processing Demands MOBILE VISION COMPUTING Photography Input = 2D Camera Processors = ISP + CPU Product = Static Images Mobile Vision Computing Input = MEMS + Depth Camera Processors = ISP + CPU + GPU Result = Data for advanced user interface and environment modeling Computational Photography Input = MEMS + 2D Camera Processors = ISP + CPU + GPU Result = Enhance Images and Videos Time

4 VISION COMPUTING APP CATEGORIES Vision Computing 3D Reconstruction (constructs 3D geometry) Tracking (constructs positions and motions) Facial Modeling Object Reconstruction Face and gesture tracking Environmental Feature Tracking Body Modeling Scene Reconstruction Body Tracking Indoor/Outdoor Positional Tracking 3D Grid with SFM Ped/car detection & tracking

5 AUGMENTED REALITY HYPER-REALISM Ray-tracing and light-field calculations running today on CUDA laptop PC 50+ Watts Ongoing research to use depth cameras to reconstruct global illumination model in real-time Need on mobile devices at 100x less power = 0.5W High-Quality Reflections, Refractions, and Caustics in Augmented Reality and their Contribution to Visual Coherence P. Kán, H. Kaufmann, Institute of Software Technology and Interactive Systems, Vienna University of Technology, Vienna, Austria

6 SIMULTANEOUS LOCALIZATION AND MAPPING

7 WHAT IS NEEDED? Accelerated image & vision processing Tegra K1: CPU, GPU, ISP Camera flexibility and handling of new sensor types Android HAL V3, V4L Low latency routing of image streams to GPU EGLStreams Integrated handling of various sensors and camera Global Time Stamps Effective image & vision programming frameworks VisionWorks

8 TEGRA K1 PLATFORM Desktop GPGPU features and tools on mobile GPGPU Compute CUDA Tools and Libraries Advanced Graphics

9 THREE GENERATIONS OF REFINEMENT Tesla Fermi Kepler

10 Audio Processor ARM7 28 nm HPM 23x23mm, 0.7mm pitch HS-FCBGA Quad Cortex-A15 4x Cores (1+ GHz) NEON SIMD 2 MB L2 (Shared) ARM Trust Zone Shadow LP C-A15 CPU HD Video Processor 1080p24/30 Video Decode 1080p24/30 Video Encode H.264 MPEG4 VC1 MPEG2 VP8 TEGRA K1: A MAJOR LEAP FORWARD FOR MOBILE & EMBEDDED APPLICATIONS SATA2 x1 USB 2.0 x3 PCIe* G2 x4 + x1 Image Processor 25MP Sensor Support ISP 1080p60 Enhanced JPEG Engine Kepler GeForce GPU w/cuda OpenGL-ES nextgen 192 Stream Processors 2D Graphics/Scaling KEPLER GPU, 192 CORES CUDA USB 3.0* x2 CSI x4 + x4 UART x4 I2C x5 NOR Flash SPI x4 SDIO/MMC x4 DDR3 Ctlr 64b 800+ MHz Display x2 Security Engine HDMI edp/lvds DAP x5 (1 2 S/TDM) 12GB/S BANDWIDTH VIDEO IMAGE COMPOSITOR (VIC)

11 Audio Processor ARM7 CPU 28 nm HPM 23x23mm, 0.7mm pitch HS-FCBGA SATA2 x1 USB 2.0 x Quad Cortex-A15 4x Cores (1+ GHz) NEON SIMD 2 MB L2 (Shared) ARM Trust Zone Shadow LP C-A15 CPU Image Processor HD Video Processor 1080p24/30 Video Decode 1080p24/30 Video Encode H.264 MPEG4 VC1 MPEG2 VP8 Kepler GeForce GPU w/cuda Quad-core A15, NEON Unified memory, access from CPU and GPU PCIe* G2 x4 + x1 25MP Sensor Support ISP 1080p60 Enhanced JPEG Engine OpenGL-ES nextgen 192 Stream Processors 2D Graphics/Scaling low-power Shadow core USB 3.0* x2 UART x4 I2C x5 SPI x4 SDIO/MMC x4 Display x2 HDMI edp/lvds CSI x4 + x4 NOR Flash DDR3 Ctlr 64b 800+ MHz Security Engine DAP x5 (1 2 S/TDM)

12 Audio Processor ARM7 IMAGE PROCESSOR 28 nm HPM 23x23mm, 0.7mm pitch HS-FCBGA SATA2 x1 USB 2.0 x3 Quad Cortex-A15 4x Cores (1+ GHz) NEON SIMD 2 MB L2 (Shared) ARM Trust Zone Shadow LP C-A15 CPU x2 Image Processor HD Video Processor 1080p24/30 Video Decode 1080p24/30 Video Encode H.264 MPEG4 VC1 MPEG2 VP8 Kepler GeForce GPU w/cuda 2 ISPs, independently programmable 25Mp camera support (upto 250Mp) PCIe* G2 x4 + x1 25MP Sensor Support ISP 1080p60 Enhanced JPEG Engine OpenGL-ES nextgen 192 Stream Processors 2D Graphics/Scaling 1.2Gp throughput USB 3.0* x2 CSI x4 + x4 UART x4 I2C x5 NOR Flash SPI x4 SDIO/MMC x4 DDR3 Ctlr 64b 800+ MHz Display x2 Security Engine HDMI edp/lvds DAP x5 (1 2 S/TDM) GPGPU interoperability

13 Audio Processor ARM7 HD DEC/ENC PROCESSOR 28 nm HPM 23x23mm, 0.7mm pitch HS-FCBGA Quad Cortex-A15 4x Cores (1+ GHz) NEON SIMD 2 MB L2 (Shared) ARM Trust Zone Shadow LP C-A15 CPU HD Video Processor 1080p24/30 Video Decode 1080p24/30 Video Encode H.264 MPEG4 VC1 MPEG2 VP8 SATA2 x1 USB 2.0 x3 PCIe* G2 x4 + x1 USB 3.0* x2 CSI x4 + x4 Image Processor 25MP Sensor Support ISP 1080p60 Enhanced JPEG Engine UART x4 I2C x5 NOR Flash SPI x4 SDIO/MMC x4 DDR3 Ctlr 64b 800+ MHz Kepler GeForce GPU w/cuda OpenGL-ES nextgen 192 Stream Processors 2D Graphics/Scaling Display x2 Security Engine HDMI edp/lvds DAP x5 (1 2 S/TDM) VIDEO ENCODE/DECODE 2X 1920X1080@30FPS CUVID/CUVENC VIDEO ENCODE/DECODE INTERFACE MOTION ESTIMATION ONLY MODE

14 VIDEO ENCODER Dedicated hardware accelerator Current frame Ref. frame Encoder Compressed video (h.264,..) Motion Vectors/Track info

15 Audio Processor ARM7 GPU 28 nm HPM 23x23mm, 0.7mm pitch HS-FCBGA Quad Cortex-A15 4x Cores (1+ GHz) NEON SIMD 2 MB L2 (Shared) ARM Trust Zone Shadow LP C-A15 CPU HD Video Processor 1080p24/30 Video Decode 1080p24/30 Video Encode H.264 MPEG4 VC1 MPEG2 VP8 KEPLER Architecture 192 CUDA Cores, SM3.2 ISA Compatible to GeForce, Quadro, Tesla SATA2 x1 USB 2.0 x3 PCIe* G2 x4 + x1 Image Processor 25MP Sensor Support ISP 1080p60 Enhanced JPEG Engine Kepler GeForce GPU w/cuda OpenGL-ES nextgen 192 Stream Processors 2D Graphics/Scaling 64kb L1 Cache and Shared Memory 128kb L2 Cache 128 kb Register File USB 3.0* x2 UART x4 I2C x5 SPI x4 SDIO/MMC x4 Display x2 HDMI edp/lvds CSI x4 + x4 NOR Flash DDR3 Ctlr 64b 800+ MHz Security Engine DAP x5 (1 2 S/TDM)

16 WHAT IS NEEDED? Accelerated image & vision processing Tegra K1: CPU, GPU, ISP Camera flexibility and handling of new sensor types Android HAL V3, V4L, Camera API Low latency routing of image streams to GPU EGLStreams Integrated handling of various sensors and camera Global Time Stamps Effective image & vision programming frameworks VisionWorks

17 CAMERA IMAGE PROCESSING Camera ISP (Image Signal Processor) Little or no programmability Data flows thru compact hardware pipe Scan-line-based - no global memory Best perf/watt ~760 math Ops ~42K vals = 670Kb 300MHz ~250Gops

18 VISUAL SENSOR REVOLUTION Single RGB sensors just the start of mobile visual revolution IR sensors LEAP Motion, eye-trackers Active illumination depth sensors TOF and structured light Multi-sensors: Stereo pairs -> Plenoptic array -> Depth cameras Stereo pair can enable object scaling and enhanced depth extraction Plenoptic Field processing needs FFTs and ray-casting Hybrid visual sensing solutions Different sensors mixed for different distances and lighting conditions Dual Camera LG Electronics Plenoptic Array Pelican imaging Capri Structured Light 3D Camera PrimeSense

19 ANDROID CAMERA HAL V3 Camera HAL v1 focused on simplifying basic camera apps Difficult or impossible to do much else New features require proprietary driver extensions Extensions not portable - restricted growth of third party app ecosystem Camera HAL v3 is a fundamentally different API Flexible primitives for building sophisticated use-cases Interface is clean and easily extensible Apps can have more control, and more responsibility Enables sophisticated camera applications Faster time to market and higher quality

20 KHRONOS CAMERA API Specification available 2015 Also FCAM-Based Will be available for any OS FCAM running on NVIDIA Linux today No global state State travels with image requests Every stage in the pipeline may have different state Enables fast, deterministic state changes Synchronize devices Lens, flash, sound capture, gyro Devices can schedule Actions E.g. to be triggered on exposure change

21 KHRONOS CAMERA API REQUIREMENTS Application control over ISP processing (including 3A) Including multiple, re-entrant ISPs Control multiple sensors with synch and alignment E.g. Stereo pairs, Plenoptic arrays, TOF/structured light depth cameras Enhanced per frame detailed control Format flexibility, Region of Interest (ROI) selection Global timing & synchronization E.g. Between cameras and MEMS sensors Flexible processing/streaming Multiple input and output streams RAW, Bayer or YUV Processing Streaming of rows (not just frames) Enable new camera functionality not available on current platforms and align with future platform directions for easy adoption

22 TEGRA K1 CAMERA DATAFLOW Flexible routing of sensor data to ISPs and unified memory ISP-A Enables sensor processing by any combination of GPUs, CPUs and ISPs Camera One Camera Two Camera Routing Unified Memory CPU CPU CPU CPU Kepler GPGPU ISP-B

23 COMPUTATIONAL PHOTOGRAPHY Camera ISP (Image Signal Processor) Little or no programmability CPU GPU Single processor or Neon SIMD - running fast Makes heavy use of general memory Non-optimal performance and power Programmable and flexible Many way parallelism - run at lower frequency Efficient image caching close to processors BUT cycles frames in and out of memory ~760 math Ops ~42K vals = 670Kb 300MHz ~250Gops

24 WHAT IS NEEDED? Accelerated image & vision processing Tegra K1: CPU, GPU, ISP Camera flexibility and handling of new sensor types Android HAL V3, V4L Low latency routing of image streams to GPU EGLStreams Integrated handling of various sensors and camera Global Time Stamps Effective image & vision programming frameworks VisionWorks

25 EGL 1.5 RELEASED EGL 1.5 brings functionality from multiple extensions into core Increased reliability and portability EGLImages Sharing textures and renderbuffers Context Robustness Defending against malicious code EGLSync objects Improved OpenGL /OpenCL interop Platform extensions Standardized interactions for multiple OS e.g. Android and 64-bit platforms srgb colorspace rendering API Interop EGL provides efficient transfer of data and events between Khronos APIs Applications OS and Display Platforms Application Portability EGL abstracts graphics context management, surface and buffer binding and rendering synchronization

26 WHAT IS NEEDED? Accelerated image & vision processing Tegra K1: CPU, GPU, ISP Camera flexibility and handling of new sensor types Android HAL V3, V4L Low latency routing of image streams to GPU EGLStreams Integrated handling of various sensors and camera Global Time Stamps Effective image & vision programming frameworks VisionWorks

27 HOW MANY SENSORS ARE IN A SMARTPHONE? Light Proximity 2 cameras 3 microphones Touch Position GPS WiFi (fingerprint) Cellular (tri-lateration) NFC, Bluetooth (beacons) Accelerometer Magnetometer Gyroscope Pressure Temperature Humidity 19 27

28 STREAMINPUT SENSOR ABSTRACTION API Apps request semantic sensor information StreamInput defines possible requests, e.g. Read Physical or Virtual Sensors e.g. Game Quaternion Context detection e.g. Am I in an elevator? Apps Need Sophisticated Access to Sensor Data Without coding to specific sensor hardware Sensor Discoverability Sensor Code Portability Advanced Sensors Everywhere Multi-axis motion/position, quaternions, context-awareness, gestures, activity monitoring, health and environmental sensors StreamInput processing graph provides optimized sensor data stream High-value, smart sensor fusion middleware can connect to apps in a portable way Apps can gain magical situational awareness

29 WHAT IS NEEDED? Accelerated image & vision processing Tegra K1: CPU, GPU, ISP Camera flexibility and handling of new sensor types Android HAL V3, V4L Low latency routing of image streams to GPU EGLStreams Integrated handling of various sensors and camera Global Time Stamps Effective image & vision programming frameworks VisionWorks

30 MOBILE & EMBEDDED DEVELOPERS NEED HELP! Control, coordinate and synchronize a diverse array of mobile sensors Handle a diverse selection of emerging depth camera technologies Write maintainable code for a heterogeneous mix of CPUs, GPUs and DSPs Write code that is deployable across multiple devices, platforms and OS Leverage dedicated vision hardware for minimized power Create fluid 60Hz experiences on batterypowered mobile devices

31 DESKTOP TO MOBILE DEVELOPMENT Image & Vision Processing code development starts at desktop PC Tegra K1 enables easy migration through libraries and tools available across platforms

32 TEGRA K1 CUDA DEVELOPMENT CUDA-Aware Editor Automated CPU to GPU code refactoring Semantic highlighting of CUDA code Integrated code samples & docs Nsight Debugger Simultaneously debug of CPU and GPU Inspect variables across CUDA threads Use breakpoints & single-step debugging Nsight Profiler Quickly identifies performance issues Integrated expert system Source line correlation Cross platform development Native memcheck, GDB, nvprof

33 CUDA LIBRARIES VisionWorks NPP OpenCV CUFFT CUBLAS CUDA Math Lib

34 NPP LIBRARY (CUDA) Data exchange & initialization Set, Convert, CopyConstBorder, Copy, Transpose, SwapChannels Arithmetic & Logical Ops Add, Sub, Mul, Div, AbsDiff Threshold & Compare Threshold, Compare Color Conversion JPEG RGB To YCbCr (& vice versa), ColorTwist, LUT_Linear DCTQuantInv/Fwd, QuantizationTable Functions FilterBox, Row, Column, Max, Min, Median, Dilate, Erode, SumWindowColumn/Row Geometry Transforms Mirror, WarpAffine / Back/ Quad, WarpPerspective / Back / Quad, Resize Statistics Mean, StdDev, NormDiff, MinMax, Histogram, SqrIntegral, RectStdDev Computer Vision ApplyHaarClassifier, GraphCuts

35 OPENCV LIBRARY Initially developed by Intel for single-core x86 CPUs Version >900 functions (x the datatypes) OpenCV4Tegra - Accelerated CUDA+NEON+GLSL+TBB multithreading OpenCV Image processing General Image Processing Segmentation Machine Learning, Detection Image Pyramids Transforms Fitting Video, Stereo, and 3D Camera Calibration Features Depth Maps Optical Flow Inpainting Tracking

36 OPENCV-GPU VALUE ADD ON LOGAN Speedup 7 6 Jetson Kepler GPU /Quadcore A15 Public OCV for Mobile/Embedded core filter imgproc objdetect Average speedup for different function categories with Logan GPU compared to public source code.

37 VISIONWORKS MOTIVATION + = Advanced Silicon VisionWorks Simplify vision programming Fully optimized and accelerated Modular and Extensible Widespread vision processing in embedded, mobile and automotive devices and applications

38 VISIONWORKS Power Efficient Computer Vision Powered with CUDA Supported on Tegra K1 Linux and Android ACCELERATING Advanced Driver Assistance Computational Photography Augmented Reality Robotics Deep Learning and more Version 0.10 is available for registered partners! ADAS Advanced Driver Assistance Systems Augmented Reality Computational Photography Robotics

39 VISIONWORKS SOFTWARE STACK Application Code Applications use combination of direct primitives, the OpenVX framework and supplied pipelines NVIDIA provides sample pipelines for common use cases Object Detection Sample Pipelines SfM 3rd Party Pipelines NVIDIA supplied vision primitives using CUDA and Tegra processing resources Classifier VisionWorks Primitives Corner Detection 3 rd Party Primitives Customers and developers can create their own primitives e.g. using CUDA CUDA Framework OpenVX enables power efficient and flexible chaining of primitives Tegra K1

40 OPENVX POWER EFFICIENT VISION ACCELERATION Khronos, open, cross-vendor vision API Focus on mobile and embedded systems Application Foundational API for vision acceleration Useful for middleware or by applications Enables diverse efficient implementations OpenCV OpenCV open source library VisionWorks Sample Pipelines Complementary to OpenCV Which is great for prototyping Open source sample implementation Hardware vendor implementations

41 OPENVX GRAPHS THE KEY TO EFFICIENCY Directed graphs for processing power and efficiency Each Node can be implemented in software or accelerated hardware Nodes may be fused to eliminate memory transfers Processing can be tiled to keep data entirely in local memory/cache EGLStreams route data from camera and to application Can extend with VisionWorks nodes using CUDA Native Camera Control OpenVX Node VisionWorks Node OpenVX Node VisionWorks Node Application Example OpenVX Graph

42 OPENVX AND OPENCV ARE COMPLEMENTARY Governance Community driven open source with no formal specification Defined and implemented by Khronos Portability APIs can vary depending on processor Tegra K1 mobile platforms Scope Efficiency Very wide 1000s of imaging and vision functions Multiple camera APIs/interfaces Memory-based architecture Each operation reads and writes memory Tight focus on hardware accelerated functions for mobile vision Use external camera API Graph-based execution Optimizable computation, data transfer Use Case Rapid experimentation Production development & deployment

43 OPENVX 1.0 FUNCTION OVERVIEW Core data structures Images and Image Pyramids Processing Graphs, Kernels, Parameters Image Processing Arithmetic, Logical, and statistical operations Multichannel Color and BitDepth Extraction and Conversion 2D Filtering and Morphological operations Image Resizing and Warping Core Computer Vision Pyramid computation Integral Image computation Feature Extraction and Tracking Histogram Computation and Equalization Canny Edge Detection Harris and FAST Corner detection Sparse Optical Flow Widely used extensions adopted into future versions of the core OpenVX Specification Evolution OpenVX 1.0 defines framework for creating, managing and executing graphs Focused set of widely used functions that are readily accelerated Implementers can add functions as extensions

44 VISIONWORKS PRIMITIVES JAN 2014 Sobel Convolve Bilateral Filter Integral Image Integral Histogram Corner Harris Corner FAST Image Pyramid Optical Flow PyrLK Optical Flow Farneback Warp Perspective Hough Lines Fast NLM Denoising Stereo Block Matching IME (Iterative Motion Estimation) HOG (Histogram of Oriented Gradients) Soft Cascade Detector Object Tracker TLD Object Tracker SLAM Path Estimator MedianFlow Estimator

45 OPENVX AND CUDA ARE COMPLEMENTARY Use Case Architecture Target Hardware Precision Ease of Use GPGPU Programming Language-based Exposed architected memory model programmer manages memory Full IEEE floating point mandated General-purpose math and other libraries Domain targeted Vision processing Library-based - no separate compiler required Abstracted node and memory model - diverse implementations can be optimized for power and performance Minimal floating point requirements optimized for vision operators Fully implemented vision operators and framework out of the box Use CUDA to build new VisionWorks OpenVX Nodes

46 VISIONWORKS SAMPLE PIPELINES (V0.10) Structure From Motion/SLAM Pedestrian Detection Vehicle detection Object tracking Dense optical flow Active Shape Model Denoising

47 VISIONWORKS LOOKING FORWARD Enable multi-camera applications 3D sensors Conformance with OpenVX once specification finalized

48 TEGRA K1 DEVELOPMENT PLATFORMS Coming to Android K1 & Other Linux Devices soon.. JETSON X3 (TK1 PRO) gige, usb3.0, HDMI, CANBUS running Vibrante Linux AUTOMOTIVE GRADE

49 QUESTIONS?

50 BACKUP

INTEGRATING COMPUTER VISION SENSOR INNOVATIONS INTO MOBILE DEVICES. Eli Savransky Principal Architect - CTO Office Mobile BU NVIDIA corp.

INTEGRATING COMPUTER VISION SENSOR INNOVATIONS INTO MOBILE DEVICES. Eli Savransky Principal Architect - CTO Office Mobile BU NVIDIA corp. INTEGRATING COMPUTER VISION SENSOR INNOVATIONS INTO MOBILE DEVICES Eli Savransky Principal Architect - CTO Office Mobile BU NVIDIA corp. Computer Vision in Mobile Tegra K1 It s time! AGENDA Use cases categories

More information

Vision Acceleration. Launch Briefing October Neil Trevett Vice President Mobile Ecosystem, NVIDIA President, Khronos Group

Vision Acceleration. Launch Briefing October Neil Trevett Vice President Mobile Ecosystem, NVIDIA President, Khronos Group Copyright Khronos Group 2014 - Page 1 Vision Acceleration Launch Briefing October 2014 Neil Trevett Vice President Mobile Ecosystem, NVIDIA President, Khronos Group Copyright Khronos Group 2014 - Page

More information

TEGRA K1 AND THE AUTOMOTIVE INDUSTRY. Gernot Ziegler, Timo Stich

TEGRA K1 AND THE AUTOMOTIVE INDUSTRY. Gernot Ziegler, Timo Stich TEGRA K1 AND THE AUTOMOTIVE INDUSTRY Gernot Ziegler, Timo Stich Previously: Tegra in Automotive Infotainment / Navigation Digital Instrument Cluster Passenger Entertainment TEGRA K1 with Kepler GPU GPU:

More information

Open API Standards for Mobile Graphics, Compute and Vision Processing GTC, March 2014

Open API Standards for Mobile Graphics, Compute and Vision Processing GTC, March 2014 Open API Standards for Mobile Graphics, Compute and Vision Processing GTC, March 2014 Neil Trevett Vice President Mobile Ecosystem, NVIDIA President Khronos Copyright Khronos Group 2014 - Page 1 Khronos

More information

Open Standard APIs for Augmented Reality

Open Standard APIs for Augmented Reality Copyright Khronos Group 2014 - Page 1 Open Standard APIs for Augmented Reality Neil Trevett Vice President Mobile Ecosystem, NVIDIA President, Khronos Group Copyright Khronos Group 2014 - Page 2 Khronos

More information

Open Standard APIs for Embedded Vision Processing

Open Standard APIs for Embedded Vision Processing Copyright Khronos Group 2014 - Page 1 Open Standard APIs for Embedded Vision Processing Neil Trevett Vice President Mobile Ecosystem, NVIDIA President, Khronos Group Copyright Khronos Group 2014 - Page

More information

THE LEADER IN VISUAL COMPUTING

THE LEADER IN VISUAL COMPUTING MOBILE EMBEDDED THE LEADER IN VISUAL COMPUTING 2 TAKING OUR VISION TO REALITY HPC DESIGN and VISUALIZATION AUTO GAMING 3 BEST DEVELOPER EXPERIENCE Tools for Fast Development Debug and Performance Tuning

More information

April 4-7, 2016 Silicon Valley VISIONWORKS A CUDA ACCELERATED COMPUTER VISION LIBRARY S6783. Elif Albuz, April 4, 2016

April 4-7, 2016 Silicon Valley VISIONWORKS A CUDA ACCELERATED COMPUTER VISION LIBRARY S6783. Elif Albuz, April 4, 2016 April 4-7, 2016 Silicon Valley VISIONWORKS A CUDA ACCELERATED COMPUTER VISION LIBRARY S6783 Elif Albuz, April 4, 2016 Motivation Introduction to VisionWorks AGENDA VisionWorks Software Stack VisionWorks

More information

WebGL Meetup GDC Copyright Khronos Group, Page 1

WebGL Meetup GDC Copyright Khronos Group, Page 1 WebGL Meetup GDC 2012 Copyright Khronos Group, 2012 - Page 1 Copyright Khronos Group, 2012 - Page 2 Khronos API Ecosystem Trends Neil Trevett Vice President Mobile Content, NVIDIA President, The Khronos

More information

Mobile AR Hardware Futures

Mobile AR Hardware Futures Copyright Khronos Group, 2010 - Page 1 Mobile AR Hardware Futures Neil Trevett Vice President Mobile Content, NVIDIA President, The Khronos Group Two Perspectives NVIDIA - Tegra 2 mobile processor Khronos

More information

TEGRA K1 による GPU コンピューティング

TEGRA K1 による GPU コンピューティング TEGRA K1 による GPU コンピューティング COMPUTE WITH TEGRA K1 馬路徹 シニア ソリューション アーキテクト NVIDIA AGENDA Introducing Tegra K1 Tegra K1 Compute Software Capabilities OpenGL GLSL OpenCL CUDA/Unified Memory Google Renderscript

More information

Khronos and the Mobile Ecosystem

Khronos and the Mobile Ecosystem Copyright Khronos Group, 2011 - Page 1 Khronos and the Mobile Ecosystem Neil Trevett VP Mobile Content, NVIDIA President, Khronos Copyright Khronos Group, 2011 - Page 2 Topics It s not just about individual

More information

AR Standards Update Austin, March 2012

AR Standards Update Austin, March 2012 AR Standards Update Austin, March 2012 Neil Trevett President, The Khronos Group Vice President Mobile Content, NVIDIA Copyright Khronos Group, 2012 - Page 1 Topics Very brief overview of Khronos Update

More information

Advanced Imaging Applications on Smart-phones Convergence of General-purpose computing, Graphics acceleration, and Sensors

Advanced Imaging Applications on Smart-phones Convergence of General-purpose computing, Graphics acceleration, and Sensors Advanced Imaging Applications on Smart-phones Convergence of General-purpose computing, Graphics acceleration, and Sensors Sriram Sethuraman Technologist & DMTS, Ittiam 1 Overview Imaging on Smart-phones

More information

GTC 2013 March San Jose, CA The Smartest People. The Best Ideas. The Biggest Opportunities. Opportunities for Participation:

GTC 2013 March San Jose, CA The Smartest People. The Best Ideas. The Biggest Opportunities. Opportunities for Participation: GTC 2013 March 18-21 San Jose, CA The Smartest People. The Best Ideas. The Biggest Opportunities. Opportunities for Participation: SPEAK - Showcase your work among the elite of graphics computing - Call

More information

Enabling a Richer Multimedia Experience with GPU Compute. Roberto Mijat Visual Computing Marketing Manager

Enabling a Richer Multimedia Experience with GPU Compute. Roberto Mijat Visual Computing Marketing Manager Enabling a Richer Multimedia Experience with GPU Compute Roberto Mijat Visual Computing Marketing Manager 1 What is GPU Compute Operating System and most application processing continue to reside on the

More information

The OpenVX Computer Vision and Neural Network Inference

The OpenVX Computer Vision and Neural Network Inference The OpenVX Computer and Neural Network Inference Standard for Portable, Efficient Code Radhakrishna Giduthuri Editor, OpenVX Khronos Group radha.giduthuri@amd.com @RadhaGiduthuri Copyright 2018 Khronos

More information

Copyright Khronos Group, Page 1. Khronos Overview. Taiwan, February 2012

Copyright Khronos Group, Page 1. Khronos Overview. Taiwan, February 2012 Copyright Khronos Group, 2012 - Page 1 Khronos Overview Taiwan, February 2012 Copyright Khronos Group, 2012 - Page 2 Khronos - Connecting Software to Silicon Creating open, royalty-free API standards -

More information

Open Standards for Today s Gaming Industry

Open Standards for Today s Gaming Industry Copyright Khronos Group 2013 - Page 1 Open Standards for Today s Gaming Industry Erik Noreke VP of Business Development, Khronos Group Gaming Evolution Copyright Khronos Group 2013 - Page 3 DESKTOP MOBILE

More information

OpenCV on Zynq: Accelerating 4k60 Dense Optical Flow and Stereo Vision. Kamran Khan, Product Manager, Software Acceleration and Libraries July 2017

OpenCV on Zynq: Accelerating 4k60 Dense Optical Flow and Stereo Vision. Kamran Khan, Product Manager, Software Acceleration and Libraries July 2017 OpenCV on Zynq: Accelerating 4k60 Dense Optical Flow and Stereo Vision Kamran Khan, Product Manager, Software Acceleration and Libraries July 2017 Agenda Why Zynq SoCs for Traditional Computer Vision Automated

More information

The Benefits of GPU Compute on ARM Mali GPUs

The Benefits of GPU Compute on ARM Mali GPUs The Benefits of GPU Compute on ARM Mali GPUs Tim Hartley 1 SEMICON Europa 2014 ARM Introduction World leading semiconductor IP Founded in 1990 1060 processor licenses sold to more than 350 companies >

More information

Navigating the Vision API Jungle: Which API Should You Use and Why? Embedded Vision Summit, May 2015

Navigating the Vision API Jungle: Which API Should You Use and Why? Embedded Vision Summit, May 2015 Copyright Khronos Group 2015 - Page 1 Navigating the Vision API Jungle: Which API Should You Use and Why? Embedded Vision Summit, May 2015 Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem

More information

Copyright Khronos Group Page 1. Vulkan Overview. June 2015

Copyright Khronos Group Page 1. Vulkan Overview. June 2015 Copyright Khronos Group 2015 - Page 1 Vulkan Overview June 2015 Copyright Khronos Group 2015 - Page 2 Khronos Connects Software to Silicon Open Consortium creating OPEN STANDARD APIs for hardware acceleration

More information

Next Generation Visual Computing

Next Generation Visual Computing Next Generation Visual Computing (Making GPU Computing a Reality with Mali ) Taipei, 18 June 2013 Roberto Mijat ARM Addressing Computational Challenges Trends Growing display sizes and resolutions Increasing

More information

Tesla GPU Computing A Revolution in High Performance Computing

Tesla GPU Computing A Revolution in High Performance Computing Tesla GPU Computing A Revolution in High Performance Computing Gernot Ziegler, Developer Technology (Compute) (Material by Thomas Bradley) Agenda Tesla GPU Computing CUDA Fermi What is GPU Computing? Introduction

More information

TR An Overview of NVIDIA Tegra K1 Architecture. Ang Li, Radu Serban, Dan Negrut

TR An Overview of NVIDIA Tegra K1 Architecture. Ang Li, Radu Serban, Dan Negrut TR-2014-17 An Overview of NVIDIA Tegra K1 Architecture Ang Li, Radu Serban, Dan Negrut November 20, 2014 Abstract This paperwork gives an overview of NVIDIA s Jetson TK1 Development Kit and its Tegra K1

More information

Copyright Khronos Group Page 1

Copyright Khronos Group Page 1 Gaming Market Briefing Overview of APIs GDC March 2016 Neil Trevett Khronos President NVIDIA Vice President Developer Ecosystem ntrevett@nvidia.com @neilt3d Copyright Khronos Group 2016 - Page 1 Copyright

More information

Acceleration Standards for Mobile Augmented Reality

Acceleration Standards for Mobile Augmented Reality Acceleration Standards for Mobile Augmented Reality Neil Trevett Khronos President Vice President Mobile Content, NVIDIA November 2012 Copyright Khronos Group 2012 Page 1 Copyright Khronos Group 2012 Page

More information

Khronos Overview The State of the Art in Open Standards for Visual Computing

Khronos Overview The State of the Art in Open Standards for Visual Computing Khronos Overview The State of the Art in Open Standards for Visual Computing Neil Trevett Khronos President Vice President Mobile Content, NVIDIA Copyright Khronos Group 2013 - Page 1 Copyright Khronos

More information

Deep Learning: Transforming Engineering and Science The MathWorks, Inc.

Deep Learning: Transforming Engineering and Science The MathWorks, Inc. Deep Learning: Transforming Engineering and Science 1 2015 The MathWorks, Inc. DEEP LEARNING: TRANSFORMING ENGINEERING AND SCIENCE A THE NEW RISE ERA OF OF GPU COMPUTING 3 NVIDIA A IS NEW THE WORLD S ERA

More information

Standards for Vision Processing and Neural Networks

Standards for Vision Processing and Neural Networks Copyright Khronos Group 2017 - Page 1 Standards for Vision Processing and Neural Networks Radhakrishna Giduthuri, AMD radha.giduthuri@ieee.org Agenda Why we need a standard? Khronos NNEF Khronos OpenVX

More information

GPGPU, 1st Meeting Mordechai Butrashvily, CEO GASS

GPGPU, 1st Meeting Mordechai Butrashvily, CEO GASS GPGPU, 1st Meeting Mordechai Butrashvily, CEO GASS Agenda Forming a GPGPU WG 1 st meeting Future meetings Activities Forming a GPGPU WG To raise needs and enhance information sharing A platform for knowledge

More information

CS427 Multicore Architecture and Parallel Computing

CS427 Multicore Architecture and Parallel Computing CS427 Multicore Architecture and Parallel Computing Lecture 6 GPU Architecture Li Jiang 2014/10/9 1 GPU Scaling A quiet revolution and potential build-up Calculation: 936 GFLOPS vs. 102 GFLOPS Memory Bandwidth:

More information

CSE 591/392: GPU Programming. Introduction. Klaus Mueller. Computer Science Department Stony Brook University

CSE 591/392: GPU Programming. Introduction. Klaus Mueller. Computer Science Department Stony Brook University CSE 591/392: GPU Programming Introduction Klaus Mueller Computer Science Department Stony Brook University First: A Big Word of Thanks! to the millions of computer game enthusiasts worldwide Who demand

More information

Fusing Sensors into Mobile Operating Systems & Innovative Use Cases

Fusing Sensors into Mobile Operating Systems & Innovative Use Cases Fusing Sensors into Mobile Operating Systems & Innovative Use Cases May 23, 2012 Tristan Joo (tristanjoo@wca.org) Board Director & Co-Chair of Mobile SIG Wireless Communications Alliance, Independent Executive

More information

WebGL, WebCL and Beyond!

WebGL, WebCL and Beyond! Copyright Khronos Group, 2011 - Page 1 WebGL, WebCL and Beyond! Neil Trevett Vice President Mobile Content, NVIDIA President, The Khronos Group Copyright Khronos Group, 2011 - Page 2 Topics in this Session

More information

A176 Cyclone. GPGPU Fanless Small FF RediBuilt Supercomputer. IT and Instrumentation for industry. Aitech I/O

A176 Cyclone. GPGPU Fanless Small FF RediBuilt Supercomputer. IT and Instrumentation for industry. Aitech I/O The A176 Cyclone is the smallest and most powerful Rugged-GPGPU, ideally suited for distributed systems. Its 256 CUDA cores reach 1 TFLOPS, and it consumes less than 17W at full load (8-10W at typical

More information

Image Contrast Adjustment using Nvidia Performance Primitives (NPP) Yang Song

Image Contrast Adjustment using Nvidia Performance Primitives (NPP) Yang Song Image Contrast Adjustment using Nvidia Performance Primitives (NPP) Yang Song Outline NPP Introduction Problem Statement Solution and Hands-on Coding Epilogue What is NPP? A library of image, signal and

More information

Update on Khronos Open Standard APIs for Vision Processing Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem

Update on Khronos Open Standard APIs for Vision Processing Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem Update on Khronos Open Standard APIs for Vision Processing Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem Copyright Khronos Group 2015 - Page 1 Copyright Khronos Group 2015 - Page

More information

Multimedia in Mobile Phones. Architectures and Trends Lund

Multimedia in Mobile Phones. Architectures and Trends Lund Multimedia in Mobile Phones Architectures and Trends Lund 091124 Presentation Henrik Ohlsson Contact: henrik.h.ohlsson@stericsson.com Working with multimedia hardware (graphics and displays) at ST- Ericsson

More information

Efficient Video Processing on Embedded GPU

Efficient Video Processing on Embedded GPU Efficient Video Processing on Embedded GPU Tobias Kammacher Armin Weiss Matthias Frei Institute of Embedded Systems High Performance Multimedia Research Group Zurich University of Applied Sciences (ZHAW)

More information

Multimedia SoC System Solutions

Multimedia SoC System Solutions Multimedia SoC System Solutions Presented By Yashu Gosain & Forrest Picket: System Software & SoC Solutions Marketing Girish Malipeddi: IP Subsystems Marketing Agenda Zynq Ultrascale+ MPSoC and Multimedia

More information

Next Generation OpenGL Neil Trevett Khronos President NVIDIA VP Mobile Copyright Khronos Group Page 1

Next Generation OpenGL Neil Trevett Khronos President NVIDIA VP Mobile Copyright Khronos Group Page 1 Next Generation OpenGL Neil Trevett Khronos President NVIDIA VP Mobile Ecosystem @neilt3d Copyright Khronos Group 2015 - Page 1 Copyright Khronos Group 2015 - Page 2 Khronos Connects Software to Silicon

More information

EGLSTREAMS: INTEROPERABILITY FOR CAMERA, CUDA AND OPENGL. Debalina Bhattacharjee Sharan Ashwathnarayan

EGLSTREAMS: INTEROPERABILITY FOR CAMERA, CUDA AND OPENGL. Debalina Bhattacharjee Sharan Ashwathnarayan 53023 - EGLSTREAMS: INTEROPERABILITY FOR CAMERA, CUDA AND OPENGL Debalina Bhattacharjee Sharan Ashwathnarayan Tegra SOC and typical use-cases Why Interops EGLStream and Its Key Features Agenda Examples

More information

Our Technology Expertise for Software Engineering Services. AceThought Services Your Partner in Innovation

Our Technology Expertise for Software Engineering Services. AceThought Services Your Partner in Innovation Our Technology Expertise for Software Engineering Services High Performance Computing MultiCore CPU AceThought experts will re-design your sequential algorithms or applications to execute in parallel by

More information

General Purpose GPU Computing in Partial Wave Analysis

General Purpose GPU Computing in Partial Wave Analysis JLAB at 12 GeV - INT General Purpose GPU Computing in Partial Wave Analysis Hrayr Matevosyan - NTC, Indiana University November 18/2009 COmputationAL Challenges IN PWA Rapid Increase in Available Data

More information

Copyright Khronos Group Page 1

Copyright Khronos Group Page 1 Open Standards and Open Source Together How Khronos APIs Accelerate Fast and Cool Applications Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem Copyright Khronos Group 2015 - Page

More information

Tesla GPU Computing A Revolution in High Performance Computing

Tesla GPU Computing A Revolution in High Performance Computing Tesla GPU Computing A Revolution in High Performance Computing Mark Harris, NVIDIA Agenda Tesla GPU Computing CUDA Fermi What is GPU Computing? Introduction to Tesla CUDA Architecture Programming & Memory

More information

SIGGRAPH Briefing August 2014

SIGGRAPH Briefing August 2014 Copyright Khronos Group 2014 - Page 1 SIGGRAPH Briefing August 2014 Neil Trevett VP Mobile Ecosystem, NVIDIA President, Khronos Copyright Khronos Group 2014 - Page 2 Significant Khronos API Ecosystem Advances

More information

Accelerating Vision Processing

Accelerating Vision Processing Accelerating Vision Processing Neil Trevett Vice President Mobile Ecosystem at NVIDIA President of Khronos and Chair of the OpenCL Working Group SIGGRAPH, July 2016 Copyright Khronos Group 2016 - Page

More information

CSE 591: GPU Programming. Introduction. Entertainment Graphics: Virtual Realism for the Masses. Computer games need to have: Klaus Mueller

CSE 591: GPU Programming. Introduction. Entertainment Graphics: Virtual Realism for the Masses. Computer games need to have: Klaus Mueller Entertainment Graphics: Virtual Realism for the Masses CSE 591: GPU Programming Introduction Computer games need to have: realistic appearance of characters and objects believable and creative shading,

More information

SC24/WG9 Liaison Meeting

SC24/WG9 Liaison Meeting Copyright Khronos Group, 2011 - Page 1 SC24/WG9 Liaison Meeting Seoul, November 2011 Neil Trevett Vice President Mobile Content, NVIDIA President, The Khronos Group Copyright Khronos Group, 2011 - Page

More information

Ecosystem Overview Neil Trevett Khronos President NVIDIA Vice President Developer

Ecosystem Overview Neil Trevett Khronos President NVIDIA Vice President Developer Ecosystem Overview Neil Trevett Khronos President NVIDIA Vice President Developer Ecosystem ntrevett@nvidia.com @neilt3d Copyright Khronos Group 2016 - Page 1 Khronos Mission Software Silicon Khronos is

More information

April 4-7, 2016 Silicon Valley

April 4-7, 2016 Silicon Valley April 4-7, 2016 Silicon Valley TEGRA PLATFORMS GAMING DRONES ROBOTICS IVA AUTOMOTIVE 2 Compile Debug Profile Trace C/C++ NVTX NVIDIA Tools extension Getting Started CodeWorks JetPack Installers IDE Integration

More information

TIOVX TI s OpenVX Implementation

TIOVX TI s OpenVX Implementation TIOVX TI s OpenVX Implementation Aish Dubey Product Marketing, Automotive Processors Embedded Vision Summit, 3 May 2017 1 TI SOC platform heterogeneous cores High level processing Object detection and

More information

Unleashing the benefits of GPU Computing with ARM Mali TM Practical applications and use-cases. Steve Steele, ARM

Unleashing the benefits of GPU Computing with ARM Mali TM Practical applications and use-cases. Steve Steele, ARM Unleashing the benefits of GPU Computing with ARM Mali TM Practical applications and use-cases Steve Steele, ARM 1 Today s Computational Challenges Trends Growing display sizes and resolutions, richer

More information

Emerging Vision Technologies: Enabling a New Era of Intelligent Devices

Emerging Vision Technologies: Enabling a New Era of Intelligent Devices Emerging Vision Technologies: Enabling a New Era of Intelligent Devices Computer vision overview Computer vision is being integrated in our daily lives Acquiring, processing, and understanding visual data

More information

NVIDIA CUDA Libraries

NVIDIA CUDA Libraries NVIDIA CUDA Libraries Ujval Kapasi*, Elif Albuz*, Philippe Vandermersch*, Nathan Whitehead*, Frank Jargstorff* San Jose Convention Center Sept 22, 2010 *NVIDIA NVIDIA CUDA Libraries Applications 3 rd Party

More information

GPGPUs in HPC. VILLE TIMONEN Åbo Akademi University CSC

GPGPUs in HPC. VILLE TIMONEN Åbo Akademi University CSC GPGPUs in HPC VILLE TIMONEN Åbo Akademi University 2.11.2010 @ CSC Content Background How do GPUs pull off higher throughput Typical architecture Current situation & the future GPGPU languages A tale of

More information

NVIDIA DESIGNWORKS Ankit Patel - Prerna Dogra -

NVIDIA DESIGNWORKS Ankit Patel - Prerna Dogra - NVIDIA DESIGNWORKS Ankit Patel - ankitp@nvidia.com Prerna Dogra - pdogra@nvidia.com 1 Autonomous Driving Deep Learning Visual Effects Virtual Desktops Visual Computing is our singular mission Gaming Product

More information

Software Driven Verification at SoC Level. Perspec System Verifier Overview

Software Driven Verification at SoC Level. Perspec System Verifier Overview Software Driven Verification at SoC Level Perspec System Verifier Overview June 2015 IP to SoC hardware/software integration and verification flows Cadence methodology and focus Applications (Basic to

More information

Completing the Multimedia Architecture

Completing the Multimedia Architecture Copyright Khronos Group, 2011 - Page 1 Completing the Multimedia Architecture Erik Noreke Chair of OpenSL ES Working Group Chair of OpenMAX AL Working Group Copyright Khronos Group, 2011 - Page 2 Today

More information

Introduction to the Tegra SoC Family and the ARM Architecture. Kristoffer Robin Stokke, PhD FLIR UAS

Introduction to the Tegra SoC Family and the ARM Architecture. Kristoffer Robin Stokke, PhD FLIR UAS Introduction to the Tegra SoC Family and the ARM Architecture Kristoffer Robin Stokke, PhD FLIR UAS Goals of Lecture To give you something concrete to start on Simple introduction to ARMv8 NEON programming

More information

Open Standards for AR and VR Neil Trevett Khronos President NVIDIA VP Developer January 2018

Open Standards for AR and VR Neil Trevett Khronos President NVIDIA VP Developer January 2018 Copyright Khronos Group 2018 - Page 1 Open Standards for AR and Neil Trevett Khronos President NVIDIA VP Developer Ecosystem ntrevett@nvidia.com @neilt3d January 2018 Khronos Mission E.g. OpenGL ES provides

More information

CUDA on ARM Update. Developing Accelerated Applications on ARM. Bas Aarts and Donald Becker

CUDA on ARM Update. Developing Accelerated Applications on ARM. Bas Aarts and Donald Becker CUDA on ARM Update Developing Accelerated Applications on ARM Bas Aarts and Donald Becker CUDA on ARM: a forward-looking development platform for high performance, energy efficient hybrid computing It

More information

Overview and AR/VR Roadmap

Overview and AR/VR Roadmap Khronos Group Inc. 2018 - Page 1 Overview and AR/ Roadmap Neil Trevett Khronos President NVIDIA VP Developer Ecosystems ntrevett@nvidia.com @neilt3d Khronos Group Inc. 2018 - Page 2 Khronos Connects Software

More information

Profiling and Debugging OpenCL Applications with ARM Development Tools. October 2014

Profiling and Debugging OpenCL Applications with ARM Development Tools. October 2014 Profiling and Debugging OpenCL Applications with ARM Development Tools October 2014 1 Agenda 1. Introduction to GPU Compute 2. ARM Development Solutions 3. Mali GPU Architecture 4. Using ARM DS-5 Streamline

More information

Enable AI on Mobile Devices

Enable AI on Mobile Devices Enable AI on Mobile Devices Scott Wang 王舒翀 Senior Segment Manager Mobile, BSG ARM Tech Forum 2017 14 th June 2017, Shenzhen AI is moving from core to edge Ubiquitous AI Safe and autonomous Mixed reality

More information

Accelerating Realism with the (NVIDIA Scene Graph)

Accelerating Realism with the (NVIDIA Scene Graph) Accelerating Realism with the (NVIDIA Scene Graph) Holger Kunz Manager, Workstation Middleware Development Phillip Miller Director, Workstation Middleware Product Management NVIDIA application acceleration

More information

GPGPU on Mobile Devices

GPGPU on Mobile Devices GPGPU on Mobile Devices Introduction Addressing GPGPU for very mobile devices Tablets Smartphones Introduction Why dedicated GPUs in mobile devices? Gaming Physics simulation for realistic effects 3D-GUI

More information

The Path to Embedded Vision & AI using a Low Power Vision DSP. Yair Siegel, Director of Segment Marketing Hotchips August 2016

The Path to Embedded Vision & AI using a Low Power Vision DSP. Yair Siegel, Director of Segment Marketing Hotchips August 2016 The Path to Embedded Vision & AI using a Low Power Vision DSP Yair Siegel, Director of Segment Marketing Hotchips August 2016 Presentation Outline Introduction The Need for Embedded Vision & AI Vision

More information

HPC with Multicore and GPUs

HPC with Multicore and GPUs HPC with Multicore and GPUs Stan Tomov Electrical Engineering and Computer Science Department University of Tennessee, Knoxville COSC 594 Lecture Notes March 22, 2017 1/20 Outline Introduction - Hardware

More information

CS8803SC Software and Hardware Cooperative Computing GPGPU. Prof. Hyesoon Kim School of Computer Science Georgia Institute of Technology

CS8803SC Software and Hardware Cooperative Computing GPGPU. Prof. Hyesoon Kim School of Computer Science Georgia Institute of Technology CS8803SC Software and Hardware Cooperative Computing GPGPU Prof. Hyesoon Kim School of Computer Science Georgia Institute of Technology Why GPU? A quiet revolution and potential build-up Calculation: 367

More information

NVIDIA AI BRAIN OF SELF DRIVING AND HD MAPPING. September 13, 2016

NVIDIA AI BRAIN OF SELF DRIVING AND HD MAPPING. September 13, 2016 NVIDIA AI BRAIN OF SELF DRIVING AND HD MAPPING September 13, 2016 AI FOR AUTONOMOUS DRIVING MAPPING KALDI LOCALIZATION DRIVENET Training on DGX-1 NVIDIA DGX-1 NVIDIA DRIVE PX 2 Driving with DriveWorks

More information

Adding Advanced Shader Features and Handling Fragmentation

Adding Advanced Shader Features and Handling Fragmentation Copyright Khronos Group, 2010 - Page 1 Adding Advanced Shader Features and Handling Fragmentation How to enable your application on a wide range of devices Imagination Technologies Copyright Khronos Group,

More information

Nvidia Jetson TX2 and its Software Toolset. João Fernandes 2017/2018

Nvidia Jetson TX2 and its Software Toolset. João Fernandes 2017/2018 Nvidia Jetson TX2 and its Software Toolset João Fernandes 2017/2018 In this presentation Nvidia Jetson TX2: Hardware Nvidia Jetson TX2: Software Machine Learning: Neural Networks Convolutional Neural Networks

More information

Embedded Computing without Compromise. Evolution of the Rugged GPGPU Computer Session: SIL7127 Dan Mor PLM -Aitech Systems GTC Israel 2017

Embedded Computing without Compromise. Evolution of the Rugged GPGPU Computer Session: SIL7127 Dan Mor PLM -Aitech Systems GTC Israel 2017 Evolution of the Rugged GPGPU Computer Session: SIL7127 Dan Mor PLM - Systems GTC Israel 2017 Agenda Current GPGPU systems NVIDIA Jetson TX1 and TX2 evaluation Conclusions New Products 2 GPGPU Product

More information

Technology for a better society. hetcomp.com

Technology for a better society. hetcomp.com Technology for a better society hetcomp.com 1 J. Seland, C. Dyken, T. R. Hagen, A. R. Brodtkorb, J. Hjelmervik,E Bjønnes GPU Computing USIT Course Week 16th November 2011 hetcomp.com 2 9:30 10:15 Introduction

More information

Arnon Peleg Visual Computing Products Management

Arnon Peleg Visual Computing Products Management Optimizing Visual Computing Applications on 3 rd Generation Intel Core Processor Family Intel Visual Computing Tools Helping Make Your Game & Media Applications Faster, Faster Arnon Peleg Visual Computing

More information

HEXIWEAR COMPLETE IOT DEVELOPMENT SOLUTION

HEXIWEAR COMPLETE IOT DEVELOPMENT SOLUTION HEXIWEAR COMPLETE IOT DEVELOPMENT SOLUTION NXP SEMICONDUCTORS PUBLIC THE ONLY SUPPLIER TO PROVIDE COMPLETE IoT SOLUTIONS DSPs, MCUs & CPUs Suite of Sensors NFC, BLE, Thread, zigbee, sub-ghz Wireless Interconnects

More information

Exploring System Coherency and Maximizing Performance of Mobile Memory Systems

Exploring System Coherency and Maximizing Performance of Mobile Memory Systems Exploring System Coherency and Maximizing Performance of Mobile Memory Systems Shanghai: William Orme, Strategic Marketing Manager of SSG Beijing & Shenzhen: Mayank Sharma, Product Manager of SSG ARM Tech

More information

Overview. Think Silicon is a privately held company founded in 2007 by the core team of Atmel MMC IC group

Overview. Think Silicon is a privately held company founded in 2007 by the core team of Atmel MMC IC group Nema An OpenGL & OpenCL Embedded Programmable Engine Georgios Keramidas & Iakovos Stamoulis Think Silicon mobile GRAPHICS Overview Think Silicon is a privately held company founded in 2007 by the core

More information

Simplify System Complexity

Simplify System Complexity Simplify System Complexity With the new high-performance CompactRIO controller Fanie Coetzer Field Sales Engineer Northern South Africa 2 3 New control system CompactPCI MMI/Sequencing/Logging FieldPoint

More information

Open Standards for Vision and AI Peter McGuinness NNEF WG Chair CEO, Highwai, Inc May 2018

Open Standards for Vision and AI Peter McGuinness NNEF WG Chair CEO, Highwai, Inc May 2018 Copyright Khronos Group 2018 - Page 1 Open Standards for Vision and AI Peter McGuinness NNEF WG Chair CEO, Highwai, Inc peter.mcguinness@gobrach.com May 2018 Khronos Mission E.g. OpenGL ES provides 3D

More information

GPGPU on ARM. Tom Gall, Gil Pitney, 30 th Oct 2013

GPGPU on ARM. Tom Gall, Gil Pitney, 30 th Oct 2013 GPGPU on ARM Tom Gall, Gil Pitney, 30 th Oct 2013 Session Description This session will discuss the current state of the art of GPGPU technologies on ARM SoC systems. What standards are there? Where are

More information

3D Graphics in Future Mobile Devices. Steve Steele, ARM

3D Graphics in Future Mobile Devices. Steve Steele, ARM 3D Graphics in Future Mobile Devices Steve Steele, ARM Market Trends Mobile Computing Market Growth Volume in millions Mobile Computing Market Trends 1600 Smart Mobile Device Shipments (Smartphones and

More information

Adhocracy Innovation with Imaging technology. Socionext Inc. Hiroyuki Komori July 5th, 2017

Adhocracy Innovation with Imaging technology. Socionext Inc. Hiroyuki Komori July 5th, 2017 Adhocracy Innovation with Imaging technology Socionext Inc. Hiroyuki Komori July 5th, 2017 Video Steaming in the world Video uploading : 65,000 clip/min, 300 hour/min on YouTube Video viewing : billions

More information

Designing GPU-accelerated applications with RTMaps (Real-Time Multisensor Applications) Framework and NVIDIA DriveWorks

Designing GPU-accelerated applications with RTMaps (Real-Time Multisensor Applications) Framework and NVIDIA DriveWorks MUNICH OCT 10-12, 2017 Designing GPU-accelerated applications with RTMaps (Real-Time Multisensor Applications) Framework and NVIDIA DriveWorks Xavier Rouah Lead Software Engineer Brief introduction about

More information

The State of Gaming APIs

The State of Gaming APIs Copyright Khronos Group, 2011 - Page 1 The State of Gaming APIs Neil Trevett Vice President Mobile Content, NVIDIA President, The Khronos Group Copyright Khronos Group, 2011 - Page 2 State of Gaming APIs

More information

Renderscript Accelerated Advanced Image and Video Processing on ARM Mali T-600 GPUs. Lihua Zhang, Ph.D. MulticoreWare Inc.

Renderscript Accelerated Advanced Image and Video Processing on ARM Mali T-600 GPUs. Lihua Zhang, Ph.D. MulticoreWare Inc. Renderscript Accelerated Advanced Image and Video Processing on ARM Mali T-600 GPUs Lihua Zhang, Ph.D. MulticoreWare Inc. lihua@multicorewareinc.com Overview More & more mobile apps are beginning to require

More information

S CUDA on Xavier

S CUDA on Xavier S8868 - CUDA on Xavier Anshuman Bhat CUDA Product Manager Saikat Dasadhikari CUDA Engineering 29 th March 2018 1 CUDA ECOSYSTEM 2018 CUDA DOWNLOADS IN 2017 3,500,000 CUDA REGISTERED DEVELOPERS 800,000

More information

Advances in GPU-based Image Processing and Computer Vision. James Fung

Advances in GPU-based Image Processing and Computer Vision. James Fung Advances in GPU-based Image Processing and Computer Vision James Fung Talk Objectives Introduce GPU hardware and features available for imaging & vision Present new algorithm mappings on the GPU GPU vision

More information

mbed OS Update Sam Grove Technical Lead, mbed OS June 2017 ARM 2017

mbed OS Update Sam Grove Technical Lead, mbed OS June 2017 ARM 2017 mbed OS Update Sam Grove Technical Lead, mbed OS June 2017 ARM mbed: Connecting chip to cloud Device software Device services Third-party cloud services IoT device application mbed Cloud Update IoT cloud

More information

Windowing System on a 3D Pipeline. February 2005

Windowing System on a 3D Pipeline. February 2005 Windowing System on a 3D Pipeline February 2005 Agenda 1.Overview of the 3D pipeline 2.NVIDIA software overview 3.Strengths and challenges with using the 3D pipeline GeForce 6800 220M Transistors April

More information

The rcuda middleware and applications

The rcuda middleware and applications The rcuda middleware and applications Will my application work with rcuda? rcuda currently provides binary compatibility with CUDA 5.0, virtualizing the entire Runtime API except for the graphics functions,

More information

Autonomous Driving Solutions

Autonomous Driving Solutions Autonomous Driving Solutions Oct, 2017 DrivePX2 & DriveWorks Marcus Oh (moh@nvidia.com) Sr. Solution Architect, NVIDIA This work is licensed under a Creative Commons Attribution-Share Alike 4.0 (CC BY-SA

More information

Graphics and Imaging Architectures

Graphics and Imaging Architectures Graphics and Imaging Architectures Kayvon Fatahalian http://www.cs.cmu.edu/afs/cs/academic/class/15869-f11/www/ About Kayvon New faculty, just arrived from Stanford Dissertation: Evolving real-time graphics

More information

NVIDIA GTX200: TeraFLOPS Visual Computing. August 26, 2008 John Tynefield

NVIDIA GTX200: TeraFLOPS Visual Computing. August 26, 2008 John Tynefield NVIDIA GTX200: TeraFLOPS Visual Computing August 26, 2008 John Tynefield 2 Outline Execution Model Architecture Demo 3 Execution Model 4 Software Architecture Applications DX10 OpenGL OpenCL CUDA C Host

More information

GPGPU Applications. for Hydrological and Atmospheric Simulations. and Visualizations on the Web. Ibrahim Demir

GPGPU Applications. for Hydrological and Atmospheric Simulations. and Visualizations on the Web. Ibrahim Demir GPGPU Applications for Hydrological and Atmospheric Simulations and Visualizations on the Web Ibrahim Demir Big Data We are collecting and generating data on a petabyte scale (1Pb = 1,000 Tb = 1M Gb) Data

More information

The Changing Face of Edge Compute

The Changing Face of Edge Compute The Changing Face of Edge Compute 2018 Arm Limited Alvin Yang Nov 2018 Market trends acceleration of technology deployment 26 years 4 years 100 billion chips shipped 100 billion chips shipped 1 Trillion

More information