Vision Acceleration. Launch Briefing October Neil Trevett Vice President Mobile Ecosystem, NVIDIA President, Khronos Group

Similar documents
Open Standard APIs for Augmented Reality

Open Standard APIs for Embedded Vision Processing

Open API Standards for Mobile Graphics, Compute and Vision Processing GTC, March 2014

Khronos and the Mobile Ecosystem

Open Standards for AR and VR Neil Trevett Khronos President NVIDIA VP Developer January 2018

WebGL Meetup GDC Copyright Khronos Group, Page 1

Next Generation OpenGL Neil Trevett Khronos President NVIDIA VP Mobile Copyright Khronos Group Page 1

AR Standards Update Austin, March 2012

Mobile AR Hardware Futures

Open Standards for Vision and AI Peter McGuinness NNEF WG Chair CEO, Highwai, Inc May 2018

Copyright Khronos Group Page 1. Vulkan Overview. June 2015

Copyright Khronos Group Page 1

Copyright Khronos Group Page 1

Khronos Overview The State of the Art in Open Standards for Visual Computing

Acceleration Standards for Mobile Augmented Reality

Update on Khronos Open Standard APIs for Vision Processing Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem

The OpenVX Computer Vision and Neural Network Inference

Open Standards for Today s Gaming Industry

Accelerating Vision Processing

Copyright Khronos Group, Page 1. Khronos Overview. Taiwan, February 2012

Navigating the Vision API Jungle: Which API Should You Use and Why? Embedded Vision Summit, May 2015

Overview and AR/VR Roadmap

Press Briefing SIGGRAPH 2015 Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem. Copyright Khronos Group Page 1

SIGGRAPH Briefing August 2014

SC24/WG9 Liaison Meeting

Press Briefing SIGGRAPH 2015 Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem. Copyright Khronos Group Page 1

Fusing Sensors into Mobile Operating Systems & Innovative Use Cases

Standards for Vision Processing and Neural Networks

INTEGRATING COMPUTER VISION SENSOR INNOVATIONS INTO MOBILE DEVICES. Eli Savransky Principal Architect - CTO Office Mobile BU NVIDIA corp.

IMAGE AND VISION PROCESSING ON TEGRA K1. Elif Albuz

WebGL, WebCL and Beyond!

The Benefits of GPU Compute on ARM Mali GPUs

Khronos Connects Software to Silicon

Enabling a Richer Multimedia Experience with GPU Compute. Roberto Mijat Visual Computing Marketing Manager

Standards for WebVR. Neil Trevett. Khronos President Vice President Mobile Content,

Neil Trevett Vice President Mobile Ecosystem, NVIDIA President, Khronos Group. Copyright Khronos Group Page 1

Advanced Imaging Applications on Smart-phones Convergence of General-purpose computing, Graphics acceleration, and Sensors

Ecosystem Overview Neil Trevett Khronos President NVIDIA Vice President Developer

Silicon Acceleration APIs

Open Standards for Building Virtual and Augmented Realities. Neil Trevett Khronos President NVIDIA VP Developer Ecosystems

GTC 2013 March San Jose, CA The Smartest People. The Best Ideas. The Biggest Opportunities. Opportunities for Participation:

The State of Gaming APIs

OpenCL Press Conference

Neural Network Exchange Format

April 4-7, 2016 Silicon Valley VISIONWORKS A CUDA ACCELERATED COMPUTER VISION LIBRARY S6783. Elif Albuz, April 4, 2016

WebGL, WebCL and OpenCL

Introduction to OpenGL ES 3.0

Emerging Vision Technologies: Enabling a New Era of Intelligent Devices

Copyright Khronos Group Page 1

Copyright Khronos Group Page 1

Copyright Khronos Group, Page 1. OpenCL. GDC, March 2010

Next Generation Visual Computing

Copyright Khronos Group Page 1

Dave Shreiner, ARM March 2009

THE LEADER IN VISUAL COMPUTING

gltf Briefing September 2016 Copyright Khronos Group Page 1

GPGPU Applications. for Hydrological and Atmospheric Simulations. and Visualizations on the Web. Ibrahim Demir

Unleashing the benefits of GPU Computing with ARM Mali TM Practical applications and use-cases. Steve Steele, ARM

Vulkan 1.1 March Copyright Khronos Group Page 1

GTC Interaction Simplified. Gesture Recognition Everywhere: Gesture Solutions on Tegra

OpenCV on Zynq: Accelerating 4k60 Dense Optical Flow and Stereo Vision. Kamran Khan, Product Manager, Software Acceleration and Libraries July 2017

Completing the Multimedia Architecture

Copyright Khronos Group, Page 1

Copyright Khronos Group 2012 Page 1. OpenCL 1.2. August 2012

Bringing it all together: The challenge in delivering a complete graphics system architecture. Chris Porthouse

Graphics Technology Update

Khronos Updates GDC 2017 Neil Trevett Vice President Developer Ecosystem, NVIDIA President,

Developing a Reference Model for Augmented Reality. 5th International AR Standards Community Meeting 19 March 2012

OpenCL Overview. Shanghai March Neil Trevett Vice President Mobile Content, NVIDIA President, The Khronos Group

Multimedia in Mobile Phones. Architectures and Trends Lund

Copyright Khronos Group, Page 1

Copyright Khronos Group Page 1

Profiling and Debugging OpenCL Applications with ARM Development Tools. October 2014

Graphics and Imaging Architectures

Streaming Media. Advanced Audio. Erik Noreke Standardization Consultant Chair, OpenSL ES. Copyright Khronos Group, Page 1

EGLSTREAMS: INTEROPERABILITY FOR CAMERA, CUDA AND OPENGL. Debalina Bhattacharjee Sharan Ashwathnarayan

Prasanna Krishnaswamy Intel Platform Architect. Imaging Systems Design for Mixed Reality Scenarios

Vulkan Launch Webinar 18 th February Copyright Khronos Group Page 1

What's Next in Graphics APIs? SIGGRAPH Asia December 2014 Neil Trevett Khronos President NVIDIA VP Mobile

Kari Pulli Intel. Radhakrishna Giduthuri, AMD. Frank Brill NVIDIA. OpenVX Webinar. June 16, 2016

KHRONOS STANDARDS UPDATE. Neil Trevett, GTC, 26 th March 2018

OpenMAX AL, OpenSL ES

Towards the Consumerization of Smart Sensors

POWERVR MBX & SGX OpenVG Support and Resources

Qualcomm Snapdragon Technologies

Copyright Khronos Group, Page 1

Creating the Embedded Media Processing Ecosystem

Copyright Khronos Group Page 1

Standards Update. Copyright Khronos Group Page 1

More performance options

Embedded Software: Its Growing Influence on the Hardware world

Advantages of MIPI Interfaces in IoT Applications

MIPI Alliance Introduction & MIPI Camera Serial Interface Overview

The Mobile Advantage. Erik Noreke Independent Standardization Consultant Chair, OpenSL ES. Copyright Khronos Group, Page 1

Public Sensing Using Your Mobile Phone for Crowd Sourcing

Movit System G1 WIRELESS MOTION DEVICE SYSTEM

ARM Multimedia IP: working together to drive down system power and bandwidth

IGLOO AND SNOWBALL. Philippe Garnier Ecosystem program

The Path to Embedded Vision & AI using a Low Power Vision DSP. Yair Siegel, Director of Segment Marketing Hotchips August 2016

Ubiquitous and Context Aware Computing: Overview and Systems

Transcription:

Copyright Khronos Group 2014 - Page 1 Vision Acceleration Launch Briefing October 2014 Neil Trevett Vice President Mobile Ecosystem, NVIDIA President, Khronos Group

Copyright Khronos Group 2014 - Page 2 Khronos Connects Software to Silicon Open Consortium creating ROYALTY-FREE, OPEN STANDARD APIs for hardware acceleration Defining the roadmap for low-level silicon interfaces needed on every platform Graphics, compute, rich media, vision, sensor and camera processing Rigorous specifications AND conformance tests for crossvendor portability Acceleration APIs BY the Industry FOR the Industry http://accelerateyourworld.org/ Well over a BILLION people use Khronos APIs Every Day

Khronos Standards 3D Asset Handling - 3D authoring asset interchange - 3D asset transmission format with compression Visual Computing - 3D Graphics - Heterogeneous Parallel Computing Over 100 companies defining royalty-free APIs to connect software to silicon Acceleration in HTML5 Sensor Processing - 3D in browser no Plug-in - Heterogeneous computing for JavaScript - Vision Acceleration - Camera Control - Sensor Fusion Copyright Khronos Group 2014 - Page 3

Copyright Khronos Group 2014 - Page 4 Mobile Vision Acceleration = New Experiences Need for advanced sensors and the acceleration to process them Computational Photography and Videography Face, Body and Gesture Tracking 3D Scene/Object Reconstruction Augmented Reality

Copyright Khronos Group 2014 - Page 5 Visual Computing = Graphics PLUS Vision Vision Processing Imagery Enhanced sensor and vision capability deepens the interaction between real and virtual worlds Data Graphics Processing Real-time GPU Compute Research project on GPU-accelerated laptop High-Quality Reflections, Refractions, and Caustics in Augmented Reality and their Contribution to Visual Coherence P. Kán, H. Kaufmann, Institute of Software Technology and Interactive Systems, Vienna University of Technology, Vienna, Austria https://www.youtube.com/watch?v=i2mewvzzdaa

Copyright Khronos Group 2014 - Page 6 Vision Pipeline Challenges and Opportunities Growing Camera Diversity Capturing color, range and lightfields Diverse Vision Processors Driving for high performance and low power Sensor Proliferation Diverse sensor awareness of the user and surroundings Light / Proximity 2 cameras 3 microphones Touch Camera sensors >20MPix Novel sensor configurations Stereo pairs Plenoptic Arrays Active Structured Light Active TOF Multi-core CPUs Programmable GPUs DSPs and DSP arrays Camera ISPs Dedicated vision IP blocks 19 Position - GPS - WiFi (fingerprint) - Cellular trilateration - NFC/Bluetooth Beacons Accelerometer Magnetometer Gyroscope Pressure / Temp / Humidity Flexible sensor and camera control to generate required image stream Use best processing available for image stream processing with code portability Control/fuse vision data by/with all other sensor data on device

Vision Processing Power Efficiency Depth sensors = significant processing - Generate/use environmental information Wearables will need always-on vision - With smaller thermal limit / battery than phones! GPUs has x10 CPU imaging power efficiency - GPUs architected for efficient pixel handling Traditional cameras have dedicated hardware - ISP = Image Signal Processor on all SOCs today SOCs have space for more transistors - But can t turn on at same time = Dark Silicon Potential for dedicated sensor/vision silicon - Can trigger full CPU/GPU complex But how to program specialized processors? Performance and Functional Portability Power Efficiency X100 X10 X1 Dedicated Hardware Advanced Sensors GPU Compute Wearables Computation Flexibility Multi-core CPU Copyright Khronos Group 2014 - Page 7

Copyright Khronos Group 2014 - Page 8 OpenVX Power Efficient Vision Acceleration Out-of-the-Box vision acceleration framework - Enables low-power, real-time applications - Targeted at mobile and embedded platforms Functional Portability - Tightly defined specification - Full conformance tests Performance portability across diverse HW - Higher-level abstraction hides hardware details - ISPs, Dedicated hardware, DSPs and DSP arrays, GPUs, Multi-core CPUs Enables low-power, always-on acceleration - Can run solely on dedicated vision hardware - Does not require full SOC CPU/GPU complex to be powered on Application Application Application Application Vision Accelerator Vision Accelerator Vision Accelerator Vision Accelerator

Copyright Khronos Group 2014 - Page 9 OpenVX Graphs The Key to Efficiency Vision processing directed graphs for power and performance efficiency - Each Node can be implemented in software or accelerated hardware - Nodes may be fused by the implementation to eliminate memory transfers - Processing can be tiled to keep data entirely in local memory/cache VXU Utility Library for access to single nodes - Easy way to start using OpenVX by calling each node independently EGLStreams can provide data and event interop with other Khronos APIs - BUT use of other Khronos APIs are not mandated Native Camera Control OpenVX Node OpenVX Node OpenVX Node OpenVX Node Downstream Application Processing Example OpenVX Graph

Copyright Khronos Group 2014 - Page 10 OpenVX 1.0 Function Overview Core data structures - Images and Image Pyramids - Processing Graphs, Kernels, Parameters Image Processing - Arithmetic, Logical, and statistical operations - Multichannel Color and BitDepth Extraction and Conversion - 2D Filtering and Morphological operations - Image Resizing and Warping Core Computer Vision - Pyramid computation - Integral Image computation Feature Extraction and Tracking - Histogram Computation and Equalization - Canny Edge Detection - Harris and FAST Corner detection - Sparse Optical Flow Widely used extensions adopted into future versions of the core OpenVX Specification Is Extensible Khronos maintains extension registry OpenVX 1.0 defines framework for creating, managing and executing graphs Focused set of widely used functions that are readily accelerated Implementers can add functions as extensions

Copyright Khronos Group 2014 - Page 11 Example Graph - Stereo Machine Vision OpenVX Graph Camera 1 Stereo Rectify with Remap Compute Depth Map (User Node) Detect and track objects (User Node) Object coordinates Camera 2 Stereo Rectify with Remap Image Pyramid Compute Optical Flow Delay Tiling extension enables user nodes (extensions) to also optimally run in local memory

Copyright Khronos Group 2014 - Page 12 OpenVX and OpenCV are Complementary Governance Conformance Community driven open source with no formal specification No conformance tests for consistency and every vendor implements different subset Formal specification defined and implemented by hardware vendors Full conformance test suite / process creates a reliable acceleration platform Portability APIs can vary depending on processor Hardware abstracted for portability Scope Efficiency Very wide 1000s of imaging and vision functions Multiple camera APIs/interfaces Memory-based architecture Each operation reads and writes memory Tight focus on hardware accelerated functions for mobile vision Use external camera API Graph-based execution Optimizable computation, data transfer Use Case Rapid experimentation Production development & deployment

Copyright Khronos Group 2014 - Page 13 OpenVX Announcement Finalized OpenVX 1.0 specification released October 2014 - www.khronos.org/openvx Full conformance test suite and Adopters Program immediately available - $20K Adopters fee ($15K for members) working group reviews submitted results - Test suite exercises graph framework and functionality of each OpenVX 1.0 node - Approved Conformant implementations can use the OpenVX trademark Khronos working on open source sample implementation of OpenVX 1.0 - Expected release on GitHub by end of 2014

Copyright Khronos Group 2014 - Page 14 Khronos APIs for Vision Processing GPU Compute Shaders (OpenGL 4.X and OpenGL ES 3.1) Pervasively available on almost any mobile device or OS Easy integration into graphics apps no vision/compute API interop needed Program in GLSL not C Limited to acceleration on a single GPU General Purpose Heterogeneous Programming Framework Flexible, low-level access to any devices with OpenCL compiler Single programming and run-time framework for CPUs, GPUs, DSPs, hardware Open standard for any device or OS being used as backed by many languages and frameworks Needs full compiler stack and IEEE precision Out of the Box Vision Framework - Operators and graph framework library Can run some or all modes on dedicated hardware no compiler needed Higher-level abstraction means easier performance portability to diverse hardware Graph optimization opens up possibility of low-power, always-on vision acceleration Fixed set of operators but can be extended It is possible to use OpenCL or GLSL to build OpenVX Nodes on programmable devices!

Copyright Khronos Group 2014 - Page 15 NVIDIA VisionWorks is Integrating OpenVX VisionWorks library contains diverse vision and imaging primitives Will leverage OpenVX for optimized primitive execution Can extend VisionWorks nodes through GPU-accelerated primitives Applications and Middleware Provided with sample library of fully accelerated pipelines Vision Pipeline Samples Object Detection Classifier SLAM VisionWorks Primitives Corner Detection 3 rd Party Pipelines 3 rd Party VisionWorks Framework GPU Libraries CUDA Libraries Tegra K1

Copyright Khronos Group 2014 - Page 16 Khronos APIs for Augmented Reality AR needs not just advanced sensor processing, vision acceleration, computation and rendering - but also for all these subsystems to work efficiently together Audio Rendering MEMS Sensors Sensor Fusion Application on CPUs, GPUs and DSPs Precision timestamps on all sensor samples Vision Processing Advanced Camera Control and stream generation EGLStream - stream data between APIs 3D Rendering and Video Composition On GPU

Copyright Khronos Group 2014 - Page 17 Summary Khronos is building a trio of interoperating APIs for portable / power-efficient vision and sensor processing OpenVX 1.0 specification is now finalized and released - Full conformance tests and Adopters program immediately available - Khronos open source sample implementation by end of 2014 - First commercial implementations already close to shipping Any company is welcome to join Khronos to influence the direction of mobile and embedded vision processing! - $15K annual membership fee for access to all Khronos API working groups - Well-defined IP framework protects your IP and conformant implementations More Information - www.khronos.org - ntrevett@nvidia.com - @neilt3d

Background Material Copyright Khronos Group 2014 - Page 18

Copyright Khronos Group 2014 - Page 19 Need for Camera Control API - OpenKCAM Advanced control of ISP and camera subsystem with cross-platform portability - Generate sophisticated image stream for advanced imaging & vision apps No platform API currently fulfills all developer requirements - Portable access to growing sensor diversity: e.g. depth sensors and sensor arrays - Cross sensor synch: e.g. synch of camera and MEMS sensors - Advanced, high-frequency per-frame burst control of camera/sensor: e.g. ROI - Multiple input, output re-circulating streams with RAW, Bayer or YUV Processing Defines control of Sensor, Color Filter Array Lens, Flash, Focus, Aperture Auto Exposure (AE) Auto White Balance (AWB) Auto Focus (AF) Image Signal Processor (ISP) EGLStreams Image/Vision Applications

Copyright Khronos Group 2014 - Page 20 OpenKCAM is FCAM-based FCAM (2010) Stanford/Nokia, open source Capture stream of camera images with precision control - A pipeline that converts requests into image stream - All parameters packed into the requests - no visible state - Programmer has full control over sensor settings for each frame in stream Control over focus and flash - No hidden daemon running Control ISP - Can access supplemental statistics from ISP if available No global state - State travels with image requests - Every pipeline stage may have different state - Enables fast, deterministic state changes Khronos coordinating with MIPI on camera control and data formats

Sensor Industry Fragmentation Copyright Khronos Group 2014 - Page 21

Copyright Khronos Group 2014 - Page 22 Sensor Data Types Raw sensor data - Acceleration, Magnetic Field, Angular Rates - Pressure, Ambient Light, Proximity, Temperature, Humidity, RGB light, UV light - Heart rate, Blood Oxygen Level, Skin Hydration, Breathalyzer Fused sensor data - Orientation (Quaternion or Euler Angles) - Gravity, Linear Acceleration, Position Contextual awareness - Device Motion: general movement of the device: still, free-fall, - Carry: how the device is being held by a user: in pocket, in hand, - Posture: how the body holding the device is positioned: standing, sitting, step, - Transport: about the environment around the device: in elevator, in car,

Copyright Khronos Group 2014 - Page 23 Low-level Sensor Abstraction API Apps request semantic sensor information StreamInput defines possible requests, e.g. Read Physical or Virtual Sensors e.g. Game Quaternion Context detection e.g. Am I in an elevator? Apps Need Sophisticated Access to Sensor Data Without coding to specific sensor hardware Sensor Discoverability Sensor Code Portability Advanced Sensors Everywhere Multi-axis motion/position, quaternions, context-awareness, gestures, activity monitoring, health and environmental sensors StreamInput processing graph provides optimized sensor data stream High-value, smart sensor fusion middleware can connect to apps in a portable way Apps can gain magical situational awareness