Emerging Vision Technologies: Enabling a New Era of Intelligent Devices

Similar documents
Preparing for Mass Market Virtual Reality: A Mobile Perspective. Qualcomm Technologies, Inc. September 16, 2017

Qualcomm Snapdragon Technologies

Making XR a reality for everyone

The Mobile Future of extended Reality (XR) Hugo Swart Senior Director, Product Management Qualcomm Technologies, Inc.

New Technologies for UAV/UGV

Qualcomm Snapdragon 450 Mobile Platform

Immersion. Tim Leland Vice President, Product Management Qualcomm Technologies,

Perform. Travis Lanier Sr. Director, Product Management Qualcomm Technologies,

Making always-on vision a reality. Dr. Evgeni Gousev Sr. Director, Engineering Qualcomm Technologies, Inc. September 22,

Future Networked Car Geneva Auto Show 2018

Leading the world to 5G

The Future of Mobility. Keith Kressin Senior Vice President, Product Management Qualcomm Technologies,

IoT with 5G Technology

RISC-V: Opportunities and Challenges in SoCs

5G and Automotive Cellular Vehicle-to-Everything (C-V2X) March 2017

Welcome to the 5G age

VR Development Platform

Achieving on Mobile Devices

Ultra-low Power Always-On Computer Vision

Mobile: the foundation of the digital economy

TEGRA K1 AND THE AUTOMOTIVE INDUSTRY. Gernot Ziegler, Timo Stich

Applying Lessons Learned to V2X Communications for China

Making 5G NR a commercial reality

Machine Learning for Selected SI & PI Problems. Timothy Michalka Sr. Director, Engineering Qualcomm Technologies, Inc. 18-Oct-2017

Smartphone-powered future

Snapdragon NPE Overview

Global 5G spectrum update

5 GHz for consumers. Guillaume Lebrun Director 7 th June 2016

Heterogeneous Computing Made Easy:

Mobile technology: A catalyst for change. Kedar Kondap Vice President, Product Management Qualcomm Technologies, Inc.

5G Design and Technology. Durga Malladi SVP Engineering Qualcomm Technologies, Inc. October 19 th, 2016

Creating Affordable and Reliable Autonomous Vehicle Systems

5G Spectrum Access. Wassim Chourbaji. Vice President, Government Affairs and Public Policy EMEA Qualcomm Technologies Inc.

Qualcomm Snapdragon 710 mobile platform

Making Mobile 5G a Commercial Reality. Peter Carson Senior Director Product Marketing Qualcomm Technologies, Inc.

NVIDIA AUTOMOTIVE. Driving Innovation

802.11ax: Meeting the demands of modern networks. Gopi Sirineni, Vice President Qualcomm Technologies, Inc. April 19,

Making 5G NR a reality

Enabling a Richer Multimedia Experience with GPU Compute. Roberto Mijat Visual Computing Marketing Manager

Solid State LiDAR for Ubiquitous 3D Sensing

A New Foundation for the Connected Home. Qualcomm Technologies, Inc. June

Innovative Wireless Technologies for Mobile Broadband

Image processing techniques for driver assistance. Razvan Itu June 2014, Technical University Cluj-Napoca

Date: 13 June Location: Sophia Antipolis. Integrating the SIM. Dr. Adrian Escott. Qualcomm Technologies, Inc.

3D Reconstruction with Tango. Ivan Dryanovski, Google Inc.

Spectrum for 4G and 5G. Qualcomm Technologies, Inc. October, 2016

The Benefits of GPU Compute on ARM Mali GPUs

Prasanna Krishnaswamy Intel Platform Architect. Imaging Systems Design for Mixed Reality Scenarios

Autonomous Driving Solutions

Advanced Driver Assistance: Modular Image Sensor Concept

Hezi Saar, Sr. Staff Product Marketing Manager Synopsys. Powering Imaging Applications with MIPI CSI-2

Japan. December 13, C-V2X Trial in Japan. Qualcomm Technologies Inc.

Arm Technology in Automotive Geely Automotive Shanghai Innovation Center

Stable Vision-Aided Navigation for Large-Area Augmented Reality

W4. Perception & Situation Awareness & Decision making

Towards 5G NR Commercialization

Advanced IP solutions enabling the autonomous driving revolution

November, Qualcomm s 5G vision Qualcomm Technologies, Inc. and/or its affiliates.

Copyright Khronos Group Page 1

Making 5G NR a reality

Mixed-Reality for Intuitive Photo-Realistic 3D-Model Generation

Making an on-device personal assistant a reality

Comprehensive Arm Solutions for Innovative Machine Learning (ML) and Computer Vision (CV) Applications

The promise of higher spectrum bands for 5G. Rasmus Hellberg PhD Senior Director, Technical Marketing Qualcomm Technologies, Inc.

5G NR to high capacity and

Why the Self-Driving Revolution Hinges on one Enabling Technology: LiDAR

Advanced Driver Assistance Systems: A Cost-Effective Implementation of the Forward Collision Warning Module

Sensor Fusion: Potential, Challenges and Applications. Presented by KVH Industries and Geodetics, Inc. December 2016

Qualcomm WiPower Flexible Wireless Charging

Lecture 19: Depth Cameras. Visual Computing Systems CMU , Fall 2013

Servosila Robotic Heads

Artec Leo. A smart professional 3D scanner for a next-generation user experience

Smart Wearables: Enriching The Digital Sixth Sense. Pankaj Kedia Sr. Director & Business Lead, Smart Wearables Segment Qualcomm Technologies, Inc.

WebGL Meetup GDC Copyright Khronos Group, Page 1

Spectrum for 4G and 5G. Qualcomm Technologies, Inc. July, 2017

2016 Seoul DevU Pipeline Cache Object. Bill Licea-Kane Engineer, Senior Staff Qualcomm Technologies, Inc

Ceilbot vision and mapping system

Towards Fully-automated Driving. tue-mps.org. Challenges and Potential Solutions. Dr. Gijs Dubbelman Mobile Perception Systems EE-SPS/VCA

Enable AI on Mobile Devices

Testing Solution for VR apps Presented by Carlos Cárdenas (DEKRA)

Vulkan API 杨瑜, 资深工程师

PMD [vision] Day Vol. 3 Munich, November 18, PMD Cameras for Automotive & Outdoor Applications. ifm electronic gmbh, V.Frey. Dr.

Photoneo's brand new PhoXi 3D Camera is the highest resolution and highest accuracy area based 3D

VIA Mobile360 Surround View

A Modular Software Framework for Eye-Hand Coordination in Humanoid Robots

Building the 5G future Cristiano Amon President, Qualcomm Incorporated

Designing a software framework for automated driving. Dr.-Ing. Sebastian Ohl, 2017 October 12 th

MULTI-MODAL MAPPING. Robotics Day, 31 Mar Frank Mascarich, Shehryar Khattak, Tung Dang

Connected Car. Dr. Sania Irwin. Head of Systems & Applications May 27, Nokia Solutions and Networks 2014 For internal use

CSR102x Bluetooth Smart Product Line Overview

Artec Leo. A smart professional 3D scanner for a next-generation user experience

Artec Leo. A smart professional 3D scanner for a next-generation user experience

NeoNet: Object centric training for image recognition

Vehicle Localization. Hannah Rae Kerner 21 April 2015

Embedded Vision Solutions

3D Scanning. Qixing Huang Feb. 9 th Slide Credit: Yasutaka Furukawa

THE RISE OF LTE ADVANCED PRO

Artec Leo. A smart professional 3D scanner for a next-generation user experience

Positional tracking for VR/AR using Stereo Vision. Edwin AZZAM, CTO Stereolabs

Advanced Imaging Applications on Smart-phones Convergence of General-purpose computing, Graphics acceleration, and Sensors

Transcription:

Emerging Vision Technologies: Enabling a New Era of Intelligent Devices

Computer vision overview

Computer vision is being integrated in our daily lives Acquiring, processing, and understanding visual data in images, videos and the real world What kind of object is this? How far is the building? Who is this? Where are the cars in the scene? What kind of a scene is this? 3

Enabling vision applications for different ecosystems Example use cases Ecosystems Mobile Virtual Reality IP Camera Robotics/ Drones Automotive Use Cases Touch to focus HMD position location Face recognition Obstacle avoidance Pedestrian detection Computer vision features Touch-totrack 6-DOF positional tracking Face detection and recognition Simultaneous localization and mapping Object detection 4

Mobile use case

Touch-to-focus User selects which part of the image the camera should focus on Computer vision identifies key points within the selected region of interest Camera tracks the key points and informs the AF algorithms where to focus even while the region of interest moves 6

Touch-to-track Robust, low power, multiple object tracker Track Detect Learn Lucas-Kanade optical flow tracking Decision Forest and NCC detection Online learning bootstrapping binary classifier Benefits Multiple object tracking up to 4 objects Robust tracking algorithm tightly integrated with 3A and auto-zoom algorithms Low power and thermal object tracking by using hardware accelerated object tracker High performance tracking on 1080p @ 30fps 7

VR use case

VR will be the new paradigm for how we interact with the world Offering unprecedented experiences and unlimited possibilities Experiences in VR Play Learn Communicate Immersive movies and shows Live concerts, sports, and other events Interactive gaming and entertainment Immersive education Training and demos 3D design and art Social interactions Shared personal moments Empathetic storytelling 9

Precise motion tracking of head movements Through 6-DOF positional tracking Yaw Z Pitch Y Roll X 3 degrees of freedom (3-DOF) In which direction I look Detect rotational movement Main benefit: Look around the virtual world from a fixed point 6 degrees of freedom (6-DOF) Where I am and in which direction I look Detect rotational movement and translational movement Main benefit: Move freely in the virtual world and look around corners 10

Achieving precise head motion tracking on the device Visual inertial odometry (VIO) for rapid and accurate 6-DOF pose Monocular camera data Captured from tracking camera image sensor at ~30 fps Accelerometer & gyroscope data Sampled from external sensors at 800 / 1000 Hz Snapdragon VIO subsystem Camera feature processing Inertial data processing Hexagon DSP algorithms Camera and inertial sensor data fusion Continuous localization Accurate, high-rate pose generation & prediction 6-DOF position & orientation (aka 6-DOF pose ) New frame accurately displayed Hexagon is a products of Qualcomm Technologies, Inc. 11

IP camera use case

Face recognition A key vision use case for IP camera Keeping our homes safer Keeping our communities safer Home surveillance Family member Recognition and intruder detection Action camera Face recognition and visual tracking Professional surveillance Traffic/parking monitoring Capturing our important moments 13

Face recognition Using face detection and recognition Detection Recognition Detects faces in real time at low power Involves accurately comparing detected face with a library of known faces Benefits Robust recognition at a distance Real time identification (no cloud needed) Better off-angle recognition Take action by just touching the photos in the image 14

Drone use case

Obstacle avoidance Computer vision allows drones to map and avoid obstacles in their path, making navigation safer Key for many drone applications, including flying cameras, delivery, agriculture and public safety drones 16

Enabling obstacle avoidance for drones and robotics Depth map Obstacle map Trajector y info Depth Cameras Depth from Stereo Obstacle Mapping Path Planning Advanced Flight Control Motors 6DOF pose 6DOF pose 6DOF pose Downward Camera + Inertial Sensors Visual-Inertial Odometry 17

Automotive use case

Vision is enabling ADAS today and autonomous driving in the future Pedestrian detection Surround view system Driver monitoring/ distraction identification Camera 1 Blind spot detection Cross traffic alert Uses robust object detection and tracking Looks for specific patterns Required to work at a distance Rear collision warning Camera 4 Rear seat display Brought in device Inward facing camera Center Stack Camera 2 Pedestrian detection Rear seat display Brought in device Parking assistance Camera 3 Sensor 2 Lidar Radar Lane departure warning Source: Strategy Analytics, Feb. 2016 19

Pedestrian detection Using robust feature detection at a distance VGA (320 x 240) 720p HD (1280x720) 1080p Full HD (1920x1080) 4K (3840x2160) 8K (7680x4320) 60km/h Alert time 0.6s 1.2s 2.1s 4.2s 8.4s Distance from Camera Increase in compute requirements 10m 20m 35m 70m 140m 1X 3X 6.75X 27X 108X 20

Bringing CV to mobile devices is challenging

On-device processing for vision workloads is key Process data closest to the source, complement cloud Efficient use of network bandwidth Security and user privacy Reliability Low latency 22

Processing vision on mobile devices is a key challenge Compute intensive work loads in mobile constrained environment Visual perception workloads Constrained mobile environment Compute intensive Evolving requirements Storage and memory bandwidth limitations Battery powered Thermal efficiency 23

Qualcomm Technologies is tackling mobile vision challenges Example: High-resolution 3D Reconstruction on a mobile processor

How Qualcomm Technologies is solving mobile vision challenges Superior camera support with Spectra ISP Efficient image processing Flexible interfaces for 2D and 3D sensors Powerful Heterogeneous Snapdragon Processors Running the right algorithm on the right processing engine Process compute intensive CV features within power and thermal limits Optimized algorithmic support and availability Access to top tier CV algorithms with OpenCV and FastCV libraries FastCV provides a mobile optimized library for key CV functions Profiling tools to quickly identify performance bottlenecks Qualcomm Snapdragon, Qualcomm Hexagon, and Fluence are products of Qualcomm Technologies, Inc. 25

3D reconstruction block diagram Color + Depth (Structure light depth based generation) 3D mesh generation Scan starts User moves Tracking / alignment Computer vision based initial pose estimation Inertial motion sensor fusion Bundle adjustment Scan finishes User stops Color correction Use cases: 3D printing, social networking, gaming avatars, etc. HD texture generation Live 3D renderer/viewer 26

Spectra TM ISP enables 3D Reconstruction on mobile 1. Great interface support Connects 2D color sensors to Snapdragon via MIPI, enabling color information to be applied to 3D-reconstructed objects Allows various kinds of 3D depth sensors to connect to Snapdragon 2. Camera Synchronization Supports tight hardware and software synchronization of camera frames, facilitating multi-sensor frame alignment Spectra ISP is a product of Qualcomm Technologies, Inc. 27

Powerful heterogeneous Snapdragon processors Enabling high performance at low power and thermal Parallelism Partitioning 3DR algorithms across our heterogeneous engines LPDDR4 Memory Qualcomm Spectra ISP Qualcomm Snapdragon X12 LTE Modem Matching CV algorithms to appropriate processing engine Display Processor (DPU) Qualcomm Hexagon 680 DSP Video Processor (VPU) Achieving more work to be done per clock cycle, power savings, and reduced latency Qualcomm Adreno 530 GPU Qualcomm Kryo CPU Qualcomm Snapdragon is a product of Qualcomm Technologies, Inc. 28

Example: 3D Reconstruction on Snapdragon 820 Using heterogeneous computing framework to do all of this at 15 FPS RGB sensor processing Depth sensor interface LPDDR4 Memory Display Processor (DPU) Qualcomm Spectra ISP Video Processor (VPU) Qualcomm Snapdragon X12 LTE Modem Qualcomm Hexagon 680 DSP Depth extraction from structured light Point cloud rendering Texture mapping Shading Qualcomm Adreno 530 GPU Qualcomm Snapdragon, Spectra ISP, Hexagon, Adreno, Kryo are products of Qualcomm Technologies, Inc. Qualcomm Kryo CPU Pose estimation and tracking Bundle adjustment Visual and inertial sensor data fusion Mesh generation 29

3D Reconstruction stack diagram Apps (Java) 3DR Scanning Application Middleware (C++) Camera 2 API Depth Engine (DSP/HVX) JNI Interface 3D Scanner Engine (CPU/GPU) OpenCV FastCV OpenCV FastCV Drivers (C) Camera HAL OpenCL OpenGL Vulkan Hardware Drivers Hardware Spectra ISP Hexagon DSP Adreno GPU Kyro CPU 30

Ubiquitous deployment of visual intelligence Bee-sized flying cameras Intelligent cameras Adaptive self-driving cars Aerial, 360 virtual reality 31

We enable ubiquitous deployment of visual intelligence 1 Computer vision enables a broad range of applications for different market segments 2 On-device processing is key to ubiquitous adoption of vision in our daily life 3 Qualcomm Snapdragon brings CV to mobile devices at low power and thermal 4 Qualcomm Technologies is bringing our mobile vision to different ecosystems Qualcomm Snapdragon is a product of Qualcomm Technologies, Inc. 32

Thank you Follow us on: For more information, visit us at: www.qualcomm.com & www.qualcomm.com/blog Nothing in these materials is an offer to sell any of the components or devices referenced herein. 2016 Qualcomm Technologies, Inc. and/or its affiliated companies. All Rights Reserved. Qualcomm is a trademark of Qualcomm Incorporated, registered in the United States and other countries. Other products and brand names may be trademarks or registered trademarks of their respective owners. References in this presentation to Qualcomm may mean Qualcomm Incorporated, Qualcomm Technologies, Inc., and/or other subsidiaries or business units within the Qualcomm corporate structure, as applicable. Qualcomm Incorporated includes Qualcomm s licensing business, QTL, and the vast majority of its patent portfolio. Qualcomm Technologies, Inc., a wholly-owned subsidiary of Qualcomm Incorporated, operates, along with its subsidiaries, substantially all of Qualcomm s engineering, research and development functions, and substantially all of its product and services businesses, including its semiconductor business, QCT.