IMAGE AND VISION PROCESSING ON TEGRA K1. Elif Albuz
|
|
- Sharyl Craig
- 6 years ago
- Views:
Transcription
1 IMAGE AND VISION PROCESSING ON TEGRA K1 Elif Albuz
2 IMAGE AND VISION USE CASES Driven by using camera as a sensor Computational Photography and Videography Face, Body and Gesture Tracking 3D Scene/Object Reconstruction Augmented Reality
3 Processing Demands MOBILE VISION COMPUTING Photography Input = 2D Camera Processors = ISP + CPU Product = Static Images Mobile Vision Computing Input = MEMS + Depth Camera Processors = ISP + CPU + GPU Result = Data for advanced user interface and environment modeling Computational Photography Input = MEMS + 2D Camera Processors = ISP + CPU + GPU Result = Enhance Images and Videos Time
4 VISION COMPUTING APP CATEGORIES Vision Computing 3D Reconstruction (constructs 3D geometry) Tracking (constructs positions and motions) Facial Modeling Object Reconstruction Face and gesture tracking Environmental Feature Tracking Body Modeling Scene Reconstruction Body Tracking Indoor/Outdoor Positional Tracking 3D Grid with SFM Ped/car detection & tracking
5 AUGMENTED REALITY HYPER-REALISM Ray-tracing and light-field calculations running today on CUDA laptop PC 50+ Watts Ongoing research to use depth cameras to reconstruct global illumination model in real-time Need on mobile devices at 100x less power = 0.5W High-Quality Reflections, Refractions, and Caustics in Augmented Reality and their Contribution to Visual Coherence P. Kán, H. Kaufmann, Institute of Software Technology and Interactive Systems, Vienna University of Technology, Vienna, Austria
6 SIMULTANEOUS LOCALIZATION AND MAPPING
7 WHAT IS NEEDED? Accelerated image & vision processing Tegra K1: CPU, GPU, ISP Camera flexibility and handling of new sensor types Android HAL V3, V4L Low latency routing of image streams to GPU EGLStreams Integrated handling of various sensors and camera Global Time Stamps Effective image & vision programming frameworks VisionWorks
8 TEGRA K1 PLATFORM Desktop GPGPU features and tools on mobile GPGPU Compute CUDA Tools and Libraries Advanced Graphics
9 THREE GENERATIONS OF REFINEMENT Tesla Fermi Kepler
10 Audio Processor ARM7 28 nm HPM 23x23mm, 0.7mm pitch HS-FCBGA Quad Cortex-A15 4x Cores (1+ GHz) NEON SIMD 2 MB L2 (Shared) ARM Trust Zone Shadow LP C-A15 CPU HD Video Processor 1080p24/30 Video Decode 1080p24/30 Video Encode H.264 MPEG4 VC1 MPEG2 VP8 TEGRA K1: A MAJOR LEAP FORWARD FOR MOBILE & EMBEDDED APPLICATIONS SATA2 x1 USB 2.0 x3 PCIe* G2 x4 + x1 Image Processor 25MP Sensor Support ISP 1080p60 Enhanced JPEG Engine Kepler GeForce GPU w/cuda OpenGL-ES nextgen 192 Stream Processors 2D Graphics/Scaling KEPLER GPU, 192 CORES CUDA USB 3.0* x2 CSI x4 + x4 UART x4 I2C x5 NOR Flash SPI x4 SDIO/MMC x4 DDR3 Ctlr 64b 800+ MHz Display x2 Security Engine HDMI edp/lvds DAP x5 (1 2 S/TDM) 12GB/S BANDWIDTH VIDEO IMAGE COMPOSITOR (VIC)
11 Audio Processor ARM7 CPU 28 nm HPM 23x23mm, 0.7mm pitch HS-FCBGA SATA2 x1 USB 2.0 x Quad Cortex-A15 4x Cores (1+ GHz) NEON SIMD 2 MB L2 (Shared) ARM Trust Zone Shadow LP C-A15 CPU Image Processor HD Video Processor 1080p24/30 Video Decode 1080p24/30 Video Encode H.264 MPEG4 VC1 MPEG2 VP8 Kepler GeForce GPU w/cuda Quad-core A15, NEON Unified memory, access from CPU and GPU PCIe* G2 x4 + x1 25MP Sensor Support ISP 1080p60 Enhanced JPEG Engine OpenGL-ES nextgen 192 Stream Processors 2D Graphics/Scaling low-power Shadow core USB 3.0* x2 UART x4 I2C x5 SPI x4 SDIO/MMC x4 Display x2 HDMI edp/lvds CSI x4 + x4 NOR Flash DDR3 Ctlr 64b 800+ MHz Security Engine DAP x5 (1 2 S/TDM)
12 Audio Processor ARM7 IMAGE PROCESSOR 28 nm HPM 23x23mm, 0.7mm pitch HS-FCBGA SATA2 x1 USB 2.0 x3 Quad Cortex-A15 4x Cores (1+ GHz) NEON SIMD 2 MB L2 (Shared) ARM Trust Zone Shadow LP C-A15 CPU x2 Image Processor HD Video Processor 1080p24/30 Video Decode 1080p24/30 Video Encode H.264 MPEG4 VC1 MPEG2 VP8 Kepler GeForce GPU w/cuda 2 ISPs, independently programmable 25Mp camera support (upto 250Mp) PCIe* G2 x4 + x1 25MP Sensor Support ISP 1080p60 Enhanced JPEG Engine OpenGL-ES nextgen 192 Stream Processors 2D Graphics/Scaling 1.2Gp throughput USB 3.0* x2 CSI x4 + x4 UART x4 I2C x5 NOR Flash SPI x4 SDIO/MMC x4 DDR3 Ctlr 64b 800+ MHz Display x2 Security Engine HDMI edp/lvds DAP x5 (1 2 S/TDM) GPGPU interoperability
13 Audio Processor ARM7 HD DEC/ENC PROCESSOR 28 nm HPM 23x23mm, 0.7mm pitch HS-FCBGA Quad Cortex-A15 4x Cores (1+ GHz) NEON SIMD 2 MB L2 (Shared) ARM Trust Zone Shadow LP C-A15 CPU HD Video Processor 1080p24/30 Video Decode 1080p24/30 Video Encode H.264 MPEG4 VC1 MPEG2 VP8 SATA2 x1 USB 2.0 x3 PCIe* G2 x4 + x1 USB 3.0* x2 CSI x4 + x4 Image Processor 25MP Sensor Support ISP 1080p60 Enhanced JPEG Engine UART x4 I2C x5 NOR Flash SPI x4 SDIO/MMC x4 DDR3 Ctlr 64b 800+ MHz Kepler GeForce GPU w/cuda OpenGL-ES nextgen 192 Stream Processors 2D Graphics/Scaling Display x2 Security Engine HDMI edp/lvds DAP x5 (1 2 S/TDM) VIDEO ENCODE/DECODE 2X 1920X1080@30FPS CUVID/CUVENC VIDEO ENCODE/DECODE INTERFACE MOTION ESTIMATION ONLY MODE
14 VIDEO ENCODER Dedicated hardware accelerator Current frame Ref. frame Encoder Compressed video (h.264,..) Motion Vectors/Track info
15 Audio Processor ARM7 GPU 28 nm HPM 23x23mm, 0.7mm pitch HS-FCBGA Quad Cortex-A15 4x Cores (1+ GHz) NEON SIMD 2 MB L2 (Shared) ARM Trust Zone Shadow LP C-A15 CPU HD Video Processor 1080p24/30 Video Decode 1080p24/30 Video Encode H.264 MPEG4 VC1 MPEG2 VP8 KEPLER Architecture 192 CUDA Cores, SM3.2 ISA Compatible to GeForce, Quadro, Tesla SATA2 x1 USB 2.0 x3 PCIe* G2 x4 + x1 Image Processor 25MP Sensor Support ISP 1080p60 Enhanced JPEG Engine Kepler GeForce GPU w/cuda OpenGL-ES nextgen 192 Stream Processors 2D Graphics/Scaling 64kb L1 Cache and Shared Memory 128kb L2 Cache 128 kb Register File USB 3.0* x2 UART x4 I2C x5 SPI x4 SDIO/MMC x4 Display x2 HDMI edp/lvds CSI x4 + x4 NOR Flash DDR3 Ctlr 64b 800+ MHz Security Engine DAP x5 (1 2 S/TDM)
16 WHAT IS NEEDED? Accelerated image & vision processing Tegra K1: CPU, GPU, ISP Camera flexibility and handling of new sensor types Android HAL V3, V4L, Camera API Low latency routing of image streams to GPU EGLStreams Integrated handling of various sensors and camera Global Time Stamps Effective image & vision programming frameworks VisionWorks
17 CAMERA IMAGE PROCESSING Camera ISP (Image Signal Processor) Little or no programmability Data flows thru compact hardware pipe Scan-line-based - no global memory Best perf/watt ~760 math Ops ~42K vals = 670Kb 300MHz ~250Gops
18 VISUAL SENSOR REVOLUTION Single RGB sensors just the start of mobile visual revolution IR sensors LEAP Motion, eye-trackers Active illumination depth sensors TOF and structured light Multi-sensors: Stereo pairs -> Plenoptic array -> Depth cameras Stereo pair can enable object scaling and enhanced depth extraction Plenoptic Field processing needs FFTs and ray-casting Hybrid visual sensing solutions Different sensors mixed for different distances and lighting conditions Dual Camera LG Electronics Plenoptic Array Pelican imaging Capri Structured Light 3D Camera PrimeSense
19 ANDROID CAMERA HAL V3 Camera HAL v1 focused on simplifying basic camera apps Difficult or impossible to do much else New features require proprietary driver extensions Extensions not portable - restricted growth of third party app ecosystem Camera HAL v3 is a fundamentally different API Flexible primitives for building sophisticated use-cases Interface is clean and easily extensible Apps can have more control, and more responsibility Enables sophisticated camera applications Faster time to market and higher quality
20 KHRONOS CAMERA API Specification available 2015 Also FCAM-Based Will be available for any OS FCAM running on NVIDIA Linux today No global state State travels with image requests Every stage in the pipeline may have different state Enables fast, deterministic state changes Synchronize devices Lens, flash, sound capture, gyro Devices can schedule Actions E.g. to be triggered on exposure change
21 KHRONOS CAMERA API REQUIREMENTS Application control over ISP processing (including 3A) Including multiple, re-entrant ISPs Control multiple sensors with synch and alignment E.g. Stereo pairs, Plenoptic arrays, TOF/structured light depth cameras Enhanced per frame detailed control Format flexibility, Region of Interest (ROI) selection Global timing & synchronization E.g. Between cameras and MEMS sensors Flexible processing/streaming Multiple input and output streams RAW, Bayer or YUV Processing Streaming of rows (not just frames) Enable new camera functionality not available on current platforms and align with future platform directions for easy adoption
22 TEGRA K1 CAMERA DATAFLOW Flexible routing of sensor data to ISPs and unified memory ISP-A Enables sensor processing by any combination of GPUs, CPUs and ISPs Camera One Camera Two Camera Routing Unified Memory CPU CPU CPU CPU Kepler GPGPU ISP-B
23 COMPUTATIONAL PHOTOGRAPHY Camera ISP (Image Signal Processor) Little or no programmability CPU GPU Single processor or Neon SIMD - running fast Makes heavy use of general memory Non-optimal performance and power Programmable and flexible Many way parallelism - run at lower frequency Efficient image caching close to processors BUT cycles frames in and out of memory ~760 math Ops ~42K vals = 670Kb 300MHz ~250Gops
24 WHAT IS NEEDED? Accelerated image & vision processing Tegra K1: CPU, GPU, ISP Camera flexibility and handling of new sensor types Android HAL V3, V4L Low latency routing of image streams to GPU EGLStreams Integrated handling of various sensors and camera Global Time Stamps Effective image & vision programming frameworks VisionWorks
25 EGL 1.5 RELEASED EGL 1.5 brings functionality from multiple extensions into core Increased reliability and portability EGLImages Sharing textures and renderbuffers Context Robustness Defending against malicious code EGLSync objects Improved OpenGL /OpenCL interop Platform extensions Standardized interactions for multiple OS e.g. Android and 64-bit platforms srgb colorspace rendering API Interop EGL provides efficient transfer of data and events between Khronos APIs Applications OS and Display Platforms Application Portability EGL abstracts graphics context management, surface and buffer binding and rendering synchronization
26 WHAT IS NEEDED? Accelerated image & vision processing Tegra K1: CPU, GPU, ISP Camera flexibility and handling of new sensor types Android HAL V3, V4L Low latency routing of image streams to GPU EGLStreams Integrated handling of various sensors and camera Global Time Stamps Effective image & vision programming frameworks VisionWorks
27 HOW MANY SENSORS ARE IN A SMARTPHONE? Light Proximity 2 cameras 3 microphones Touch Position GPS WiFi (fingerprint) Cellular (tri-lateration) NFC, Bluetooth (beacons) Accelerometer Magnetometer Gyroscope Pressure Temperature Humidity 19 27
28 STREAMINPUT SENSOR ABSTRACTION API Apps request semantic sensor information StreamInput defines possible requests, e.g. Read Physical or Virtual Sensors e.g. Game Quaternion Context detection e.g. Am I in an elevator? Apps Need Sophisticated Access to Sensor Data Without coding to specific sensor hardware Sensor Discoverability Sensor Code Portability Advanced Sensors Everywhere Multi-axis motion/position, quaternions, context-awareness, gestures, activity monitoring, health and environmental sensors StreamInput processing graph provides optimized sensor data stream High-value, smart sensor fusion middleware can connect to apps in a portable way Apps can gain magical situational awareness
29 WHAT IS NEEDED? Accelerated image & vision processing Tegra K1: CPU, GPU, ISP Camera flexibility and handling of new sensor types Android HAL V3, V4L Low latency routing of image streams to GPU EGLStreams Integrated handling of various sensors and camera Global Time Stamps Effective image & vision programming frameworks VisionWorks
30 MOBILE & EMBEDDED DEVELOPERS NEED HELP! Control, coordinate and synchronize a diverse array of mobile sensors Handle a diverse selection of emerging depth camera technologies Write maintainable code for a heterogeneous mix of CPUs, GPUs and DSPs Write code that is deployable across multiple devices, platforms and OS Leverage dedicated vision hardware for minimized power Create fluid 60Hz experiences on batterypowered mobile devices
31 DESKTOP TO MOBILE DEVELOPMENT Image & Vision Processing code development starts at desktop PC Tegra K1 enables easy migration through libraries and tools available across platforms
32 TEGRA K1 CUDA DEVELOPMENT CUDA-Aware Editor Automated CPU to GPU code refactoring Semantic highlighting of CUDA code Integrated code samples & docs Nsight Debugger Simultaneously debug of CPU and GPU Inspect variables across CUDA threads Use breakpoints & single-step debugging Nsight Profiler Quickly identifies performance issues Integrated expert system Source line correlation Cross platform development Native memcheck, GDB, nvprof
33 CUDA LIBRARIES VisionWorks NPP OpenCV CUFFT CUBLAS CUDA Math Lib
34 NPP LIBRARY (CUDA) Data exchange & initialization Set, Convert, CopyConstBorder, Copy, Transpose, SwapChannels Arithmetic & Logical Ops Add, Sub, Mul, Div, AbsDiff Threshold & Compare Threshold, Compare Color Conversion JPEG RGB To YCbCr (& vice versa), ColorTwist, LUT_Linear DCTQuantInv/Fwd, QuantizationTable Functions FilterBox, Row, Column, Max, Min, Median, Dilate, Erode, SumWindowColumn/Row Geometry Transforms Mirror, WarpAffine / Back/ Quad, WarpPerspective / Back / Quad, Resize Statistics Mean, StdDev, NormDiff, MinMax, Histogram, SqrIntegral, RectStdDev Computer Vision ApplyHaarClassifier, GraphCuts
35 OPENCV LIBRARY Initially developed by Intel for single-core x86 CPUs Version >900 functions (x the datatypes) OpenCV4Tegra - Accelerated CUDA+NEON+GLSL+TBB multithreading OpenCV Image processing General Image Processing Segmentation Machine Learning, Detection Image Pyramids Transforms Fitting Video, Stereo, and 3D Camera Calibration Features Depth Maps Optical Flow Inpainting Tracking
36 OPENCV-GPU VALUE ADD ON LOGAN Speedup 7 6 Jetson Kepler GPU /Quadcore A15 Public OCV for Mobile/Embedded core filter imgproc objdetect Average speedup for different function categories with Logan GPU compared to public source code.
37 VISIONWORKS MOTIVATION + = Advanced Silicon VisionWorks Simplify vision programming Fully optimized and accelerated Modular and Extensible Widespread vision processing in embedded, mobile and automotive devices and applications
38 VISIONWORKS Power Efficient Computer Vision Powered with CUDA Supported on Tegra K1 Linux and Android ACCELERATING Advanced Driver Assistance Computational Photography Augmented Reality Robotics Deep Learning and more Version 0.10 is available for registered partners! ADAS Advanced Driver Assistance Systems Augmented Reality Computational Photography Robotics
39 VISIONWORKS SOFTWARE STACK Application Code Applications use combination of direct primitives, the OpenVX framework and supplied pipelines NVIDIA provides sample pipelines for common use cases Object Detection Sample Pipelines SfM 3rd Party Pipelines NVIDIA supplied vision primitives using CUDA and Tegra processing resources Classifier VisionWorks Primitives Corner Detection 3 rd Party Primitives Customers and developers can create their own primitives e.g. using CUDA CUDA Framework OpenVX enables power efficient and flexible chaining of primitives Tegra K1
40 OPENVX POWER EFFICIENT VISION ACCELERATION Khronos, open, cross-vendor vision API Focus on mobile and embedded systems Application Foundational API for vision acceleration Useful for middleware or by applications Enables diverse efficient implementations OpenCV OpenCV open source library VisionWorks Sample Pipelines Complementary to OpenCV Which is great for prototyping Open source sample implementation Hardware vendor implementations
41 OPENVX GRAPHS THE KEY TO EFFICIENCY Directed graphs for processing power and efficiency Each Node can be implemented in software or accelerated hardware Nodes may be fused to eliminate memory transfers Processing can be tiled to keep data entirely in local memory/cache EGLStreams route data from camera and to application Can extend with VisionWorks nodes using CUDA Native Camera Control OpenVX Node VisionWorks Node OpenVX Node VisionWorks Node Application Example OpenVX Graph
42 OPENVX AND OPENCV ARE COMPLEMENTARY Governance Community driven open source with no formal specification Defined and implemented by Khronos Portability APIs can vary depending on processor Tegra K1 mobile platforms Scope Efficiency Very wide 1000s of imaging and vision functions Multiple camera APIs/interfaces Memory-based architecture Each operation reads and writes memory Tight focus on hardware accelerated functions for mobile vision Use external camera API Graph-based execution Optimizable computation, data transfer Use Case Rapid experimentation Production development & deployment
43 OPENVX 1.0 FUNCTION OVERVIEW Core data structures Images and Image Pyramids Processing Graphs, Kernels, Parameters Image Processing Arithmetic, Logical, and statistical operations Multichannel Color and BitDepth Extraction and Conversion 2D Filtering and Morphological operations Image Resizing and Warping Core Computer Vision Pyramid computation Integral Image computation Feature Extraction and Tracking Histogram Computation and Equalization Canny Edge Detection Harris and FAST Corner detection Sparse Optical Flow Widely used extensions adopted into future versions of the core OpenVX Specification Evolution OpenVX 1.0 defines framework for creating, managing and executing graphs Focused set of widely used functions that are readily accelerated Implementers can add functions as extensions
44 VISIONWORKS PRIMITIVES JAN 2014 Sobel Convolve Bilateral Filter Integral Image Integral Histogram Corner Harris Corner FAST Image Pyramid Optical Flow PyrLK Optical Flow Farneback Warp Perspective Hough Lines Fast NLM Denoising Stereo Block Matching IME (Iterative Motion Estimation) HOG (Histogram of Oriented Gradients) Soft Cascade Detector Object Tracker TLD Object Tracker SLAM Path Estimator MedianFlow Estimator
45 OPENVX AND CUDA ARE COMPLEMENTARY Use Case Architecture Target Hardware Precision Ease of Use GPGPU Programming Language-based Exposed architected memory model programmer manages memory Full IEEE floating point mandated General-purpose math and other libraries Domain targeted Vision processing Library-based - no separate compiler required Abstracted node and memory model - diverse implementations can be optimized for power and performance Minimal floating point requirements optimized for vision operators Fully implemented vision operators and framework out of the box Use CUDA to build new VisionWorks OpenVX Nodes
46 VISIONWORKS SAMPLE PIPELINES (V0.10) Structure From Motion/SLAM Pedestrian Detection Vehicle detection Object tracking Dense optical flow Active Shape Model Denoising
47 VISIONWORKS LOOKING FORWARD Enable multi-camera applications 3D sensors Conformance with OpenVX once specification finalized
48 TEGRA K1 DEVELOPMENT PLATFORMS Coming to Android K1 & Other Linux Devices soon.. JETSON X3 (TK1 PRO) gige, usb3.0, HDMI, CANBUS running Vibrante Linux AUTOMOTIVE GRADE
49 QUESTIONS?
50 BACKUP
INTEGRATING COMPUTER VISION SENSOR INNOVATIONS INTO MOBILE DEVICES. Eli Savransky Principal Architect - CTO Office Mobile BU NVIDIA corp.
INTEGRATING COMPUTER VISION SENSOR INNOVATIONS INTO MOBILE DEVICES Eli Savransky Principal Architect - CTO Office Mobile BU NVIDIA corp. Computer Vision in Mobile Tegra K1 It s time! AGENDA Use cases categories
More informationVision Acceleration. Launch Briefing October Neil Trevett Vice President Mobile Ecosystem, NVIDIA President, Khronos Group
Copyright Khronos Group 2014 - Page 1 Vision Acceleration Launch Briefing October 2014 Neil Trevett Vice President Mobile Ecosystem, NVIDIA President, Khronos Group Copyright Khronos Group 2014 - Page
More informationTEGRA K1 AND THE AUTOMOTIVE INDUSTRY. Gernot Ziegler, Timo Stich
TEGRA K1 AND THE AUTOMOTIVE INDUSTRY Gernot Ziegler, Timo Stich Previously: Tegra in Automotive Infotainment / Navigation Digital Instrument Cluster Passenger Entertainment TEGRA K1 with Kepler GPU GPU:
More informationOpen API Standards for Mobile Graphics, Compute and Vision Processing GTC, March 2014
Open API Standards for Mobile Graphics, Compute and Vision Processing GTC, March 2014 Neil Trevett Vice President Mobile Ecosystem, NVIDIA President Khronos Copyright Khronos Group 2014 - Page 1 Khronos
More informationOpen Standard APIs for Augmented Reality
Copyright Khronos Group 2014 - Page 1 Open Standard APIs for Augmented Reality Neil Trevett Vice President Mobile Ecosystem, NVIDIA President, Khronos Group Copyright Khronos Group 2014 - Page 2 Khronos
More informationOpen Standard APIs for Embedded Vision Processing
Copyright Khronos Group 2014 - Page 1 Open Standard APIs for Embedded Vision Processing Neil Trevett Vice President Mobile Ecosystem, NVIDIA President, Khronos Group Copyright Khronos Group 2014 - Page
More informationTHE LEADER IN VISUAL COMPUTING
MOBILE EMBEDDED THE LEADER IN VISUAL COMPUTING 2 TAKING OUR VISION TO REALITY HPC DESIGN and VISUALIZATION AUTO GAMING 3 BEST DEVELOPER EXPERIENCE Tools for Fast Development Debug and Performance Tuning
More informationApril 4-7, 2016 Silicon Valley VISIONWORKS A CUDA ACCELERATED COMPUTER VISION LIBRARY S6783. Elif Albuz, April 4, 2016
April 4-7, 2016 Silicon Valley VISIONWORKS A CUDA ACCELERATED COMPUTER VISION LIBRARY S6783 Elif Albuz, April 4, 2016 Motivation Introduction to VisionWorks AGENDA VisionWorks Software Stack VisionWorks
More informationWebGL Meetup GDC Copyright Khronos Group, Page 1
WebGL Meetup GDC 2012 Copyright Khronos Group, 2012 - Page 1 Copyright Khronos Group, 2012 - Page 2 Khronos API Ecosystem Trends Neil Trevett Vice President Mobile Content, NVIDIA President, The Khronos
More informationMobile AR Hardware Futures
Copyright Khronos Group, 2010 - Page 1 Mobile AR Hardware Futures Neil Trevett Vice President Mobile Content, NVIDIA President, The Khronos Group Two Perspectives NVIDIA - Tegra 2 mobile processor Khronos
More informationTEGRA K1 による GPU コンピューティング
TEGRA K1 による GPU コンピューティング COMPUTE WITH TEGRA K1 馬路徹 シニア ソリューション アーキテクト NVIDIA AGENDA Introducing Tegra K1 Tegra K1 Compute Software Capabilities OpenGL GLSL OpenCL CUDA/Unified Memory Google Renderscript
More informationKhronos and the Mobile Ecosystem
Copyright Khronos Group, 2011 - Page 1 Khronos and the Mobile Ecosystem Neil Trevett VP Mobile Content, NVIDIA President, Khronos Copyright Khronos Group, 2011 - Page 2 Topics It s not just about individual
More informationAR Standards Update Austin, March 2012
AR Standards Update Austin, March 2012 Neil Trevett President, The Khronos Group Vice President Mobile Content, NVIDIA Copyright Khronos Group, 2012 - Page 1 Topics Very brief overview of Khronos Update
More informationAdvanced Imaging Applications on Smart-phones Convergence of General-purpose computing, Graphics acceleration, and Sensors
Advanced Imaging Applications on Smart-phones Convergence of General-purpose computing, Graphics acceleration, and Sensors Sriram Sethuraman Technologist & DMTS, Ittiam 1 Overview Imaging on Smart-phones
More informationGTC 2013 March San Jose, CA The Smartest People. The Best Ideas. The Biggest Opportunities. Opportunities for Participation:
GTC 2013 March 18-21 San Jose, CA The Smartest People. The Best Ideas. The Biggest Opportunities. Opportunities for Participation: SPEAK - Showcase your work among the elite of graphics computing - Call
More informationEnabling a Richer Multimedia Experience with GPU Compute. Roberto Mijat Visual Computing Marketing Manager
Enabling a Richer Multimedia Experience with GPU Compute Roberto Mijat Visual Computing Marketing Manager 1 What is GPU Compute Operating System and most application processing continue to reside on the
More informationThe OpenVX Computer Vision and Neural Network Inference
The OpenVX Computer and Neural Network Inference Standard for Portable, Efficient Code Radhakrishna Giduthuri Editor, OpenVX Khronos Group radha.giduthuri@amd.com @RadhaGiduthuri Copyright 2018 Khronos
More informationCopyright Khronos Group, Page 1. Khronos Overview. Taiwan, February 2012
Copyright Khronos Group, 2012 - Page 1 Khronos Overview Taiwan, February 2012 Copyright Khronos Group, 2012 - Page 2 Khronos - Connecting Software to Silicon Creating open, royalty-free API standards -
More informationOpen Standards for Today s Gaming Industry
Copyright Khronos Group 2013 - Page 1 Open Standards for Today s Gaming Industry Erik Noreke VP of Business Development, Khronos Group Gaming Evolution Copyright Khronos Group 2013 - Page 3 DESKTOP MOBILE
More informationOpenCV on Zynq: Accelerating 4k60 Dense Optical Flow and Stereo Vision. Kamran Khan, Product Manager, Software Acceleration and Libraries July 2017
OpenCV on Zynq: Accelerating 4k60 Dense Optical Flow and Stereo Vision Kamran Khan, Product Manager, Software Acceleration and Libraries July 2017 Agenda Why Zynq SoCs for Traditional Computer Vision Automated
More informationThe Benefits of GPU Compute on ARM Mali GPUs
The Benefits of GPU Compute on ARM Mali GPUs Tim Hartley 1 SEMICON Europa 2014 ARM Introduction World leading semiconductor IP Founded in 1990 1060 processor licenses sold to more than 350 companies >
More informationNavigating the Vision API Jungle: Which API Should You Use and Why? Embedded Vision Summit, May 2015
Copyright Khronos Group 2015 - Page 1 Navigating the Vision API Jungle: Which API Should You Use and Why? Embedded Vision Summit, May 2015 Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem
More informationCopyright Khronos Group Page 1. Vulkan Overview. June 2015
Copyright Khronos Group 2015 - Page 1 Vulkan Overview June 2015 Copyright Khronos Group 2015 - Page 2 Khronos Connects Software to Silicon Open Consortium creating OPEN STANDARD APIs for hardware acceleration
More informationNext Generation Visual Computing
Next Generation Visual Computing (Making GPU Computing a Reality with Mali ) Taipei, 18 June 2013 Roberto Mijat ARM Addressing Computational Challenges Trends Growing display sizes and resolutions Increasing
More informationTesla GPU Computing A Revolution in High Performance Computing
Tesla GPU Computing A Revolution in High Performance Computing Gernot Ziegler, Developer Technology (Compute) (Material by Thomas Bradley) Agenda Tesla GPU Computing CUDA Fermi What is GPU Computing? Introduction
More informationTR An Overview of NVIDIA Tegra K1 Architecture. Ang Li, Radu Serban, Dan Negrut
TR-2014-17 An Overview of NVIDIA Tegra K1 Architecture Ang Li, Radu Serban, Dan Negrut November 20, 2014 Abstract This paperwork gives an overview of NVIDIA s Jetson TK1 Development Kit and its Tegra K1
More informationCopyright Khronos Group Page 1
Gaming Market Briefing Overview of APIs GDC March 2016 Neil Trevett Khronos President NVIDIA Vice President Developer Ecosystem ntrevett@nvidia.com @neilt3d Copyright Khronos Group 2016 - Page 1 Copyright
More informationAcceleration Standards for Mobile Augmented Reality
Acceleration Standards for Mobile Augmented Reality Neil Trevett Khronos President Vice President Mobile Content, NVIDIA November 2012 Copyright Khronos Group 2012 Page 1 Copyright Khronos Group 2012 Page
More informationKhronos Overview The State of the Art in Open Standards for Visual Computing
Khronos Overview The State of the Art in Open Standards for Visual Computing Neil Trevett Khronos President Vice President Mobile Content, NVIDIA Copyright Khronos Group 2013 - Page 1 Copyright Khronos
More informationDeep Learning: Transforming Engineering and Science The MathWorks, Inc.
Deep Learning: Transforming Engineering and Science 1 2015 The MathWorks, Inc. DEEP LEARNING: TRANSFORMING ENGINEERING AND SCIENCE A THE NEW RISE ERA OF OF GPU COMPUTING 3 NVIDIA A IS NEW THE WORLD S ERA
More informationStandards for Vision Processing and Neural Networks
Copyright Khronos Group 2017 - Page 1 Standards for Vision Processing and Neural Networks Radhakrishna Giduthuri, AMD radha.giduthuri@ieee.org Agenda Why we need a standard? Khronos NNEF Khronos OpenVX
More informationGPGPU, 1st Meeting Mordechai Butrashvily, CEO GASS
GPGPU, 1st Meeting Mordechai Butrashvily, CEO GASS Agenda Forming a GPGPU WG 1 st meeting Future meetings Activities Forming a GPGPU WG To raise needs and enhance information sharing A platform for knowledge
More informationCS427 Multicore Architecture and Parallel Computing
CS427 Multicore Architecture and Parallel Computing Lecture 6 GPU Architecture Li Jiang 2014/10/9 1 GPU Scaling A quiet revolution and potential build-up Calculation: 936 GFLOPS vs. 102 GFLOPS Memory Bandwidth:
More informationCSE 591/392: GPU Programming. Introduction. Klaus Mueller. Computer Science Department Stony Brook University
CSE 591/392: GPU Programming Introduction Klaus Mueller Computer Science Department Stony Brook University First: A Big Word of Thanks! to the millions of computer game enthusiasts worldwide Who demand
More informationFusing Sensors into Mobile Operating Systems & Innovative Use Cases
Fusing Sensors into Mobile Operating Systems & Innovative Use Cases May 23, 2012 Tristan Joo (tristanjoo@wca.org) Board Director & Co-Chair of Mobile SIG Wireless Communications Alliance, Independent Executive
More informationWebGL, WebCL and Beyond!
Copyright Khronos Group, 2011 - Page 1 WebGL, WebCL and Beyond! Neil Trevett Vice President Mobile Content, NVIDIA President, The Khronos Group Copyright Khronos Group, 2011 - Page 2 Topics in this Session
More informationA176 Cyclone. GPGPU Fanless Small FF RediBuilt Supercomputer. IT and Instrumentation for industry. Aitech I/O
The A176 Cyclone is the smallest and most powerful Rugged-GPGPU, ideally suited for distributed systems. Its 256 CUDA cores reach 1 TFLOPS, and it consumes less than 17W at full load (8-10W at typical
More informationImage Contrast Adjustment using Nvidia Performance Primitives (NPP) Yang Song
Image Contrast Adjustment using Nvidia Performance Primitives (NPP) Yang Song Outline NPP Introduction Problem Statement Solution and Hands-on Coding Epilogue What is NPP? A library of image, signal and
More informationUpdate on Khronos Open Standard APIs for Vision Processing Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem
Update on Khronos Open Standard APIs for Vision Processing Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem Copyright Khronos Group 2015 - Page 1 Copyright Khronos Group 2015 - Page
More informationMultimedia in Mobile Phones. Architectures and Trends Lund
Multimedia in Mobile Phones Architectures and Trends Lund 091124 Presentation Henrik Ohlsson Contact: henrik.h.ohlsson@stericsson.com Working with multimedia hardware (graphics and displays) at ST- Ericsson
More informationEfficient Video Processing on Embedded GPU
Efficient Video Processing on Embedded GPU Tobias Kammacher Armin Weiss Matthias Frei Institute of Embedded Systems High Performance Multimedia Research Group Zurich University of Applied Sciences (ZHAW)
More informationMultimedia SoC System Solutions
Multimedia SoC System Solutions Presented By Yashu Gosain & Forrest Picket: System Software & SoC Solutions Marketing Girish Malipeddi: IP Subsystems Marketing Agenda Zynq Ultrascale+ MPSoC and Multimedia
More informationNext Generation OpenGL Neil Trevett Khronos President NVIDIA VP Mobile Copyright Khronos Group Page 1
Next Generation OpenGL Neil Trevett Khronos President NVIDIA VP Mobile Ecosystem @neilt3d Copyright Khronos Group 2015 - Page 1 Copyright Khronos Group 2015 - Page 2 Khronos Connects Software to Silicon
More informationEGLSTREAMS: INTEROPERABILITY FOR CAMERA, CUDA AND OPENGL. Debalina Bhattacharjee Sharan Ashwathnarayan
53023 - EGLSTREAMS: INTEROPERABILITY FOR CAMERA, CUDA AND OPENGL Debalina Bhattacharjee Sharan Ashwathnarayan Tegra SOC and typical use-cases Why Interops EGLStream and Its Key Features Agenda Examples
More informationOur Technology Expertise for Software Engineering Services. AceThought Services Your Partner in Innovation
Our Technology Expertise for Software Engineering Services High Performance Computing MultiCore CPU AceThought experts will re-design your sequential algorithms or applications to execute in parallel by
More informationGeneral Purpose GPU Computing in Partial Wave Analysis
JLAB at 12 GeV - INT General Purpose GPU Computing in Partial Wave Analysis Hrayr Matevosyan - NTC, Indiana University November 18/2009 COmputationAL Challenges IN PWA Rapid Increase in Available Data
More informationCopyright Khronos Group Page 1
Open Standards and Open Source Together How Khronos APIs Accelerate Fast and Cool Applications Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem Copyright Khronos Group 2015 - Page
More informationTesla GPU Computing A Revolution in High Performance Computing
Tesla GPU Computing A Revolution in High Performance Computing Mark Harris, NVIDIA Agenda Tesla GPU Computing CUDA Fermi What is GPU Computing? Introduction to Tesla CUDA Architecture Programming & Memory
More informationSIGGRAPH Briefing August 2014
Copyright Khronos Group 2014 - Page 1 SIGGRAPH Briefing August 2014 Neil Trevett VP Mobile Ecosystem, NVIDIA President, Khronos Copyright Khronos Group 2014 - Page 2 Significant Khronos API Ecosystem Advances
More informationAccelerating Vision Processing
Accelerating Vision Processing Neil Trevett Vice President Mobile Ecosystem at NVIDIA President of Khronos and Chair of the OpenCL Working Group SIGGRAPH, July 2016 Copyright Khronos Group 2016 - Page
More informationCSE 591: GPU Programming. Introduction. Entertainment Graphics: Virtual Realism for the Masses. Computer games need to have: Klaus Mueller
Entertainment Graphics: Virtual Realism for the Masses CSE 591: GPU Programming Introduction Computer games need to have: realistic appearance of characters and objects believable and creative shading,
More informationSC24/WG9 Liaison Meeting
Copyright Khronos Group, 2011 - Page 1 SC24/WG9 Liaison Meeting Seoul, November 2011 Neil Trevett Vice President Mobile Content, NVIDIA President, The Khronos Group Copyright Khronos Group, 2011 - Page
More informationEcosystem Overview Neil Trevett Khronos President NVIDIA Vice President Developer
Ecosystem Overview Neil Trevett Khronos President NVIDIA Vice President Developer Ecosystem ntrevett@nvidia.com @neilt3d Copyright Khronos Group 2016 - Page 1 Khronos Mission Software Silicon Khronos is
More informationApril 4-7, 2016 Silicon Valley
April 4-7, 2016 Silicon Valley TEGRA PLATFORMS GAMING DRONES ROBOTICS IVA AUTOMOTIVE 2 Compile Debug Profile Trace C/C++ NVTX NVIDIA Tools extension Getting Started CodeWorks JetPack Installers IDE Integration
More informationTIOVX TI s OpenVX Implementation
TIOVX TI s OpenVX Implementation Aish Dubey Product Marketing, Automotive Processors Embedded Vision Summit, 3 May 2017 1 TI SOC platform heterogeneous cores High level processing Object detection and
More informationUnleashing the benefits of GPU Computing with ARM Mali TM Practical applications and use-cases. Steve Steele, ARM
Unleashing the benefits of GPU Computing with ARM Mali TM Practical applications and use-cases Steve Steele, ARM 1 Today s Computational Challenges Trends Growing display sizes and resolutions, richer
More informationEmerging Vision Technologies: Enabling a New Era of Intelligent Devices
Emerging Vision Technologies: Enabling a New Era of Intelligent Devices Computer vision overview Computer vision is being integrated in our daily lives Acquiring, processing, and understanding visual data
More informationNVIDIA CUDA Libraries
NVIDIA CUDA Libraries Ujval Kapasi*, Elif Albuz*, Philippe Vandermersch*, Nathan Whitehead*, Frank Jargstorff* San Jose Convention Center Sept 22, 2010 *NVIDIA NVIDIA CUDA Libraries Applications 3 rd Party
More informationGPGPUs in HPC. VILLE TIMONEN Åbo Akademi University CSC
GPGPUs in HPC VILLE TIMONEN Åbo Akademi University 2.11.2010 @ CSC Content Background How do GPUs pull off higher throughput Typical architecture Current situation & the future GPGPU languages A tale of
More informationNVIDIA DESIGNWORKS Ankit Patel - Prerna Dogra -
NVIDIA DESIGNWORKS Ankit Patel - ankitp@nvidia.com Prerna Dogra - pdogra@nvidia.com 1 Autonomous Driving Deep Learning Visual Effects Virtual Desktops Visual Computing is our singular mission Gaming Product
More informationSoftware Driven Verification at SoC Level. Perspec System Verifier Overview
Software Driven Verification at SoC Level Perspec System Verifier Overview June 2015 IP to SoC hardware/software integration and verification flows Cadence methodology and focus Applications (Basic to
More informationCompleting the Multimedia Architecture
Copyright Khronos Group, 2011 - Page 1 Completing the Multimedia Architecture Erik Noreke Chair of OpenSL ES Working Group Chair of OpenMAX AL Working Group Copyright Khronos Group, 2011 - Page 2 Today
More informationIntroduction to the Tegra SoC Family and the ARM Architecture. Kristoffer Robin Stokke, PhD FLIR UAS
Introduction to the Tegra SoC Family and the ARM Architecture Kristoffer Robin Stokke, PhD FLIR UAS Goals of Lecture To give you something concrete to start on Simple introduction to ARMv8 NEON programming
More informationOpen Standards for AR and VR Neil Trevett Khronos President NVIDIA VP Developer January 2018
Copyright Khronos Group 2018 - Page 1 Open Standards for AR and Neil Trevett Khronos President NVIDIA VP Developer Ecosystem ntrevett@nvidia.com @neilt3d January 2018 Khronos Mission E.g. OpenGL ES provides
More informationCUDA on ARM Update. Developing Accelerated Applications on ARM. Bas Aarts and Donald Becker
CUDA on ARM Update Developing Accelerated Applications on ARM Bas Aarts and Donald Becker CUDA on ARM: a forward-looking development platform for high performance, energy efficient hybrid computing It
More informationOverview and AR/VR Roadmap
Khronos Group Inc. 2018 - Page 1 Overview and AR/ Roadmap Neil Trevett Khronos President NVIDIA VP Developer Ecosystems ntrevett@nvidia.com @neilt3d Khronos Group Inc. 2018 - Page 2 Khronos Connects Software
More informationProfiling and Debugging OpenCL Applications with ARM Development Tools. October 2014
Profiling and Debugging OpenCL Applications with ARM Development Tools October 2014 1 Agenda 1. Introduction to GPU Compute 2. ARM Development Solutions 3. Mali GPU Architecture 4. Using ARM DS-5 Streamline
More informationEnable AI on Mobile Devices
Enable AI on Mobile Devices Scott Wang 王舒翀 Senior Segment Manager Mobile, BSG ARM Tech Forum 2017 14 th June 2017, Shenzhen AI is moving from core to edge Ubiquitous AI Safe and autonomous Mixed reality
More informationAccelerating Realism with the (NVIDIA Scene Graph)
Accelerating Realism with the (NVIDIA Scene Graph) Holger Kunz Manager, Workstation Middleware Development Phillip Miller Director, Workstation Middleware Product Management NVIDIA application acceleration
More informationGPGPU on Mobile Devices
GPGPU on Mobile Devices Introduction Addressing GPGPU for very mobile devices Tablets Smartphones Introduction Why dedicated GPUs in mobile devices? Gaming Physics simulation for realistic effects 3D-GUI
More informationThe Path to Embedded Vision & AI using a Low Power Vision DSP. Yair Siegel, Director of Segment Marketing Hotchips August 2016
The Path to Embedded Vision & AI using a Low Power Vision DSP Yair Siegel, Director of Segment Marketing Hotchips August 2016 Presentation Outline Introduction The Need for Embedded Vision & AI Vision
More informationHPC with Multicore and GPUs
HPC with Multicore and GPUs Stan Tomov Electrical Engineering and Computer Science Department University of Tennessee, Knoxville COSC 594 Lecture Notes March 22, 2017 1/20 Outline Introduction - Hardware
More informationCS8803SC Software and Hardware Cooperative Computing GPGPU. Prof. Hyesoon Kim School of Computer Science Georgia Institute of Technology
CS8803SC Software and Hardware Cooperative Computing GPGPU Prof. Hyesoon Kim School of Computer Science Georgia Institute of Technology Why GPU? A quiet revolution and potential build-up Calculation: 367
More informationNVIDIA AI BRAIN OF SELF DRIVING AND HD MAPPING. September 13, 2016
NVIDIA AI BRAIN OF SELF DRIVING AND HD MAPPING September 13, 2016 AI FOR AUTONOMOUS DRIVING MAPPING KALDI LOCALIZATION DRIVENET Training on DGX-1 NVIDIA DGX-1 NVIDIA DRIVE PX 2 Driving with DriveWorks
More informationAdding Advanced Shader Features and Handling Fragmentation
Copyright Khronos Group, 2010 - Page 1 Adding Advanced Shader Features and Handling Fragmentation How to enable your application on a wide range of devices Imagination Technologies Copyright Khronos Group,
More informationNvidia Jetson TX2 and its Software Toolset. João Fernandes 2017/2018
Nvidia Jetson TX2 and its Software Toolset João Fernandes 2017/2018 In this presentation Nvidia Jetson TX2: Hardware Nvidia Jetson TX2: Software Machine Learning: Neural Networks Convolutional Neural Networks
More informationEmbedded Computing without Compromise. Evolution of the Rugged GPGPU Computer Session: SIL7127 Dan Mor PLM -Aitech Systems GTC Israel 2017
Evolution of the Rugged GPGPU Computer Session: SIL7127 Dan Mor PLM - Systems GTC Israel 2017 Agenda Current GPGPU systems NVIDIA Jetson TX1 and TX2 evaluation Conclusions New Products 2 GPGPU Product
More informationTechnology for a better society. hetcomp.com
Technology for a better society hetcomp.com 1 J. Seland, C. Dyken, T. R. Hagen, A. R. Brodtkorb, J. Hjelmervik,E Bjønnes GPU Computing USIT Course Week 16th November 2011 hetcomp.com 2 9:30 10:15 Introduction
More informationArnon Peleg Visual Computing Products Management
Optimizing Visual Computing Applications on 3 rd Generation Intel Core Processor Family Intel Visual Computing Tools Helping Make Your Game & Media Applications Faster, Faster Arnon Peleg Visual Computing
More informationHEXIWEAR COMPLETE IOT DEVELOPMENT SOLUTION
HEXIWEAR COMPLETE IOT DEVELOPMENT SOLUTION NXP SEMICONDUCTORS PUBLIC THE ONLY SUPPLIER TO PROVIDE COMPLETE IoT SOLUTIONS DSPs, MCUs & CPUs Suite of Sensors NFC, BLE, Thread, zigbee, sub-ghz Wireless Interconnects
More informationExploring System Coherency and Maximizing Performance of Mobile Memory Systems
Exploring System Coherency and Maximizing Performance of Mobile Memory Systems Shanghai: William Orme, Strategic Marketing Manager of SSG Beijing & Shenzhen: Mayank Sharma, Product Manager of SSG ARM Tech
More informationOverview. Think Silicon is a privately held company founded in 2007 by the core team of Atmel MMC IC group
Nema An OpenGL & OpenCL Embedded Programmable Engine Georgios Keramidas & Iakovos Stamoulis Think Silicon mobile GRAPHICS Overview Think Silicon is a privately held company founded in 2007 by the core
More informationSimplify System Complexity
Simplify System Complexity With the new high-performance CompactRIO controller Fanie Coetzer Field Sales Engineer Northern South Africa 2 3 New control system CompactPCI MMI/Sequencing/Logging FieldPoint
More informationOpen Standards for Vision and AI Peter McGuinness NNEF WG Chair CEO, Highwai, Inc May 2018
Copyright Khronos Group 2018 - Page 1 Open Standards for Vision and AI Peter McGuinness NNEF WG Chair CEO, Highwai, Inc peter.mcguinness@gobrach.com May 2018 Khronos Mission E.g. OpenGL ES provides 3D
More informationGPGPU on ARM. Tom Gall, Gil Pitney, 30 th Oct 2013
GPGPU on ARM Tom Gall, Gil Pitney, 30 th Oct 2013 Session Description This session will discuss the current state of the art of GPGPU technologies on ARM SoC systems. What standards are there? Where are
More information3D Graphics in Future Mobile Devices. Steve Steele, ARM
3D Graphics in Future Mobile Devices Steve Steele, ARM Market Trends Mobile Computing Market Growth Volume in millions Mobile Computing Market Trends 1600 Smart Mobile Device Shipments (Smartphones and
More informationAdhocracy Innovation with Imaging technology. Socionext Inc. Hiroyuki Komori July 5th, 2017
Adhocracy Innovation with Imaging technology Socionext Inc. Hiroyuki Komori July 5th, 2017 Video Steaming in the world Video uploading : 65,000 clip/min, 300 hour/min on YouTube Video viewing : billions
More informationDesigning GPU-accelerated applications with RTMaps (Real-Time Multisensor Applications) Framework and NVIDIA DriveWorks
MUNICH OCT 10-12, 2017 Designing GPU-accelerated applications with RTMaps (Real-Time Multisensor Applications) Framework and NVIDIA DriveWorks Xavier Rouah Lead Software Engineer Brief introduction about
More informationThe State of Gaming APIs
Copyright Khronos Group, 2011 - Page 1 The State of Gaming APIs Neil Trevett Vice President Mobile Content, NVIDIA President, The Khronos Group Copyright Khronos Group, 2011 - Page 2 State of Gaming APIs
More informationRenderscript Accelerated Advanced Image and Video Processing on ARM Mali T-600 GPUs. Lihua Zhang, Ph.D. MulticoreWare Inc.
Renderscript Accelerated Advanced Image and Video Processing on ARM Mali T-600 GPUs Lihua Zhang, Ph.D. MulticoreWare Inc. lihua@multicorewareinc.com Overview More & more mobile apps are beginning to require
More informationS CUDA on Xavier
S8868 - CUDA on Xavier Anshuman Bhat CUDA Product Manager Saikat Dasadhikari CUDA Engineering 29 th March 2018 1 CUDA ECOSYSTEM 2018 CUDA DOWNLOADS IN 2017 3,500,000 CUDA REGISTERED DEVELOPERS 800,000
More informationAdvances in GPU-based Image Processing and Computer Vision. James Fung
Advances in GPU-based Image Processing and Computer Vision James Fung Talk Objectives Introduce GPU hardware and features available for imaging & vision Present new algorithm mappings on the GPU GPU vision
More informationmbed OS Update Sam Grove Technical Lead, mbed OS June 2017 ARM 2017
mbed OS Update Sam Grove Technical Lead, mbed OS June 2017 ARM mbed: Connecting chip to cloud Device software Device services Third-party cloud services IoT device application mbed Cloud Update IoT cloud
More informationWindowing System on a 3D Pipeline. February 2005
Windowing System on a 3D Pipeline February 2005 Agenda 1.Overview of the 3D pipeline 2.NVIDIA software overview 3.Strengths and challenges with using the 3D pipeline GeForce 6800 220M Transistors April
More informationThe rcuda middleware and applications
The rcuda middleware and applications Will my application work with rcuda? rcuda currently provides binary compatibility with CUDA 5.0, virtualizing the entire Runtime API except for the graphics functions,
More informationAutonomous Driving Solutions
Autonomous Driving Solutions Oct, 2017 DrivePX2 & DriveWorks Marcus Oh (moh@nvidia.com) Sr. Solution Architect, NVIDIA This work is licensed under a Creative Commons Attribution-Share Alike 4.0 (CC BY-SA
More informationGraphics and Imaging Architectures
Graphics and Imaging Architectures Kayvon Fatahalian http://www.cs.cmu.edu/afs/cs/academic/class/15869-f11/www/ About Kayvon New faculty, just arrived from Stanford Dissertation: Evolving real-time graphics
More informationNVIDIA GTX200: TeraFLOPS Visual Computing. August 26, 2008 John Tynefield
NVIDIA GTX200: TeraFLOPS Visual Computing August 26, 2008 John Tynefield 2 Outline Execution Model Architecture Demo 3 Execution Model 4 Software Architecture Applications DX10 OpenGL OpenCL CUDA C Host
More informationGPGPU Applications. for Hydrological and Atmospheric Simulations. and Visualizations on the Web. Ibrahim Demir
GPGPU Applications for Hydrological and Atmospheric Simulations and Visualizations on the Web Ibrahim Demir Big Data We are collecting and generating data on a petabyte scale (1Pb = 1,000 Tb = 1M Gb) Data
More informationThe Changing Face of Edge Compute
The Changing Face of Edge Compute 2018 Arm Limited Alvin Yang Nov 2018 Market trends acceleration of technology deployment 26 years 4 years 100 billion chips shipped 100 billion chips shipped 1 Trillion
More information