Next Generation Visual Computing
|
|
- Scarlett Anis Casey
- 5 years ago
- Views:
Transcription
1 Next Generation Visual Computing (Making GPU Computing a Reality with Mali ) Taipei, 18 June 2013 Roberto Mijat ARM
2 Addressing Computational Challenges Trends Growing display sizes and resolutions Increasing computational power and novel applications Persistent users expectation of improved experience Limitations Limited and restricted energy and thermal budgets In mobile, processing power greatly outgrowing battery capacity Traditional scaling solutions not sustainable Necessities Increase computational efficiency of processing platforms Make use of heterogeneous and parallel computing Leverage new technologies such as GPU Compute 2
3 Complementary Compute Architectures Note: characteristics of generic CPUs and GPUs 3
4 Heterogeneous Computing Operating System Most application processing CPU Programmable through C-like languages and APIs GPU Cost effective, efficient, great floating point performance Control ALU ALU ALU ALU Caches RAM GPU used as computational accelerators or companion processor 2D/3D graphics Advanced Image Processing Accelerate/Complement ISP functionality Offload video codec blocks Accelerate physics computation 4
5 Benefits of GPU Computing Performance Faster computation Offload and acceleration of non-graphical applications Energy Efficiency Free-up CPU resource by offloading to GPU Better load-balance across system resources Increased system efficiency using the best processor for the job Cost Reduction Reduced cost through h/w consolidation and software flexibility Simpler interface to parallel programming through modern APIs Improved user experience Remove computational barriers Enable new use cases and applications 5
6 Adoption of Mobile GPU Compute OpenCL Full Profile Khronos conformant GPUs in mobile SoCs GPU Compute capable devices start shipping OEMs and SiPs evaluating leading GPU Compute solutions Gradual roll-out of GPU Compute APIs in mobile/embedded platforms Android RenderScript computation first enabled on GPU 6
7 Adoption of Mobile GPU Compute First public demonstrations of GPU Compute Mobile benchmarks ISVs and OEMs start porting/optimizing libraries and key use-case functionality using GPU Compute Computational Photography and Advanced Imaging GPU acceleration Codec vendors develop GPU Compute enabled HEVC decoders Exploration by mainstream developers 7
8 Adoption of Mobile GPU Compute Mainstream support for GPU Computing in Mobile and Embedded GPU Compute widely available and utilized by developers/libraries Introduction of GPUs implementing HSA features, full system coherency Hardware consolidation and software cost reduction through migration of selected ISP/DSP functionality to GPU New use cases, innovation 8
9 OPENCL 9
10 OpenCL Overview OpenCL is A framework to enable general purpose parallel computing A computing language portable across heterogeneous processing platforms An API to define and control the platforms A royalty-free open standard, interoperable with existing APIs OpenCL enables easier, better programming of heterogeneous parallel compute systems, and unleashes the general purpose computational power of GPUs needed by emerging workloads OpenCL and the OpenCL logo are trademarks of Apple Inc. 10
11 OpenCL Programming Model Application Optimize performance critical code Program The kernel is executed over each element of the N-dimensional index space Index space (NDRange) Kernel Runtime Compiler - OpenCL kernel - Native kernel Can use static compilation Binaries are cached Kernel object Can be built to target any supported device Execute command Work-item: instance of a kernel executing on a point in the index space Work-group: collection of work-items 11
12 The ARM OpenCL Implementation Implements the latest version of the standard Implements Full Profile, supports 64-bit Optimized for interoperability with existing Mali software stack Optimized for interoperability between CPU and GPU Architected for Cache Coherent Interconnect support Extensible design 12
13 With Full Profile you know what you get Full Profile defines the baseline set of features for OpenCL Embedded Profile defines a subset of the specification Designed to enable OpenCL on less capable devices Making optional a large set of features, restricting developers Reducing precision of floating point maths Key Feature Embedded Full FP32 precision Relaxed IEEE-754 Built-in atomic operations Optional Supported 64-bit integer Optional Supported Online compiler Optional Supported 3D image writes Optional Supported Linear interpolation for floating point images Optional Supported Size of buffers and memory Limited Supported Image data type requirements Reduced Supported 13
14 RENDERSCRIPT 14
15 Introduction to RenderScript Compute framework and API for Android Officially introduced in Honeycomb Cross-platform control-slave architecture, with runtime compilation A graphics engine component has been deprecated since Jelly Bean Complements existing APIs by adding: A compute API for parallel processing similar to OpenCL A scripting language based on C99 supporting vector data types Designed for portability, performance, usability On-device JIT compilation and dynamic thread launch Native code optimization to maximize performance critical algorithms Mali-T604 is the first GPU to support RenderScript 15
16 Online compilation How RenderScript works Java App Reflected Layer llvm-rs-cc Portable Bitcode RenderScript Script Online compilation Dalvik JIT libbcc Executable librs Machine Code ARM Compute System (Cortex CPU + Mali GPU + AMBA 4) 16
17 DESIGNED FOR GPU COMPUTE 17
18 Mali-T600 : Designed for GPU Compute Comprehensive support for general purpose data types 8/16/32/64-bit signed/unsigned integer FP16, FP32, FP64 2,3,4,8,16 wide vectors 2D/3D images Floating Point precision & performance Full IEEE compliance 100s of GFLOPs performance for non graphical workloads Sustainable and proven performance for real life workloads 18
19 Mali-T600: Designed for GPU Compute Hardware acceleration Most common mathematical functions implemented in h/w >70% coverage within newest industry APIs Most operations compute in one cycle Optimal memory throughput and latency Optimized for stream and generic load/store operations Tight integration with system using latest AMBA interfaces Leverage on new Cache Coherent Interconnect technologies Task management implemented in hardware Optimal automatic distribution of compute workloads Optimal dynamic power management Efficient use of processing resources 19
20 GPU Compute on Mali: here today! Passed Khronos Conformance Only OpenCL 1.1 Full Profile on Linux and Android outside of console and desktop space. Proven in Silicon Samsung Exynos 5 Dual, implements Full Profile OpenCL and RenderScript DDKs available now Mali-T600 shipping in real products Google Chromebook Google Nexus 10 InSignal Arndale Community Board API exposed for developers RenderScript on Android for Nexus 10 20
21 Example of the benefits of GPU Compute from the real world USE CASES 21
22 Example use cases for GPU Computing Mobile Computational Photography Physics in games Moving and still image real-time stabilization Information extraction: object detection, classification and tracking Imaging: correction, improvement, consolidation Content and context understanding HDR Augmented Reality DTV/STB 2D to 3D conversion Super resolution Pre and post processing Camera based UI Trans-coding Information extraction and superimposition Automotive Lane Detection Smart Head-Light Road Sign Recognition Night Vision Object Classification Pedestrian, Vehicle and Collision Detection Vehicle Detection Dynamic cruise control 100s GFLOPs of efficient processing power: improve existing use-cases, enable next generation use-cases 22
23 Advanced Image Processing RenderScript is the official Heterogeneous Compute Android API Since Android ICS 4.2 it has been enabled to target the GPU Complex image filters can be greatly accelerated by GPU Compute Filter Speed-up [1] MotionBlur 3.5x Cloud 4.2x Labyrinth 3.8x TitleReflection 7.3x WhirlPinch 3.6x Wave 7.0x Bicubic 15.4x Image size: 2560x1920 [1] Acceleration compares RenderScript compiled on device (LLVM) on dual-core Cortex -A15 and Mali -T604 on a stock Google Nexus 10 23
24 Video Processing APK Proprietary Transcoding/Processing Pipeline Image filters implemented using RenderScript Optimized for ARM + Mali-T600 GPU Compute Filter FPS (GPU+CPU vs CPU only) Speed-up Deshake (720p) 28 / 8 3.5x Upscaling (720p to 1080p) 20 / 3 6.7x 24
25 GPU Compute accelerated superscaling Accelerated using RenderScript On Google Nexus 10 (Mali-T604) 25
26 Next Generation Multimedia Codecs High Efficiency Video Coding (HEVC) Latest video compression standard ratified by ITU in Jan 2013 Improved video quality and double data compression from H.264 Can support up to 8k UHD ARM is collaborating with multiple codec vendors Ensuring widest availability of HEVC across multiple ARM platforms Enabling HEVC early, in software, through NEON and GPU Compute Flexibility of software solutions critical as HEVC rolls out 26
27 Why GPU Compute for HEVC High resolution HEVC decoding maximises CPU load GPUs are traditionally idle during video playback GPU architecture suites acceleration of parallel codec blocks Offloading computation to the GPU frees up the CPU to perform other (system) tasks Combining CPU (NEON) and GPU Compute enable most efficient HEVC decode Mali GPUs are well suited for Video Acceleration with significant power/performance benefits Ittiam Systems 27
28 Physics (Cloth Simulation) 28
29 ISP Pipeline Offload to GPU (OpenCL) Entire ISP pipeline offloaded to the GPU using OpenCL More flexibility Sensor and camera module vendors can invest in optimized portable software libraries instead of hardware ISP SoC implementers can reduce BoM by offloading ISP blocks to the GPU Mali-T604 demo was previewed at MWC13 OpenCL Raw Data form HDR Sensor Noise reduction HDR reconstruction Tone mapping Colour conversion Rendering De-noising Gamma correction OpenGL ES 29
30 Gesture User Interfaces eyesight TM s gesture recognition technology using GPU Compute on ARM s Mali-T600 offers unique capabilities Reduction of overall power consumption Reduction of load from the CPU Robust recognition in challenging lighting conditions Enhanced user experience Higher FPS for more gesture capabilities and features 30
31 Energy used for unit of work (lower is better) Computer Vision Based Applications Computer Vision entails the acquisition, processing, analysis and understanding of sensor data (images), in order to derive information to enable decisions to be made In this example: Consistent 6x speed up ~5x more energy efficiency Face detection study on Mali-T604 based silicon 31
32 Conclusions Improve energy efficiency through heterogeneous computing Use the best processor for the task Balance workload across system resources Offload heavy parallel computation to the GPU Bring the benefits of GPU Compute to key use cases Computational Photography and Advanced Imaging Next generation of multimedia codecs Computer Vision applications The Mali Ecosystem is making GPU Compute a reality 32
Unleashing the benefits of GPU Computing with ARM Mali TM Practical applications and use-cases. Steve Steele, ARM
Unleashing the benefits of GPU Computing with ARM Mali TM Practical applications and use-cases Steve Steele, ARM 1 Today s Computational Challenges Trends Growing display sizes and resolutions, richer
More informationEnabling a Richer Multimedia Experience with GPU Compute. Roberto Mijat Visual Computing Marketing Manager
Enabling a Richer Multimedia Experience with GPU Compute Roberto Mijat Visual Computing Marketing Manager 1 What is GPU Compute Operating System and most application processing continue to reside on the
More informationTake GPU Processing Power Beyond Graphics with Mali GPU Computing
Take GPU Processing Power Beyond Graphics with Mali GPU Computing Roberto Mijat Visual Computing Marketing Manager August 2012 Introduction Modern processor and SoC architectures endorse parallelism as
More informationThe Benefits of GPU Compute on ARM Mali GPUs
The Benefits of GPU Compute on ARM Mali GPUs Tim Hartley 1 SEMICON Europa 2014 ARM Introduction World leading semiconductor IP Founded in 1990 1060 processor licenses sold to more than 350 companies >
More informationIntegrating CPU and GPU, The ARM Methodology. Edvard Sørgård, Senior Principal Graphics Architect, ARM Ian Rickards, Senior Product Manager, ARM
Integrating CPU and GPU, The ARM Methodology Edvard Sørgård, Senior Principal Graphics Architect, ARM Ian Rickards, Senior Product Manager, ARM The ARM Business Model Global leader in the development of
More informationRenderscript Accelerated Advanced Image and Video Processing on ARM Mali T-600 GPUs. Lihua Zhang, Ph.D. MulticoreWare Inc.
Renderscript Accelerated Advanced Image and Video Processing on ARM Mali T-600 GPUs Lihua Zhang, Ph.D. MulticoreWare Inc. lihua@multicorewareinc.com Overview More & more mobile apps are beginning to require
More informationProfiling and Debugging OpenCL Applications with ARM Development Tools. October 2014
Profiling and Debugging OpenCL Applications with ARM Development Tools October 2014 1 Agenda 1. Introduction to GPU Compute 2. ARM Development Solutions 3. Mali GPU Architecture 4. Using ARM DS-5 Streamline
More informationExploring System Coherency and Maximizing Performance of Mobile Memory Systems
Exploring System Coherency and Maximizing Performance of Mobile Memory Systems Shanghai: William Orme, Strategic Marketing Manager of SSG Beijing & Shenzhen: Mayank Sharma, Product Manager of SSG ARM Tech
More informationSIGGRAPH Briefing August 2014
Copyright Khronos Group 2014 - Page 1 SIGGRAPH Briefing August 2014 Neil Trevett VP Mobile Ecosystem, NVIDIA President, Khronos Copyright Khronos Group 2014 - Page 2 Significant Khronos API Ecosystem Advances
More informationNext Generation OpenGL Neil Trevett Khronos President NVIDIA VP Mobile Copyright Khronos Group Page 1
Next Generation OpenGL Neil Trevett Khronos President NVIDIA VP Mobile Ecosystem @neilt3d Copyright Khronos Group 2015 - Page 1 Copyright Khronos Group 2015 - Page 2 Khronos Connects Software to Silicon
More information3D Graphics in Future Mobile Devices. Steve Steele, ARM
3D Graphics in Future Mobile Devices Steve Steele, ARM Market Trends Mobile Computing Market Growth Volume in millions Mobile Computing Market Trends 1600 Smart Mobile Device Shipments (Smartphones and
More informationCopyright Khronos Group Page 1. Vulkan Overview. June 2015
Copyright Khronos Group 2015 - Page 1 Vulkan Overview June 2015 Copyright Khronos Group 2015 - Page 2 Khronos Connects Software to Silicon Open Consortium creating OPEN STANDARD APIs for hardware acceleration
More informationBuilding High Performance, Power Efficient Cortex and Mali systems with ARM CoreLink. Robert Kaye
Building High Performance, Power Efficient Cortex and Mali systems with ARM CoreLink Robert Kaye 1 Agenda Once upon a time ARM designed systems Compute trends Bringing it all together with CoreLink 400
More informationGPGPU on ARM. Tom Gall, Gil Pitney, 30 th Oct 2013
GPGPU on ARM Tom Gall, Gil Pitney, 30 th Oct 2013 Session Description This session will discuss the current state of the art of GPGPU technologies on ARM SoC systems. What standards are there? Where are
More informationHETEROGENEOUS SYSTEM ARCHITECTURE: PLATFORM FOR THE FUTURE
HETEROGENEOUS SYSTEM ARCHITECTURE: PLATFORM FOR THE FUTURE Haibo Xie, Ph.D. Chief HSA Evangelist AMD China OUTLINE: The Challenges with Computing Today Introducing Heterogeneous System Architecture (HSA)
More informationWebGL Meetup GDC Copyright Khronos Group, Page 1
WebGL Meetup GDC 2012 Copyright Khronos Group, 2012 - Page 1 Copyright Khronos Group, 2012 - Page 2 Khronos API Ecosystem Trends Neil Trevett Vice President Mobile Content, NVIDIA President, The Khronos
More informationBifrost - The GPU architecture for next five billion
Bifrost - The GPU architecture for next five billion Hessed Choi Senior FAE / ARM ARM Tech Forum June 28 th, 2016 Vulkan 2 ARM 2016 What is Vulkan? A 3D graphics API for the next twenty years Logical successor
More informationCopyright Khronos Group, Page 1. Khronos Overview. Taiwan, February 2012
Copyright Khronos Group, 2012 - Page 1 Khronos Overview Taiwan, February 2012 Copyright Khronos Group, 2012 - Page 2 Khronos - Connecting Software to Silicon Creating open, royalty-free API standards -
More informationMobile Graphics Ecosystem. Tom Olson OpenGL ES working group chair
OpenGL ES in the Mobile Graphics Ecosystem Tom Olson OpenGL ES working group chair Director, Graphics Research, ARM Ltd 1 Outline Why Mobile Graphics? OpenGL ES Overview Getting Started with OpenGL ES
More informationMobile AR Hardware Futures
Copyright Khronos Group, 2010 - Page 1 Mobile AR Hardware Futures Neil Trevett Vice President Mobile Content, NVIDIA President, The Khronos Group Two Perspectives NVIDIA - Tegra 2 mobile processor Khronos
More informationProfiling and Debugging Games on Mobile Platforms
Profiling and Debugging Games on Mobile Platforms Lorenzo Dal Col Senior Software Engineer, Graphics Tools Gamelab 2013, Barcelona 26 th June 2013 Agenda Introduction to Performance Analysis with ARM DS-5
More informationPress Briefing SIGGRAPH 2015 Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem. Copyright Khronos Group Page 1
Press Briefing SIGGRAPH 2015 Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem Copyright Khronos Group 2015 - Page 1 Khronos Connects Software to Silicon Open Consortium creating ROYALTY-FREE,
More informationBringing it all together: The challenge in delivering a complete graphics system architecture. Chris Porthouse
Bringing it all together: The challenge in delivering a complete graphics system architecture Chris Porthouse System Integration & the role of standards Content Ecosystem Java Execution Environment Native
More informationVulkan 1.1 March Copyright Khronos Group Page 1
Vulkan 1.1 March 2018 Copyright Khronos Group 2018 - Page 1 Vulkan 1.1 Launch and Ongoing Momentum Strengthening the Ecosystem Improved developer tools (SDK, validation/debug layers) More rigorous conformance
More informationAR Standards Update Austin, March 2012
AR Standards Update Austin, March 2012 Neil Trevett President, The Khronos Group Vice President Mobile Content, NVIDIA Copyright Khronos Group, 2012 - Page 1 Topics Very brief overview of Khronos Update
More informationMultimedia in Mobile Phones. Architectures and Trends Lund
Multimedia in Mobile Phones Architectures and Trends Lund 091124 Presentation Henrik Ohlsson Contact: henrik.h.ohlsson@stericsson.com Working with multimedia hardware (graphics and displays) at ST- Ericsson
More informationPress Briefing SIGGRAPH 2015 Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem. Copyright Khronos Group Page 1
Press Briefing SIGGRAPH 2015 Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem Copyright Khronos Group 2015 - Page 1 Khronos Connects Software to Silicon Open Consortium creating ROYALTY-FREE,
More informationKhronos and the Mobile Ecosystem
Copyright Khronos Group, 2011 - Page 1 Khronos and the Mobile Ecosystem Neil Trevett VP Mobile Content, NVIDIA President, Khronos Copyright Khronos Group, 2011 - Page 2 Topics It s not just about individual
More informationCLICK TO EDIT MASTER TITLE STYLE. Click to edit Master text styles. Second level Third level Fourth level Fifth level
CLICK TO EDIT MASTER TITLE STYLE Second level THE HETEROGENEOUS SYSTEM ARCHITECTURE ITS (NOT) ALL ABOUT THE GPU PAUL BLINZER, FELLOW, HSA SYSTEM SOFTWARE, AMD SYSTEM ARCHITECTURE WORKGROUP CHAIR, HSA FOUNDATION
More informationMali Developer Resources. Kevin Ho ARM Taiwan FAE
Mali Developer Resources Kevin Ho ARM Taiwan FAE ARM Mali Developer Tools Software Development SDKs for OpenGL ES & OpenCL OpenGL ES Emulators Shader Development Studio Shader Library Asset Creation Texture
More informationCopyright Khronos Group Page 1
Open Standards and Open Source Together How Khronos APIs Accelerate Fast and Cool Applications Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem Copyright Khronos Group 2015 - Page
More informationNavigating the Vision API Jungle: Which API Should You Use and Why? Embedded Vision Summit, May 2015
Copyright Khronos Group 2015 - Page 1 Navigating the Vision API Jungle: Which API Should You Use and Why? Embedded Vision Summit, May 2015 Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem
More informationOpen Standards for Vision and AI Peter McGuinness NNEF WG Chair CEO, Highwai, Inc May 2018
Copyright Khronos Group 2018 - Page 1 Open Standards for Vision and AI Peter McGuinness NNEF WG Chair CEO, Highwai, Inc peter.mcguinness@gobrach.com May 2018 Khronos Mission E.g. OpenGL ES provides 3D
More informationKhronos Connects Software to Silicon
Press Pre-Briefing GDC 2015 Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem All Materials Embargoed Until Tuesday 3 rd March, 12:01AM Pacific Time Copyright Khronos Group 2015 - Page
More informationARM Mobile GPU Compute Accelerates UX Differentiation
ARM Mobile GPU Compute Accelerates UX Differentiation GPU Compute Enables Engaging Mobile User Experiences (UX) Executive Summary Users continue to demand more from their mobile devices and many mobile
More informationThe Mobile Advantage. Erik Noreke Independent Standardization Consultant Chair, OpenSL ES. Copyright Khronos Group, Page 1
& The Mobile Advantage Erik Noreke Independent Standardization Consultant Chair, OpenSL ES Copyright Khronos Group, 2010 - Page 1 Focused on Creating an Open Market Open Standards Open Markets - Developed
More informationModern Processor Architectures. L25: Modern Compiler Design
Modern Processor Architectures L25: Modern Compiler Design The 1960s - 1970s Instructions took multiple cycles Only one instruction in flight at once Optimisation meant minimising the number of instructions
More informationARM Multimedia IP: working together to drive down system power and bandwidth
ARM Multimedia IP: working together to drive down system power and bandwidth Speaker: Robert Kong ARM China FAE Author: Sean Ellis ARM Architect 1 Agenda System power overview Bandwidth, bandwidth, bandwidth!
More informationCopyright Khronos Group Page 1
Gaming Market Briefing Overview of APIs GDC March 2016 Neil Trevett Khronos President NVIDIA Vice President Developer Ecosystem ntrevett@nvidia.com @neilt3d Copyright Khronos Group 2016 - Page 1 Copyright
More informationAltera SDK for OpenCL
Altera SDK for OpenCL A novel SDK that opens up the world of FPGAs to today s developers Altera Technology Roadshow 2013 Today s News Altera today announces its SDK for OpenCL Altera Joins Khronos Group
More informationOpen Standard APIs for Augmented Reality
Copyright Khronos Group 2014 - Page 1 Open Standard APIs for Augmented Reality Neil Trevett Vice President Mobile Ecosystem, NVIDIA President, Khronos Group Copyright Khronos Group 2014 - Page 2 Khronos
More informationOur Technology Expertise for Software Engineering Services. AceThought Services Your Partner in Innovation
Our Technology Expertise for Software Engineering Services High Performance Computing MultiCore CPU AceThought experts will re-design your sequential algorithms or applications to execute in parallel by
More informationTHE PROGRAMMER S GUIDE TO THE APU GALAXY. Phil Rogers, Corporate Fellow AMD
THE PROGRAMMER S GUIDE TO THE APU GALAXY Phil Rogers, Corporate Fellow AMD THE OPPORTUNITY WE ARE SEIZING Make the unprecedented processing capability of the APU as accessible to programmers as the CPU
More informationBuilding Ultra-Low Power Wearable SoCs
Building Ultra-Low Power Wearable SoCs 1 Wearable noun An item that can be worn adjective Easy to wear, suitable for wearing 2 Wearable Opportunity: Fastest Growing Market Segment Projected Growth from
More informationExpanding Opportunities in Clamshell Devices. Laurence Bryant VP Strategic Marketing
Expanding Opportunities in Clamshell Devices Laurence Bryant VP Strategic Marketing 1 PC Mobile Ecosystem Scaling The Richness Of Small Screen Experiences The smartphone and tablet ecosystem is shaping
More informationMore performance options
More performance options OpenCL, streaming media, and native coding options with INDE April 8, 2014 2014, Intel Corporation. All rights reserved. Intel, the Intel logo, Intel Inside, Intel Xeon, and Intel
More informationSYSTEMS ON CHIP (SOC) FOR EMBEDDED APPLICATIONS
SYSTEMS ON CHIP (SOC) FOR EMBEDDED APPLICATIONS Embedded System System Set of components needed to perform a function Hardware + software +. Embedded Main function not computing Usually not autonomous
More informationOpen API Standards for Mobile Graphics, Compute and Vision Processing GTC, March 2014
Open API Standards for Mobile Graphics, Compute and Vision Processing GTC, March 2014 Neil Trevett Vice President Mobile Ecosystem, NVIDIA President Khronos Copyright Khronos Group 2014 - Page 1 Khronos
More informationEnable AI on Mobile Devices
Enable AI on Mobile Devices Scott Wang 王舒翀 Senior Segment Manager Mobile, BSG ARM Tech Forum 2017 14 th June 2017, Shenzhen AI is moving from core to edge Ubiquitous AI Safe and autonomous Mixed reality
More informationThe OpenVX Computer Vision and Neural Network Inference
The OpenVX Computer and Neural Network Inference Standard for Portable, Efficient Code Radhakrishna Giduthuri Editor, OpenVX Khronos Group radha.giduthuri@amd.com @RadhaGiduthuri Copyright 2018 Khronos
More informationGTC Interaction Simplified. Gesture Recognition Everywhere: Gesture Solutions on Tegra
GTC 2013 Interaction Simplified Gesture Recognition Everywhere: Gesture Solutions on Tegra eyesight at a Glance Touch-free technology providing an enhanced user experience. Easy and intuitive control
More informationThe Bifrost GPU architecture and the ARM Mali-G71 GPU
The Bifrost GPU architecture and the ARM Mali-G71 GPU Jem Davies ARM Fellow and VP of Technology Hot Chips 28 Aug 2016 Introduction to ARM Soft IP ARM licenses Soft IP cores (amongst other things) to our
More informationHardware Accelerated Graphics for High Performance JavaFX Mobile Applications
Hardware Accelerated Graphics for High Performance JavaFX Mobile Applications Pavel Petroshenko, Sun Microsystems Jan Valenta, Sun Microsystems Jerry Evans, Sun Microsystems Goal of this Session Demonstrate
More informationOpen Standards for Building Virtual and Augmented Realities. Neil Trevett Khronos President NVIDIA VP Developer Ecosystems
Open Standards for Building Virtual and Augmented Realities Neil Trevett Khronos President NVIDIA VP Developer Ecosystems Khronos Mission Asian Members Software Silicon Khronos is an International Industry
More informationOverview. Technology Details. D/AVE NX Preliminary Product Brief
Overview D/AVE NX is the latest and most powerful addition to the D/AVE family of rendering cores. It is the first IP to bring full OpenGL ES 2.0/3.1 rendering to the FPGA and SoC world. Targeted for graphics
More informationModern Processor Architectures (A compiler writer s perspective) L25: Modern Compiler Design
Modern Processor Architectures (A compiler writer s perspective) L25: Modern Compiler Design The 1960s - 1970s Instructions took multiple cycles Only one instruction in flight at once Optimisation meant
More informationOverview and AR/VR Roadmap
Khronos Group Inc. 2018 - Page 1 Overview and AR/ Roadmap Neil Trevett Khronos President NVIDIA VP Developer Ecosystems ntrevett@nvidia.com @neilt3d Khronos Group Inc. 2018 - Page 2 Khronos Connects Software
More informationHeterogeneous Computing
Heterogeneous Computing Featured Speaker Ben Sander Senior Fellow Advanced Micro Devices (AMD) DR. DOBB S: GPU AND CPU PROGRAMMING WITH HETEROGENEOUS SYSTEM ARCHITECTURE Ben Sander AMD Senior Fellow APU:
More informationHSA Foundation! Advanced Topics on Heterogeneous System Architectures. Politecnico di Milano! Seminar Room (Bld 20)! 15 December, 2017!
Advanced Topics on Heterogeneous System Architectures HSA Foundation! Politecnico di Milano! Seminar Room (Bld 20)! 15 December, 2017! Antonio R. Miele! Marco D. Santambrogio! Politecnico di Milano! 2
More informationMali GPU acceleration of HEVC and VP9 Decoder
Mali GPU acceleration of HEVC and VP9 Decoder 2 Web Video continues to grow!!! Video accounted for 50% of the mobile traffic in 2012 - Citrix ByteMobile's 4Q 2012 Analytics Report. Globally, IP video traffic
More informationHSA foundation! Advanced Topics on Heterogeneous System Architectures. Politecnico di Milano! Seminar Room A. Alario! 23 November, 2015!
Advanced Topics on Heterogeneous System Architectures HSA foundation! Politecnico di Milano! Seminar Room A. Alario! 23 November, 2015! Antonio R. Miele! Marco D. Santambrogio! Politecnico di Milano! 2
More informationThe LPGPU2 Project. Ben Juurlink, TU Berlin
The LPGPU2 Ben Juurlink, TU Berlin LPGPU2 Consortium LPGPU2 = Low-Power Parallel Processing on GPUs 2 LPGPU2 Objectives 1. To improve the power efficiency of compute and graphics LPGPU2 applications running
More informationOpenCL Press Conference
Copyright Khronos Group, 2011 - Page 1 OpenCL Press Conference Tokyo, November 2011 Neil Trevett Vice President Mobile Content, NVIDIA President, The Khronos Group Copyright Khronos Group, 2011 - Page
More informationAdvanced Imaging Applications on Smart-phones Convergence of General-purpose computing, Graphics acceleration, and Sensors
Advanced Imaging Applications on Smart-phones Convergence of General-purpose computing, Graphics acceleration, and Sensors Sriram Sethuraman Technologist & DMTS, Ittiam 1 Overview Imaging on Smart-phones
More informationDave Shreiner, ARM March 2009
4 th Annual Dave Shreiner, ARM March 2009 Copyright Khronos Group, 2009 - Page 1 Motivation - What s OpenGL ES, and what can it do for me? Overview - Lingo decoder - Overview of the OpenGL ES Pipeline
More informationUpdate on Khronos Open Standard APIs for Vision Processing Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem
Update on Khronos Open Standard APIs for Vision Processing Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem Copyright Khronos Group 2015 - Page 1 Copyright Khronos Group 2015 - Page
More informationStandards for Vision Processing and Neural Networks
Copyright Khronos Group 2017 - Page 1 Standards for Vision Processing and Neural Networks Radhakrishna Giduthuri, AMD radha.giduthuri@ieee.org Agenda Why we need a standard? Khronos NNEF Khronos OpenVX
More informationUnleash the DSP performance of Arm Cortex processors
Unleash the DSP performance of Arm Cortex processors Arm Tech Symposia 2017 Lionel Belnet Senior Product Manager Agenda Unleash the DSP performance of Cortex processors 1 Introducing Arm Cortex technology
More informationARM processors driving automotive innovation
ARM processors driving automotive innovation Chris Turner Director of advanced technology marketing, CPU group ARM tech forums, Seoul and Taipei June/July 2016 The ultimate intelligent connected device
More informationCopyright Khronos Group 2012 Page 1. OpenCL 1.2. August 2012
Copyright Khronos Group 2012 Page 1 OpenCL 1.2 August 2012 Copyright Khronos Group 2012 Page 2 Khronos - Connecting Software to Silicon Khronos defines open, royalty-free standards to access graphics,
More informationOpen Standards for Today s Gaming Industry
Copyright Khronos Group 2013 - Page 1 Open Standards for Today s Gaming Industry Erik Noreke VP of Business Development, Khronos Group Gaming Evolution Copyright Khronos Group 2013 - Page 3 DESKTOP MOBILE
More informationAccelerating Vision Processing
Accelerating Vision Processing Neil Trevett Vice President Mobile Ecosystem at NVIDIA President of Khronos and Chair of the OpenCL Working Group SIGGRAPH, July 2016 Copyright Khronos Group 2016 - Page
More informationOpenMAX AL, OpenSL ES
Copyright Khronos Group, 2011 - Page 1 OpenMAX AL, OpenSL ES Native Multimedia in Android Erik Noreke Chair of OpenMAX AL and OpenSL ES Working Groups Copyright Khronos Group, 2011 - Page 2 Why Create
More informationIBM Power Systems: Open innovation to put data to work Dexter Henderson Vice President IBM Power Systems
IBM Power Systems: Open innovation to put data to work Dexter Henderson Vice President IBM Power Systems 2014 IBM Corporation Powerful Forces are Changing the Way Business Gets Done Data growing exponentially
More informationHigher compression efficiency, exceptional image quality, faster encoding time and lower costs
Table of Content Introduction... 2 RealMedia HD fast facts... 2 Key benefits of RealMedia HD... 2 1. RealMedia HD... 4 1.1 The evolution of video codecs... 4 1.2 RealMedia Variable Bitrate (RMVB) global
More informationTHE LEADER IN VISUAL COMPUTING
MOBILE EMBEDDED THE LEADER IN VISUAL COMPUTING 2 TAKING OUR VISION TO REALITY HPC DESIGN and VISUALIZATION AUTO GAMING 3 BEST DEVELOPER EXPERIENCE Tools for Fast Development Debug and Performance Tuning
More informationMAPPING VIDEO CODECS TO HETEROGENEOUS ARCHITECTURES. Mauricio Alvarez-Mesa Techische Universität Berlin - Spin Digital MULTIPROG 2015
MAPPING VIDEO CODECS TO HETEROGENEOUS ARCHITECTURES Mauricio Alvarez-Mesa Techische Universität Berlin - Spin Digital MULTIPROG 2015 Video Codecs 70% of internet traffic will be video in 2018 [CISCO] Video
More informationџ џ џ џ џ џ џ џ џ џ џ џ .976 REAL CINEMA
PRO 4K Dune HD Pro 4K is a premium compact 4Kp60 HDR media player and Smart TV box with Hi-End video quality, HD audio support, BD3D support, unique hybrid Linux + Android 7.1 software, automatic conversion
More informationDeveloping the Bifrost GPU architecture for mainstream graphics
Developing the Bifrost GPU architecture for mainstream graphics Anand Patel Senior Product Manager, Media Processing Group ARM Tech Symposia India December 7 th 2016 Graphics processing drivers Virtual
More informationThe State of Gaming APIs
Copyright Khronos Group, 2011 - Page 1 The State of Gaming APIs Neil Trevett Vice President Mobile Content, NVIDIA President, The Khronos Group Copyright Khronos Group, 2011 - Page 2 State of Gaming APIs
More informationPowerVR GPU IP from Wearables to Servers. Kristof Beets Director of Business Development May 2015
PowerVR GPU IP from Wearables to Servers Kristof Beets Director of Business Development May 2015 www.imgtec.com Expanding embedded GPU market opportunities Huge range of market opportunities equates to
More information9 GENERATION INTEL CORE DESKTOP PROCESSORS
PRODUCT BRIEF 9 GENERATION INTEL CORE DESKTOP PROCESSORS TH The Most Powerful Generation of Intel Core Processors Introducing the NEW 9th Gen Intel Core desktop processors - the most powerful generation
More informationmbed OS Update Sam Grove Technical Lead, mbed OS June 2017 ARM 2017
mbed OS Update Sam Grove Technical Lead, mbed OS June 2017 ARM mbed: Connecting chip to cloud Device software Device services Third-party cloud services IoT device application mbed Cloud Update IoT cloud
More informationHigh Quality Real Time Image Processing Framework on Mobile Platforms using Tegra K1. Eyal Hirsch
High Quality Real Time Image Processing Framework on Mobile Platforms using Tegra K1 Eyal Hirsch Established in 2009 and headquartered in Israel SagivTech Snapshot Core domain expertise: GPU Computing
More informationCopyright Khronos Group, Page 1. OpenCL. GDC, March 2010
Copyright Khronos Group, 2011 - Page 1 OpenCL GDC, March 2010 Authoring and accessibility Application Acceleration System Integration Copyright Khronos Group, 2011 - Page 2 Khronos Family of Standards
More informationWorld s most advanced data center accelerator for PCIe-based servers
NVIDIA TESLA P100 GPU ACCELERATOR World s most advanced data center accelerator for PCIe-based servers HPC data centers need to support the ever-growing demands of scientists and researchers while staying
More informationSimplify System Complexity
Simplify System Complexity With the new high-performance CompactRIO controller Fanie Coetzer Field Sales Engineer Northern South Africa 2 3 New control system CompactPCI MMI/Sequencing/Logging FieldPoint
More informationHigher Level Programming Abstractions for FPGAs using OpenCL
Higher Level Programming Abstractions for FPGAs using OpenCL Desh Singh Supervising Principal Engineer Altera Corporation Toronto Technology Center ! Technology scaling favors programmability CPUs."#/0$*12'$-*
More informationHandheld Devices. Kari Pulli. Research Fellow, Nokia Research Center Palo Alto. Material from Jyrki Leskelä, Jarmo Nikula, Mika Salmela
OpenCL in Handheld Devices Kari Pulli Research Fellow, Nokia Research Center Palo Alto Material from Jyrki Leskelä, Jarmo Nikula, Mika Salmela 1 OpenCL 1.0 Embedded Profile Enables OpenCL on mobile and
More informationThe Changing Face of Edge Compute
The Changing Face of Edge Compute 2018 Arm Limited Alvin Yang Nov 2018 Market trends acceleration of technology deployment 26 years 4 years 100 billion chips shipped 100 billion chips shipped 1 Trillion
More informationNext Generation Enterprise Solutions from ARM
Next Generation Enterprise Solutions from ARM Ian Forsyth Director Product Marketing Enterprise and Infrastructure Applications Processor Product Line Ian.forsyth@arm.com 1 Enterprise Trends IT is the
More informationARM instruction sets and CPUs for wide-ranging applications
ARM instruction sets and CPUs for wide-ranging applications Chris Turner Director, CPU technology marketing ARM Tech Forum Taipei July 4 th 2017 ARM computing is everywhere #1 shipping GPU in the world
More informationShrinath Shanbhag Senior Software Engineer Microsoft Corporation
Accelerating GPU inferencing with DirectML and DirectX 12 Shrinath Shanbhag Senior Software Engineer Microsoft Corporation Machine Learning Machine learning has become immensely popular over the last decade
More informationStreaming Media. Advanced Audio. Erik Noreke Standardization Consultant Chair, OpenSL ES. Copyright Khronos Group, Page 1
Streaming Media Advanced Audio Erik Noreke Standardization Consultant Chair, OpenSL ES Copyright Khronos Group, 2010 - Page 1 Today s Consumer Requirements Rich media applications and UI - Consumer decisions
More informationHTML5 Evolution and Development. Matt Spencer UI & Browser Marketing Manager
HTML5 Evolution and Development Matt Spencer UI & Browser Marketing Manager 1 HTML5 Ratified. finally! After 7 years of development, the HTML5 specification was ratified on 28 th October 14 urce>
More informationOMAP Android Integration
OMAP Android Integration July 1 st 2011 David Gottardo, Android Architecture at Texas Instruments France Email: d-gottardo@ti.com OMAP Android Integration July 1 st 2011 David Gottardo, Android Architecture
More informationBeyond Hardware IP An overview of Arm development solutions
Beyond Hardware IP An overview of Arm development solutions 2018 Arm Limited Arm Technical Symposia 2018 Advanced first design cost (US$ million) IC design complexity and cost aren t slowing down 542.2
More informationAcceleration Standards for Mobile Augmented Reality
Acceleration Standards for Mobile Augmented Reality Neil Trevett Khronos President Vice President Mobile Content, NVIDIA November 2012 Copyright Khronos Group 2012 Page 1 Copyright Khronos Group 2012 Page
More informationSimplify System Complexity
1 2 Simplify System Complexity With the new high-performance CompactRIO controller Arun Veeramani Senior Program Manager National Instruments NI CompactRIO The Worlds Only Software Designed Controller
More informationGrowth outside Cell Phone Applications
ARM Introduction Growth outside Cell Phone Applications ~1B units shipped into non-mobile applications Embedded segment now accounts for 13% of ARM shipments Automotive, microcontroller and smartcards
More information