PowerVR Graphics - Latest Developments and Future Plans
|
|
- Paula Bailey
- 5 years ago
- Views:
Transcription
1 PowerVR Graphics - Latest Developments and Future Plans Latest Developments and Future Plans
2 A brief introduction Joe Davis Lead Developer Support Engineer, PowerVR Graphics With Imagination s PowerVR Developer Technology team for ~6 years PowerVR Developer Technology SDK, tools, documentation and developer support/relations (e.g. this session ) 2
3 Company overview
4 About Imagination Multimedia, processors, communications and cloud IP Driving IP innovation with unrivalled portfolio Recognised leader in graphics, GPU compute and video IP #3 design IP company world-wide* Ensigma Communications Processors MIPS General Processors SoC fabric PowerVR Graphics & GPU Compute Processors PowerVR Video Processors PowerVR Vision Processors 4 * source: Gartner
5 About Imagination Our IP plus our partners know-how combine to drive and disrupt Smart Security Wearables Gaming & VR/AR Advanced Automotive Wearables Retail ehealth Smart homes 5
6 About Imagination Business model Licensees OEMs and ODMs Consumers 6
7 About Imagination Our licensees and partners drive our business 7
8 PowerVR Rogue Hardware
9 PowerVR Rogue Recap Tile-based deferred renderer Building on technology proven over 5 previous generations Formally announced at CES 2012 USC - Universal Shading Cluster New scalar SIMD shader core General purpose compute is a first class citizen in the core while not forgetting what makes a shader core great for graphics 9
10 TBDR Tile-based Tile-based Split each render up into small tiles (32x32 for the most part) Bin geometry after vertex shading into those tiles Tile-based rasterisation and pixel shading Keep all data access for pixel shading on chip 10
11 TBDR Deferred Deferred rasterisation Don t actually get the GPU to do any pixel shading straight away HW support for fully deferred rasterisation and then pixel shading Rasterisation is pixel accurate 11
12 TBDR Bandwidth savings Bandwidth savings across all phases of rendering Only fetch the geometry needed for the tile Only process the visible pixels in the tile Efficient processing Maximize available computational resources Do the best the hardware can with bandwidth 12
13 TBDR Power savings Maximizing core efficiency Lighting up the USC less often is always going to be a saving Minimizing bandwidth Texturing less is a fantastic way to save power Geometry fetch and binning is often more than 10% of per-frame bandwidth Saves bandwidth for other parts of your render 13
14 Rogue USC Architectural Building Block Unified Shading Cluster Basic building block of the Rogue architecture Laid out in pairs, with a shared TPU 1, 0.5 and 0.25 USC designs are special Different balance in the design Tend to find their way into non-gaming applications 14
15 Rogue USC Shader Architecture 16-wide in hardware 32-wide branch granularity We run half a task/warp per clock Scalar SIMD Optimized ALU pipeline Mix of F32, F16, integer, floating point specials, logic ops 15
16 Rogue USC Pipeline datapaths Configurable in the IP core F16 paths were sometimes optional, thankfully not any more F16 paths performance increased significantly after the first generation Performance in your shader F32 paths are dual FMAD F16 paths can do different things per cycle depending on shader ISA is available for you to interrogate though, with disassembling compilers 16
17 Rogue USC Scalar Scalar ALUs Hard to understate what a benefit this is Seems obvious to do, right? Vector architectures are just hard to program well Scalar isn t a free lunch Makes performance a lot more predictable for you 17
18 Rogue USC Programmable output registers The pixel output registers in the ISA are read/write One per pixel Width depends on IP core We expose it programmatically with Pixel Local Storage Worked closely with ARM (thanks, Jan-Harald!) 18
19 Evolution Health Warning: Really Bad Diagrams
20 Rogue Evolution Architecture has changed quite a bit over time Rogue in 2010 still mostly looks like a Rogue today Significant evolutionary changes across the architecture Lots of it driven by developers before the IP is baked Lots of it driven by also analysing your stuff anyway 20
21 PowerVR Series6XT Rogue Host CPU Bus Host CPU Interface Vertex Data Master Control and Register Bus Unified Shading Cluster Array USC0 Texture Unit USC1 Tiling Co-Processor Pixel Data Master Coarse Grain Scheduler ASTC* PVRTC Pixel Co-Processor Compute Data Master USCn-1 Texture Unit USCn 2D Core (TLA) System Memory Bus System Memory Interface Core Mgmt Unit Multi-level Memory Cache Unit (MCU) System Memory Bus * Extra low power GFLOPS Supports both LDR and HDR ASTC formats
22 Pipeline Pipeline Pipeline Pipeline Pipeline Pipeline Pipeline Pipeline Pipeline FLOP ALU core (FP32) FLOP PowerVR Series6XT Unified Shading Cluster Array PowerVR Series6XT USC USC Pipeline FLOP ALU core (FP32) FLOP USC FLOP FLOP ALU core (FP16) ALU core (FP16) FLOP FLOP FLOP FLOP ALU core (FP16) ALU core (FP16) FLOP FLOP Special function FLOP 16 pipelines 8 clusters
23 Series6 to Series6XT Lots of lessons learned Improved scheduler Streamlined ISA Improved compute task efficiency Completely new F16 datapath Improved front-end for sustained geometry performance ASTC 23
24 PowerVR Series7XT Host CPU Bus Host CPU Interface Vertex Data Master Control and Register Bus Unified Shading Cluster Array USC Texture Unit USC Tiling Co-Processor Pixel Data Master Coarse Grain Scheduler ASTC LDR + HDR PVRTC Pixel Co-Processor Compute Data Master USC Texture Unit USC Tessellation Co-Processor System Memory Bus System Memory Interface Core Mgmt Unit Multi-level Memory Cache Unit (MCU) System Memory Bus 2D Core (TLA) Extra low power GFLOPS
25 Pipeline Pipeline Pipeline Pipeline Pipeline Pipeline Pipeline Pipeline Pipeline PowerVR Series7XT Unified Shading Cluster Array PowerVR Series7XT USC USC USC FLOP ALU core (FP32) FLOP Pipeline FLOP ALU core (FP32) FLOP FLOP ALU core (FP16) FLOP FLOP ALU core (FP16) FLOP + Special function FLOP FLOP ALU core (FP16) FLOP FLOP ALU core (FP16) FLOP FLOP 16 pipelines ALU core (FP64) FLOP 2-16 clusters Optional
26 Series6XT to Series7XT Adding features and smoothing off rough edges Changed how the architecture scales Improved USC Streamlined ISA Features Hardware tessellation DX11-compliant USC (precision mainly) FP64 26
27 Into the future Exciting changes being worked on across the architecture USC Front-end Scaling Stuff you want! You can help We love feedback about the architecture and how it could best fit what you re doing Don t be shy 27
28 PowerVR Wizard Ray Tracing Update
29 What is Ray Tracing? Ray tracing is the ability for the shader program for one object to be aware of the geometry of other objects. 29
30 PowerVR Architecture PowerVR Series 6XT Host CPU Bus Host CPU Interface Vertex Data Master Pixel Data Master Compute Data Master Coarse Grain Scheduler Control and Register Bus USC Unified Shading Cluster Array USC Shared Texture Unit USC USC Shared Texture Unit System Memory Interface Core Management Unit Multi-level Memory Cache Unit (MCU) System Memory Bus Tiling Coprocessor Pixel Coprocessor 2D Core (TLA)
31 PowerVR Graphics Wizard Architecture PowerVR GR6500 Host CPU Bus Host CPU Interface Vertex Data Master Pixel Data Master Compute Data Master Ray Data Master Coarse Grain Scheduler Control and Register Bus USC Unified Shading Cluster Array USC Shared Texture Unit USC USC Shared Texture Unit System Memory Interface Core Management Unit Multi-level Memory Cache Unit (MCU) System Memory Bus PowerVR Ray Tracing Unit (RTU) Ray Intersection Processor Coherency Engine Scene Hierarchy Generator Tiling Coprocessor Pixel Coprocessor Frame Accumulator Cache 2D Core (TLA)
32 3 Unique Features of Wizard Fixed-function Ray-Box and Ray-Triangle testers Coherence-Driven Task-Forming and Scheduling Streaming Scene Hierarchy Generator 32
33 Fixed-Function Ray-Box and Ray-Triangle Testers 44x Less Area for Box Testing
34 Coherence-Gathering The Coherency Engine lets us process all these rays at the same time
35 Streaming Scene Hierarchy Generator
36 What is Ray Tracing? Ray tracing is the ability for the shader program for one object to be aware of the geometry of other objects. 36
37 Just a few use cases Hybrid Shadows, Reflections, etc. Augmented Reality Production- Order-Independent Quality Renders Transparency Ambient Occlusion Asset creation / compression Global Illumination Physics & Collision Detection Virtual Reality Lens correction, Ultra-low latency rendering, Lenticular Displays A.I. & Line of Sight Calculations Rapid photoquality output 37
38 GRays/s Ray Tracing Requirements Sustained Ray Throughput at 1080p, 60fps Physics / AI / etc. Technique vs Ray throughput In-Engine Lightmap baking Hybrid, Reflections Hybrid, Soft Shadows, 1 light Dynamic AO Interactive GI, (Light Probes) Lens Effects, e.g. DOF, AA, etc. Fully ray traced game 38
39 PowerVR developer tools
40 PowerVR Tools Asset Optimization Development Debugging and Profiling + PVRGeoPOD PVRTexTool + PVRVFrame PVRShaderEditor PVRShaman + PVRTune PVRTrace PVRMonitor 40
41 PowerVR Tools Release schedule PowerVR Tools release process Minor revision roughly every 6 months Recent/upcoming releases 3.5 SDK (April 2015) 4.0 SDK (due September 2015) 41
42 PVRTrace What is PVRTrace? OpenGL ES API tracer OpenGL ES 1.x, 2.0 and 3.x recording libraries GUI for analysis Features Inspect, analyse and playback captured data 42
43 PVRTrace New render state & data inspectors 43
44 PVRTune What is PVRTune? PowerVR graphics core performance analyser GUI for analysis On-device server Features Real-time performance data 44
45 PVRTune Real-time GPU profiler New counters GPU clock speed, triangles culled, Hidden Surface Removal efficiency, SLC memory reads/writes and more GUI changes Simplified setup and navigation Graphics and Compute modes Tree view for counters (Overview, Tiler, Renderer etc.) 45
46 PVRShaderEditor Shader editor & offline profiler (with disassembly!) 46
47 Rogue graphics driver
48 Rogue graphics driver Release schedule DDK (Driver Development Kit) release process Reference driver source code released to PowerVR IP licensees Minor revision roughly every 6 months Top-tier customers engage early. Drivers in products shortly after official DDK release 48
49 Rogue graphics driver 1.4 DDK Release date Q (release 1) Q (release 2) OpenGL ES: Key features (release 1) OpenGL ES 3.1 Compute shaders, shader storage buffer objects, draw indirect and more OpenGL ES: Key features (release 2) Android Lollipop support 49
50 Rogue graphics driver 1.5 DDK Release date Q2/Q OpenGL ES: Key features Android Extension Pack (AEP) ASTC, blend equation advanced, GPU shader model 5 and more srgb PVRTC Pixel local storage 128/256 bits per-pixel on-chip 50
51 Rogue graphics driver 1.6 DDK Release date Q OpenGL ES: Key features Bicubic texture filtering Shader group vote Polygon offset clamp Pixel local storage 2 Simultaneously write to pixel local storage and a framebuffer attachment 51
52 52
53 Vulkan About What is Vulkan? New open standard API developed by the Khronos group Designed for high-efficiency access to graphics and compute on modern GPUs Key features Minimizes driver overhead and enables multi-threaded GPU command preparation Designed for mobile, desktop, console and embedded platforms Designed for all GPUs - tile based GPUs are first-class citizens! SPIR-V binary intermediate language for shaders 53
54 Vulkan PowerVR driver status PowerVR Vulkan driver Driver development on-going Working with key partners on initial content bring up More details at SIGGRAPH 2015 Khronos BoF: Vulkan, OpenGL, OpenGL ES - 5:30 PM - 7:30 PM 54
55 PowerVR Graphics Future roadmaps What drives our roadmaps? Market analysis Customer feedback Developer feedback 55
56 Upcoming events idc-uk Imagination Developers Connection 2015 UK 1 st October, SOHO Hotel, London UK Register here: Agenda A full developer day including optimization tips, how to use ray tracing with raster graphics and more Also includes guest talks from Google and Digital Legends 56
57 Questions?
58
Enabling immersive gaming experiences Intro to Ray Tracing
Enabling immersive gaming experiences Intro to Ray Tracing Overview What is Ray Tracing? Why Ray Tracing? PowerVR Wizard Architecture Example Content Unity Hybrid Rendering Demonstration 3 What is Ray
More informationEnhancing Traditional Rasterization Graphics with Ray Tracing. October 2015
Enhancing Traditional Rasterization Graphics with Ray Tracing October 2015 James Rumble Developer Technology Engineer, PowerVR Graphics Overview Ray Tracing Fundamentals PowerVR Ray Tracing Pipeline Using
More informationPowerVR Hardware. Architecture Overview for Developers
Public Imagination Technologies PowerVR Hardware Public. This publication contains proprietary information which is subject to change without notice and is supplied 'as is' without warranty of any kind.
More informationTools To Get Great Graphics Performance
PowerVR SDK Overview Tools To Get Great Graphics Performance August 2012 www.imgtec.com Imagination Technologies PowerVR Graphics The Embedded Graphics IP Experts Imagination Technologies - multi-media
More informationEnhancing Traditional Rasterization Graphics with Ray Tracing. March 2015
Enhancing Traditional Rasterization Graphics with Ray Tracing March 2015 Introductions James Rumble Developer Technology Engineer Ray Tracing Support Justin DeCell Software Design Engineer Ray Tracing
More informationPowerVR GPU IP from Wearables to Servers. Kristof Beets Director of Business Development May 2015
PowerVR GPU IP from Wearables to Servers Kristof Beets Director of Business Development May 2015 www.imgtec.com Expanding embedded GPU market opportunities Huge range of market opportunities equates to
More informationThe PowerVR Insider SDK. PowerVR Developer Technology
The PowerVR Insider SDK PowerVR Developer Technology Nov 2012 First, An Introduction! Who Am I? Who? Guillem Vinals Developer Technology Engineer, PowerVR Graphics What? An introduction to our PowerVR
More informationVulkan: Architecture positive How Vulkan maps to PowerVR GPUs Kevin sun Lead Developer Support Engineer, APAC PowerVR Graphics.
Vulkan: Architecture positive How Vulkan maps to PowerVR GPUs Kevin sun Lead Developer Support Engineer, APAC PowerVR Graphics www.imgtec.com Introduction Who am I? Kevin Sun Working at Imagination Technologies
More informationUsing the PowerVR SDK to Optimize your Renderer
Using the PowerVR SDK to Optimize your Renderer March, 2013 www.imgtec.com Imagination Technologies p1 First, an Introduction www.imgtec.com Imagination Technologies p2 Who are we? Who? Joe Davis, Developer
More informationPowerVR Series5. Architecture Guide for Developers
Public Imagination Technologies PowerVR Series5 Public. This publication contains proprietary information which is subject to change without notice and is supplied 'as is' without warranty of any kind.
More informationThe Bifrost GPU architecture and the ARM Mali-G71 GPU
The Bifrost GPU architecture and the ARM Mali-G71 GPU Jem Davies ARM Fellow and VP of Technology Hot Chips 28 Aug 2016 Introduction to ARM Soft IP ARM licenses Soft IP cores (amongst other things) to our
More informationNext Generation OpenGL Neil Trevett Khronos President NVIDIA VP Mobile Copyright Khronos Group Page 1
Next Generation OpenGL Neil Trevett Khronos President NVIDIA VP Mobile Ecosystem @neilt3d Copyright Khronos Group 2015 - Page 1 Copyright Khronos Group 2015 - Page 2 Khronos Connects Software to Silicon
More informationPowerVR Performance Recommendations. The Golden Rules
PowerVR Performance Recommendations Public. This publication contains proprietary information which is subject to change without notice and is supplied 'as is' without warranty of any kind. Redistribution
More informationPowerVR: Getting Great Graphics Performance with the PowerVR Insider SDK. PowerVR Developer Technology
PowerVR: Getting Great Graphics Performance with the PowerVR Insider SDK PowerVR Developer Technology Company Overview Leading silicon, software & cloud IP supplier Graphics, video, comms, processor, cloud
More informationBifrost - The GPU architecture for next five billion
Bifrost - The GPU architecture for next five billion Hessed Choi Senior FAE / ARM ARM Tech Forum June 28 th, 2016 Vulkan 2 ARM 2016 What is Vulkan? A 3D graphics API for the next twenty years Logical successor
More informationMali Developer Resources. Kevin Ho ARM Taiwan FAE
Mali Developer Resources Kevin Ho ARM Taiwan FAE ARM Mali Developer Tools Software Development SDKs for OpenGL ES & OpenCL OpenGL ES Emulators Shader Development Studio Shader Library Asset Creation Texture
More informationLPGPU Workshop on Power-Efficient GPU and Many-core Computing (PEGPUM 2014)
A practitioner s view of challenges faced with power and performance on mobile GPU Prashant Sharma Samsung R&D Institute UK LPGPU Workshop on Power-Efficient GPU and Many-core Computing (PEGPUM 2014) SERI
More informationWorking with Metal Overview
Graphics and Games #WWDC14 Working with Metal Overview Session 603 Jeremy Sandmel GPU Software 2014 Apple Inc. All rights reserved. Redistribution or public display not permitted without written permission
More informationProfiling and Debugging Games on Mobile Platforms
Profiling and Debugging Games on Mobile Platforms Lorenzo Dal Col Senior Software Engineer, Graphics Tools Gamelab 2013, Barcelona 26 th June 2013 Agenda Introduction to Performance Analysis with ARM DS-5
More informationPOWERVR MBX & SGX OpenVG Support and Resources
POWERVR MBX & SGX OpenVG Support and Resources Kristof Beets 3 rd Party Relations Manager - Imagination Technologies kristof.beets@imgtec.com Copyright Khronos Group, 2006 - Page 1 Copyright Khronos Group,
More informationCourse Recap + 3D Graphics on Mobile GPUs
Lecture 18: Course Recap + 3D Graphics on Mobile GPUs Interactive Computer Graphics Q. What is a big concern in mobile computing? A. Power Two reasons to save power Run at higher performance for a fixed
More informationIntroduction to PowerVR Ray Tracing Tuesday 18th March, GDC. James A. McCombe
Introduction to PowerVR Tracing Tuesday 18th March, 2014 @ GDC James A. McCombe What are we launching today? Host CPU Interface Vertex Data Master Control and Register Bus Unified Shading Cluster Array
More informationPowerVR Performance Recommendations. The Golden Rules
PowerVR Performance Recommendations Copyright Imagination Technologies Limited. All Rights Reserved. This publication contains proprietary information which is subject to change without notice and is supplied
More informationOptimizing and Profiling Unity Games for Mobile Platforms. Angelo Theodorou Senior Software Engineer, MPG Gamelab 2014, 25 th -27 th June
Optimizing and Profiling Unity Games for Mobile Platforms Angelo Theodorou Senior Software Engineer, MPG Gamelab 2014, 25 th -27 th June 1 Agenda Introduction ARM and the presenter Preliminary knowledge
More informationParallel Programming on Larrabee. Tim Foley Intel Corp
Parallel Programming on Larrabee Tim Foley Intel Corp Motivation This morning we talked about abstractions A mental model for GPU architectures Parallel programming models Particular tools and APIs This
More informationSIGGRAPH Briefing August 2014
Copyright Khronos Group 2014 - Page 1 SIGGRAPH Briefing August 2014 Neil Trevett VP Mobile Ecosystem, NVIDIA President, Khronos Copyright Khronos Group 2014 - Page 2 Significant Khronos API Ecosystem Advances
More informationArchitectures. Michael Doggett Department of Computer Science Lund University 2009 Tomas Akenine-Möller and Michael Doggett 1
Architectures Michael Doggett Department of Computer Science Lund University 2009 Tomas Akenine-Möller and Michael Doggett 1 Overview of today s lecture The idea is to cover some of the existing graphics
More informationGraphics Architectures and OpenCL. Michael Doggett Department of Computer Science Lund university
Graphics Architectures and OpenCL Michael Doggett Department of Computer Science Lund university Overview Parallelism Radeon 5870 Tiled Graphics Architectures Important when Memory and Bandwidth limited
More informationCopyright Khronos Group Page 1
Gaming Market Briefing Overview of APIs GDC March 2016 Neil Trevett Khronos President NVIDIA Vice President Developer Ecosystem ntrevett@nvidia.com @neilt3d Copyright Khronos Group 2016 - Page 1 Copyright
More informationPress Briefing SIGGRAPH 2015 Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem. Copyright Khronos Group Page 1
Press Briefing SIGGRAPH 2015 Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem Copyright Khronos Group 2015 - Page 1 Khronos Connects Software to Silicon Open Consortium creating ROYALTY-FREE,
More information! Readings! ! Room-level, on-chip! vs.!
1! 2! Suggested Readings!! Readings!! H&P: Chapter 7 especially 7.1-7.8!! (Over next 2 weeks)!! Introduction to Parallel Computing!! https://computing.llnl.gov/tutorials/parallel_comp/!! POSIX Threads
More informationBringing AAA graphics to mobile platforms. Niklas Smedberg Senior Engine Programmer, Epic Games
Bringing AAA graphics to mobile platforms Niklas Smedberg Senior Engine Programmer, Epic Games Who Am I A.k.a. Smedis Platform team at Epic Games Unreal Engine 15 years in the industry 30 years of programming
More informationCS427 Multicore Architecture and Parallel Computing
CS427 Multicore Architecture and Parallel Computing Lecture 6 GPU Architecture Li Jiang 2014/10/9 1 GPU Scaling A quiet revolution and potential build-up Calculation: 936 GFLOPS vs. 102 GFLOPS Memory Bandwidth:
More informationChallenges for GPU Architecture. Michael Doggett Graphics Architecture Group April 2, 2008
Michael Doggett Graphics Architecture Group April 2, 2008 Graphics Processing Unit Architecture CPUs vsgpus AMD s ATI RADEON 2900 Programming Brook+, CAL, ShaderAnalyzer Architecture Challenges Accelerated
More informationEECS 487: Interactive Computer Graphics
EECS 487: Interactive Computer Graphics Lecture 21: Overview of Low-level Graphics API Metal, Direct3D 12, Vulkan Console Games Why do games look and perform so much better on consoles than on PCs with
More informationMali-400 MP: A Scalable GPU for Mobile Devices Tom Olson
Mali-400 MP: A Scalable GPU for Mobile Devices Tom Olson Director, Graphics Research, ARM Outline ARM and Mobile Graphics Design Constraints for Mobile GPUs Mali Architecture Overview Multicore Scaling
More informationReal-Time Rendering (Echtzeitgraphik) Michael Wimmer
Real-Time Rendering (Echtzeitgraphik) Michael Wimmer wimmer@cg.tuwien.ac.at Walking down the graphics pipeline Application Geometry Rasterizer What for? Understanding the rendering pipeline is the key
More informationPractical Techniques for Ray Tracing in Games. Gareth Morgan (Imagination Technologies) Aras Pranckevičius (Unity Technologies) March, 2014
Practical Techniques for Ray Tracing in Games Gareth Morgan (Imagination Technologies) Aras Pranckevičius (Unity Technologies) March, 2014 What Ray Tracing is not! Myth: Ray Tracing is only for photorealistic
More informationPowerVR SDK & Tools. February 2017
PowerVR SDK & Tools February 2017 www.imgtec.com PowerVR SDK Features PowerVR Framework Cross-platform programming kit Supports PowerVR assets Source code provided Allows easy porting of native graphics
More informationLecture 25: Board Notes: Threads and GPUs
Lecture 25: Board Notes: Threads and GPUs Announcements: - Reminder: HW 7 due today - Reminder: Submit project idea via (plain text) email by 11/24 Recap: - Slide 4: Lecture 23: Introduction to Parallel
More informationAchieving Console Quality Games on Mobile
Achieving Console Quality Games on Mobile Peter Harris, Senior Principal Engineer, ARM Unai Landa, CTO, Digital Legends Jon Kirkham, Staff Engineer, ARM GDC 2017 Agenda Premium smartphone in 2017 ARM Cortex
More informationPowerVR Performance Recommendations The Golden Rules. October 2015
PowerVR Performance Recommendations The Golden Rules October 2015 Paul Ly Developer Technology Engineer, PowerVR Graphics Understanding Your Bottlenecks Based on our experience 3 The Golden Rules 1. The
More informationOverview. Technology Details. D/AVE NX Preliminary Product Brief
Overview D/AVE NX is the latest and most powerful addition to the D/AVE family of rendering cores. It is the first IP to bring full OpenGL ES 2.0/3.1 rendering to the FPGA and SoC world. Targeted for graphics
More informationCopyright Khronos Group Page 1. Vulkan Overview. June 2015
Copyright Khronos Group 2015 - Page 1 Vulkan Overview June 2015 Copyright Khronos Group 2015 - Page 2 Khronos Connects Software to Silicon Open Consortium creating OPEN STANDARD APIs for hardware acceleration
More informationLecture 9: Deferred Shading. Visual Computing Systems CMU , Fall 2013
Lecture 9: Deferred Shading Visual Computing Systems The course so far The real-time graphics pipeline abstraction Principle graphics abstractions Algorithms and modern high performance implementations
More informationReal-Time Reyes: Programmable Pipelines and Research Challenges. Anjul Patney University of California, Davis
Real-Time Reyes: Programmable Pipelines and Research Challenges Anjul Patney University of California, Davis Real-Time Reyes-Style Adaptive Surface Subdivision Anjul Patney and John D. Owens SIGGRAPH Asia
More informationMobile HW and Bandwidth
Your logo on white Mobile HW and Bandwidth Andrew Gruber Qualcomm Technologies, Inc. Agenda and Goals Describe the Power and Bandwidth challenges facing Mobile Graphics Describe some of the Power Saving
More informationPOWERVR MBX. Technology Overview
POWERVR MBX Technology Overview Copyright 2009, Imagination Technologies Ltd. All Rights Reserved. This publication contains proprietary information which is subject to change without notice and is supplied
More informationParallelizing Graphics Pipeline Execution (+ Basics of Characterizing a Rendering Workload)
Lecture 2: Parallelizing Graphics Pipeline Execution (+ Basics of Characterizing a Rendering Workload) Visual Computing Systems Today Finishing up from last time Brief discussion of graphics workload metrics
More informationMention driver developers in the room. Because of time this will be fairly high level, feel free to come talk to us afterwards
1 Introduce Mark, Michael Poll: Who is a software developer or works for a software company? Who s in management? Who knows what the OpenGL ARB standards body is? Mention driver developers in the room.
More informationParallelizing Graphics Pipeline Execution (+ Basics of Characterizing a Rendering Workload)
Lecture 2: Parallelizing Graphics Pipeline Execution (+ Basics of Characterizing a Rendering Workload) Visual Computing Systems Analyzing a 3D Graphics Workload Where is most of the work done? Memory Vertex
More informationGraphics Hardware, Graphics APIs, and Computation on GPUs. Mark Segal
Graphics Hardware, Graphics APIs, and Computation on GPUs Mark Segal Overview Graphics Pipeline Graphics Hardware Graphics APIs ATI s low-level interface for computation on GPUs 2 Graphics Hardware High
More informationOptimisation. CS7GV3 Real-time Rendering
Optimisation CS7GV3 Real-time Rendering Introduction Talk about lower-level optimization Higher-level optimization is better algorithms Example: not using a spatial data structure vs. using one After that
More informationOptimizing DirectX Graphics. Richard Huddy European Developer Relations Manager
Optimizing DirectX Graphics Richard Huddy European Developer Relations Manager Some early observations Bear in mind that graphics performance problems are both commoner and rarer than you d think The most
More informationGraphics Processing Unit Architecture (GPU Arch)
Graphics Processing Unit Architecture (GPU Arch) With a focus on NVIDIA GeForce 6800 GPU 1 What is a GPU From Wikipedia : A specialized processor efficient at manipulating and displaying computer graphics
More informationPress Briefing SIGGRAPH 2015 Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem. Copyright Khronos Group Page 1
Press Briefing SIGGRAPH 2015 Neil Trevett Khronos President NVIDIA Vice President Mobile Ecosystem Copyright Khronos Group 2015 - Page 1 Khronos Connects Software to Silicon Open Consortium creating ROYALTY-FREE,
More informationVulkan Multipass mobile deferred done right
Vulkan Multipass mobile deferred done right Hans-Kristian Arntzen Marius Bjørge Khronos 5 / 25 / 2017 Content What is multipass? What multipass allows... A driver to do versus MRT Developers to do Transient
More informationNext-Generation Graphics on Larrabee. Tim Foley Intel Corp
Next-Generation Graphics on Larrabee Tim Foley Intel Corp Motivation The killer app for GPGPU is graphics We ve seen Abstract models for parallel programming How those models map efficiently to Larrabee
More informationPowerVR. Performance Recommendations
PowerVR Public. This publication contains proprietary information which is subject to change without notice and is supplied 'as is' without warranty of any kind. Redistribution of this document is permitted
More informationBuilding scalable 3D applications. Ville Miettinen Hybrid Graphics
Building scalable 3D applications Ville Miettinen Hybrid Graphics What s going to happen... (1/2) Mass market: 3D apps will become a huge success on low-end and mid-tier cell phones Retro-gaming New game
More informationUnreal Engine 4: Mobile Graphics on ARM CPU and GPU Architecture
Unreal Engine 4: Mobile Graphics on ARM CPU and GPU Architecture Ray Hwang, Segment Marketing Manager, ARM Niklas Smedberg, Senior Engine Programmer, Epic Games Hessed Choi, Senior Field Applications Engineer,
More informationOptimizing for DirectX Graphics. Richard Huddy European Developer Relations Manager
Optimizing for DirectX Graphics Richard Huddy European Developer Relations Manager Also on today from ATI... Start & End Time: 12:00pm 1:00pm Title: Precomputed Radiance Transfer and Spherical Harmonic
More informationMultimedia in Mobile Phones. Architectures and Trends Lund
Multimedia in Mobile Phones Architectures and Trends Lund 091124 Presentation Henrik Ohlsson Contact: henrik.h.ohlsson@stericsson.com Working with multimedia hardware (graphics and displays) at ST- Ericsson
More informationSqueezing Performance out of your Game with ATI Developer Performance Tools and Optimization Techniques
Squeezing Performance out of your Game with ATI Developer Performance Tools and Optimization Techniques Jonathan Zarge, Team Lead Performance Tools Richard Huddy, European Developer Relations Manager ATI
More informationUnleashing the benefits of GPU Computing with ARM Mali TM Practical applications and use-cases. Steve Steele, ARM
Unleashing the benefits of GPU Computing with ARM Mali TM Practical applications and use-cases Steve Steele, ARM 1 Today s Computational Challenges Trends Growing display sizes and resolutions, richer
More informationWindowing System on a 3D Pipeline. February 2005
Windowing System on a 3D Pipeline February 2005 Agenda 1.Overview of the 3D pipeline 2.NVIDIA software overview 3.Strengths and challenges with using the 3D pipeline GeForce 6800 220M Transistors April
More informationReal - Time Rendering. Pipeline optimization. Michal Červeňanský Juraj Starinský
Real - Time Rendering Pipeline optimization Michal Červeňanský Juraj Starinský Motivation Resolution 1600x1200, at 60 fps Hw power not enough Acceleration is still necessary 3.3.2010 2 Overview Application
More informationDEFERRED RENDERING STEFAN MÜLLER ARISONA, ETH ZURICH SMA/
DEFERRED RENDERING STEFAN MÜLLER ARISONA, ETH ZURICH SMA/2013-11-04 DEFERRED RENDERING? CONTENTS 1. The traditional approach: Forward rendering 2. Deferred rendering (DR) overview 3. Example uses of DR:
More informationGeForce4. John Montrym Henry Moreton
GeForce4 John Montrym Henry Moreton 1 Architectural Drivers Programmability Parallelism Memory bandwidth 2 Recent History: GeForce 1&2 First integrated geometry engine & 4 pixels/clk Fixed-function transform,
More informationStreaming Massive Environments From Zero to 200MPH
FORZA MOTORSPORT From Zero to 200MPH Chris Tector (Software Architect Turn 10 Studios) Turn 10 Internal studio at Microsoft Game Studios - we make Forza Motorsport Around 70 full time staff 2 Why am I
More informationThreading Hardware in G80
ing Hardware in G80 1 Sources Slides by ECE 498 AL : Programming Massively Parallel Processors : Wen-Mei Hwu John Nickolls, NVIDIA 2 3D 3D API: API: OpenGL OpenGL or or Direct3D Direct3D GPU Command &
More informationIntroduction to Parallel Programming Models
Introduction to Parallel Programming Models Tim Foley Stanford University Beyond Programmable Shading 1 Overview Introduce three kinds of parallelism Used in visual computing Targeting throughput architectures
More informationScheduling the Graphics Pipeline on a GPU
Lecture 20: Scheduling the Graphics Pipeline on a GPU Visual Computing Systems Today Real-time 3D graphics workload metrics Scheduling the graphics pipeline on a modern GPU Quick aside: tessellation Triangle
More informationDeferred Rendering Due: Wednesday November 15 at 10pm
CMSC 23700 Autumn 2017 Introduction to Computer Graphics Project 4 November 2, 2017 Deferred Rendering Due: Wednesday November 15 at 10pm 1 Summary This assignment uses the same application architecture
More informationECE 574 Cluster Computing Lecture 16
ECE 574 Cluster Computing Lecture 16 Vince Weaver http://web.eece.maine.edu/~vweaver vincent.weaver@maine.edu 26 March 2019 Announcements HW#7 posted HW#6 and HW#5 returned Don t forget project topics
More informationEnabling a Richer Multimedia Experience with GPU Compute. Roberto Mijat Visual Computing Marketing Manager
Enabling a Richer Multimedia Experience with GPU Compute Roberto Mijat Visual Computing Marketing Manager 1 What is GPU Compute Operating System and most application processing continue to reside on the
More informationThe NVIDIA GeForce 8800 GPU
The NVIDIA GeForce 8800 GPU August 2007 Erik Lindholm / Stuart Oberman Outline GeForce 8800 Architecture Overview Streaming Processor Array Streaming Multiprocessor Texture ROP: Raster Operation Pipeline
More informationCSE 591: GPU Programming. Introduction. Entertainment Graphics: Virtual Realism for the Masses. Computer games need to have: Klaus Mueller
Entertainment Graphics: Virtual Realism for the Masses CSE 591: GPU Programming Introduction Computer games need to have: realistic appearance of characters and objects believable and creative shading,
More informationRendering. Converting a 3D scene to a 2D image. Camera. Light. Rendering. View Plane
Rendering Pipeline Rendering Converting a 3D scene to a 2D image Rendering Light Camera 3D Model View Plane Rendering Converting a 3D scene to a 2D image Basic rendering tasks: Modeling: creating the world
More information3D Graphics in Future Mobile Devices. Steve Steele, ARM
3D Graphics in Future Mobile Devices Steve Steele, ARM Market Trends Mobile Computing Market Growth Volume in millions Mobile Computing Market Trends 1600 Smart Mobile Device Shipments (Smartphones and
More informationGUERRILLA DEVELOP CONFERENCE JULY 07 BRIGHTON
Deferred Rendering in Killzone 2 Michal Valient Senior Programmer, Guerrilla Talk Outline Forward & Deferred Rendering Overview G-Buffer Layout Shader Creation Deferred Rendering in Detail Rendering Passes
More informationToday s Agenda. DirectX 9 Features Sim Dietrich, nvidia - Multisample antialising Jason Mitchell, ATI - Shader models and coding tips
Today s Agenda DirectX 9 Features Sim Dietrich, nvidia - Multisample antialising Jason Mitchell, ATI - Shader models and coding tips Optimization for DirectX 9 Graphics Mike Burrows, Microsoft - Performance
More informationSaving the Planet Designing Low-Power, Low-Bandwidth GPUs
Saving the Planet Designing Low-Power, Low-Bandwidth GPUs Alan Tsai Business Development Manager ARM Saving the Planet? Really? Photo courtesy of NASA. 2 Mobile GPU design is all about power It s not about
More informationIntroduction to Modern GPU Hardware
The following content are extracted from the material in the references on last page. If any wrong citation or reference missing, please contact ldvan@cs.nctu.edu.tw. I will correct the error asap. This
More informationVulkan API 杨瑜, 资深工程师
Vulkan API 杨瑜, 资深工程师 Vulkan Overview (1/3) Some History ~2011 became apparent that the API is getting in the way - Console Developers programmed GPUs To-the-Metal 2012 Khronos started work on GLCommon
More informationRadeon ProRender and Radeon Rays in a Gaming Rendering Workflow. Takahiro Harada, AMD 2017/3
Radeon ProRender and Radeon Rays in a Gaming Rendering Workflow Takahiro Harada, AMD 2017/3 Agenda Introduction Radeon ProRender & Radeon Rays Radeon Rays Unity + Radeon Rays Integration to real time applications
More informationSpring 2010 Prof. Hyesoon Kim. AMD presentations from Richard Huddy and Michael Doggett
Spring 2010 Prof. Hyesoon Kim AMD presentations from Richard Huddy and Michael Doggett Radeon 2900 2600 2400 Stream Processors 320 120 40 SIMDs 4 3 2 Pipelines 16 8 4 Texture Units 16 8 4 Render Backens
More informationGPU Computation Strategies & Tricks. Ian Buck NVIDIA
GPU Computation Strategies & Tricks Ian Buck NVIDIA Recent Trends 2 Compute is Cheap parallelism to keep 100s of ALUs per chip busy shading is highly parallel millions of fragments per frame 0.5mm 64-bit
More informationAdding Advanced Shader Features and Handling Fragmentation
Copyright Khronos Group, 2010 - Page 1 Adding Advanced Shader Features and Handling Fragmentation How to enable your application on a wide range of devices Imagination Technologies Copyright Khronos Group,
More informationIntroduction to Multicore architecture. Tao Zhang Oct. 21, 2010
Introduction to Multicore architecture Tao Zhang Oct. 21, 2010 Overview Part1: General multicore architecture Part2: GPU architecture Part1: General Multicore architecture Uniprocessor Performance (ECint)
More informationOpenGL Status - November 2013 G-Truc Creation
OpenGL Status - November 2013 G-Truc Creation Vendor NVIDIA AMD Intel Windows Apple Release date 02/10/2013 08/11/2013 30/08/2013 22/10/2013 Drivers version 331.10 beta 13.11 beta 9.2 10.18.10.3325 MacOS
More informationInside VR on Mobile. Sam Martin Graphics Architect GDC 2016
Inside VR on Mobile Sam Martin Graphics Architect GDC 2016 VR Today Emerging technology Main mobile VR ecosystems Google Cardboard Samsung GearVR In this talk: Latency Multiple views Performance tuning
More informationOpenGL ES 2.0 : Start Developing Now. Dan Ginsburg Advanced Micro Devices, Inc.
OpenGL ES 2.0 : Start Developing Now Dan Ginsburg Advanced Micro Devices, Inc. Agenda OpenGL ES 2.0 Brief Overview Tools OpenGL ES 2.0 Emulator RenderMonkey w/ OES 2.0 Support OpenGL ES 2.0 3D Engine Case
More informationShadows. COMP 575/770 Spring 2013
Shadows COMP 575/770 Spring 2013 Shadows in Ray Tracing Shadows are important for realism Basic idea: figure out whether a point on an object is illuminated by a light source Easy for ray tracers Just
More informationIntroduction to OpenGL ES 3.0
Introduction to OpenGL ES 3.0 Eisaku Ohbuchi Digital Media Professionals Inc. 2012 Digital Media Professionals Inc. All rights reserved. 12/Sep/2012 Page 1 Agenda DMP overview (quick!) OpenGL ES 3.0 update
More informationSpring 2009 Prof. Hyesoon Kim
Spring 2009 Prof. Hyesoon Kim Benchmarking is critical to make a design decision and measuring performance Performance evaluations: Design decisions Earlier time : analytical based evaluations From 90
More informationIntroduction to CUDA Algoritmi e Calcolo Parallelo. Daniele Loiacono
Introduction to CUDA Algoritmi e Calcolo Parallelo References q This set of slides is mainly based on: " CUDA Technical Training, Dr. Antonino Tumeo, Pacific Northwest National Laboratory " Slide of Applied
More informationWhiz-Bang Graphics and Media Performance for Java Platform, Micro Edition (JavaME)
Whiz-Bang Graphics and Media Performance for Java Platform, Micro Edition (JavaME) Pavel Petroshenko, Sun Microsystems, Inc. Ashmi Bhanushali, NVIDIA Corporation Jerry Evans, Sun Microsystems, Inc. Nandini
More informationCS452/552; EE465/505. Clipping & Scan Conversion
CS452/552; EE465/505 Clipping & Scan Conversion 3-31 15 Outline! From Geometry to Pixels: Overview Clipping (continued) Scan conversion Read: Angel, Chapter 8, 8.1-8.9 Project#1 due: this week Lab4 due:
More informationMulticore and MIPS: Creating the next generation of SoCs. Jim Whittaker EVP MIPS Business Unit
Multicore and MIPS: Creating the next generation of SoCs Jim Whittaker EVP MIPS Business Unit www.imgtec.com Many new opportunities Wearables Home wireless for everything Automation & Robotics ADAS and
More information