Fast Stereoscopic Rendering on Mobile Ray Tracing GPU for Virtual Reality Applications
|
|
- Tyler Hutchinson
- 5 years ago
- Views:
Transcription
1 Fast Stereoscopic Rendering on Mobile Ray Tracing GPU for Virtual Reality Applications SAMSUNG Advanced Institute of Technology Won-Jong Lee, Seok Joong Hwang, Youngsam Shin, Jeong-Joon Yoo, Soojung Ryu
2 What is ray tracing? Ray tracing is a technique for generating an image by tracing the path of light through pixels in an image plane and simulating the effects of its encounters with virtual objects. Naturally represents the global effects such as shadow, reflection, and refraction Image source: Wikipedia
3 Ray Tracing and Mobile Ray tracing provides a potential rendering technique for future mobile applications that require photorealistic graphics
4 Ray tracing for VR Stereo Iray VR (NVIDIA, 2016) Foveated rendering (AMD, 2014) VRWorks Audio (NVIDIA, 2016) - 3 -
5 Background: Ray Tracing Early desktop CPU/GPU (`00~09) - Packet tracing [Gunther 07][Overbeck 08][Benthin 09] HW Specialization (`02~06) - SarrCor [Schmitter `02], RPU, D-RPU [Woop `05, `06] - Not commericalized Modern GPUs and MICs (`10~) - OptiX [Steven `10], Embree [Wald `14] - For professional graphics Mobile GPU and H/W revisit (`13~present) - SGRT [Lee `13, `15], GR6500 [McCombe `14], RayCore [Nah `14] - Targeted for real-time applications (Game, UX, AR/VR) - 4 -
6 Background: Ray Tracing Performance & Power Consumption - Future GFX application will require higher resolution (4K>). E.g. VR, high-quality 3D game - 3Grays/s for full RT game engine, 100~250 Mrays /s for hybrid rendering Ray tracing requirements (1080p, 60fps), Techniques vs Ray throughput Reference: PowerVR Graphics Keynote, Imagination Developer Connection
7 Motivation: stereoscopic reprojection Project Left eye s ray hit points onto Right eye 2) [Adelson et al. 1992] 4 pixel classification 1) [Badt et al. 1988] - Good, Missed, Overlapped, - Bad : projected but may be occluded GOOD BAD Detect by examining reprojection indices L R L R 1) Badt, Two algorithms taking advantage of temporal coherence in ray tracing Visual Computer, ) Adelson and Larry, Visible Surface Ray-Tracing of Stereoscopic Images Southeast Regional Conf.,
8 Goal of this paper Efficiently map the reprojection algorithm onto the existing ray tracing GPUs Stereoscopic ray traced rendering with reprojection method. Except the yellow pixels (indicate bad pixels), the most of the pixels (91.54%) in the right image can be reused with the results of the left image
9 Proposed Framework
10 Target platform: a mobile ray tracing GPUs (SGRT) A mobile GPU based on ray tracing, which combines the advantages of programmable DSP cores and a dedicated hardware - T&I Units : fast, compact H/W to accelerate traversal & intersection - SRP : programmable shader core support flexible shading and ray generation High performance features : dual AABB test unit [Lee et al. 2014], reorder buffer [Lee et al. 2015], hybrid number representation [Hwang et al. 2015] Host CPUs Core #1 Core #2 Core #3 Core #4 Intersection Unit Cache(L1) T&I Unit Traversal Traversal Traversal Unit Traversal Unit Unit Unit Cache(L1) Cache(L1) Cache(L1) Cache(L1) Cache(L2) SGRT Core #1 SGRT Core #1 SGRT Core #1 SGRT Core #1 VLIW Engine Internal SRAM SRP Coarse Grained Reconfigurable Array I-Cache C-Mem Texture Unit Cache(L1) Host System BUS AXI System BUS Host DRAM External DRAM - 9 -
11 SGRT: T&I (Traversal and Intersection) Units Specialized H/W for BVH tree traversal and intersection operation for ray tracing MIMD parallel architecture[lee et al 2013], 2-AABB traversal unit [Lee et al 2014], latency hiding [Shin et al 2015], and hybrid number representation [Hwang et al 2015] T&I unit and shader core in GPU are connected via direct interfaces
12 SGRT: SRP (Samsung Reconfigurable Processor) A flexible architecture template [Lee et al, 2011] ISA such as arithmetic, special function and texture are properly implemented. The VLIW engine useful for GP computations (functions, control flow). The CGRA makes full use of software pipeline technique for loop acceleration. Instruction VLIW DATA Central RF (Register file) FU FU FU FU FU RF FU RF FU RF FU RF FU RF FU RF FU RF FU RF FU RF FU RF FU RF FU RF CGA for ( ) { Loop } for ( ) { Loop } for ( ) { Loop } Control proc Data proc Control proc Data proc Control proc Data proc
13 Overview: stereoscopic reprojection rendering on SGRT This is the first demonstration of stereoscopic rendering utilizing mobile ray tracing GPU for VR applications
14 Overview: stereoscopic reprojection rendering on SGRT This is the first demonstration of stereoscopic rendering utilizing mobile ray tracing GPU for VR applications. SGRT efficiently supports flexible real-time ray tracing by combining the advantages of the hardware and the software Thus, easily added new software kernels Reprojection, Validation, and Reuse
15 Overview: stereoscopic reprojection rendering on SGRT This is the first demonstration of stereoscopic rendering utilizing mobile ray tracing GPU for VR applications. SGRT efficiently supports flexible real-time ray tracing by combining the advantages of the hardware and the software Thus, easily added new software kernels Reprojection, Validation, and Reuse. H/W accelerator (T&I unit) efficiently performed fast ray traversal processing
16 Overview: stereoscopic reprojection rendering on SGRT This is the first demonstration of stereoscopic rendering utilizing mobile ray tracing GPU for VR applications. SGRT efficiently supports flexible real-time ray tracing by combining the advantages of the hardware and the software Thus, easily added new software kernels Reprojection, Validation, and Reuse. H/W accelerator (T&I unit) efficiently performed fast ray traversal processing Tile based ray tracing - By conducting ray tracing per-tile basis, the G-buffer can be fit into the internal memory, which allows the kernels (reprojection and reusing) to be performed using the on-chip internal SRAM without having to access the external DRAM. (exception, validation kernels)
17 Processing Detail For left image For right image L R Shader Core (SRP) Ray Generation Shading Re-projection Internal Mem. Rays Hit Points DRAM Span G-Buffer Ray tracing H/W (T&I Unit) Traversal & Intersection
18 Processing Detail For left image For right image L R Shader Core (SRP) Ray Generation Shading Re-projection Internal Mem. Rays Hit Points DRAM Span G-Buffer Ray tracing H/W (T&I Unit) Traversal & Intersection
19 Processing Detail For left image For right image L R Shader Core (SRP) Ray Generation Shading Re-projection Internal Mem. DRAM Ray tracing H/W (T&I Unit) Rays Hit Points Traversal & Intersection Color Normal Texcoord Position Span G-Buffer
20 Processing Detail For left image For right image L R Shader Core (SRP) Ray Generation Shading Re-projection Internal Mem. DRAM Ray tracing H/W (T&I Unit) Rays Hit Points Traversal & Intersection Color Normal Texcoord Position Span G-Buffer
21 Processing Detail For left image For right image L R Shader Core (SRP) Ray Generation Shading Re-projection Internal Mem. DRAM Ray tracing H/W (T&I Unit) Rays Hit Points Traversal & Intersection Color Normal Texcoord Position Span G-Buffer
22 Processing Detail For left image For right image L R Shader Core (SRP) Ray Generation Shading Re-projection Internal Mem. DRAM Ray tracing H/W (T&I Unit) Rays Hit Points Traversal & Intersection Color Normal Texcoord Position Span G-Buffer
23 Processing Detail For left image For right image L R Shader Core (SRP) Ray Generation Shading Re-projection Internal Mem. DRAM Ray tracing H/W (T&I Unit) Rays Hit Points Traversal & Intersection Color Normal Texcoord Position Span G-Buffer
24 Processing Detail For left image For right image L R Shader Core (SRP) Ray Generation Shading Re-projection Internal Mem. DRAM Ray tracing H/W (T&I Unit) Rays Hit Points Traversal & Intersection Color Normal Texcoord Position Span G-Buffer
25 Processing Detail For left image For right image Shader Core (SRP) L R Validation Test Yes No Bad Pixel? Ray Generation Re-use Shading Internal Mem. Tile Color- Buffer Updating tile colors DRAM Span G-Buffer Rays Hit Points Frame Buffer Ray tracing H/W (T&I Unit) Traversal & Intersection
26 Processing Detail For left image For right image Shader Core (SRP) L R Validation Test Yes No Bad Pixel? Ray Generation Re-use Shading Internal Mem. DRAM Span G-Buffer Prefetching a row (by DMA) Rays Hit Points Tile Color- Buffer Updating tile colors Frame Buffer Ray tracing H/W (T&I Unit) Traversal & Intersection
27 Processing Detail For left image For right image Shader Core (SRP) L R Validation Test Yes No Bad Pixel? Ray Generation Re-use Shading Internal Mem. DRAM Span G-Buffer Prefetching a row (by DMA) Rays Hit Points Tile Color- Buffer Updating tile colors Frame Buffer Ray tracing H/W (T&I Unit) Traversal & Intersection
28 Processing Detail For left image For right image Shader Core (SRP) L R Validation Test Yes No Bad Pixel? Ray Generation Re-use Shading Internal Mem. DRAM Span G-Buffer Prefetching a row (by DMA) Rays Hit Points Tile Color- Buffer Updating tile colors Frame Buffer Ray tracing H/W (T&I Unit) Traversal & Intersection
29 Processing Detail For left image For right image Red: bad pixels Shader Core (SRP) L R Validation Test Yes No Bad Pixel? Ray Generation Re-use Shading Internal Mem. DRAM Span G-Buffer Prefetching a row (by DMA) Rays Hit Points Tile Color- Buffer Updating tile colors Frame Buffer Ray tracing H/W (T&I Unit) Traversal & Intersection
30 Processing Detail For left image For right image Shader Core (SRP) L R Validation Test Yes No Bad Pixel? Ray Generation Re-use Shading Internal Mem. DRAM Span G-Buffer Prefetching a row (by DMA) Rays Hit Points Tile Color- Buffer Updating tile colors Frame Buffer Ray tracing H/W (T&I Unit) Traversal & Intersection
31 Evaluation
32 Experimental Setup Performance and energy simulation model Cycle accurate simulators for SGRT integrated with energy model Energy and power model of [Lee et al. 2015] which utilized a custom model based on the database built with the power values per component from Synopsys PrimTime PX [SYNOPSYS 2016] with SAMSUNG 14nm LPP process technology [Samsung 2016]. Configuration of the T&I unit is the same as [Lee et al 2014] 4 TRV + 1 IST units, 500MHz
33 Experimental Setup Test Application we used five datasets (Figure 1): Teapot (15K triangles), Chess (42K), BMW (55K), Chemical Lab. (98K), Music box (106K), and Provence (600K). Test scenes were all rendered at 2048 x 1024 resolution with enough secondary ray effects. We compared with the standard reference; ray tracing without reprojection in the same hardware platform Teapot (15K triangles) Chess (42K) BMW (55K) Chemical Lab. (98K) Music box (106K) Provence (600K)
34 Reused pixels The results of the stereoscopic rendering for six test scenes* The pixels, marked as yellow, in the Right-image indicates the bad pixels. We could find that most of the pixels (91.54% in average) in Left-image could be reused as shown in the figure. * Intentionally Barrel Distortion Correction filter has not been applied to this rendered scenes so that we would focus on the reprojection effect in the scene
35 Relative Performance Overall, it achieved up to 1.64 times better performance compared with the reference platform. This is because it can substantially reduce the computing cost of the T&I unit. In terms of the absolute performance, we could obtain 131.3, 14.4, 18.2, 8.6, 28.9 and 44.5 fps for each test scene, respectively x Standard Reprojected Teapot Chess Musicbox BMW Chemical Lab. Provence
36 Relative Performance Regarding energy consumption, our implementation could reduce up to 20% because it could cut the workloads in the hardware Standard Reprojected % Teapot Chess Musicbox BMW Chemical Provence Lab.
37 Conclusion
38 Summary In this work, we present a solution to realize ray tracing based stereoscopic rendering utilizing a mobile ray tracing GPU. With the combination of the reprojection and tile-based ray tracing, our approach could be a versatile solution for future VR applications, As it achieves up to 1.64 times better performance and 20% better energy efficiency, compared with the state-of-the-art solution. Future work, Apply more adaptive rendering such as foveated rendering
39 Thank you!
Enhancing Traditional Rasterization Graphics with Ray Tracing. October 2015
Enhancing Traditional Rasterization Graphics with Ray Tracing October 2015 James Rumble Developer Technology Engineer, PowerVR Graphics Overview Ray Tracing Fundamentals PowerVR Ray Tracing Pipeline Using
More informationRay Tracing. Computer Graphics CMU /15-662, Fall 2016
Ray Tracing Computer Graphics CMU 15-462/15-662, Fall 2016 Primitive-partitioning vs. space-partitioning acceleration structures Primitive partitioning (bounding volume hierarchy): partitions node s primitives
More informationPart IV. Review of hardware-trends for real-time ray tracing
Part IV Review of hardware-trends for real-time ray tracing Hardware Trends For Real-time Ray Tracing Philipp Slusallek Saarland University, Germany Large Model Visualization at Boeing CATIA Model of Boeing
More informationEnabling immersive gaming experiences Intro to Ray Tracing
Enabling immersive gaming experiences Intro to Ray Tracing Overview What is Ray Tracing? Why Ray Tracing? PowerVR Wizard Architecture Example Content Unity Hybrid Rendering Demonstration 3 What is Ray
More informationHardware-driven visibility culling
Hardware-driven visibility culling I. Introduction 20073114 김정현 The goal of the 3D graphics is to generate a realistic and accurate 3D image. To achieve this, it needs to process not only large amount
More informationEnhancing Traditional Rasterization Graphics with Ray Tracing. March 2015
Enhancing Traditional Rasterization Graphics with Ray Tracing March 2015 Introductions James Rumble Developer Technology Engineer Ray Tracing Support Justin DeCell Software Design Engineer Ray Tracing
More informationCourse Recap + 3D Graphics on Mobile GPUs
Lecture 18: Course Recap + 3D Graphics on Mobile GPUs Interactive Computer Graphics Q. What is a big concern in mobile computing? A. Power Two reasons to save power Run at higher performance for a fixed
More informationReal-Time Graphics Architecture. Kurt Akeley Pat Hanrahan. Ray Tracing.
Real-Time Graphics Architecture Kurt Akeley Pat Hanrahan http://www.graphics.stanford.edu/courses/cs448a-01-fall Ray Tracing with Tim Purcell 1 Topics Why ray tracing? Interactive ray tracing on multicomputers
More informationReview for Ray-tracing Algorithm and Hardware
Review for Ray-tracing Algorithm and Hardware Reporter: 邱敬捷博士候選人 Lan-Da Van ( 范倫達 ), Ph. D. Department of Computer Science National Chiao Tung University Taiwan, R.O.C. Summer, 2017 1 2017/7/26 Outline
More informationIntroduction to PowerVR Ray Tracing Tuesday 18th March, GDC. James A. McCombe
Introduction to PowerVR Tracing Tuesday 18th March, 2014 @ GDC James A. McCombe What are we launching today? Host CPU Interface Vertex Data Master Control and Register Bus Unified Shading Cluster Array
More informationRay Tracing with Multi-Core/Shared Memory Systems. Abe Stephens
Ray Tracing with Multi-Core/Shared Memory Systems Abe Stephens Real-time Interactive Massive Model Visualization Tutorial EuroGraphics 2006. Vienna Austria. Monday September 4, 2006 http://www.sci.utah.edu/~abe/massive06/
More informationCOMP 4801 Final Year Project. Ray Tracing for Computer Graphics. Final Project Report FYP Runjing Liu. Advised by. Dr. L.Y.
COMP 4801 Final Year Project Ray Tracing for Computer Graphics Final Project Report FYP 15014 by Runjing Liu Advised by Dr. L.Y. Wei 1 Abstract The goal of this project was to use ray tracing in a rendering
More informationReal-Time Ray Tracing Using Nvidia Optix Holger Ludvigsen & Anne C. Elster 2010
1 Real-Time Ray Tracing Using Nvidia Optix Holger Ludvigsen & Anne C. Elster 2010 Presentation by Henrik H. Knutsen for TDT24, fall 2012 Om du ønsker, kan du sette inn navn, tittel på foredraget, o.l.
More informationRendering. Converting a 3D scene to a 2D image. Camera. Light. Rendering. View Plane
Rendering Pipeline Rendering Converting a 3D scene to a 2D image Rendering Light Camera 3D Model View Plane Rendering Converting a 3D scene to a 2D image Basic rendering tasks: Modeling: creating the world
More informationCS427 Multicore Architecture and Parallel Computing
CS427 Multicore Architecture and Parallel Computing Lecture 6 GPU Architecture Li Jiang 2014/10/9 1 GPU Scaling A quiet revolution and potential build-up Calculation: 936 GFLOPS vs. 102 GFLOPS Memory Bandwidth:
More informationA Bandwidth Effective Rendering Scheme for 3D Texture-based Volume Visualization on GPU
for 3D Texture-based Volume Visualization on GPU Won-Jong Lee, Tack-Don Han Media System Laboratory (http://msl.yonsei.ac.k) Dept. of Computer Science, Yonsei University, Seoul, Korea Contents Background
More informationPowerVR Hardware. Architecture Overview for Developers
Public Imagination Technologies PowerVR Hardware Public. This publication contains proprietary information which is subject to change without notice and is supplied 'as is' without warranty of any kind.
More informationHigh-Performance Ray Tracing
Lecture 16: High-Performance Ray Tracing Computer Graphics CMU 15-462/15-662, Fall 2015 Ray tracing is a mechanism for answering visibility queries v(x 1,x 2 ) = 1 if x 1 is visible from x 2, 0 otherwise
More informationComputer Architecture
Computer Architecture Slide Sets WS 2013/2014 Prof. Dr. Uwe Brinkschulte M.Sc. Benjamin Betting Part 10 Thread and Task Level Parallelism Computer Architecture Part 10 page 1 of 36 Prof. Dr. Uwe Brinkschulte,
More informationEmbree Ray Tracing Kernels: Overview and New Features
Embree Ray Tracing Kernels: Overview and New Features Attila Áfra, Ingo Wald, Carsten Benthin, Sven Woop Intel Corporation Intel, the Intel logo, Intel Xeon Phi, Intel Xeon Processor are trademarks of
More informationArchitectures. Michael Doggett Department of Computer Science Lund University 2009 Tomas Akenine-Möller and Michael Doggett 1
Architectures Michael Doggett Department of Computer Science Lund University 2009 Tomas Akenine-Möller and Michael Doggett 1 Overview of today s lecture The idea is to cover some of the existing graphics
More informationSpecialized Acceleration Structures for Ray-Tracing. Warren Hunt
Specialized Acceleration Structures for Ray-Tracing Warren Hunt Bill Mark Forward: Flavor of Research Build is cheap (especially with scan, lazy and build from hierarchy) Grid build and BVH refit are really
More informationReal Time Ray Tracing
Real Time Ray Tracing Programação 3D para Simulação de Jogos Vasco Costa Ray tracing? Why? How? P3DSJ Real Time Ray Tracing Vasco Costa 2 Real time ray tracing : example Source: NVIDIA P3DSJ Real Time
More informationConemarching in VR. Johannes Saam Mariano Merchante FRAMESTORE. Developing a Fractal experience at 90 FPS. / Framestore
Conemarching in VR Developing a Fractal experience at 90 FPS Johannes Saam Mariano Merchante FRAMESTORE / Framestore THE CONCEPT THE CONCEPT FRACTALS AND COLLISIONS THE CONCEPT RAYMARCHING AND VR FRACTALS
More informationA 50Mvertices/s Graphics Processor with Fixed-Point Programmable Vertex Shader for Mobile Applications
A 50Mvertices/s Graphics Processor with Fixed-Point Programmable Vertex Shader for Mobile Applications Ju-Ho Sohn, Jeong-Ho Woo, Min-Wuk Lee, Hye-Jung Kim, Ramchan Woo, Hoi-Jun Yoo Semiconductor System
More informationLecture 6: Texture. Kayvon Fatahalian CMU : Graphics and Imaging Architectures (Fall 2011)
Lecture 6: Texture Kayvon Fatahalian CMU 15-869: Graphics and Imaging Architectures (Fall 2011) Today: texturing! Texture filtering - Texture access is not just a 2D array lookup ;-) Memory-system implications
More informationPANEL MMEDIA Challenges in Multimedia
WWW.IARIA.ORG PANEL MMEDIA Challenges in Multimedia Prof. Dr. Petre DINI Concordia University, Canada China Space Agency Center, China IARIA Organization petre@iaria.org Petre DINI 1 Panel Moderator Petre
More informationDNNBuilder: an Automated Tool for Building High-Performance DNN Hardware Accelerators for FPGAs
IBM Research AI Systems Day DNNBuilder: an Automated Tool for Building High-Performance DNN Hardware Accelerators for FPGAs Xiaofan Zhang 1, Junsong Wang 2, Chao Zhu 2, Yonghua Lin 2, Jinjun Xiong 3, Wen-mei
More information6.837 Introduction to Computer Graphics Final Exam Tuesday, December 20, :05-12pm Two hand-written sheet of notes (4 pages) allowed 1 SSD [ /17]
6.837 Introduction to Computer Graphics Final Exam Tuesday, December 20, 2011 9:05-12pm Two hand-written sheet of notes (4 pages) allowed NAME: 1 / 17 2 / 12 3 / 35 4 / 8 5 / 18 Total / 90 1 SSD [ /17]
More informationFast BVH Construction on GPUs
Fast BVH Construction on GPUs Published in EUROGRAGHICS, (2009) C. Lauterbach, M. Garland, S. Sengupta, D. Luebke, D. Manocha University of North Carolina at Chapel Hill NVIDIA University of California
More informationRendering: Reality. Eye acts as pinhole camera. Photons from light hit objects
Basic Ray Tracing Rendering: Reality Eye acts as pinhole camera Photons from light hit objects Rendering: Reality Eye acts as pinhole camera Photons from light hit objects Rendering: Reality Eye acts as
More informationAN ACCELERATION OF FPGA-BASED RAY TRACER
AN ACCELERATION OF FPGA-BASED RAY TRACER Raisa Malcheva, PhD Mohammad Yunis, MA Donetsk National Technical University, Ukraine Abstract The Hardware implementations of the Ray Tracing algorithm are analyzed.
More information! Readings! ! Room-level, on-chip! vs.!
1! 2! Suggested Readings!! Readings!! H&P: Chapter 7 especially 7.1-7.8!! (Over next 2 weeks)!! Introduction to Parallel Computing!! https://computing.llnl.gov/tutorials/parallel_comp/!! POSIX Threads
More informationImproving Memory Space Efficiency of Kd-tree for Real-time Ray Tracing Byeongjun Choi, Byungjoon Chang, Insung Ihm
Improving Memory Space Efficiency of Kd-tree for Real-time Ray Tracing Byeongjun Choi, Byungjoon Chang, Insung Ihm Department of Computer Science and Engineering Sogang University, Korea Improving Memory
More informationHardware Accelerated Volume Visualization. Leonid I. Dimitrov & Milos Sramek GMI Austrian Academy of Sciences
Hardware Accelerated Volume Visualization Leonid I. Dimitrov & Milos Sramek GMI Austrian Academy of Sciences A Real-Time VR System Real-Time: 25-30 frames per second 4D visualization: real time input of
More informationEvaluation and Improvement of GPU Ray Tracing with a Thread Migration Technique
Evaluation and Improvement of GPU Ray Tracing with a Thread Migration Technique Xingxing Zhu and Yangdong Deng Institute of Microelectronics, Tsinghua University, Beijing, China Email: zhuxingxing0107@163.com,
More informationLecture 6: Texturing Part II: Texture Compression and GPU Latency Hiding Mechanisms. Visual Computing Systems CMU , Fall 2014
Lecture 6: Texturing Part II: Texture Compression and GPU Latency Hiding Mechanisms Visual Computing Systems Review: mechanisms to reduce aliasing in the graphics pipeline When sampling visibility?! -
More informationRAY TRACING FROM A DATA MOVEMENT PERSPECTIVE
RAY TRACING FROM A DATA MOVEMENT PERSPECTIVE by Daniel Kopta A dissertation submitted to the faculty of The University of Utah in partial fulfillment of the requirements for the degree of Doctor of Philosophy
More informationGraphics Processing Unit Architecture (GPU Arch)
Graphics Processing Unit Architecture (GPU Arch) With a focus on NVIDIA GeForce 6800 GPU 1 What is a GPU From Wikipedia : A specialized processor efficient at manipulating and displaying computer graphics
More informationReal-time ray tracing
Lecture 10: Real-time ray tracing (and opportunities for hardware acceleration) Visual Computing Systems Recent push towards real-time ray tracing Image credit: NVIDIA (this ray traced image can be rendered
More informationChallenges for GPU Architecture. Michael Doggett Graphics Architecture Group April 2, 2008
Michael Doggett Graphics Architecture Group April 2, 2008 Graphics Processing Unit Architecture CPUs vsgpus AMD s ATI RADEON 2900 Programming Brook+, CAL, ShaderAnalyzer Architecture Challenges Accelerated
More informationThreading Hardware in G80
ing Hardware in G80 1 Sources Slides by ECE 498 AL : Programming Massively Parallel Processors : Wen-Mei Hwu John Nickolls, NVIDIA 2 3D 3D API: API: OpenGL OpenGL or or Direct3D Direct3D GPU Command &
More informationAccelerating Realism with the (NVIDIA Scene Graph)
Accelerating Realism with the (NVIDIA Scene Graph) Holger Kunz Manager, Workstation Middleware Development Phillip Miller Director, Workstation Middleware Product Management NVIDIA application acceleration
More informationMassive Model Visualization using Real-time Ray Tracing
Massive Model Visualization using Real-time Ray Tracing Eurographics 2006 Tutorial: Real-time Interactive Massive Model Visualization Andreas Dietrich Philipp Slusallek Saarland University & intrace GmbH
More informationMultimedia in Mobile Phones. Architectures and Trends Lund
Multimedia in Mobile Phones Architectures and Trends Lund 091124 Presentation Henrik Ohlsson Contact: henrik.h.ohlsson@stericsson.com Working with multimedia hardware (graphics and displays) at ST- Ericsson
More informationB-KD Trees for Hardware Accelerated Ray Tracing of Dynamic Scenes
B-KD rees for Hardware Accelerated Ray racing of Dynamic Scenes Sven Woop Gerd Marmitt Philipp Slusallek Saarland University, Germany Outline Previous Work B-KD ree as new Spatial Index Structure DynR
More informationBenchmark 1.a Investigate and Understand Designated Lab Techniques The student will investigate and understand designated lab techniques.
I. Course Title Parallel Computing 2 II. Course Description Students study parallel programming and visualization in a variety of contexts with an emphasis on underlying and experimental technologies.
More informationCS8803SC Software and Hardware Cooperative Computing GPGPU. Prof. Hyesoon Kim School of Computer Science Georgia Institute of Technology
CS8803SC Software and Hardware Cooperative Computing GPGPU Prof. Hyesoon Kim School of Computer Science Georgia Institute of Technology Why GPU? A quiet revolution and potential build-up Calculation: 367
More informationTEAPOT: A Toolset for Evaluating Performance, Power and Image Quality on Mobile Graphics Systems
International Conference on Supercomputing June 2013 TEAPOT: A Toolset for Evaluating Performance, Power and Image Quality on Mobile Graphics Systems Joan-Manuel Parcerisa Polychronis Xekalakis Computer
More informationGPGPU on Mobile Devices
GPGPU on Mobile Devices Introduction Addressing GPGPU for very mobile devices Tablets Smartphones Introduction Why dedicated GPUs in mobile devices? Gaming Physics simulation for realistic effects 3D-GUI
More informationASYNCHRONOUS SHADERS WHITE PAPER 0
ASYNCHRONOUS SHADERS WHITE PAPER 0 INTRODUCTION GPU technology is constantly evolving to deliver more performance with lower cost and lower power consumption. Transistor scaling and Moore s Law have helped
More informationParallelizing Graphics Pipeline Execution (+ Basics of Characterizing a Rendering Workload)
Lecture 2: Parallelizing Graphics Pipeline Execution (+ Basics of Characterizing a Rendering Workload) Visual Computing Systems Analyzing a 3D Graphics Workload Where is most of the work done? Memory Vertex
More informationDual Streaming for Hardware-Accelerated Ray Tracing
Dual Streaming for Hardware-Accelerated Ray Tracing Konstantin Shkurko Ian Mallett Tim Grant Cem Yuksel Daniel Kopta Erik Brunvand Streaming Processor Main Memory Other Data Hit Records Scene Data Ray
More informationProgrammable Shaders for Deformation Rendering
Programmable Shaders for Deformation Rendering Carlos D. Correa, Deborah Silver Rutgers, The State University of New Jersey Motivation We present a different way of obtaining mesh deformation. Not a modeling,
More informationPowerVR Series5. Architecture Guide for Developers
Public Imagination Technologies PowerVR Series5 Public. This publication contains proprietary information which is subject to change without notice and is supplied 'as is' without warranty of any kind.
More informationAcceleration Data Structures
CT4510: Computer Graphics Acceleration Data Structures BOCHANG MOON Ray Tracing Procedure for Ray Tracing: For each pixel Generate a primary ray (with depth 0) While (depth < d) { Find the closest intersection
More informationGeForce4. John Montrym Henry Moreton
GeForce4 John Montrym Henry Moreton 1 Architectural Drivers Programmability Parallelism Memory bandwidth 2 Recent History: GeForce 1&2 First integrated geometry engine & 4 pixels/clk Fixed-function transform,
More informationA Mobile Accelerator Architecture for Ray Tracing
A Mobile Accelerator Architecture for Ray Tracing Josef Spjut, Daniel Kopta, Erik Brunvand and Al Davis sjosef, dkopta, elb, ald utah.edu 1 State of the Art in Mobile Graphics 2 Mobile SoC 3 Mobile
More informationComparing Memory Systems for Chip Multiprocessors
Comparing Memory Systems for Chip Multiprocessors Jacob Leverich Hideho Arakida, Alex Solomatnikov, Amin Firoozshahian, Mark Horowitz, Christos Kozyrakis Computer Systems Laboratory Stanford University
More informationA SXGA 3D Display Processor with Reduced Rendering Data and Enhanced Precision. Seok-Hoon Kim MVLSI Lab., KAIST
A SXGA 3D Display Processor with Reduced Rendering Data and Enhanced Precision Seok-Hoon Kim MVLSI Lab., KAIST Contents Background Motivation 3D Graphics + 3D Display Previous Works Conventional 3D Image
More informationCSE 591: GPU Programming. Introduction. Entertainment Graphics: Virtual Realism for the Masses. Computer games need to have: Klaus Mueller
Entertainment Graphics: Virtual Realism for the Masses CSE 591: GPU Programming Introduction Computer games need to have: realistic appearance of characters and objects believable and creative shading,
More informationMulti Bounding Volume Hierarchies for Ray Tracing Pipelines
Tampere University of Technology Multi Bounding Volume Hierarchies for Ray Tracing Pipelines Citation Viitanen, T., Koskela, M., Jääskeläinen, P., & Takala, J. (2016). Multi Bounding Volume Hierarchies
More informationCSE 591/392: GPU Programming. Introduction. Klaus Mueller. Computer Science Department Stony Brook University
CSE 591/392: GPU Programming Introduction Klaus Mueller Computer Science Department Stony Brook University First: A Big Word of Thanks! to the millions of computer game enthusiasts worldwide Who demand
More informationComputer Graphics (CS 543) Lecture 13b Ray Tracing (Part 1) Prof Emmanuel Agu. Computer Science Dept. Worcester Polytechnic Institute (WPI)
Computer Graphics (CS 543) Lecture 13b Ray Tracing (Part 1) Prof Emmanuel Agu Computer Science Dept. Worcester Polytechnic Institute (WPI) Raytracing Global illumination-based rendering method Simulates
More informationGPU Ray Tracing at the Desktop and in the Cloud. Phillip Miller, NVIDIA Ludwig von Reiche, mental images
GPU Ray Tracing at the Desktop and in the Cloud Phillip Miller, NVIDIA Ludwig von Reiche, mental images Ray Tracing has always had an appeal Ray Tracing Prediction The future of interactive graphics is
More informationInteractive Stable Ray Tracing
Interactive Stable Ray Tracing Alessandro Dal Corso 1,2, Marco Salvi 1, Craig Kolb 1, Jeppe Revall Frisvad 2, Aaron Lefohn 1, David Luebke 1 1 NVIDIA, 2 Technical University of Denmark Comparison with
More informationINFOGR Computer Graphics. J. Bikker - April-July Lecture 11: Acceleration. Welcome!
INFOGR Computer Graphics J. Bikker - April-July 2015 - Lecture 11: Acceleration Welcome! Today s Agenda: High-speed Ray Tracing Acceleration Structures The Bounding Volume Hierarchy BVH Construction BVH
More informationRow Tracing with Hierarchical Occlusion Maps
Row Tracing with Hierarchical Occlusion Maps Ravi P. Kammaje, Benjamin Mora August 9, 2008 Page 2 Row Tracing with Hierarchical Occlusion Maps Outline August 9, 2008 Introduction Related Work Row Tracing
More informationShadows. COMP 575/770 Spring 2013
Shadows COMP 575/770 Spring 2013 Shadows in Ray Tracing Shadows are important for realism Basic idea: figure out whether a point on an object is illuminated by a light source Easy for ray tracers Just
More informationINFOMAGR Advanced Graphics. Jacco Bikker - February April Welcome!
INFOMAGR Advanced Graphics Jacco Bikker - February April 2016 Welcome! I x, x = g(x, x ) ε x, x + S ρ x, x, x I x, x dx Today s Agenda: Introduction Ray Distributions The Top-level BVH Real-time Ray Tracing
More informationHOW LEADING-EDGE COMPUTING TECHNOLOGIES ARE HELPING REIMAGINE CITIES OF THE FUTURE. Andrew Rink, AEC Industry Marketing GTC China - November 22, 2018
HOW LEADING-EDGE COMPUTING TECHNOLOGIES ARE HELPING REIMAGINE CITIES OF THE FUTURE Andrew Rink, AEC Industry Marketing GTC China - November 22, 2018 COMPUTING TECHNOLOGY TRENDS IN AEC GPU-Accelerated Workflows
More informationPerformance Analysis and Culling Algorithms
Performance Analysis and Culling Algorithms Michael Doggett Department of Computer Science Lund University 2009 Tomas Akenine-Möller and Michael Doggett 1 Assignment 2 Sign up for Pluto labs on the web
More informationNVIDIA Case Studies:
NVIDIA Case Studies: OptiX & Image Space Photon Mapping David Luebke NVIDIA Research Beyond Programmable Shading 0 How Far Beyond? The continuum Beyond Programmable Shading Just programmable shading: DX,
More informationEE382N (20): Computer Architecture - Parallelism and Locality Spring 2015 Lecture 09 GPUs (II) Mattan Erez. The University of Texas at Austin
EE382 (20): Computer Architecture - ism and Locality Spring 2015 Lecture 09 GPUs (II) Mattan Erez The University of Texas at Austin 1 Recap 2 Streaming model 1. Use many slimmed down cores to run in parallel
More informationIntroduction to Multicore architecture. Tao Zhang Oct. 21, 2010
Introduction to Multicore architecture Tao Zhang Oct. 21, 2010 Overview Part1: General multicore architecture Part2: GPU architecture Part1: General Multicore architecture Uniprocessor Performance (ECint)
More informationParallelizing Graphics Pipeline Execution (+ Basics of Characterizing a Rendering Workload)
Lecture 2: Parallelizing Graphics Pipeline Execution (+ Basics of Characterizing a Rendering Workload) Visual Computing Systems Today Finishing up from last time Brief discussion of graphics workload metrics
More informationIntro to Ray-Tracing & Ray-Surface Acceleration
Lecture 12 & 13: Intro to Ray-Tracing & Ray-Surface Acceleration Computer Graphics and Imaging UC Berkeley Course Roadmap Rasterization Pipeline Core Concepts Sampling Antialiasing Transforms Geometric
More informationLecture 4 - Real-time Ray Tracing
INFOMAGR Advanced Graphics Jacco Bikker - November 2017 - February 2018 Lecture 4 - Real-time Ray Tracing Welcome! I x, x = g(x, x ) ε x, x + න S ρ x, x, x I x, x dx Today s Agenda: Introduction Ray Distributions
More informationThis Unit: Putting It All Together. CIS 371 Computer Organization and Design. Sources. What is Computer Architecture?
This Unit: Putting It All Together CIS 371 Computer Organization and Design Unit 15: Putting It All Together: Anatomy of the XBox 360 Game Console Application OS Compiler Firmware CPU I/O Memory Digital
More informationFRUSTUM-TRACED RASTER SHADOWS: REVISITING IRREGULAR Z-BUFFERS
FRUSTUM-TRACED RASTER SHADOWS: REVISITING IRREGULAR Z-BUFFERS Chris Wyman, Rama Hoetzlein, Aaron Lefohn 2015 Symposium on Interactive 3D Graphics & Games CONTRIBUTIONS Full scene, fully dynamic alias-free
More informationNVIDIA DESIGNWORKS Ankit Patel - Prerna Dogra -
NVIDIA DESIGNWORKS Ankit Patel - ankitp@nvidia.com Prerna Dogra - pdogra@nvidia.com 1 Autonomous Driving Deep Learning Visual Effects Virtual Desktops Visual Computing is our singular mission Gaming Product
More informationRay Casting of Trimmed NURBS Surfaces on the GPU
Ray Casting of Trimmed NURBS Surfaces on the GPU Hans-Friedrich Pabst Jan P. Springer André Schollmeyer Robert Lenhardt Christian Lessig Bernd Fröhlich Bauhaus University Weimar Faculty of Media Virtual
More informationPOWERVR MBX. Technology Overview
POWERVR MBX Technology Overview Copyright 2009, Imagination Technologies Ltd. All Rights Reserved. This publication contains proprietary information which is subject to change without notice and is supplied
More informationPantaRay: Fast Ray-traced Occlusion Caching of Massive Scenes J. Pantaleoni, L. Fascione, M. Hill, T. Aila
PantaRay: Fast Ray-traced Occlusion Caching of Massive Scenes J. Pantaleoni, L. Fascione, M. Hill, T. Aila Agenda Introduction Motivation Basics PantaRay Accelerating structure generation Massively parallel
More informationInteractive Ray Tracing: Higher Memory Coherence
Interactive Ray Tracing: Higher Memory Coherence http://gamma.cs.unc.edu/rt Dinesh Manocha (UNC Chapel Hill) Sung-Eui Yoon (Lawrence Livermore Labs) Interactive Ray Tracing Ray tracing is naturally sub-linear
More informationINFOMAGR Advanced Graphics. Jacco Bikker - February April Welcome!
INFOMAGR Advanced Graphics Jacco Bikker - February April 2016 Welcome! I x, x = g(x, x ) ε x, x + S ρ x, x, x I x, x dx Today s Agenda: Introduction : GPU Ray Tracing Practical Perspective Advanced Graphics
More informationReal-Time Shadows. Last Time? Today. Why are Shadows Important? Shadows as a Depth Cue. For Intuition about Scene Lighting
Last Time? Real-Time Shadows Today Why are Shadows Important? Shadows & Soft Shadows in Ray Tracing Planar Shadows Projective Texture Shadows Shadow Maps Shadow Volumes Why are Shadows Important? Depth
More informationFor Intuition about Scene Lighting. Today. Limitations of Planar Shadows. Cast Shadows on Planar Surfaces. Shadow/View Duality.
Last Time Modeling Transformations Illumination (Shading) Real-Time Shadows Viewing Transformation (Perspective / Orthographic) Clipping Projection (to Screen Space) Graphics Pipeline Clipping Rasterization
More informationSoC for Car Navigation Systems with a 53.3 GOPS Image Recognition Engine
Session 5D : Designer s Forum : State-of-the-art SoCs 5D-4 SoC for Car Navigation Systems with a 53.3 GOPS Image Recognition Engine Jan. 20. 2010 Hiroyuki Hamasaki*, Yasuhiko Hoshi*, Atsushi Nakamura *,
More informationWindowing System on a 3D Pipeline. February 2005
Windowing System on a 3D Pipeline February 2005 Agenda 1.Overview of the 3D pipeline 2.NVIDIA software overview 3.Strengths and challenges with using the 3D pipeline GeForce 6800 220M Transistors April
More informationGraphics Hardware, Graphics APIs, and Computation on GPUs. Mark Segal
Graphics Hardware, Graphics APIs, and Computation on GPUs Mark Segal Overview Graphics Pipeline Graphics Hardware Graphics APIs ATI s low-level interface for computation on GPUs 2 Graphics Hardware High
More informationStructure. Woo-Chan Park, Kil-Whan Lee, Seung-Gi Lee, Moon-Hee Choi, Won-Jong Lee, Cheol-Ho Jeong, Byung-Uck Kim, Woo-Nam Jung,
A High Performance 3D Graphics Rasterizer with Effective Memory Structure Woo-Chan Park, Kil-Whan Lee, Seung-Gi Lee, Moon-Hee Choi, Won-Jong Lee, Cheol-Ho Jeong, Byung-Uck Kim, Woo-Nam Jung, Il-San Kim,
More informationBuilding scalable 3D applications. Ville Miettinen Hybrid Graphics
Building scalable 3D applications Ville Miettinen Hybrid Graphics What s going to happen... (1/2) Mass market: 3D apps will become a huge success on low-end and mid-tier cell phones Retro-gaming New game
More informationGraphics Architectures and OpenCL. Michael Doggett Department of Computer Science Lund university
Graphics Architectures and OpenCL Michael Doggett Department of Computer Science Lund university Overview Parallelism Radeon 5870 Tiled Graphics Architectures Important when Memory and Bandwidth limited
More informationRay tracing. Computer Graphics COMP 770 (236) Spring Instructor: Brandon Lloyd 3/19/07 1
Ray tracing Computer Graphics COMP 770 (236) Spring 2007 Instructor: Brandon Lloyd 3/19/07 1 From last time Hidden surface removal Painter s algorithm Clipping algorithms Area subdivision BSP trees Z-Buffer
More informationParallel Computing: Parallel Architectures Jin, Hai
Parallel Computing: Parallel Architectures Jin, Hai School of Computer Science and Technology Huazhong University of Science and Technology Peripherals Computer Central Processing Unit Main Memory Computer
More informationSung-Eui Yoon ( 윤성의 )
CS380: Computer Graphics Ray Tracing Sung-Eui Yoon ( 윤성의 ) Course URL: http://sglab.kaist.ac.kr/~sungeui/cg/ Class Objectives Understand overall algorithm of recursive ray tracing Ray generations Intersection
More informationInterpolation using scanline algorithm
Interpolation using scanline algorithm Idea: Exploit knowledge about already computed color values. Traverse projected triangle top-down using scanline. Compute start and end color value of each pixel
More informationLecture 25: Board Notes: Threads and GPUs
Lecture 25: Board Notes: Threads and GPUs Announcements: - Reminder: HW 7 due today - Reminder: Submit project idea via (plain text) email by 11/24 Recap: - Slide 4: Lecture 23: Introduction to Parallel
More informationFRUSTUM-TRACED RASTER SHADOWS: REVISITING IRREGULAR Z-BUFFERS
FRUSTUM-TRACED RASTER SHADOWS: REVISITING IRREGULAR Z-BUFFERS Chris Wyman, Rama Hoetzlein, Aaron Lefohn 2015 Symposium on Interactive 3D Graphics & Games CONTRIBUTIONS Full scene, fully dynamic alias-free
More information