Beyond Programmable Shading Course ACM SIGGRAPH 2010 Bending the Graphics Pipeline

Size: px
Start display at page:

Download "Beyond Programmable Shading Course ACM SIGGRAPH 2010 Bending the Graphics Pipeline"

Transcription

1 Beyond Programmable Shading Course ACM SIGGRAPH 2010 Bending the Graphics Pipeline Johan Andersson DICE

2 Overview Give a taste of a few rendering techniques we are using & experimenting with how they interact, or would like to interact, with the graphics pipeline Tile-based Deferred Shading Morphological Antialiasing Analytical Ambient Occlusion Beyond Programmable Shading, SIGGRAPH

3 Beyond Programmable Shading Course ACM SIGGRAPH 2010 TILE-BASED DEFERRED SHADING

4 Tile-based deferred shading Tile-based culling & lighting Cull lights per screen-space tile Lighting kernel runs per tile Minimizes bandwidth/setup cost DX11: GPU compute shader Covered in the course last year [Andersson09] PS3: SPU jobs GPU renders gbuffer SPU does light culling & full lighting evaluation for each pixel Beyond Programmable Shading, SIGGRAPH

5 Multiple deferred lighting models Standard phong Metallic Skin Translucent Beyond Programmable Shading, SIGGRAPH

6 Working with tiles Tile culling optimizations Cull lights & shadows with tile normal cone Detect tile specular=0 Detect tile lighting model Tile lighting kernel permutations Specular on/off Lighting models More in the future Beyond Programmable Shading, SIGGRAPH

7 SPU-based Deferred Shading Ported DX11 compute shader to SPU job Offloads PS3 GPU SPU processing in parallel with GPU rendering 32x16 pixel tiles Explicit SoA vectorization instead of implicit C/C++ on SPU - HLSL on GPU Not a problem for such a relative small kernel But not ideal data-parallel programming model Beyond Programmable Shading, SIGGRAPH

8 SPU vs GPU architecture 6 execution contexts vs 1+ million (each pixel) Explicit SIMD vs implicit SIMD C/C++ vs HLSL Explicit async DMA vs implicit latency hiding What can we learn? Beyond Programmable Shading, SIGGRAPH

9 Issues & challenges going forward More lighting models SIMD & branching efficiency Transparent decal surfaces & volumes Fixed function blending doesn t work well with deferred Higher-quality antialiasing Beyond Programmable Shading, SIGGRAPH

10 Flexible lighting models Want both more & more flexible models: Custom gbuffer layout per material Quality & performance tradeoffs Examples: Hair / anisotropic materials Requires more lighting model parameters in gbuffer Foliage Massive overdraw with alpha-tested simple shaders, few parameters Write to as simple gbuffer as possible to reduce ROP/bandwidth bottleneck Skin Sub-surface scattering approximation Beyond Programmable Shading, SIGGRAPH

11 The SIMD efficiency problem Lighting models through dynamic branches GPU shader model can be problematic: Increased register pressure = overall slower shader Requires good screen-space SIMD coherency for performance win Potential solutions: Reshuffle pixels to improve coherency? Within each tile, sort pixels by model, compute lighting & then scatter back GRAMPS-style queing? [Sugerman09] Attractive & powerful high-level programming model Alpha-tested foliage has far from ideal coherency Beyond Programmable Shading, SIGGRAPH

12 Decals & deferred shading Decals blend selectively against gbuffer Include: Diffuse albedo (gbuffer1.rgb) Normal (gbuffer0.rgb) Want to include (but can t in single pass): Specular albedo (gbuffer1.a) Specular smoothness (gbuffer0.a) Exclude: Material id (can t blend) Object lighting (inherit from below surface) Fixed function blending doesn t work well Pixel shader can t write out both alpha & blend factor! Consoles doesn t have blend mode per MRT Linear blend doesn t work for all components See Destruction Masking in Frostbite 2 using Volume Distance Fields [Kihl10] for more details about decal use case Beyond Programmable Shading, SIGGRAPH

13 Need programmable blending Benefits: Write out gbuffer alpha channels indepenently of blend factor Treat channels & targets however you see fit Non-linear blending & renormalizing blends Can do overlapping dependent blending Read current normal, add bumps relative to it, write out What approach? LRB-style pixel shader framebuffer read/modify/write [Lalonde09] Ideal general solution for developers How to hide synchronization latency? Implicit / explicit? Blend shader Yet another stage in a fixed pipeline No R/M/W, not ideal More? Beyond Programmable Shading, SIGGRAPH

14 The deferred shading + MSAA problem Huge storage & bandwidth requirements with deferred 1920 x 1080 x 5 x 4 x 4 = 165 MB Doesn t scale! Adding 1 bit of precision = 2x more memory 4x MSAA is not enough Esp. for thin geometry in a distance Prohibitive performance and bandwidth in general with deferred shading But don t miss Andrew Lauritzen s talk later in the course: Deferred Rendering for Current and Future Rendering Pipelines There are alternatives to MSAA... Beyond Programmable Shading, SIGGRAPH

15 MLAA Morphological Antialiasing Post-effect antialiasing Introduced in [Reshetov09] Implementations: Intel CPU reference implementation [Reshetov09] Sony PS3 SPU implementation [Perthuis10] GPU compute? [Biri10] Beyond Programmable Shading, SIGGRAPH

16 MLAA workings From [Reshetov09] Beyond Programmable Shading, SIGGRAPH

17 MLAA comparisons (PS3) No AA MLAA Beyond Programmable Shading, SIGGRAPH

18 MLAA takeaways Awesome AA for still pictures Moving pictures good, but: No sub-pixel information = edges snap to pixels Doesn t solve aliasing on fine detail geometry Overall still a very good benefit! Focus/exclude effect based on framebuffer alpha & thresholds Unique requirements per game/app Not good to use on some UI, mark in alpha (or apply before) Variable post-effect, trade perf vs quality! Beyond Programmable Shading, SIGGRAPH

19 MLAA future (PC) GPU compute shader implementation Combine with MSAA & sub-pixel samples Simple MSAA box filter downsampling is a big waste Sort of similar to A Directionally Adaptive Edge Anti- Aliasing Filter [Yang09] A must to reduce the edge snapping of pure MLAA Not fully clear how it should work (sample distribution) Beyond Programmable Shading, SIGGRAPH

20 AMBIENT OCCLUSION Beyond Programmable Shading Course ACM SIGGRAPH 2010

21 Current dynamic AO Horizon-based Ambient Occlusion See [Bavoil09] for complete details Based on screen-space depth-buffer (SSAO) Very high quality sampling But only screen-space info is a big limitation Creates false occlusion artifacts Render in half-res for improved performance Bilateral upsampling + gaussian blur Can also do dual-resolution to reduce artifacts Beyond Programmable Shading, SIGGRAPH

22 Horizon-based Ambient Occlusion False occlusion halo from thin geometry Beyond Programmable Shading, SIGGRAPH

23 HBAO limitations False halo occlusion artifacts around small geometry Such as: fences & poles Extra visible when moving the camera Very noisy sampling for detailed zbuffers Common with alpha-tested foliage Difficult sampling problem Beyond Programmable Shading, SIGGRAPH

24 Analytical Ambient Occlusion Beyond Programmable Shading, SIGGRAPH

25 HBAO vs AAO Beyond Programmable Shading, SIGGRAPH

26 Analytical Ambient Occlusion Using Ambient Occlusion Volumes [McGuire10] Experimental implementation in Frostbite 2 With some good help from Morgan McGuire and Louis Bavoil Geometry-based technique Not screen-space! Say what? Beyond Programmable Shading, SIGGRAPH

27 AOV idea 1. Extrude prism for each triangle (GS) Extrusion distance is where occlusion=0 2. Rasterize primitives in prism With depth-test enabled, near depth clip disabled Finds visible points inside volume Need to handle case with camera inside volume 3. Accumulate analytical occlusion contribution for visible pixels (PS) Uses pixel normal & depth values from gbuffer Subtractive blend Beyond Programmable Shading, SIGGRAPH

28 HBAO Beyond Programmable Shading, SIGGRAPH

29 HBAO AOV Beyond Programmable Shading, SIGGRAPH

30 AOV in practice Render geometry again in separate AO pass Uses depth & normal buffer from deferred rendering Half-res or lower with bilateral upsampling Culling should consider extrusion distance Separate paths for dynamic & rigid objects Can pre-compute rigid extruded AOV & reduce overdraw Doesn t work with alpha-tested surfaces Simulate with per-surface or per-triangle approx. coverage factor Beyond Programmable Shading, SIGGRAPH

31 Overdarkening (extra occlusion) Beyond Programmable Shading, SIGGRAPH

32 Varying overdraw with AO distance 0.1 m 0.2 m 0.5 m Beyond Programmable Shading, SIGGRAPH

33 AOV pros & cons Pros: Very high quality - close to raytracing ground truth Noise free (when full res) Perfectly stable with view changes Supports arbitrary dynamic polygon soups Cons: Requires massive fillrate Geometry cost Overdarkening, may require content tweaks Beyond Programmable Shading, SIGGRAPH

34 AOV future optimizations Reduce the massive overdraw Cull / restrict prisms that only extend out to empty air? Clamp screen-space prism size Not correct, but practical tradeoff. HBAO does this More optimal prism geometry GS is limited to triangle strip output Precompute using quads for rigid objects Geometry LOD / mix with higher-order geometry representations Also see AO volume texture & analytical capsule techniques [Hill10] Beyond Programmable Shading, SIGGRAPH

35 AOV takeaways Major improvement in visual quality compared to SSAO Interesting use of geometry & rasterization pipelines Builds on existing HW-, SW- & content pipelines Quite simple brute force drop-in (but not as simple as SSAO) Siggraph interactive framerates today, but lots of potential: Performance highly dependent on occlusion distance Optimizations / less brute force? Use for high-end / reference / precompute / beauty shots initially Beyond Programmable Shading, SIGGRAPH

36 Conclusions New graphics pipeline usages are opened up with improved HW performance Often not efficient to do with pure compute Continue to give us more performance & bandwidth! We need to continue to break down some fixed graphics pipeline barriers Beyond Programmable Shading, SIGGRAPH

37 Acknowledgments Morgan McGuire Louis Bavoil David Luebke Andrew Lauritzen Robert Kihl Christina Coffin SCEE Beyond Programmable Shading, SIGGRAPH

38 Questions? blog: For more DICE talks: Beyond Programmable Shading, SIGGRAPH

39 References [Andersson09] Johan Andersson, Parallel Graphics in Frostbite - Current & Future, Beyond Programmable Shading Course Siggraph [Lalonde09] Paul Lalonde Innovating in a Software Graphics Pipeline Beyond Programmable Shading Course Siggraph [Reshetov09] Alexander Reshetov, Morphological Antialiasing [Yang09] Jason C. Yang et al, High Performance Graphics 2009, A Directionally Adaptive Edge Anti- Aliasing Filter [McGuire10] Morgan McGuire, High Performance Graphics 2010, Ambient Occlusion Volumes [Biri10] Venceslas Biri et al, Siggraph 2010, Practical morphological antialiasing on the GPU [Bavoil08] Louis Bavoil & Miguel Sainz, Siggraph 2008 Image-Space Horizon-Based Ambient Occlusion [Hill10] Stephen Hill, Game Developers Conference 2010 Rendering with Conviction [Kihl10] Robert Kihl, Advanced in Real-time Rendering in 3D Graphics and Games, Siggraph 2010, Destruction Masking in Frostbite 2 using Volume Distance Fields [Sugerman09] Jeremy Sugerman et al - ACM Transactions on Graphics January, 2009 GRAMPS: A Programming Model for Graphics Pipelines [Perthuis10] Cedric Perthuis, MLAA in God of War 3 (PS3 registered developers only) Beyond Programmable Shading, SIGGRAPH

GUERRILLA DEVELOP CONFERENCE JULY 07 BRIGHTON

GUERRILLA DEVELOP CONFERENCE JULY 07 BRIGHTON Deferred Rendering in Killzone 2 Michal Valient Senior Programmer, Guerrilla Talk Outline Forward & Deferred Rendering Overview G-Buffer Layout Shader Creation Deferred Rendering in Detail Rendering Passes

More information

Lecture 9: Deferred Shading. Visual Computing Systems CMU , Fall 2013

Lecture 9: Deferred Shading. Visual Computing Systems CMU , Fall 2013 Lecture 9: Deferred Shading Visual Computing Systems The course so far The real-time graphics pipeline abstraction Principle graphics abstractions Algorithms and modern high performance implementations

More information

5 MAJOR CHALLENGES IN REAL-TIME RENDERING

5 MAJOR CHALLENGES IN REAL-TIME RENDERING Beyond Programmable Shading Course ACM SIGGRAPH 2012 5 MAJOR CHALLENGES IN REAL-TIME RENDERING Johan Andersson, DICE Beyond Programmable Shading, SIGGRAPH 2012 1 Overview What are the major challenges

More information

AGGREGATE G-BUFFER ANTI-ALIASING

AGGREGATE G-BUFFER ANTI-ALIASING AGGREGATE G-BUFFER ANTI-ALIASING Cyril Crassin 1, Morgan McGuire 1,2, Kayvon Fatahalian 3, Aaron Lefohn 1 1 NVIDIA 2 Williams College 3 Carnegie Mellon University Motivation Pixel The Mummy [ Universal

More information

Dominic Filion, Senior Engineer Blizzard Entertainment. Rob McNaughton, Lead Technical Artist Blizzard Entertainment

Dominic Filion, Senior Engineer Blizzard Entertainment. Rob McNaughton, Lead Technical Artist Blizzard Entertainment Dominic Filion, Senior Engineer Blizzard Entertainment Rob McNaughton, Lead Technical Artist Blizzard Entertainment Screen-space techniques Deferred rendering Screen-space ambient occlusion Depth of Field

More information

Applications of Explicit Early-Z Z Culling. Jason Mitchell ATI Research

Applications of Explicit Early-Z Z Culling. Jason Mitchell ATI Research Applications of Explicit Early-Z Z Culling Jason Mitchell ATI Research Outline Architecture Hardware depth culling Applications Volume Ray Casting Skin Shading Fluid Flow Deferred Shading Early-Z In past

More information

Next-Generation Graphics on Larrabee. Tim Foley Intel Corp

Next-Generation Graphics on Larrabee. Tim Foley Intel Corp Next-Generation Graphics on Larrabee Tim Foley Intel Corp Motivation The killer app for GPGPU is graphics We ve seen Abstract models for parallel programming How those models map efficiently to Larrabee

More information

Real-Time Hair Rendering on the GPU NVIDIA

Real-Time Hair Rendering on the GPU NVIDIA Real-Time Hair Rendering on the GPU Sarah Tariq NVIDIA Motivation Academia and the movie industry have been simulating and rendering impressive and realistic hair for a long time We have demonstrated realistic

More information

A Trip Down The (2011) Rasterization Pipeline

A Trip Down The (2011) Rasterization Pipeline A Trip Down The (2011) Rasterization Pipeline Aaron Lefohn - Intel / University of Washington Mike Houston AMD / Stanford 1 This talk Overview of the real-time rendering pipeline available in ~2011 corresponding

More information

8/5/2012. Introduction. Transparency. Anti-Aliasing. Applications. Conclusions. Introduction

8/5/2012. Introduction. Transparency. Anti-Aliasing. Applications. Conclusions. Introduction Introduction Transparency effects and applications Anti-Aliasing impact in the final image Why combine Transparency with Anti-Aliasing? Marilena Maule João Comba Rafael Torchelsen Rui Bastos UFRGS UFRGS

More information

DEFERRED RENDERING STEFAN MÜLLER ARISONA, ETH ZURICH SMA/

DEFERRED RENDERING STEFAN MÜLLER ARISONA, ETH ZURICH SMA/ DEFERRED RENDERING STEFAN MÜLLER ARISONA, ETH ZURICH SMA/2013-11-04 DEFERRED RENDERING? CONTENTS 1. The traditional approach: Forward rendering 2. Deferred rendering (DR) overview 3. Example uses of DR:

More information

Advanced Ambient Occlusion Methods for Modern Games

Advanced Ambient Occlusion Methods for Modern Games Advanced Ambient Occlusion Methods for Modern Games Andrei Tatarinov, Senior Developer Technology Engineer Alexey Panteleev, Senior Developer Technology Engineer Outline What is AO and why is it SS? Is

More information

Tiled shading: light culling reaching the speed of light. Dmitry Zhdan Developer Technology Engineer, NVIDIA

Tiled shading: light culling reaching the speed of light. Dmitry Zhdan Developer Technology Engineer, NVIDIA Tiled shading: light culling reaching the speed of light Dmitry Zhdan Developer Technology Engineer, NVIDIA Agenda Über Goal Classic deferred vs tiled shading How to improve culling in tiled shading? New

More information

Deus Ex is in the Details

Deus Ex is in the Details Deus Ex is in the Details Augmenting the PC graphics of Deus Ex: Human Revolution using DirectX 11 technology Matthijs De Smedt Graphics Programmer, Nixxes Software Overview Introduction DirectX 11 implementation

More information

Real-Time Hair Simulation and Rendering on the GPU. Louis Bavoil

Real-Time Hair Simulation and Rendering on the GPU. Louis Bavoil Real-Time Hair Simulation and Rendering on the GPU Sarah Tariq Louis Bavoil Results 166 simulated strands 0.99 Million triangles Stationary: 64 fps Moving: 41 fps 8800GTX, 1920x1200, 8XMSAA Results 166

More information

High-Quality Surface Splatting on Today s GPUs

High-Quality Surface Splatting on Today s GPUs High-Quality Surface Splatting on Today s GPUs M. Botsch, A. Hornung, M. Zwicker, L. Kobbelt Presented by Julian Yu-Chung Chen CS594 GPU Programming 2006-03-30 Outline Point Based Rendering Surface splatting

More information

Efficient and Scalable Shading for Many Lights

Efficient and Scalable Shading for Many Lights Efficient and Scalable Shading for Many Lights 1. GPU Overview 2. Shading recap 3. Forward Shading 4. Deferred Shading 5. Tiled Deferred Shading 6. And more! First GPU Shaders Unified Shaders CUDA OpenCL

More information

The Vegetation of Horizon Zero Dawn. Gilbert Sanders Principal Artist, Guerrilla Games

The Vegetation of Horizon Zero Dawn. Gilbert Sanders Principal Artist, Guerrilla Games The Vegetation of Horizon Zero Dawn Gilbert Sanders Principal Artist, Guerrilla Games Welcome Topics Simulation Shading Creation Shadow Casting Summary Introduction Our Renderer Artist Node-Based Shader

More information

POWERVR MBX. Technology Overview

POWERVR MBX. Technology Overview POWERVR MBX Technology Overview Copyright 2009, Imagination Technologies Ltd. All Rights Reserved. This publication contains proprietary information which is subject to change without notice and is supplied

More information

CS354R: Computer Game Technology

CS354R: Computer Game Technology CS354R: Computer Game Technology Real-Time Global Illumination Fall 2018 Global Illumination Mirror s Edge (2008) 2 What is Global Illumination? Scene recreates feel of physically-based lighting models

More information

Object Space Lighting. Dan Baker Founder, Oxide Games

Object Space Lighting. Dan Baker Founder, Oxide Games Object Space Lighting Dan Baker Founder, Oxide Games Ashes of the Singularity Nitrous Engine/Oxide Games New studio founded from industry vets Firaxis, Zenimax, Stardock Ground up, custom engine Several

More information

HIGH-QUALITY RASTERIZATION CHRIS WYMAN SENIOR RESEARCH SCIENTIST, NVIDIA

HIGH-QUALITY RASTERIZATION CHRIS WYMAN SENIOR RESEARCH SCIENTIST, NVIDIA HIGH-QUALITY RASTERIZATION CHRIS WYMAN SENIOR RESEARCH SCIENTIST, NVIDIA NEW RASTER METHODS USING MAXWELL Accumulative Anti-Aliasing (ACAA) A simple improvement on forward MSAA using less memory and bandwidth

More information

PowerVR Hardware. Architecture Overview for Developers

PowerVR Hardware. Architecture Overview for Developers Public Imagination Technologies PowerVR Hardware Public. This publication contains proprietary information which is subject to change without notice and is supplied 'as is' without warranty of any kind.

More information

Real - Time Rendering. Pipeline optimization. Michal Červeňanský Juraj Starinský

Real - Time Rendering. Pipeline optimization. Michal Červeňanský Juraj Starinský Real - Time Rendering Pipeline optimization Michal Červeňanský Juraj Starinský Motivation Resolution 1600x1200, at 60 fps Hw power not enough Acceleration is still necessary 3.3.2010 2 Overview Application

More information

NVIDIA Case Studies:

NVIDIA Case Studies: NVIDIA Case Studies: OptiX & Image Space Photon Mapping David Luebke NVIDIA Research Beyond Programmable Shading 0 How Far Beyond? The continuum Beyond Programmable Shading Just programmable shading: DX,

More information

Optimizing DirectX Graphics. Richard Huddy European Developer Relations Manager

Optimizing DirectX Graphics. Richard Huddy European Developer Relations Manager Optimizing DirectX Graphics Richard Huddy European Developer Relations Manager Some early observations Bear in mind that graphics performance problems are both commoner and rarer than you d think The most

More information

Screen Space Ambient Occlusion. Daniel Kvarfordt & Benjamin Lillandt

Screen Space Ambient Occlusion. Daniel Kvarfordt & Benjamin Lillandt Screen Space Ambient Occlusion Daniel Kvarfordt & Benjamin Lillandt Ambient light Same from all directions. Lambertian shading doesn't show form well. Need shadows to see form. Global illumination can

More information

Hello, Iam Cedric Perthuis, I am from Sony Santa Monica and I am here with Tobias Berghoff

Hello, Iam Cedric Perthuis, I am from Sony Santa Monica and I am here with Tobias Berghoff 1 Hello, Iam Cedric Perthuis, I am from Sony Santa Monica and I am here with Tobias Berghoff from the SCE advance technology group to talk to you about MLAA on PS3. Tobias will first present the Playstation

More information

Michal Valient Lead Tech Guerrilla Games

Michal Valient Lead Tech Guerrilla Games Michal Valient Lead Tech Guerrilla Games Intro Guerrilla is based in Amsterdam and we re part of Sony since 2005 We re working on two titles Unannounced new IP Killzone: Shadow Fall The new Killzone is

More information

Screen Space Ambient Occlusion TSBK03: Advanced Game Programming

Screen Space Ambient Occlusion TSBK03: Advanced Game Programming Screen Space Ambient Occlusion TSBK03: Advanced Game Programming August Nam-Ki Ek, Oscar Johnson and Ramin Assadi March 5, 2015 This project report discusses our approach of implementing Screen Space Ambient

More information

The Rasterization Pipeline

The Rasterization Pipeline Lecture 5: The Rasterization Pipeline (and its implementation on GPUs) Computer Graphics CMU 15-462/15-662, Fall 2015 What you know how to do (at this point in the course) y y z x (w, h) z x Position objects

More information

Ultimate Graphics Performance for DirectX 10 Hardware

Ultimate Graphics Performance for DirectX 10 Hardware Ultimate Graphics Performance for DirectX 10 Hardware Nicolas Thibieroz European Developer Relations AMD Graphics Products Group nicolas.thibieroz@amd.com V1.01 Generic API Usage DX10 designed for performance

More information

Chapter 10 Computation Culling with Explicit Early-Z and Dynamic Flow Control

Chapter 10 Computation Culling with Explicit Early-Z and Dynamic Flow Control Chapter 10 Computation Culling with Explicit Early-Z and Dynamic Flow Control Pedro V. Sander ATI Research John R. Isidoro ATI Research Jason L. Mitchell ATI Research Introduction In last year s course,

More information

Optimizing and Profiling Unity Games for Mobile Platforms. Angelo Theodorou Senior Software Engineer, MPG Gamelab 2014, 25 th -27 th June

Optimizing and Profiling Unity Games for Mobile Platforms. Angelo Theodorou Senior Software Engineer, MPG Gamelab 2014, 25 th -27 th June Optimizing and Profiling Unity Games for Mobile Platforms Angelo Theodorou Senior Software Engineer, MPG Gamelab 2014, 25 th -27 th June 1 Agenda Introduction ARM and the presenter Preliminary knowledge

More information

MSAA- Based Coarse Shading

MSAA- Based Coarse Shading MSAA- Based Coarse Shading for Power- Efficient Rendering on High Pixel- Density Displays Pavlos Mavridis Georgios Papaioannou Department of Informatics, Athens University of Economics & Business Motivation

More information

Building scalable 3D applications. Ville Miettinen Hybrid Graphics

Building scalable 3D applications. Ville Miettinen Hybrid Graphics Building scalable 3D applications Ville Miettinen Hybrid Graphics What s going to happen... (1/2) Mass market: 3D apps will become a huge success on low-end and mid-tier cell phones Retro-gaming New game

More information

Enhancing Traditional Rasterization Graphics with Ray Tracing. October 2015

Enhancing Traditional Rasterization Graphics with Ray Tracing. October 2015 Enhancing Traditional Rasterization Graphics with Ray Tracing October 2015 James Rumble Developer Technology Engineer, PowerVR Graphics Overview Ray Tracing Fundamentals PowerVR Ray Tracing Pipeline Using

More information

Advanced Computer Graphics CS 563: Screen Space GI Techniques: Real Time

Advanced Computer Graphics CS 563: Screen Space GI Techniques: Real Time Advanced Computer Graphics CS 563: Screen Space GI Techniques: Real Time William DiSanto Computer Science Dept. Worcester Polytechnic Institute (WPI) Overview Deferred Shading Ambient Occlusion Screen

More information

Mattan Erez. The University of Texas at Austin

Mattan Erez. The University of Texas at Austin EE382V (17325): Principles in Computer Architecture Parallelism and Locality Fall 2007 Lecture 11 The Graphics Processing Unit Mattan Erez The University of Texas at Austin Outline What is a GPU? Why should

More information

GCN Performance Tweets AMD Developer Relations

GCN Performance Tweets AMD Developer Relations AMD Developer Relations Overview This document lists all GCN ( Graphics Core Next ) performance tweets that were released on Twitter during the first few months of 2013. Each performance tweet in this

More information

Rendering Grass with Instancing in DirectX* 10

Rendering Grass with Instancing in DirectX* 10 Rendering Grass with Instancing in DirectX* 10 By Anu Kalra Because of the geometric complexity, rendering realistic grass in real-time is difficult, especially on consumer graphics hardware. This article

More information

Beyond Programmable Shading Course, ACM SIGGRAPH 2011

Beyond Programmable Shading Course, ACM SIGGRAPH 2011 1/66 Road to Real-Time Order-Independent Transparency Marco Salvi 2/66 Talk Outline Motivation Compositing Equation Recursive Solvers Visibility Based Solvers State of the Art and Future Work Q&A 3/66

More information

RSX Best Practices. Mark Cerny, Cerny Games David Simpson, Naughty Dog Jon Olick, Naughty Dog

RSX Best Practices. Mark Cerny, Cerny Games David Simpson, Naughty Dog Jon Olick, Naughty Dog RSX Best Practices Mark Cerny, Cerny Games David Simpson, Naughty Dog Jon Olick, Naughty Dog RSX Best Practices About libgcm Using the SPUs with the RSX Brief overview of GCM Replay December 7 th, 2004

More information

A bit more Deferred - CryEngine 3. Triangle Game Conference 2009 Martin Mittring Lead Graphics Programmer

A bit more Deferred - CryEngine 3. Triangle Game Conference 2009 Martin Mittring Lead Graphics Programmer A bit more Deferred - CryEngine 3 Triangle Game Conference 2009 Martin Mittring Lead Graphics Programmer Crytek Main office: Germany Frankfurt More studios: Kiev, Budapest, Sofia, Nottingham, Seoul English

More information

Pipeline Operations. CS 4620 Lecture 10

Pipeline Operations. CS 4620 Lecture 10 Pipeline Operations CS 4620 Lecture 10 2008 Steve Marschner 1 Hidden surface elimination Goal is to figure out which color to make the pixels based on what s in front of what. Hidden surface elimination

More information

COMPUTING SCIENCE. Real-Time Ambient Occlusion on the Playstation3. Dominic Goulding, Richard Smith, Lee Clark, Gary Ushaw and Graham Morgan

COMPUTING SCIENCE. Real-Time Ambient Occlusion on the Playstation3. Dominic Goulding, Richard Smith, Lee Clark, Gary Ushaw and Graham Morgan COMPUTING SCIENCE Real-Time Ambient Occlusion on the Playstation3 Dominic Goulding, Richard Smith, Lee Clark, Gary Ushaw and Graham Morgan TECHNICAL REPORT SERIES No. CS-TR-1319 March 2012 TECHNICAL REPORT

More information

Real-Time Reyes: Programmable Pipelines and Research Challenges. Anjul Patney University of California, Davis

Real-Time Reyes: Programmable Pipelines and Research Challenges. Anjul Patney University of California, Davis Real-Time Reyes: Programmable Pipelines and Research Challenges Anjul Patney University of California, Davis Real-Time Reyes-Style Adaptive Surface Subdivision Anjul Patney and John D. Owens SIGGRAPH Asia

More information

Rendering Algorithms: Real-time indirect illumination. Spring 2010 Matthias Zwicker

Rendering Algorithms: Real-time indirect illumination. Spring 2010 Matthias Zwicker Rendering Algorithms: Real-time indirect illumination Spring 2010 Matthias Zwicker Today Real-time indirect illumination Ray tracing vs. Rasterization Screen space techniques Visibility & shadows Instant

More information

Graphics Processing Unit Architecture (GPU Arch)

Graphics Processing Unit Architecture (GPU Arch) Graphics Processing Unit Architecture (GPU Arch) With a focus on NVIDIA GeForce 6800 GPU 1 What is a GPU From Wikipedia : A specialized processor efficient at manipulating and displaying computer graphics

More information

Enhancing Traditional Rasterization Graphics with Ray Tracing. March 2015

Enhancing Traditional Rasterization Graphics with Ray Tracing. March 2015 Enhancing Traditional Rasterization Graphics with Ray Tracing March 2015 Introductions James Rumble Developer Technology Engineer Ray Tracing Support Justin DeCell Software Design Engineer Ray Tracing

More information

The Graphics Pipeline

The Graphics Pipeline The Graphics Pipeline Ray Tracing: Why Slow? Basic ray tracing: 1 ray/pixel Ray Tracing: Why Slow? Basic ray tracing: 1 ray/pixel But you really want shadows, reflections, global illumination, antialiasing

More information

Pipeline Operations. CS 4620 Lecture Steve Marschner. Cornell CS4620 Spring 2018 Lecture 11

Pipeline Operations. CS 4620 Lecture Steve Marschner. Cornell CS4620 Spring 2018 Lecture 11 Pipeline Operations CS 4620 Lecture 11 1 Pipeline you are here APPLICATION COMMAND STREAM 3D transformations; shading VERTEX PROCESSING TRANSFORMED GEOMETRY conversion of primitives to pixels RASTERIZATION

More information

Optimizing for DirectX Graphics. Richard Huddy European Developer Relations Manager

Optimizing for DirectX Graphics. Richard Huddy European Developer Relations Manager Optimizing for DirectX Graphics Richard Huddy European Developer Relations Manager Also on today from ATI... Start & End Time: 12:00pm 1:00pm Title: Precomputed Radiance Transfer and Spherical Harmonic

More information

MAXIS-mizing Darkspore*: A Case Study of Graphic Analysis and Optimizations in Maxis Deferred Renderer

MAXIS-mizing Darkspore*: A Case Study of Graphic Analysis and Optimizations in Maxis Deferred Renderer MAXIS-mizing Darkspore*: A Case Study of Graphic Analysis and Optimizations in Maxis Deferred Renderer A New Gaming Experience Made Possible With Processor Graphics Released in early 2011, the 2nd Generation

More information

Practical Techniques for Ray Tracing in Games. Gareth Morgan (Imagination Technologies) Aras Pranckevičius (Unity Technologies) March, 2014

Practical Techniques for Ray Tracing in Games. Gareth Morgan (Imagination Technologies) Aras Pranckevičius (Unity Technologies) March, 2014 Practical Techniques for Ray Tracing in Games Gareth Morgan (Imagination Technologies) Aras Pranckevičius (Unity Technologies) March, 2014 What Ray Tracing is not! Myth: Ray Tracing is only for photorealistic

More information

Wed, October 12, 2011

Wed, October 12, 2011 Practical Occlusion Culling in Killzone 3 Michal Valient Lead Tech, Guerrilla B.V. Talk takeaway Occlusion culling system used in Killzone 3 The reasons why to use software rasterization (Some) technical

More information

PowerVR Series5. Architecture Guide for Developers

PowerVR Series5. Architecture Guide for Developers Public Imagination Technologies PowerVR Series5 Public. This publication contains proprietary information which is subject to change without notice and is supplied 'as is' without warranty of any kind.

More information

The Application Stage. The Game Loop, Resource Management and Renderer Design

The Application Stage. The Game Loop, Resource Management and Renderer Design 1 The Application Stage The Game Loop, Resource Management and Renderer Design Application Stage Responsibilities 2 Set up the rendering pipeline Resource Management 3D meshes Textures etc. Prepare data

More information

Real-Time Reyes Programmable Pipelines and Research Challenges

Real-Time Reyes Programmable Pipelines and Research Challenges Real-Time Reyes Programmable Pipelines and Research Challenges Anjul Patney University of California, Davis This talk Parallel Computing for Graphics: In Action What does it take to write a programmable

More information

graphics pipeline computer graphics graphics pipeline 2009 fabio pellacini 1

graphics pipeline computer graphics graphics pipeline 2009 fabio pellacini 1 graphics pipeline computer graphics graphics pipeline 2009 fabio pellacini 1 graphics pipeline sequence of operations to generate an image using object-order processing primitives processed one-at-a-time

More information

graphics pipeline computer graphics graphics pipeline 2009 fabio pellacini 1

graphics pipeline computer graphics graphics pipeline 2009 fabio pellacini 1 graphics pipeline computer graphics graphics pipeline 2009 fabio pellacini 1 graphics pipeline sequence of operations to generate an image using object-order processing primitives processed one-at-a-time

More information

Render-To-Texture Caching. D. Sim Dietrich Jr.

Render-To-Texture Caching. D. Sim Dietrich Jr. Render-To-Texture Caching D. Sim Dietrich Jr. What is Render-To-Texture Caching? Pixel shaders are becoming more complex and expensive Per-pixel shadows Dynamic Normal Maps Bullet holes Water simulation

More information

Pipeline Operations. CS 4620 Lecture 14

Pipeline Operations. CS 4620 Lecture 14 Pipeline Operations CS 4620 Lecture 14 2014 Steve Marschner 1 Pipeline you are here APPLICATION COMMAND STREAM 3D transformations; shading VERTEX PROCESSING TRANSFORMED GEOMETRY conversion of primitives

More information

Render all data necessary into textures Process textures to calculate final image

Render all data necessary into textures Process textures to calculate final image Screenspace Effects Introduction General idea: Render all data necessary into textures Process textures to calculate final image Achievable Effects: Glow/Bloom Depth of field Distortions High dynamic range

More information

Game Technology. Lecture Physically Based Rendering. Dipl-Inform. Robert Konrad Polona Caserman, M.Sc.

Game Technology. Lecture Physically Based Rendering. Dipl-Inform. Robert Konrad Polona Caserman, M.Sc. Game Technology Lecture 7 4.12.2017 Physically Based Rendering Dipl-Inform. Robert Konrad Polona Caserman, M.Sc. Prof. Dr.-Ing. Ralf Steinmetz KOM - Multimedia Communications Lab PPT-for-all v.3.4_office2010

More information

CS4620/5620: Lecture 14 Pipeline

CS4620/5620: Lecture 14 Pipeline CS4620/5620: Lecture 14 Pipeline 1 Rasterizing triangles Summary 1! evaluation of linear functions on pixel grid 2! functions defined by parameter values at vertices 3! using extra parameters to determine

More information

MORPHOLOGICAL ANTIALIASING AND TOPOLOGICAL RECONSTRUCTION

MORPHOLOGICAL ANTIALIASING AND TOPOLOGICAL RECONSTRUCTION MORPHOLOGICAL ANTIALIASING AND TOPOLOGICAL RECONSTRUCTION Keywords: Abstract: antialiasing, mlaa, topology, realtime, gpu. Morphological antialiasing is a post-processing approach which does note require

More information

Beyond Programmable Shading 2012

Beyond Programmable Shading 2012 Beyond Programmable Shading Course ACM SIGGRAPH 2012 Beyond Programmable Shading 2012 Aaron Lefohn Intel Mike Houston AMD Welcome 5 th consecutive year of Beyond Programmable Shading SIGGRAPH course This

More information

Practical Shadows. Outline

Practical Shadows. Outline Practical Shadows Out of the demo, into the engine Tom Forsyth RAD Game Tools Outline Shadow volumes (stencil shadows) Fairly brief covered well elsewhere Shadow buffers Scene management of shadow buffers

More information

CS230 : Computer Graphics Lecture 4. Tamar Shinar Computer Science & Engineering UC Riverside

CS230 : Computer Graphics Lecture 4. Tamar Shinar Computer Science & Engineering UC Riverside CS230 : Computer Graphics Lecture 4 Tamar Shinar Computer Science & Engineering UC Riverside Shadows Shadows for each pixel do compute viewing ray if ( ray hits an object with t in [0, inf] ) then compute

More information

Practical Morphological Anti-Aliasing on the GPU

Practical Morphological Anti-Aliasing on the GPU Practical Morphological Anti-Aliasing on the GPU SIGGRAPH 2010 Talk Venceslas BIRI, Adrien HERUBEL & Stéphane DEVERLY 27 mai 2010 Univ Paris Est - LIGM & Duran Duboi 1 / 24 Motivation Geometrical antialiasing

More information

Rendering Structures Analyzing modern rendering on mobile

Rendering Structures Analyzing modern rendering on mobile Rendering Structures Analyzing modern rendering on mobile 2018 Arm Limited Hans-Kristian Arntzen 2018-08-16 SIGGRAPH 2018 Content 1 2 3 4 5 Motivation Scene and lights Rendering structures overview Benchmark

More information

Horizon-Based Ambient Occlusion using Compute Shaders. Louis Bavoil

Horizon-Based Ambient Occlusion using Compute Shaders. Louis Bavoil Horizon-Based Ambient Occlusion using Compute Shaders Louis Bavoil lbavoil@nvidia.com Document Change History Version Date Responsible Reason for Change 1 March 14, 2011 Louis Bavoil Initial release Overview

More information

Architectures. Michael Doggett Department of Computer Science Lund University 2009 Tomas Akenine-Möller and Michael Doggett 1

Architectures. Michael Doggett Department of Computer Science Lund University 2009 Tomas Akenine-Möller and Michael Doggett 1 Architectures Michael Doggett Department of Computer Science Lund University 2009 Tomas Akenine-Möller and Michael Doggett 1 Overview of today s lecture The idea is to cover some of the existing graphics

More information

Mattan Erez. The University of Texas at Austin

Mattan Erez. The University of Texas at Austin EE382V: Principles in Computer Architecture Parallelism and Locality Fall 2008 Lecture 10 The Graphics Processing Unit Mattan Erez The University of Texas at Austin Outline What is a GPU? Why should we

More information

How to Work on Next Gen Effects Now: Bridging DX10 and DX9. Guennadi Riguer ATI Technologies

How to Work on Next Gen Effects Now: Bridging DX10 and DX9. Guennadi Riguer ATI Technologies How to Work on Next Gen Effects Now: Bridging DX10 and DX9 Guennadi Riguer ATI Technologies Overview New pipeline and new cool things Simulating some DX10 features in DX9 Experimental techniques Why This

More information

Enabling immersive gaming experiences Intro to Ray Tracing

Enabling immersive gaming experiences Intro to Ray Tracing Enabling immersive gaming experiences Intro to Ray Tracing Overview What is Ray Tracing? Why Ray Tracing? PowerVR Wizard Architecture Example Content Unity Hybrid Rendering Demonstration 3 What is Ray

More information

Graphics Performance Optimisation. John Spitzer Director of European Developer Technology

Graphics Performance Optimisation. John Spitzer Director of European Developer Technology Graphics Performance Optimisation John Spitzer Director of European Developer Technology Overview Understand the stages of the graphics pipeline Cherchez la bottleneck Once found, either eliminate or balance

More information

Graphics Hardware. Graphics Processing Unit (GPU) is a Subsidiary hardware. With massively multi-threaded many-core. Dedicated to 2D and 3D graphics

Graphics Hardware. Graphics Processing Unit (GPU) is a Subsidiary hardware. With massively multi-threaded many-core. Dedicated to 2D and 3D graphics Why GPU? Chapter 1 Graphics Hardware Graphics Processing Unit (GPU) is a Subsidiary hardware With massively multi-threaded many-core Dedicated to 2D and 3D graphics Special purpose low functionality, high

More information

The PixelTransit Image Generator: A Next-Generation Visual Simulation Engine

The PixelTransit Image Generator: A Next-Generation Visual Simulation Engine The PixelTransit Image Generator: A Next-Generation Visual Simulation Engine Robert P. Kuehne*, Sean Carmody** Blue Newt Software, LLC, Ann Arbor, Michigan, USA *(rpk@blue-newt.com), **(sean.carmody@blue-newt.com)

More information

Parallelizing Graphics Pipeline Execution (+ Basics of Characterizing a Rendering Workload)

Parallelizing Graphics Pipeline Execution (+ Basics of Characterizing a Rendering Workload) Lecture 2: Parallelizing Graphics Pipeline Execution (+ Basics of Characterizing a Rendering Workload) Visual Computing Systems Today Finishing up from last time Brief discussion of graphics workload metrics

More information

VAO++: Practical Volumetric Ambient Occlusion for Games

VAO++: Practical Volumetric Ambient Occlusion for Games VAO++: Practical Volumetric Ambient Occlusion for Games Jakub Bokšanský, Adam Pospíšil (Project Wilberforce) Jiří Bittner (CTU in Prague) EGSR 19.6.2017 Motivation Focus on performance (highly optimized

More information

Black Desert Online. Taking MMO Development to the Next Level. Dongwook Ha Gwanghyeon Go

Black Desert Online. Taking MMO Development to the Next Level. Dongwook Ha Gwanghyeon Go Black Desert Online Taking MMO Development to the Next Level Dongwook Ha (dongwook@pearlabyss.com) Gwanghyeon Go (xdotdt@pearlabyss.com) 2018-03-23 Black Desert Online Challenges Massive data and contents

More information

Graphics Architectures and OpenCL. Michael Doggett Department of Computer Science Lund university

Graphics Architectures and OpenCL. Michael Doggett Department of Computer Science Lund university Graphics Architectures and OpenCL Michael Doggett Department of Computer Science Lund university Overview Parallelism Radeon 5870 Tiled Graphics Architectures Important when Memory and Bandwidth limited

More information

Lecture 13: Reyes Architecture and Implementation. Kayvon Fatahalian CMU : Graphics and Imaging Architectures (Fall 2011)

Lecture 13: Reyes Architecture and Implementation. Kayvon Fatahalian CMU : Graphics and Imaging Architectures (Fall 2011) Lecture 13: Reyes Architecture and Implementation Kayvon Fatahalian CMU 15-869: Graphics and Imaging Architectures (Fall 2011) A gallery of images rendered using Reyes Image credit: Lucasfilm (Adventures

More information

Programming Graphics Hardware

Programming Graphics Hardware Tutorial 5 Programming Graphics Hardware Randy Fernando, Mark Harris, Matthias Wloka, Cyril Zeller Overview of the Tutorial: Morning 8:30 9:30 10:15 10:45 Introduction to the Hardware Graphics Pipeline

More information

Deferred Rendering Due: Wednesday November 15 at 10pm

Deferred Rendering Due: Wednesday November 15 at 10pm CMSC 23700 Autumn 2017 Introduction to Computer Graphics Project 4 November 2, 2017 Deferred Rendering Due: Wednesday November 15 at 10pm 1 Summary This assignment uses the same application architecture

More information

Abstract. 2 Description of the Effects Used. 1 Introduction Phong Illumination Bump Mapping

Abstract. 2 Description of the Effects Used. 1 Introduction Phong Illumination Bump Mapping Developing a Real-Time Renderer With Optimized Shadow Volumes Mátyás Premecz (email: pmat@freemail.hu) Department of Control Engineering and Information Technology, Budapest University of Technolgy Hungary

More information

Real-Time Realism will require...

Real-Time Realism will require... Real-Time Realism will require... Tomas Akenine-Möller Lund University and Intel Corporation Panel at High-Performance Graphics 2010 2010-06-27 1 Contents Talk about differences/similarities between ray

More information

Hardware-driven Visibility Culling Jeong Hyun Kim

Hardware-driven Visibility Culling Jeong Hyun Kim Hardware-driven Visibility Culling Jeong Hyun Kim KAIST (Korea Advanced Institute of Science and Technology) Contents Introduction Background Clipping Culling Z-max (Z-min) Filter Programmable culling

More information

Today s Agenda. DirectX 9 Features Sim Dietrich, nvidia - Multisample antialising Jason Mitchell, ATI - Shader models and coding tips

Today s Agenda. DirectX 9 Features Sim Dietrich, nvidia - Multisample antialising Jason Mitchell, ATI - Shader models and coding tips Today s Agenda DirectX 9 Features Sim Dietrich, nvidia - Multisample antialising Jason Mitchell, ATI - Shader models and coding tips Optimization for DirectX 9 Graphics Mike Burrows, Microsoft - Performance

More information

Evolution of GPUs Chris Seitz

Evolution of GPUs Chris Seitz Evolution of GPUs Chris Seitz Overview Concepts: Real-time rendering Hardware graphics pipeline Evolution of the PC hardware graphics pipeline: 1995-1998: Texture mapping and z-buffer 1998: Multitexturing

More information

Ray Tracing with Multi-Core/Shared Memory Systems. Abe Stephens

Ray Tracing with Multi-Core/Shared Memory Systems. Abe Stephens Ray Tracing with Multi-Core/Shared Memory Systems Abe Stephens Real-time Interactive Massive Model Visualization Tutorial EuroGraphics 2006. Vienna Austria. Monday September 4, 2006 http://www.sci.utah.edu/~abe/massive06/

More information

Morphological: Sub-pixel Morhpological Anti-Aliasing [Jimenez 11] Fast AproXimatte Anti Aliasing [Lottes 09]

Morphological: Sub-pixel Morhpological Anti-Aliasing [Jimenez 11] Fast AproXimatte Anti Aliasing [Lottes 09] 1 2 3 Morphological: Sub-pixel Morhpological Anti-Aliasing [Jimenez 11] Fast AproXimatte Anti Aliasing [Lottes 09] Analytical: Geometric Buffer Anti Aliasing [Persson 11] Distance to Edge Anti Aliasing

More information

A Real-time Micropolygon Rendering Pipeline. Kayvon Fatahalian Stanford University

A Real-time Micropolygon Rendering Pipeline. Kayvon Fatahalian Stanford University A Real-time Micropolygon Rendering Pipeline Kayvon Fatahalian Stanford University Detailed surfaces Credit: DreamWorks Pictures, Shrek 2 (2004) Credit: Pixar Animation Studios, Toy Story 2 (1999) Credit:

More information

Ray Tracing. Computer Graphics CMU /15-662, Fall 2016

Ray Tracing. Computer Graphics CMU /15-662, Fall 2016 Ray Tracing Computer Graphics CMU 15-462/15-662, Fall 2016 Primitive-partitioning vs. space-partitioning acceleration structures Primitive partitioning (bounding volume hierarchy): partitions node s primitives

More information

After the release of Maxwell in September last year, a number of press articles appeared that describe VXGI simply as a technology to improve

After the release of Maxwell in September last year, a number of press articles appeared that describe VXGI simply as a technology to improve After the release of Maxwell in September last year, a number of press articles appeared that describe VXGI simply as a technology to improve lighting in games. While that is certainly true, it doesn t

More information

TSBK03 Screen-Space Ambient Occlusion

TSBK03 Screen-Space Ambient Occlusion TSBK03 Screen-Space Ambient Occlusion Joakim Gebart, Jimmy Liikala December 15, 2013 Contents 1 Abstract 1 2 History 2 2.1 Crysis method..................................... 2 3 Chosen method 2 3.1 Algorithm

More information

LEVEL 1 ANIMATION ACADEMY2010

LEVEL 1 ANIMATION ACADEMY2010 1 Textures add more realism to an environment and characters. There are many 2D painting programs that can be used to create textures, such as Adobe Photoshop and Corel Painter. Many artists use photographs

More information

Bringing Hollywood to Real Time. Abe Wiley 3D Artist 3-D Application Research Group

Bringing Hollywood to Real Time. Abe Wiley 3D Artist 3-D Application Research Group Bringing Hollywood to Real Time Abe Wiley 3D Artist 3-D Application Research Group Overview > Film Pipeline Overview and compare with Games > The RhinoFX/ATI Relationship > Ruby 1 and 2 The Movies > Breakdown

More information