High Quality Direct3D 10.0 & 10.1 Accelerated Techniques. Jon Story, AMD Holger Gruen, AMD
|
|
- Phyllis Griffith
- 5 years ago
- Views:
Transcription
1
2 High Qualit Direct3D 0.0 & 0. Accelerated Techniques Jon Stor, AMD Holger Gruen, AMD
3 Agenda» High Definition Ambient Occlusion» High Qualit Shado Filtering
4 High Definition Ambient Occlusion (HDAO)
5 Conentional AO :» Compute Offset Transform Eecuted to a camera as to group a post-perspectie fullscreen of space 3D positions from arranged space processing screen and in test passa XY sphere against and depth depth» Occlusion Factor ~= Failure Rate Occlusion = /2 3/4
6 Conentional AO : 2» Usuall requires man depth samples to achiee acceptable results» Maths oerhead per sample is high Transform to post-perspectie space Depth testing Result attenuation» Filtering pass almost essential to smooth out dithering and banding artifacts
7 The Aim of HDAO» Delier a belieable AO loo» Achiee affordable performance on toda s HW» Aoid need for filtering pass(es) Keep performance higher No additional render targets required» Eas to incorporate No normals required Account for normals ith ease if aailable
8 Ho does HDAO Fit into the Rendering Pipeline?» Render Use Finall depth combine scene (and as normals) AO usual buffer as ith» Optionall input(s) original scene to render AO shader normals as part of an MRT setup
9 Ho does HDAO Wor?» If Still Sample Also both a detects fullscreen tins the a central occlusion fall pair post ithin of piel processing piels, beteen a of specified interest radius, mirrored pass and objects record and that through its are camera closer the central to Z alue each the piel,» camera and Detects otherecord than alles their in camera central camera piel, z Z alues space then» e Not hae _onl_ detected creases a alle Valle = false Valle = true
10 Valle Detection» The Start Perform simplest b a sampling alle implementation test the for central each tin ould depth pair of piel, be (mirrored) a 33 and conerting samples pattern to» camera Valle test Z produces binar result Credit Occlusion to Holger Factor Gruen = Valle for this Test idea
11 Ho does Gather Wor?» Gather Similar to fetches Fetch4 4 point eposed sampled on DX9 teels ATI GPUs in a single instruction» Aailable There are on non all Direct3D issues in 0. preious & hardare ersions of the HLSL compiler» Used Must use on single March channel 2009 SDK formats later (restriction gone for DX) depth buffer Shado maps f4depth = DepthTe.Gather( SamPoint, f2coord ); ( 0, 0 ) (, 0 ) W Z X Y ( 0, ) (, )
12 Direct3D 0.0 Version of Gather» Be sure to get the integer offsets correct for the 4 samples // Direct3D 0. f4ret = Te.Gather( g_samplepoint, f2tecoord ); // Direct3D 0.0 f4ret. = Te.SampleLeel( g_samplepoint, f2tecoord, 0, int2( 0, ) ).; f4ret. = Te.SampleLeel( g_samplepoint, f2tecoord, 0, int2(, ) ).; f4ret.z = Te.SampleLeel( g_samplepoint, f2tecoord, 0, int2(, 0 ) ).; f4ret. = Te.SampleLeel( g_samplepoint, f2tecoord, 0, int2( 0, 0 ) ).;
13 Gather Pattern» It ould be ideal to accelerate depth sampling using Gather W X Y Z W X Y Z W X Y Z W X Y Z» Each Gather should tessellate around center piel» Each Gather needs to hae a mirrored tin» Simpl add additional rings of Gathers W X Y Z W X Y Z W X Y Z W X Y Z W X Y Z W X Y Z W X Y Z W X Y Z» To perfectl mirror samples, onl requires a sizzle z ith z
14 Natural Vectorization for( igather=0; igather<num_gathers; igather ) { } // Gather mirrored tin depth samples (and conert to camera Z) f4sampledz[0] = DepthTe.Gather( SamPoint, f2tecoord ); f4sampledz[0] = -g_fqtimesznear / (f4sampledz[0] - g_fq ); f4sampledz[] = DepthTe.Gather( SamPoint, f2mirrortecoord ); f4sampledz[] = -g_fqtimesznear / (f4sampledz[] - g_fq ); // Detect alles // First tin f4diff = fcenterz. - f4sampledz[0]; f4compare[0] = ( f4diff < g_fhdaorejectradius. )? (.0f ) : ( 0.0f ); f4compare[0] *= ( f4diff > g_fhdaoacceptradius. )? (.0f ) : ( 0.0f ); // Mirrored tin f4diff = fcenterz. - f4sampledz[]; f4compare[] = ( f4diff < g_fhdaorejectradius. )? (.0f ) : ( 0.0f ); f4compare[] *= ( f4diff > g_fhdaoacceptradius. )? (.0f ) : ( 0.0f ); // Accumulate occlusion factor We perform the alle detection logic on 4 alles at once We Gather 4 samples at We efficientl once Finall e eight conert and store 4 depth the occlusion samples to factor camera of 4 alles space at a time f4occlusion.z = ( g_f4ringweight[igather].z * f4compare[0].z * f4compare[].z );
15 HDAO On Off Buffer (depth (depth onl) onl) 40 Gathers No filtering needed HDAO Code Sample
16 Bringing in Camera Space Normals» HDAO easil accounts for normals» Scale Z component of camera space normal b desired amount» Add scaled normal to camera Z alue» Run alle detection code as before // Offset b scaled normal f4cameraz = ( f4normalz * g_fnormalscale );
17 HDAO On Buffer (depth (depth & onl) normals) & - 80 Gathers (could use alot less) - No filtering needed
18 Earl Rejection Test» Lo For smooth densit normals, meshes can compute produce undesireable angle beteen occlusion them» Or Terrain compute direction ectors from S dome full camera space positions» Calculate Dramaticall the increase angle of performance the alle» Reject if too shallo Pass Fail
19 Performance» HDAO (depth onl): and normals): Direct3D 0.0: MS Direct3D 0.: MS MS FPS HDAO 0.0 s 0. (280024) HDAO Off HDAO Depth HDAO Depth Normals Direct3D Direct3D Phenom 2.3GHz, HD4870X2, 2 GB RAM, Windos Vista 32 (SP)
20 Tom Clanc s HAWX Publisher: Ubisoft Deeloper: Ubisoft Romania HDAO is onl applied to the terrain and buildings (not the aeroplane)
21 Stormrise Publisher: SEGA Deeloper: The Creatie Assembl Australia
22 BattleForge Publisher: EA Deeloper: EA Phenomic
23 Future Wor» Looing into a compute shader accelerated ersion Solid sampling pattern lends itself ell to Thread Local Storage» Account for strong light sources AO for man scenes is not lo frequenc» Real alle tracing...
24 High Qualit Shado Filtering
25 Ho Direct3D 0. helps filtering for single channel tetures Direct3D 0.0 Direct3D 0. z z z z NN point samples if ou need all data points e.g. 44 = 6 (N/2)(N/2) Gather operations get all data - e.g. 22 = 4
26 Wh reisit conentional shado filtering? -» There are adanced techniques for smooth shados» The most prominent are» VSMs, laered VSMs, CSMs, ESMs, ACDF SMs» Can be combined ith SATs for arbitar smoothness» But these methods bring other problems» The renderer gets more comple» Ma need to or around specific artifacts» Use onl if neccessar
27 Wh reisit conentional shado filtering? - 2» Adanced methods come at a cost» More RTs at a high memor cost» Costl postprocessing operations» Non optimal RT formats» Is an adanced techique needed?» Depth buffer based deferred shadoing does not depend on depth compleit» Big conentional shado filters not that epensie
28 Surprising insights about uniform shado filtering Let s filter a 44 isibilit sample bloc Direct3D 0. Direct3D 0.0 z z z z 4 Gather operations plus some ALU (N/2)(N/2) Gather ops for NN 9 PCF samples plus some ALU right?
29 Surprising insights about uniform shado filtering Let s filter a 44 isibilit sample bloc Direct3D 0. Direct3D 0.0 z z z z NO! 4 Gather operations plus some ALU (N/2)(N/2) Gather ops for NN 9 PCF samples plus some ALU right?
30 Surprising insights about uniform shado filtering Let s filter a 44 isibilit sample bloc Direct3D 0. Direct3D 0.0 z z z z 4 Gather operations plus some ALU => (N/2)(N/2) Gather() samples for NN 4 shifted PCF samples plus a post eight factor is enough => (N/2)(N/2) PCF samples for NN
31 Surprising insights about uniform shado filtering 2 Let s loo at onl ro of 4 isibilit samples ( ) 0 ( ) 2 ( ) 2 3 Simplifies to ( ) ( ) p (( ) 0 ) ( ( ) 2 3) p Credit for this idea goes to: Serge Nenaho at Funcom
32 Surprising insights about uniform shado filtering 3» Onl (N/2)(N/2) PCF samples necessar instead for a uniform filter» Cheaper than commonl assumed» 88 ith onl 6 PCF samples» Not onl for shado filtering» Same teture op count as Direct3D 0.» Wh bother ith Direct3D 0.?
33 From DICE s Frostbite Engine: Uniform shado filtering
34 From DICE s Frostbite Engine: Gaussian shado filtering
35 Disadantages of uniform shado filtering Uniform filtering blurs aa too man details Gaussian filtering preseres more details
36 Adanced Direct3D 0. shado filtering Use a unique eight per PCF sample
37 Adanced Direct3D 0. shado filtering Use a unique eight per PCF sample
38 Adanced Direct3D 0. shado filtering Use a unique eight per PCF sample ( N ) ( N ) = 0 ( N ) ( N ) = 0 pcf
39 Adanced Direct3D 0. shado filtering 2» Direct3D 0. needs (N/2)(N/2) Gather() samples» A (N/2)(N/2) PCF samples solution is no longer possible for unique eights» Filter eights are not smmetric» Equation sstem not solable» It is possible to get belo NN PCF ops for Direct3D 0.0 though
40 Adanced Direct3D 0. shado filtering 3 Let s filter a 44 isibilit sample bloc using unique eights Direct3D 0. Direct3D 0.0 z z z z 4 Gather() operations plus some ALU => (N/2)(N/2) Gather samples for NN 9 PCF samples plus some ALU right?
41 Adanced Direct3D 0. shado filtering 3 Let s filter a 44 isibilit sample bloc using unique eights Direct3D 0. Direct3D 0.0 z z z z NO! 4 Gather() samples plus some ALU => (N/2)(N/2) Gather() samples for NN 9 PCF samples plus some ALU right?
42 Adanced Direct3D 0. shado filtering 3 Let s filter a 44 isibilit sample bloc using unique eights Direct3D 0. Direct3D 0.0 z z z z 4 Gather() samples plus some ALU => (N/2)(N/2) Gather() samples for NN 6 shifted PCF samples plus post eight factors is enough => (N/2)(N-) PCF samples for NN
43 Adanced Direct3D 0. shado filtering ) ( 2 ) ( ) ( ) ( ) ( ( ) ( ) 2 N- N ) - ( = N ( ) ( ) ' p = ( ) ( ) 0 ' p = ( ) ( ) ) ( left ( ) 0 0 ' = 0 p =
44 Adanced Direct3D 0. shado filtering ) ( 2 ) ( ) ( ) ( ) ( ( ) ( ) 2 N- N ) - ( = N ( ) ( ) ) ( left ( ) ( ) ( ) ( ) center ( ) ( ) ( ) p = ' ( ) ( ) ' = p ( ) ( ) = ' ( ) p =
45 Adanced Direct3D 0. shado filtering ) ( 2 ) ( ) ( ) ( ) ( ( ) ( ) 2 N- N ) - ( = N ( ) ( ) ) ( left ( ) ( ) ( ) ( ) center ( ) ( ) 2 2 N N N N N N right ( ) ( ) ( ) ' = N N N N N p ' = N N N p 2 ' = N N N 2 = N N p
46 Adanced Direct3D 0. shado filtering 5» Direct3D 0.0» needs (N/2)(N-) PCF samples for Gaussian shados not (N-)(N-)!» can do one ro ith (N/2) samples ith shifted teture coords» teture coord stas untouched» Stats of an optimized shader for 88» Direct3D 0. shader roughl tice as fast as the Direct3D0.0 ersion» Direct3D 0. shader as fast as the optimized uniform (N/2)(N/2) filter under Direct3D0.0
47 From DICE s Frostbite Engine: Standard 22 shado filtering
48 From DICE s Frostbite Engine: 55 Gaussian filtering
49 Tom Clanc s HAWX Publisher: Ubisoft Deeloper: Ubisoft Romania Normal Qualit Blurred VSM
50 Tom Clanc s HAWX Publisher: Ubisoft Deeloper: Ubisoft Romania Gaussian Shados
51 Stormrise, Publisher: SEGA Deeloper: The Creatie Assembl Australia Normal Shado Qualit
52 Stormrise, Publisher: SEGA Deeloper: The Creatie Assembl Australia Gaussian Shados
53 Summar:» HDAO adds enourmous depth to the scene, at an affordable cost» Using Direct3D 0. gather4 instruction greatl accelerates performance» Groing number of game deelopers using the effect» Mail if ou ould lie to no more...
54 Summar: 2» Conentional high qualit shado filtering is suprisingl fast» Een under Direct3D 0.0/9» Direct3D 0. deliers the best performance» No reason not to use gaussian shados!» Direct3D supports Gather()!» Mail if ou ant the shaders or the deriations for (N/2)(N/2) PCF sample shados
55 Questions? Please fill in the feedbac forms...
A Trip Down The (2011) Rasterization Pipeline
A Trip Down The (2011) Rasterization Pipeline Aaron Lefohn - Intel / University of Washington Mike Houston AMD / Stanford 1 This talk Overview of the real-time rendering pipeline available in ~2011 corresponding
More informationToday. The Graphics Pipeline: Projective Transformations. Last Week: Schedule. XForms Forms Library. Questions?
Toda The Graphics Pipeline: Projectie Reiew & Schedule Ra Casting / Tracing s. The Graphics Pipeline Projectie Last Week: Animation & Quaternions Finite Element Simulations collisions, fracture, & deformation
More informationDeus Ex is in the Details
Deus Ex is in the Details Augmenting the PC graphics of Deus Ex: Human Revolution using DirectX 11 technology Matthijs De Smedt Graphics Programmer, Nixxes Software Overview Introduction DirectX 11 implementation
More informationAdvanced Computer Graphics CS 563: Screen Space GI Techniques: Real Time
Advanced Computer Graphics CS 563: Screen Space GI Techniques: Real Time William DiSanto Computer Science Dept. Worcester Polytechnic Institute (WPI) Overview Deferred Shading Ambient Occlusion Screen
More informationAdvanced Post Processing
Advanced Post Processing Agenda Efficient Use of Blur Horizon-Based Ambient Occlusion Efficient Use of Blur Many Forms of Blur Bloom Motion Blur And more Depth of Field God Rays Issues of Blur Effect Full-screen
More informationThe Ultimate Developers Toolkit. Jonathan Zarge Dan Ginsburg
The Ultimate Developers Toolkit Jonathan Zarge Dan Ginsburg February 20, 2008 Agenda GPU PerfStudio GPU ShaderAnalyzer RenderMonkey Additional Tools Tootle GPU MeshMapper CubeMapGen The Compressonator
More informationResolve your Resolves Jon Story Holger Gruen AMD Graphics Products Group
Jon Story Holger Gruen AMD Graphics Products Group jon.story@amd.com holger.gruen@amd.com Introduction Over the last few years it has become common place for PC games to make use of Multi-Sample Anti-Aliasing
More informationGraphics Hardware. Graphics Processing Unit (GPU) is a Subsidiary hardware. With massively multi-threaded many-core. Dedicated to 2D and 3D graphics
Why GPU? Chapter 1 Graphics Hardware Graphics Processing Unit (GPU) is a Subsidiary hardware With massively multi-threaded many-core Dedicated to 2D and 3D graphics Special purpose low functionality, high
More informationDirect3D 11 Performance Tips & Tricks
Direct3D 11 Performance Tips & Tricks Holger Gruen Cem Cebenoyan AMD ISV Relations NVIDIA ISV Relations Agenda Introduction Shader Model 5 Resources and Resource Views Multithreading Miscellaneous Q&A
More informationParallelization. Memory coherency helps when distributing rays to various threads/processors
/ Ra Tracing OpenGL projects triangles onto the image plane and rasteries them to determine which piels the cover Scanline rendering is a per triangle operation Ra Tracing instead works as a per piel operation
More informationDept. of Computing Science & Math
Lecture 4: Multi-Laer Perceptrons 1 Revie of Gradient Descent Learning 1. The purpose of neural netor training is to minimize the output errors on a particular set of training data b adusting the netor
More informationRendering Algorithms: Real-time indirect illumination. Spring 2010 Matthias Zwicker
Rendering Algorithms: Real-time indirect illumination Spring 2010 Matthias Zwicker Today Real-time indirect illumination Ray tracing vs. Rasterization Screen space techniques Visibility & shadows Instant
More informationUltimate Graphics Performance for DirectX 10 Hardware
Ultimate Graphics Performance for DirectX 10 Hardware Nicolas Thibieroz European Developer Relations AMD Graphics Products Group nicolas.thibieroz@amd.com V1.01 Generic API Usage DX10 designed for performance
More informationHorizon-Based Ambient Occlusion using Compute Shaders. Louis Bavoil
Horizon-Based Ambient Occlusion using Compute Shaders Louis Bavoil lbavoil@nvidia.com Document Change History Version Date Responsible Reason for Change 1 March 14, 2011 Louis Bavoil Initial release Overview
More informationTSBK03 Screen-Space Ambient Occlusion
TSBK03 Screen-Space Ambient Occlusion Joakim Gebart, Jimmy Liikala December 15, 2013 Contents 1 Abstract 1 2 History 2 2.1 Crysis method..................................... 2 3 Chosen method 2 3.1 Algorithm
More informationLast Time. Correct Transparent Shadow. Does Ray Tracing Simulate Physics? Does Ray Tracing Simulate Physics? Refraction and the Lifeguard Problem
Graphics Pipeline: Projective Last Time Shadows cast ra to light stop after first intersection Reflection & Refraction compute direction of recursive ra Recursive Ra Tracing maimum number of bounces OR
More informationGraphics Hardware, Graphics APIs, and Computation on GPUs. Mark Segal
Graphics Hardware, Graphics APIs, and Computation on GPUs Mark Segal Overview Graphics Pipeline Graphics Hardware Graphics APIs ATI s low-level interface for computation on GPUs 2 Graphics Hardware High
More informationChapter 10 Computation Culling with Explicit Early-Z and Dynamic Flow Control
Chapter 10 Computation Culling with Explicit Early-Z and Dynamic Flow Control Pedro V. Sander ATI Research John R. Isidoro ATI Research Jason L. Mitchell ATI Research Introduction In last year s course,
More informationGeometric Model of Camera
Geometric Model of Camera Dr. Gerhard Roth COMP 42A Winter 25 Version 2 Similar Triangles 2 Geometric Model of Camera Perspective projection P(X,Y,Z) p(,) f X Z f Y Z 3 Parallel lines aren t 4 Figure b
More informationAdaptive Point Cloud Rendering
1 Adaptive Point Cloud Rendering Project Plan Final Group: May13-11 Christopher Jeffers Eric Jensen Joel Rausch Client: Siemens PLM Software Client Contact: Michael Carter Adviser: Simanta Mitra 4/29/13
More informationGeometric Transformations
CS INTRODUCTION TO COMPUTER GRAPHICS Geometric Transformations D and D Andries an Dam 9/9/7 /46 CS INTRODUCTION TO COMPUTER GRAPHICS How do we use Geometric Transformations? (/) Objects in a scene at the
More informationCSC Computer Graphics
7//7 CSC. Computer Graphics Lecture Kasun@dscs.sjp.ac.l Department of Computer Science Universit of Sri Jaewardanepura Line drawing algorithms DDA Midpoint (Bresenham s) Algorithm Circle drawing algorithms
More informationReal-time Atmospheric Effects in Games Revisited. Carsten Wenzel
Real-time Atmospheric Effects in Games Reisited Carsten Wenel The deal Follow up to a talk I gae at SIGGRAPH 2006 Coers material presented at the time plus recent additions and improements Oeriew Introduction
More informationScreen Space Ambient Occlusion TSBK03: Advanced Game Programming
Screen Space Ambient Occlusion TSBK03: Advanced Game Programming August Nam-Ki Ek, Oscar Johnson and Ramin Assadi March 5, 2015 This project report discusses our approach of implementing Screen Space Ambient
More informationEnhancing Traditional Rasterization Graphics with Ray Tracing. March 2015
Enhancing Traditional Rasterization Graphics with Ray Tracing March 2015 Introductions James Rumble Developer Technology Engineer Ray Tracing Support Justin DeCell Software Design Engineer Ray Tracing
More informationReal - Time Rendering. Graphics pipeline. Michal Červeňanský Juraj Starinský
Real - Time Rendering Graphics pipeline Michal Červeňanský Juraj Starinský Overview History of Graphics HW Rendering pipeline Shaders Debugging 2 History of Graphics HW First generation Second generation
More informationDX10, Batching, and Performance Considerations. Bryan Dudash NVIDIA Developer Technology
DX10, Batching, and Performance Considerations Bryan Dudash NVIDIA Developer Technology The Point of this talk The attempt to combine wisdom and power has only rarely been successful and then only for
More informationCanonical Shaders for Optimal Performance. Sébastien Dominé Manager of Developer Technology Tools
Canonical Shaders for Optimal Performance Sébastien Dominé Manager of Developer Technology Tools Agenda Introduction FX Composer 1.0 High Performance Shaders Basics Vertex versus Pixel Talk to your compiler
More informationBeyond Programmable Shading Course ACM SIGGRAPH 2010 Bending the Graphics Pipeline
Beyond Programmable Shading Course ACM SIGGRAPH 2010 Bending the Graphics Pipeline Johan Andersson DICE Overview Give a taste of a few rendering techniques we are using & experimenting with how they interact,
More informationStereo Matching! Christian Unger 1,2, Nassir Navab 1!! Computer Aided Medical Procedures (CAMP), Technische Universität München, Germany!!
Stereo Matching Christian Unger 12 Nassir Navab 1 1 Computer Aided Medical Procedures CAMP) Technische Universität München German 2 BMW Group München German Hardware Architectures. Microprocessors Pros:
More informationEfficient GPU Rendering of Subdivision Surfaces. Tim Foley,
Efficient GPU Rendering of Subdivision Surfaces Tim Foley, 2017-03-02 Collaborators Activision Wade Brainerd Stanford Matthias Nießner NVIDIA Manuel Kraemer Henry Moreton 2 Subdivision surfaces are a powerful
More informationDominic Filion, Senior Engineer Blizzard Entertainment. Rob McNaughton, Lead Technical Artist Blizzard Entertainment
Dominic Filion, Senior Engineer Blizzard Entertainment Rob McNaughton, Lead Technical Artist Blizzard Entertainment Screen-space techniques Deferred rendering Screen-space ambient occlusion Depth of Field
More informationEnhancing Traditional Rasterization Graphics with Ray Tracing. October 2015
Enhancing Traditional Rasterization Graphics with Ray Tracing October 2015 James Rumble Developer Technology Engineer, PowerVR Graphics Overview Ray Tracing Fundamentals PowerVR Ray Tracing Pipeline Using
More informationAdvanced Ambient Occlusion Methods for Modern Games
Advanced Ambient Occlusion Methods for Modern Games Andrei Tatarinov, Senior Developer Technology Engineer Alexey Panteleev, Senior Developer Technology Engineer Outline What is AO and why is it SS? Is
More informationCENG 477 Introduction to Computer Graphics. Graphics Hardware and OpenGL
CENG 477 Introduction to Computer Graphics Graphics Hardware and OpenGL Introduction Until now, we focused on graphic algorithms rather than hardware and implementation details But graphics, without using
More informationArchitectures. Michael Doggett Department of Computer Science Lund University 2009 Tomas Akenine-Möller and Michael Doggett 1
Architectures Michael Doggett Department of Computer Science Lund University 2009 Tomas Akenine-Möller and Michael Doggett 1 Overview of today s lecture The idea is to cover some of the existing graphics
More informationCS770/870 Spring 2017 Transformations
CS770/870 Spring 2017 Transformations Coordinate sstems 2D Transformations Homogeneous coordinates Matrices, vectors, points Coordinate Sstems Coordinate sstems used in graphics Screen coordinates: the
More informationA Real-time Micropolygon Rendering Pipeline. Kayvon Fatahalian Stanford University
A Real-time Micropolygon Rendering Pipeline Kayvon Fatahalian Stanford University Detailed surfaces Credit: DreamWorks Pictures, Shrek 2 (2004) Credit: Pixar Animation Studios, Toy Story 2 (1999) Credit:
More informationUpgrading XL Fortran Compilers
Upgrading XL Fortran Compilers Oeriew Upgrading to the latest IBM XL Fortran compilers makes good business sense. Upgrading puts new capabilities into the hands of your programmers making them and your
More informationComputer Graphics. Geometric Transformations
Computer Graphics Geometric Transformations Contents coordinate sstems scalar values, points, vectors, matrices right-handed and left-handed coordinate sstems mathematical foundations transformations mathematical
More informationMorphological: Sub-pixel Morhpological Anti-Aliasing [Jimenez 11] Fast AproXimatte Anti Aliasing [Lottes 09]
1 2 3 Morphological: Sub-pixel Morhpological Anti-Aliasing [Jimenez 11] Fast AproXimatte Anti Aliasing [Lottes 09] Analytical: Geometric Buffer Anti Aliasing [Persson 11] Distance to Edge Anti Aliasing
More informationGPGPU, 4th Meeting Mordechai Butrashvily, CEO GASS Company for Advanced Supercomputing Solutions
GPGPU, 4th Meeting Mordechai Butrashvily, CEO moti@gass-ltd.co.il GASS Company for Advanced Supercomputing Solutions Agenda 3rd meeting 4th meeting Future meetings Activities All rights reserved (c) 2008
More informationSqueezing Performance out of your Game with ATI Developer Performance Tools and Optimization Techniques
Squeezing Performance out of your Game with ATI Developer Performance Tools and Optimization Techniques Jonathan Zarge, Team Lead Performance Tools Richard Huddy, European Developer Relations Manager ATI
More informationComputer Graphics. Si Lu. Fall er_graphics.htm 10/11/2017
Computer Graphics Si Lu Fall 27 http://www.cs.pd.edu/~lusi/cs447/cs447_547_comput er_graphics.htm //27 Last time Filtering Resampling 2 Toda Compositing NPR 3D Graphics Toolkits Transformations 3 Demo
More informationCOMP 4801 Final Year Project. Ray Tracing for Computer Graphics. Final Project Report FYP Runjing Liu. Advised by. Dr. L.Y.
COMP 4801 Final Year Project Ray Tracing for Computer Graphics Final Project Report FYP 15014 by Runjing Liu Advised by Dr. L.Y. Wei 1 Abstract The goal of this project was to use ray tracing in a rendering
More informationVideo Seamless Splicing Method Based on SURF Algorithm and Harris Corner Points Detection
Vol13 (Softech 016), pp138-14 http://dxdoiorg/101457/astl016137 Video Seamless Splicing Method Based on SURF Algorithm and Harris Corner Points Detection Dong Jing 1, Chen Dong, Jiang Shuen 3 1 College
More informationBUMP MAPPING. Programação 3D Simulação e Jogos Prof. João A. Madeiras Pereira MEIC-A/IST
UMP MAPPIG Programação 3D Simulação e Jogos Prof. João A. Madeiras Pereira MEIC-A/IS Eamples Shading Generating ormal Map ase teture (RG) Height map (Gre scale) ormal map (normal encoded RG) Displacement
More information3D Viewing and Projec5on. Taking Pictures with a Real Camera. Steps: Graphics does the same thing for rendering an image for 3D geometric objects
3D Vieing and Projec5on Taking Pictures ith a Real Camera Steps: Iden5 interes5ng objects Rotate and translate the camera to desired viepoint Adjust camera seings such as ocal length Choose desired resolu5on
More information2D transformations and homogeneous coordinates
2D transformations and homogeneous coordinates Dr Nicolas Holzschuch Universit of Cape Ton e-mail: holzschu@cs.uct.ac.za Map of the lecture Transformations in 2D: vector/matri notation eample: translation,
More informationHow to Work on Next Gen Effects Now: Bridging DX10 and DX9. Guennadi Riguer ATI Technologies
How to Work on Next Gen Effects Now: Bridging DX10 and DX9 Guennadi Riguer ATI Technologies Overview New pipeline and new cool things Simulating some DX10 features in DX9 Experimental techniques Why This
More information3-Dimensional Viewing
CHAPTER 6 3-Dimensional Vieing Vieing and projection Objects in orld coordinates are projected on to the vie plane, hich is defined perpendicular to the vieing direction along the v -ais. The to main tpes
More informationDeferred Rendering Due: Wednesday November 15 at 10pm
CMSC 23700 Autumn 2017 Introduction to Computer Graphics Project 4 November 2, 2017 Deferred Rendering Due: Wednesday November 15 at 10pm 1 Summary This assignment uses the same application architecture
More informationConemarching in VR. Johannes Saam Mariano Merchante FRAMESTORE. Developing a Fractal experience at 90 FPS. / Framestore
Conemarching in VR Developing a Fractal experience at 90 FPS Johannes Saam Mariano Merchante FRAMESTORE / Framestore THE CONCEPT THE CONCEPT FRACTALS AND COLLISIONS THE CONCEPT RAYMARCHING AND VR FRACTALS
More informationProgrammable GPUS. Last Time? Reading for Today. Homework 4. Planar Shadows Projective Texture Shadows Shadow Maps Shadow Volumes
Last Time? Programmable GPUS Planar Shadows Projective Texture Shadows Shadow Maps Shadow Volumes frame buffer depth buffer stencil buffer Stencil Buffer Homework 4 Reading for Create some geometry "Rendering
More information3D graphics rendering pipeline (1) 3D graphics rendering pipeline (3) 3D graphics rendering pipeline (2) 8/29/11
3D graphics rendering pipeline (1) Geometr Rasteriation 3D Coordinates & Transformations Prof. Aaron Lanterman (Based on slides b Prof. Hsien-Hsin Sean Lee) School of Electrical and Computer Engineering
More informationCS5620 Intro to Computer Graphics
Ra Tracing Ra-Tracing A B C D Photorealism Objective: To generate images that are as close as possible to those perceived b the human ee. Method: Accurate simulation of optical phenomena. Copright Page
More informationCS427 Multicore Architecture and Parallel Computing
CS427 Multicore Architecture and Parallel Computing Lecture 6 GPU Architecture Li Jiang 2014/10/9 1 GPU Scaling A quiet revolution and potential build-up Calculation: 936 GFLOPS vs. 102 GFLOPS Memory Bandwidth:
More informationFinal Project: Real-Time Global Illumination with Radiance Regression Functions
Volume xx (200y), Number z, pp. 1 5 Final Project: Real-Time Global Illumination with Radiance Regression Functions Fu-Jun Luan Abstract This is a report for machine learning final project, which combines
More informationREVECTORIZATION-BASED SHADOW MAPPING
REVECTORIZATION-BASED SHADOW MAPPING Márcio C. F. Macedo (UFBA Brazil) Antônio L. Apolinário Jr. (UFBA Brazil) PGCOMP (UFBA Brazil) AGENDA Introduction; Revectorization-Based Shadow Mapping: Revectorization
More informationReal-World Applications of Computer Arithmetic
1 Commercial Applications Real-World Applications of Computer Arithmetic Stuart Oberman General purpose microprocessors with high performance FPUs AMD Athlon Intel P4 Intel Itanium Application specific
More informationGPU-Accelerated Iterated Function Systems. Simon Green, NVIDIA Corporation
GPU-Accelerated Iterated Function Sstems Simon Green NVIDIA Corporation Iterated Function Sstems Fractal Conceived b John Hutchinson 1981 Popularized b Michael Barnsle Fractals Everwhere 1998 Consists
More informationIBM Netfinity Availability Extensions for Microsoft Cluster Server
Enhanced server availabilit for Windows NT environments IBM Netfinit Availabilit Extensions for Microsoft Cluster Server Continuing leadership in clustering technolog Executive Summar In toda s business
More informationScreen Space Ambient Occlusion. Daniel Kvarfordt & Benjamin Lillandt
Screen Space Ambient Occlusion Daniel Kvarfordt & Benjamin Lillandt Ambient light Same from all directions. Lambertian shading doesn't show form well. Need shadows to see form. Global illumination can
More informationCS452/552; EE465/505. Clipping & Scan Conversion
CS452/552; EE465/505 Clipping & Scan Conversion 3-31 15 Outline! From Geometry to Pixels: Overview Clipping (continued) Scan conversion Read: Angel, Chapter 8, 8.1-8.9 Project#1 due: this week Lab4 due:
More informationProjections. Brian Curless CSE 457 Spring Reading. Shrinking the pinhole. The pinhole camera. Required:
Reading Required: Projections Brian Curless CSE 457 Spring 2013 Angel, 5.1-5.6 Further reading: Fole, et al, Chapter 5.6 and Chapter 6 David F. Rogers and J. Alan Adams, Mathematical Elements for Computer
More information3D Coordinates & Transformations
3D Coordinates & Transformations Prof. Aaron Lanterman (Based on slides b Prof. Hsien-Hsin Sean Lee) School of Electrical and Computer Engineering Georgia Institute of Technolog 3D graphics rendering pipeline
More informationThe Rasterization Pipeline
Lecture 5: The Rasterization Pipeline (and its implementation on GPUs) Computer Graphics CMU 15-462/15-662, Fall 2015 What you know how to do (at this point in the course) y y z x (w, h) z x Position objects
More informationSeamless Compute and OpenGL Graphics Development in NVIDIA Nsight 3.0 Visual Studio Edition and Beyond 3/20/2013
Seamless Compute and OpenGL Graphics Development in NVIDIA Nsight 3.0 Visual Studio Edition and Beyond 3/20/2013 Agenda Computational Graphics and Visual Computing Developer Challenges Maximus Getting
More informationComputer Graphics. Geometric Transformations
Contents coordinate sstems scalar values, points, vectors, matrices right-handed and left-handed coordinate sstems mathematical foundations transformations mathematical descriptions of geometric changes,
More informationRendering Grass with Instancing in DirectX* 10
Rendering Grass with Instancing in DirectX* 10 By Anu Kalra Because of the geometric complexity, rendering realistic grass in real-time is difficult, especially on consumer graphics hardware. This article
More informationMichal Valient Lead Tech Guerrilla Games
Michal Valient Lead Tech Guerrilla Games Intro Guerrilla is based in Amsterdam and we re part of Sony since 2005 We re working on two titles Unannounced new IP Killzone: Shadow Fall The new Killzone is
More informationNext-Generation Graphics on Larrabee. Tim Foley Intel Corp
Next-Generation Graphics on Larrabee Tim Foley Intel Corp Motivation The killer app for GPGPU is graphics We ve seen Abstract models for parallel programming How those models map efficiently to Larrabee
More informationTechnical Guide. Updated August 24, Page 1 of 19
Technical Guide Updated August 24, 2018 Page 1 of 19 3DMark 11 Overview...3 Benchmark Principles...4 3DMark 11 Presets...5 Preset Configurations...6 Custom settings...8 Rendering Engine...9 Post Processing...
More informationTDA362/DIT223 Computer Graphics EXAM (Same exam for both CTH- and GU students)
TDA362/DIT223 Computer Graphics EXAM (Same exam for both CTH- and GU students) Saturday, January 13 th, 2018, 08:30-12:30 Examiner Ulf Assarsson, tel. 031-772 1775 Permitted Technical Aids None, except
More informationGUERRILLA DEVELOP CONFERENCE JULY 07 BRIGHTON
Deferred Rendering in Killzone 2 Michal Valient Senior Programmer, Guerrilla Talk Outline Forward & Deferred Rendering Overview G-Buffer Layout Shader Creation Deferred Rendering in Detail Rendering Passes
More informationCS354R: Computer Game Technology
CS354R: Computer Game Technology Real-Time Global Illumination Fall 2018 Global Illumination Mirror s Edge (2008) 2 What is Global Illumination? Scene recreates feel of physically-based lighting models
More informationCS 450: COMPUTER GRAPHICS RASTERIZING LINES SPRING 2016 DR. MICHAEL J. REALE
CS 45: COMPUTER GRAPHICS RASTERIZING LINES SPRING 6 DR. MICHAEL J. REALE OBJECT-ORDER RENDERING We going to start on how we will perform object-order rendering Object-order rendering Go through each OBJECT
More informationRadeon ProRender and Radeon Rays in a Gaming Rendering Workflow. Takahiro Harada, AMD 2017/3
Radeon ProRender and Radeon Rays in a Gaming Rendering Workflow Takahiro Harada, AMD 2017/3 Agenda Introduction Radeon ProRender & Radeon Rays Radeon Rays Unity + Radeon Rays Integration to real time applications
More informationNVIDIA Parallel Nsight. Jeff Kiel
NVIDIA Parallel Nsight Jeff Kiel Agenda: NVIDIA Parallel Nsight Programmable GPU Development Presenting Parallel Nsight Demo Questions/Feedback Programmable GPU Development More programmability = more
More informationPhysically Based Shading in Unity. Aras Pranckevičius Rendering Dude
Physically Based Shading in Unity Aras Pranckevičius Rendering Dude Outline New built-in shaders in Unity 5 What, how and why And all related things Shaders in Unity 4.x A lot of good things are available
More informationMattan Erez. The University of Texas at Austin
EE382V: Principles in Computer Architecture Parallelism and Locality Fall 2008 Lecture 10 The Graphics Processing Unit Mattan Erez The University of Texas at Austin Outline What is a GPU? Why should we
More informationAcknowledgement: Images and many slides from presentations by Mark J. Kilgard and other Nvidia folks, from slides on developer.nvidia.
Shadows Acknowledgement: Images and many slides from presentations by Mark J. Kilgard and other Nvidia folks, from slides on developer.nvidia.com Practical & Robust Stenciled Shadow Volumes for Hardware-Accelerated
More informationReal-Time Reyes: Programmable Pipelines and Research Challenges. Anjul Patney University of California, Davis
Real-Time Reyes: Programmable Pipelines and Research Challenges Anjul Patney University of California, Davis Real-Time Reyes-Style Adaptive Surface Subdivision Anjul Patney and John D. Owens SIGGRAPH Asia
More informationOpenGL ES 2.0 : Start Developing Now. Dan Ginsburg Advanced Micro Devices, Inc.
OpenGL ES 2.0 : Start Developing Now Dan Ginsburg Advanced Micro Devices, Inc. Agenda OpenGL ES 2.0 Brief Overview Tools OpenGL ES 2.0 Emulator RenderMonkey w/ OES 2.0 Support OpenGL ES 2.0 3D Engine Case
More informationWarping, Morphing and Mosaics
Computational Photograph and Video: Warping, Morphing and Mosaics Prof. Marc Pollefes Dr. Gabriel Brostow Toda s schedule Last week s recap Warping Morphing Mosaics Toda s schedule Last week s recap Warping
More informationMany rendering scenarios, such as battle scenes or urban environments, require rendering of large numbers of autonomous characters.
1 2 Many rendering scenarios, such as battle scenes or urban environments, require rendering of large numbers of autonomous characters. Crowd rendering in large environments presents a number of challenges,
More informationPowerVR Performance Recommendations. The Golden Rules
PowerVR Performance Recommendations Copyright Imagination Technologies Limited. All Rights Reserved. This publication contains proprietary information which is subject to change without notice and is supplied
More informationA bit more Deferred - CryEngine 3. Triangle Game Conference 2009 Martin Mittring Lead Graphics Programmer
A bit more Deferred - CryEngine 3 Triangle Game Conference 2009 Martin Mittring Lead Graphics Programmer Crytek Main office: Germany Frankfurt More studios: Kiev, Budapest, Sofia, Nottingham, Seoul English
More informationVAO++: Practical Volumetric Ambient Occlusion for Games
VAO++: Practical Volumetric Ambient Occlusion for Games Jakub Bokšanský, Adam Pospíšil (Project Wilberforce) Jiří Bittner (CTU in Prague) EGSR 19.6.2017 Motivation Focus on performance (highly optimized
More informationRendering. Converting a 3D scene to a 2D image. Camera. Light. Rendering. View Plane
Rendering Pipeline Rendering Converting a 3D scene to a 2D image Rendering Light Camera 3D Model View Plane Rendering Converting a 3D scene to a 2D image Basic rendering tasks: Modeling: creating the world
More informationAnnouncements. Introduction to Cameras. The Key to Axis Angle Rotation. Axis-Angle Form (review) Axis Angle (4 steps) Mechanics of Axis Angle
Ross Beerige Bruce Draper Introuction to Cameras September th 25 Announcements PA ue eek from Tuesa Q: hat i I mean b robust I/O? Hanle arious numbers of erte/face features Check for count matches Goo
More informationOUTPUT PRIMITIVES. CEng 477 Introduction to Computer Graphics METU, 2007
OUTPUT PRIMITIVES CEng 477 Introduction to Computer Graphics METU, 007 Recap: The basic forward projection pipeline: MCS Model Model Modeling Transformations M M 3D World Scene Viewing Transformations
More informationGlobal Illumination CS334. Daniel G. Aliaga Department of Computer Science Purdue University
Global Illumination CS334 Daniel G. Aliaga Department of Computer Science Purdue University Recall: Lighting and Shading Light sources Point light Models an omnidirectional light source (e.g., a bulb)
More informationProgramming Graphics Hardware
Tutorial 5 Programming Graphics Hardware Randy Fernando, Mark Harris, Matthias Wloka, Cyril Zeller Overview of the Tutorial: Morning 8:30 9:30 10:15 10:45 Introduction to the Hardware Graphics Pipeline
More informationOptimal Shaders Using High-Level Languages
Optimal Shaders Using High-Level Languages The Good, The Bad, The Ugly All high level languages provide significant power and flexibility that: Make writing shaders easy Make writing slow shaders easy
More informationSpring 2010 Prof. Hyesoon Kim. AMD presentations from Richard Huddy and Michael Doggett
Spring 2010 Prof. Hyesoon Kim AMD presentations from Richard Huddy and Michael Doggett Radeon 2900 2600 2400 Stream Processors 320 120 40 SIMDs 4 3 2 Pipelines 16 8 4 Texture Units 16 8 4 Render Backens
More informationDeferred rendering using Compute shaders
Deferred rendering using Compute shaders A comparative study using shader model 4.0 and 5.0 Benjamin Golba 1 P a g e This thesis is submitted to the Department of Interaction and System Design at Blekinge
More informationAdvanced Shading and Texturing
Real-Time Graphics Architecture Kurt Akeley Pat Hanrahan http://www.graphics.stanford.edu/courses/cs448a-01-fall Advanced Shading and Texturing 1 Topics Features Bump mapping Environment mapping Shadow
More informationBringing AAA graphics to mobile platforms. Niklas Smedberg Senior Engine Programmer, Epic Games
Bringing AAA graphics to mobile platforms Niklas Smedberg Senior Engine Programmer, Epic Games Who Am I A.k.a. Smedis Platform team at Epic Games Unreal Engine 15 years in the industry 30 years of programming
More informationCOMPUTING SCIENCE. Real-Time Ambient Occlusion on the Playstation3. Dominic Goulding, Richard Smith, Lee Clark, Gary Ushaw and Graham Morgan
COMPUTING SCIENCE Real-Time Ambient Occlusion on the Playstation3 Dominic Goulding, Richard Smith, Lee Clark, Gary Ushaw and Graham Morgan TECHNICAL REPORT SERIES No. CS-TR-1319 March 2012 TECHNICAL REPORT
More information