AMD RADEON GFX & DIRECTX 12 GRAPHICS CORE NEXT BETTER PREPARED FOR DIRECTX 12 ROBERT HALLOCK AMD TECHNICAL MARKETING APPROVED FOR ALL AUDIENCES
|
|
- Reynard Poole
- 6 years ago
- Views:
Transcription
1 GRAPHICS CORE NEXT BETTER PREPARED FOR DIRECTX 12 APPROVED FOR ALL AUDIENCES AMD RADEON GFX & DIRECTX 12 ROBERT HALLOCK AMD TECHNICAL MARKETING
2 GAME DEVS & ASYNC SHADERS DAN BAKER, PARTNER, OXIDE GAMES I ve heard of developers getting +30% GPU performance by using Async [Shading]. [ ]it could end up being pretty disruptive in a year or so as these GCN built and optimized engines start coming to the PC. 2 DIRECTX 12 & AMD RADEON GRAPHICS AUGUST, 2015 APPROVED FOR ALL AUDIENCES
3 ASYNC SHADERS WHAT ARE THEY? HIGHER FPS Breaks complex graphics workloads into many parallel tasks Idle GPU resources do work instead of waiting their turn Parallel workloads = jobs finish faster = more performance Increasing granularity of the multi threading improves performance GREAT FOR VR More parallelism = lower latency = more responsive VR IMAGE QUALITY GPU PERFORMANCE Opens up GPU resources for free graphical effects that don t compromise performance 3 DIRECTX 12 & AMD RADEON GRAPHICS AUGUST, 2015 APPROVED FOR ALL AUDIENCES
4 THE DIRECTX 11 GRAPHICS BOTTLENECK CREATE SHADOWS UPLOAD DATA FROM PROCESSOR TIME COMPRESS TEXTURES GENERATE REFLECTIONS ( ENVIRONMENT MAPPING ) BUFFER LIGHT/MATERIAL INTERACTIONS UPLOAD 2 nd LIGHTING PASS PHYSICS SIM POST PROCESS COMPUTE, & MEMORY USE DIFFERENT GPU RESOURCES SO WHY MUST THESE JOBS WAIT TO RUN IN ORDER? 4 DIRECTX 12 & AMD RADEON GRAPHICS AUGUST, 2015 APPROVED FOR ALL AUDIENCES * Chart for illustrative purposes only.
5 ASYNCHRONOUS SHADING IN DIRECTX 12 CREATE SHADOWS BUFFER LIGHT/MATERIAL INTERACTIONS GENERATE REFLECTIONS ( ENVIRONMENT MAPPING ) SECOND LIGHTING PASS POST PROCESS UPLOAD DATA FROM PROCESSOR UPLOAD DATA FROM PROCESSOR RENDER TIME SAVED LATENCY FPS COMPRESS TEXTURES COMPRESS TEXTURES PHYSICS SIMULATION TIME THEY RE PARALLEL IN DIRECTX 12 HIGHER FPS RESPONSIVE VR LOW INPUT LAG 5 DIRECTX 12 & AMD RADEON GRAPHICS AUGUST, 2015 APPROVED FOR ALL AUDIENCES * Chart for illustrative purposes only.
6 GAME DEVS ON ASYNC SHADERS With async shaders, we can fill parts of the GPU that [would] otherwise be forced to sit idle. It's one of those features we wish we had on every GPU. DAN BAKER, PARTNER, OXIDE GAMES [Async shading] can be used to add more game features without affecting performance or to increase efficiency by performing existing work in parallel with rendering. JOHN KLOETZLI, PRINCIPAL GFX DEV, FIRAXIS GAMES On The Tomorrow Children, the flexibility offered by moving the bulk of our rendering work to compute was just too great not to take advantage of. [ ] [Async shading] really made a lot of things possible that would have been incredibly difficult otherwise. JAYMIN KESSLER, SENIOR PROGRAMMER, Q GAMES 6 DIRECTX 12 & AMD RADEON GRAPHICS AUGUST, 2015 APPROVED FOR ALL AUDIENCES
7 PRESS ON ASYNC SHADING It turns out that NVIDIA's Maxwell architecture has an Achilles' heel after all, which tilts the scales in favor of competing AMD Graphics CoreNext architecture better prepared for DirectX 12. TechPowerUP Oxide s developer also revealed that NVIDIA s Maxwell does not support natively Async Compute, and that NVIDIA asked Oxide to disable it for its graphics cards. DSOGaming [ ]AMDcardshavebeenbuilttotake advantage of DirectX 12 s features, mainly thanks to Async Compute which should become more prevalent in the next couple of years. WCCFtech 7 DIRECTX 12 & AMD RADEON GRAPHICS AUGUST, 2015 APPROVED FOR ALL AUDIENCES
8 WHAT ABOUT NVIDIA MAXWELL? Curiously, [NVIDIA s] driver reported [async shading] was functional but attempting to use it was an unmitigated disaster. Ashes of the Singularity Development Team AFAIK, Maxwell doesn t support Async Compute, at least not natively. We disabled it at the request of Nvidia, as it was much slower to try to use it then to not. 8 DIRECTX 12 & AMD RADEON GRAPHICS AUGUST, 2015 APPROVED FOR ALL AUDIENCES games dev replies ashes singularity controversy/
9 ASYNCHRONOUS COMPUTE ENGINES GRAPHICS CORE NEXT: HARDWARE FOR ASYNC SHADING These are Asynchronous Compute Engines Every GCN based graphics chip has several ACEs are hardware accelerators for asynchronous shading in DX12 PC games Only GCN has support for asynchronous shading AMD Radeon R9 390X Graphics Card 9 DIRECTX 12 & AMD RADEON GRAPHICS AUGUST, 2015 APPROVED FOR ALL AUDIENCES
10 DIRECTX 12 GAMES & HARDWARE GAMES AND HARDWARE 10 DIRECTX 12 & AMD RADEON GRAPHICS AUGUST, 2015 APPROVED FOR ALL AUDIENCES
11 OPTIMIZED FOR GRAPHICS CORE NEXT Built with the Dawn Engine Supports DirectX 12 Features TressFX Hair 3.0 More features TBA Watch the trailer Visit the website 11 DIRECTX 12 & AMD RADEON GRAPHICS AUGUST, 2015 APPROVED FOR ALL AUDIENCES
12 OPTIMIZED FOR GRAPHICS CORE NEXT Built with the Nitrous Engine DirectX 12 Capabilities Multi threaded Command Buffers Increases multi core CPU utilization Async Shaders Increases GPU utilization Explicit Frame Management Reduces frame latency Custom Memory Manager Eliminates microstutter See the gameplay Visit the website Read the blog 12 DIRECTX 12 & AMD RADEON GRAPHICS AUGUST, 2015 APPROVED FOR ALL AUDIENCES
13 ASHES OF THE SINGULARITY ULTRAHD 4K, MEDIUM PRESET, DIRECTX 12 AMD Radeon R9 Fury X GTX 980 Ti AMD Radeon R9 390X GTX 980 AMD Radeon R9 390 GTX Average FPS 13 DIRECTX 12 & AMD RADEON GRAPHICS AUGUST, 2015 APPROVED FOR ALL AUDIENCES SYSTEM CONFIG: Intel Core i7 5960X Gigabyte X99 UD4 16GBG DDR Windows 10 x64 AMD Catalyst Beta 3 NVIDIA WHQL
14 ASHES OF THE SINGULARITY 1440p, MEDIUM & HIGH PRESETS, DIRECTX 12 AMD Radeon R9 Fury X GTX 980 Ti AMD Radeon R9 390X GTX 980 AMD Radeon R9 390 GTX Average FPS HIGH MEDIUM 14 DIRECTX 12 & AMD RADEON GRAPHICS AUGUST, 2015 APPROVED FOR ALL AUDIENCES SYSTEM CONFIG: Intel Core i7 5960X Gigabyte X99 UD4 16GBG DDR Windows 10 x64 AMD Catalyst Beta 3 NVIDIA WHQL
15 API Overhead Feature Test Supports DirectX 12, Mantle & DirectX 11 Evaluates a GPU s geometric throughput ( scene detail ) at >30 FPS Multi threaded command buffers See the trailer Visit the website Read the technical guide 15 DIRECTX 12 & AMD RADEON GRAPHICS AUGUST, 2015 APPROVED FOR ALL AUDIENCES
16 3DMARK API OVERHEAD FEATURE TEST FULL DGPU STACK PERFORMANCE GEOMETRIC SCENE DETAIL AT 1080P AMD Radeon R9 Fury X GTX 980 Ti AMD Radeon R9 Fury (Asus) AMD Radeon R9 390X GTX 980 AMD Radeon R9 390 GTX 970 AMD Radeon R9 380 GTX 960 AMD Radeon R GB GTX 750 Ti AMD Radeon R7 360 GTX Mil. 4.0 Mil. 7.0 Mil Mil Mil Mil Mil. >30 FPS 16 DIRECTX 12 & AMD RADEON GRAPHICS AUGUST, 2015 APPROVED FOR ALL AUDIENCES Core i7 5960X, Gigabyte X99 UD4, 16GB DDR4 2666, Windows 10 x64 (Build 10240), AMD Catalyst driver Beta 19, ForceWare DirectX 12 multi threaded. 1080p resolution.
17 ADDITIONAL CONTENT Material Headline Source Video AMD Simplified: Async Shading YouTube Article Article AMD s DirectX 12 Advantage Explained GCN Architecture More Friendly to Parallelism Than Maxwell Lack of Async Compute on Maxwell Makes AMD GCN Better Prepared for DX12 Community NV GPUs do not support DX12 Asynchronous Compute/Shaders Reddit Community Maxwell Can t do VR well? An issue of latency. Reddit Article Oxide Games Claims NVIDIA GPUs Do Not Support DirectX 12 Asynchronous Compute DSOGaming TechPowerUP Game Debate Community GTX 980, GTX 970, GTX 980 Ti may not be fully DX 12 supported Steam Community Article AMD Dives Deep on Asynchronous Shading AnandTech Article DirectX12, LiquidVR may breathe fresh life into AMD GPUs thanks to asynchronous shading ExtremeTech Article Oxide Games Dev Replies on Ashes of the Singularity Controversy WCCFTech 17 DIRECTX 12 & AMD RADEON GRAPHICS AUGUST, 2015 APPROVED FOR ALL AUDIENCES
ASYNCHRONOUS SHADERS WHITE PAPER 0
ASYNCHRONOUS SHADERS WHITE PAPER 0 INTRODUCTION GPU technology is constantly evolving to deliver more performance with lower cost and lower power consumption. Transistor scaling and Moore s Law have helped
More informationRadeon Software. Terry Makedon. Director of Software Strategy and UX
Cautionary statement This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. ( AMD ), which are made pursuant to the Safe Harbor provisions of the Private Securities
More informationForza Horizon 4 Benchmark Guide
Forza Horizon 4 Benchmark Guide Copyright 2018 Playground Games Limited. The Playground Games name and logo, the Forza Horizon 4 name and logo and the Forza Horizon 4 insignia are trademarks of Playground
More informationAMD Graphics Team Last Updated April 29, 2013 APPROVED FOR PUBLIC DISTRIBUTION. 1 3DMark Overview April 2013 Approved for public distribution
AMD Graphics Team Last Updated April 29, 2013 APPROVED FOR PUBLIC DISTRIBUTION 1 3DMark Overview April 2013 Approved for public distribution 2 3DMark Overview April 2013 Approved for public distribution
More informationLIQUIDVR TODAY AND TOMORROW GUENNADI RIGUER, SOFTWARE ARCHITECT
LIQUIDVR TODAY AND TOMORROW GUENNADI RIGUER, SOFTWARE ARCHITECT Bootstrapping the industry for better VR experience Complimentary to HMD SDKs It s all about giving developers the tools they want! AMD LIQUIDVR
More informationEECS 487: Interactive Computer Graphics
EECS 487: Interactive Computer Graphics Lecture 21: Overview of Low-level Graphics API Metal, Direct3D 12, Vulkan Console Games Why do games look and perform so much better on consoles than on PCs with
More informationAMD Graphics Team Last Updated February 11, 2013 APPROVED FOR PUBLIC DISTRIBUTION. 1 3DMark Overview February 2013 Approved for public distribution
AMD Graphics Team Last Updated February 11, 2013 APPROVED FOR PUBLIC DISTRIBUTION 1 3DMark Overview February 2013 Approved for public distribution 2 3DMark Overview February 2013 Approved for public distribution
More informationTable of Contents 2-4
Setting Up TS 2018 with a single nvidia card, using nvidia Control Panel (NVCP) PLUS (optional) nvidia Inspector (NVI). Single Standard and GSync Monitor settings. Setting up DSR in TS 2018 This is a guide
More informationShrinath Shanbhag Senior Software Engineer Microsoft Corporation
Accelerating GPU inferencing with DirectML and DirectX 12 Shrinath Shanbhag Senior Software Engineer Microsoft Corporation Machine Learning Machine learning has become immensely popular over the last decade
More informationPC BUILDING PRESENTED BY
PC BUILDING PRESENTED BY WHAT IS A PC General purpose Personal Computer for individual usage Macintosh 1984 WHAT IS A PC General purpose Personal Computer for individual usage IBM Personal Computer XT
More informationGPU ARCHITECTURE Chris Schultz, June 2017
GPU ARCHITECTURE Chris Schultz, June 2017 MISC All of the opinions expressed in this presentation are my own and do not reflect any held by NVIDIA 2 OUTLINE CPU versus GPU Why are they different? CUDA
More informationLow-Overhead Rendering with Direct3D. Evan Hart Principal Engineer - NVIDIA
Low-Overhead Rendering with Direct3D Evan Hart Principal Engineer - NVIDIA Ground Rules No DX9 Need to move fast Big topic in 30 minutes Assuming experienced audience Everything is a tradeoff These are
More informationInstall Error Code 43 Nvidia Geforce 8500 Gt >>>CLICK HERE<<<
Install Error Code 43 Nvidia Geforce 8500 Gt Showed a code 43 error. site and downloaded the latest version of NVidia GeForce 8500 and restarted. A GFX TEAM member is running with a 8500GT? hi, i would
More informationCSE 591: GPU Programming. Introduction. Entertainment Graphics: Virtual Realism for the Masses. Computer games need to have: Klaus Mueller
Entertainment Graphics: Virtual Realism for the Masses CSE 591: GPU Programming Introduction Computer games need to have: realistic appearance of characters and objects believable and creative shading,
More informationPacketShader: A GPU-Accelerated Software Router
PacketShader: A GPU-Accelerated Software Router Sangjin Han In collaboration with: Keon Jang, KyoungSoo Park, Sue Moon Advanced Networking Lab, CS, KAIST Networked and Distributed Computing Systems Lab,
More informationA Case Study in Optimizing GNU Radio s ATSC Flowgraph
A Case Study in Optimizing GNU Radio s ATSC Flowgraph Presented by Greg Scallon and Kirby Cartwright GNU Radio Conference 2017 Thursday, September 14 th 10am ATSC FLOWGRAPH LOADING 3% 99% 76% 36% 10% 33%
More informationWindowing System on a 3D Pipeline. February 2005
Windowing System on a 3D Pipeline February 2005 Agenda 1.Overview of the 3D pipeline 2.NVIDIA software overview 3.Strengths and challenges with using the 3D pipeline GeForce 6800 220M Transistors April
More informationSAPPHIRE R7 260X 2GB GDDR5 OC BATLELFIELD 4 EDITION
SAPPHIRE R7 260X 2GB GDDR5 OC BATLELFIELD 4 EDITION Specification Display Support Output GPU Video Memory Dimension Software Accessory 4 x Maximum Display Monitor(s) support 1 x HDMI (with 3D) 1 x DisplayPort
More informationXfx Amd Radeon Hd 7970 Dual Fan Black Edition Review
Xfx Amd Radeon Hd 7970 Dual Fan Black Edition Review XFX Radeon R9 285 Black Edition OC Review Since AMD announced their GPU 14 R9 series video cards, AIB partners have Two 89mm dual axial fans and five
More information2 x Maximum Display Monitor(s) support MHz Core Clock 28 nm Chip 384 x Stream Processors. 145(L)X95(W)X26(H) mm Size. 1.
Model 11215-01-20G SAPPHIRE R7 250 2GB DDR3 WITH BOOST Specification Display Support Output GPU Video Memory Dimension Software 2 x Maximum Display Monitor(s) support 1 x D-Sub(VGA) 1 x HDMI (with 3D)
More informationObject Space Lighting. Dan Baker Founder, Oxide Games
Object Space Lighting Dan Baker Founder, Oxide Games Ashes of the Singularity Nitrous Engine/Oxide Games New studio founded from industry vets Firaxis, Zenimax, Stardock Ground up, custom engine Several
More informationThe Road to the AMD. Fiji GPU. Featuring Die Stacking and HBM Technology 1 THE ROAD TO THE AMD FIJI GPU ECTC 2016 MAY 2015
The Road to the AMD Fiji GPU Featuring Die Stacking and HBM Technology 1 THE ROAD TO THE AMD FIJI GPU ECTC 2016 MAY 2015 Fiji Chip DETAILED LOOK 4GB High-Bandwidth Memory 4096-bit wide interface 512 GB/s
More informationAn Incremental Rendering VM. Georg Haaser, Harald Steinlechner Stefan Maierhofer, and Robert F. Tobler VRVis Research Center Vienna, Austria
, Harald Steinlechner Stefan Maierhofer, and Robert F. Tobler VRVis Research Center Vienna, Austria Motivation Rendering huge scenes with dynamism Editor applications / strategy games Everything is changeable
More informationArchitectures. Michael Doggett Department of Computer Science Lund University 2009 Tomas Akenine-Möller and Michael Doggett 1
Architectures Michael Doggett Department of Computer Science Lund University 2009 Tomas Akenine-Möller and Michael Doggett 1 Overview of today s lecture The idea is to cover some of the existing graphics
More informationGPU ARCHITECTURE Chris Schultz, June 2017
Chris Schultz, June 2017 MISC All of the opinions expressed in this presentation are my own and do not reflect any held by NVIDIA 2 OUTLINE Problems Solved Over Time versus Why are they different? Complex
More informationAMD Brings New Value to Radeon
AMD Brings New Value to Radeon New form factors, memory technologies, driver capabilities, & DirectX 12 support Summary The PC gaming graphics industry today has very few players but many problems. Today
More informationAccelerating Cloud Graphics
Accelerating Cloud Graphics Franck DIARD, Ph. D. SW Architect Distinguished Engineer, NVIDIA Agenda 30 minute talk 10 minute demo 10 minute Q&A GeForce GRID Lower Latency Higher Density Higher Quality
More informationOptimizing and Profiling Unity Games for Mobile Platforms. Angelo Theodorou Senior Software Engineer, MPG Gamelab 2014, 25 th -27 th June
Optimizing and Profiling Unity Games for Mobile Platforms Angelo Theodorou Senior Software Engineer, MPG Gamelab 2014, 25 th -27 th June 1 Agenda Introduction ARM and the presenter Preliminary knowledge
More informationGraphics Performance Optimisation. John Spitzer Director of European Developer Technology
Graphics Performance Optimisation John Spitzer Director of European Developer Technology Overview Understand the stages of the graphics pipeline Cherchez la bottleneck Once found, either eliminate or balance
More informationOptimizing DirectX Graphics. Richard Huddy European Developer Relations Manager
Optimizing DirectX Graphics Richard Huddy European Developer Relations Manager Some early observations Bear in mind that graphics performance problems are both commoner and rarer than you d think The most
More informationExplicit Multi GPU Programming with DirectX 12. Juha Sjöholm Developer Technology Engineer NVIDIA
Explicit Multi GPU Programming with DirectX 12 Juha Sjöholm Developer Technology Engineer NVIDIA Agenda What is explicit Multi GPU API Introduction Engine Requirements Frame Pipelining Case Study Problem
More informationGPU > CPU. FOR HIGH PERFORMANCE COMPUTING PRESENTATION BY - SADIQ PASHA CHETHANA DILIP
GPU > CPU. FOR HIGH PERFORMANCE COMPUTING PRESENTATION BY - SADIQ PASHA CHETHANA DILIP INTRODUCTION or With the exponential increase in computational power of todays hardware, the complexity of the problem
More informationAti opengl 1.4 windows 7 32bit. Ati opengl 1.4 windows 7 32bit.zip
Ati opengl 1.4 windows 7 32bit Ati opengl 1.4 windows 7 32bit.zip texture compression; OpenGL 1.4 - Depth textures. Release date: September 7Windows 7 opengl 32bit ati drivers Support OpenGL VR SDK 1.4
More informationD3D12 & Vulkan Done Right. Gareth Thomas Developer Technology Engineer, AMD
D3D12 & Vulkan Done Right Gareth Thomas Developer Technology Engineer, AMD Agenda Barriers Copy Queue Resources Pipeline Shaders What is *not* in this talk Async compute Check out Async Compute: Deep Dive
More informationAdministrivia. HW0 scores, HW1 peer-review assignments out. If you re having Cython trouble with HW2, let us know.
Administrivia HW0 scores, HW1 peer-review assignments out. HW2 out, due Nov. 2. If you re having Cython trouble with HW2, let us know. Review on Wednesday: Post questions on Piazza Introduction to GPUs
More informationCSE 591/392: GPU Programming. Introduction. Klaus Mueller. Computer Science Department Stony Brook University
CSE 591/392: GPU Programming Introduction Klaus Mueller Computer Science Department Stony Brook University First: A Big Word of Thanks! to the millions of computer game enthusiasts worldwide Who demand
More informationIntel Optane Memory and Intel SSD 545s combine to offer NVMe-class storage performance. November 24, 2017 Version 1.0
Intel Optane Memory and Intel SSD 545s combine to offer NVMe-class storage performance November 24, 2017 Version 1.0 A Complex Landscape of Storage Options Consumers and enthusiasts that want to build
More informationState-Of-The-Art Standards
State-Of-The-Art Standards XMP Generally speaking, modern BIOS designs make it easy to load XMP configurations. Before Intel introduced XMP (Extreme Memory Profile) in late 2007, overclocking memory was
More informationOptimizing for DirectX Graphics. Richard Huddy European Developer Relations Manager
Optimizing for DirectX Graphics Richard Huddy European Developer Relations Manager Also on today from ATI... Start & End Time: 12:00pm 1:00pm Title: Precomputed Radiance Transfer and Spherical Harmonic
More information* ENDNOTES: RVM-26 AND RZG-01.
2 * ENDNOTES: RVM-26 AND RZG-01. 3 4 5 6 7 *SEE ENDNOTES GD-126 ** RESULTS MAY VARY. SEE ENDNOTES RZP-31 8 * SEE ENDNOTES: RZP-31 ** SEE ENDNOTES: GD-126 *** AMD DEFINES PREMIUM PROCESSOR COOLING AS A
More informationCOMP 4801 Final Year Project. Ray Tracing for Computer Graphics. Final Project Report FYP Runjing Liu. Advised by. Dr. L.Y.
COMP 4801 Final Year Project Ray Tracing for Computer Graphics Final Project Report FYP 15014 by Runjing Liu Advised by Dr. L.Y. Wei 1 Abstract The goal of this project was to use ray tracing in a rendering
More informationGPGPU introduction and network applications. PacketShaders, SSLShader
GPGPU introduction and network applications PacketShaders, SSLShader Agenda GPGPU Introduction Computer graphics background GPGPUs past, present and future PacketShader A GPU-Accelerated Software Router
More informationReal-Time Rendering Architectures
Real-Time Rendering Architectures Mike Houston, AMD Part 1: throughput processing Three key concepts behind how modern GPU processing cores run code Knowing these concepts will help you: 1. Understand
More informationChallenges for GPU Architecture. Michael Doggett Graphics Architecture Group April 2, 2008
Michael Doggett Graphics Architecture Group April 2, 2008 Graphics Processing Unit Architecture CPUs vsgpus AMD s ATI RADEON 2900 Programming Brook+, CAL, ShaderAnalyzer Architecture Challenges Accelerated
More information! Readings! ! Room-level, on-chip! vs.!
1! 2! Suggested Readings!! Readings!! H&P: Chapter 7 especially 7.1-7.8!! (Over next 2 weeks)!! Introduction to Parallel Computing!! https://computing.llnl.gov/tutorials/parallel_comp/!! POSIX Threads
More informationBeyond Programmable Shading 2012
Beyond Programmable Shading Course ACM SIGGRAPH 2012 Beyond Programmable Shading 2012 Aaron Lefohn Intel Mike Houston AMD Welcome 5 th consecutive year of Beyond Programmable Shading SIGGRAPH course This
More informationOpenACC Course. Office Hour #2 Q&A
OpenACC Course Office Hour #2 Q&A Q1: How many threads does each GPU core have? A: GPU cores execute arithmetic instructions. Each core can execute one single precision floating point instruction per cycle
More informationNext Generation OpenGL Neil Trevett Khronos President NVIDIA VP Mobile Copyright Khronos Group Page 1
Next Generation OpenGL Neil Trevett Khronos President NVIDIA VP Mobile Ecosystem @neilt3d Copyright Khronos Group 2015 - Page 1 Copyright Khronos Group 2015 - Page 2 Khronos Connects Software to Silicon
More informationBuilding NVLink for Developers
Building NVLink for Developers Unleashing programmatic, architectural and performance capabilities for accelerated computing Why NVLink TM? Simpler, Better and Faster Simplified Programming No specialized
More informationHeadline in Arial Bold 30pt. Visualisation using the Grid Jeff Adie Principal Systems Engineer, SAPK July 2008
Headline in Arial Bold 30pt Visualisation using the Grid Jeff Adie Principal Systems Engineer, SAPK July 2008 Agenda Visualisation Today User Trends Technology Trends Grid Viz Nodes Software Ecosystem
More informationFaster2 Video Transcode in Handbrake* vs Intel Core i7-7700k Processor
MEGATASK TO THE EXTREME 2.3X Faster2 Video Transcode in Handbrake* vs i7-7700k 2.4X Faster1 4K Video Editing Adobe* Premiere* Pro CC vs Intel Core i7-7700k Edit 4K video, render 3D effects, and compose
More informationParallel Computer Architecture and Programming Final Project
Muhammad Hilman Beyri (mbeyri), Zixu Ding (zixud) Parallel Computer Architecture and Programming Final Project Summary We have developed a distributed interactive ray tracing application in OpenMP and
More informationLucid Virtu Installation Guide
Lucid Virtu Installation Guide 1 Introduction Lucid VIRTU solution is designed for Intel Sandy Bridge platform with Intel Processor Graphics enabled. VIRTU dynamically assigns tasks to best available graphics
More informationGigabyte Nvidia Gtx 260 Manual READ ONLINE
Gigabyte Nvidia Gtx 260 Manual READ ONLINE Gigabyte GeForce GTX 260 Super Overclock - I had a bit of deja vu when Gigabyte contacted me a few weeks ago to take their latest GeForce GTX 260 for a spin.
More informationPractical Development for Vulkan. Dan Ginsburg, Valve Baldur Karlsson, Unity Dean Sekulic, Croteam
Practical Development for Vulkan Dan Ginsburg, Valve Baldur Karlsson, Unity Dean Sekulic, Croteam Session Overview Vulkan Status Update, Dan Ginsburg Vulkan Care and Feeding, Dean Sekulic Debugging with
More informationVIRTU Universal MVP Installation Guide
VIRTU Universal MVP Installation Guide 1 1. Introduction VIRTU Universal MVP includes the base features of Virtu Universal technology, which virtualizes integrated GPU and discrete GPU for best of breed
More informationNVIDIA nfinitefx Engine: Programmable Pixel Shaders
NVIDIA nfinitefx Engine: Programmable Pixel Shaders The NVIDIA nfinitefx Engine: The NVIDIA nfinitefx TM engine gives developers the ability to program a virtually infinite number of special effects and
More informationTUNING CUDA APPLICATIONS FOR MAXWELL
TUNING CUDA APPLICATIONS FOR MAXWELL DA-07173-001_v6.5 August 2014 Application Note TABLE OF CONTENTS Chapter 1. Maxwell Tuning Guide... 1 1.1. NVIDIA Maxwell Compute Architecture... 1 1.2. CUDA Best Practices...2
More informationSAPPHIRE TOXIC R9 280X 3GB GDDR5
SAPPHIRE TOXIC R9 280X 3GB GDDR5 Specification Display Support Output GPU Video Memory Dimension Software Accessory 5 x Maximum Display Monitor(s) support 1 x HDMI (with 3D) 2 x Mini-DisplayPort 1 x Single-Link
More informationSIGGRAPH Briefing August 2014
Copyright Khronos Group 2014 - Page 1 SIGGRAPH Briefing August 2014 Neil Trevett VP Mobile Ecosystem, NVIDIA President, Khronos Copyright Khronos Group 2014 - Page 2 Significant Khronos API Ecosystem Advances
More informationReal-World Applications of Computer Arithmetic
1 Commercial Applications Real-World Applications of Computer Arithmetic Stuart Oberman General purpose microprocessors with high performance FPUs AMD Athlon Intel P4 Intel Itanium Application specific
More informationMultimedia in Mobile Phones. Architectures and Trends Lund
Multimedia in Mobile Phones Architectures and Trends Lund 091124 Presentation Henrik Ohlsson Contact: henrik.h.ohlsson@stericsson.com Working with multimedia hardware (graphics and displays) at ST- Ericsson
More informationGPU Architecture and Function. Michael Foster and Ian Frasch
GPU Architecture and Function Michael Foster and Ian Frasch Overview What is a GPU? How is a GPU different from a CPU? The graphics pipeline History of the GPU GPU architecture Optimizations GPU performance
More informationNVIDIA Parallel Nsight. Jeff Kiel
NVIDIA Parallel Nsight Jeff Kiel Agenda: NVIDIA Parallel Nsight Programmable GPU Development Presenting Parallel Nsight Demo Questions/Feedback Programmable GPU Development More programmability = more
More informationAnalyze and Optimize Windows* Game Applications Using Intel INDE Graphics Performance Analyzers (GPA)
Analyze and Optimize Windows* Game Applications Using Intel INDE Graphics Performance Analyzers (GPA) Intel INDE Graphics Performance Analyzers (GPA) are powerful, agile tools enabling game developers
More informationThis Unit: Putting It All Together. CIS 371 Computer Organization and Design. What is Computer Architecture? Sources
This Unit: Putting It All Together CIS 371 Computer Organization and Design Unit 15: Putting It All Together: Anatomy of the XBox 360 Game Console Application OS Compiler Firmware CPU I/O Memory Digital
More informationNVIDIA s Compute Unified Device Architecture (CUDA)
NVIDIA s Compute Unified Device Architecture (CUDA) Mike Bailey mjb@cs.oregonstate.edu Reaching the Promised Land NVIDIA GPUs CUDA Knights Corner Speed Intel CPUs General Programmability 1 History of GPU
More informationNVIDIA s Compute Unified Device Architecture (CUDA)
NVIDIA s Compute Unified Device Architecture (CUDA) Mike Bailey mjb@cs.oregonstate.edu Reaching the Promised Land NVIDIA GPUs CUDA Knights Corner Speed Intel CPUs General Programmability History of GPU
More informationCUDA (Compute Unified Device Architecture)
CUDA (Compute Unified Device Architecture) Mike Bailey History of GPU Performance vs. CPU Performance GFLOPS Source: NVIDIA G80 = GeForce 8800 GTX G71 = GeForce 7900 GTX G70 = GeForce 7800 GTX NV40 = GeForce
More informationCollecting OpenCL*-related Metrics with Intel Graphics Performance Analyzers
Collecting OpenCL*-related Metrics with Intel Graphics Performance Analyzers Collecting Important OpenCL*-related Metrics with Intel GPA System Analyzer Introduction Intel SDK for OpenCL* Applications
More informationA SIMD-efficient 14 Instruction Shader Program for High-Throughput Microtriangle Rasterization
A SIMD-efficient 14 Instruction Shader Program for High-Throughput Microtriangle Rasterization Jordi Roca Victor Moya Carlos Gonzalez Vicente Escandell Albert Murciego Agustin Fernandez, Computer Architecture
More informationLucid Virtu Installation Guide
Lucid Virtu Installation Guide 1 Introduction Lucid VIRTU solution is designed for Intel Sandy Bridge platform with Intel Processor Graphics enabled. VIRTU dynamically assigns tasks to best available graphics
More informationCS8803SC Software and Hardware Cooperative Computing GPGPU. Prof. Hyesoon Kim School of Computer Science Georgia Institute of Technology
CS8803SC Software and Hardware Cooperative Computing GPGPU Prof. Hyesoon Kim School of Computer Science Georgia Institute of Technology Why GPU? A quiet revolution and potential build-up Calculation: 367
More informationGraphics Pass-through with VT-d
Graphics Pass-through with VT-d Nov-19-2009 Weidong Han Ben Lin Xen Summit Asia 2009 Agenda Graphics Virtualization Introduction Graphics Pass-through with VT-d Performance Conclusion 2 Requirements on
More informationLecture 7: The Programmable GPU Core. Kayvon Fatahalian CMU : Graphics and Imaging Architectures (Fall 2011)
Lecture 7: The Programmable GPU Core Kayvon Fatahalian CMU 15-869: Graphics and Imaging Architectures (Fall 2011) Today A brief history of GPU programmability Throughput processing core 101 A detailed
More informationSAPPHIRE DUAL-X R9 270X 2GB GDDR5 OC WITH BOOST
SAPPHIRE DUAL-X R9 270X 2GB GDDR5 OC WITH BOOST Specification Display Support Output GPU Video Memory Dimension Software Accessory 3 x Maximum Display Monitor(s) support 1 x HDMI (with 3D) 1 x DisplayPort
More informationUltimate Graphics Performance for DirectX 10 Hardware
Ultimate Graphics Performance for DirectX 10 Hardware Nicolas Thibieroz European Developer Relations AMD Graphics Products Group nicolas.thibieroz@amd.com V1.01 Generic API Usage DX10 designed for performance
More informationSPIN: Seamless Operating System Integration of Peer-to-Peer DMA Between SSDs and GPUs. Shai Bergman Tanya Brokhman Tzachi Cohen Mark Silberstein
: Seamless Operating System Integration of Peer-to-Peer DMA Between SSDs and s Shai Bergman Tanya Brokhman Tzachi Cohen Mark Silberstein What do we do? Enable efficient file I/O for s Why? Support diverse
More informationGraphics Architectures and OpenCL. Michael Doggett Department of Computer Science Lund university
Graphics Architectures and OpenCL Michael Doggett Department of Computer Science Lund university Overview Parallelism Radeon 5870 Tiled Graphics Architectures Important when Memory and Bandwidth limited
More informationLecture 6: Texture. Kayvon Fatahalian CMU : Graphics and Imaging Architectures (Fall 2011)
Lecture 6: Texture Kayvon Fatahalian CMU 15-869: Graphics and Imaging Architectures (Fall 2011) Today: texturing! Texture filtering - Texture access is not just a 2D array lookup ;-) Memory-system implications
More informationVulkan Launch Webinar 18 th February Copyright Khronos Group Page 1
Vulkan Launch Webinar 18 th February 2016 Copyright Khronos Group 2016 - Page 1 Copyright Khronos Group 2016 - Page 2 The Vulkan Launch Webinar Is About to Start! Kathleen Mattson - Webinar MC, Khronos
More informationSoft Particles. Tristan Lorach
Soft Particles Tristan Lorach tlorach@nvidia.com January 2007 Document Change History Version Date Responsible Reason for Change 1 01/17/07 Tristan Lorach Initial release January 2007 ii Abstract Before:
More informationTechnical Report. GLSL Pseudo-Instancing
Technical Report GLSL Pseudo-Instancing Abstract GLSL Pseudo-Instancing This whitepaper and corresponding SDK sample demonstrate a technique to speed up the rendering of instanced geometry with GLSL. The
More informationDominic Filion, Senior Engineer Blizzard Entertainment. Rob McNaughton, Lead Technical Artist Blizzard Entertainment
Dominic Filion, Senior Engineer Blizzard Entertainment Rob McNaughton, Lead Technical Artist Blizzard Entertainment Screen-space techniques Deferred rendering Screen-space ambient occlusion Depth of Field
More informationFast Stereoscopic Rendering on Mobile Ray Tracing GPU for Virtual Reality Applications
Fast Stereoscopic Rendering on Mobile Ray Tracing GPU for Virtual Reality Applications SAMSUNG Advanced Institute of Technology Won-Jong Lee, Seok Joong Hwang, Youngsam Shin, Jeong-Joon Yoo, Soojung Ryu
More informationCSCI-GA Graphics Processing Units (GPUs): Architecture and Programming Lecture 2: Hardware Perspective of GPUs
CSCI-GA.3033-004 Graphics Processing Units (GPUs): Architecture and Programming Lecture 2: Hardware Perspective of GPUs Mohamed Zahran (aka Z) mzahran@cs.nyu.edu http://www.mzahran.com History of GPUs
More informationSLICING THE WORKLOAD MULTI-GPU OPENGL RENDERING APPROACHES
SLICING THE WORKLOAD MULTI-GPU OPENGL RENDERING APPROACHES INGO ESSER NVIDIA DEVTECH PROVIZ OVERVIEW Motivation Tools of the trade Multi-GPU driver functions Multi-GPU programming functions Multi threaded
More informationAnatomy of AMD s TeraScale Graphics Engine
Anatomy of AMD s TeraScale Graphics Engine Mike Houston Design Goals Focus on Efficiency f(perf/watt, Perf/$) Scale up processing power and AA performance Target >2x previous generation Enhance stream
More informationCS GPU and GPGPU Programming Lecture 8+9: GPU Architecture 7+8. Markus Hadwiger, KAUST
CS 380 - GPU and GPGPU Programming Lecture 8+9: GPU Architecture 7+8 Markus Hadwiger, KAUST Reading Assignment #5 (until March 12) Read (required): Programming Massively Parallel Processors book, Chapter
More informationPower Efficiency for Software Algorithms running on Graphics Processors. Björn Johnsson Per Ganestam Michael Doggett Tomas Akenine-Möller
1 Power Efficiency for Software Algorithms running on Graphics Processors Björn Johnsson Per Ganestam Michael Doggett Tomas Akenine-Möller Overview 2 Motivation Goal Project Applications Methodology Results
More informationToday s Agenda. DirectX 9 Features Sim Dietrich, nvidia - Multisample antialising Jason Mitchell, ATI - Shader models and coding tips
Today s Agenda DirectX 9 Features Sim Dietrich, nvidia - Multisample antialising Jason Mitchell, ATI - Shader models and coding tips Optimization for DirectX 9 Graphics Mike Burrows, Microsoft - Performance
More informationParallel Programming on Larrabee. Tim Foley Intel Corp
Parallel Programming on Larrabee Tim Foley Intel Corp Motivation This morning we talked about abstractions A mental model for GPU architectures Parallel programming models Particular tools and APIs This
More informationECE 574 Cluster Computing Lecture 17
ECE 574 Cluster Computing Lecture 17 Vince Weaver http://web.eece.maine.edu/~vweaver vincent.weaver@maine.edu 28 March 2019 HW#8 (CUDA) posted. Project topics due. Announcements 1 CUDA installing On Linux
More informationThis Unit: Putting It All Together. CIS 501 Computer Architecture. What is Computer Architecture? Sources
This Unit: Putting It All Together CIS 501 Computer Architecture Unit 12: Putting It All Together: Anatomy of the XBox 360 Game Console Application OS Compiler Firmware CPU I/O Memory Digital Circuits
More informationDuksu Kim. Professional Experience Senior researcher, KISTI High performance visualization
Duksu Kim Assistant professor, KORATEHC Education Ph.D. Computer Science, KAIST Parallel Proximity Computation on Heterogeneous Computing Systems for Graphics Applications Professional Experience Senior
More informationInteractive Light Mapping with PowerVR Ray Tracing
Interactive Light Mapping with PowerVR Ray Tracing Jens Fursund Justin DeCell Light Map Basics A light map is a texture that stores lighting for objects in the scene 3 Generation of light maps for GI Charting
More informationOn-the-fly Vertex Reuse for Massively-Parallel Software Geometry Processing
2018 On-the-fly for Massively-Parallel Software Geometry Processing Bernhard Kerbl Wolfgang Tatzgern Elena Ivanchenko Dieter Schmalstieg Markus Steinberger 5 4 3 4 2 5 6 7 6 3 1 2 0 1 0, 0,1,7, 7,1,2,
More informationCS4230 Parallel Programming. Lecture 3: Introduction to Parallel Architectures 8/28/12. Homework 1: Parallel Programming Basics
CS4230 Parallel Programming Lecture 3: Introduction to Parallel Architectures Mary Hall August 28, 2012 Homework 1: Parallel Programming Basics Due before class, Thursday, August 30 Turn in electronically
More informationD3D12 & Vulkan: Lessons learned. Dr. Matthäus G. Chajdas Developer Technology Engineer, AMD
D3D12 & Vulkan: Lessons learned Dr. Matthäus G. Chajdas Developer Technology Engineer, AMD D3D12 What s new? DXIL DXGI & UWP updates Root Signature 1.1 Shader cache GPU validation PIX D3D12 / DXIL DXBC
More informationCourse Recap + 3D Graphics on Mobile GPUs
Lecture 18: Course Recap + 3D Graphics on Mobile GPUs Interactive Computer Graphics Q. What is a big concern in mobile computing? A. Power Two reasons to save power Run at higher performance for a fixed
More information