SIMULATOR AMD RESEARCH JUNE 14, 2015

Similar documents
Panel Discussion: The Future of I/O From a CPU Architecture Perspective

OPENCL TM APPLICATION ANALYSIS AND OPTIMIZATION MADE EASY WITH AMD APP PROFILER AND KERNELANALYZER

AMD CORPORATE TEMPLATE AMD Radeon Open Compute Platform Felix Kuehling

AMD ACCELERATING TECHNOLOGIES FOR EXASCALE COMPUTING FELLOW 3 OCTOBER 2016

INTRODUCTION TO OPENCL TM A Beginner s Tutorial. Udeepta Bordoloi AMD

HETEROGENEOUS SYSTEM ARCHITECTURE: PLATFORM FOR THE FUTURE

KVM CPU MODEL IN SYSCALL EMULATION MODE ALEXANDRU DUTU, JOHN SLICE JUNE 14, 2015

ROCm: An open platform for GPU computing exploration

AMD IOMMU VERSION 2 How KVM will use it. Jörg Rödel August 16th, 2011

AMD APU and Processor Comparisons. AMD Client Desktop Feb 2013 AMD

ADVANCED RENDERING EFFECTS USING OPENCL TM AND APU Session Olivier Zegdoun AMD Sr. Software Engineer


INTERFERENCE FROM GPU SYSTEM SERVICE REQUESTS

The Rise of Open Programming Frameworks. JC BARATAULT IWOCL May 2015

Understanding GPGPU Vector Register File Usage

AMD Graphics Team Last Updated February 11, 2013 APPROVED FOR PUBLIC DISTRIBUTION. 1 3DMark Overview February 2013 Approved for public distribution

Sequential Consistency for Heterogeneous-Race-Free

Heterogeneous Computing

Generic System Calls for GPUs

AMD RYZEN PROCESSOR WITH RADEON VEGA GRAPHICS CORPORATE BRAND GUIDELINES

HyperTransport Technology

SOLUTION TO SHADER RECOMPILES IN RADEONSI SEPTEMBER 2015

Run Anywhere. The Hardware Platform Perspective. Ben Pollan, AMD Java Labs October 28, 2008

FLASH MEMORY SUMMIT Adoption of Caching & Hybrid Solutions

AMD Graphics Team Last Updated April 29, 2013 APPROVED FOR PUBLIC DISTRIBUTION. 1 3DMark Overview April 2013 Approved for public distribution

Use cases. Faces tagging in photo and video, enabling: sharing media editing automatic media mashuping entertaining Augmented reality Games

THE PROGRAMMER S GUIDE TO THE APU GALAXY. Phil Rogers, Corporate Fellow AMD

RegMutex: Inter-Warp GPU Register Time-Sharing

Multi-core processors are here, but how do you resolve data bottlenecks in native code?

EFFICIENT SPARSE MATRIX-VECTOR MULTIPLICATION ON GPUS USING THE CSR STORAGE FORMAT

MEASURING AND MODELING ON-CHIP INTERCONNECT POWER ON REAL HARDWARE

BIOMEDICAL DATA ANALYSIS ON HETEROGENEOUS PLATFORM. Dong Ping Zhang Heterogeneous System Architecture AMD

EXPLOITING ACCELERATOR-BASED HPC FOR ARMY APPLICATIONS

FUSION PROCESSORS AND HPC

CAUTIONARY STATEMENT 1 AMD NEXT HORIZON NOVEMBER 6, 2018

HSA foundation! Advanced Topics on Heterogeneous System Architectures. Politecnico di Milano! Seminar Room A. Alario! 23 November, 2015!

Automatic Intra-Application Load Balancing for Heterogeneous Systems

GPGPU COMPUTE ON AMD. Udeepta Bordoloi April 6, 2011

viewdle! - machine vision experts

HPG 2011 HIGH PERFORMANCE GRAPHICS HOT 3D

CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to

HSA Foundation! Advanced Topics on Heterogeneous System Architectures. Politecnico di Milano! Seminar Room (Bld 20)! 15 December, 2017!

MIGRATION OF LEGACY APPLICATIONS TO HETEROGENEOUS ARCHITECTURES Francois Bodin, CTO, CAPS Entreprise. June 2011

CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to

clarmor: A DYNAMIC BUFFER OVERFLOW DETECTOR FOR OPENCL KERNELS CHRIS ERB, JOE GREATHOUSE, MAY 16, 2018

HIGHLY PARALLEL COMPUTING IN PHYSICS-BASED RENDERING OpenCL Raytracing Based. Thibaut PRADOS OPTIS Real-Time & Virtual Reality Manager

AMD Radeon ProRender plug-in for Unreal Engine. Installation Guide

3D Numerical Analysis of Two-Phase Immersion Cooling for Electronic Components

1 Presentation Title Month ##, 2012

AMD EPYC CORPORATE BRAND GUIDELINES

Fusion Enabled Image Processing

ACCELERATING MATRIX PROCESSING WITH GPUs. Nicholas Malaya, Shuai Che, Joseph Greathouse, Rene van Oostrum, and Michael Schulte AMD Research

AMD SEV Update Linux Security Summit David Kaplan, Security Architect

Accelerating Applications. the art of maximum performance computing James Spooner Maxeler VP of Acceleration

NEXT-GENERATION MATRIX 3D IMMERSIVE USER INTERFACE [ M3D-IUI ] H Raghavendra Swamy AMD Senior Software Engineer

AMD S X86 OPEN64 COMPILER. Michael Lai AMD

1 HiPEAC January, 2012 Public TASKS, FUTURES AND ASYNCHRONOUS PROGRAMMING

DR. LISA SU

AMD AIB Partner Guidelines. Version February, 2015

STREAMING VIDEO DATA INTO 3D APPLICATIONS Session Christopher Mayer AMD Sr. Software Engineer

AMD HD3D Technology. Setup Guide. 1 AMD HD3D TECHNOLOGY: Setup Guide

PROTECTING VM REGISTER STATE WITH AMD SEV-ES DAVID KAPLAN LSS 2017

CLICK TO EDIT MASTER TITLE STYLE. Click to edit Master text styles. Second level Third level Fourth level Fifth level

Designing Natural Interfaces

Maximizing Six-Core AMD Opteron Processor Performance with RHEL

D3D12 & Vulkan: Lessons learned. Dr. Matthäus G. Chajdas Developer Technology Engineer, AMD

AMD RYZEN CORPORATE BRAND GUIDELINES

Gestural and Cinematic Interfaces - DX11. David Brebner Unlimited Realities CTO

OpenCL Implementation Of A Heterogeneous Computing System For Real-time Rendering And Dynamic Updating Of Dense 3-d Volumetric Data

Pattern-based analytics to estimate and track yield risk of designs down to 7nm

Vulkan (including Vulkan Fast Paths)

The Road to the AMD. Fiji GPU. Featuring Die Stacking and HBM Technology 1 THE ROAD TO THE AMD FIJI GPU ECTC 2016 MAY 2015

HPCA 18. Reliability-aware Data Placement for Heterogeneous memory Architecture

The mobile computing evolution. The Griffin architecture. Memory enhancements. Power management. Thermal management

Desktop Telepresence Arrived! Sudha Valluru ViVu CEO

LIQUIDVR TODAY AND TOMORROW GUENNADI RIGUER, SOFTWARE ARCHITECT

SCALING DGEMM TO MULTIPLE CAYMAN GPUS AND INTERLAGOS MANY-CORE CPUS FOR HPL

MULTIMEDIA PROCESSING Real-time H.264 video enhancement by using AMD APP SDK

Cilk Plus: Multicore extensions for C and C++

Heterogeneous SoCs. May 28, 2014 COMPUTER SYSTEM COLLOQUIUM 1

PCCC WORKSHOP:AMD の最新製品戦略とプラットフォームソリューション FEBRUARY 19 TH 2016 HIDETOSHI IWASA, FAE MANAGER AMD JAPAN

Graphics Hardware 2008

AMD Radeon ProRender plug-in for Universal Scene Description. Installation Guide

AMD Security and Server innovation

Changing your Driver Options with Radeon Pro Settings. Quick Start User Guide v3.0

Introducing NVDIMM-X: Designed to be the World s Fastest NAND-Based SSD Architecture and a Platform for the Next Generation of New Media SSDs

Driver Options in AMD Radeon Pro Settings. User Guide

Fan Control in AMD Radeon Pro Settings. User Guide. This document is a quick user guide on how to configure GPU fan speed in AMD Radeon Pro Settings.

Resource Saving: Latest Innovation in Optimized Cloud Infrastructure

Changing your Driver Options with Radeon Pro Settings. Quick Start User Guide v2.1

CAUTIONARY STATEMENT 1 EPYC PROCESSOR ONE YEAR ANNIVERSARY JUNE 2018

Solid State Graphics (SSG) SDK Setup and Raw Video Player Guide

THE HETEROGENEOUS SYSTEM ARCHITECTURE IT S BEYOND THE GPU

User Manual. Nvidia Jetson Series Carrier board Aetina ACE-N622

Device Pack. Network Video Management System Standard Edition. Release Note. Software Version: 9.5a Sony Corporation

INTRODUCING RYZEN MARCH

Compiling for HSA accelerators with GCC

Microsoft Windows 2016 Mellanox 100GbE NIC Tuning Guide

Device Pack. Network Video Management System Standard Edition. Release Note. Software Version: Sony Corporation

Anatomy of AMD s TeraScale Graphics Engine

Transcription:

AMD'S gem5apu SIMULATOR AMD RESEARCH JUNE 14, 2015

OVERVIEW Introducing AMD s gem5 APU Simulator Extends gem5 with a GPU timing model Supports Heterogeneous System Architecture in SE mode Includes several Ruby-based APU memory systems This talk Heterogeneous System Architecture Execution flow Release schedule Future plans 2 AMD'S gem5apu SIMULATOR JUNE 20, 2015 PUBLIC

WHAT IS HSA? HETEROGENEOUS SYSTEM ARCHITECTURE PARALLEL WORKLOADS huma and hq SERIAL WORKLOADS Processor design that makes it easy to harnessthe entire computing power of an APU for faster and more power-efficient devices, including personal computers, tablets, smartphones, and cloud servers APU ACCELERATED PROCESSING UNIT 3 AMD'S gem5apu SIMULATOR JUNE 20, 2015 PUBLIC

HSA BUILDING BLOCKS http://hsafoundation.com http://github.com/hsafoundation HSA Hardware Building Blocks Shared Virtual Memory Single address space Coherent Pageable Fast access from all components Can share pointers Architected User-Level Queues Signals Context Switching Platform Atomics Defined Memory Model Open- Source HSA Platform System Arch Specification Industry standard, architected requirements for how devices share memory and communicate with each other HSA Software Building Blocks HSAIL Portable, parallel, compiler IR Instruction definition HSA Runtime Create queues Allocate memory Device discovery Open- Source Open- Source HSA Programmer s Reference Manual HSA System Runtime Specification Open- Source Multiple high level compilers CLANG/LLVM/HSAIL C++, OpenMP, OpenACC, Python, OpenCL, etc Industry standard compiler IR and runtime to enable existing programming languages to target the GPU 4 AMD'S gem5apu SIMULATOR JUNE 20, 2015 PUBLIC

APU SIMULATION SUPPORT HSA Hardware Building Blocks Shared Virtual Memory Single address space Coherent Pageable Fast access from all components Can share pointers Architected User-Level Queues Signals Context Switching Platform Atomics Defined Memory Model HSA Software Building Blocks HSAIL Portable, parallel, compiler IR Instruction definition HSA Runtime (OpenCL TM Runtime) Create queues Allocate memory Device discovery Multiple high-level compilers CLANG/LLVM/HSAIL C++, OpenMP, OpenACC, Python, OpenCL, etc 5 AMD'S gem5apu SIMULATOR JUNE 20, 2015 PUBLIC Legend Included in first release Work-in-progress / may be released Longer term work

APU SIMULATION FLOW GPGPU flow based on gem5 w/ AMD added components Supports HSAIL/BRIG kernels System emulation simulation No OS or device driver Execution-driven evaluation GPU directly executes HSAIL CPU executes x86 Details GCN GPU model* Multiple GPU cache hierarchies Write-through Read-for-ownership [*] AMD Graphics Core Next (GCN) Architecture, White Paper. http://www.amd.com/documents/gcn_architecture_whitepaper.pdf. June 2012. GCC x86 host binary Emulated OpenCL driver ops. CPU OpenCL benchmarks Ruby HSA LLVM HSAIL compiler https://github.com/hsafoundation/ HSAIL-HLC-Stable HSAIL kernel Brig file Brig binary parser X86 X86 CPU CPU X86 X86 CPU CPU X86 X86 CPU CPU X86 CUCPU GPU Cache Hierarchy 6 AMD'S gem5apu SIMULATOR JUNE 20, 2015 PUBLIC

RELEASE SCHEDULE First patches posted for review on May 11 Posting all patches by the end of October We appreciate the active discussion and reviews We made many, many modifications based on reviewer feedback Please keep the cost/benefit analysis in mind when asking for changes 7 AMD'S gem5apu SIMULATOR JUNE 20, 2015 PUBLIC

Questions?

DISCLAIMER & ATTRIBUTION The information presented in this document is for informational purposes only and may contain technical inaccuracies, omissions and typographical errors. The information contained herein is subject to change and may be rendered inaccurate for many reasons, including but not limitedto product and roadmap changes, component and motherboard version changes, new model and/or product releases, product differences between differing manufacturers, software changes, BIOS flashes, firmware upgrades, or the like. AMD assumes no obligation to update or otherwise correct or revise this information. However, AMD reserves the right to revise this information and to make changes from time to time to the content hereof without obligation of AMD to notify any person of such revisions or changes. AMD MAKES NO REPRESENTATIONS OR WARRANTIES WITH RESPECT TO THE CONTENTS HEREOF AND ASSUMES NO RESPONSIBILITY FOR ANY INACCURACIES, ERRORS OR OMISSIONS THAT MAY APPEAR IN THIS INFORMATION. AMD SPECIFICALLY DISCLAIMS ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE. IN NO EVENT WILL AMDBE LIABLE TO ANY PERSON FOR ANY DIRECT, INDIRECT, SPECIAL OR OTHER CONSEQUENTIAL DAMAGES ARISING FROM THE USE OF ANY INFORMATION CONTAINED HEREIN, EVEN IF AMD IS EXPRESSLY ADVISED OF THE POSSIBILITY OF SUCH DAMAGES. ATTRIBUTION 2015 Advanced Micro Devices, Inc. All rights reserved. AMD, the AMD Arrow logoand combinations thereof are trademarks of Advanced Micro Devices, Inc. in the United States and/or other jurisdictions. SPEC is a registered trademark of the Standard Performance EvaluationCorporation (SPEC). OpenCLis a trademark of Apple Inc. used by permission by Khronos. Other names are for informational purposes only and may be trademarks of their respective owners. 9 AMD'S gem5apu SIMULATOR JUNE 20, 2015 PUBLIC

10 AMD'S gem5apu SIMULATOR JUNE 20, 2015 PUBLIC