AMD CORPORATE TEMPLATE AMD Radeon Open Compute Platform Felix Kuehling

Similar documents
ROCm: An open platform for GPU computing exploration

AMD ACCELERATING TECHNOLOGIES FOR EXASCALE COMPUTING FELLOW 3 OCTOBER 2016

OPENCL TM APPLICATION ANALYSIS AND OPTIMIZATION MADE EASY WITH AMD APP PROFILER AND KERNELANALYZER

SIMULATOR AMD RESEARCH JUNE 14, 2015

CAUTIONARY STATEMENT 1 AMD NEXT HORIZON NOVEMBER 6, 2018

INTRODUCTION TO OPENCL TM A Beginner s Tutorial. Udeepta Bordoloi AMD

CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to

The Rise of Open Programming Frameworks. JC BARATAULT IWOCL May 2015

Panel Discussion: The Future of I/O From a CPU Architecture Perspective

AMD Graphics Team Last Updated February 11, 2013 APPROVED FOR PUBLIC DISTRIBUTION. 1 3DMark Overview February 2013 Approved for public distribution

AMD IOMMU VERSION 2 How KVM will use it. Jörg Rödel August 16th, 2011

AMD RYZEN PROCESSOR WITH RADEON VEGA GRAPHICS CORPORATE BRAND GUIDELINES

HETEROGENEOUS SYSTEM ARCHITECTURE: PLATFORM FOR THE FUTURE

KVM CPU MODEL IN SYSCALL EMULATION MODE ALEXANDRU DUTU, JOHN SLICE JUNE 14, 2015

AMD APU and Processor Comparisons. AMD Client Desktop Feb 2013 AMD


AMD Graphics Team Last Updated April 29, 2013 APPROVED FOR PUBLIC DISTRIBUTION. 1 3DMark Overview April 2013 Approved for public distribution

CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to

INTERFERENCE FROM GPU SYSTEM SERVICE REQUESTS

ADVANCED RENDERING EFFECTS USING OPENCL TM AND APU Session Olivier Zegdoun AMD Sr. Software Engineer

clarmor: A DYNAMIC BUFFER OVERFLOW DETECTOR FOR OPENCL KERNELS CHRIS ERB, JOE GREATHOUSE, MAY 16, 2018

Understanding GPGPU Vector Register File Usage

Automatic Intra-Application Load Balancing for Heterogeneous Systems

ACCELERATING MATRIX PROCESSING WITH GPUs. Nicholas Malaya, Shuai Che, Joseph Greathouse, Rene van Oostrum, and Michael Schulte AMD Research

Use cases. Faces tagging in photo and video, enabling: sharing media editing automatic media mashuping entertaining Augmented reality Games

HyperTransport Technology

SOLUTION TO SHADER RECOMPILES IN RADEONSI SEPTEMBER 2015

Generic System Calls for GPUs

AMD Radeon ProRender plug-in for Unreal Engine. Installation Guide

MEASURING AND MODELING ON-CHIP INTERCONNECT POWER ON REAL HARDWARE

EFFICIENT SPARSE MATRIX-VECTOR MULTIPLICATION ON GPUS USING THE CSR STORAGE FORMAT

EXPLOITING ACCELERATOR-BASED HPC FOR ARMY APPLICATIONS

FLASH MEMORY SUMMIT Adoption of Caching & Hybrid Solutions

THE PROGRAMMER S GUIDE TO THE APU GALAXY. Phil Rogers, Corporate Fellow AMD

AMD SEV Update Linux Security Summit David Kaplan, Security Architect

Heterogeneous Computing

viewdle! - machine vision experts

BIOMEDICAL DATA ANALYSIS ON HETEROGENEOUS PLATFORM. Dong Ping Zhang Heterogeneous System Architecture AMD

Driver Options in AMD Radeon Pro Settings. User Guide

Sequential Consistency for Heterogeneous-Race-Free

HIGHLY PARALLEL COMPUTING IN PHYSICS-BASED RENDERING OpenCL Raytracing Based. Thibaut PRADOS OPTIS Real-Time & Virtual Reality Manager

GPGPU COMPUTE ON AMD. Udeepta Bordoloi April 6, 2011

Changing your Driver Options with Radeon Pro Settings. Quick Start User Guide v3.0

The Road to the AMD. Fiji GPU. Featuring Die Stacking and HBM Technology 1 THE ROAD TO THE AMD FIJI GPU ECTC 2016 MAY 2015

AMD HD3D Technology. Setup Guide. 1 AMD HD3D TECHNOLOGY: Setup Guide

AMD AIB Partner Guidelines. Version February, 2015

RegMutex: Inter-Warp GPU Register Time-Sharing

HPG 2011 HIGH PERFORMANCE GRAPHICS HOT 3D

DR. LISA SU

D3D12 & Vulkan: Lessons learned. Dr. Matthäus G. Chajdas Developer Technology Engineer, AMD

The mobile computing evolution. The Griffin architecture. Memory enhancements. Power management. Thermal management

Fusion Enabled Image Processing

MIGRATION OF LEGACY APPLICATIONS TO HETEROGENEOUS ARCHITECTURES Francois Bodin, CTO, CAPS Entreprise. June 2011

Designing Natural Interfaces

3D Numerical Analysis of Two-Phase Immersion Cooling for Electronic Components

Run Anywhere. The Hardware Platform Perspective. Ben Pollan, AMD Java Labs October 28, 2008

Pattern-based analytics to estimate and track yield risk of designs down to 7nm

Vulkan (including Vulkan Fast Paths)

NEXT-GENERATION MATRIX 3D IMMERSIVE USER INTERFACE [ M3D-IUI ] H Raghavendra Swamy AMD Senior Software Engineer

Desktop Telepresence Arrived! Sudha Valluru ViVu CEO

Changing your Driver Options with Radeon Pro Settings. Quick Start User Guide v2.1

AMD RYZEN CORPORATE BRAND GUIDELINES

Solid State Graphics (SSG) SDK Setup and Raw Video Player Guide

1 HiPEAC January, 2012 Public TASKS, FUTURES AND ASYNCHRONOUS PROGRAMMING

Accelerating Applications. the art of maximum performance computing James Spooner Maxeler VP of Acceleration

AMD EPYC CORPORATE BRAND GUIDELINES

Multi-core processors are here, but how do you resolve data bottlenecks in native code?

PCCC WORKSHOP:AMD の最新製品戦略とプラットフォームソリューション FEBRUARY 19 TH 2016 HIDETOSHI IWASA, FAE MANAGER AMD JAPAN

AMD Radeon ProRender plug-in for Universal Scene Description. Installation Guide

STREAMING VIDEO DATA INTO 3D APPLICATIONS Session Christopher Mayer AMD Sr. Software Engineer

PROTECTING VM REGISTER STATE WITH AMD SEV-ES DAVID KAPLAN LSS 2017

Fan Control in AMD Radeon Pro Settings. User Guide. This document is a quick user guide on how to configure GPU fan speed in AMD Radeon Pro Settings.

Gestural and Cinematic Interfaces - DX11. David Brebner Unlimited Realities CTO

SCALING DGEMM TO MULTIPLE CAYMAN GPUS AND INTERLAGOS MANY-CORE CPUS FOR HPL

Cilk Plus: Multicore extensions for C and C++

Maximizing Six-Core AMD Opteron Processor Performance with RHEL

AMD s Radeon Next Generation GPU Architecture. Vega10

LIQUIDVR TODAY AND TOMORROW GUENNADI RIGUER, SOFTWARE ARCHITECT

AMD Security and Server innovation

amdgpu Graphics Stack Documentation

AMD S X86 OPEN64 COMPILER. Michael Lai AMD

MULTIMEDIA PROCESSING Real-time H.264 video enhancement by using AMD APP SDK

FUSION PROCESSORS AND HPC

OpenCL Implementation Of A Heterogeneous Computing System For Real-time Rendering And Dynamic Updating Of Dense 3-d Volumetric Data

Microsoft Windows 2016 Mellanox 100GbE NIC Tuning Guide

HPCA 18. Reliability-aware Data Placement for Heterogeneous memory Architecture

1 Presentation Title Month ##, 2012

INTRODUCING RYZEN MARCH

Anatomy of AMD s TeraScale Graphics Engine

User Manual. Nvidia Jetson Series Carrier board Aetina ACE-N622

AMD S ROCM SOFTWARE HELPS ACCELERATE HPC & DEEP LEARNING A HETEROGENEOUS COMPUTE ACCELERATION PLATFORM DESIGNED FOR THE FUTURE

Graphics Hardware 2008

オープンソ プンソース技術者のための AMD 最新テクノロジーアップデート 日本 AMD 株式会社 マーケティング ビジネス開発本部 エンタープライズプロダクトマーケティング部 山野 洋幸

AMD 780G. Niles Burbank AMD. an x86 chipset with advanced integrated GPU. Hot Chips 2008

CAUTIONARY STATEMENT 1 EPYC PROCESSOR ONE YEAR ANNIVERSARY JUNE 2018

Resource Saving: Latest Innovation in Optimized Cloud Infrastructure

Radeon Pro Software: Radeon Pro ReLive. User Guide v3.0

GPU LIBRARY ADVISOR. DA _v8.0 September Application Note

MxGPU Setup Guide with VMware

NVIDIA COLLECTIVE COMMUNICATION LIBRARY (NCCL)

Transcription:

AMD Radeon Open Compute Platform Felix Kuehling

ROCM PLATFORM ON LINUX Compiler Front End AMDGPU Driver Enabled with ROCm GCN Assembly Device LLVM Compiler (GCN) LLVM Opt Passes GCN Target Host LLVM Compiler LLVM Opt Passes CPU ISA Target User Space Kernel Space LanguageRuntime API ROCr System Runtime API ROCm Thunk API ROCm Kernel Driver (KFD) CPU Code GPU Code AMDGPU Kernel Driver DRM TTM

KFD FEATURES AND HW SUPPORT Upstreaming progress: 4.17: Fiji, Polaris 4.18: Vega10 4.19: Raven Ridge 4.20: Vega20 Compatible with ROCm 1.9 User mode queues Automatic memory residency management Multi-GPU memory mapping Shared Virtual Memory Device enumeration and properties (Topology) Asynchronous events Not yet upstream: Peer2peer memory mapping DMABuf import / export (GFX interop, IPC) GDB support (RDMA)

ROCm PROGRAMMING MODEL OPTIONS HIP Convert CUDA to portable C++ Single-source Host+Kernel C++ Kernel Language C Runtime (CUDA-like) Platforms: AMD GPU, NVIDIA (same perf as native CUDA) When to use it? Port existing CUDA code Developers familiar with CUDA New project that needs portability to AMD and NVIDIA HCC True single-source C++ accelerator language Single-source Host+Kernel C++ Kernel Language C++ Runtime Platforms: AMD GPU When to use it? New projects where true C++ language preferred Use features from latest ISO C++ standards OpenCL Khronos Industry Standard accelerator language Split Host/Kernel C99-based Kernel Language C Runtime Platforms: CPU, GPU, FPGA When to use it? Port existing OpenCL code New project that needs portability to CPU, GPU, FPGA

ROCM ML SOFTWARE STACK Applications Machine Learning App Frameworks Caffe Caffe2 TensorFlow PyTorch Keras MxNet Middleware & Libraries Programming Models ROCm MIOpen BLAS, FFT, RNG RCCL Eigen HCC HIP OpenCL TM Python ROCm Platform Italics = Under Dev

LINKS Online Documentation: https://rocmdocumentation.readthedocs.io/en/latest/ ROCm 1.9 release notes: https://rocmdocumentation.readthedocs.io/en/latest/current_r elease_notes/current-release-notes.html GitHub: https://github.com/radeonopencompute/rocm

DISCLAIMER & ATTRIBUTION The information presented in this document is for informational purposes only and may contain technical inaccuracies, omissions and typographical errors. The information contained herein is subject to change and may be rendered inaccurate for many reasons, including but not limited to product and roadmap changes, component and motherboard version changes, new model and/or product releases, product differences between differing manufacturers, software changes, BIOS flashes, firmware upgrades, or the like. AMD assumes no obligation to update or otherwise correct or revise this information. However, AMD reserves the right to revise this information and to make changes from time to time to the content hereof without obligation of AMD to notify any person of such revisions or changes. AMD MAKES NO REPRESENTATIONS OR WARRANTIES WITH RESPECT TO THE CONTENTS HEREOF AND ASSUMES NO RESPONSIBILITY FOR ANY INACCURACIES, ERRORS OR OMISSIONS THAT MAY APPEAR IN THIS INFORMATION. AMD SPECIFICALLY DISCLAIMS ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE. IN NO EVENT WILL AMD BE LIABLE TO ANY PERSON FOR ANY DIRECT, INDIRECT, SPECIAL OR OTHER CONSEQUENTIAL DAMAGES ARISING FROM THE USE OF ANY INFORMATION CONTAINED HEREIN, EVEN IF AMD IS EXPRESSLY ADVISED OF THE POSSIBILITY OF SUCH DAMAGES. ATTRIBUTION 2018 Advanced Micro Devices, Inc. All rights reserved. AMD, the AMD Arrow logo, FirePro and combinations thereof are trademarks of Advanced Micro Devices, Inc. in the United States and/or other jurisdictions. ARM is a registered trademark of ARM Limited in the UK and other countries. PCIe is a registered trademarks of PCI-SIG Corporation. OpenCL and the OpenCL logo are trademarks of Apple, Inc. and used by permission of Khronos. OpenVX is a trademark of Khronos Group, Inc. Other names are for informational purposes only and may be trademarks of their respective owners. Use of third party marks / names is for informational purposes only and no endorsement of or by AMD is intended or implied.