Panel Discussion: The Future of I/O From a CPU Architecture Perspective

Similar documents
SIMULATOR AMD RESEARCH JUNE 14, 2015

AMD ACCELERATING TECHNOLOGIES FOR EXASCALE COMPUTING FELLOW 3 OCTOBER 2016

INTRODUCTION TO OPENCL TM A Beginner s Tutorial. Udeepta Bordoloi AMD


AMD IOMMU VERSION 2 How KVM will use it. Jörg Rödel August 16th, 2011

OPENCL TM APPLICATION ANALYSIS AND OPTIMIZATION MADE EASY WITH AMD APP PROFILER AND KERNELANALYZER

AMD RYZEN PROCESSOR WITH RADEON VEGA GRAPHICS CORPORATE BRAND GUIDELINES

HETEROGENEOUS SYSTEM ARCHITECTURE: PLATFORM FOR THE FUTURE

AMD CORPORATE TEMPLATE AMD Radeon Open Compute Platform Felix Kuehling

ROCm: An open platform for GPU computing exploration

AMD Graphics Team Last Updated February 11, 2013 APPROVED FOR PUBLIC DISTRIBUTION. 1 3DMark Overview February 2013 Approved for public distribution

AMD APU and Processor Comparisons. AMD Client Desktop Feb 2013 AMD

AMD Graphics Team Last Updated April 29, 2013 APPROVED FOR PUBLIC DISTRIBUTION. 1 3DMark Overview April 2013 Approved for public distribution

KVM CPU MODEL IN SYSCALL EMULATION MODE ALEXANDRU DUTU, JOHN SLICE JUNE 14, 2015

INTERFERENCE FROM GPU SYSTEM SERVICE REQUESTS

CLICK TO EDIT MASTER TITLE STYLE. Click to edit Master text styles. Second level Third level Fourth level Fifth level

Use cases. Faces tagging in photo and video, enabling: sharing media editing automatic media mashuping entertaining Augmented reality Games

HyperTransport Technology

THE PROGRAMMER S GUIDE TO THE APU GALAXY. Phil Rogers, Corporate Fellow AMD

EFFICIENT SPARSE MATRIX-VECTOR MULTIPLICATION ON GPUS USING THE CSR STORAGE FORMAT

CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to

MEASURING AND MODELING ON-CHIP INTERCONNECT POWER ON REAL HARDWARE

Heterogeneous Computing

Generic System Calls for GPUs

Understanding GPGPU Vector Register File Usage

EXPLOITING ACCELERATOR-BASED HPC FOR ARMY APPLICATIONS

AMD EPYC CORPORATE BRAND GUIDELINES

3D Numerical Analysis of Two-Phase Immersion Cooling for Electronic Components

ADVANCED RENDERING EFFECTS USING OPENCL TM AND APU Session Olivier Zegdoun AMD Sr. Software Engineer

CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to

viewdle! - machine vision experts

Sequential Consistency for Heterogeneous-Race-Free

FLASH MEMORY SUMMIT Adoption of Caching & Hybrid Solutions

AMD SEV Update Linux Security Summit David Kaplan, Security Architect

FUSION PROCESSORS AND HPC

BIOMEDICAL DATA ANALYSIS ON HETEROGENEOUS PLATFORM. Dong Ping Zhang Heterogeneous System Architecture AMD

Run Anywhere. The Hardware Platform Perspective. Ben Pollan, AMD Java Labs October 28, 2008

AMD HD3D Technology. Setup Guide. 1 AMD HD3D TECHNOLOGY: Setup Guide

SOLUTION TO SHADER RECOMPILES IN RADEONSI SEPTEMBER 2015

AMD RYZEN CORPORATE BRAND GUIDELINES

RegMutex: Inter-Warp GPU Register Time-Sharing

Desktop Telepresence Arrived! Sudha Valluru ViVu CEO

Fusion Enabled Image Processing

Automatic Intra-Application Load Balancing for Heterogeneous Systems

Multi-core processors are here, but how do you resolve data bottlenecks in native code?

LIQUIDVR TODAY AND TOMORROW GUENNADI RIGUER, SOFTWARE ARCHITECT

AMD AIB Partner Guidelines. Version February, 2015

Maximizing Six-Core AMD Opteron Processor Performance with RHEL

The mobile computing evolution. The Griffin architecture. Memory enhancements. Power management. Thermal management

AMD Radeon ProRender plug-in for Unreal Engine. Installation Guide

Designing Natural Interfaces

Accelerating Applications. the art of maximum performance computing James Spooner Maxeler VP of Acceleration

D3D12 & Vulkan: Lessons learned. Dr. Matthäus G. Chajdas Developer Technology Engineer, AMD

ACCELERATING MATRIX PROCESSING WITH GPUs. Nicholas Malaya, Shuai Che, Joseph Greathouse, Rene van Oostrum, and Michael Schulte AMD Research

GPGPU COMPUTE ON AMD. Udeepta Bordoloi April 6, 2011

Gestural and Cinematic Interfaces - DX11. David Brebner Unlimited Realities CTO

NEXT-GENERATION MATRIX 3D IMMERSIVE USER INTERFACE [ M3D-IUI ] H Raghavendra Swamy AMD Senior Software Engineer

STREAMING VIDEO DATA INTO 3D APPLICATIONS Session Christopher Mayer AMD Sr. Software Engineer

1 HiPEAC January, 2012 Public TASKS, FUTURES AND ASYNCHRONOUS PROGRAMMING

CAUTIONARY STATEMENT 1 AMD NEXT HORIZON NOVEMBER 6, 2018

The Rise of Open Programming Frameworks. JC BARATAULT IWOCL May 2015

The Road to the AMD. Fiji GPU. Featuring Die Stacking and HBM Technology 1 THE ROAD TO THE AMD FIJI GPU ECTC 2016 MAY 2015

HPG 2011 HIGH PERFORMANCE GRAPHICS HOT 3D

Pattern-based analytics to estimate and track yield risk of designs down to 7nm

DR. LISA SU

clarmor: A DYNAMIC BUFFER OVERFLOW DETECTOR FOR OPENCL KERNELS CHRIS ERB, JOE GREATHOUSE, MAY 16, 2018

HIGHLY PARALLEL COMPUTING IN PHYSICS-BASED RENDERING OpenCL Raytracing Based. Thibaut PRADOS OPTIS Real-Time & Virtual Reality Manager

Vulkan (including Vulkan Fast Paths)

AMD S X86 OPEN64 COMPILER. Michael Lai AMD

AMD Security and Server innovation

HPCA 18. Reliability-aware Data Placement for Heterogeneous memory Architecture

PROTECTING VM REGISTER STATE WITH AMD SEV-ES DAVID KAPLAN LSS 2017

MIGRATION OF LEGACY APPLICATIONS TO HETEROGENEOUS ARCHITECTURES Francois Bodin, CTO, CAPS Entreprise. June 2011

THE HETEROGENEOUS SYSTEM ARCHITECTURE IT S BEYOND THE GPU

OpenCL Implementation Of A Heterogeneous Computing System For Real-time Rendering And Dynamic Updating Of Dense 3-d Volumetric Data

Graphics Hardware 2008

Changing your Driver Options with Radeon Pro Settings. Quick Start User Guide v3.0

SCALING DGEMM TO MULTIPLE CAYMAN GPUS AND INTERLAGOS MANY-CORE CPUS FOR HPL

HSA Foundation! Advanced Topics on Heterogeneous System Architectures. Politecnico di Milano! Seminar Room (Bld 20)! 15 December, 2017!

Driver Options in AMD Radeon Pro Settings. User Guide

HSA foundation! Advanced Topics on Heterogeneous System Architectures. Politecnico di Milano! Seminar Room A. Alario! 23 November, 2015!

Fan Control in AMD Radeon Pro Settings. User Guide. This document is a quick user guide on how to configure GPU fan speed in AMD Radeon Pro Settings.

Device Pack. Network Video Management System Standard Edition. Release Note. Software Version: Sony Corporation

Changing your Driver Options with Radeon Pro Settings. Quick Start User Guide v2.1

1 Presentation Title Month ##, 2012

Cilk Plus: Multicore extensions for C and C++

User Manual. Nvidia Jetson Series Carrier board Aetina ACE-N622

MULTIMEDIA PROCESSING Real-time H.264 video enhancement by using AMD APP SDK

Introducing NVDIMM-X: Designed to be the World s Fastest NAND-Based SSD Architecture and a Platform for the Next Generation of New Media SSDs

PCCC WORKSHOP:AMD の最新製品戦略とプラットフォームソリューション FEBRUARY 19 TH 2016 HIDETOSHI IWASA, FAE MANAGER AMD JAPAN

INTRODUCING RYZEN MARCH

Microsoft Windows 2016 Mellanox 100GbE NIC Tuning Guide

NUMA Topology for AMD EPYC Naples Family Processors

AMD 780G. Niles Burbank AMD. an x86 chipset with advanced integrated GPU. Hot Chips 2008

Resource Saving: Latest Innovation in Optimized Cloud Infrastructure

VTR-2000 Evaluation and Product Development Platform. Instruction Sheet SOC Technologies Inc.

Device Pack. Network Video Management System Enterprise Edition. Release Note. Software Version: Sony Corporation

Device Pack. Network Video Management System Standard Edition. Release Note. Software Version: 9.5a Sony Corporation

オープンソ プンソース技術者のための AMD 最新テクノロジーアップデート 日本 AMD 株式会社 マーケティング ビジネス開発本部 エンタープライズプロダクトマーケティング部 山野 洋幸

Encoder Core. API Specification. Revision: SOC Technologies Inc.

Transcription:

Panel Discussion: The Future of I/O From a CPU Architecture Perspective Brad Benton AMD, Inc. #OFADevWorkshop

Issues Move to Exascale involves more parallel processing across more processing elements GPUs, FPGAs, Specialized accelerators, Processing In Memory (PIM), etc. These elements frequently do not live in the same coherent memory domain Results in much data movement to bring data to the needed processing element March 30 April 2, 2014 #OFADevWorkshop 2

Heterogeneous System Architecture (HSA) HSA Foundation Not-for-Profit Industry Consortium of SOC and SOC IP vendors, OEMs, academia, OSVs and ISVs defining a consistent heterogeneous platform architecture to make it dramatically easier to program heterogeneous parallel devices Develops open, royalty-free industry specifications and APIs for heterogeneous computing http://www.hsafoundation.com/ Goal: Bring Accelerators (and other devices) forward as first class citizens Unified address space Operate in pageable system memory Full memory coherence User mode dispatch/scheduling March 30 April 2, 2014 #OFADevWorkshop 3

HSA Foundation Membership 50+ Members Multiple categories of membership Founding Members AMD ARM Imagination MediaTek Qualcomm Samsung Texas Instruments March 30 April 2, 2014 #OFADevWorkshop 4

Application-level access to HSA devices Application C C C C C C Application talks directly to hardware No system call No kernel driver involved Hardware scheduling Low dispatch latency Application B B B B HSA Device Standardized binary packet format Architected Queueing Language (AQL) A A Application A A March 30 April 2, 2014 #OFADevWorkshop 5

HSA is Designed to go Beyond the GPU CPU Audio Processor Video Hardware Encode Decode Engines Shared Memory, Coherency, User Mode Queues GPU Fixed Function Accelerator DSP Image Signal Processing March 30 April 2, 2014 #OFADevWorkshop 6

HSA and I/O Devices Okay HSA looks interesting from the viewpoint of accelerators, how is this related to I/O? I/O devices can participate, as well! The HSA and its programming model are designed to allow any I/O devices capable of page fault handling to participate in a shared virtual address space. March 30 April 2, 2014 #OFADevWorkshop 7

HSA is Designed to go Beyond the GPU CPU Audio Processor Video Hardware Encode Decode Engines Shared Memory, Coherency, User Mode Queues GPU Fixed Function Accelerator DSP Image Signal Processing March 30 April 2, 2014 #OFADevWorkshop 8

HSA is Designed to go Beyond the GPU CPU Audio Processor Video Hardware I/O Devices Shared Memory, Coherency, User Mode Queues GPU Fixed Function Accelerator DSP Image Signal Processing March 30 April 2, 2014 #OFADevWorkshop 9

HSA CPU/GPU Queueing CPU GPU March 30 April 2, 2014 #OFADevWorkshop 10

HSA CPU/GPU Queueing + NIC Queueing CPU GPU NIC March 30 April 2, 2014 #OFADevWorkshop 11

HSA IOMMU/Device Interactions March 30 April 2, 2014 #OFADevWorkshop 12

HSA and I/O Devices Implications for OFA Software stack Unified memory addressing via system-wide accessible page tables Support for no pinning, on-demand paging options Aligns with requirements from MPI and PGAS input to OFIWG to simplify memory registration March 30 April 2, 2014 #OFADevWorkshop 13

Disclaimer & Attribution The information presented in this document is for informational purposes only and may contain technical inaccuracies, omissions and typographical errors. The information contained herein is subject to change and may be rendered inaccurate for many reasons, including but not limited to product and roadmap changes, component and motherboard version changes, new model and/or product releases, product differences between differing manufacturers, software changes, BIOS flashes, firmware upgrades, or the like. AMD assumes no obligation to update or otherwise correct or revise this information. However, AMD reserves the right to revise this information and to make changes from time to time to the content hereof without obligation of AMD to notify any person of such revisions or changes. AMD MAKES NO REPRESENTATIONS OR WARRANTIES WITH RESPECT TO THE CONTENTS HEREOF AND ASSUMES NO RESPONSIBILITY FOR ANY INACCURACIES, ERRORS OR OMISSIONS THAT MAY APPEAR IN THIS INFORMATION. AMD SPECIFICALLY DISCLAIMS ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE. IN NO EVENT WILL AMD BE LIABLE TO ANY PERSON FOR ANY DIRECT, INDIRECT, SPECIAL OR OTHER CONSEQUENTIAL DAMAGES ARISING FROM THE USE OF ANY INFORMATION CONTAINED HEREIN, EVEN IF AMD IS EXPRESSLY ADVISED OF THE POSSIBILITY OF SUCH DAMAGES. ATTRIBUTION 2014 Advanced Micro Devices, Inc. All rights reserved. AMD, the AMD Arrow logo and combinations thereof are trademarks of Advanced Micro Devices, Inc. in the United States and/or other jurisdictions. SPEC is a registered trademark of the Standard Performance Evaluation Corporation (SPEC). Other names are for informational purposes only and may be trademarks of their respective owners.

Thank You #OFADevWorkshop