Future of Supercomputing: The Computational Element

Size: px
Start display at page:

Download "Future of Supercomputing: The Computational Element"

Transcription

1 C O N F I D E N T I A L Future of Supercomputing: The Computational Element Tommy Toles tommy.toles@amd.com

2 Agenda HPC The New World Order Whole > Sum-of-Parts? Key Challenges A Look Ahead 2

3 All Worldwide Servers Compared To HPC Revenue, Units & Processors All Servers Worldwide 2003 to to CAGR CAGR Total Factory Revenue ($B) $46,149 $49,146 $51,268 $52, % 1.9% Units Shipped (same as nodes) 5,278,222 6,307,484 7,050,099 7,472, % 6.0% Processor Dies Shipped 8,662,823 10,134,624 11,712,766 12,779, % 9.1% Source: IDC 2007 HPC Technical Servers Worldwide 2003 to to CAGR CAGR HPC Server Revenue ($B) $5,698 $7,393 $9,208 $10, % 8.9% Adjusted Revenues (To match enter $5,128 $6,654 $8,287 $9, % 8.9% Node Units Shipped 411, ,510 1,215,735 1,419, % 16.7% Processor Elements Shipped 1,002,905 1,657,827 2,681,079 3,351, % 25.0% Source: IDC 2007 HPC As A Ratio Of All Servers Revenue ($B) 12.3% 15.0% 18.0% 19.2% Adjusted Revenues (Apples-to-apple 11.1% 13.5% 16.2% 17.3% Units Shipped (Nodes) 7.8% 11.6% 17.2% 19.0% Processors Shipped 11.6% 16.4% 22.6% 26.1% Source: IDC 2007

4 Revenue percentage by CPU type Total HPC Revenue by Processor Type Source: IDC 100% 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% x86-64 x86-32 VECTOR RISC EPIC X86 bases system took 63% share in 2006

5 Revenue percentage by O/S Total HPC Revenue by OS Source: IDC 100% 80% 60% 40% Linux Unix W/NT. Other 20% 0% Linux systems accounted for 66% of the total revenue in 2006

6 Historical Performance Metrics Performance Time AMD VSP Custo 6

7 Innovations New Metric: Performance per Watt Performance Increase Performance within Power Envelope Time AMD VSP Custo 7

8 Power The final frontier The combined total of data centers in California for 2004 were estimated to require ~300MW of energy. That s equivalent to ~5000 barrels of oil a day! SOURCE: California Energy Commission 9/27/

9 Feeding the Beast (Watts) More Back-up Power Generator Requirements $$$ More High-Voltage switch Equipment Requirements $$$ More High-Capacity CRAC units (air-conditioners) $$$ More UPS equipment requirements $$$ Lower Density/ Unusable Floor Space $$$ Data Center Expansion $$$ 9/27/

10 IT Knows it has a Problem Power Consumption/Cooling Issues Tracked By Company: 71% How Are Companies Addressing These Issues Power Consumption 12% Increased Amount Of Power Supplied To Data Center 44% Cooling 21% Stopped Buying More Servers And/Or Consolidated Existing Equipment 26% Both Equally Important Suspect That Both are Issues but do not Track at this Time 12% 38% Implemented "Cool Aisle/Hot Aisle Layout Increased Size Of Data Center Other 25% 23% 16% Neither Presents An Issue at this Time 17% None Of The Above 15% Base: 1,177 IT Decision Makers November 2005 Strategy Group/Ziff-Davis Average of 18% of total rack space wasted due to power and cooling issues 9/27/

11 Next-Generation Power Comparison Watts From: Memory Northbridge CPU In 2006 Next-Generation AMD Opteron Defined A New Standard In Performance-Per-Watt With Energy-Efficient DDR2 Memory and Improved AMD PowerNow! Capabilities 35W 83W 22W 83W 22W 83W 22W 35W In mid-2007 We Plan to Offer Quad-Core AMD Opteron in the Same DDR2-based Platforms at the Same Power Efficiency 190W 290W Dual-Core 180W 266W 190W Quad-Core Rev F Xeon Dempsey Xeon Wood- Crest Xeon Clover- Town Quad-Core 11 Wattage based on 2P systems with 8 DIMMs at max CPU wattage; Wattage for Dempsey, Woodcrest and Clovertown is estimated based on currently publicly available values (see, eg: and is subject to change. The examples contained herein are intended for informational purposes only. Other factors will affect real-world power consumption.

12 At the Wall Power Comparison 12

13 AMD Power Efficiency Innovation Independent Dynamic Core Technology AMD CoolCore Technology Same Power And Thermal Envelopes As Dual-Core! Dual Dynamic Power Management Low-Power DDR2 Memory 13

14 Computational Requirements 14 Source: Andy Bechtolsheim, IEEE Grid & Cluster Conference, 2007

15 What Makes a Great CPU even better? Frequency Lift Instruction Set Enhancement Increasing Cores Core Capabilities Memory Access (Capacity, Bandwidth & Latency) CPU-CPU connectivity IO Connectivity (Data access) Software Scaling 15

16 Legacy Northbridge Server Architecture FSB Sharing: Memory CPU-CPU I/O Memory access delayed by passing through NB DDR Server Processor North Bridge (MCH) Server Processor PCI-X Bridge 2 nd Processor competes For FSB B/W I/O & memory compete for FSB B/W PCI-X B/W bottlenecks: link B/W < I/O device B/W IDE, FDC, USB, Etc. South Bridge PCI AMD VSP Custo 16

17 AMD Opteron Direct Connect Server Architecture DDR AMD Opteron Processor AMD Opteron Processor DDR Reduced Bus Contention Separate Mem. And I/O Paths Dedicated Memory Path Reduced Memory Latency Better Scalability AMD-8131 PCI-X Bridge HyperTransport bus has ample bandwidth for I/O devices PCI-X IDE, FDC, USB, Etc. AMD-8111 I/O Hub PCI Fewer chips needed for basic server AMD VSP Custo 17

18 HyperTransport Technology Buses Enable Glueless Expansion for up to 8-way Servers AMD Opteron Platform HyperTransport Technology Buses for Glueless I/O or CPU Expansion Maximum of Four Processors per Memory Controller Hub Historic MP Server FSB Bus Bandwidth Shared Across All Four Processors DDR 144-bit AMD Opteron AMD Opteron DDR 144-bit Limited Memory Bandwidth Shared by All Memory Processor Processor Processor Processor DDR 144-bit Separate Memory and I/O Paths Eliminates Most Bus Contention PCI-X Other I/O AMD Opteron PCI-X Bridge * Other Bridge IDE, USB, LPC, Etc. AMD Opteron PCI-X Bridge I/O Hub** Fewer Chips Needed for 4-way Server (Reduces Cost) DDR 144-bit HyperTransport Link Has Ample Bandwidth For I/O Devices DDR 144-bit DDR 144-bit DDR 144-bit DDR 144-bit Key Memory Traffic I/O Traffic IPC Traffic Memory Address Buffer 4 Memory Address Buffer Memory Address Buffer Memory Address Buffer IDE, FDC, USB, Etc. Memory Ctlr Hub (MCH) 1 I/O Hub 3 9-Chip Chipset Needed for 4-way Server I/O & Memory Share FSB PCI-X Bridge 2 PCI-X Bridge PCI-X Bridge PCI PCI-X PCI-X PCI-X Bandwidth Bottlenecks: Link Bandwidth < I/O Bridge Bandwidth Scalable memory and I/O bandwidth Up to 8 processors without glue logic Each processor adds more memory Each processor adds additional HyperTransport buses for more PCI-X and other I/O bridges Fewer chips required AMD VSP Custo *AMD-8131 HyperTransport PCI-X Tunnel **AMD-8111 HyperTransport I/O Hub System scalability limited by Northbridge Maximum of 4 processors o Processors compete for FSB bandwidth Memory size and bandwidth are limited Maximum of 3 PCI-X bridges Many more chips required 1 ServerWorks CMIC HE Memory Controller Hub (MCH) 2 ServerWorks CIOB-X 64-bit PCI/PCI-X Controller Hub 3 ServerWorks CSB5 I/O Controller Hub 4 ServerWorks REMC Memory Address Buffer 18

19 AMD Performance Innovation AMD Wide Floating-Point Accelerator AMD Memory Optimizer Technology ~150% 100% Comprehensive Performance Enhancements! Dual-Core Quad-Core Rapid Virtualization Indexing AMD Balanced Smart Cache 19

20 Dual-Core to Quad-Core Uplift Dual-Core AMD Opteron TM 2200 Series vs. Quad-Core AMD Opteron Model Socket Performance Scaling VMmark >124% >124% SPECint_rate % 57% 59% SPECfp_rate2006 Stream memory bandwidth 23% 49% 57% 49% 54% Average Performance Increase SPECompMbase % % = Dual-Core AMD Opteron Processor Performance SPEC and the benchmark name SPECint, SPECfp and SPECOMPM are registered trademarks of the Standard Performance Evaluation Corporation. Benchmark results stated above for Dual-Core AMD Opteron processor Model 2222 reflect results published on as of Sep 9, The comparison presented above is based on results for Quad- Core AMD Opteron processor Model 2350 under submission to SPEC as of Sep 9, For the latest results visit and Stream and VMmark results based on internal measurements at AMD performance labs. 20

21 Quad-Core vs. Dual-Die AMD s design will be a TRUE quad-core processor without compromising performance, power or heat Intel may rush a dual die architecture to market in order to claim first to market, only to change the design to true quad-core later - more churn and increased customer TCO Native Quad-Core Design Optimum performance Same power & thermal envelopes as dual-core Dual Die (Dual Cavity) Can hinder performance (FSB design) Publicly known thermal design power ranges higher than dual-core products 21

22 Native Quad-Core Benefit: Faster Data Sharing Situation: Core 1 needs data in Core 3 cache How Does it Get There? Native Quad-Core AMD Opteron Core 1 L2 System Request Queue Hyper Transport Crossbar L3 Core 2 Core 3 Core L2 L2 L2 Memory Controller Quad-Core Clovertown Core 1 Core 2 Core 3 Core L2 L2 Front-Side Bus Front-Side Bus Memory Controller Northbridge 1. Core 1 probes Core 3 cache, data is copied directly back to Core 1 This happens at processor frequency Result: Improved Quad-Core Performance 1. Core 1 sends a request to the memory controller, which probes Core 3 cache 2. Core 3 sends data back to the memory controller, which forwards it to Core 1 This happens at front-side bus frequency Result: Reduced Quad-Core Performance 22

23 Performance-Per-Watt Leadership Quad-Core AMD Opteron Processor Model 2350 (75 Watt ) vs. Intel Xeon 5345 (80 Watt, without Additional Watts of Memory Controller and FBDIMM) Fluent Fluent (sedan_4m) 67% SPECompMBase2001 SPECompM2001 SPECfp_rate_base2006 SPECfp_rate2006 Both on GCC gcc SPECfp_rate2006 SPECfp_rate2006 PGI Intel compiler vs. PGI compiler 30% 27% 36% LSDyna LSDyna 3 Vehicle 3 Collision Collison SPECint_rate_base2006 SPECint_rate2006 GCC Both on gcc 12% 9% 26% Average Performance Increase SPECint_rate2006 SPECint_rate2006 PGI Intel compiler vs. PGI compiler -5% % = Intel Xeon 5345 SPEC and the benchmark name SPECint, SPECfp and SPECOMPM are registered trademarks of the Standard Performance Evaluation Corporation. Competitive benchmark results stated above reflect results published on as of Sep 9, The comparison presented above is based on results for Quad-Core AMD Opteron processor Model 2350 and Xeon 5345 (specint_rate2006 gcc and SPECompM2001 base) under submission to SPEC as of Sep 9, For the latest results visit Fluent and LSDyna result based on internal measurements at AMD performance labs. 23

24 Interconnect Memory Power CPU AMD Server Platform Roadmap Single Core Dual Core Quad Core Mainstream: 95W Blades/Rack Dense: up to 68W Maximum Performance: 120W (Special Edition) Registered DDR-2 DDR-3 Registered DDR-1 HT 2.6 GHz HyperTransport - 1 GHz

25 Investment Protection: Stable Platform Progression Long-term success for partners and end-customers 45nm Octal-core nm Dual Core 65nm Quad-core 45nm Quad-core 3 rd Generation Platform 2 nd Generation Platform 130nm Single Core 90nm Dual Core 1 st Generation Platform Stable platforms deliver better long-term value and logical transitions for partners and customers 25

26 Platforms in Market Today 2003 vs. IBM eserver X4600 X2200 X4500 X4200/4100 SC 1435 PowerEdge 2970 PowerEdge 6950 DL585 DL385 DL365 DL145 BL465c X2100 BL685c U20 & U40 WS xw st Generation AMD Opteron Blade X6220 Blade X8420 X3455 X3655 X3755 LS21 LS41 E-9422R E-9522R E-9722R E-9222T XT3 XT4 X630 S2 AMD Validated Solutions BladeFrame ES and EX G

27 Customers are Responding! AMD Opteron Server Market Share WW total 11.9% 16.0% WW x % 17.1% WW x86 2-way 13.6% 18.2% WW x86 4-way 28.2% 40.1% US x % 27.4% US x86 2-way 21.2% 27.9% US x86 4-way 36.0% 56.2% Source: Gartner, IDC, end of year results 27

28 PERF. PERF. 64-bit Single Core POWER/PERF. 64-bit Homogeneous Multi-CPU DIVERSITY Platform Co-Processing Heterogeneous CPU+xPU The Heterogeneous Processing Imperative Dual-Core Opteron AMD64 HD, DRM 486 3D, digital media Java, XML, web services 16-bit Single Core 32-bit Single Core , GUI, PowerPoint, web browsers Spreadsheets, word-processing s 2000s 2010s By the end of the decade, homogenous multi-core becomes increasingly inadequate 28

29 Continuum of Solutions Fusion" "Stream" general purpose GPU "Torrenza" HTX Accelerator PCI-E PCIe Accelerator Add-in Chipset Accelerator Chipset AMD Processor Accelerator Opteron Socket Socket compatible accelerator Accelerator CPU Package level integration (MCM) C P U NB Accelerator Silicon level integration Accelerated Processors Fusion AMD s code name for: Accelerated Processors (integrated acceleration) Torrenza AMD s code name for: slot or socket based acceleration Slot or Socket Acceleration Stream Specific example of a GPGPU accelerator under Torrenza 29

30 Torrenza: Accelerated Computing Today Direct Connect Accelerators in sockets or slots deliver superior performance without bridge chips 100s of GFlops to solve complex math Application specific optimization Familiar programming interfaces speed time to implementation Partners: Altera Celoxica DRC Lattice Xilinx XtremeData Stream ASICs FPGA Application Libraries Compilers Hardware Interfaces Partners: CTM Celoxica OpenFPGA Peakstream Rapidmind Partners: Bay Microsystems Commex NetLogic Qlogic RMI Tarari Woven IB XML iscsi 10Gb E Search Storage Security Torrenza Scale Up Virtualization Partners: 3Leaf Systems Liquid Computing Mannheim Newisys Panta Systems Specialized Direct Connect devices for highthroughput, low-latency processing cht and HT provide peer level interfaces to build systems from commodity building blocks 30 March 2007 AMD Commercial

31 How Do We Get There? Silicon Budgets Carbon Collaboration Facilities Algorithms 31

32 AMD Collaboration Resources Green Grid Developer Pages Torrenza Forum (accelerators) Home/Torrenza.aspx Lightweight Profiling for increased parallellism HyperTransport interconnect 32

33 My Prediction Texas A&M 38 OSU 13 Gig Em Aggies! 33

34 Disclaimer and Trademark Attribution DISCLAIMER The information contained herein is subject to change and may be rendered inaccurate for many reasons, including, but not limited to product and roadmap changes, component and motherboard version changes, new model and/or product releases, product differences between differing manufacturers, software changes, BIOS flashes, firmware upgrades, or the like. AMD assumes no obligation to update or otherwise correct or revise this information. However, AMD reserves the right to revise this information and to make changes from time to time to the content hereof without obligation of AMD to notify any person of such revisions or changes. AMD MAKES NO REPRESENTATIONS OR WARRANTIES WITH RESPECT TO THE CONTENTS HEREOF AND ASSUMES NO RESPONSIBILITY FOR ANY INACCURACIES, ERRORS OR OMISSIONS THAT MAY APPEAR IN THIS INFORMATION. AMD SPECIFICALLY DISCLAIMS ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE. IN NO EVENT WILL AMD BE LIABLE TO ANY PERSON FOR ANY DIRECT, INDIRECT, SPECIAL OR OTHER CONSEQUENTIAL DAMAGES ARISING FROM THE USE OF ANY INFORMATION CONTAINED HEREIN OR FOR THE PERFORMANCE OR OPERATION OF ANY PERSON, INCLUDING, WITHOUT LIMITATION, ANY LOST PROFITS, BUSINESS INTERRUPTION, DAMAGE TO OR DESTRUCTION OF PROPERTY, OR LOSS OF PROGRAMS OR OTHER DATA, EVEN IF AMD IS EXPRESSLY ADVISED OF THE POSSIBILITY OF SUCH DAMAGES Advanced Micro Devices, Inc. AMD, the AMD Arrow logo, AMD Opteron, and combinations thereof, are trademarks of Advanced Micro Devices, Inc. Windows is a registered trademark of Microsoft Corporation in the U.S. and/or other countries. Linux is a registered trademark of Linus Torvalds. WebBench and NetBench are trademarks of Ziff Davis Publishing Holdings Inc., an affiliate of Veritest Inc. Other product and company names are for informational purposes only and may be trademarks of their respective companies. 34

The Future of Computing: AMD Vision

The Future of Computing: AMD Vision The Future of Computing: AMD Vision Tommy Toles AMD Business Development Executive thomas.toles@amd.com 512-327-5389 Agenda Celebrating Momentum Years of Leadership & Innovation Current Opportunity To

More information

HyperTransport Technology

HyperTransport Technology HyperTransport Technology in 2009 and Beyond Mike Uhler VP, Accelerated Computing, AMD President, HyperTransport Consortium February 11, 2009 Agenda AMD Roadmap Update Torrenza, Fusion, Stream Computing

More information

Maximizing Six-Core AMD Opteron Processor Performance with RHEL

Maximizing Six-Core AMD Opteron Processor Performance with RHEL Maximizing Six-Core AMD Opteron Processor Performance with RHEL Bhavna Sarathy Red Hat Technical Lead, AMD Sanjay Rao Senior Software Engineer, Red Hat Sept 4, 2009 1 Agenda Six-Core AMD Opteron processor

More information

The mobile computing evolution. The Griffin architecture. Memory enhancements. Power management. Thermal management

The mobile computing evolution. The Griffin architecture. Memory enhancements. Power management. Thermal management Next-Generation Mobile Computing: Balancing Performance and Power Efficiency HOT CHIPS 19 Jonathan Owen, AMD Agenda The mobile computing evolution The Griffin architecture Memory enhancements Power management

More information

Quad-core Press Briefing First Quarter Update

Quad-core Press Briefing First Quarter Update Quad-core Press Briefing First Quarter Update AMD Worldwide Server/Workstation Marketing C O N F I D E N T I A L Outstanding Dual-core Performance Toady Average of scores places AMD ahead by 2% Average

More information

CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to

CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to AMD s strategy and focus, expected datacenter total

More information

The AMD64 Technology for Server and Workstation. Dr. Ulrich Knechtel Enterprise Program Manager EMEA

The AMD64 Technology for Server and Workstation. Dr. Ulrich Knechtel Enterprise Program Manager EMEA The AMD64 Technology for Server and Workstation Dr. Ulrich Knechtel Enterprise Program Manager EMEA Agenda Direct Connect Architecture AMD Opteron TM Processor Roadmap Competition OEM support The AMD64

More information

Integrating FPGAs in High Performance Computing A System, Architecture, and Implementation Perspective

Integrating FPGAs in High Performance Computing A System, Architecture, and Implementation Perspective Integrating FPGAs in High Performance Computing A System, Architecture, and Implementation Perspective Nathan Woods XtremeData FPGA 2007 Outline Background Problem Statement Possible Solutions Description

More information

AMD WW HPC 07. Francesco Torricelli WW HPC Manager

AMD WW HPC 07. Francesco Torricelli WW HPC Manager AMD WW HPC 07 Francesco Torricelli WW HPC Manager Agenda Quad-Core AMD Opteron TM Long Term Roadmap AMD HPC WW 2 2007 Agenda Quad-Core AMD Opteron TM Architecture Benchmarks Tools Long Term Roadmap AMD

More information

INTRODUCTION TO OPENCL TM A Beginner s Tutorial. Udeepta Bordoloi AMD

INTRODUCTION TO OPENCL TM A Beginner s Tutorial. Udeepta Bordoloi AMD INTRODUCTION TO OPENCL TM A Beginner s Tutorial Udeepta Bordoloi AMD IT S A HETEROGENEOUS WORLD Heterogeneous computing The new normal CPU Many CPU s 2, 4, 8, Very many GPU processing elements 100 s Different

More information

HETEROGENEOUS SYSTEM ARCHITECTURE: PLATFORM FOR THE FUTURE

HETEROGENEOUS SYSTEM ARCHITECTURE: PLATFORM FOR THE FUTURE HETEROGENEOUS SYSTEM ARCHITECTURE: PLATFORM FOR THE FUTURE Haibo Xie, Ph.D. Chief HSA Evangelist AMD China OUTLINE: The Challenges with Computing Today Introducing Heterogeneous System Architecture (HSA)

More information

Six-Core AMD Opteron Processor

Six-Core AMD Opteron Processor What s you should know about the Six-Core AMD Opteron Processor (Codenamed Istanbul ) Six-Core AMD Opteron Processor Versatility Six-Core Opteron processors offer an optimal mix of performance, energy

More information

AMD ACCELERATING TECHNOLOGIES FOR EXASCALE COMPUTING FELLOW 3 OCTOBER 2016

AMD ACCELERATING TECHNOLOGIES FOR EXASCALE COMPUTING FELLOW 3 OCTOBER 2016 AMD ACCELERATING TECHNOLOGIES FOR EXASCALE COMPUTING BILL.BRANTLEY@AMD.COM, FELLOW 3 OCTOBER 2016 AMD S VISION FOR EXASCALE COMPUTING EMBRACING HETEROGENEITY CHAMPIONING OPEN SOLUTIONS ENABLING LEADERSHIP

More information

EPYC VIDEO CUG 2018 MAY 2018

EPYC VIDEO CUG 2018 MAY 2018 AMD UPDATE CUG 2018 EPYC VIDEO CRAY AND AMD PAST SUCCESS IN HPC AMD IN TOP500 LIST 2002 TO 2011 2011 - AMD IN FASTEST MACHINES IN 11 COUNTRIES ZEN A FRESH APPROACH Designed from the Ground up for Optimal

More information

Newest generation of HP ProLiant DL380 takes #1 position overall on Oracle E-Business Suite Small Model Benchmark

Newest generation of HP ProLiant DL380 takes #1 position overall on Oracle E-Business Suite Small Model Benchmark Newest generation of HP ProLiant DL380 takes #1 position overall on Oracle E-Business Suite Small Model Benchmark ProLiant DL380 G6 uses latest Intel Xeon X5570 technology for ultimate performance HP Leadership

More information

Run Anywhere. The Hardware Platform Perspective. Ben Pollan, AMD Java Labs October 28, 2008

Run Anywhere. The Hardware Platform Perspective. Ben Pollan, AMD Java Labs October 28, 2008 Run Anywhere The Hardware Platform Perspective Ben Pollan, AMD Java Labs October 28, 2008 Agenda Java Labs Introduction Community Collaboration Performance Optimization Recommendations Leveraging the Latest

More information

DR. LISA SU

DR. LISA SU CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to AMD s strategy and focus, expected datacenter total

More information

CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to the features, functionality, availability, timing,

More information

AMD APU and Processor Comparisons. AMD Client Desktop Feb 2013 AMD

AMD APU and Processor Comparisons. AMD Client Desktop Feb 2013 AMD AMD APU and Processor Comparisons AMD Client Desktop Feb 2013 AMD SUMMARY 3DMark released Feb 4, 2013 Contains DirectX 9, DirectX 10, and DirectX 11 tests AMD s current product stack features DirectX 11

More information

FUSION PROCESSORS AND HPC

FUSION PROCESSORS AND HPC FUSION PROCESSORS AND HPC Chuck Moore AMD Corporate Fellow & Technology Group CTO June 14, 2011 Fusion Processors and HPC Today: Multi-socket x86 CMPs + optional dgpu + high BW memory Fusion APUs (SPFP)

More information

ABySS Performance Benchmark and Profiling. May 2010

ABySS Performance Benchmark and Profiling. May 2010 ABySS Performance Benchmark and Profiling May 2010 Note The following research was performed under the HPC Advisory Council activities Participating vendors: AMD, Dell, Mellanox Compute resource - HPC

More information

SIMULATOR AMD RESEARCH JUNE 14, 2015

SIMULATOR AMD RESEARCH JUNE 14, 2015 AMD'S gem5apu SIMULATOR AMD RESEARCH JUNE 14, 2015 OVERVIEW Introducing AMD s gem5 APU Simulator Extends gem5 with a GPU timing model Supports Heterogeneous System Architecture in SE mode Includes several

More information

The Rise of Open Programming Frameworks. JC BARATAULT IWOCL May 2015

The Rise of Open Programming Frameworks. JC BARATAULT IWOCL May 2015 The Rise of Open Programming Frameworks JC BARATAULT IWOCL May 2015 1,000+ OpenCL projects SourceForge GitHub Google Code BitBucket 2 TUM.3D Virtual Wind Tunnel 10K C++ lines of code, 30 GPU kernels CUDA

More information

AMD Server/Workstation Customer Presentation

AMD Server/Workstation Customer Presentation AMD Server/Workstation Customer Presentation Systemhaus Maitschke Inh. Gerald Maitschke Tel. +49 89 94004804 Fax. +49 89 71034015 Mobil. +49 171 3357041 gerald@maitschke.de www.maitschke.de Agenda AMD

More information

The Road to the AMD. Fiji GPU. Featuring Die Stacking and HBM Technology 1 THE ROAD TO THE AMD FIJI GPU ECTC 2016 MAY 2015

The Road to the AMD. Fiji GPU. Featuring Die Stacking and HBM Technology 1 THE ROAD TO THE AMD FIJI GPU ECTC 2016 MAY 2015 The Road to the AMD Fiji GPU Featuring Die Stacking and HBM Technology 1 THE ROAD TO THE AMD FIJI GPU ECTC 2016 MAY 2015 Fiji Chip DETAILED LOOK 4GB High-Bandwidth Memory 4096-bit wide interface 512 GB/s

More information

CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to

CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to AMD s positioning in the datacenter market; expected

More information

Performance and Energy Efficiency of the 14 th Generation Dell PowerEdge Servers

Performance and Energy Efficiency of the 14 th Generation Dell PowerEdge Servers Performance and Energy Efficiency of the 14 th Generation Dell PowerEdge Servers This white paper details the performance improvements of Dell PowerEdge servers with the Intel Xeon Processor Scalable CPU

More information

OPENCL TM APPLICATION ANALYSIS AND OPTIMIZATION MADE EASY WITH AMD APP PROFILER AND KERNELANALYZER

OPENCL TM APPLICATION ANALYSIS AND OPTIMIZATION MADE EASY WITH AMD APP PROFILER AND KERNELANALYZER OPENCL TM APPLICATION ANALYSIS AND OPTIMIZATION MADE EASY WITH AMD APP PROFILER AND KERNELANALYZER Budirijanto Purnomo AMD Technical Lead, GPU Compute Tools PRESENTATION OVERVIEW Motivation AMD APP Profiler

More information

THE PROGRAMMER S GUIDE TO THE APU GALAXY. Phil Rogers, Corporate Fellow AMD

THE PROGRAMMER S GUIDE TO THE APU GALAXY. Phil Rogers, Corporate Fellow AMD THE PROGRAMMER S GUIDE TO THE APU GALAXY Phil Rogers, Corporate Fellow AMD THE OPPORTUNITY WE ARE SEIZING Make the unprecedented processing capability of the APU as accessible to programmers as the CPU

More information

AMD CORPORATE TEMPLATE AMD Radeon Open Compute Platform Felix Kuehling

AMD CORPORATE TEMPLATE AMD Radeon Open Compute Platform Felix Kuehling AMD Radeon Open Compute Platform Felix Kuehling ROCM PLATFORM ON LINUX Compiler Front End AMDGPU Driver Enabled with ROCm GCN Assembly Device LLVM Compiler (GCN) LLVM Opt Passes GCN Target Host LLVM Compiler

More information

ROCm: An open platform for GPU computing exploration

ROCm: An open platform for GPU computing exploration UCX-ROCm: ROCm Integration into UCX {Khaled Hamidouche, Brad Benton}@AMD Research ROCm: An open platform for GPU computing exploration 1 JUNE, 2018 ISC ROCm Software Platform An Open Source foundation

More information

SCALING DGEMM TO MULTIPLE CAYMAN GPUS AND INTERLAGOS MANY-CORE CPUS FOR HPL

SCALING DGEMM TO MULTIPLE CAYMAN GPUS AND INTERLAGOS MANY-CORE CPUS FOR HPL SCALING DGEMM TO MULTIPLE CAYMAN GPUS AND INTERLAGOS MANY-CORE CPUS FOR HPL Matthias Bach and David Rohr Frankfurt Institute for Advanced Studies Goethe University of Frankfurt I: INTRODUCTION 3 Scaling

More information

The Next Revolution in Computer Systems Architecture

The Next Revolution in Computer Systems Architecture The Next Revolution in Computer Systems Architecture Richard Oehler Corporate Fellow Office of the CTO University of Mannheim 2/08/07 Computer Systems Architecture Not just the Processor Chip It s all

More information

HPCA 18. Reliability-aware Data Placement for Heterogeneous memory Architecture

HPCA 18. Reliability-aware Data Placement for Heterogeneous memory Architecture HPCA 18 Reliability-aware Data Placement for Heterogeneous memory Architecture Manish Gupta Ψ, Vilas Sridharan*, David Roberts*, Andreas Prodromou Ψ, Ashish Venkat Ψ, Dean Tullsen Ψ, Rajesh Gupta Ψ Ψ *

More information

FLASH MEMORY SUMMIT Adoption of Caching & Hybrid Solutions

FLASH MEMORY SUMMIT Adoption of Caching & Hybrid Solutions FLASH MEMORY SUMMIT 2011 Adoption of Caching & Hybrid Solutions Market Overview 2009 Flash production reached parity with all other existing solid state memories in terms of bites. 2010 Overall flash production

More information

AMD Graphics Team Last Updated February 11, 2013 APPROVED FOR PUBLIC DISTRIBUTION. 1 3DMark Overview February 2013 Approved for public distribution

AMD Graphics Team Last Updated February 11, 2013 APPROVED FOR PUBLIC DISTRIBUTION. 1 3DMark Overview February 2013 Approved for public distribution AMD Graphics Team Last Updated February 11, 2013 APPROVED FOR PUBLIC DISTRIBUTION 1 3DMark Overview February 2013 Approved for public distribution 2 3DMark Overview February 2013 Approved for public distribution

More information

AMD 780G. Niles Burbank AMD. an x86 chipset with advanced integrated GPU. Hot Chips 2008

AMD 780G. Niles Burbank AMD. an x86 chipset with advanced integrated GPU. Hot Chips 2008 AMD 780G an x86 chipset with advanced integrated GPU Hot Chips 2008 Niles Burbank AMD Agenda Evolving PC expectations AMD 780G Overview Design Challenges Video Playback Support Display Capabilities Power

More information

AMD Graphics Team Last Updated April 29, 2013 APPROVED FOR PUBLIC DISTRIBUTION. 1 3DMark Overview April 2013 Approved for public distribution

AMD Graphics Team Last Updated April 29, 2013 APPROVED FOR PUBLIC DISTRIBUTION. 1 3DMark Overview April 2013 Approved for public distribution AMD Graphics Team Last Updated April 29, 2013 APPROVED FOR PUBLIC DISTRIBUTION 1 3DMark Overview April 2013 Approved for public distribution 2 3DMark Overview April 2013 Approved for public distribution

More information

Panel Discussion: The Future of I/O From a CPU Architecture Perspective

Panel Discussion: The Future of I/O From a CPU Architecture Perspective Panel Discussion: The Future of I/O From a CPU Architecture Perspective Brad Benton AMD, Inc. #OFADevWorkshop Issues Move to Exascale involves more parallel processing across more processing elements GPUs,

More information

Quad-Core Intel Xeon Processor 5400 Series

Quad-Core Intel Xeon Processor 5400 Series Platform Brief Intel Xeon Processor Technology Quad-Core Intel Xeon Processor 5400 Series Boost Performance and Energy Efficiency with New 45nm Multi-core Intel Xeon Processors Now featuring Intel s second-generation

More information

EFFICIENT SPARSE MATRIX-VECTOR MULTIPLICATION ON GPUS USING THE CSR STORAGE FORMAT

EFFICIENT SPARSE MATRIX-VECTOR MULTIPLICATION ON GPUS USING THE CSR STORAGE FORMAT EFFICIENT SPARSE MATRIX-VECTOR MULTIPLICATION ON GPUS USING THE CSR STORAGE FORMAT JOSEPH L. GREATHOUSE, MAYANK DAGA AMD RESEARCH 11/20/2014 THIS TALK IN ONE SLIDE Demonstrate how to save space and time

More information

INTERFERENCE FROM GPU SYSTEM SERVICE REQUESTS

INTERFERENCE FROM GPU SYSTEM SERVICE REQUESTS INTERFERENCE FROM GPU SYSTEM SERVICE REQUESTS ARKAPRAVA BASU, JOSEPH L. GREATHOUSE, GURU VENKATARAMANI, JÁN VESELÝ AMD RESEARCH, ADVANCED MICRO DEVICES, INC. MODERN SYSTEMS ARE POWERED BY HETEROGENEITY

More information

Understanding GPGPU Vector Register File Usage

Understanding GPGPU Vector Register File Usage Understanding GPGPU Vector Register File Usage Mark Wyse AMD Research, Advanced Micro Devices, Inc. Paul G. Allen School of Computer Science & Engineering, University of Washington AGENDA GPU Architecture

More information

Resource Saving: Latest Innovation in Optimized Cloud Infrastructure

Resource Saving: Latest Innovation in Optimized Cloud Infrastructure Resource Saving: Latest Innovation in Optimized Cloud Infrastructure CloudFest 2018 Presented by Martin Galle, Director FAE We Keep ITSupermicro Green 2018 Cloud Computing Development Technology Evolution

More information

CAUTIONARY STATEMENT 1 AMD NEXT HORIZON NOVEMBER 6, 2018

CAUTIONARY STATEMENT 1 AMD NEXT HORIZON NOVEMBER 6, 2018 CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to AMD s positioning in the datacenter market; expected

More information

3D Numerical Analysis of Two-Phase Immersion Cooling for Electronic Components

3D Numerical Analysis of Two-Phase Immersion Cooling for Electronic Components 3D Numerical Analysis of Two-Phase Immersion Cooling for Electronic Components Xudong An, Manish Arora, Wei Huang, William C. Brantley, Joseph L. Greathouse AMD Research Advanced Micro Devices, Inc. MOTIVATION

More information

Nehalem Hochleistungsrechnen für reale Anwendungen

Nehalem Hochleistungsrechnen für reale Anwendungen Nehalem Hochleistungsrechnen für reale Anwendungen T-Systems HPCN Workshop DLR Braunschweig May 14-15, 2009 Hans-Joachim Plum Intel GmbH 1 Performance tests and ratings are measured using specific computer

More information

IBM System x servers. Innovation comes standard

IBM System x servers. Innovation comes standard IBM System x servers Innovation comes standard IBM System x servers Highlights Build a cost-effective, flexible IT environment with IBM X-Architecture technology. Achieve maximum performance per watt with

More information

ADVANCED RENDERING EFFECTS USING OPENCL TM AND APU Session Olivier Zegdoun AMD Sr. Software Engineer

ADVANCED RENDERING EFFECTS USING OPENCL TM AND APU Session Olivier Zegdoun AMD Sr. Software Engineer ADVANCED RENDERING EFFECTS USING OPENCL TM AND APU Session 2117 Olivier Zegdoun AMD Sr. Software Engineer CONTENTS Rendering Effects Before Fusion: single discrete GPU case Before Fusion: multiple discrete

More information

Intel Enterprise Processors Technology

Intel Enterprise Processors Technology Enterprise Processors Technology Kosuke Hirano Enterprise Platforms Group March 20, 2002 1 Agenda Architecture in Enterprise Xeon Processor MP Next Generation Itanium Processor Interconnect Technology

More information

Agenda. Sun s x Sun s x86 Strategy. 2. Sun s x86 Product Portfolio. 3. Virtualization < 1 >

Agenda. Sun s x Sun s x86 Strategy. 2. Sun s x86 Product Portfolio. 3. Virtualization < 1 > Agenda Sun s x86 1. Sun s x86 Strategy 2. Sun s x86 Product Portfolio 3. Virtualization < 1 > 1. SUN s x86 Strategy Customer Challenges Power and cooling constraints are very real issues Energy costs are

More information

Anatomy of AMD s TeraScale Graphics Engine

Anatomy of AMD s TeraScale Graphics Engine Anatomy of AMD s TeraScale Graphics Engine Mike Houston Design Goals Focus on Efficiency f(perf/watt, Perf/$) Scale up processing power and AA performance Target >2x previous generation Enhance stream

More information

IBM System x servers. Innovation comes standard

IBM System x servers. Innovation comes standard IBM System x servers Innovation comes standard IBM System x servers Highlights Build a cost-effective, flexible IT environment with IBM X-Architecture technology. Achieve maximum performance per watt with

More information

AMD IOMMU VERSION 2 How KVM will use it. Jörg Rödel August 16th, 2011

AMD IOMMU VERSION 2 How KVM will use it. Jörg Rödel August 16th, 2011 AMD IOMMU VERSION 2 How KVM will use it Jörg Rödel August 16th, 2011 AMD IOMMU VERSION 2 WHAT S NEW? 2 AMD IOMMU Version 2 Support in KVM August 16th, 2011 Public NEW FEATURES - OVERVIEW Two-level page

More information

Driving Leading Edge Microprocessor Technology

Driving Leading Edge Microprocessor Technology Driving Leading Edge Microprocessor Technology Dr. Hans Deppe Corporate Vice President & General Manager AMD in Dresden AMD Overview A leading global supplier of innovative semiconductor solutions for

More information

IBM System x family brochure

IBM System x family brochure IBM Systems and Technology Group System x IBM System x family brochure IBM System x rack and tower servers 2 IBM System x family brochure IBM System x servers Highlights IBM System x and BladeCenter servers

More information

Scheduling Strategies for HPC as a Service (HPCaaS) for Bio-Science Applications

Scheduling Strategies for HPC as a Service (HPCaaS) for Bio-Science Applications Scheduling Strategies for HPC as a Service (HPCaaS) for Bio-Science Applications Sep 2009 Gilad Shainer, Tong Liu (Mellanox); Jeffrey Layton (Dell); Joshua Mora (AMD) High Performance Interconnects for

More information

MEASURING AND MODELING ON-CHIP INTERCONNECT POWER ON REAL HARDWARE

MEASURING AND MODELING ON-CHIP INTERCONNECT POWER ON REAL HARDWARE MEASURING AND MODELING ON-CHIP INTERCONNECT POWER ON REAL HARDWARE VIGNESH ADHINARAYANAN, INDRANI PAUL, JOSEPH L. GREATHOUSE, WEI HUANG, ASHUTOSH PATTNAIK, WU-CHUN FENG POWER AND ENERGY ARE FIRST-CLASS

More information

オープンソ プンソース技術者のための AMD 最新テクノロジーアップデート 日本 AMD 株式会社 マーケティング ビジネス開発本部 エンタープライズプロダクトマーケティング部 山野 洋幸

オープンソ プンソース技術者のための AMD 最新テクノロジーアップデート 日本 AMD 株式会社 マーケティング ビジネス開発本部 エンタープライズプロダクトマーケティング部 山野 洋幸 AMD AMD CPU 2 Happy 6 th Birthday AMD Opteron Processor 3 6コア Istanbul : 完全な進捗状況 Executing months ahead of schedule In collaboration with GLOBALFOUNDRIES: first tapeout to production World s only six-core

More information

Multi-core processors are here, but how do you resolve data bottlenecks in native code?

Multi-core processors are here, but how do you resolve data bottlenecks in native code? Multi-core processors are here, but how do you resolve data bottlenecks in native code? hint: it s all about locality Michael Wall October, 2008 part I of II: System memory 2 PDC 2008 October 2008 Session

More information

MM5 Modeling System Performance Research and Profiling. March 2009

MM5 Modeling System Performance Research and Profiling. March 2009 MM5 Modeling System Performance Research and Profiling March 2009 Note The following research was performed under the HPC Advisory Council activities AMD, Dell, Mellanox HPC Advisory Council Cluster Center

More information

FUSION1200 Scalable x86 SMP System

FUSION1200 Scalable x86 SMP System FUSION1200 Scalable x86 SMP System Introduction Life Sciences Departmental System Manufacturing (CAE) Departmental System Competitive Analysis: IBM x3950 Competitive Analysis: SUN x4600 / SUN x4600 M2

More information

Quad-Core Intel Xeon Processor-based 3200/3210 Chipset Server Platforms

Quad-Core Intel Xeon Processor-based 3200/3210 Chipset Server Platforms Quad-Core Intel Xeon Processor-based 3200/3210 Chipset Server Platforms Entry-level server platforms with outstanding dependability, performance, and value Trust your company to Intel's proven server technology

More information

Sun Fire X4170 M2 Server Frequently Asked Questions

Sun Fire X4170 M2 Server Frequently Asked Questions Overview Faced with ever increasing computing needs and budget constraints, companies today want to set up infrastructures that offer optimal value, can easily be re-purposed, and have reduced complexity.

More information

HP ProLiant DL580 G5. HP ProLiant BL680c G5. IBM p570 POWER6. Fujitsu Siemens PRIMERGY RX600 S4. Egenera BladeFrame PB400003R.

HP ProLiant DL580 G5. HP ProLiant BL680c G5. IBM p570 POWER6. Fujitsu Siemens PRIMERGY RX600 S4. Egenera BladeFrame PB400003R. HP ProLiant DL58 G5 earns #1 overall four-processor performance; ProLiant BL68c takes #2 four-processor performance on Windows in two-tier SAP Sales and Distribution Standard Application Benchmark HP leadership

More information

C-Series Servers Meat and Tators

C-Series Servers Meat and Tators C-Series Servers Meat and Tators Jimmy Ray Purser Chief Geek, TechWiseTV 2006 Cisco Systems, Inc. All rights reserved. 1 When it comes to Servers 2006 Cisco Systems, Inc. All rights reserved. 2 Unified

More information

viewdle! - machine vision experts

viewdle! - machine vision experts viewdle! - machine vision experts topic using algorithmic metadata creation and heterogeneous computing to build the personal content management system of the future Page 2 Page 3 video of basic recognition

More information

HP ProLiant BL35p Server Blade

HP ProLiant BL35p Server Blade Data sheet The new HP ProLiant BL35p two-way Server Blade delivers uncompromising manageability, maximum compute density and breakthrough power efficiencies to the high-performance data centre. The ProLiant

More information

Performance of the AMD Opteron LS21 for IBM BladeCenter

Performance of the AMD Opteron LS21 for IBM BladeCenter August 26 Performance Analysis Performance of the AMD Opteron LS21 for IBM BladeCenter Douglas M. Pase and Matthew A. Eckl IBM Systems and Technology Group Page 2 Abstract In this paper we examine the

More information

Linux Networx HPC Strategy and Roadmap

Linux Networx HPC Strategy and Roadmap Linux Networx HPC Strategy and Roadmap Eric Pitcher October 2006 Agenda Business Update Technology Trends Linux Networx Drivers Hardware Roadmap Software Highlights Linux Networx Overview Founded in 1989,

More information

Broadcast-Quality, High-Density HEVC Encoding with AMD EPYC Processors

Broadcast-Quality, High-Density HEVC Encoding with AMD EPYC Processors Solution Brief December, 2018 2018 Broadcast-Quality, High-Density HEVC Encoding with AMD EPYC Processors HIGHLIGHTS o The AMD EPYC SoC brings a new balance to the datacenter. Utilizing an x86-architecture,

More information

Use cases. Faces tagging in photo and video, enabling: sharing media editing automatic media mashuping entertaining Augmented reality Games

Use cases. Faces tagging in photo and video, enabling: sharing media editing automatic media mashuping entertaining Augmented reality Games Viewdle Inc. 1 Use cases Faces tagging in photo and video, enabling: sharing media editing automatic media mashuping entertaining Augmented reality Games 2 Why OpenCL matter? OpenCL is going to bring such

More information

IBM System x3455 AMD Opteron SMP 1 U server features Xcelerated Memory Technology to meet the needs of HPC environments

IBM System x3455 AMD Opteron SMP 1 U server features Xcelerated Memory Technology to meet the needs of HPC environments IBM Europe Announcement ZG07-0492, dated July 17, 2007 IBM System x3455 AMD Opteron SMP 1 U server features Xcelerated Memory Technology to meet the needs of HPC environments Key prerequisites...2 Description...3

More information

Mellanox Technologies Maximize Cluster Performance and Productivity. Gilad Shainer, October, 2007

Mellanox Technologies Maximize Cluster Performance and Productivity. Gilad Shainer, October, 2007 Mellanox Technologies Maximize Cluster Performance and Productivity Gilad Shainer, shainer@mellanox.com October, 27 Mellanox Technologies Hardware OEMs Servers And Blades Applications End-Users Enterprise

More information

Intelligent servers- Lower TCO, Rapid ROI and More Performance

Intelligent servers- Lower TCO, Rapid ROI and More Performance Intelligent servers- Lower TCO, Rapid ROI and More Performance Neil Lin Solutions Specialist Enterprise Solutions Sales Intel Xeon Processor 5500 Series: Transforming Computing Intelligent Platform World

More information

INCREASING DENSITY AND SIMPLIFYING SETUP WITH INTEL PROCESSOR-POWERED DELL POWEREDGE FX2 ARCHITECTURE

INCREASING DENSITY AND SIMPLIFYING SETUP WITH INTEL PROCESSOR-POWERED DELL POWEREDGE FX2 ARCHITECTURE INCREASING DENSITY AND SIMPLIFYING SETUP WITH INTEL PROCESSOR-POWERED DELL POWEREDGE FX2 ARCHITECTURE As your business grows, it faces the challenge of deploying, operating, powering, and maintaining an

More information

Introducing NVDIMM-X: Designed to be the World s Fastest NAND-Based SSD Architecture and a Platform for the Next Generation of New Media SSDs

Introducing NVDIMM-X: Designed to be the World s Fastest NAND-Based SSD Architecture and a Platform for the Next Generation of New Media SSDs , Inc. Introducing NVDIMM-X: Designed to be the World s Fastest NAND-Based SSD Architecture and a Platform for the Next Generation of New Media SSDs Doug Finke Director of Product Marketing September 2016

More information

Accelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing

Accelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing Accelerating HPC (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing SAAHPC, Knoxville, July 13, 2010 Legal Disclaimer Intel may make changes to specifications and product

More information

Automatic Intra-Application Load Balancing for Heterogeneous Systems

Automatic Intra-Application Load Balancing for Heterogeneous Systems Automatic Intra-Application Load Balancing for Heterogeneous Systems Michael Boyer, Shuai Che, and Kevin Skadron Department of Computer Science University of Virginia Jayanth Gummaraju and Nuwan Jayasena

More information

Performance of Mellanox ConnectX Adapter on Multi-core Architectures Using InfiniBand. Abstract

Performance of Mellanox ConnectX Adapter on Multi-core Architectures Using InfiniBand. Abstract Performance of Mellanox ConnectX Adapter on Multi-core Architectures Using InfiniBand Abstract...1 Introduction...2 Overview of ConnectX Architecture...2 Performance Results...3 Acknowledgments...7 For

More information

Exchange Server 2007 Performance Comparison of the Dell PowerEdge 2950 and HP Proliant DL385 G2 Servers

Exchange Server 2007 Performance Comparison of the Dell PowerEdge 2950 and HP Proliant DL385 G2 Servers Exchange Server 2007 Performance Comparison of the Dell PowerEdge 2950 and HP Proliant DL385 G2 Servers By Todd Muirhead Dell Enterprise Technology Center Dell Enterprise Technology Center dell.com/techcenter

More information

Application Performance on Dual Processor Cluster Nodes

Application Performance on Dual Processor Cluster Nodes Application Performance on Dual Processor Cluster Nodes by Kent Milfeld milfeld@tacc.utexas.edu edu Avijit Purkayastha, Kent Milfeld, Chona Guiang, Jay Boisseau TEXAS ADVANCED COMPUTING CENTER Thanks Newisys

More information

RegMutex: Inter-Warp GPU Register Time-Sharing

RegMutex: Inter-Warp GPU Register Time-Sharing RegMutex: Inter-Warp GPU Register Time-Sharing Farzad Khorasani* Hodjat Asghari Esfeden Amin Farmahini-Farahani Nuwan Jayasena Vivek Sarkar *farkhor@gatech.edu The 45 th International Symposium on Computer

More information

Intel Workstation Platforms

Intel Workstation Platforms Product Brief Intel Workstation Platforms Intel Workstation Platforms For intelligent performance, automated energy efficiency, and flexible expandability How quickly can you change complex data into actionable

More information

IBM System x family brochure

IBM System x family brochure IBM Systems and Technology System x IBM System x family brochure IBM System x rack and tower servers 2 IBM System x family brochure IBM System x servers Highlights IBM System x and BladeCenter servers

More information

Agenda. Quad-Core AMD Opteron TM. Long Term Roadmap AMD HPC WW. Worldwide High Performance Computing

Agenda. Quad-Core AMD Opteron TM. Long Term Roadmap AMD HPC WW. Worldwide High Performance Computing AMD WW HPC 07 Agenda Quad-Core AMD Opteron TM Long Term Roadmap AMD HPC WW 2 2007 Worldwide High Performance Computing Agenda Quad-Core AMD Opteron TM Architecture Benchmarks Tools Long Term Roadmap AMD

More information

WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY BX924 S2

WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY BX924 S2 WHITE PAPER PERFORMANCE REPORT PRIMERGY BX924 S2 WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY BX924 S2 This document contains a summary of the benchmarks executed for the PRIMERGY BX924

More information

Robert Jamieson. Robs Techie PP Everything in this presentation is at your own risk!

Robert Jamieson. Robs Techie PP Everything in this presentation is at your own risk! Robert Jamieson Robs Techie PP Everything in this presentation is at your own risk! PC s Today Basic Setup Hardware pointers PCI Express How will it effect you Basic Machine Setup Set the swap space Min

More information

Virtual Leverage: Server Consolidation in Open Source Environments. Margaret Lewis Commercial Software Strategist AMD

Virtual Leverage: Server Consolidation in Open Source Environments. Margaret Lewis Commercial Software Strategist AMD Virtual Leverage: Server Consolidation in Open Source Environments Margaret Lewis Commercial Software Strategist AMD What Is Virtualization? Abstraction of Hardware Components Virtual Memory Virtual Volume

More information

IBM Power Systems HPC Cluster

IBM Power Systems HPC Cluster IBM Power Systems HPC Cluster Highlights Complete and fully Integrated HPC cluster for demanding workloads Modular and Extensible: match components & configurations to meet demands Integrated: racked &

More information

CAUTIONARY STATEMENT 1 EPYC PROCESSOR ONE YEAR ANNIVERSARY JUNE 2018

CAUTIONARY STATEMENT 1 EPYC PROCESSOR ONE YEAR ANNIVERSARY JUNE 2018 CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to, the features, functionality, availability, timing,

More information

Scaling to Petaflop. Ola Torudbakken Distinguished Engineer. Sun Microsystems, Inc

Scaling to Petaflop. Ola Torudbakken Distinguished Engineer. Sun Microsystems, Inc Scaling to Petaflop Ola Torudbakken Distinguished Engineer Sun Microsystems, Inc HPC Market growth is strong CAGR increased from 9.2% (2006) to 15.5% (2007) Market in 2007 doubled from 2003 (Source: IDC

More information

Linux User Group of Davis. Marc J. Miller Strategic Alliance Manager, AMD October 7, 2003

Linux User Group of Davis. Marc J. Miller Strategic Alliance Manager, AMD October 7, 2003 Linux User Group of Davis Marc J. Miller Strategic Alliance Manager, AMD October 7, 2003 x86 in High Performance Computing - The Six System Challenges #6: Watt density: x86 is the most widely installed

More information

Intel Enterprise Solutions

Intel Enterprise Solutions Intel Enterprise Solutions Catalin Morosanu Business Development Manager High Performance Computing catalin.morosanu@intel.com Intel s figures 2003/Q104 Revenue 2003: $ 31 billion first Quarter 2004: $

More information

Key Considerations for Improving Performance And Virtualization in Microsoft SQL Server Environments

Key Considerations for Improving Performance And Virtualization in Microsoft SQL Server Environments Key Considerations for Improving Performance And Virtualization in Microsoft SQL Server Environments Table of Contents Maximizing Performance in SQL Server Environments............... 4 Focusing on Hardware...........................................

More information

Key results at a glance:

Key results at a glance: HP ProLiant BL680c G5 server blade takes world record for excellent performance for four-processor server on three-tier SAP SD Standard Application Benchmark with Microsoft Windows 2008. The HP Difference

More information

From Dual Core to Multi-Core, is everybody ready?

From Dual Core to Multi-Core, is everybody ready? From Dual Core to Multi-Core, is everybody ready? Dr Ben Bennett Director, Intel HPC September 12 th 2005, PPAM Legal Disclaimer LEGAL INFORMATION: THIS DOCUMENT AND RELATED MATERIALS AND INFORMATION ARE

More information

VIA ProSavageDDR KM266 Chipset

VIA ProSavageDDR KM266 Chipset VIA ProSavageDDR KM266 Chipset High Performance Integrated DDR platform for the AMD Athlon XP Page 1 The VIA ProSavageDDR KM266: High Performance Integrated DDR platform for the AMD Athlon XP processor

More information

Meet the Increased Demands on Your Infrastructure with Dell and Intel. ServerWatchTM Executive Brief

Meet the Increased Demands on Your Infrastructure with Dell and Intel. ServerWatchTM Executive Brief Meet the Increased Demands on Your Infrastructure with Dell and Intel ServerWatchTM Executive Brief a QuinStreet Excutive Brief. 2012 Doing more with less is the mantra that sums up much of the past decade,

More information

Accelerating Applications. the art of maximum performance computing James Spooner Maxeler VP of Acceleration

Accelerating Applications. the art of maximum performance computing James Spooner Maxeler VP of Acceleration Accelerating Applications the art of maximum performance computing James Spooner Maxeler VP of Acceleration Introduction The Process The Tools Case Studies Summary What do we mean by acceleration? How

More information