Analyzing Performance and Power of Applications on GPUs with Dell 12G Platforms. Dr. Jeffrey Layton Enterprise Technologist HPC

Size: px
Start display at page:

Download "Analyzing Performance and Power of Applications on GPUs with Dell 12G Platforms. Dr. Jeffrey Layton Enterprise Technologist HPC"

Transcription

1 Analyzing Performance and Power of Applications on GPUs with Dell 12G Platforms Dr. Jeffrey Layton Enterprise Technologist HPC

2 Why GPUs? GPUs have very high peak compute capability! 6-9X CPU Challenges How feed enough data? Need to port applications! Example: Tesla M2090 GPU Cores 512 Memory 6 GB Memory BW GB/s Peak Performance Single Precision 1331 GFLOPs Double Precision 665 GFLOPs Tesla M2090 GPU 2

3 How can they be used? M610x Inside the server Limited Space, few can fit Limited Power, few can run Difficult to replace Outside the server Pros: Flexibility, Multiple GPUs GPUs can be shared Multiple Host Servers Cons: Oversubscription may limit performance Host GPU C410x 3

4 The Problem: Best Design Parameters are Unknown How many GPUs per server is ideal for my application? How much bandwidth do I need per GPU for a typical users? How does performance scale with increasing number of nodes? How does performance scale with increasing number of GPUs/node? What problem size is most suitable for GPU computing? What is the impact on power consumption and performance/watt? Etc. Etc. Even if you know some of your design parameters, They may change due to improved GPUs, CPUs, GPU drivers, Software/Algorithm redesign etc. 4

5 GPU Enabled Product Portfolio

6 Overview GPU enabled products throughput the portfolio Learn on a laptop Develop/Test on workstations Production on servers 6

7 Laptops Learn GPU programming Buisness laptop: E6520 Nvidia NVS 4200M XPS 15: 48 CUDA cores 512MB memory Nvidia Geforce GT540M 96 CUDA cores 2GB memory 7

8 Workstations Develop/Tune Applications Portable Workstation M display (1920 x 1080) Up to 16GB memory Intel quad-core i7 (Ivy Bridge coming soon) Up to 3 hard drives Quadro 3000M 240 CUDA cores 2GB GDDR5 memory Quadro 5010M 96 CUDA cores 4GB GDDR5 memory T7500: Tower Case Dual-socket Intel Westmere (X56-- processors) Up to 192GB memory (12 DIMM slots) Two PCIe Gen2 x16 slots Five internal SATA or SAS drives RAID cards available GigE on-board, optional 10GigE cards 8

9 Rackable Workstation Develop/Tune R5500: 2U rackable workstation Dual-socket Westmere 12 DIMM slots (up to 192GB) Up to 5 SATA or 6 SAS drives (2.5 ) Two PCIe Gen2 x16 slots Tesla C2070 is an example 9

10 Dell M610x blade Half-height blade (5U) 2S Westmere 12 DIMM slots (192GB memory) Mezz card for QDR IB, 10GigE One double-wide GPU per blade 10

11 Dell PowerEdge C410x Power & Flexibility Basically, Room and board for 16 GPUs Theoretical Max. of 16.5 TFLOPs Connects up to 8 hosts Connects up to 16 PCIe Gen-2 devices (GPGPUs) to hosts High density, 3U chassis. Flexibility to selecting number of GPGPUs Individually serviceable modules N W Power supplies (3+1) N+1 92mm Cooling fans (7+1) PCIe switches 8 PEX PEX

12 Dell PowerEdge C410x Sixteen (16) x16 Gen-2 Modules - PCIe Gen-2 x16 compliant - Independently serviceable LED and On/Off GPU card Power connector for GPGPU card Board-to-board connector for X16 Gen 2 PCIe signals and power Dell Research Computing 1

13 Learn more about Dell PEC C410x How can you dynamically allocate GPUs to host nodes using the Dell PEC C410x? Learn more at session S0309 (Thursday 10:30 Room K) Dynamically Allocating GPGPU to Host Nodes (servers) 13

14 Dell PEC C6100: 4 2S in one chassis Four 2-Socket Nodes in 2U Intel Westmere-EP Each Node: 12 DIMMs each 2 GigE (Intel) 1 Daughter Card (PCIe x8) QDR IB or 10GigE One PCIe x16 (half-length, half-height) Optional SAS controller (in-place of IB) Chassis Design: Hot Plug, Individual Nodes Up to 12 x 3.5 drives (3 per node) Up to 24 x 2.5 drives (6 per node) N+1 Power supplies (1100W or 1400W) NVIDIA HIC certified 14

15 Dell PEC C6145: Two AMD 4S in 2U Two 4-Socket Nodes in 2U 4S AMD Opteron 6200 series Each Node: 32 x DDR3 RDIMMs 2 x GbE Intel 1 x8 Gen II (custom mezzanine slot) QDR IB or 10GigE 3 x16 Gen II (low-profile, half height/half-length) Chassis Design: Hot Plug, Individual Nodes 24 x 2.5 or 12 x 3.5 HDD Redundant Power supplies (1100W or 1400W) Embedded x16 HIC and slots for additional HICs 15

16 Dell PEC C6220 Four 2S Sandy Bridge in 2U Four 2-Socket Nodes in 2U Intel Sandy Bridge-EP Each Node: 16 DIMMs each 2 GigE (Intel) 1 Daughter Card (PCIe Gen 3 x8) FDR IB or QDR IB or 10GigE One PCIe G3 x16 (half-length, half-height) Two node version has two PCIe G3 x16 slots Optional SAS controller (in-place of IB) Chassis Design: Hot Plug, Individual Nodes Up to 12 x 3.5 drives or Up to 24 x 2.5 drives N+1 Power supplies (1100W or 1400W) NVIDIA HIC certified 16

17 Dell Power R720 First Standard Server with internal GPUs Two-socket Intel Sandy Bridge-EP 24 DIMM slots (up to 768GB) Dell Select Network Adapters: 4x GigE 2x10GigE + 2xGigE Intel or Broadcom 7 PCIe Gen 3 slots: Up to two internal GPUS (passive) PCIe G3 x8 slot for network adapter (e.g. FDR IB) Up to 4 front-access, hot-swap, PCIe drives Up to 16 drives HIC certified to work with Dell C410x 17

18 Performance and Power Measurements

19 Engineering Performing benchmarks/tests of various GPU applications: Different host nodes C6100, C6145 are called Dell 11G C6220 and R720 are called Dell 12G Different number of GPUs Internal and external GPUs Measured performance AND power during tests Goal is to understand how applications scale: Number and type of GPUs Host node configuration Develop best practices for GPU configurations 19

20 Applications HPL NAMD XFDTD 3D Oil Reservoir Simulation ANSYS Mechanical 20

21 Thanks!!!!! Dr. Saeed Iqbal Shawn Gao Onur Celebioglu Mark Fernandez, Glen Otero Nvidia Mass Fatica, Stan Posey, Peter Lillian, Bob Cravella, Travis Wells, et al 21

22 Dell 11G

23 Normalized Performance (GFLOPS) Nromalized Power (W) Normalized GFLOPS/W Dell PowerEdge C C410x HPL Normalized Results % 64.91% 56.33% 39.93% 18.18% % 64.91% 56.33% 39.93% 18.18% % 64.91% 56.33% 39.93% 18.18% CPU Only CPU + 1 CPU + 2 CPU + 4 CPU + 8 CPU Only CPU + 1 CPU + 2 CPU + 4 CPU + 8 CPU Only CPU + 1 CPU + 2 CPU + 4 CPU + 8 One HIC from host node to C410x Recommended no more than 2 GPUs per HIC 23 1-node PE C6100, Dual X5650@2.67GHz, 48GB, 1333MHz Memory; C410x has Nvidia s

24 Normalized Performance (GFLOPS) Normalized GFLOPS/W Dell PowerEdge C C410x HPL Normalized Results % % 45.11% 19.74% 32.02% 12.03% 1 HIC 2 HIC % 45.11% 35.85% 46.36% 19.74% 32.02% 12.03% 20.11% 1 HIC 2 HIC CPU Only CPU + 1 CPU + 2 CPU + 4 M207 CPU + 8 CPU Only CPU + 1 CPU + 2 CPU + 4 M207 CPU + 8 One or Two HICs from host node to C410x Recommend no more than 2 GPUs per HIC (1 per HIC is better) 24 1-node PE C6145, Four 6132HE@2.2GHz, 128GB, 1333MHz Memory; C410x has Nvidia s

25 Normalized Performance (day/ns) Normalized Power (W) Normalized Perf (1/days/ns)/W Dell PowerEdge C C410x NAMD Normalized Results CPU Only CPU + 1 CPU + 2 CPU + 4 CPU + 8 CPU Only CPU + 1 CPU + 2 CPU + 4 CPU + 8 CPU Only CPU + 1 CPU + 2 CPU + 4 CPU + 8 STMV data set (1M atoms) Two GPUs is best, 4 GPUs is also good (Perf/W). 1 HIC is good 25 1-node PE C6100, Dual X5650@2.67GHz, 48GB, 1333MHz Memory; C410x has Nvidia s

26 Normalized Performance (1/time) Dell PowerEdge C C410x XFDTD Normalized Performance GPUs are best 2-4 GPUs per x16 HIC CPU Only CPU + 1 CPU + 2 CPU + 4 CPU + 6 CPU Node PE C6100, Dual X5670@2.93GHz, 48GB, 1333MHz Memory; C410x has 16 GPUs

27 Normalized Performance (time) Normalized Power (W) Normalized Perf/W (Time*W) Dell PowerEdge C C410x 3D Oil Reservoir Simulation (Elastic Model) Normalized Results GPUs 2 GPUs 4 GPUs CPU CPU CPU CPU 8.28M/2 GPUS 16.57M/2 GPUs 33.15M/4 GPUs 8 GPUs 66.34M/8 GPUs Problem Size/Number of GPUs GPUS 2 GPUS 4 GPUs CPU CPU CPU CPU 8.28M/2 GPUS 16.57M/2 GPUs 33.15M/4 GPUs 8 GPUs 66.34M/8 GPUs Problem Size/Number of GPUs GPUs CPU CPU CPU CPU 2 GPUs 2 GPUs 8.28M/2 GPUS 16.57M/2 GPUs 33.15M/4 GPUs 8 GPUs 66.34M/8 GPUs Problem Size/Number of GPUs Multiple data sets Smaller data sets: 2 GPUs. Larger data sets: 4-8 GPUs (per x16 HIC) 27 Node PE C6100, Dual X5670@2.93GHz, 48GB, 1333MHz Memory; C410x has 16 GPUs

28 Dell 12G

29 Normalized Performance (GFLOPS) Normalized Power (W) Normalized GFLOPS/W Dell PowerEdge R720 HPL-Normalized Results % 60.2% % 60.2% % 67.8% 60.2% % % R720 CPU only R M2090 R M2090 R720 CPU only R M2090 R M2090 R720 CPU only R M2090 Internal GPUs 2 GPUs seems like a good configuration (individual x16 slot) R M R720, Dual Intel E5-2660@2.2GHz, 64GB, 1333MHz Memory (8x 8GB); two internal M2090 GPUs

30 GFLOPs/Watts Compare GFLOPS/watt (Cap 225W) Comparisons ( C6100 & R720) C6100 ( 2.93GHz ) R720 ( 2.7GHz ) R720 ( 2.2GHz ) M2090 GPUs with 225W power capping

31 Normalized Performance (GFLOPS) Normalized Power (W) Normalized GFLOPS/W Dell PowerEdge C C410x HPL - Normalized Results % % 58.7% 34.4% % 58.7% % 67.2% 58.7% 34.4% % % CPU only CPU+1 M2090 CPU+2 M2090 CPU+4 M2090 CPU only CPU+1 M2090 CPU+2 M2090 CPU+4 M2090 CPU only CPU+1 M2090 CPU+2 M2090 CPU+4 M2090 External GPUs with single HIC Two GPUs seems to be sweet spot (two GPUs with 1 x16 HIC) 31

32 Normalized Performance (GFLOPS) Normalized Performance (GFLOPS) Comparison of Internal vs. External HPL - Normalized Performance Not quite apples-to-apples (PCIe G2 to G3) % % 34.4% % % % % R720 CPU only R M2090 R M2090 CPU only CPU+1 M2090 CPU+2 M2090 CPU+4 M

33 Normalized GFLOPS/W Normalized GFLOPS/W Comparison of Internal vs. External HPL - Normalized GFLOPS/W Not quite apples-to-apples (PCIe G2 to G3) % 67.8% 60.2% % 67.2% 58.7% 34.4% R720 CPU only R M2090 R M2090 CPU only CPU+1 M2090 CPU+2 M2090 CPU+4 M2090 Internal GPUs are more effective (perf/w) but external are still very good 33

34 Normalized Performance (Time) Normalized Perf/W - (1/time)/W Dell PowerEdge R720 ANSYS Mechanical - Normalized Results 1 core Core 1 Core + 1 M Core 1 Core + 1 M V14cg-1V14sp-1V14sp-2V14sp-3V14sp-4V14sp-5V14sp-6 Massive speedup with GPU (up to 3.6x) but not all cases Perf/W (efficiency) can be quite good (2 times better) 34

35 Normalized Performance (Time) Normalized Perf/W - (1/time)/W Dell PowerEdge R720 ANSYS Mechanical - Normalized Results 2 cores Cores 2 Cores + 1 M Cores 2 Cores + 1 M Less speedup than 1 core case Perf/W (efficiency) is still very good 35

36 Normalized Perforamnce (Time) Normalized Perf/W - (1/T)/W Dell PowerEdge R720 ANSYS Mechanical - Normalized Results 4 cores Cores 4 Cores + 1 M Cores 4 Cores + 1 M V14cg-1V14sp-1V14sp-2V14sp-3V14sp-4V14sp-5V14sp-6 V14cg-1V14sp-1V14sp-2V14sp-3V14sp-4V14sp-5V14sp-6 Less speedup than 1 and 2 core cases 36

37 Normalized Performance (Time) Normalized Perf/W - (1/T)/W Dell PowerEdge R720 ANSYS Mechanical - Normalized Results 8 cores Cores 8 Cores + 1 M Cores + 2 M Cores 8 Cores + 1 M Added 2 GPU tests (not much impact on performance) Efficiency is not good except for 2 cases (less than 1) 37

38 Normalized Performance (Speedup) Normalized Perf/W - (1/T)/W Dell PowerEdge R720 ANSYS Mechanical - Normalized Results 16 cores Cores 16 Cores + 1 M Cores + 2 M Cores 16 Cores + 1 M2090 Very little speed improvement Efficiency with GPUs is worse than CPUs only 38

39 Normalized Performance (1/T) Normalized Perf/W - (1/T)/W Dell PowerEdge R720 ANSYS Mechanical Trends Choose last 3 cases (best usage of GPUs) V14sp V14sp V14sp-5 V14sp-6 CPU Only V14sp-5 V14sp-6 CPU Only Core 2 Cores 4 Cores 8 Cores 16 Cores 1 Core 2 Cores 4 Cores 8 Cores 16 Cores 39

40 ANSYS Mechanical observations As the number of cores increased: The GPU speedup decreases Efficiency decreases (at 16 cores it s not good) Cross-over point is 8 or 16 cores (8 is a good rule of thumb) Recommended configuration: Small CPU core count (no more than 4) with 1 GPU With 2 GPUs in node, you can run 2 cases at the same time (uses 8 cores) Performance varies with case (solver) 40

41 Summary Lots of options for GPU configurations which one is best? How do you define best? Performance? Power efficiency? Both? Answers vary (depend upon application) You don t have to have a dedicated x16 slot for each GPU for good performance and good efficiency Many applications shown here illustrate this Dell C410x allows GPU Direct for up to 8 GPUs Other systems do not allow this 41

42 Thanks! Questions?

System Design of Kepler Based HPC Solutions. Saeed Iqbal, Shawn Gao and Kevin Tubbs HPC Global Solutions Engineering.

System Design of Kepler Based HPC Solutions. Saeed Iqbal, Shawn Gao and Kevin Tubbs HPC Global Solutions Engineering. System Design of Kepler Based HPC Solutions Saeed Iqbal, Shawn Gao and Kevin Tubbs HPC Global Solutions Engineering. Introduction The System Level View K20 GPU is a powerful parallel processor! K20 has

More information

Dell Solution for High Density GPU Infrastructure

Dell Solution for High Density GPU Infrastructure Dell Solution for High Density GPU Infrastructure 李信乾 (Clayton Li) 產品技術顧問 HPC@DELL Key partnerships & programs Customer inputs & new ideas Collaboration Innovation Core & new technologies Critical adoption

More information

Architecting High Performance Computing Systems for Fault Tolerance and Reliability

Architecting High Performance Computing Systems for Fault Tolerance and Reliability Architecting High Performance Computing Systems for Fault Tolerance and Reliability Blake T. Gonzales HPC Computer Scientist Dell Advanced Systems Group blake_gonzales@dell.com Agenda HPC Fault Tolerance

More information

Accelerating high-performance computing with hybrid platforms

Accelerating high-performance computing with hybrid platforms Accelerating high-performance computing with hybrid platforms October 2010 Dell THIS WHITE PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN TYPOGRAPHICAL ERRORS AND TECHNICAL INACCURACIES. THE

More information

Game-changing Extreme GPU computing with The Dell PowerEdge C4130

Game-changing Extreme GPU computing with The Dell PowerEdge C4130 Game-changing Extreme GPU computing with The Dell PowerEdge C4130 A Dell Technical White Paper This white paper describes the system architecture and performance characterization of the PowerEdge C4130.

More information

LAMMPS-KOKKOS Performance Benchmark and Profiling. September 2015

LAMMPS-KOKKOS Performance Benchmark and Profiling. September 2015 LAMMPS-KOKKOS Performance Benchmark and Profiling September 2015 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell, Mellanox, NVIDIA

More information

GROMACS (GPU) Performance Benchmark and Profiling. February 2016

GROMACS (GPU) Performance Benchmark and Profiling. February 2016 GROMACS (GPU) Performance Benchmark and Profiling February 2016 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Dell, Mellanox, NVIDIA Compute

More information

NAMD Performance Benchmark and Profiling. January 2015

NAMD Performance Benchmark and Profiling. January 2015 NAMD Performance Benchmark and Profiling January 2015 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell, Mellanox Compute resource

More information

Dell PowerEdge Servers Portfolio Guide

Dell PowerEdge Servers Portfolio Guide Dell PowerEdge Servers Portfolio Guide Dell PowerEdge Servers Purpose-Built for Reliability Virtualization-Enabled for an Efficient Infrastructure Intelligent, Connected Systems Managment With Dell you

More information

LS-DYNA Performance Benchmark and Profiling. April 2015

LS-DYNA Performance Benchmark and Profiling. April 2015 LS-DYNA Performance Benchmark and Profiling April 2015 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell, Mellanox Compute resource

More information

Sugon TC6600 blade server

Sugon TC6600 blade server Sugon TC6600 blade server The converged-architecture blade server The TC6600 is a new generation, multi-node and high density blade server with shared power, cooling, networking and management infrastructure

More information

HPC Hardware Overview

HPC Hardware Overview HPC Hardware Overview John Lockman III April 19, 2013 Texas Advanced Computing Center The University of Texas at Austin Outline Lonestar Dell blade-based system InfiniBand ( QDR) Intel Processors Longhorn

More information

Performance Analysis of HPC Applications on Several Dell PowerEdge 12 th Generation Servers

Performance Analysis of HPC Applications on Several Dell PowerEdge 12 th Generation Servers Performance Analysis of HPC Applications on Several Dell PowerEdge 12 th Generation Servers This Dell technical white paper evaluates and provides recommendations for the performance of several HPC applications

More information

PART-I (B) (TECHNICAL SPECIFICATIONS & COMPLIANCE SHEET) Supply and installation of High Performance Computing System

PART-I (B) (TECHNICAL SPECIFICATIONS & COMPLIANCE SHEET) Supply and installation of High Performance Computing System INSTITUTE FOR PLASMA RESEARCH (An Autonomous Institute of Department of Atomic Energy, Government of India) Near Indira Bridge; Bhat; Gandhinagar-382428; India PART-I (B) (TECHNICAL SPECIFICATIONS & COMPLIANCE

More information

The Cray CX1 puts massive power and flexibility right where you need it in your workgroup

The Cray CX1 puts massive power and flexibility right where you need it in your workgroup The Cray CX1 puts massive power and flexibility right where you need it in your workgroup Up to 96 cores of Intel 5600 compute power 3D visualization Up to 32TB of storage GPU acceleration Small footprint

More information

ANSYS Fluent 14 Performance Benchmark and Profiling. October 2012

ANSYS Fluent 14 Performance Benchmark and Profiling. October 2012 ANSYS Fluent 14 Performance Benchmark and Profiling October 2012 Note The following research was performed under the HPC Advisory Council activities Special thanks for: HP, Mellanox For more information

More information

CPMD Performance Benchmark and Profiling. February 2014

CPMD Performance Benchmark and Profiling. February 2014 CPMD Performance Benchmark and Profiling February 2014 Note The following research was performed under the HPC Advisory Council activities Special thanks for: HP, Mellanox For more information on the supporting

More information

GPU Clusters for High- Performance Computing Jeremy Enos Innovative Systems Laboratory

GPU Clusters for High- Performance Computing Jeremy Enos Innovative Systems Laboratory GPU Clusters for High- Performance Computing Jeremy Enos Innovative Systems Laboratory National Center for Supercomputing Applications University of Illinois at Urbana-Champaign Presentation Outline NVIDIA

More information

STAR-CCM+ Performance Benchmark and Profiling. July 2014

STAR-CCM+ Performance Benchmark and Profiling. July 2014 STAR-CCM+ Performance Benchmark and Profiling July 2014 Note The following research was performed under the HPC Advisory Council activities Participating vendors: CD-adapco, Intel, Dell, Mellanox Compute

More information

Intel Xeon E v4, Optional Operating System, 8GB Memory, 2TB SAS H330 Hard Drive and a 3 Year Warranty

Intel Xeon E v4, Optional Operating System, 8GB Memory, 2TB SAS H330 Hard Drive and a 3 Year Warranty pe_r730_1356_a Datasheet Check its price: Click Here Overview adapts to virtually any workload with a scalable server featuring an optimal mix of memory, storage, processing and GPUs. This model is the

More information

FUJITSU Server PRIMERGY CX400 M4 Workload-specific power in a modular form factor. 0 Copyright 2018 FUJITSU LIMITED

FUJITSU Server PRIMERGY CX400 M4 Workload-specific power in a modular form factor. 0 Copyright 2018 FUJITSU LIMITED FUJITSU Server PRIMERGY CX400 M4 Workload-specific power in a modular form factor 0 Copyright 2018 FUJITSU LIMITED FUJITSU Server PRIMERGY CX400 M4 Workload-specific power in a compact and modular form

More information

Stan Posey, CAE Industry Development NVIDIA, Santa Clara, CA, USA

Stan Posey, CAE Industry Development NVIDIA, Santa Clara, CA, USA Stan Posey, CAE Industry Development NVIDIA, Santa Clara, CA, USA NVIDIA and HPC Evolution of GPUs Public, based in Santa Clara, CA ~$4B revenue ~5,500 employees Founded in 1999 with primary business in

More information

NAMD GPU Performance Benchmark. March 2011

NAMD GPU Performance Benchmark. March 2011 NAMD GPU Performance Benchmark March 2011 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Dell, Intel, Mellanox Compute resource - HPC Advisory

More information

DELL POWEREDGE SERVERS

DELL POWEREDGE SERVERS DELL POWEREDGE SERVERS PRODUCT GUIDE INTRODUCING THE LATEST GENERATION OF POWEREDGE SERVERS User-Inspired Award-Winning Design Cost-Cutting Energy Smart Technology Exclusive Industry-Only Management Integrated

More information

HP GTC Presentation May 2012

HP GTC Presentation May 2012 HP GTC Presentation May 2012 Today s Agenda: HP s Purpose-Built SL Server Line Desktop GPU Computing Revolution with HP s Z Workstations Hyperscale the new frontier for HPC New HPC customer requirements

More information

CST STUDIO SUITE R Supported GPU Hardware

CST STUDIO SUITE R Supported GPU Hardware CST STUDIO SUITE R 2017 Supported GPU Hardware 1 Supported Hardware CST STUDIO SUITE currently supports up to 8 GPU devices in a single host system, meaning each number of GPU devices between 1 and 8 is

More information

University at Buffalo Center for Computational Research

University at Buffalo Center for Computational Research University at Buffalo Center for Computational Research The following is a short and long description of CCR Facilities for use in proposals, reports, and presentations. If desired, a letter of support

More information

High Performance Computing with Accelerators

High Performance Computing with Accelerators High Performance Computing with Accelerators Volodymyr Kindratenko Innovative Systems Laboratory @ NCSA Institute for Advanced Computing Applications and Technologies (IACAT) National Center for Supercomputing

More information

Exactly as much as you need.

Exactly as much as you need. Exactly as much as you need. Get IT All with PRIMERGY RX300 S6 & PRIMERGY RX200 S6 1 Copyright 2011 FUJITSU Agenda 1. Get IT All: The Offer 2. Dynamic Infrastructures 3. PRIMERGY Portfolio Overview 4.

More information

n N c CIni.o ewsrg.au

n N c CIni.o ewsrg.au @NCInews NCI and Raijin National Computational Infrastructure 2 Our Partners General purpose, highly parallel processors High FLOPs/watt and FLOPs/$ Unit of execution Kernel Separate memory subsystem GPGPU

More information

About 2CRSI. OCtoPus Solution. Technical Specifications. OCtoPus servers. OCtoPus. OCP Solution by 2CRSI.

About 2CRSI. OCtoPus Solution. Technical Specifications. OCtoPus servers. OCtoPus. OCP Solution by 2CRSI. About 2CRSI OCtoPus Solution Technical Specifications OCtoPus servers OCtoPus OCP Solution by 2CRSI 1 About 2CRSI 3 OCtoPus Solution 4 Technical Specifications OCtoPus Rack Unique server design 6 7 OCtoPus

More information

Dell PowerEdge R720xd with PERC H710P: A Balanced Configuration for Microsoft Exchange 2010 Solutions

Dell PowerEdge R720xd with PERC H710P: A Balanced Configuration for Microsoft Exchange 2010 Solutions Dell PowerEdge R720xd with PERC H710P: A Balanced Configuration for Microsoft Exchange 2010 Solutions A comparative analysis with PowerEdge R510 and PERC H700 Global Solutions Engineering Dell Product

More information

John Fragalla TACC 'RANGER' INFINIBAND ARCHITECTURE WITH SUN TECHNOLOGY. Presenter s Name Title and Division Sun Microsystems

John Fragalla TACC 'RANGER' INFINIBAND ARCHITECTURE WITH SUN TECHNOLOGY. Presenter s Name Title and Division Sun Microsystems TACC 'RANGER' INFINIBAND ARCHITECTURE WITH SUN TECHNOLOGY SUBTITLE WITH TWO LINES OF TEXT IF NECESSARY John Fragalla Presenter s Name Title and Division Sun Microsystems Principle Engineer High Performance

More information

Ultimate performance and security in a 2U Form factor.

Ultimate performance and security in a 2U Form factor. Ultimate performance and security in a 2U Form factor. PRECISION 7920 RACK Powerful performance Power through the most complex, demanding applications more quickly with a new generation of dual-socket

More information

GW2000h w/gw175h/q F1 specifications

GW2000h w/gw175h/q F1 specifications Product overview The Gateway GW2000h w/ GW175h/q F1 maximizes computing power and thermal control with up to four hot-pluggable nodes in a space-saving 2U form factor. Offering first-class performance,

More information

HostEngine 5URP24 Computer User Guide

HostEngine 5URP24 Computer User Guide HostEngine 5URP24 Computer User Guide Front and Rear View HostEngine 5URP24 (HE5URP24) computer features Intel Xeon Scalable (Skylake FCLGA3647 socket) Series dual processors with the Intel C621 chipset.

More information

The Why and How of Developing All-Flash Storage Server

The Why and How of Developing All-Flash Storage Server The Why and How of Developing All-Flash Storage Server June 2016 Jungsoo Kim Manager, SK Telecom Agenda Why we care about All-Flash Storage Transforming to 5G Network Open HW & SW Projects @ SKT Our approaches

More information

Data Sheet Fujitsu Server PRIMERGY CX250 S2 Dual Socket Server Node

Data Sheet Fujitsu Server PRIMERGY CX250 S2 Dual Socket Server Node Data Sheet Fujitsu Server PRIMERGY CX250 S2 Dual Socket Server Node Data Sheet Fujitsu Server PRIMERGY CX250 S2 Dual Socket Server Node Datasheet for Red Hat certification Standard server node for PRIMERGY

More information

Dell PowerEdge server portfolio: platforms and solutions for enterprise applications

Dell PowerEdge server portfolio: platforms and solutions for enterprise applications Dell PowerEdge server portfolio: platforms and solutions for enterprise applications Dell PowerEdge server portfolio: platforms and solutions for enterprise applications Next-generation PowerEdge technologies

More information

Memory Selection Guidelines for High Performance Computing with Dell PowerEdge 11G Servers

Memory Selection Guidelines for High Performance Computing with Dell PowerEdge 11G Servers Memory Selection Guidelines for High Performance Computing with Dell PowerEdge 11G Servers A Dell Technical White Paper By Garima Kochhar and Jacob Liberman High Performance Computing Engineering Dell

More information

Intel Xeon E v4, Windows Server 2016 Standard, 16GB Memory, 1TB SAS Hard Drive and a 3 Year Warranty

Intel Xeon E v4, Windows Server 2016 Standard, 16GB Memory, 1TB SAS Hard Drive and a 3 Year Warranty pe_r430_11598_b Datasheet Check its price: Click Here Overview delivers peak 2-socket performance for HPC, web tech and infrastructure scale-out. R430 provides Intel Xeon processor E5-2600 v4 product family

More information

Intel Select Solutions for Professional Visualization with Advantech Servers & Appliances

Intel Select Solutions for Professional Visualization with Advantech Servers & Appliances Solution Brief Intel Select Solution for Professional Visualization Intel Xeon Processor Scalable Family Powered by Intel Rendering Framework Intel Select Solutions for Professional Visualization with

More information

Agenda. Sun s x Sun s x86 Strategy. 2. Sun s x86 Product Portfolio. 3. Virtualization < 1 >

Agenda. Sun s x Sun s x86 Strategy. 2. Sun s x86 Product Portfolio. 3. Virtualization < 1 > Agenda Sun s x86 1. Sun s x86 Strategy 2. Sun s x86 Product Portfolio 3. Virtualization < 1 > 1. SUN s x86 Strategy Customer Challenges Power and cooling constraints are very real issues Energy costs are

More information

SU Dual and Quad-Core Xeon UP Server

SU Dual and Quad-Core Xeon UP Server SU4-1300 Dual and Quad-Core Xeon UP Server www.eslim.co.kr Dual and Quad-Core Server Computing Leader!! ESLIM KOREA INC. 1. Overview eslim SU4-1300 The ideal entry-level server Intel Xeon processor 3000/3200

More information

Part Number Unit Descriptions

Part Number Unit Descriptions Part Number Unit Descriptions 2582B2A System x3100m4 Simple Swap (SATA) Xeon 4C E3-1220v2 69W 3.1GHz/1600MHz/8MB Form factor Tower (can be a 4U rack form factor using the optional Tower-to-Rack Conversion

More information

High Performance Computing

High Performance Computing 21 High Performance Computing High Performance Computing Systems 21-2 HPC-1420-ISSE Robust 1U Intel Quad Core Xeon Server with Innovative Cable-less Design 21-3 HPC-2820-ISSE 2U Intel Quad Core Xeon Server

More information

Accelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing

Accelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing Accelerating HPC (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing SAAHPC, Knoxville, July 13, 2010 Legal Disclaimer Intel may make changes to specifications and product

More information

MILC Performance Benchmark and Profiling. April 2013

MILC Performance Benchmark and Profiling. April 2013 MILC Performance Benchmark and Profiling April 2013 Note The following research was performed under the HPC Advisory Council activities Special thanks for: HP, Mellanox For more information on the supporting

More information

DESERT STORM WS-TS700

DESERT STORM WS-TS700 Description: The Desert Storm WS-TS700 is designed for use as a high performance workstation, support for graphics cards and MIO audio cards, optimized audio performance, BIOS flashback and Q-code logger.

More information

Altos T310 F3 Specifications

Altos T310 F3 Specifications Product overview The Altos T310 F3 delivers proactive management tools matched by best priceperformance technology ideal for SMB and branch office operations. This singlesocket tower server features an

More information

Dell EMC PowerEdge server portfolio: platforms and solutions

Dell EMC PowerEdge server portfolio: platforms and solutions Dell EMC PowerEdge server portfolio: platforms and solutions Servers are the bedrock of the modern appliance. With consistent, scalable and industry-leading design, Dell EMC servers can help you tackle

More information

About 2CRSI. OCtoPus Solution. Technical Specifications. OCtoPus. OCP Solution by 2CRSI.

About 2CRSI. OCtoPus Solution. Technical Specifications. OCtoPus. OCP Solution by 2CRSI. About 2CRSI OCtoPus Solution Technical Specifications OCtoPus OCtoPus OCP Solution by 2CRSI 1 Remark: All specifications and photos are subject to change whitout notice. 2 About 2CRSI 5 OCtoPus Solution

More information

Fujitsu PRIMERGY Servers Portfolio

Fujitsu PRIMERGY Servers Portfolio Fujitsu Servers Portfolio Dynamic Infrastructures for workgroup, datacenter and cloud computing shaping tomorrow with you Higher IT efficiency and reduced total cost of ownership Fujitsu Micro and Tower

More information

HUAWEI Tecal X6000 High-Density Server

HUAWEI Tecal X6000 High-Density Server HUAWEI Tecal X6000 High-Density Server Professional Trusted Future-oriented HUAWEI TECHNOLOGIES CO., LTD. HUAWEI Tecal X6000 High-Density Server (X6000) High computing density The X6000 is 2U high and

More information

Essentials. Expected Discontinuance Q2'15 Limited 3-year Warranty Yes Extended Warranty Available

Essentials. Expected Discontinuance Q2'15 Limited 3-year Warranty Yes Extended Warranty Available M&A, Inc. Essentials Status Launched Expected Discontinuance Q2'15 Limited 3-year Warranty Extended Warranty Available for Purchase (Select Countries) On-Site Repair Available for Purchase (Select Countries)

More information

OCTOPUS Performance Benchmark and Profiling. June 2015

OCTOPUS Performance Benchmark and Profiling. June 2015 OCTOPUS Performance Benchmark and Profiling June 2015 2 Note The following research was performed under the HPC Advisory Council activities Special thanks for: HP, Mellanox For more information on the

More information

WiRack19 - Computing Server. Wiwynn SV324G2. Highlights. Specification.

WiRack19 - Computing Server. Wiwynn SV324G2. Highlights. Specification. WiRack19 - Computing Server Wiwynn SV324G2 Inherits benefits from hyper-scale deployed OCP Leopard MB Front serviceable for quick configuration and deployment Tool-less, hot-swappable redundancy for easy

More information

Performance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA

Performance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA Performance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA Pak Lui, Gilad Shainer, Brian Klaff Mellanox Technologies Abstract From concept to

More information

Ultimate performance and security in a 2U Form factor.

Ultimate performance and security in a 2U Form factor. Ultimate performance and security in a 2U Form factor. PRECISION 7920 XL RACK When Stability Matters When you choose an OEM XL product, you get the stability, visibility and longevity you need from the

More information

TFLOP Performance for ANSYS Mechanical

TFLOP Performance for ANSYS Mechanical TFLOP Performance for ANSYS Mechanical Dr. Herbert Güttler Engineering GmbH Holunderweg 8 89182 Bernstadt www.microconsult-engineering.de Engineering H. Güttler 19.06.2013 Seite 1 May 2009, Ansys12, 512

More information

Optimal BIOS settings for HPC with Dell PowerEdge 12 th generation servers

Optimal BIOS settings for HPC with Dell PowerEdge 12 th generation servers Optimal BIOS settings for HPC with Dell PowerEdge 12 th generation servers This Dell technical white paper analyses the various BIOS options available in Dell PowerEdge 12 th generation servers and provides

More information

Altos R320 F3 Specifications. Product overview. Product views. Internal view

Altos R320 F3 Specifications. Product overview. Product views. Internal view Product overview The Altos R320 F3 single-socket 1U rack server delivers great performance and enterprise-level scalability in a space-saving design. Proactive management utilities effectively handle SMB

More information

IBM System x family brochure

IBM System x family brochure IBM Systems and Technology Group System x IBM System x family brochure IBM System x rack and tower servers 2 IBM System x family brochure IBM System x servers Highlights IBM System x and BladeCenter servers

More information

SERVER TECHNOLOGY H. Server & Workstation Motherboards Server Barebones & Accessories

SERVER TECHNOLOGY H. Server & Workstation Motherboards Server Barebones & Accessories SERVER TECHNOLOGY 2018 2H Server & Workstation Motherboards Server Barebones & Accessories MOTHERBOARD We put our three decades of know-how in motherboard design at the service of cutting-edge server motherboards.

More information

Avid Configuration Guidelines Dell 3620 Workstation Tower & 3420 Workstation SFF Single Quad Core CPU Qualified for Software Only

Avid Configuration Guidelines Dell 3620 Workstation Tower & 3420 Workstation SFF Single Quad Core CPU Qualified for Software Only Avid Configuration Guidelines Dell 3620 Workstation Tower & 3420 Workstation SFF Single Quad Core CPU Qualified for Software Only Page 1 of 12 1.) Dell 3620 Tower and 3420 SFF [Small Form Factor] AVID

More information

The Dell Precision T3620 tower as a Smart Client leveraging GPU hardware acceleration

The Dell Precision T3620 tower as a Smart Client leveraging GPU hardware acceleration The Dell Precision T3620 tower as a Smart Client leveraging GPU hardware acceleration Dell IP Video Platform Design and Calibration Lab June 2018 H17415 Reference Architecture Dell EMC Solutions Copyright

More information

Suggested use: infrastructure applications, collaboration/ , web, and virtualized desktops in a workgroup or distributed environments.

Suggested use: infrastructure applications, collaboration/ , web, and virtualized desktops in a workgroup or distributed environments. The IBM System x3500 M4 server provides outstanding performance for your business-critical applications. Its energy-efficient design supports more cores, memory, and data capacity in a scalable Tower or

More information

Microsoft SQL Server in a VMware Environment on Dell PowerEdge R810 Servers and Dell EqualLogic Storage

Microsoft SQL Server in a VMware Environment on Dell PowerEdge R810 Servers and Dell EqualLogic Storage Microsoft SQL Server in a VMware Environment on Dell PowerEdge R810 Servers and Dell EqualLogic Storage A Dell Technical White Paper Dell Database Engineering Solutions Anthony Fernandez April 2010 THIS

More information

GPUs and Emerging Architectures

GPUs and Emerging Architectures GPUs and Emerging Architectures Mike Giles mike.giles@maths.ox.ac.uk Mathematical Institute, Oxford University e-infrastructure South Consortium Oxford e-research Centre Emerging Architectures p. 1 CPUs

More information

15 Jun 2012 Kevin Chang

15 Jun 2012 Kevin Chang TYAN/MiTAC 4U MICRO SERVER Product Overview 15 Jun 2012 Kevin Chang FM65 System Enclosure 19 rack mount 4U enclosure (H176mm x W440mm x D650mm) Support either (MFG option) Regular AC w/ redundancy DC 12V

More information

Fujitsu VDI / vgpu Virtualization

Fujitsu VDI / vgpu Virtualization Fujitsu VDI / vgpu Virtualization Antti Sirkiä Service Partner Manager, Certified Trainer Fujitsu, Product Business Unit Why Virtualization / Graphics Virtualization? :: GRAPHICS VIRTUALIZATION :: Multiple

More information

IBM eserver xseries. BladeCenter. Arie Berkovitch eserver Territory Manager IBM Corporation

IBM eserver xseries. BladeCenter. Arie Berkovitch eserver Territory Manager IBM Corporation BladeCenter Arie Berkovitch eserver Territory Manager 2006 IBM Corporation IBM BladeCenter What is a Blade A server on a card each Blade has its own: processor networking memory optional storage etc. IBM

More information

RECENT TRENDS IN GPU ARCHITECTURES. Perspectives of GPU computing in Science, 26 th Sept 2016

RECENT TRENDS IN GPU ARCHITECTURES. Perspectives of GPU computing in Science, 26 th Sept 2016 RECENT TRENDS IN GPU ARCHITECTURES Perspectives of GPU computing in Science, 26 th Sept 2016 NVIDIA THE AI COMPUTING COMPANY GPU Computing Computer Graphics Artificial Intelligence 2 NVIDIA POWERS WORLD

More information

MegaGauss (MGs) Cluster Design Overview

MegaGauss (MGs) Cluster Design Overview MegaGauss (MGs) Cluster Design Overview NVIDIA Tesla (Fermi) S2070 Modules Based Solution Version 6 (Apr 27, 2010) Alexander S. Zaytsev p. 1 of 15: "Title" Front view: planar

More information

HPE Scalable Storage with Intel Enterprise Edition for Lustre*

HPE Scalable Storage with Intel Enterprise Edition for Lustre* HPE Scalable Storage with Intel Enterprise Edition for Lustre* HPE Scalable Storage with Intel Enterprise Edition For Lustre* High Performance Storage Solution Meets Demanding I/O requirements Performance

More information

All-Flash Storage System

All-Flash Storage System All-Flash Storage System June 2016 Jungsoo Kim Manager, SK Telecom Agenda SKT Storage Solution R&D Introduction Our approaches in developing storage system AF-Media details Computing Board Storage Module

More information

NVIDIA GPU Computing Séminaire Calcul Hybride Aristote 25 Mars 2010

NVIDIA GPU Computing Séminaire Calcul Hybride Aristote 25 Mars 2010 NVIDIA GPU Computing 2010 Séminaire Calcul Hybride Aristote 25 Mars 2010 NVIDIA GPU Computing 2010 Tesla 3 rd generation Full OEM coverage Ecosystem focus Value Propositions per segments Card System Module

More information

Representation of the interested Bidders / vendors. Form no. T2 (TECHNICAL MINIMUM SPECIFICATIONS)

Representation of the interested Bidders / vendors. Form no. T2 (TECHNICAL MINIMUM SPECIFICATIONS) Sr. no. Clause no./page No. Item & Specification in the tender Bidder / Vendor s representation Response to the Bidders Page No.12 1 Chassis: 5U Rack Mountable or Higher Please consider Minimum 2U Rack

More information

Broadberry. Artificial Intelligence Server for Fraud. Date: Q Application: Artificial Intelligence

Broadberry. Artificial Intelligence Server for Fraud. Date: Q Application: Artificial Intelligence TM Artificial Intelligence Server for Fraud Date: Q2 2017 Application: Artificial Intelligence Tags: Artificial intelligence, GPU, GTX 1080 TI HM Revenue & Customs The UK s tax, payments and customs authority

More information

Dell PowerEdge server portfolio: platforms and solutions for enterprise applications

Dell PowerEdge server portfolio: platforms and solutions for enterprise applications Dell PowerEdge server portfolio: platforms and solutions for enterprise applications Dell PowerEdge server portfolio: platforms and solutions for enterprise applications Next-generation PowerEdge server

More information

HostEngine 4U Host Computer User Guide

HostEngine 4U Host Computer User Guide HostEngine 4U Host Computer User Guide HostEngine 4U computer features Intel Xeon E5-2600v4 (Broadwell) Series dual-processors with the Intel C612 chipset. HostEngine 4U provides four PCI Express (PCIe)

More information

Power Systems AC922 Overview. Chris Mann IBM Distinguished Engineer Chief System Architect, Power HPC Systems December 11, 2017

Power Systems AC922 Overview. Chris Mann IBM Distinguished Engineer Chief System Architect, Power HPC Systems December 11, 2017 Power Systems AC922 Overview Chris Mann IBM Distinguished Engineer Chief System Architect, Power HPC Systems December 11, 2017 IBM POWER HPC Platform Strategy High-performance computer and high-performance

More information

LS-DYNA Performance Benchmark and Profiling. October 2017

LS-DYNA Performance Benchmark and Profiling. October 2017 LS-DYNA Performance Benchmark and Profiling October 2017 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: LSTC, Huawei, Mellanox Compute resource

More information

Fujitsu Enterprise Product & Solution Facts

Fujitsu Enterprise Product & Solution Facts Fujitsu Enterprise Product & Solution Facts Servers PRIMERGY, SPARC Enterprise, PRIMEQUEST, BS2000/OSD Mainframes Storage ETERNUS for Flexible Data Management and Efficient Data Protection Solutions SAP,

More information

IBM System x3850 M2 servers feature hypervisor capability

IBM System x3850 M2 servers feature hypervisor capability IBM Europe Announcement ZG08-0161, dated March 25, 2008 IBM System x3850 M2 servers feature hypervisor capability Key prerequisites...2 Description...3 Product positioning... 7 Reference information...

More information

eslim SV Dual and Quad-Core Xeon Server Dual and Quad-Core Server Computing Leader!! ESLIM KOREA INC.

eslim SV Dual and Quad-Core Xeon Server  Dual and Quad-Core Server Computing Leader!! ESLIM KOREA INC. eslim SV7-2186 Dual and Quad-Core Xeon Server www.eslim.co.kr Dual and Quad-Core Server Computing Leader!! ESLIM KOREA INC. 1. Overview eslim SV7-2186 Server Dual and Quad-Core Intel Xeon Processors 4

More information

Maximize automotive simulation productivity with ANSYS HPC and NVIDIA GPUs

Maximize automotive simulation productivity with ANSYS HPC and NVIDIA GPUs Presented at the 2014 ANSYS Regional Conference- Detroit, June 5, 2014 Maximize automotive simulation productivity with ANSYS HPC and NVIDIA GPUs Bhushan Desam, Ph.D. NVIDIA Corporation 1 NVIDIA Enterprise

More information

FEMAP/NX NASTRAN PERFORMANCE TUNING

FEMAP/NX NASTRAN PERFORMANCE TUNING FEMAP/NX NASTRAN PERFORMANCE TUNING Chris Teague - Saratech (949) 481-3267 www.saratechinc.com NX Nastran Hardware Performance History Running Nastran in 1984: Cray Y-MP, 32 Bits! (X-MP was only 24 Bits)

More information

NLVMUG 16 maart Display protocols in Horizon

NLVMUG 16 maart Display protocols in Horizon NLVMUG 16 maart 2017 Display protocols in Horizon NLVMUG 16 maart 2017 Display protocols in Horizon Topics Introduction Display protocols - Basics PCoIP vs Blast Extreme Optimizing Monitoring Future Recap

More information

ANSYS Improvements to Engineering Productivity with HPC and GPU-Accelerated Simulation

ANSYS Improvements to Engineering Productivity with HPC and GPU-Accelerated Simulation ANSYS Improvements to Engineering Productivity with HPC and GPU-Accelerated Simulation Ray Browell nvidia Technology Theater SC12 1 2012 ANSYS, Inc. nvidia Technology Theater SC12 HPC Revolution Recent

More information

HostEngine 3U Host Computer User Guide

HostEngine 3U Host Computer User Guide HostEngine 3U computer features Intel Xeon 3.2GHz or lower-speed single- or dual-processor(s) with the Intel C602 chipset. HostEngine 3U provides four PCI Express (PCIe) Gen 3.0 x16 expansion slots. Each

More information

IBM System x family brochure

IBM System x family brochure IBM Systems and Technology System x IBM System x family brochure IBM System x rack and tower servers 2 IBM System x family brochure IBM System x servers Highlights IBM System x and BladeCenter servers

More information

Inspur AI Computing Platform

Inspur AI Computing Platform Inspur Server Inspur AI Computing Platform 3 Server NF5280M4 (2CPU + 3 ) 4 Server NF5280M5 (2 CPU + 4 ) Node (2U 4 Only) 8 Server NF5288M5 (2 CPU + 8 ) 16 Server SR BOX (16 P40 Only) Server target market

More information

Huawei Enterprise A Better Way. Huawei FusionServer X6800 Competitiveness Analysis

Huawei Enterprise A Better Way. Huawei FusionServer X6800 Competitiveness Analysis Huawei Enterprise A Better Way Huawei FusionServer X6800 Competitiveness Analysis Contents 1 Positioning and Selling Click Points to add Title 2 How to Beat Click to add Title 3 How to Defend 2 X6800:

More information

Pedraforca: a First ARM + GPU Cluster for HPC

Pedraforca: a First ARM + GPU Cluster for HPC www.bsc.es Pedraforca: a First ARM + GPU Cluster for HPC Nikola Puzovic, Alex Ramirez We ve hit the power wall ALL computers are limited by power consumption Energy-efficient approaches Multi-core Fujitsu

More information

LAMMPSCUDA GPU Performance. April 2011

LAMMPSCUDA GPU Performance. April 2011 LAMMPSCUDA GPU Performance April 2011 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Dell, Intel, Mellanox Compute resource - HPC Advisory Council

More information

An Open, Standards Based Approach to Building Supercomputers

An Open, Standards Based Approach to Building Supercomputers An Open, Standards Based Approach to Building Supercomputers HPC Advisory Council March 21-23, 2011 Lugano, Switzerland Reza Rooholamini Development Approach Deliver ROBUST, RELIABLE, and SCALABLE solutions

More information

HUAWEI Tecal X8000 High-Density Rack Server

HUAWEI Tecal X8000 High-Density Rack Server HUAWEI Tecal X8000 High-Density Rack Server Professional Trusted Future-oriented HUAWEI TECHNOLOGIES CO., LTD. HUAWEI Tecal X8000 High-Density Rack Server (X8000) High density and innovative architecture

More information

GPU for HPC. October 2010

GPU for HPC. October 2010 GPU for HPC Simone Melchionna Jonas Latt Francis Lapique October 2010 EPFL/ EDMX EPFL/EDMX EPFL/DIT simone.melchionna@epfl.ch jonas.latt@epfl.ch francis.lapique@epfl.ch 1 Moore s law: in the old days,

More information

The Rise of Open Programming Frameworks. JC BARATAULT IWOCL May 2015

The Rise of Open Programming Frameworks. JC BARATAULT IWOCL May 2015 The Rise of Open Programming Frameworks JC BARATAULT IWOCL May 2015 1,000+ OpenCL projects SourceForge GitHub Google Code BitBucket 2 TUM.3D Virtual Wind Tunnel 10K C++ lines of code, 30 GPU kernels CUDA

More information