Analyzing Performance and Power of Applications on GPUs with Dell 12G Platforms. Dr. Jeffrey Layton Enterprise Technologist HPC
|
|
- Madlyn Bishop
- 5 years ago
- Views:
Transcription
1 Analyzing Performance and Power of Applications on GPUs with Dell 12G Platforms Dr. Jeffrey Layton Enterprise Technologist HPC
2 Why GPUs? GPUs have very high peak compute capability! 6-9X CPU Challenges How feed enough data? Need to port applications! Example: Tesla M2090 GPU Cores 512 Memory 6 GB Memory BW GB/s Peak Performance Single Precision 1331 GFLOPs Double Precision 665 GFLOPs Tesla M2090 GPU 2
3 How can they be used? M610x Inside the server Limited Space, few can fit Limited Power, few can run Difficult to replace Outside the server Pros: Flexibility, Multiple GPUs GPUs can be shared Multiple Host Servers Cons: Oversubscription may limit performance Host GPU C410x 3
4 The Problem: Best Design Parameters are Unknown How many GPUs per server is ideal for my application? How much bandwidth do I need per GPU for a typical users? How does performance scale with increasing number of nodes? How does performance scale with increasing number of GPUs/node? What problem size is most suitable for GPU computing? What is the impact on power consumption and performance/watt? Etc. Etc. Even if you know some of your design parameters, They may change due to improved GPUs, CPUs, GPU drivers, Software/Algorithm redesign etc. 4
5 GPU Enabled Product Portfolio
6 Overview GPU enabled products throughput the portfolio Learn on a laptop Develop/Test on workstations Production on servers 6
7 Laptops Learn GPU programming Buisness laptop: E6520 Nvidia NVS 4200M XPS 15: 48 CUDA cores 512MB memory Nvidia Geforce GT540M 96 CUDA cores 2GB memory 7
8 Workstations Develop/Tune Applications Portable Workstation M display (1920 x 1080) Up to 16GB memory Intel quad-core i7 (Ivy Bridge coming soon) Up to 3 hard drives Quadro 3000M 240 CUDA cores 2GB GDDR5 memory Quadro 5010M 96 CUDA cores 4GB GDDR5 memory T7500: Tower Case Dual-socket Intel Westmere (X56-- processors) Up to 192GB memory (12 DIMM slots) Two PCIe Gen2 x16 slots Five internal SATA or SAS drives RAID cards available GigE on-board, optional 10GigE cards 8
9 Rackable Workstation Develop/Tune R5500: 2U rackable workstation Dual-socket Westmere 12 DIMM slots (up to 192GB) Up to 5 SATA or 6 SAS drives (2.5 ) Two PCIe Gen2 x16 slots Tesla C2070 is an example 9
10 Dell M610x blade Half-height blade (5U) 2S Westmere 12 DIMM slots (192GB memory) Mezz card for QDR IB, 10GigE One double-wide GPU per blade 10
11 Dell PowerEdge C410x Power & Flexibility Basically, Room and board for 16 GPUs Theoretical Max. of 16.5 TFLOPs Connects up to 8 hosts Connects up to 16 PCIe Gen-2 devices (GPGPUs) to hosts High density, 3U chassis. Flexibility to selecting number of GPGPUs Individually serviceable modules N W Power supplies (3+1) N+1 92mm Cooling fans (7+1) PCIe switches 8 PEX PEX
12 Dell PowerEdge C410x Sixteen (16) x16 Gen-2 Modules - PCIe Gen-2 x16 compliant - Independently serviceable LED and On/Off GPU card Power connector for GPGPU card Board-to-board connector for X16 Gen 2 PCIe signals and power Dell Research Computing 1
13 Learn more about Dell PEC C410x How can you dynamically allocate GPUs to host nodes using the Dell PEC C410x? Learn more at session S0309 (Thursday 10:30 Room K) Dynamically Allocating GPGPU to Host Nodes (servers) 13
14 Dell PEC C6100: 4 2S in one chassis Four 2-Socket Nodes in 2U Intel Westmere-EP Each Node: 12 DIMMs each 2 GigE (Intel) 1 Daughter Card (PCIe x8) QDR IB or 10GigE One PCIe x16 (half-length, half-height) Optional SAS controller (in-place of IB) Chassis Design: Hot Plug, Individual Nodes Up to 12 x 3.5 drives (3 per node) Up to 24 x 2.5 drives (6 per node) N+1 Power supplies (1100W or 1400W) NVIDIA HIC certified 14
15 Dell PEC C6145: Two AMD 4S in 2U Two 4-Socket Nodes in 2U 4S AMD Opteron 6200 series Each Node: 32 x DDR3 RDIMMs 2 x GbE Intel 1 x8 Gen II (custom mezzanine slot) QDR IB or 10GigE 3 x16 Gen II (low-profile, half height/half-length) Chassis Design: Hot Plug, Individual Nodes 24 x 2.5 or 12 x 3.5 HDD Redundant Power supplies (1100W or 1400W) Embedded x16 HIC and slots for additional HICs 15
16 Dell PEC C6220 Four 2S Sandy Bridge in 2U Four 2-Socket Nodes in 2U Intel Sandy Bridge-EP Each Node: 16 DIMMs each 2 GigE (Intel) 1 Daughter Card (PCIe Gen 3 x8) FDR IB or QDR IB or 10GigE One PCIe G3 x16 (half-length, half-height) Two node version has two PCIe G3 x16 slots Optional SAS controller (in-place of IB) Chassis Design: Hot Plug, Individual Nodes Up to 12 x 3.5 drives or Up to 24 x 2.5 drives N+1 Power supplies (1100W or 1400W) NVIDIA HIC certified 16
17 Dell Power R720 First Standard Server with internal GPUs Two-socket Intel Sandy Bridge-EP 24 DIMM slots (up to 768GB) Dell Select Network Adapters: 4x GigE 2x10GigE + 2xGigE Intel or Broadcom 7 PCIe Gen 3 slots: Up to two internal GPUS (passive) PCIe G3 x8 slot for network adapter (e.g. FDR IB) Up to 4 front-access, hot-swap, PCIe drives Up to 16 drives HIC certified to work with Dell C410x 17
18 Performance and Power Measurements
19 Engineering Performing benchmarks/tests of various GPU applications: Different host nodes C6100, C6145 are called Dell 11G C6220 and R720 are called Dell 12G Different number of GPUs Internal and external GPUs Measured performance AND power during tests Goal is to understand how applications scale: Number and type of GPUs Host node configuration Develop best practices for GPU configurations 19
20 Applications HPL NAMD XFDTD 3D Oil Reservoir Simulation ANSYS Mechanical 20
21 Thanks!!!!! Dr. Saeed Iqbal Shawn Gao Onur Celebioglu Mark Fernandez, Glen Otero Nvidia Mass Fatica, Stan Posey, Peter Lillian, Bob Cravella, Travis Wells, et al 21
22 Dell 11G
23 Normalized Performance (GFLOPS) Nromalized Power (W) Normalized GFLOPS/W Dell PowerEdge C C410x HPL Normalized Results % 64.91% 56.33% 39.93% 18.18% % 64.91% 56.33% 39.93% 18.18% % 64.91% 56.33% 39.93% 18.18% CPU Only CPU + 1 CPU + 2 CPU + 4 CPU + 8 CPU Only CPU + 1 CPU + 2 CPU + 4 CPU + 8 CPU Only CPU + 1 CPU + 2 CPU + 4 CPU + 8 One HIC from host node to C410x Recommended no more than 2 GPUs per HIC 23 1-node PE C6100, Dual X5650@2.67GHz, 48GB, 1333MHz Memory; C410x has Nvidia s
24 Normalized Performance (GFLOPS) Normalized GFLOPS/W Dell PowerEdge C C410x HPL Normalized Results % % 45.11% 19.74% 32.02% 12.03% 1 HIC 2 HIC % 45.11% 35.85% 46.36% 19.74% 32.02% 12.03% 20.11% 1 HIC 2 HIC CPU Only CPU + 1 CPU + 2 CPU + 4 M207 CPU + 8 CPU Only CPU + 1 CPU + 2 CPU + 4 M207 CPU + 8 One or Two HICs from host node to C410x Recommend no more than 2 GPUs per HIC (1 per HIC is better) 24 1-node PE C6145, Four 6132HE@2.2GHz, 128GB, 1333MHz Memory; C410x has Nvidia s
25 Normalized Performance (day/ns) Normalized Power (W) Normalized Perf (1/days/ns)/W Dell PowerEdge C C410x NAMD Normalized Results CPU Only CPU + 1 CPU + 2 CPU + 4 CPU + 8 CPU Only CPU + 1 CPU + 2 CPU + 4 CPU + 8 CPU Only CPU + 1 CPU + 2 CPU + 4 CPU + 8 STMV data set (1M atoms) Two GPUs is best, 4 GPUs is also good (Perf/W). 1 HIC is good 25 1-node PE C6100, Dual X5650@2.67GHz, 48GB, 1333MHz Memory; C410x has Nvidia s
26 Normalized Performance (1/time) Dell PowerEdge C C410x XFDTD Normalized Performance GPUs are best 2-4 GPUs per x16 HIC CPU Only CPU + 1 CPU + 2 CPU + 4 CPU + 6 CPU Node PE C6100, Dual X5670@2.93GHz, 48GB, 1333MHz Memory; C410x has 16 GPUs
27 Normalized Performance (time) Normalized Power (W) Normalized Perf/W (Time*W) Dell PowerEdge C C410x 3D Oil Reservoir Simulation (Elastic Model) Normalized Results GPUs 2 GPUs 4 GPUs CPU CPU CPU CPU 8.28M/2 GPUS 16.57M/2 GPUs 33.15M/4 GPUs 8 GPUs 66.34M/8 GPUs Problem Size/Number of GPUs GPUS 2 GPUS 4 GPUs CPU CPU CPU CPU 8.28M/2 GPUS 16.57M/2 GPUs 33.15M/4 GPUs 8 GPUs 66.34M/8 GPUs Problem Size/Number of GPUs GPUs CPU CPU CPU CPU 2 GPUs 2 GPUs 8.28M/2 GPUS 16.57M/2 GPUs 33.15M/4 GPUs 8 GPUs 66.34M/8 GPUs Problem Size/Number of GPUs Multiple data sets Smaller data sets: 2 GPUs. Larger data sets: 4-8 GPUs (per x16 HIC) 27 Node PE C6100, Dual X5670@2.93GHz, 48GB, 1333MHz Memory; C410x has 16 GPUs
28 Dell 12G
29 Normalized Performance (GFLOPS) Normalized Power (W) Normalized GFLOPS/W Dell PowerEdge R720 HPL-Normalized Results % 60.2% % 60.2% % 67.8% 60.2% % % R720 CPU only R M2090 R M2090 R720 CPU only R M2090 R M2090 R720 CPU only R M2090 Internal GPUs 2 GPUs seems like a good configuration (individual x16 slot) R M R720, Dual Intel E5-2660@2.2GHz, 64GB, 1333MHz Memory (8x 8GB); two internal M2090 GPUs
30 GFLOPs/Watts Compare GFLOPS/watt (Cap 225W) Comparisons ( C6100 & R720) C6100 ( 2.93GHz ) R720 ( 2.7GHz ) R720 ( 2.2GHz ) M2090 GPUs with 225W power capping
31 Normalized Performance (GFLOPS) Normalized Power (W) Normalized GFLOPS/W Dell PowerEdge C C410x HPL - Normalized Results % % 58.7% 34.4% % 58.7% % 67.2% 58.7% 34.4% % % CPU only CPU+1 M2090 CPU+2 M2090 CPU+4 M2090 CPU only CPU+1 M2090 CPU+2 M2090 CPU+4 M2090 CPU only CPU+1 M2090 CPU+2 M2090 CPU+4 M2090 External GPUs with single HIC Two GPUs seems to be sweet spot (two GPUs with 1 x16 HIC) 31
32 Normalized Performance (GFLOPS) Normalized Performance (GFLOPS) Comparison of Internal vs. External HPL - Normalized Performance Not quite apples-to-apples (PCIe G2 to G3) % % 34.4% % % % % R720 CPU only R M2090 R M2090 CPU only CPU+1 M2090 CPU+2 M2090 CPU+4 M
33 Normalized GFLOPS/W Normalized GFLOPS/W Comparison of Internal vs. External HPL - Normalized GFLOPS/W Not quite apples-to-apples (PCIe G2 to G3) % 67.8% 60.2% % 67.2% 58.7% 34.4% R720 CPU only R M2090 R M2090 CPU only CPU+1 M2090 CPU+2 M2090 CPU+4 M2090 Internal GPUs are more effective (perf/w) but external are still very good 33
34 Normalized Performance (Time) Normalized Perf/W - (1/time)/W Dell PowerEdge R720 ANSYS Mechanical - Normalized Results 1 core Core 1 Core + 1 M Core 1 Core + 1 M V14cg-1V14sp-1V14sp-2V14sp-3V14sp-4V14sp-5V14sp-6 Massive speedup with GPU (up to 3.6x) but not all cases Perf/W (efficiency) can be quite good (2 times better) 34
35 Normalized Performance (Time) Normalized Perf/W - (1/time)/W Dell PowerEdge R720 ANSYS Mechanical - Normalized Results 2 cores Cores 2 Cores + 1 M Cores 2 Cores + 1 M Less speedup than 1 core case Perf/W (efficiency) is still very good 35
36 Normalized Perforamnce (Time) Normalized Perf/W - (1/T)/W Dell PowerEdge R720 ANSYS Mechanical - Normalized Results 4 cores Cores 4 Cores + 1 M Cores 4 Cores + 1 M V14cg-1V14sp-1V14sp-2V14sp-3V14sp-4V14sp-5V14sp-6 V14cg-1V14sp-1V14sp-2V14sp-3V14sp-4V14sp-5V14sp-6 Less speedup than 1 and 2 core cases 36
37 Normalized Performance (Time) Normalized Perf/W - (1/T)/W Dell PowerEdge R720 ANSYS Mechanical - Normalized Results 8 cores Cores 8 Cores + 1 M Cores + 2 M Cores 8 Cores + 1 M Added 2 GPU tests (not much impact on performance) Efficiency is not good except for 2 cases (less than 1) 37
38 Normalized Performance (Speedup) Normalized Perf/W - (1/T)/W Dell PowerEdge R720 ANSYS Mechanical - Normalized Results 16 cores Cores 16 Cores + 1 M Cores + 2 M Cores 16 Cores + 1 M2090 Very little speed improvement Efficiency with GPUs is worse than CPUs only 38
39 Normalized Performance (1/T) Normalized Perf/W - (1/T)/W Dell PowerEdge R720 ANSYS Mechanical Trends Choose last 3 cases (best usage of GPUs) V14sp V14sp V14sp-5 V14sp-6 CPU Only V14sp-5 V14sp-6 CPU Only Core 2 Cores 4 Cores 8 Cores 16 Cores 1 Core 2 Cores 4 Cores 8 Cores 16 Cores 39
40 ANSYS Mechanical observations As the number of cores increased: The GPU speedup decreases Efficiency decreases (at 16 cores it s not good) Cross-over point is 8 or 16 cores (8 is a good rule of thumb) Recommended configuration: Small CPU core count (no more than 4) with 1 GPU With 2 GPUs in node, you can run 2 cases at the same time (uses 8 cores) Performance varies with case (solver) 40
41 Summary Lots of options for GPU configurations which one is best? How do you define best? Performance? Power efficiency? Both? Answers vary (depend upon application) You don t have to have a dedicated x16 slot for each GPU for good performance and good efficiency Many applications shown here illustrate this Dell C410x allows GPU Direct for up to 8 GPUs Other systems do not allow this 41
42 Thanks! Questions?
System Design of Kepler Based HPC Solutions. Saeed Iqbal, Shawn Gao and Kevin Tubbs HPC Global Solutions Engineering.
System Design of Kepler Based HPC Solutions Saeed Iqbal, Shawn Gao and Kevin Tubbs HPC Global Solutions Engineering. Introduction The System Level View K20 GPU is a powerful parallel processor! K20 has
More informationDell Solution for High Density GPU Infrastructure
Dell Solution for High Density GPU Infrastructure 李信乾 (Clayton Li) 產品技術顧問 HPC@DELL Key partnerships & programs Customer inputs & new ideas Collaboration Innovation Core & new technologies Critical adoption
More informationArchitecting High Performance Computing Systems for Fault Tolerance and Reliability
Architecting High Performance Computing Systems for Fault Tolerance and Reliability Blake T. Gonzales HPC Computer Scientist Dell Advanced Systems Group blake_gonzales@dell.com Agenda HPC Fault Tolerance
More informationAccelerating high-performance computing with hybrid platforms
Accelerating high-performance computing with hybrid platforms October 2010 Dell THIS WHITE PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN TYPOGRAPHICAL ERRORS AND TECHNICAL INACCURACIES. THE
More informationGame-changing Extreme GPU computing with The Dell PowerEdge C4130
Game-changing Extreme GPU computing with The Dell PowerEdge C4130 A Dell Technical White Paper This white paper describes the system architecture and performance characterization of the PowerEdge C4130.
More informationLAMMPS-KOKKOS Performance Benchmark and Profiling. September 2015
LAMMPS-KOKKOS Performance Benchmark and Profiling September 2015 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell, Mellanox, NVIDIA
More informationGROMACS (GPU) Performance Benchmark and Profiling. February 2016
GROMACS (GPU) Performance Benchmark and Profiling February 2016 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Dell, Mellanox, NVIDIA Compute
More informationNAMD Performance Benchmark and Profiling. January 2015
NAMD Performance Benchmark and Profiling January 2015 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell, Mellanox Compute resource
More informationDell PowerEdge Servers Portfolio Guide
Dell PowerEdge Servers Portfolio Guide Dell PowerEdge Servers Purpose-Built for Reliability Virtualization-Enabled for an Efficient Infrastructure Intelligent, Connected Systems Managment With Dell you
More informationLS-DYNA Performance Benchmark and Profiling. April 2015
LS-DYNA Performance Benchmark and Profiling April 2015 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell, Mellanox Compute resource
More informationSugon TC6600 blade server
Sugon TC6600 blade server The converged-architecture blade server The TC6600 is a new generation, multi-node and high density blade server with shared power, cooling, networking and management infrastructure
More informationHPC Hardware Overview
HPC Hardware Overview John Lockman III April 19, 2013 Texas Advanced Computing Center The University of Texas at Austin Outline Lonestar Dell blade-based system InfiniBand ( QDR) Intel Processors Longhorn
More informationPerformance Analysis of HPC Applications on Several Dell PowerEdge 12 th Generation Servers
Performance Analysis of HPC Applications on Several Dell PowerEdge 12 th Generation Servers This Dell technical white paper evaluates and provides recommendations for the performance of several HPC applications
More informationPART-I (B) (TECHNICAL SPECIFICATIONS & COMPLIANCE SHEET) Supply and installation of High Performance Computing System
INSTITUTE FOR PLASMA RESEARCH (An Autonomous Institute of Department of Atomic Energy, Government of India) Near Indira Bridge; Bhat; Gandhinagar-382428; India PART-I (B) (TECHNICAL SPECIFICATIONS & COMPLIANCE
More informationThe Cray CX1 puts massive power and flexibility right where you need it in your workgroup
The Cray CX1 puts massive power and flexibility right where you need it in your workgroup Up to 96 cores of Intel 5600 compute power 3D visualization Up to 32TB of storage GPU acceleration Small footprint
More informationANSYS Fluent 14 Performance Benchmark and Profiling. October 2012
ANSYS Fluent 14 Performance Benchmark and Profiling October 2012 Note The following research was performed under the HPC Advisory Council activities Special thanks for: HP, Mellanox For more information
More informationCPMD Performance Benchmark and Profiling. February 2014
CPMD Performance Benchmark and Profiling February 2014 Note The following research was performed under the HPC Advisory Council activities Special thanks for: HP, Mellanox For more information on the supporting
More informationGPU Clusters for High- Performance Computing Jeremy Enos Innovative Systems Laboratory
GPU Clusters for High- Performance Computing Jeremy Enos Innovative Systems Laboratory National Center for Supercomputing Applications University of Illinois at Urbana-Champaign Presentation Outline NVIDIA
More informationSTAR-CCM+ Performance Benchmark and Profiling. July 2014
STAR-CCM+ Performance Benchmark and Profiling July 2014 Note The following research was performed under the HPC Advisory Council activities Participating vendors: CD-adapco, Intel, Dell, Mellanox Compute
More informationIntel Xeon E v4, Optional Operating System, 8GB Memory, 2TB SAS H330 Hard Drive and a 3 Year Warranty
pe_r730_1356_a Datasheet Check its price: Click Here Overview adapts to virtually any workload with a scalable server featuring an optimal mix of memory, storage, processing and GPUs. This model is the
More informationFUJITSU Server PRIMERGY CX400 M4 Workload-specific power in a modular form factor. 0 Copyright 2018 FUJITSU LIMITED
FUJITSU Server PRIMERGY CX400 M4 Workload-specific power in a modular form factor 0 Copyright 2018 FUJITSU LIMITED FUJITSU Server PRIMERGY CX400 M4 Workload-specific power in a compact and modular form
More informationStan Posey, CAE Industry Development NVIDIA, Santa Clara, CA, USA
Stan Posey, CAE Industry Development NVIDIA, Santa Clara, CA, USA NVIDIA and HPC Evolution of GPUs Public, based in Santa Clara, CA ~$4B revenue ~5,500 employees Founded in 1999 with primary business in
More informationNAMD GPU Performance Benchmark. March 2011
NAMD GPU Performance Benchmark March 2011 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Dell, Intel, Mellanox Compute resource - HPC Advisory
More informationDELL POWEREDGE SERVERS
DELL POWEREDGE SERVERS PRODUCT GUIDE INTRODUCING THE LATEST GENERATION OF POWEREDGE SERVERS User-Inspired Award-Winning Design Cost-Cutting Energy Smart Technology Exclusive Industry-Only Management Integrated
More informationHP GTC Presentation May 2012
HP GTC Presentation May 2012 Today s Agenda: HP s Purpose-Built SL Server Line Desktop GPU Computing Revolution with HP s Z Workstations Hyperscale the new frontier for HPC New HPC customer requirements
More informationCST STUDIO SUITE R Supported GPU Hardware
CST STUDIO SUITE R 2017 Supported GPU Hardware 1 Supported Hardware CST STUDIO SUITE currently supports up to 8 GPU devices in a single host system, meaning each number of GPU devices between 1 and 8 is
More informationUniversity at Buffalo Center for Computational Research
University at Buffalo Center for Computational Research The following is a short and long description of CCR Facilities for use in proposals, reports, and presentations. If desired, a letter of support
More informationHigh Performance Computing with Accelerators
High Performance Computing with Accelerators Volodymyr Kindratenko Innovative Systems Laboratory @ NCSA Institute for Advanced Computing Applications and Technologies (IACAT) National Center for Supercomputing
More informationExactly as much as you need.
Exactly as much as you need. Get IT All with PRIMERGY RX300 S6 & PRIMERGY RX200 S6 1 Copyright 2011 FUJITSU Agenda 1. Get IT All: The Offer 2. Dynamic Infrastructures 3. PRIMERGY Portfolio Overview 4.
More informationn N c CIni.o ewsrg.au
@NCInews NCI and Raijin National Computational Infrastructure 2 Our Partners General purpose, highly parallel processors High FLOPs/watt and FLOPs/$ Unit of execution Kernel Separate memory subsystem GPGPU
More informationAbout 2CRSI. OCtoPus Solution. Technical Specifications. OCtoPus servers. OCtoPus. OCP Solution by 2CRSI.
About 2CRSI OCtoPus Solution Technical Specifications OCtoPus servers OCtoPus OCP Solution by 2CRSI 1 About 2CRSI 3 OCtoPus Solution 4 Technical Specifications OCtoPus Rack Unique server design 6 7 OCtoPus
More informationDell PowerEdge R720xd with PERC H710P: A Balanced Configuration for Microsoft Exchange 2010 Solutions
Dell PowerEdge R720xd with PERC H710P: A Balanced Configuration for Microsoft Exchange 2010 Solutions A comparative analysis with PowerEdge R510 and PERC H700 Global Solutions Engineering Dell Product
More informationJohn Fragalla TACC 'RANGER' INFINIBAND ARCHITECTURE WITH SUN TECHNOLOGY. Presenter s Name Title and Division Sun Microsystems
TACC 'RANGER' INFINIBAND ARCHITECTURE WITH SUN TECHNOLOGY SUBTITLE WITH TWO LINES OF TEXT IF NECESSARY John Fragalla Presenter s Name Title and Division Sun Microsystems Principle Engineer High Performance
More informationUltimate performance and security in a 2U Form factor.
Ultimate performance and security in a 2U Form factor. PRECISION 7920 RACK Powerful performance Power through the most complex, demanding applications more quickly with a new generation of dual-socket
More informationGW2000h w/gw175h/q F1 specifications
Product overview The Gateway GW2000h w/ GW175h/q F1 maximizes computing power and thermal control with up to four hot-pluggable nodes in a space-saving 2U form factor. Offering first-class performance,
More informationHostEngine 5URP24 Computer User Guide
HostEngine 5URP24 Computer User Guide Front and Rear View HostEngine 5URP24 (HE5URP24) computer features Intel Xeon Scalable (Skylake FCLGA3647 socket) Series dual processors with the Intel C621 chipset.
More informationThe Why and How of Developing All-Flash Storage Server
The Why and How of Developing All-Flash Storage Server June 2016 Jungsoo Kim Manager, SK Telecom Agenda Why we care about All-Flash Storage Transforming to 5G Network Open HW & SW Projects @ SKT Our approaches
More informationData Sheet Fujitsu Server PRIMERGY CX250 S2 Dual Socket Server Node
Data Sheet Fujitsu Server PRIMERGY CX250 S2 Dual Socket Server Node Data Sheet Fujitsu Server PRIMERGY CX250 S2 Dual Socket Server Node Datasheet for Red Hat certification Standard server node for PRIMERGY
More informationDell PowerEdge server portfolio: platforms and solutions for enterprise applications
Dell PowerEdge server portfolio: platforms and solutions for enterprise applications Dell PowerEdge server portfolio: platforms and solutions for enterprise applications Next-generation PowerEdge technologies
More informationMemory Selection Guidelines for High Performance Computing with Dell PowerEdge 11G Servers
Memory Selection Guidelines for High Performance Computing with Dell PowerEdge 11G Servers A Dell Technical White Paper By Garima Kochhar and Jacob Liberman High Performance Computing Engineering Dell
More informationIntel Xeon E v4, Windows Server 2016 Standard, 16GB Memory, 1TB SAS Hard Drive and a 3 Year Warranty
pe_r430_11598_b Datasheet Check its price: Click Here Overview delivers peak 2-socket performance for HPC, web tech and infrastructure scale-out. R430 provides Intel Xeon processor E5-2600 v4 product family
More informationIntel Select Solutions for Professional Visualization with Advantech Servers & Appliances
Solution Brief Intel Select Solution for Professional Visualization Intel Xeon Processor Scalable Family Powered by Intel Rendering Framework Intel Select Solutions for Professional Visualization with
More informationAgenda. Sun s x Sun s x86 Strategy. 2. Sun s x86 Product Portfolio. 3. Virtualization < 1 >
Agenda Sun s x86 1. Sun s x86 Strategy 2. Sun s x86 Product Portfolio 3. Virtualization < 1 > 1. SUN s x86 Strategy Customer Challenges Power and cooling constraints are very real issues Energy costs are
More informationSU Dual and Quad-Core Xeon UP Server
SU4-1300 Dual and Quad-Core Xeon UP Server www.eslim.co.kr Dual and Quad-Core Server Computing Leader!! ESLIM KOREA INC. 1. Overview eslim SU4-1300 The ideal entry-level server Intel Xeon processor 3000/3200
More informationPart Number Unit Descriptions
Part Number Unit Descriptions 2582B2A System x3100m4 Simple Swap (SATA) Xeon 4C E3-1220v2 69W 3.1GHz/1600MHz/8MB Form factor Tower (can be a 4U rack form factor using the optional Tower-to-Rack Conversion
More informationHigh Performance Computing
21 High Performance Computing High Performance Computing Systems 21-2 HPC-1420-ISSE Robust 1U Intel Quad Core Xeon Server with Innovative Cable-less Design 21-3 HPC-2820-ISSE 2U Intel Quad Core Xeon Server
More informationAccelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing
Accelerating HPC (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing SAAHPC, Knoxville, July 13, 2010 Legal Disclaimer Intel may make changes to specifications and product
More informationMILC Performance Benchmark and Profiling. April 2013
MILC Performance Benchmark and Profiling April 2013 Note The following research was performed under the HPC Advisory Council activities Special thanks for: HP, Mellanox For more information on the supporting
More informationDESERT STORM WS-TS700
Description: The Desert Storm WS-TS700 is designed for use as a high performance workstation, support for graphics cards and MIO audio cards, optimized audio performance, BIOS flashback and Q-code logger.
More informationAltos T310 F3 Specifications
Product overview The Altos T310 F3 delivers proactive management tools matched by best priceperformance technology ideal for SMB and branch office operations. This singlesocket tower server features an
More informationDell EMC PowerEdge server portfolio: platforms and solutions
Dell EMC PowerEdge server portfolio: platforms and solutions Servers are the bedrock of the modern appliance. With consistent, scalable and industry-leading design, Dell EMC servers can help you tackle
More informationAbout 2CRSI. OCtoPus Solution. Technical Specifications. OCtoPus. OCP Solution by 2CRSI.
About 2CRSI OCtoPus Solution Technical Specifications OCtoPus OCtoPus OCP Solution by 2CRSI 1 Remark: All specifications and photos are subject to change whitout notice. 2 About 2CRSI 5 OCtoPus Solution
More informationFujitsu PRIMERGY Servers Portfolio
Fujitsu Servers Portfolio Dynamic Infrastructures for workgroup, datacenter and cloud computing shaping tomorrow with you Higher IT efficiency and reduced total cost of ownership Fujitsu Micro and Tower
More informationHUAWEI Tecal X6000 High-Density Server
HUAWEI Tecal X6000 High-Density Server Professional Trusted Future-oriented HUAWEI TECHNOLOGIES CO., LTD. HUAWEI Tecal X6000 High-Density Server (X6000) High computing density The X6000 is 2U high and
More informationEssentials. Expected Discontinuance Q2'15 Limited 3-year Warranty Yes Extended Warranty Available
M&A, Inc. Essentials Status Launched Expected Discontinuance Q2'15 Limited 3-year Warranty Extended Warranty Available for Purchase (Select Countries) On-Site Repair Available for Purchase (Select Countries)
More informationOCTOPUS Performance Benchmark and Profiling. June 2015
OCTOPUS Performance Benchmark and Profiling June 2015 2 Note The following research was performed under the HPC Advisory Council activities Special thanks for: HP, Mellanox For more information on the
More informationWiRack19 - Computing Server. Wiwynn SV324G2. Highlights. Specification.
WiRack19 - Computing Server Wiwynn SV324G2 Inherits benefits from hyper-scale deployed OCP Leopard MB Front serviceable for quick configuration and deployment Tool-less, hot-swappable redundancy for easy
More informationPerformance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA
Performance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA Pak Lui, Gilad Shainer, Brian Klaff Mellanox Technologies Abstract From concept to
More informationUltimate performance and security in a 2U Form factor.
Ultimate performance and security in a 2U Form factor. PRECISION 7920 XL RACK When Stability Matters When you choose an OEM XL product, you get the stability, visibility and longevity you need from the
More informationTFLOP Performance for ANSYS Mechanical
TFLOP Performance for ANSYS Mechanical Dr. Herbert Güttler Engineering GmbH Holunderweg 8 89182 Bernstadt www.microconsult-engineering.de Engineering H. Güttler 19.06.2013 Seite 1 May 2009, Ansys12, 512
More informationOptimal BIOS settings for HPC with Dell PowerEdge 12 th generation servers
Optimal BIOS settings for HPC with Dell PowerEdge 12 th generation servers This Dell technical white paper analyses the various BIOS options available in Dell PowerEdge 12 th generation servers and provides
More informationAltos R320 F3 Specifications. Product overview. Product views. Internal view
Product overview The Altos R320 F3 single-socket 1U rack server delivers great performance and enterprise-level scalability in a space-saving design. Proactive management utilities effectively handle SMB
More informationIBM System x family brochure
IBM Systems and Technology Group System x IBM System x family brochure IBM System x rack and tower servers 2 IBM System x family brochure IBM System x servers Highlights IBM System x and BladeCenter servers
More informationSERVER TECHNOLOGY H. Server & Workstation Motherboards Server Barebones & Accessories
SERVER TECHNOLOGY 2018 2H Server & Workstation Motherboards Server Barebones & Accessories MOTHERBOARD We put our three decades of know-how in motherboard design at the service of cutting-edge server motherboards.
More informationAvid Configuration Guidelines Dell 3620 Workstation Tower & 3420 Workstation SFF Single Quad Core CPU Qualified for Software Only
Avid Configuration Guidelines Dell 3620 Workstation Tower & 3420 Workstation SFF Single Quad Core CPU Qualified for Software Only Page 1 of 12 1.) Dell 3620 Tower and 3420 SFF [Small Form Factor] AVID
More informationThe Dell Precision T3620 tower as a Smart Client leveraging GPU hardware acceleration
The Dell Precision T3620 tower as a Smart Client leveraging GPU hardware acceleration Dell IP Video Platform Design and Calibration Lab June 2018 H17415 Reference Architecture Dell EMC Solutions Copyright
More informationSuggested use: infrastructure applications, collaboration/ , web, and virtualized desktops in a workgroup or distributed environments.
The IBM System x3500 M4 server provides outstanding performance for your business-critical applications. Its energy-efficient design supports more cores, memory, and data capacity in a scalable Tower or
More informationMicrosoft SQL Server in a VMware Environment on Dell PowerEdge R810 Servers and Dell EqualLogic Storage
Microsoft SQL Server in a VMware Environment on Dell PowerEdge R810 Servers and Dell EqualLogic Storage A Dell Technical White Paper Dell Database Engineering Solutions Anthony Fernandez April 2010 THIS
More informationGPUs and Emerging Architectures
GPUs and Emerging Architectures Mike Giles mike.giles@maths.ox.ac.uk Mathematical Institute, Oxford University e-infrastructure South Consortium Oxford e-research Centre Emerging Architectures p. 1 CPUs
More information15 Jun 2012 Kevin Chang
TYAN/MiTAC 4U MICRO SERVER Product Overview 15 Jun 2012 Kevin Chang FM65 System Enclosure 19 rack mount 4U enclosure (H176mm x W440mm x D650mm) Support either (MFG option) Regular AC w/ redundancy DC 12V
More informationFujitsu VDI / vgpu Virtualization
Fujitsu VDI / vgpu Virtualization Antti Sirkiä Service Partner Manager, Certified Trainer Fujitsu, Product Business Unit Why Virtualization / Graphics Virtualization? :: GRAPHICS VIRTUALIZATION :: Multiple
More informationIBM eserver xseries. BladeCenter. Arie Berkovitch eserver Territory Manager IBM Corporation
BladeCenter Arie Berkovitch eserver Territory Manager 2006 IBM Corporation IBM BladeCenter What is a Blade A server on a card each Blade has its own: processor networking memory optional storage etc. IBM
More informationRECENT TRENDS IN GPU ARCHITECTURES. Perspectives of GPU computing in Science, 26 th Sept 2016
RECENT TRENDS IN GPU ARCHITECTURES Perspectives of GPU computing in Science, 26 th Sept 2016 NVIDIA THE AI COMPUTING COMPANY GPU Computing Computer Graphics Artificial Intelligence 2 NVIDIA POWERS WORLD
More informationMegaGauss (MGs) Cluster Design Overview
MegaGauss (MGs) Cluster Design Overview NVIDIA Tesla (Fermi) S2070 Modules Based Solution Version 6 (Apr 27, 2010) Alexander S. Zaytsev p. 1 of 15: "Title" Front view: planar
More informationHPE Scalable Storage with Intel Enterprise Edition for Lustre*
HPE Scalable Storage with Intel Enterprise Edition for Lustre* HPE Scalable Storage with Intel Enterprise Edition For Lustre* High Performance Storage Solution Meets Demanding I/O requirements Performance
More informationAll-Flash Storage System
All-Flash Storage System June 2016 Jungsoo Kim Manager, SK Telecom Agenda SKT Storage Solution R&D Introduction Our approaches in developing storage system AF-Media details Computing Board Storage Module
More informationNVIDIA GPU Computing Séminaire Calcul Hybride Aristote 25 Mars 2010
NVIDIA GPU Computing 2010 Séminaire Calcul Hybride Aristote 25 Mars 2010 NVIDIA GPU Computing 2010 Tesla 3 rd generation Full OEM coverage Ecosystem focus Value Propositions per segments Card System Module
More informationRepresentation of the interested Bidders / vendors. Form no. T2 (TECHNICAL MINIMUM SPECIFICATIONS)
Sr. no. Clause no./page No. Item & Specification in the tender Bidder / Vendor s representation Response to the Bidders Page No.12 1 Chassis: 5U Rack Mountable or Higher Please consider Minimum 2U Rack
More informationBroadberry. Artificial Intelligence Server for Fraud. Date: Q Application: Artificial Intelligence
TM Artificial Intelligence Server for Fraud Date: Q2 2017 Application: Artificial Intelligence Tags: Artificial intelligence, GPU, GTX 1080 TI HM Revenue & Customs The UK s tax, payments and customs authority
More informationDell PowerEdge server portfolio: platforms and solutions for enterprise applications
Dell PowerEdge server portfolio: platforms and solutions for enterprise applications Dell PowerEdge server portfolio: platforms and solutions for enterprise applications Next-generation PowerEdge server
More informationHostEngine 4U Host Computer User Guide
HostEngine 4U Host Computer User Guide HostEngine 4U computer features Intel Xeon E5-2600v4 (Broadwell) Series dual-processors with the Intel C612 chipset. HostEngine 4U provides four PCI Express (PCIe)
More informationPower Systems AC922 Overview. Chris Mann IBM Distinguished Engineer Chief System Architect, Power HPC Systems December 11, 2017
Power Systems AC922 Overview Chris Mann IBM Distinguished Engineer Chief System Architect, Power HPC Systems December 11, 2017 IBM POWER HPC Platform Strategy High-performance computer and high-performance
More informationLS-DYNA Performance Benchmark and Profiling. October 2017
LS-DYNA Performance Benchmark and Profiling October 2017 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: LSTC, Huawei, Mellanox Compute resource
More informationFujitsu Enterprise Product & Solution Facts
Fujitsu Enterprise Product & Solution Facts Servers PRIMERGY, SPARC Enterprise, PRIMEQUEST, BS2000/OSD Mainframes Storage ETERNUS for Flexible Data Management and Efficient Data Protection Solutions SAP,
More informationIBM System x3850 M2 servers feature hypervisor capability
IBM Europe Announcement ZG08-0161, dated March 25, 2008 IBM System x3850 M2 servers feature hypervisor capability Key prerequisites...2 Description...3 Product positioning... 7 Reference information...
More informationeslim SV Dual and Quad-Core Xeon Server Dual and Quad-Core Server Computing Leader!! ESLIM KOREA INC.
eslim SV7-2186 Dual and Quad-Core Xeon Server www.eslim.co.kr Dual and Quad-Core Server Computing Leader!! ESLIM KOREA INC. 1. Overview eslim SV7-2186 Server Dual and Quad-Core Intel Xeon Processors 4
More informationMaximize automotive simulation productivity with ANSYS HPC and NVIDIA GPUs
Presented at the 2014 ANSYS Regional Conference- Detroit, June 5, 2014 Maximize automotive simulation productivity with ANSYS HPC and NVIDIA GPUs Bhushan Desam, Ph.D. NVIDIA Corporation 1 NVIDIA Enterprise
More informationFEMAP/NX NASTRAN PERFORMANCE TUNING
FEMAP/NX NASTRAN PERFORMANCE TUNING Chris Teague - Saratech (949) 481-3267 www.saratechinc.com NX Nastran Hardware Performance History Running Nastran in 1984: Cray Y-MP, 32 Bits! (X-MP was only 24 Bits)
More informationNLVMUG 16 maart Display protocols in Horizon
NLVMUG 16 maart 2017 Display protocols in Horizon NLVMUG 16 maart 2017 Display protocols in Horizon Topics Introduction Display protocols - Basics PCoIP vs Blast Extreme Optimizing Monitoring Future Recap
More informationANSYS Improvements to Engineering Productivity with HPC and GPU-Accelerated Simulation
ANSYS Improvements to Engineering Productivity with HPC and GPU-Accelerated Simulation Ray Browell nvidia Technology Theater SC12 1 2012 ANSYS, Inc. nvidia Technology Theater SC12 HPC Revolution Recent
More informationHostEngine 3U Host Computer User Guide
HostEngine 3U computer features Intel Xeon 3.2GHz or lower-speed single- or dual-processor(s) with the Intel C602 chipset. HostEngine 3U provides four PCI Express (PCIe) Gen 3.0 x16 expansion slots. Each
More informationIBM System x family brochure
IBM Systems and Technology System x IBM System x family brochure IBM System x rack and tower servers 2 IBM System x family brochure IBM System x servers Highlights IBM System x and BladeCenter servers
More informationInspur AI Computing Platform
Inspur Server Inspur AI Computing Platform 3 Server NF5280M4 (2CPU + 3 ) 4 Server NF5280M5 (2 CPU + 4 ) Node (2U 4 Only) 8 Server NF5288M5 (2 CPU + 8 ) 16 Server SR BOX (16 P40 Only) Server target market
More informationHuawei Enterprise A Better Way. Huawei FusionServer X6800 Competitiveness Analysis
Huawei Enterprise A Better Way Huawei FusionServer X6800 Competitiveness Analysis Contents 1 Positioning and Selling Click Points to add Title 2 How to Beat Click to add Title 3 How to Defend 2 X6800:
More informationPedraforca: a First ARM + GPU Cluster for HPC
www.bsc.es Pedraforca: a First ARM + GPU Cluster for HPC Nikola Puzovic, Alex Ramirez We ve hit the power wall ALL computers are limited by power consumption Energy-efficient approaches Multi-core Fujitsu
More informationLAMMPSCUDA GPU Performance. April 2011
LAMMPSCUDA GPU Performance April 2011 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Dell, Intel, Mellanox Compute resource - HPC Advisory Council
More informationAn Open, Standards Based Approach to Building Supercomputers
An Open, Standards Based Approach to Building Supercomputers HPC Advisory Council March 21-23, 2011 Lugano, Switzerland Reza Rooholamini Development Approach Deliver ROBUST, RELIABLE, and SCALABLE solutions
More informationHUAWEI Tecal X8000 High-Density Rack Server
HUAWEI Tecal X8000 High-Density Rack Server Professional Trusted Future-oriented HUAWEI TECHNOLOGIES CO., LTD. HUAWEI Tecal X8000 High-Density Rack Server (X8000) High density and innovative architecture
More informationGPU for HPC. October 2010
GPU for HPC Simone Melchionna Jonas Latt Francis Lapique October 2010 EPFL/ EDMX EPFL/EDMX EPFL/DIT simone.melchionna@epfl.ch jonas.latt@epfl.ch francis.lapique@epfl.ch 1 Moore s law: in the old days,
More informationThe Rise of Open Programming Frameworks. JC BARATAULT IWOCL May 2015
The Rise of Open Programming Frameworks JC BARATAULT IWOCL May 2015 1,000+ OpenCL projects SourceForge GitHub Google Code BitBucket 2 TUM.3D Virtual Wind Tunnel 10K C++ lines of code, 30 GPU kernels CUDA
More information