Performance and Energy Efficiency of the 14 th Generation Dell PowerEdge Servers

Similar documents
Performance and power efficiency of Dell PowerEdge servers with E v2

Intergenerational Energy Efficiency of Dell EMC PowerEdge Servers

WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY BX924 S2

Meet the Increased Demands on Your Infrastructure with Dell and Intel. ServerWatchTM Executive Brief

WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY BX920 S2

Accelerating storage performance in the PowerEdge FX2 converged architecture modular chassis

Dell EMC PowerEdge Server System Profile Performance Comparison

Dell PowerEdge R910 SQL OLTP Virtualization Study Measuring Performance and Power Improvements of New Intel Xeon E7 Processors and Low-Voltage Memory

WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY RX100 S7

FAST FORWARD TO YOUR <NEXT> CREATION

Intelligent servers- Lower TCO, Rapid ROI and More Performance

Consolidating OLTP Workloads on Dell PowerEdge R th generation Servers

WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY RX600 S6

Four-Socket Server Consolidation Using SQL Server 2008

A Comparative Study of Microsoft Exchange 2010 on Dell PowerEdge R720xd with Exchange 2007 on Dell PowerEdge R510

Performance Report PRIMERGY RX100 S6

vstart 50 VMware vsphere Solution Specification

Microsoft SQL Server in a VMware Environment on Dell PowerEdge R810 Servers and Dell EqualLogic Storage

EPYC VIDEO CUG 2018 MAY 2018

Performance Report PRIMERGY TX120 S2

IBM Power Systems solution for SugarCRM

WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY RX100 S7P

Microsoft SQL Server 2012 Fast Track Reference Architecture Using PowerEdge R720 and Compellent SC8000

WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY TX120 S3P

Comparing the Performance and Power of Dell VRTX and HP c3000

White Paper Fujitsu PRIMERGY Servers Performance Report PRIMERGY RX200 S8

Dell PowerEdge 11 th Generation Servers: R810, R910, and M910 Memory Guidance

Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms

PowerEdge 3250 Features and Performance Report

Competitive Power Savings with VMware Consolidation on the Dell PowerEdge 2950

Dell PowerEdge R720xd with PERC H710P: A Balanced Configuration for Microsoft Exchange 2010 Solutions

SAP SD Benchmark with DB2 and Red Hat Enterprise Linux 5 on IBM System x3850 M2

Microsoft SQL Server 2012 Fast Track Reference Configuration Using PowerEdge R720 and EqualLogic PS6110XV Arrays

Dell OpenManage Cluster Configurator on Dell PowerEdge VRTX

Fast-track Hybrid IT Transformation with Intel Data Center Blocks for Cloud

Dell Server Migration Utility (SMU)

Quad-Core Intel Xeon Processor-based 3200/3210 Chipset Server Platforms

HP ProLiant delivers #1 overall TPC-C price/performance result with the ML350 G6

Dell IT Assistant Migration To OpenManage Essentials

Fujitsu M10 Standard Benchmark Results Rev 3.6

HP ProLiant BladeSystem Gen9 vs Gen8 and G7 Server Blades on Data Warehouse Workloads

for Power Energy and

Dell Microsoft Business Intelligence and Data Warehousing Reference Configuration Performance Results Phase III

Using Dell Repository Manager to Find Newer Updates from the Dell Online Repository

April 2 nd, Bob Burroughs Director, HPC Solution Sales

HP on CUTTING EDGE with ProLiant BL460c G6 server blade

Dell Microsoft Reference Configuration Performance Results

Dell Guide to Server Benchmarks

InfoBrief. Dell 2-Node Cluster Achieves Unprecedented Result with Three-tier SAP SD Parallel Standard Application Benchmark on Linux

Impact of Dell FlexMem Bridge on Microsoft SQL Server Database Performance

Dell EMC SAP HANA Appliance Backup and Restore Performance with Dell EMC Data Domain

2 to 4 Intel Xeon Processor E v3 Family CPUs. Up to 12 SFF Disk Drives for Appliance Model. Up to 6 TB of Main Memory (with GB LRDIMMs)

Secure Login for SAP Single Sign-On Sizing Guide

Accelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing

Newest generation of HP ProLiant DL380 takes #1 position overall on Oracle E-Business Suite Small Model Benchmark

Dell, Intel, and Red Hat

DELL EMC POWER EDGE R940 MAKES DE NOVO ASSEMBLY EASIER

Making the Cloud Work for You Introducing the Intel Xeon Processor E5 Family

5 Myths in Systems Performance

Dell Reference Configuration for Large Oracle Database Deployments on Dell EqualLogic Storage

Nehalem Hochleistungsrechnen für reale Anwendungen

Cost and Performance benefits of Dell Compellent Automated Tiered Storage for Oracle OLAP Workloads

SAP RDS 에최적화된 IBM Hardware IBM System x3850 X5 and x3690 X5: Workload Optimized Solution for SAP

unleashed the future Intel Xeon Scalable Processors for High Performance Computing Alexey Belogortsev Field Application Engineer

NVMe Over Fabrics: Scaling Up With The Storage Performance Development Kit

WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY RX350 S7

A Performance Characterization of Microsoft SQL Server 2005 Virtual Machines on Dell PowerEdge Servers Running VMware ESX Server 3.

Accelerating Microsoft SQL Server 2016 Performance With Dell EMC PowerEdge R740

TPC-E testing of Microsoft SQL Server 2016 on Dell EMC PowerEdge R830 Server and Dell EMC SC9000 Storage

Reduce Costs & Increase Oracle Database OLTP Workload Service Levels:

Performance Report PRIMERGY RX300 S5

Memory Population Guidelines for AMD EPYC Processors

Key Considerations for Improving Performance And Virtualization in Microsoft SQL Server Environments

Lenovo Enterprise Portfolio

HP ProLiant DL580 G5. HP ProLiant BL680c G5. IBM p570 POWER6. Fujitsu Siemens PRIMERGY RX600 S4. Egenera BladeFrame PB400003R.

Goro Watanabe. Bill King. OOW 2013 The Best Platform for Big Data and Oracle Database 12c. EVP Fujitsu R&D Center North America

Performance Comparisons of Dell PowerEdge Servers with SQL Server 2000 Service Pack 4 Enterprise Product Group (EPG)

Extremely Fast Distributed Storage for Cloud Service Providers

DataON and Intel Select Hyper-Converged Infrastructure (HCI) Maximizes IOPS Performance for Windows Server Software-Defined Storage

TECHNICAL OVERVIEW ACCELERATED COMPUTING AND THE DEMOCRATIZATION OF SUPERCOMPUTING

HPE ProLiant DL580 Gen10 Server

Maximizing Six-Core AMD Opteron Processor Performance with RHEL

Exchange Server 2007 Performance Comparison of the Dell PowerEdge 2950 and HP Proliant DL385 G2 Servers

DELL Reference Configuration Microsoft SQL Server 2008 Fast Track Data Warehouse

Intel and SAP Realising the Value of your Data

Munara Tolubaeva Technical Consulting Engineer. 3D XPoint is a trademark of Intel Corporation in the U.S. and/or other countries.

SAS Enterprise Miner Performance on IBM System p 570. Jan, Hsian-Fen Tsao Brian Porter Harry Seifert. IBM Corporation

Upgrade to Microsoft SQL Server 2016 with Dell EMC Infrastructure

Key results at a glance:

DELL EMC READY BUNDLE FOR VIRTUALIZATION WITH VMWARE AND FIBRE CHANNEL INFRASTRUCTURE

Performance Report PRIMERGY RX600 S4

Lawson M3 7.1 Large User Scaling on System i

Performance Report PRIMERGY BX620 S5

Dell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance

IBM Power Systems Performance Report. POWER9, POWER8 and POWER7 Results

Milestone Solution Partner IT Infrastructure Components Certification Report

Deploy a High-Performance Database Solution: Cisco UCS B420 M4 Blade Server with Fusion iomemory PX600 Using Oracle Database 12c

HPE ProLiant ML350 Gen P 16GB-R E208i-a 8SFF 1x800W RPS Solution Server (P04674-S01)

Dell PowerEdge R920 System Powers High Performing SQL Server Databases and Consolidates Databases

Dell EMC Ready Bundle for HPC Digital Manufacturing ANSYS Performance

Transcription:

Performance and Energy Efficiency of the 14 th Generation Dell PowerEdge Servers This white paper details the performance improvements of Dell PowerEdge servers with the Intel Xeon Processor Scalable CPU family. Solutions Performance Analysis Dell Global Solutions Engineering

This document is for informational purposes only and may contain typographical errors and technical inaccuracies. The content is provided as is, without express or implied warranties of any kind. 2017 Dell Inc. All rights reserved. Dell and its affiliates cannot be responsible for errors or omissions in typography or photography. Dell, the Dell logo, and PowerEdge are trademarks of Dell Inc. Intel and Xeon are registered trademarks of Intel Corporation in the U.S. and other countries. Microsoft, Windows, and Windows SQL Server are either trademarks or registered trademarks of Microsoft Corporation in the United States and/or other countries. Other trademarks and trade names may be used in this document to refer to either the entities claiming the marks and names or their products. Dell disclaims proprietary interest in the marks and names of others. Members of Dell s Solutions Performance Analysis team contributing to this whitepaper included: Diego Esteves, Waseem Raja, Don Russell, Bruce Wagner, Robert Woolweaver and Juergen Zimmermann. Special thanks to Michael Hellen of Intel s Joint Innovation Center for his significant contribution. July 2017 Version 1.0 ii

Contents Executive summary... 5 Introduction... 5 Key findings... 5 Performance with Skylake-SP... 5 Arithmetic performance... 6 SPEC CPU2006 integer tests... 6 SPEC CPU2006 floating point tests... 7 High Performance Computing (HPC) performance tests... 8 Memory subsystem performance... 9 Energy Efficiency...10 Business Transaction Performance...11 SPECjbb2015...11 SAP-SD 2-Tier, Linux / Sybase...12 Summary...13 Appendix A Test configurations...14 Appendix B 14G PowerEdge Floating Point Operations per Second...15 Appendix C 14G PowerEdge System Memory Bandwidth...16 Appendix D- SPEC CPU 2006 throughput metric results across Xeon Processor Scalable Family...17 iii

Tables Table 1 Benchmark configurations...14 Figures Figure 1 Performance improvement running SPECint_rate_base2006... 6 Figure 2 Performance improvement running SPECfp_rate_base2006... 7 Figure 3 Performance improvement running Linpack... 8 Figure 4 Performance improvement running STREAM... 9 Figure 5 Energy Efficiency improvement running SPECpower_ssj2008...10 Figure 6 Performance improvement running SPECjbb2015...11 Figure 7 Performance improvement running SAP SD 2-Tier, Linux / Sybase...12 Figure 8 Linpack results for the Xeon Processor Scalable family...15 Figure 9 Stream results for the Xeon Processor Scalable family...16 Figure 10 SPECcpu2006 integer results for the Xeon Processor Scalable family...17 Figure 11 SPECcpu2006 floating point results for the Xeon Processor Scalable family...17 iv

Executive summary Introduction Dell's 14 th generation PowerEdge servers are now available with the Intel Xeon Processor Scalable family, code named Skylake-SP. These new Xeon processors feature up to 28 cores, 38.5 MB of last level caching and (6) 2666 MT/s DDR4 memory channels. In order to show customers the performance and energy efficiency uplifts possible from the new PowerEdge products, Dell s Solutions Performance Analysis team performed a series of benchmarks and compared the results to those previously obtained from PowerEdge 13G servers equipped with both the Xeon E5-2600 v4 (Broadwell-EP) and Xeon E5-2600 v3 (Haswell-EP) CPU families. Based on these results, PowerEdge 14G servers with new Skylake-SP family processors performed up to 102% better as compared to their direct predecessor 13G servers equipped with Broadwell-EP family processors. Key findings Performance with Skylake-SP PowerEdge 14G servers with two Xeon Platinum 8180 processors delivered up to 50% higher throughput using the comprehensive SPECcpu2006 integer suite. PowerEdge 14G servers with two Xeon Platinum 8180 processors delivered up to 64% higher throughput using the comprehensive SPECcpu2006 floating point suite. PowerEdge 14G servers with two Xeon Platinum 8180 processors produced up to 102% higher floating-point operations per second using the popular Linpack high performance computing metric. PowerEdge 14G servers with two Xeon Platinum 8180 processors and 2 DIMMS per available channel produced up to 69% higher sustainable memory bandwidth according to the STREAM benchmark. The PowerEdge 14G 4 socket server with four Xeon Platinum 8180 processors demonstrated up to 78% higher energy efficiency than its 13G 4 socket predecessor. PowerEdge 14G servers with two Xeon Platinum 8180 processors achieved up to a 44% higher score on the SAP-SD two-tier business transaction benchmark. 5

Arithmetic performance SPEC CPU2006 integer tests The widely referenced SPEC CPU2006 benchmark is described on SPEC.org as: CPU2006 is SPEC's computer server industry standard benchmark suite, stressing a system's processor, memory subsystem and compiler. SPEC designed CPU2006 to provide a comparative measure of compute-intensive performance across the widest practical range of hardware based upon the aggregate score of 12 integer and 17 floating point real-world applications. The integer portion of the benchmark is particularly good at measuring a server s ability to run general business applications. In figure 1, we see 14G s Xeon Platinum 8180 achieve a 50% improvement 1 in SPECint_rate over the best 13G result. Figure 1 Two socket Platform Performance improvement running SPECint_rate_base2006 See the Appendix D for comparative SPECint_rate_base2006 results across the full Skylake-SP CPU stack. 1 SPEC and SPECcpu are registered trademarks of Standard Performance Evaluation Corporation. The performance described is based upon results posted at http://www.spec.org as of July 11, 2017. 6

SPEC CPU2006 floating point tests The throughput or rate of a machine carrying out floating point arithmetic is important to those working today s biggest problems in science and engineering. In figure 2, we see 14G s Xeon Platinum 8180 achieve a 64% improvement 2 in SPECfp_rate over the best 13G result. Figure 2 Two socket Platform Performance improvement running SPECfp_rate_base2006 See the Appendix D for comparative SPECfp_rate_base2006 results across the full Skylake-SP CPU stack 2 SPEC and SPECcpu are registered trademarks of Standard Performance Evaluation Corporation. The performance described is based upon results posted at http://www.spec.org as of July 11, 2017. 7

High Performance Computing (HPC) performance tests The widely-available LINPACK benchmark is the standard for illustrating a system s heavy math floating point processing power needed for simulating natural phenomena, analyzing structures and machine learning. Thanks primarily to Skylake-SP s higher core count, new AVX-512 vector engines, improvements in IPC (instructions per clock cycle) and 2666 MT/s DDR4 memory support, the Xeon Platinum 8180 delivered an amazing 102% performance improvement over the Xeon E5-2699 v4, as seen in the following figure. Figure 3 Performance improvement running Linpack See the Appendix B for Linpack results across the full Skylake-SP CPU family 8

Memory subsystem performance Cloud and in-memory database applications benefit from greater memory bandwidth. Dell 14G PowerEdge servers with select SKL- SP processor models support up to (48) DDR4 DIMMs running at a speed of 2666 MT/s. This is up from the 2400 MT/s limit of 13G PowerEdge and is also the case when every possible DIMM socket is populated. In figure 4, the server industry-standard STREAM benchmark illustrates a 69% improvement in total system memory bandwidth performance over the best Broadwell result thanks to Skylake s faster memory interface and the addition of two additional channels (6 total). Figure 4 Performance improvement running STREAM See the Appendix C for STREAM results across the full SKL-SP CPU family 9

Energy Efficiency SPECpower_ssj2008 is an industry standard benchmark created by the Standard Performance Evaluation Corporation (SPEC ) to measure a server s power and performance across its full range of utilization levels from 100% to idle. As figure 5 shows, the Dell PowerEdge R940 with four of the new Xeon-SP 8180 processors with their higher core counts and new power management features is capable of 78% higher performance per watt than was possible on the R930 3. Figure 5 Four Socket Platform Energy Efficiency improvement running SPECpower_ssj2008 3 Required SPEC disclosure information: R940/8180 scores: (11,546,769 ssj_ops and 946W) @ 100% target load and 11,667 overall ssj_ops/watt vs. R930/E7-8880v3: (5,968,583 ssj_ops and 815W) @ 100% and 6,542 overall ssj_ops/watt. Comparison based on results by Dell Labs July 2017. SPEC and the benchmark name SPECpower_ssj are registered trademarks 10

Business Transaction Performance SPECjbb2015 According to the SPEC website: This benchmark models a Java-based business application for a worldwide supermarket company with an IT infrastructure that handles a mix of point-of-sale requests, online purchases and data-mining operations. It exercises the latest data formats (XML), communication using compression and messaging with security in a virtualized cloud computing environment. As figure 6 shows, a Dell PowerEdge R740 with a pair of the new Xeon-SP 8180 processors and accompanying 2666M memory provides 41% more Java operations per second than the R730 last refreshed in May 2016 4 Figure 6 Two Socket Platform Performance improvement running SPECjbb2015 4 SPEC and SPECjbb are registered trademarks of Standard Performance Evaluation Corporation. The performance described is based upon results posted at http://www.spec.org as of July 11, 2017. 11

SAP-SD 2-Tier, Linux / Sybase The (Sales and Distribution) benchmark is described on the SAP web site as: The Sales and Distribution (SD) Benchmark covers a sell-from-stock scenario, which includes the creation of a customer order with five line items and the corresponding delivery with subsequent goods movement and invoicing. The SAP-SD Two-Tier benchmark s primary metric is the Number of Benchmark Users. As figure 7 shows, the PowerEdge R740 with Xeon Platinum 8180 achieved an acknowledged world record result being 44% higher 5 than from the PowerEdge R730 with Xeon E5-2699A v4, itself a world record 6 Figure 7 Performance improvement running SAP SD 2-Tier, Linux / Sybase 5 Results of the Dell PowerEdge R740 on the two-tier SAP SD standard application benchmark: 32,085 SAP SD benchmark users with the SAP enhancement package 5 for SAP ERP 6.0, Red Hat Enterprise Linux 7.3, and Sybase ASE 16.0, 2 x Intel Xeon Platinum 8180 processors (56 cores, 112 threads), 768 GB main memory. Certification number 2017017. http://www.sap.com/benchmark 6 Results of the Dell PowerEdge R730 on the two-tier SAP SD standard application benchmark: 22,222 SAP SD benchmark users with the SAP enhancement package 5 for SAP ERP 6.0, Red Hat Enterprise Linux 7.2, and Sybase ASE 16, 2 x Intel Xeon E5-2699A v4 processors (44 cores, 88 threads), 512 GB main memory. Certification number 2016050. 12

Summary PowerEdge 14G servers with the new Skylake-SP processor models, can provide even more performance from all representative scientific and business transaction workloads. Thanks to both the CPU family s additional compute resources and Dell s Energy Smart implementation; this additional performance comes with the same or even less electricity required or waste heat generated making possible capacity growth, infrastructure reduction and lower total cost of ownership over the life of the product. The Xeon Processor Scalable family is available for purchase on all PowerEdge 14G servers. The enhanced performance of the 14G server lineup continues the PowerEdge tradition of delivering the maximum performance today's datacenter administrators demand. 13

Appendix A Test configurations Table 1 Benchmark configurations Benchmark Processor quantity Skylake-SP4 family processor DIMM quantity DIMM specifications SPECint_rate_base2006 2 Xeon Platinum 8180 24 16 GB dual rank 2666 MT/s registered DIMMs SPECfp_rate_base2006 2 Xeon Platinum 8180 24 16 GB dual rank 2666 MT/s registered DIMMs Linpack 2 Xeon Platinum 8180 24 16 GB dual rank 2666 MT/s registered DIMMs STREAM 2 Xeon Platinum 8180 24 16 GB dual rank 2666 MT/s registered DIMMs SPECpower_ssj2008 4 Xeon Platinum 8180 24 16 GB dual rank 2666 MT/s registered DIMMs SPECjbb2015 2 Xeon Platinum 8180 24 16 GB dual rank 2666 MT/s registered DIMMs SAP SD Two-Tier, Linux 2 Xeon Platinum 8180 24 32 GB dual rank 2666 MT/s registered DIMMs 14

Appendix B 14G PowerEdge Floating Point Operations per Second Figure 8 Linpack results for the Skylake-SP family in a 2 socket platform 15

Appendix C 14G PowerEdge System Memory Bandwidth Figure 9 Stream results for the Skylake-SP family in a 2 socket platform 16

Appendix D- SPEC CPU 2006 throughput metric results across Xeon Processor Scalable Family 7 Figure 10 Integer Workloads Figure 11 Floating Point Workloads 7 SPEC and SPECcpu are registered trademarks of Standard Performance Evaluation Corporation. The performance described is based upon results posted at http://www.spec.org as of July 11, 2017. 17