Dell EMC NUMA Configuration for AMD EPYC (Naples) Processors

Similar documents
PowerEdge NUMA Configurations with AMD EPYC Processors

NUMA Topology for AMD EPYC Naples Family Processors

Handling the Firmware Update Dependency by using Lifecycle Controller in the 14th generation Dell EMC PowerEdge servers

Dell EMC Ready Solution for VMware vcloud NFV 3.0 OpenStack Edition Platform

Dell EMC Ready Solution for VMware vcloud NFV 3.0 OpenStack Edition Platform

Dell PowerEdge 11 th Generation Servers: R810, R910, and M910 Memory Guidance

Best Practices for Deploying a Mixed 1Gb/10Gb Ethernet SAN using Dell Storage PS Series Arrays

Dell EMC SC Series Storage with SAS Front-end Support for VMware vsphere

Changing default password of root user for idrac9 by using Dell EMC License Manager

EPYC VIDEO CUG 2018 MAY 2018

Dell EMC Storage Manager Scalability Solutions

Dell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance

Changing unique password of root user for idrac9 by using Dell EMC License Manager

Memory Population Guidelines for AMD EPYC Processors

Microsoft Windows 2016 Mellanox 100GbE NIC Tuning Guide

Boot Mode Considerations: BIOS vs. UEFI

Dell EMC Ready Solution for VMware vcloud NFV 3.0 OpenStack Edition Platform

Understanding idrac Quick Sync 2

Linux Network Tuning Guide for AMD EPYC Processor Based Servers

Boot and Network Configuration Deployment using Server Template with Dell EMC OpenManage Essentials (OME)

Linux Network Tuning Guide for AMD EPYC Processor Based Servers

Dell EMC Ready Bundle for HPC Digital Manufacturing ANSYS Performance

CAUTIONARY STATEMENT This presentation contains forward-looking statements concerning Advanced Micro Devices, Inc. (AMD) including, but not limited to

TPC-E testing of Microsoft SQL Server 2016 on Dell EMC PowerEdge R830 Server and Dell EMC SC9000 Storage

Side-by-side comparison of the features of Dell EMC idrac9 and idrac8

Accelerating Microsoft SQL Server 2016 Performance With Dell EMC PowerEdge R740

Upgrade to Microsoft SQL Server 2016 with Dell EMC Infrastructure

DELL EMC READY BUNDLE FOR VIRTUALIZATION WITH VMWARE AND FIBRE CHANNEL INFRASTRUCTURE

NVMe Performance Testing and Optimization Application Note

DELL Reference Configuration Microsoft SQL Server 2008 Fast Track Data Warehouse

OpenStack and Hadoop. Achieving near bare-metal performance for big data workloads running in a private cloud ABSTRACT

Dell SC Series Snapshots and SQL Server Backups Comparison

Dell PowerEdge R720xd with PERC H710P: A Balanced Configuration for Microsoft Exchange 2010 Solutions

Dell PowerVault Network Attached Storage (NAS) Systems Running Windows Storage Server 2012 Troubleshooting Guide

Dell EMC PowerEdge R6415

PowerEdge FX2 - Upgrading from 10GbE Pass-through Modules to FN410S I/O Modules

Dell EMC PowerEdge Systems SUSE Linux Enterprise Server 15. Installation Instructions and Important Information

DELL EMC READY BUNDLE FOR VIRTUALIZATION WITH VMWARE AND ISCSI INFRASTRUCTURE

Dell EMC Networking S4148-ON and S4128-ON

Dell EMC Networking S4148-ON and S4128-ON

Dell EMC PowerEdge R7415

HyperTransport Technology

Huawei CloudEngine 6850

VMware Infrastructure Update 1 for Dell PowerEdge Systems. Deployment Guide. support.dell.com

AMD EPYC Processors Showcase High Performance for Network Function Virtualization (NFV)

Atos ARM solutions for HPC

Quick Boot on Dell EMC PowerEdge servers

Exchange Server 2007 Performance Comparison of the Dell PowerEdge 2950 and HP Proliant DL385 G2 Servers

iscsi Boot from SAN with Dell PS Series

VMware Infrastructure Update 1 for Dell PowerEdge Systems. Deployment Guide. support.dell.com

Scale out a 13th Generation XC Series Cluster Using 14th Generation XC Series Appliance

Hyper-converged storage for Oracle RAC based on NVMe SSDs and standard x86 servers

Accelerating Microsoft SQL Server Performance With NVDIMM-N on Dell EMC PowerEdge R740

SUN CUSTOMER READY HPC CLUSTER: REFERENCE CONFIGURATIONS WITH SUN FIRE X4100, X4200, AND X4600 SERVERS Jeff Lu, Systems Group Sun BluePrints OnLine

Dell EMC PowerEdge R740xd as a Dedicated Milestone Server, Using Nvidia GPU Hardware Acceleration

Huawei CloudEngine 6850

Dell EMC PowerEdge and BlueData

VMware Infrastructure 3.5 for Dell PowerEdge Systems. Deployment Guide. support.dell.com

Altair OptiStruct 13.0 Performance Benchmark and Profiling. May 2015

Avid Configuration Guidelines Dell R7920 Rack workstation Dual 8 to 28 Core CPU System

Dell PowerEdge VRTX Storage Subsystem Compatibility Matrix

Dell EMC SC Series and Active Directory Integration

The Dell Precision T3620 tower as a Smart Client leveraging GPU hardware acceleration

Open storage architecture for private Oracle database clouds

SAP HANA on Dell EMC XC Appliances and XC Core Systems Best Practices Guide

POWEREDGE RACK SERVERS

Authors. Punita. Rajeswari Ayyaswamy. Lokesh Thutaram. Sanjeev Dambal. September, 2017

Cisco UCS M5 Blade and Rack Servers

Dell EMC Networking S6010-ON

DELL EMC READY BUNDLE FOR VIRTUALIZATION WITH VMWARE VSAN INFRASTRUCTURE

Maximizing Six-Core AMD Opteron Processor Performance with RHEL

VMware Infrastructure 3.5 Update 2 for Dell PowerEdge Systems. Deployment Guide. support.dell.com

OpenManage Essentials Managing Firmware and Driver Compliance by using Multiple Baselines

Family 15h Models 10h-1Fh AMD Athlon Processor Product Data Sheet

Dell EMC NX3340 and NX3240 Network Attached Storage System using Windows Storage Server Troubleshooting Guide

Dell EMC PowerEdge R7425

GLOBAL [Change] Input Product Model. Products Support News & Events Where to Buy FOX Club About FOXCONN. Home Products Motherboards Socket AM3 A74GA

Avid Configuration Guidelines Dell R7920 Rack workstation Dual 8 to 28 Core CPU System

SGI UV 300RL for Oracle Database In-Memory

Dell Reference Configuration for Large Oracle Database Deployments on Dell EqualLogic Storage


Dell PowerEdge M620 Systems (For Dell PowerEdge VRTX Enclosure) Owner's Manual

DR. LISA SU

Application Note. A Performance Comparison of Dialogic DM3 Media Boards for PCI Express and PCI

Reference Architecture Microsoft Exchange 2013 on Dell PowerEdge R730xd 2500 Mailboxes

VMAX3 AND VMAX ALL FLASH WITH CLOUDARRAY

Avid Configuration Guidelines HP Z640 Dual 8-Core / Dual 10-Core / Dual 12-Core CPU Workstation

The PowerEdge M830 blade server

Virtualized SQL Server Performance and Scaling on Dell EMC XC Series Web-Scale Hyper-converged Appliances Powered by Nutanix Software

Dell EMC PowerEdge Installation, Management and Diagnostics

(Please refer "CPU Support List" for more information.) (Please refer "Memory Support List" for more information.)

Dell EMC OpenManage Deployment Toolkit for

VIA ProSavageDDR KM266 Chipset

Customer Replacement Procedure

Technical Specifications

MD711-SU Installation Guide

Six-Core AMD Opteron Processor

Dell PowerEdge R920 System Powers High Performing SQL Server Databases and Consolidates Databases

Cisco Nexus 5648Q. Switch Configuration Guide for Dell EMC SC Series SANs

Server-Related Policy Configuration

Transcription:

Dell EMC NUMA Configuration for AMD EPYC (Naples) Processors Dell Engineering February 2018 A Dell EMC Deployment and Configuration Guide

Revisions Date February 2018 Description Initial release The information in this publication is provided as is. Dell Inc. makes no representations or warranties of any kind with respect to the information in this publication, and specifically disclaims implied warranties of merchantability or fitness for a particular purpose. Use, copying, and distribution of any software described in this publication requires an applicable software license. Copyright 2018 Dell Inc. or its subsidiaries. All Rights Reserved. Dell, EMC, and other trademarks are trademarks of Dell Inc. or its subsidiaries. Other trademarks may be the property of their respective owners. Published in the USA [4/16/2018] [Deployment and Configuration Guide] Dell believes the information in this document is accurate as of its publication date. The information is subject to change without notice. 2 Dell EMC NUMA Configuration for AMD EPYC (Naples) Processors version 1.0

Table of contents Revisions... 2 Executive summary... 4 1 AMD EPYC Architecture... 5 1.1 Zeppelin Die Layout... 5 1.2 Memory Interleaving... 6 1.2.1 Memory Interleaving Rules... 6 1.2.2 NUMA Domains per Memory Interleave Option... 6 2 Performance Tuning... 8 2.1 Memory DIMM Population Guidelines... 8 2.2 PCIe Configuration Guidelines... 9 3 BIOS Setup... 10 4 Platform Specific NUMA/Die Domain Details... 11 5 Technical support and resources... 13 5.1 Dell... 13 5.2 AMD... 13 3 Dell EMC NUMA Configuration for AMD EPYC (Naples) Processors version 1.0

Executive summary With the introduction of AMD s EPYC (Naples) x86 Server CPUs featuring four Zeppelin dies per package there is a need to clarify how AMD s new silicon design establishes Non-Uniform Memory Access (NUMA) domains across dies and sockets. The goal of this Dell EMC Deployment and Configuration Guide is demonstrate how Dell EMC Servers leverages AMD s EPYC CPUs to configured NUMA domains for optimal performance by using Dell EMC BIOS Settings. 4 Dell EMC NUMA Configuration for AMD EPYC (Naples) Processors version 1.0

1 AMD EPYC Architecture 1.1 Zeppelin Die Layout AMD EPYC is a Multi-Chip Module (MCM) processor and per silicon package there are four Zeppelin SOCs/dies leveraged from AMD Ryzen. Each of the four dies have direct Infinity Fabric connections to each of the other dies as well as a possible socket-to-socket interconnect. This design allows, at most, four NUMA nodes per socket or eight NUMA nodes in a dual sockets system AMD EPYC processor s four dies each have two Unified Memory Controllers (UMC), that each control one DDR channel with two DIMMs per channel, along with one controller for IO, as shown in Figure 1 below: IO IO Channel 0 UMC 0 UMC 0 Channel 2 DIE 0 DIE 1 Channel 1 UMC 1 UMC 1 Channel 3 Channel 4 UMC 0 UMC 0 Channel 6 DIE 2 DIE 3 Channel 5 UMC 1 UMC 1 Channel 7 IO IO Figure 1 Zeppelin Die Layout 5 Dell EMC NUMA Configuration for AMD EPYC (Naples) Processors version 1.0

1.2 Memory Interleaving The Memory Interleave feature for AMD EPYC processors is what controls how may NUMA domains are generated. AMD EPYC processors support 4 memory interleaving options. Each option becomes available based on system configuration. Socket Interleaving (2 processor configurations) Die Interleaving Channel Interleaving Memory Interleaving disabled 1.2.1 Memory Interleaving Rules The following are the rules for each memory interleave option: The system can socket interleave, but only if all channels in the entire system have the same amount of memory. Die interleaving must be enabled as well. The system can die interleave, but only if all channels on the socket have the same amount of memory. Channel interleaving must be enabled as well. The system can channel interleave as long as both channels have at least one DIMM. The channels do not have to be symmetrical. This is the default configuration. No interleave at all, where each channel is stacked on top of the previous channel. However, it should be noted that probe filter performance may be affected if there is one UMC with less memory than the other UMC on the same die. 1.2.2 NUMA Domains per Memory Interleave Option AMD s new silicon architecture adds nuances on how to configure platforms for NUMA. The focus of AMDs scheme to NUMA lies within its quad-die layout and its potential to have four NUMA domains. Socket Interleaving is the only memory interleave option meant for inter-socket memory interleaving, and is only available with 2-processor configurations. In this configuration memory across both sockets will be seen as a single memory domain producing a non-numa configuration. Die Interleaving is the intra-socket memory interleave option that creates one NUMA domain for all the four dies on a socket. In a 2-processor configuration this will produce two NUMA domains, one domain pertaining to each socket providing customers with the first option for NUMA configuration. In a 1-processor configuration die interleaving will be the maximum option for memory interleaving, and will produce one memory domain thus producing a non-numa configuration. Channel Interleaving is the intra-die memory interleave option and is the default setting for Dell EMC platforms. With channel interleaving the memory behind each UMC will be interleaved and seen as 1 NUMA domain per die. This will generate four NUMA domains per socket. Memory Interleave disabled - When memory interleave is disable 4 NUMA nodes will be seen as in the case for channel interleaving but the memory will not be interleaved yet stacked next to one another. 6 Dell EMC NUMA Configuration for AMD EPYC (Naples) Processors version 1.0

NUMA Domain Count per Memory Interleave Option Number of Processors Socket Interleave NUMA Domains Die Interleave NUMA Domains Channel Interleave NUMA Domains 2 1 2 8 1 NA 1 4 7 Dell EMC NUMA Configuration for AMD EPYC (Naples) Processors version 1.0

2 Performance Tuning For best performance from AMD EPYC processors, it is recommended that each die have one DIMM populated on each channel. This allows all IO behind each die to access memory, with optimal latency. 2.1 Memory DIMM Population Guidelines Populate empty channels, with the same type/capacity of DIMMs, before populating 2 DIMMs on a given channel Recommendations for best performance: o 1 DIMM per channel dedicates full memory bandwidth o Populating 2 DIMMs per channel will increase capacity but will lower the clock speed, resulting in lower memory bandwidth. There is a dependency between memory speed and the bandwidth of the Infinity Fabric Memory Bus Speed to Infinity Fabric Bud Speed Memory Bus Speed Infinity Fabric Speed Infinity Fabric Speed (Die to Die within CPU socket) (Socket-to-Socket) 2666 MT/s 5.3 GT/s 10.6 GT/s 2400 MT/s 4.8 GT/s 9.6 GT/s 2133 MT/s 4.2 GT/s 8.5 GT/s 1866 MT/s 3.7 GT/s 7.4 GT/s Minimum recommended: o At least 1 DIMM is per die in the system for a total of 4 DIMM per CPU On Dell EMC platforms populate DIMM 1 first. (white slots in Figure 4, below) A 2 socket system (2 CPUs are populated) will need equivalent memory configurations on both CPUs for optimal performance. 8 Dell EMC NUMA Configuration for AMD EPYC (Naples) Processors version 1.0

D1 D0 D1 D0 D1 D0 D1 D0 CPU D0 D1 D0 D1 D0 D1 D0 D1 DIMM Layout 2.2 PCIe Configuration Guidelines When PCIe cards are populated into particular slots with NUMA-unaware application/software, make sure to have memory DIMMs populated in the corresponding NUMA-node mapping as local memory. Mappings can be found in Section 4 Platform Specific NUMA/Die Domain Details Considering also pinning the interrupts to local CPUs to get maximum performance. For instructions on how to tune network cards for better performance on AMD EPYC processors, go to the following links and download provided documentation: o https://support.amd.com/techdocs/56224.pdf o https://developer.amd.com/resources/epyc-resources/epyc-white-papers/ 9 Dell EMC NUMA Configuration for AMD EPYC (Naples) Processors version 1.0

3 BIOS Setup The Memory Interleaving setting controls whether the system is configured for Socket, Die, Channel interleaving. In System Setup (F2 prompt during system boot), enter System BIOS > Memory Settings and navigate to Memory Interleaving to choose the memory interleave for desired configuration. This option is also available in system management consoles such as RACADM. 10 Dell EMC NUMA Configuration for AMD EPYC (Naples) Processors version 1.0

4 Platform Specific NUMA/Die Domain Details The following matrices shows how CPU die, memory and PCIe slots are physically grouped to each NUMA domain for Dell EMC EPYC based platforms, PowerEdge R6415, R7415, and R7425. PowerEdge R6415 CPU Die Memory Slots Slot1 2 A7, A8, A15, A16 2 PCIe Slot / Device NUMA Domain Slot2 0 A3, A4, A11, A12 0 Slot3 2 A7, A8, A15, A16 2 Embedded LOM 2 A7, A8, A15, A16 2 Mini PERC 3 A5, A6, A13, A14 3 PowerEdge R7415 (Sys config without rear side disk) PCIe Slot / Device CPU Die Memory Slots NUMA Domain Slot1 2 A7, A8, A15, A16 2 Slot2 1 A1, A2, A9, A10 1 Slot3 0 A3, A4, A11, A12 0 Slot4 2 A7, A8, A15, A16 2 Slot5 3 A5, A6, A13, A14 3 Embedded LOM 2 A7, A8, A15, A16 2 Mini PERC 3 A5, A6, A13, A14 3 PowerEdge R7415 (Sys config. with rear side disk) PCIe Slot / Device CPU Die Memory Slots NUMA Domain Slot1 2 A7, A8, A15, A16 2 Slot2 0 A3, A4, A11, A12 0 Slot3 2 A7, A8, A15, A16 2 Embedded LOM 2 A7, A8, A15, A16 2 Mini PERC 3 A5, A6, A13, A14 3 11 Dell EMC NUMA Configuration for AMD EPYC (Naples) Processors version 1.0

PowerEdge R7425 PCIe Riser PCIe Slot / Device CPU SKT CPU Die Memory Slots NUMA Domain 1A 1 1 2 A7, A8, A15, A16 2 3 1 3 A5, A6, A13, A14 3 1 1 2 A7, A8, A15, A16 2 1D 2 1 3 A5, A6, A13, A14 3 3 1 3 A5, A6, A13, A14 3 1E 1 1 2 A7, A8, A15, A16 2 2 1 3 A5, A6, A13, A14 3 2A 4 2 2 B7, B8, B15, B16 6 6 2 3 B5, B6, B13, B14 7 2C 4 2 2 B7, B8, B15, B16 6 4 1 1 A1, A2, A9, A10 1 2D 5 2 2 B7, B8, B15, B16 6 6 2 3 B5, B6, B13, B14 7 3A 7 2 3 B5, B6, B13, B14 7 8 2 0 B3, B4, B11, B12 4 3B 7 2 1 B1, B2, B9, B10 5 8 2 0 B3, B4, B11, B12 4 12 Dell EMC NUMA Configuration for AMD EPYC (Naples) Processors version 1.0

5 Technical support and resources 5.1 Dell Dell.com/support is focused on meeting customer needs with proven services and support. Dell TechCenter is an online technical community where IT professionals have access to numerous resources for Dell EMC software, hardware and services. Storage Solutions Technical Documents on Dell TechCenter provide expertise that helps to ensure customer success on Dell EMC Storage platforms. 5.2 AMD https://community.amd.com/community/server-gurus EPYC Server Community Forum https://developer.amd.com/resources/epyc-resources/epyc-tuning-guides/ Linux Network Tuning Guide for AMD EPYC Processor Based Servers 13 Dell EMC NUMA Configuration for AMD EPYC (Naples) Processors version 1.0