Copyright 2017 Intel Corporation

Similar documents
Dr Christopher Dahnken. SSG DRD EMEA Datacenter

Hubert Nueckel Principal Engineer, Intel. Doug Nelson Technical Lead, Intel. September 2017

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

Intel Xeon Scalable Processor for HPC 나승구이사

Ultimate Workstation Performance

Density Optimized System Enabling Next-Gen Performance

FAST FORWARD TO YOUR <NEXT> CREATION

DataON and Intel Select Hyper-Converged Infrastructure (HCI) Maximizes IOPS Performance for Windows Server Software-Defined Storage

Philippe Thierry Sr Staff Engineer Intel Corp.

Flexible General-Purpose Server Board in a Standard Form Factor

FUJITSU Server PRIMERGY CX400 M4 Workload-specific power in a modular form factor. 0 Copyright 2018 FUJITSU LIMITED

EPYC VIDEO CUG 2018 MAY 2018

unleashed the future Intel Xeon Scalable Processors for High Performance Computing Alexey Belogortsev Field Application Engineer

OCP-T Spec Open Pod Update

Mike Greenfield, Intel MultiCore 7 Workshop September 27 and 28, 2017 National Center for Atmospheric Research in Boulder, Colorado

NVMe Takes It All, SCSI Has To Fall. Brave New Storage World. Lugano April Alexander Ruebensaal

Cisco HyperFlex HX220c M4 Node

ECE 571 Advanced Microprocessor-Based Design Lecture 18

A Universal Terabit Network Dataplane

Cisco HyperFlex HX220c M4 and HX220c M4 All Flash Nodes

Cisco HyperFlex HX220c M4 and HX220c M4 All Flash Nodes

Accelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing

Intel Server S2600BP Product Family Configuration Guide

Full Featured with Maximum Flexibility for Expansion

THE STORAGE PERFORMANCE DEVELOPMENT KIT AND NVME-OF

Technical Overview of the Lenovo X6 Servers

Intel Select Solution for NFVI with Advantech Servers & Appliances

Intel HPC Technologies Outlook

Intel Select Solutions for Professional Visualization with Advantech Servers & Appliances

QCT Rackgo X Yosemite V2 2018/10/11

Service Edge Virtualization - Hardware Considerations for Optimum Performance

LS-DYNA Performance on Intel Scalable Solutions

Intel Server Board S2600WF Product Family

HostEngine 5URP24 Computer User Guide

Samuli Toivola Lead HW Architect Nokia

Andreas Schneider. Markus Leberecht. Senior Cloud Solution Architect, Intel Deutschland. Distribution Sales Manager, Intel Deutschland

Bringing High Density Embedded Computing To Life

SERVER. Samuli Toivola Lead HW Architect Nokia

Intel Xeon Scalable Processor: The Foundation of Data Centre Innovation. Intel Software Developer Conference London, 2017

Intel Server Board S2600WF Product Family

Essentials. Expected Discontinuance Q2'15 Limited 3-year Warranty Yes Extended Warranty Available

Broadberry. Artificial Intelligence Server for Fraud. Date: Q Application: Artificial Intelligence

Intel Architecture for HPC

Intel Server Boards. The Quality You Want. The Confidence INTEL SERVER BOARDS. Intel Product Quick Reference Matrix Q4 2018

Appendix A Technology and Terms

RAISE YOUR PERFORMANCE TO NEW HEIGHTS

Introduction: Modern computer architecture. The stored program computer and its inherent bottlenecks Multi- and manycore chips and nodes

Energy Efficient Servers

Out-of-band (OOB) Management of Storage Software through Baseboard Management Controller Piotr Wysocki, Kapil Karkra Intel

Intel Architecture for Software Developers

Server-Grade Performance Computer-on-Modules:

High-Density NVMe Systems

Innovating and Integrating for Communications and Storage

Give Your PC. Intel Core Desktop Processors. Unleash your creativity with 8th Generation Intel Core desktop processors and the Intel H370 chipset.

Essential Performance and Advanced Security

Cisco HyperFlex HX220c Edge M5

Open edge server. Revision 0.1. Author: Samuli Toivola, Nokia 28/09/ / 25

New Levels of Performance and Interconnection for Datacenter Applications

Intel Data Center Blocks (Intel DCB)

Fast forward. To your <next>

VRM-Express-HL. Overview:

Results from TSUBAME3.0 A 47 AI- PFLOPS System for HPC & AI Convergence

Cisco Ultra Packet Core High Performance AND Features. Aeneas Dodd-Noble, Principal Engineer Daniel Walton, Director of Engineering October 18, 2018

SU Dual and Quad-Core Xeon UP Server

CSU 0201 Compute Sled Unit

Intel s Architecture for NFV

Advanced Parallel Programming I

Huawei FusionServer. V5 Rack Server HUAWEI TECHNOLOGIES CO., LTD.

Atos ARM solutions for HPC

Innovation Makes Computing Simple

Pactron FPGA Accelerated Computing Solutions

Intel Data Center Blocks (Intel DCB)

Introducing Sandy Bridge

Maximizing heterogeneous system performance with ARM interconnect and CCIX

n N c CIni.o ewsrg.au

CSU 0111 Compute Sled Unit

HPE ProLiant ML350 Gen P 16GB-R E208i-a 8SFF 1x800W RPS Solution Server (P04674-S01)

The Why and How of Developing All-Flash Storage Server

Unlock Next Level Performance

The Intel Z390 Chipset and 9 and 8 Generation Intel Core Desktop Processors

Sugon TC6600 blade server

Colin Cunningham, Intel Kumaran Siva, Intel Sandeep Mahajan, Oracle 03-Oct :45 p.m. - 5:30 p.m. Moscone West - Room 3020

Innovate. Integrate. Innovate. Integrate.

HPE ProLiant ML350 Gen10 Server

Erkenntnisse aus aktuellen Performance- Messungen mit LS-DYNA

INTELLIGENCE AT THE EDGE -HIGH PERFORMANCE EMBEDDED COMPUTING TRENDS (HPEC)

LS-DYNA Performance Benchmark and Profiling. October 2017

Future of datacenter STORAGE. Carol Wilder, Niels Reimers,

PRIMERGY BX2580 M2. System configurator and order-information guide. PRIMERGY Server. Contents. Instructions Configuration diagram Configurator

33% 148% 2. at 4 years. Silo d applications & data pockets. Slow Deployment of new services. Security exploits growing. Network bottlenecks

Dr. Jean-Laurent PHILIPPE, PhD EMEA HPC Technical Sales Specialist. With Dell Amsterdam, October 27, 2016

Accelerating Real-Time Big Data. Breaking the limitations of captive NVMe storage

Intel Select Solution for ucpe

Intel data center blocks (Intel dcb)

Benchmarking Software Data Planes Intel Xeon Skylake vs. Broadwell 1. Maciek Konstantynowicz

SGI UV for SAP HANA. Scale-up, Single-node Architecture Enables Real-time Operations at Extreme Scale and Lower TCO

Achieve Optimal Network Throughput on the Cisco UCS S3260 Storage Server

All-Flash Storage System

Xu Wang Hardware Engineer Facebook, Inc.

WiRack19 - Computing Server. Wiwynn SV324G2. Highlights. Specification.

Transcription:

Agenda Intel Xeon Scalable Platform Overview Architectural Enhancements 2

Platform Overview 3x16 PCIe* Gen3 2 or 3 Intel UPI 3x16 PCIe Gen3 Capabilities Details 10GbE Skylake-SP CPU OPA DMI Intel C620 Series Intel ME QAT IE 4x10GbE NIC TPM High Speed IO GPIO 2666 1x 100Gb OPA Fabric SPI USB3 PCIe3 SATA3 espi/lpc Firmware Skylake-SP CPU OPA BMC 1x 100Gb OPA Fabric CPU VRs OPA VRs Mem VRs BMC: Baseboard Management Controller PCH: Intel Platform Controller Hub IE: Innovation Engine Firmware Intel OPA: Intel Omni-Path Architecture Intel QAT: Intel QuickAssist Technology ME: Manageability Engine NIC: Network Interface Controller VMD: Volume Management Device NTB: Non-Transparent Bridge Socket Scalability CPU TDP Chipset Networking Compression and Crypto Acceleration Storage Security Manageability Socket P 2S, 4S, 8S, and >8S (with node controller support) 70W 205W Intel C620 Series (code name Lewisburg PCH) Intel Omni-Path Fabric (integrated w/ CPU + discrete) 4x10GbE (integrated w/ chipset) 100G/40G/25G discrete options Intel QuickAssist Technology option in chipset to support 100Gb/s comp/decomp/crypto 100K RSA2K public key CPU integrated QuickData Technology, VMD, and NTB Intel Optane SSD, Intel 3D-NAND NVMe* & SATA SSD CPU Instruction Set enhancements (MBE, PPK, MPX) Manageability Engine with multiple secure boot options Intel Platform Trust Technology Intel Key Protection Technology Intel Node Manager Intel Datacenter Manager Innovation Engine (IE) 3

Platform Topologies 2S Configurations General Purpose/Storage/ HPC/Comms SP 4S Configurations Enterprise and Cloud 8S Configuration Enterprise Intel UPI DMI * * 3x16 PCIe* 1x100G Intel OP Fabric x4 3x16 PCIe 1x100G Intel OP Fabric (2S-2UPI & 2S-3UPI shown) DMI 3x16 PCIe (4S-2UPI & 4S-3UPI shown) Intel Xeon Scalable Platform supports configurations ranging from 2S-2UPI to 8S DMI 3x16 PCIe 4

5

Overview Skylake core microarchitecture, with data center specific enhancements Intel AVX-512 with 32 DP Flops per core Data center optimized cache hierarchy 1MB L2 per core, non-inclusive L3 New mesh interconnect architecture Enhanced memory subsystem Modular IO subsystem with integrated devices New Intel Ultra Path Interconnect (Intel UPI Intel Speed Shift Technology Security & Virtualization enhancements Optional Integrated Intel Omni-Path Fabric (Intel OPA) Features Intel Xeon Processor E5-2600 v4 Intel Xeon Scalable Processor (Skylake-SP) s Per Socket Up to 22 Up to 28 6 Channels 2 or 3 UPI Threads Per Socket Up to 44 threads Up to 56 threads UPI Last-level Cache (LLC) Up to 55 MB Up to 38.5 MB (non-inclusive) QPI/UPI Speed (GT/s) 2x QPI channels @ 9.6 GT/s Up to 3x UPI @ 10.4 GT/s UPI PCIe* Lanes/ Controllers/Speed(GT/s) Memory Population 40 / 10 / PCIe 3.0 (2.5, 5, 8 GT/s) 48 / 12 / PCIe 3.0 (2.5, 5, 8 GT/s) 4 channels of up to 3 RDIMMs, LRDIMMs, or 3DS LRDIMMs 6 channels of up to 2 RDIMMs, LRDIMMs, or 3DS LRDIMMs Max Memory Speed Up to 2400 Up to 2666 48 Lanes PCIe 3.0 Shared L3 Omni Path HFI UPI Omni Path DMI3 TDP (W) 55W-145W 70W-205W 6

Key Instruction Set Architecture Enhancements Compute Virtualization Security Intel AVX-512 2x compute density per core for vector operations Cache Management Instructions CLFLUSHOPT Lower latency cache line flush CLWB Cache line writeback to memory without invalidation Improved Time Stamp Counter Virtualization Reduces overhead on VMs migrating across processors running at different base frequency Page Protection Keys (PPK) Extends paging architecture to provide a page-granular, thread-private user-level memory protection Mode Based Execution (MBE) Protects against malicious kernel updates in a virtualized system 7

Intel Advanced Vector Extensions 512 (Intel AVX-512) 512-bit wide vectors 32 operand registers 8 64b mask registers Embedded broadcast Embedded rounding Microarchitecture Instruction Set SP FLOPs / cycle DP FLOPs / cycle Skylake Intel AVX-512 & FMA 64 32 Haswell / Broadwell Intel AVX2 & FMA 32 16 Sandybridge Intel AVX (256b) 16 8 Nehalem SSE (128b) 8 4 Intel AVX-512 Instruction Types Intel AVX-512-F AVX-512-VL AVX-512-BW AVX-512-DQ AVX-512-CD AVX-512 Foundation Instructions Vector Length Orthogonality: ability to operate on sub-512 vector sizes 512-bit Byte/Word support Additional D/Q/SP/DP instructions (converts, transcendental support, etc.) Conflict Detect: used in vectorizing loops with potential address conflicts Intel AVX-512 doubles the number of FLOPs per cycle 8

Non-Inclusive v. Inclusive L3 Inclusive L3 (prior architectures) L2 256kB 2 3 Non-Inclusive L3 (Skylake architecture) L2 1MB 2 3 1. Memory reads fill directly to the L2, no longer to both the L2 and L3 2. When a L2 line needs to be removed, both modified and unmodified lines are written back 3. Data shared across cores are copied into the L3 for servicing future L2 misses 2.5 MB L3 1.375 MB L3 Cache hierarchy architected and optimized for data center use cases: Virtualized use-cases get larger private L2 cache free from interference 1 Memory 1 Memory Multithreaded workloads can operate on larger data per thread (due to increased L2 size) and reduce uncore activity 9