Intel Xeon Scalable Processor for HPC 나승구이사

Size: px
Start display at page:

Download "Intel Xeon Scalable Processor for HPC 나승구이사"

Transcription

1 Intel Xeon Scalable Processor for HPC 나승구이사

2 Growing Challenges in HPC System Bottlenecks The Walls Divergent Workloads Machine learning hpc Big Data visualization Barriers to Extending Usage Optimizing For Cloud Memory I/O Storage Energy Efficient Performance Space Resiliency Unoptimized Software Resources Split Among Modeling and Simulation Big Data Analytics Machine Learning Visualization Democratization at Every Scale Cloud Access Exploration of New Parallel Programming Models 2

3 Intel Scalable System Framework Modeling & Simulation HPC Data Analytics Machine Learning Visualization Many Workloads one Framework A Flexible Framework for Today & Tomorrow Enabling Breakthrough System Performance 3

4 Unified Intel Xeon Scalable Platform Intel Xeon Processor E7 Targeted at mission-critical applications that value a scale-up system with leadership memory 18 cores capacity and advanced RAS E7 v3 Brickland Platform E7 v4 Intel Xeon Scalable Platform (codename Purley) Skylake CPUs Intel Xeon PLATINUM Intel Xeon Processor E5 Targeted at a wide variety of applications that value a balanced system with leadership performance/watt/$ Grantley-EP Platform E5 v3 E v4 (4S) E5 v3 E v4 Intel Xeon GOLD Intel Xeon SILVER Intel Xeon BRONZE 4

5 Overview Skylake core microarchitecture, with data center specific enhancements Intel AVX-512 with 32 DP Flops per core Data center optimized cache hierarchy 1MB L2 per core, non-inclusive L3 New mesh interconnect architecture Enhanced memory subsystem Modular IO subsystem with integrated devices New Intel Ultra Path Interconnect (Intel UPI Intel Speed Shift Technology Security & Virtualization enhancements Optional Integrated Intel Omni-Path Fabric (Intel OPA) Features Intel Xeon Processor E v4 Intel Xeon Scalable Processor (Skylake-SP) s Per Socket Up to 22 Up to 28 6 Channels DDR4 DDR4 2 or 3 UPI Threads Per Socket Up to 44 threads Up to 56 threads DDR4 UPI Last-level (LLC) Up to 55 MB Up to 38.5 MB (non-inclusive) QPI/UPI Speed (GT/s) 2x QPI 9.6 GT/s Up to 3x 10.4 GT/s DDR4 DDR4 UPI PCIe* Lanes/ Controllers/Speed(GT/s) Memory Population 40 / 10 / PCIe 3.0 (2.5, 5, 8 GT/s) 48 / 12 / PCIe 3.0 (2.5, 5, 8 GT/s) 4 channels of up to 3 RDIMMs, LRDIMMs, or 3DS LRDIMMs 6 channels of up to 2 RDIMMs, LRDIMMs, or 3DS LRDIMMs Max Memory Speed Up to 2400 Up to 2666 DDR4 DDR4 48 Lanes PCIe 3.0 Shared L3 Omni Path HFI UPI Omni Path DMI3 TDP (W) 55W-145W 70W-205W 5

6 VEC INT Microarchitecture Enhancements Front End Load Buffer Store Buffer Port 0 Port 1 ALU Shift JMP 2 FMA ALU Shift DIV ALU LEA MUL FMA ALU Shift Reorder Buffer Port 5 ALU LEA FMA ALU Shuffle Data Center 32KB L1 I$ Pre decode Inst Q Branch Prediction Unit Port 6 ALU Shift JMP 1 Load Data 2 Load Data 3 Scheduler Port 4 Store Data 1MB L2$ Decoders μop Allocate/Rename/Retire Broadwell uarch Skylake uarch Out-of-order Window In-flight Loads + Stores Scheduler Entries Registers Integer + FP Allocation Queue 56 64/thread L1D BW (B/Cyc) Load + Store L2 Unified TLB Larger and improved branch predictor, higher throughput decoder, larger window Improved scheduler and execution engine, improved throughput and latency of divide/sqrt More load/store bandwidth, deeper load/store buffers, improved prefetcher Data center specific enhancements Intel AVX-512 with 2 FMAs per core, larger 1MB L2 per core Port 2 Load/STA Memory Control Fill Buffers 5 6 Port 3 Load/STA Fill Buffers Port 7 STA 32KB L1 D$ μop Queue In order OOO Memory Data center-specific enhancements to the core K+2M: K+2M: G: 16 6

7 Intel Advanced Vector Extensions 512 (Intel AVX-512) 512-bit wide vectors 32 operand registers 8 64b mask registers Embedded broadcast Embedded rounding Microarchitecture Instruction Set SP FLOPs / cycle DP FLOPs / cycle Skylake Intel AVX-512 & FMA Haswell / Broadwell Intel AVX2 & FMA Sandybridge Intel AVX (256b) 16 8 Nehalem SSE (128b) 8 4 Intel AVX-512 Instruction Types Intel AVX-512-F AVX-512-VL AVX-512-BW AVX-512-DQ AVX-512-CD AVX-512 Foundation Instructions Vector Length Orthogonality: ability to operate on sub-512 vector sizes 512-bit Byte/Word support Additional D/Q/SP/DP instructions (converts, transcendental support, etc.) Conflict Detect: used in vectorizing loops with potential address conflicts Intel AVX-512 doubles the number of FLOPs per cycle 7

8 New Mesh Interconnect Architecture Xeon E7 v4 24-core die Skylake-SP 28-core die QPI QPI Link Link R3QPI QPI Agent PCI-E PCI-E PCI-E PCI-E X4 X16 X16 X8 (ESI) UBox PCU R2PCI CB DMA IIO IOAPIC 2x UPI x 20 PCIe* * x16 PCIe x16 DMI x 4 CBDMA On Pkg PCIe x16 1x UPI x20 PCIe x16 /QPII C LLC LLC 2.5MB 2.5MB /QPII C /QPII C LLC LLC 2.5MB 2.5MB /QPII C SKX SKX SKX SKX SKX SKX /QPII /QPII /QPII /QPII C C C C LLC LLC 2.5MB 2.5MB LLC LLC 2.5MB 2.5MB LLC LLC 2.5MB 2.5MB LLC LLC 2.5MB 2.5MB /QPII /QPII /QPII /QPII C C C C /QPII /QPII /QPII /QPII C C C C LLC LLC 2.5MB 2.5MB LLC LLC 2.5MB 2.5MB LLC LLC 2.5MB 2.5MB LLC LLC 2.5MB 2.5MB /QPII /QPII /QPII /QPII C C C C DDR4 MC DDR4 DDR4 SKX SKX SKX SKX SKX SKX SKX SKX SKX MC DDR4 DDR4 DDR4 SKX /QPII C LLC LLC 2.5MB 2.5MB /QPII C /QPII C LLC LLC 2.5MB 2.5MB /QPII C SKX SKX SKX SKX SKX SKX SKX SKX SKX SKX SKX SKX DDR Home Agent Mem Ctlr DDR DDR Home Agent Mem Ctlr DDR CHA Caching and Home Agent ; SF Snoop Filter; LLC Last Level ; SKX Skylake-SP ; UPI Intel UltraPath Interconnect Dual-ring in Broadwell Server (Intel Xeon Processor E5/E7) v. Mesh in Skylake-SP 9

9 Re-architected L2 & L3 Hierarchy Previous Architectures Shared L3 2.5MB/core (inclusive) Skylake-SP Architecture Shared L MB/core (non-inclusive) L2 (256KB private) L2 (256KB private) L2 (256KB private) L2 (1MB private) L2 (1MB private) L2 (1MB private) On-chip cache balance shifted from shared-distributed (prior architectures) to private-local (Skylake architecture): Shared-distributed shared-distributed L3 is primary cache Private-local private L2 becomes primary cache with shared L3 used as overflow cache Shared L3 changed from inclusive to non-inclusive: Inclusive (prior architectures) L3 has copies of all lines in L2 Non-inclusive (Skylake architecture) lines in L2 may not exist in L3 10

10 RELATIVE IO BANDWIDTH IO Performance IO bandwidth change over Intel Xeon processor E5-2699v4 >50% aggregate IO bandwidth improvement in line with memory bandwidth increase for a balanced system performance M E M O R Y B W ( R E A D ) A G G. L O C A L P C I E R D W R ( L A R G E P K T ) D M A M E M - M E M D M A M E M - IO D M I Source as of June 2017: Intel internal measurements on platform with Xeon Platinum 8180, Turbo enabled, UPI=10.4, SNC1, 6x32GB DDR4-2400/2666 per CPU, 1 DPC, and platform with E v4, Turbo enabled, 4x32GB DDR4-2400, RHEL 7.0. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit 11

11 CONNECTOR CONNECTOR Skylake-SP with Integrated Fabric Single on-package Storm Lake Host Fabric Interface (HFI) port Fabric component interfaces to SKL using 16 PCIe* lanes Fabric PCIe* lanes are incremental to the existing 48 PCIe* lanes available on SKL-F Single cable from SKL-F package connector to QSFP module at rear IO panel Same socket for SKL-SP and SKL-F processors Intel Xeon Scalable Processor-based platforms can be designed to support both processors Skylake-F Internal Faceplate-to- Processor (IPF) Cable Platform designs will require an expanded keep-out zone and additional board components to accommodate both processors F QSFP module QSPF connector Internal Faceplate Transition (IFT) Connector and Cage 12

12 Efficiency (Rmax/Rpeak) June 2017 Top500 Analysis Intel OPA 22% more total entries than EDR IB* 31 Intel OPA EDR IB* Intel OPA 30% more top 100 entries than EDR IB* Top 10 Top 15 Top 50 Top 100 Intel OPA EDR IB* 1 Intel OPA 7.6% higher average efficiency at scale than EDR IB* 81.9% 74.3% Share of 100Gb Flops 67.1PF OPA vs 38.7PF EDR EDR IB* 67.1 Intel OPA 0 Average efficiency on Intel Xeon processors *Other names and brands may be claimed as the property of others. June 2017 list 13

13 Why OPA has been winning in HPC 2 HPC Performance Major Buying Criteria In most cases, Intel OPA has equal to or better HPC performance (latency, bandwidth, and applications) Price / Performance Next Evaluation/Buying Point 1 Leadership price/performance at any scale from entry to large supercomputers Build a better cluster more FLOPS and storage OPA Features Key Differentiators Builds on key technologies from QLogic and Cray needed to support ever-increasing HPC cluster sizes & workloads Features: CPU/Fabric Integration, HPC Optimization and Enhanced Fabric Functionality 1 Configuration assumes a 750-node cluster, and number of switch chips required is based on a full bisectional bandwidth (FBB) Fat-Tree configuration. Intel OPA uses one fully-populated 768-port director switch, and Mellanox EDR solution uses a combination of 648-port director switches and 36-port edge switches. Mellanox component pricing from with prices as of April 4, Compute node pricing based on Dell PowerEdge R730 server from with prices as of April 4, Intel OPA pricing based on pricing from as of April 4, 2017 * Other names and brands may be claimed as property of others. 2 See Performance backup slide for configuration information, 14

14 15 Intel OPA s Impact across Many Segments 100 s of Accounts Deployed Supercomputers Artificial Intelligence Traditional HPC HPC Cloud Enterprise R&D LRZ

15 Relative Performance Intel(R) OPA vs EDR IB* HIGHER is Better Application Performance - Intel Xeon Platinum 8170 processors Intel Omni-Path Architecture (Intel OPA) vs EDR InfiniBand* 15% 10% 5% 0% -5% -10% -15% Chemistr y 0% 2% 8% Best performance using either Intel MPI or Open MPI for both fabrics 1 Physics/Bio -1% 0%0%2% 4% 5% 8% 9%10% Material Sciences 13% Computational Molecular Dynamics -2% -1%-1% 0% 1%1% 3% 3% 8%8% 8% Computational Geodynamics 0% -7% -6% Computational Fluid Dynamics 0%0%1% 2% -2%-2% 7% 5% Finite Element Analysis 11% -6% Electromagnetics Climate & Weather13% Graphics -3% 5% 6% 9% 4% 7% 4% 1. Not every application was tested with both Intel MPI and Open MPI. See configuration slide for necessary details. 2. VASP benchmark comparison at 8 nodes only. Some workloads use different rank counts. See configuration slide for detail. 16 nodes / 52 ranks per node 2 *see the configuration slides for complete configuration information for select workloads 16

16

17 1.63x Average Gains on High Performance Compute Apps Higher is better Earth Systems Models Manufacturing Life Sciences FSI Broadwell (E v4) WRF* HOMME* LSTC LS-DYNA Explicit* INTES PERMAS* V16 MILC* GROMACS* VASP* NAMD* LAMMPS* Amber GB* Binomial Black-Scholes Monte Carlo option pricing European options Intel Xeon Gold 6148 processor Results have been estimated based on internal Intel analysis and are provided for informational purposes only. Geomean of Weather Research Forecasting - Conus 12Km, HOMME, LSTCLS-DYNA Explicit, INTES PERMAS V16, MILC, GROMACS water 1.5M_pme, VASPSi256, NAMDstmv, LAMMPS, Amber GB Nucleosome, Binomial option pricing, Black-Scholes, Monte Carlo European options. Any difference in system hardware or software design or configuration may affect actual performance. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to Configurations: see page 54,55 Benefits of Intel Xeon processor Scalable family Intel AVX-512 and Higher IPC/Per performance Balanced IO and memory Predictable latency with mesh/memory architecture improvements Additional options to scale performance Intel Optane SSD DC P4800X Series for fast storage Intel Omni-Path Fabric to scale cluster performance

18 Latency, Bandwidth, and Message Rate - Intel MPI Benchmarks Intel Xeon Processor E5-2697A v4 & Intel Xeon Platinum 8170 processors Intel Omni-Path Architecture (Intel OPA) Metric Intel Xeon E5-2697A v4 CPU GHz, 16c Intel Xeon Platinum 8170 CPU GHz, 26c Latency (one-way, 1 switch, 8B) [ns] ; PingPong Bandwidth (1 rank per node, 1 port, uni-dir, 1MB) [GB/s] ; Uniband Bandwidth (1 rank per node, 1 port, bi-dir, 1MB) [GB/s] ; Biband Message Rate (max ranks per node, uni-dir, 8B) [Mmps] ; Uniband % Message Rate (max ranks per node, bi-dir, 8B) [Mmps] ; Biband % Last updated: April 2017 Dual socket servers with one Intel OPA Edge switch hop. Intel Turbo Boost Technology enabled, Intel Hyper-Threading Technology enabled. Intel MPI Benchmarks 4.1. Intel OPA: Open MPI hfi as packaged with IFS Benchmark processes pinned to the cores on the socket that is local to the Intel OP Host Fabric Interface (HFI) before using the remote socket. 1. Intel Xeon E5-2697A v4 processor, 2.60 GHz 16 cores,2133 MHz DDR4 memory. 32 ranks per node for message rate tests. RHEL* 7.2., el7.x86_64 kernel. BIOS settings: IOU non-posted prefetch disabled. Snoop timer for posted prefetch=9. Early snoop disabled. Cluster on Die disabled. 2. Intel Xeon Platinum 8170 processor, 2.10 GHz 26 cores, 64 GB 2666 MHz DDR4 memory per node. 52 ranks per node for message rate tests RHEL* 7.3, el7.x86_64 kernel. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit Copyright 2017, Intel Corporation.*Other names and brands may be claimed as the property of others. 21

19 GFlop/sec GFlop/sec Intel Xeon Platinum Processor 8180 vs. Intel Xeon Processor E5-2699v SGEMM Performance SGEMM SKX SGEMM BDX DGEMM Performance DGEMM SKX DGEMM BDX M=N=K (Matrix Sizes) M=N=K (Matrix Sizes) DGEMM SGEMM Average Up to 2.26 Up to Kx20Kx20K Up to 2.25 Up to 2.32 Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit: Source: Intel measured as of June Configuration details on slide

20 Normalized Performance Increased SPEC MPI 2007 performance with the 2S Intel Xeon Gold Performance metric is a geomean over 13 apps. Up to 24% faster 2S Intel Xeon processor E v3 2S Intel Xeon processor E v4 Intel Xeon Gold 6148 processor Workload: medium. Up Up to to YY% 2.4X faster faster Up to 71% faster SPEC MPI 2007 Application / Workload Description: SPEC MPI* 2007 benchmark suite is for evaluating MPI-parallel, floating point, compute intensive performance across a wide range of cluster and SMP hardware Workload can be run on either single node and cluster (this is a single node comparison) Key Takeaway: SPEC gives users the most objective and representative benchmark suite for measuring and comparing high-performance computer systems. Performance Factors: Intel AVX-512 contributes up to 18% performance boost per component. Memory bandwidth improvements speedup application performance up to 71% in geomean. Memory bandwidth contributes to performance increase. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit *Other names and brands may be claimed as the property of others. 1.Testing conducted on ISV* software comparing Intel Xeon Gold 6148 processor to 2S Intel Xeon Processor E v3 and to 2S Intel Xeon Processor E v4. Testing done by Intel. For complete testing configuration details, see slide

21 The Urgency of Code Modernization Continued scientific progress increasingly depends on advances in simulation and data analysis, which will only be possible through code modernization Sudip Dosanjh, Director at NERSC Source: 24

22 2017 Intel Corporation. All rights reserved. Intel and the Intel logo are trademarks of Intel Corporation or its subsidiaries in the U.S. and/or other countries. *Other names and brands may be claimed as the property of others. For more complete information about compiler optimizations, see our Optimization Notice.

23 Notices and Disclaimers Intel technologies features and benefits depend on system configuration and may require enabled hardware, software or service activation. Learn more at intel.com, or from the OEM or retailer. No computer system can be absolutely secure. Tests document performance of components on a particular test, in specific systems. Differences in hardware, software, or configuration will affect actual performance. Consult other sources of information to evaluate performance as you consider your purchase. For more complete information about performance and benchmark results, visit Cost reduction scenarios described are intended as examples of how a given Intel-based product, in the specified circumstances and configurations, may affect future costs and provide cost savings. Circumstances will vary. Intel does not guarantee any costs or cost reduction. This document contains information on products, services and/or processes in development. All information provided here is subject to change without notice. Contact your Intel representative to obtain the latest forecast, schedule, specifications and roadmaps. No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document. Statements in this document that refer to Intel s plans and expectations for the quarter, the year, and the future, are forward-looking statements that involve a number of risks and uncertainties. A detailed discussion of the factors that could affect Intel s results and plans is included in Intel s SEC filings, including the annual report on Form 10-K. Intel does not control or audit third-party benchmark data or the web sites referenced in this document. You should visit the referenced web site and confirm whether referenced data are accurate. Intel, the Intel logo and others are trademarks of Intel Corporation in the U.S. and/or other countries. *Other names and brands may be claimed as the property of others Intel Corporation. 26

24 Optimization Notice Intel's compilers may or may not optimize to the same degree for non-intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice. Notice revision #

unleashed the future Intel Xeon Scalable Processors for High Performance Computing Alexey Belogortsev Field Application Engineer

unleashed the future Intel Xeon Scalable Processors for High Performance Computing Alexey Belogortsev Field Application Engineer the future unleashed Alexey Belogortsev Field Application Engineer Intel Xeon Scalable Processors for High Performance Computing Growing Challenges in System Architecture The Walls System Bottlenecks Divergent

More information

Hubert Nueckel Principal Engineer, Intel. Doug Nelson Technical Lead, Intel. September 2017

Hubert Nueckel Principal Engineer, Intel. Doug Nelson Technical Lead, Intel. September 2017 Hubert Nueckel Principal Engineer, Intel Doug Nelson Technical Lead, Intel September 2017 Legal Disclaimer Intel technologies features and benefits depend on system configuration and may require enabled

More information

Dr Christopher Dahnken. SSG DRD EMEA Datacenter

Dr Christopher Dahnken. SSG DRD EMEA Datacenter Dr Christopher Dahnken SSG DRD EMEA Datacenter Legal Disclaimer & Optimization Notice INFORMATION IN THIS DOCUMENT IS PROVIDED AS IS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL

More information

Copyright 2017 Intel Corporation

Copyright 2017 Intel Corporation Agenda Intel Xeon Scalable Platform Overview Architectural Enhancements 2 Platform Overview 3x16 PCIe* Gen3 2 or 3 Intel UPI 3x16 PCIe Gen3 Capabilities Details 10GbE Skylake-SP CPU OPA DMI Intel C620

More information

FAST FORWARD TO YOUR <NEXT> CREATION

FAST FORWARD TO YOUR <NEXT> CREATION FAST FORWARD TO YOUR CREATION THE ULTIMATE PROFESSIONAL WORKSTATIONS POWERED BY INTEL XEON PROCESSORS 7 SEPTEMBER 2017 WHAT S NEW INTRODUCING THE NEW INTEL XEON SCALABLE PROCESSOR BREAKTHROUGH PERFORMANCE

More information

April 2 nd, Bob Burroughs Director, HPC Solution Sales

April 2 nd, Bob Burroughs Director, HPC Solution Sales April 2 nd, 2019 Bob Burroughs Director, HPC Solution Sales Today - Introducing 2 nd Generation Intel Xeon Scalable Processors how Intel Speeds HPC performance Work Time System Peak Efficiency Software

More information

Ultimate Workstation Performance

Ultimate Workstation Performance Product brief & COMPARISON GUIDE Intel Scalable Processors Intel W Processors Ultimate Workstation Performance Intel Scalable Processors and Intel W Processors for Professional Workstations Optimized to

More information

LS-DYNA Performance on Intel Scalable Solutions

LS-DYNA Performance on Intel Scalable Solutions LS-DYNA Performance on Intel Scalable Solutions Nick Meng, Michael Strassmaier, James Erwin, Intel nick.meng@intel.com, michael.j.strassmaier@intel.com, james.erwin@intel.com Jason Wang, LSTC jason@lstc.com

More information

Mike Greenfield, Intel MultiCore 7 Workshop September 27 and 28, 2017 National Center for Atmospheric Research in Boulder, Colorado

Mike Greenfield, Intel MultiCore 7 Workshop September 27 and 28, 2017 National Center for Atmospheric Research in Boulder, Colorado Mike Greenfield, Intel Multi 7 Workshop September 27 and 28, 2017 National Center for Atmospheric Research in ulder, Colorado * Legal Disclaimers Intel technologies may require enabled hardware, specific

More information

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme FUT3056BU VMware vsphere Scales on the Amazing Next-Gen Intel Xeon Architecture VMworld 2017 Content: Not for publication Tom Adelmeyer, Richard A. Brunner, Principal Engineer, Intel Principal Engineer,

More information

Munara Tolubaeva Technical Consulting Engineer. 3D XPoint is a trademark of Intel Corporation in the U.S. and/or other countries.

Munara Tolubaeva Technical Consulting Engineer. 3D XPoint is a trademark of Intel Corporation in the U.S. and/or other countries. Munara Tolubaeva Technical Consulting Engineer 3D XPoint is a trademark of Intel Corporation in the U.S. and/or other countries. notices and disclaimers Intel technologies features and benefits depend

More information

Fast-track Hybrid IT Transformation with Intel Data Center Blocks for Cloud

Fast-track Hybrid IT Transformation with Intel Data Center Blocks for Cloud Fast-track Hybrid IT Transformation with Intel Data Center Blocks for Cloud Kyle Corrigan, Cloud Product Line Manager, Intel Server Products Group Wagner Diaz, Product Marketing Engineer, Intel Data Center

More information

NVMe Over Fabrics: Scaling Up With The Storage Performance Development Kit

NVMe Over Fabrics: Scaling Up With The Storage Performance Development Kit NVMe Over Fabrics: Scaling Up With The Storage Performance Development Kit Ben Walker Data Center Group Intel Corporation 2018 Storage Developer Conference. Intel Corporation. All Rights Reserved. 1 Notices

More information

Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms

Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Family-Based Platforms Executive Summary Complex simulations of structural and systems performance, such as car crash simulations,

More information

Ravindra Babu Ganapathi

Ravindra Babu Ganapathi 14 th ANNUAL WORKSHOP 2018 INTEL OMNI-PATH ARCHITECTURE AND NVIDIA GPU SUPPORT Ravindra Babu Ganapathi Intel Corporation [ April, 2018 ] Intel MPI Open MPI MVAPICH2 IBM Platform MPI SHMEM Intel MPI Open

More information

Intel Architecture 2S Server Tioga Pass Performance and Power Optimization

Intel Architecture 2S Server Tioga Pass Performance and Power Optimization Intel Architecture 2S Server Tioga Pass Performance and Power Optimization Terry Trausch/Platform Architect/Intel Inc. Whitney Zhao/HW Engineer/Facebook Inc. Agenda Tioga Pass Feature Overview Intel Xeon

More information

IFS RAPS14 benchmark on 2 nd generation Intel Xeon Phi processor

IFS RAPS14 benchmark on 2 nd generation Intel Xeon Phi processor IFS RAPS14 benchmark on 2 nd generation Intel Xeon Phi processor D.Sc. Mikko Byckling 17th Workshop on High Performance Computing in Meteorology October 24 th 2016, Reading, UK Legal Disclaimer & Optimization

More information

Accelerating Insights In the Technical Computing Transformation

Accelerating Insights In the Technical Computing Transformation Accelerating Insights In the Technical Computing Transformation Dr. Rajeeb Hazra Vice President, Data Center Group General Manager, Technical Computing Group June 2014 TOP500 Highlights Intel Xeon Phi

More information

EPYC VIDEO CUG 2018 MAY 2018

EPYC VIDEO CUG 2018 MAY 2018 AMD UPDATE CUG 2018 EPYC VIDEO CRAY AND AMD PAST SUCCESS IN HPC AMD IN TOP500 LIST 2002 TO 2011 2011 - AMD IN FASTEST MACHINES IN 11 COUNTRIES ZEN A FRESH APPROACH Designed from the Ground up for Optimal

More information

Intel HPC Portfolio September Emiliano Politano Technical Account Manager

Intel HPC Portfolio September Emiliano Politano Technical Account Manager Intel HPC Portfolio September 2014 Emiliano Politano Technical Account Manager Legal Disclaimers Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors.

More information

Bei Wang, Dmitry Prohorov and Carlos Rosales

Bei Wang, Dmitry Prohorov and Carlos Rosales Bei Wang, Dmitry Prohorov and Carlos Rosales Aspects of Application Performance What are the Aspects of Performance Intel Hardware Features Omni-Path Architecture MCDRAM 3D XPoint Many-core Xeon Phi AVX-512

More information

Accelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing

Accelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing Accelerating HPC (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing SAAHPC, Knoxville, July 13, 2010 Legal Disclaimer Intel may make changes to specifications and product

More information

Enabling Performance-per-Watt Gains in High-Performance Cluster Computing

Enabling Performance-per-Watt Gains in High-Performance Cluster Computing WHITE PAPER Appro Xtreme-X Supercomputer with the Intel Xeon Processor E5-2600 Product Family Enabling Performance-per-Watt Gains in High-Performance Cluster Computing Appro Xtreme-X Supercomputer with

More information

Intel SSD Data center evolution

Intel SSD Data center evolution Intel SSD Data center evolution March 2018 1 Intel Technology Innovations Fill the Memory and Storage Gap Performance and Capacity for Every Need Intel 3D NAND Technology Lower cost & higher density Intel

More information

Andreas Schneider. Markus Leberecht. Senior Cloud Solution Architect, Intel Deutschland. Distribution Sales Manager, Intel Deutschland

Andreas Schneider. Markus Leberecht. Senior Cloud Solution Architect, Intel Deutschland. Distribution Sales Manager, Intel Deutschland Markus Leberecht Senior Cloud Solution Architect, Intel Deutschland Andreas Schneider Distribution Sales Manager, Intel Deutschland Legal Disclaimers 2016 Intel Corporation. Intel, the Intel logo, Xeon

More information

Technologies and application performance. Marc Mendez-Bermond HPC Solutions Expert - Dell Technologies September 2017

Technologies and application performance. Marc Mendez-Bermond HPC Solutions Expert - Dell Technologies September 2017 Technologies and application performance Marc Mendez-Bermond HPC Solutions Expert - Dell Technologies September 2017 The landscape is changing We are no longer in the general purpose era the argument of

More information

Ravindra Babu Ganapathi Product Owner/ Technical Lead Omni Path Libraries, Intel Corp. Sayantan Sur Senior Software Engineer, Intel Corp.

Ravindra Babu Ganapathi Product Owner/ Technical Lead Omni Path Libraries, Intel Corp. Sayantan Sur Senior Software Engineer, Intel Corp. Ravindra Babu Ganapathi Product Owner/ Technical Lead Omni Path Libraries, Intel Corp. Sayantan Sur Senior Software Engineer, Intel Corp. Legal All information provided here is subject to change without

More information

LS-DYNA Performance Benchmark and Profiling. October 2017

LS-DYNA Performance Benchmark and Profiling. October 2017 LS-DYNA Performance Benchmark and Profiling October 2017 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: LSTC, Huawei, Mellanox Compute resource

More information

Intel Architecture for Software Developers

Intel Architecture for Software Developers Intel Architecture for Software Developers 1 Agenda Introduction Processor Architecture Basics Intel Architecture Intel Core and Intel Xeon Intel Atom Intel Xeon Phi Coprocessor Use Cases for Software

More information

Building an Open Memory-Centric Computing Architecture using Intel Optane Frank Ober Efstathios Efstathiou Oracle Open World 2017 October 3, 2017

Building an Open Memory-Centric Computing Architecture using Intel Optane Frank Ober Efstathios Efstathiou Oracle Open World 2017 October 3, 2017 Building an Memory-Centric Computing Architecture using Intel Optane Frank Ober Efstathios Efstathiou Oracle World 2017 October 3, 2017 Agenda The legal stuff Why Memory Centric Computing? Overview of

More information

THE STORAGE PERFORMANCE DEVELOPMENT KIT AND NVME-OF

THE STORAGE PERFORMANCE DEVELOPMENT KIT AND NVME-OF 14th ANNUAL WORKSHOP 2018 THE STORAGE PERFORMANCE DEVELOPMENT KIT AND NVME-OF Paul Luse Intel Corporation Apr 2018 AGENDA Storage Performance Development Kit What is SPDK? The SPDK Community Why are so

More information

Intel Xeon Processor E v3 Family

Intel Xeon Processor E v3 Family Intel Xeon Processor E5-2600 v3 Family October 2014 Document Number: 331309-001US All information provided here is subject to change without notice. Contact your Intel representative to obtain the latest

More information

Hardware and Software Co-Optimization for Best Cloud Experience

Hardware and Software Co-Optimization for Best Cloud Experience Hardware and Software Co-Optimization for Best Cloud Experience Khun Ban (Intel), Troy Wallins (Intel) October 25-29 2015 1 Cloud Computing Growth Connected Devices + Apps + New Services + New Service

More information

Intel Omni-Path Architecture

Intel Omni-Path Architecture Intel Omni-Path Architecture Product Overview Andrew Russell, HPC Solution Architect May 2016 Legal Disclaimer Copyright 2016 Intel Corporation. All rights reserved. Intel Xeon, Intel Xeon Phi, and Intel

More information

Performance and Energy Efficiency of the 14 th Generation Dell PowerEdge Servers

Performance and Energy Efficiency of the 14 th Generation Dell PowerEdge Servers Performance and Energy Efficiency of the 14 th Generation Dell PowerEdge Servers This white paper details the performance improvements of Dell PowerEdge servers with the Intel Xeon Processor Scalable CPU

More information

Dr. Jean-Laurent PHILIPPE, PhD EMEA HPC Technical Sales Specialist. With Dell Amsterdam, October 27, 2016

Dr. Jean-Laurent PHILIPPE, PhD EMEA HPC Technical Sales Specialist. With Dell Amsterdam, October 27, 2016 Dr. Jean-Laurent PHILIPPE, PhD EMEA HPC Technical Sales Specialist With Dell Amsterdam, October 27, 2016 Legal Disclaimers Intel technologies features and benefits depend on system configuration and may

More information

Implementing Storage in Intel Omni-Path Architecture Fabrics

Implementing Storage in Intel Omni-Path Architecture Fabrics white paper Implementing in Intel Omni-Path Architecture Fabrics Rev 2 A rich ecosystem of storage solutions supports Intel Omni- Path Executive Overview The Intel Omni-Path Architecture (Intel OPA) is

More information

Trends in systems and how to get efficient performance

Trends in systems and how to get efficient performance Trends in systems and how to get efficient performance Martin Hilgeman HPC Consultant martin.hilgeman@dell.com The landscape is changing We are no longer in the general purpose era the argument of tuning

More information

SPDK China Summit Ziye Yang. Senior Software Engineer. Network Platforms Group, Intel Corporation

SPDK China Summit Ziye Yang. Senior Software Engineer. Network Platforms Group, Intel Corporation SPDK China Summit 2018 Ziye Yang Senior Software Engineer Network Platforms Group, Intel Corporation Agenda SPDK programming framework Accelerated NVMe-oF via SPDK Conclusion 2 Agenda SPDK programming

More information

The Effect of In-Network Computing-Capable Interconnects on the Scalability of CAE Simulations

The Effect of In-Network Computing-Capable Interconnects on the Scalability of CAE Simulations The Effect of In-Network Computing-Capable Interconnects on the Scalability of CAE Simulations Ophir Maor HPC Advisory Council ophir@hpcadvisorycouncil.com The HPC-AI Advisory Council World-wide HPC non-profit

More information

SOLUTIONS BRIEF: Transformation of Modern Healthcare

SOLUTIONS BRIEF: Transformation of Modern Healthcare SOLUTIONS BRIEF: Transformation of Modern Healthcare Healthcare & The Intel Xeon Scalable Processor Intel is committed to bringing the best of our manufacturing, design and partner networks to enable our

More information

Intel. Rack Scale Design: A Deeper Perspective on Software Manageability for the Open Compute Project Community. Mohan J. Kumar Intel Fellow

Intel. Rack Scale Design: A Deeper Perspective on Software Manageability for the Open Compute Project Community. Mohan J. Kumar Intel Fellow Intel Rack Scale Design: A Deeper Perspective on Software Manageability for the Open Compute Project Community Mohan J. Kumar Intel Fellow Agenda Rack Scale Design (RSD) Overview Manageability for RSD

More information

Introducing Sandy Bridge

Introducing Sandy Bridge Introducing Sandy Bridge Bob Valentine Senior Principal Engineer 1 Sandy Bridge - Intel Next Generation Microarchitecture Sandy Bridge: Overview Integrates CPU, Graphics, MC, PCI Express* On Single Chip

More information

Intel HPC Technologies Outlook

Intel HPC Technologies Outlook Intel HPC Technologies Outlook Andrey Semin Principal Engineer, HPC Technology Manager, EMEA October 19 th, 2015 ZKI Tagung AK Supercomputing Munich, Germany Legal Disclaimers INFORMATION IN THIS DOCUMENT

More information

Intel Many Integrated Core (MIC) Architecture

Intel Many Integrated Core (MIC) Architecture Intel Many Integrated Core (MIC) Architecture Karl Solchenbach Director European Exascale Labs BMW2011, November 3, 2011 1 Notice and Disclaimers Notice: This document contains information on products

More information

Density Optimized System Enabling Next-Gen Performance

Density Optimized System Enabling Next-Gen Performance Product brief High Performance Computing (HPC) and Hyper-Converged Infrastructure (HCI) Intel Server Board S2600BP Product Family Featuring the Intel Xeon Processor Scalable Family Density Optimized System

More information

Accelerating NVMe-oF* for VMs with the Storage Performance Development Kit

Accelerating NVMe-oF* for VMs with the Storage Performance Development Kit Accelerating NVMe-oF* for VMs with the Storage Performance Development Kit Jim Harris Principal Software Engineer Intel Data Center Group Santa Clara, CA August 2017 1 Notices and Disclaimers Intel technologies

More information

Intel Architecture for HPC

Intel Architecture for HPC Intel Architecture for HPC Georg Zitzlsberger georg.zitzlsberger@vsb.cz 1st of March 2018 Agenda Salomon Architectures Intel R Xeon R processors v3 (Haswell) Intel R Xeon Phi TM coprocessor (KNC) Ohter

More information

Philippe Thierry Sr Staff Engineer Intel Corp.

Philippe Thierry Sr Staff Engineer Intel Corp. HPC@Intel Philippe Thierry Sr Staff Engineer Intel Corp. IBM, April 8, 2009 1 Agenda CPU update: roadmap, micro-μ and performance Solid State Disk Impact What s next Q & A Tick Tock Model Perenity market

More information

Innovation Accelerating Mission Critical Infrastructure

Innovation Accelerating Mission Critical Infrastructure Innovation Accelerating Mission Critical Infrastructure Legal Disclaimers INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE,

More information

Performance Analysis of LS-DYNA in Huawei HPC Environment

Performance Analysis of LS-DYNA in Huawei HPC Environment Performance Analysis of LS-DYNA in Huawei HPC Environment Pak Lui, Zhanxian Chen, Xiangxu Fu, Yaoguo Hu, Jingsong Huang Huawei Technologies Abstract LS-DYNA is a general-purpose finite element analysis

More information

Intel Cluster Toolkit Compiler Edition 3.2 for Linux* or Windows HPC Server 2008*

Intel Cluster Toolkit Compiler Edition 3.2 for Linux* or Windows HPC Server 2008* Intel Cluster Toolkit Compiler Edition. for Linux* or Windows HPC Server 8* Product Overview High-performance scaling to thousands of processors. Performance leadership Intel software development products

More information

INTEL HPC DEVELOPER CONFERENCE FUEL YOUR INSIGHT

INTEL HPC DEVELOPER CONFERENCE FUEL YOUR INSIGHT INTEL HPC DEVELOPER CONFERENCE FUEL YOUR INSIGHT INTEL HPC DEVELOPER CONFERENCE FUEL YOUR INSIGHT UPDATE ON OPENSWR: A SCALABLE HIGH- PERFORMANCE SOFTWARE RASTERIZER FOR SCIVIS Jefferson Amstutz Intel

More information

LS-DYNA Performance Benchmark and Profiling. October 2017

LS-DYNA Performance Benchmark and Profiling. October 2017 LS-DYNA Performance Benchmark and Profiling October 2017 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: LSTC, Huawei, Mellanox Compute resource

More information

ENVISION TECHNOLOGY CONFERENCE. Functional intel (ia) BLA PARTHAS, INTEL PLATFORM ARCHITECT

ENVISION TECHNOLOGY CONFERENCE. Functional intel (ia) BLA PARTHAS, INTEL PLATFORM ARCHITECT ENVISION TECHNOLOGY CONFERENCE Functional Safety @ intel (ia) BLA PARTHAS, INTEL PLATFORM ARCHITECT Legal Notices & Disclaimers This document contains information on products, services and/or processes

More information

Essential Performance and Advanced Security

Essential Performance and Advanced Security Product brief Intel Xeon E-2100 Processor Essential Performance and Advanced Security for Entry Server, Secure Cloud, and Entry Workstation Solutions Performance and Security, Intelligently Designed for

More information

High Performance Computing The Essential Tool for a Knowledge Economy

High Performance Computing The Essential Tool for a Knowledge Economy High Performance Computing The Essential Tool for a Knowledge Economy Rajeeb Hazra Vice President & General Manager Technical Computing Group Datacenter & Connected Systems Group July 22 nd 2013 1 What

More information

OPENSHMEM AND OFI: BETTER TOGETHER

OPENSHMEM AND OFI: BETTER TOGETHER 4th ANNUAL WORKSHOP 208 OPENSHMEM AND OFI: BETTER TOGETHER James Dinan, David Ozog, and Kayla Seager Intel Corporation [ April, 208 ] NOTICES AND DISCLAIMERS Intel technologies features and benefits depend

More information

Re-Architecting Cloud Storage with Intel 3D XPoint Technology and Intel 3D NAND SSDs

Re-Architecting Cloud Storage with Intel 3D XPoint Technology and Intel 3D NAND SSDs Re-Architecting Cloud Storage with Intel 3D XPoint Technology and Intel 3D NAND SSDs Jack Zhang yuan.zhang@intel.com, Cloud & Enterprise Storage Architect Santa Clara, CA 1 Agenda Memory Storage Hierarchy

More information

Advanced Parallel Programming I

Advanced Parallel Programming I Advanced Parallel Programming I Alexander Leutgeb, RISC Software GmbH RISC Software GmbH Johannes Kepler University Linz 2016 22.09.2016 1 Levels of Parallelism RISC Software GmbH Johannes Kepler University

More information

Outline. Motivation Parallel k-means Clustering Intel Computing Architectures Baseline Performance Performance Optimizations Future Trends

Outline. Motivation Parallel k-means Clustering Intel Computing Architectures Baseline Performance Performance Optimizations Future Trends Collaborators: Richard T. Mills, Argonne National Laboratory Sarat Sreepathi, Oak Ridge National Laboratory Forrest M. Hoffman, Oak Ridge National Laboratory Jitendra Kumar, Oak Ridge National Laboratory

More information

IXPUG 16. Dmitry Durnov, Intel MPI team

IXPUG 16. Dmitry Durnov, Intel MPI team IXPUG 16 Dmitry Durnov, Intel MPI team Agenda - Intel MPI 2017 Beta U1 product availability - New features overview - Competitive results - Useful links - Q/A 2 Intel MPI 2017 Beta U1 is available! Key

More information

POWER YOUR CREATIVITY WITH THE INTEL CORE X-SERIES PROCESSOR FAMILY

POWER YOUR CREATIVITY WITH THE INTEL CORE X-SERIES PROCESSOR FAMILY Product Brief POWER YOUR CREATIVITY WITH THE INTEL CORE X-SERIES PROCESSOR FAMILY The Ultimate Creator PC Platform Made to create, the latest X-series processor family is powered by up to 18 cores and

More information

Dell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance

Dell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance Dell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance This Dell EMC technical white paper discusses performance benchmarking results and analysis for Simulia

More information

Making the Cloud Work for You Introducing the Intel Xeon Processor E5 Family

Making the Cloud Work for You Introducing the Intel Xeon Processor E5 Family Making the Cloud Work for You Introducing the Intel Xeon Processor E5 Family Legal Information Today s presentations contain forward-looking statements. All statements made that are not historical facts

More information

Vector Engine Processor of SX-Aurora TSUBASA

Vector Engine Processor of SX-Aurora TSUBASA Vector Engine Processor of SX-Aurora TSUBASA Shintaro Momose, Ph.D., NEC Deutschland GmbH 9 th October, 2018 WSSP 1 NEC Corporation 2018 Contents 1) Introduction 2) VE Processor Architecture 3) Performance

More information

LS-DYNA Performance Benchmark and Profiling. April 2015

LS-DYNA Performance Benchmark and Profiling. April 2015 LS-DYNA Performance Benchmark and Profiling April 2015 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: Intel, Dell, Mellanox Compute resource

More information

Future of Compute is Big Data. Mark Seager Intel Fellow, CTO for HPC Ecosystem Intel Corporation

Future of Compute is Big Data. Mark Seager Intel Fellow, CTO for HPC Ecosystem Intel Corporation Future of Compute is Big Data Mark Seager Intel Fellow, CTO for HPC Ecosystem Intel Corporation Traditional HPC is scientific simulation First ever Nobel prize for HPC takes the experiment into cyberspace

More information

Changpeng Liu. Cloud Storage Software Engineer. Intel Data Center Group

Changpeng Liu. Cloud Storage Software Engineer. Intel Data Center Group Changpeng Liu Cloud Storage Software Engineer Intel Data Center Group Notices & Disclaimers Intel technologies features and benefits depend on system configuration and may require enabled hardware, software

More information

Technology Insight: Intel s Next Generation Microarchitecture Code Name Skylake

Technology Insight: Intel s Next Generation Microarchitecture Code Name Skylake Technology Insight: Intel s Next Generation Microarchitecture Code Name Skylake Julius Mandelblat, Senior Principal Engineer, Intel SPCS001 AGENDA Introduction and Overview Core Microarchitecture Interconnect

More information

H.J. Lu, Sunil K Pandey. Intel. November, 2018

H.J. Lu, Sunil K Pandey. Intel. November, 2018 H.J. Lu, Sunil K Pandey Intel November, 2018 Issues with Run-time Library on IA Memory, string and math functions in today s glibc are optimized for today s Intel processors: AVX/AVX2/AVX512 FMA It takes

More information

Performance Evaluation of NWChem Ab-Initio Molecular Dynamics (AIMD) Simulations on the Intel Xeon Phi Processor

Performance Evaluation of NWChem Ab-Initio Molecular Dynamics (AIMD) Simulations on the Intel Xeon Phi Processor * Some names and brands may be claimed as the property of others. Performance Evaluation of NWChem Ab-Initio Molecular Dynamics (AIMD) Simulations on the Intel Xeon Phi Processor E.J. Bylaska 1, M. Jacquelin

More information

Single-Points of Performance

Single-Points of Performance Single-Points of Performance Mellanox Technologies Inc. 29 Stender Way, Santa Clara, CA 9554 Tel: 48-97-34 Fax: 48-97-343 http://www.mellanox.com High-performance computations are rapidly becoming a critical

More information

Small File I/O Performance in Lustre. Mikhail Pershin, Joe Gmitter Intel HPDD April 2018

Small File I/O Performance in Lustre. Mikhail Pershin, Joe Gmitter Intel HPDD April 2018 Small File I/O Performance in Lustre Mikhail Pershin, Joe Gmitter Intel HPDD April 2018 Overview Small File I/O Concerns Data on MDT (DoM) Feature Overview DoM Use Cases DoM Performance Results Small File

More information

Achieving 2.5X 1 Higher Performance for the Taboola TensorFlow* Serving Application through Targeted Software Optimization

Achieving 2.5X 1 Higher Performance for the Taboola TensorFlow* Serving Application through Targeted Software Optimization white paper Internet Discovery Artificial Intelligence (AI) Achieving.X Higher Performance for the Taboola TensorFlow* Serving Application through Targeted Software Optimization As one of the world s preeminent

More information

Jim Pappas Director of Technology Initiatives, Intel Vice-Chair, Storage Networking Industry Association (SNIA) December 07, 2018

Jim Pappas Director of Technology Initiatives, Intel Vice-Chair, Storage Networking Industry Association (SNIA) December 07, 2018 Jim Pappas Director of Technology Initiatives, Intel Vice-Chair, Storage Networking Industry Association (SNIA) December 07, 2018 jim@intel.com 1 How did this Effort Start? Memristor MRAM Carbon Nanotube

More information

investor meeting SANTA CLARA Diane Bryant Senior Vice President & General Manager Data Center Group

investor meeting SANTA CLARA Diane Bryant Senior Vice President & General Manager Data Center Group investor meeting 2 0 1 5 SANTA CLARA Diane Bryant Senior Vice President & General Manager Data Center Group Key Messages Fundamental growth drivers remain strong Adoption of cloud computing growing and

More information

Performance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA

Performance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA Performance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA Pak Lui, Gilad Shainer, Brian Klaff Mellanox Technologies Abstract From concept to

More information

EARLY EVALUATION OF THE CRAY XC40 SYSTEM THETA

EARLY EVALUATION OF THE CRAY XC40 SYSTEM THETA EARLY EVALUATION OF THE CRAY XC40 SYSTEM THETA SUDHEER CHUNDURI, SCOTT PARKER, KEVIN HARMS, VITALI MOROZOV, CHRIS KNIGHT, KALYAN KUMARAN Performance Engineering Group Argonne Leadership Computing Facility

More information

Kevin O Leary, Intel Technical Consulting Engineer

Kevin O Leary, Intel Technical Consulting Engineer Kevin O Leary, Intel Technical Consulting Engineer Moore s Law Is Going Strong Hardware performance continues to grow exponentially We think we can continue Moore's Law for at least another 10 years."

More information

Modern CPU Architectures

Modern CPU Architectures Modern CPU Architectures Alexander Leutgeb, RISC Software GmbH RISC Software GmbH Johannes Kepler University Linz 2014 16.04.2014 1 Motivation for Parallelism I CPU History RISC Software GmbH Johannes

More information

Intel Advisor XE Future Release Threading Design & Prototyping Vectorization Assistant

Intel Advisor XE Future Release Threading Design & Prototyping Vectorization Assistant Intel Advisor XE Future Release Threading Design & Prototyping Vectorization Assistant Parallel is the Path Forward Intel Xeon and Intel Xeon Phi Product Families are both going parallel Intel Xeon processor

More information

Intel s Architecture for NFV

Intel s Architecture for NFV Intel s Architecture for NFV Evolution from specialized technology to mainstream programming Net Futures 2015 Network applications Legal Disclaimer INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION

More information

Intel Enterprise Processors Technology

Intel Enterprise Processors Technology Enterprise Processors Technology Kosuke Hirano Enterprise Platforms Group March 20, 2002 1 Agenda Architecture in Enterprise Xeon Processor MP Next Generation Itanium Processor Interconnect Technology

More information

Desktop 4th Generation Intel Core, Intel Pentium, and Intel Celeron Processor Families and Intel Xeon Processor E3-1268L v3

Desktop 4th Generation Intel Core, Intel Pentium, and Intel Celeron Processor Families and Intel Xeon Processor E3-1268L v3 Desktop 4th Generation Intel Core, Intel Pentium, and Intel Celeron Processor Families and Intel Xeon Processor E3-1268L v3 Addendum May 2014 Document Number: 329174-004US Introduction INFORMATION IN THIS

More information

6th Generation Intel Core Processor Series

6th Generation Intel Core Processor Series 6th Generation Intel Core Processor Series Application Power Guidelines Addendum Supporting the 6th Generation Intel Core Processor Series Based on the S-Processor Lines August 2015 Document Number: 332854-001US

More information

Intel Math Kernel Library (Intel MKL) BLAS. Victor Kostin Intel MKL Dense Solvers team manager

Intel Math Kernel Library (Intel MKL) BLAS. Victor Kostin Intel MKL Dense Solvers team manager Intel Math Kernel Library (Intel MKL) BLAS Victor Kostin Intel MKL Dense Solvers team manager Intel MKL BLAS/Sparse BLAS Original ( dense ) BLAS available from www.netlib.org Additionally Intel MKL provides

More information

Extremely Fast Distributed Storage for Cloud Service Providers

Extremely Fast Distributed Storage for Cloud Service Providers Solution brief Intel Storage Builders StorPool Storage Intel SSD DC S3510 Series Intel Xeon Processor E3 and E5 Families Intel Ethernet Converged Network Adapter X710 Family Extremely Fast Distributed

More information

Aim High. Intel Technical Update Teratec 07 Symposium. June 20, Stephen R. Wheat, Ph.D. Director, HPC Digital Enterprise Group

Aim High. Intel Technical Update Teratec 07 Symposium. June 20, Stephen R. Wheat, Ph.D. Director, HPC Digital Enterprise Group Aim High Intel Technical Update Teratec 07 Symposium June 20, 2007 Stephen R. Wheat, Ph.D. Director, HPC Digital Enterprise Group Risk Factors Today s s presentations contain forward-looking statements.

More information

Changpeng Liu. Senior Storage Software Engineer. Intel Data Center Group

Changpeng Liu. Senior Storage Software Engineer. Intel Data Center Group Changpeng Liu Senior Storage Software Engineer Intel Data Center Group Legal Notices and Disclaimers Intel technologies features and benefits depend on system configuration and may require enabled hardware,

More information

Introduction: Modern computer architecture. The stored program computer and its inherent bottlenecks Multi- and manycore chips and nodes

Introduction: Modern computer architecture. The stored program computer and its inherent bottlenecks Multi- and manycore chips and nodes Introduction: Modern computer architecture The stored program computer and its inherent bottlenecks Multi- and manycore chips and nodes Motivation: Multi-Cores where and why Introduction: Moore s law Intel

More information

Future of datacenter STORAGE. Carol Wilder, Niels Reimers,

Future of datacenter STORAGE. Carol Wilder, Niels Reimers, Future of datacenter STORAGE Carol Wilder, carol.a.wilder@intel.com Niels Reimers, niels.reimers@intel.com Legal Notices/disclaimer Intel technologies features and benefits depend on system configuration

More information

A U G U S T 8, S A N T A C L A R A, C A

A U G U S T 8, S A N T A C L A R A, C A A U G U S T 8, 2 0 1 8 S A N T A C L A R A, C A Data-Centric Innovation Summit LISA SPELMAN VICE PRESIDENT & GENERAL MANAGER INTEL XEON PRODUCTS AND DATA CENTER MARKETING Increased integration and optimization

More information

Efficient Parallel Programming on Xeon Phi for Exascale

Efficient Parallel Programming on Xeon Phi for Exascale Efficient Parallel Programming on Xeon Phi for Exascale Eric Petit, Intel IPAG, Seminar at MDLS, Saclay, 29th November 2016 Legal Disclaimers Intel technologies features and benefits depend on system configuration

More information

Performance Analysis of HPC Applications on Several Dell PowerEdge 12 th Generation Servers

Performance Analysis of HPC Applications on Several Dell PowerEdge 12 th Generation Servers Performance Analysis of HPC Applications on Several Dell PowerEdge 12 th Generation Servers This Dell technical white paper evaluates and provides recommendations for the performance of several HPC applications

More information

Meet the Increased Demands on Your Infrastructure with Dell and Intel. ServerWatchTM Executive Brief

Meet the Increased Demands on Your Infrastructure with Dell and Intel. ServerWatchTM Executive Brief Meet the Increased Demands on Your Infrastructure with Dell and Intel ServerWatchTM Executive Brief a QuinStreet Excutive Brief. 2012 Doing more with less is the mantra that sums up much of the past decade,

More information

DEVITO AUTOMATED HIGH-PERFORMANCE FINITE DIFFERENCES FOR GEOPHYSICAL EXPLORATION

DEVITO AUTOMATED HIGH-PERFORMANCE FINITE DIFFERENCES FOR GEOPHYSICAL EXPLORATION DEVITO AUTOMATED HIGH-PERFORMANCE FINITE DIFFERENCES FOR GEOPHYSICAL EXPLORATION F. Luporini 1, C. Yount 4, M. Louboutin 3, N. Kukreja 1, P. Witte 2, M. Lange 5, P. Kelly 1, F. Herrmann 3, G.Gorman 1 1Imperial

More information

Visualizing and Finding Optimization Opportunities with Intel Advisor Roofline feature

Visualizing and Finding Optimization Opportunities with Intel Advisor Roofline feature Visualizing and Finding Optimization Opportunities with Intel Advisor Roofline feature Intel Software Developer Conference Frankfurt, 2017 Klaus-Dieter Oertel, Intel Agenda Intel Advisor for vectorization

More information

Interconnect Your Future

Interconnect Your Future Interconnect Your Future Paving the Path to Exascale November 2017 Mellanox Accelerates Leading HPC and AI Systems Summit CORAL System Sierra CORAL System Fastest Supercomputer in Japan Fastest Supercomputer

More information

Intel Select Solutions for Professional Visualization with Advantech Servers & Appliances

Intel Select Solutions for Professional Visualization with Advantech Servers & Appliances Solution Brief Intel Select Solution for Professional Visualization Intel Xeon Processor Scalable Family Powered by Intel Rendering Framework Intel Select Solutions for Professional Visualization with

More information