Akhilesh Kumar, Sailesh Kottapalli, Ian M Steiner, Bob Valentine, Israel Hirsh, Geetha Vedaraman, Lily P Looi, Mohamed Arafa, Andy Rudoff, Sreenivas

Size: px
Start display at page:

Download "Akhilesh Kumar, Sailesh Kottapalli, Ian M Steiner, Bob Valentine, Israel Hirsh, Geetha Vedaraman, Lily P Looi, Mohamed Arafa, Andy Rudoff, Sreenivas"

Transcription

1 Akhilesh Kumar, Sailesh Kottapalli, Ian M Steiner, Bob Valentine, Israel Hirsh, Geetha Vedaraman, Lily P Looi, Mohamed Arafa, Andy Rudoff, Sreenivas Mandava, Bahaa Fahim, Sujal A Vora Intel Corporation, 2018

2 Notices and Disclaimers This document contains information on products, services and/or processes in development. All information provided here is subject to change without notice. Contact your Intel representative to obtain the latest forecast, schedule, specifications and roadmaps. Intel technologies features and benefits depend on system configuration and may require enabled hardware, software or service activation. Learn more at intel.com, or from the OEM or retailer. No computer system can be absolutely secure. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit Optimization Notice: Intel's compilers may or may not optimize to the same degree for non-intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice. Tests document performance of components on a particular test, in specific systems. Differences in hardware, software, or configuration will affect actual performance. Consult other sources of information to evaluate performance as you consider your purchase. For more complete information about performance and benchmark results, visit Results have been estimated or simulated using internal Intel analysis or architecture simulation or modeling, and provided to you for informational purposes. Any differences in your system hardware, software or configuration may affect your actual performance. Cost reduction scenarios described are intended as examples of how a given Intel-based product, in the specified circumstances and configurations, may affect future costs and provide cost savings. Circumstances will vary. Intel does not guarantee any costs or cost reduction. Intel does not control or audit third-party benchmark data or the web sites referenced in this document. You should visit the referenced web site and confirm whether referenced data are accurate. Intel, the Intel logo, Intel Optane and Xeon are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the united states and other countries. * Other names and brands may be claimed as the property of others Intel Corporation. 2

3 Outline Intel Xeon Scalable Processor Roadmap Focus Areas for Cascade Lake-SP Instruction Enhancement for AI/Deep Learning Inference Intel Optane DC Persistent Memory Side Channel Mitigations Wrap up 3

4 First Generation Intel Xeon Scalable Processor Introduced in July 2017 Skylake-SP core microarchitecture with data center specific enhancements Intel AVX-512 with 32 DP flops per cycle per core Data center optimized cache hierarchy 1MB L2 per core, non-inclusive L3 New Intel Mesh architecture Enhanced 6 channel memory subsystem 48 lanes of PCIe Gen3 with integrated DMA, NTB, and VMD devices New Intel Ultra Path Interconnect (Intel UPI) 6 Channels DDR4 DDR4 DDR4 DDR4 Core Core Core Core 2 or 3 UPI UPI Features Cores and Threads Per CPU Last-level Cache (LLC) QPI/UPI Speed (GT/s) Intel Xeon Scalable Processor Up to 28 cores and 56 threads Up to 38.5 MB (non-inclusive) Up to 3x 10.4 GT/s DDR4 DDR4 DDR4 Core Core Shared L3 UPI UPI PCIe* Lanes/ Controllers Memory Population Up to 48 / 12 / PCIe 3.0 (2.5, 5, 8 GT/s) Up to 6 channels of up to 2 RDIMMs, LRDIMMs, or 3DS LRDIMMs 48 Lanes PCIe* 3.0 DMI3 Max Memory Speed Up to Foundation for Accelerating Data Center Innovations 4

5 Next Step in the Intel Xeon Scalable Processor Cascade Lake CPU is designed to be compatible with first-gen Intel Xeon Scalable platform Same core count, cache size, and I/O speeds as first-gen Process tuning, frequency push, targeted performance improvements Architectural improvements through targeted instruction set enhancements New platform capabilities with support for Intel Optane DC persistent memory Hardware enhancements for protection against side-channel methods Grantley Platform Intel Microarchitecture Codenamed Haswell Haswell 22nm Features Cores and Threads Last-level Cache UPI Speed (GT/s) PCIe* 3.0 Lanes Memory Speed Broadwell 14nm Purley Platform Intel Microarchitecture Codenamed Skylake New Microarchitecture Skylake- SP 14nm New Microarchitecture Cascade Lake CPU Cascade Lake-SP 14nm Up to 28 Cores and 56 Threads Up to 38.5 MB (non-inclusive) Up to 3x 10.4 GT/s Up to 48 lanes with 12 controllers Up to 6 up to 2666 MHz 5

6 6

7 Images/Second AI/Deep Learning Software Optimizations on first generation Intel Xeon Scalable Processor Higher is Better 5.4X(1) In 10 months since Intel Xeon Scalable Processor launch July 11, 2017 July 11, 2017 December 14, 2017 January 19, 2018 April 3, 2018 May 8, 2018 May 8, 2018 FP32 FP32 FP32 FP32 Int8 Int8 Int8 BS = 64 BS = 64 BS = 64 BS = 64 BS = 64 BS = 64 BS = 64, 4 instances 2S Intel Xeon Processor 2699 V4 Inference Throughput on Intel Caffe ResNet50 2S Intel Xeon Scalable Platinum 8180 Processor 2S Intel Xeon Scalable Platinum 8180 Processor 2S Intel Xeon Scalable Platinum 8180 Processor 2S Intel Xeon Scalable Platinum 8180 Processor 2S Intel Xeon Scalable Platinum 8180 Processor 2S Intel Xeon Scalable Platinum 8180 Processor (1) Up to 5.4X performance improvement with software optimizations on Caffe Resnet-50 in 10 months with 2 socket Intel Xeon Scalable Processor, Configuration Details 1, 2. Performance measurements were obtained prior to implementation of recent software patches and firmware updates intended to address exploits referred to as "Spectre" and "Meltdown." Implementation of these updates may make these results inapplicable to your device or system. Performance measurements were obtained prior to implementation of recent software patches and firmware updates intended to address exploits referred to as "Spectre" and "Meltdown." Implementation of these updates may make these results inapplicable to your device or system. Optimization Notice: Intel's compilers may or may not optimize to the same degree for non-intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit: Source: Intel measured as of April

8 Neural Machine Translation Software Optimization on first generation Intel Xeon Scalable Processor Up to 14X(1) higher inference performance Configuration Details 3 4 Performance measurements were obtained prior to implementation of recent software patches and firmware updates intended to address exploits referred to as "Spectre" and "Meltdown." Implementation of these updates may make these results inapplicable to your device or system. Optimization Notice: Intel's compilers may or may not optimize to the same degree for non-intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit: Source: Intel measured as of May

9 Cascade Lake Vector Neural Network Instructions Vector Neural Network Instruction (VNNI) on Cascade Lake accelerates Deep Learning and AI inference workloads VNNI : A new set of Intel Advanced Vector Extension (Intel AVX-512) instructions 8-bit (int8) new instruction (VPDPBUSD) Fuses 3 instructions in inner convolution loop using int8 data type 16-bit (int16) new instruction (VPDPWSSD) Fuses 2 instructions in inner convolution loop using int16 data type 9

10 AI/DL Inference Enhancements on INT16 with VNNI Xeon Scalable AVX-512 INPUT INT16 INPUT INT16 vpmaddwd OUTPUT INT32 CONSTANT INT32 vpaddd OUTPUT INT32 Current AVX-512 instructions to perform INT16 convolutions: vpmaddwd, vpaddd New instructions for accelerating AI on Intel Xeon Scalable processors using int16 data INPUT INT16 INPUT INT16 Cascade Lake SP VNNI CONSTANT INT32 vpdpwssd OUTPUT INT32 VNNI instruction to accelerate INT16 convolutions: vpdpwssd 10

11 AI/DL Inference Enhancements on INT8 with VNNI Xeon Scalable AVX-512 INPUT INT8 INPUT INT8 vpmaddubsw OUTPUT INT16 CONSTANT INT16 vpmaddwd OUTPUT INT32 CONSTANT INT32 vpaddd OUTPUT INT32 Current AVX-512 instructions to perform INT8 convolutions: vpmaddubsw, vpmaddwd, vpaddd New instructions for accelerating AI on Intel Xeon Scalable processors using int8 data INPUT INT8 INPUT INT8 Cascade Lake SP VNNI CONSTANT INT32 vpdpbusd OUTPUT INT32 VNNI instruction to accelerate INT8 convolutions: vpdpbusd 11

12 Elements Processed/Cycle VNNI Per Core Throughput Vector Elements Processed per Cycle on Different Data Types FP32 Int16 Int8 FP32 VNNI Int16 VNNI Int8 vfmadd231ps vpmaddwd, vpaddd vpmaddubsw, vpmaddwd, vpaddd vfmadd231ps vpdpwssd vpdpbusd Input 32bit / Output 32bit Input 16bit / Output 32bit Input 8bit / Output 32bit Input 32bit / Output 32bit Input 16bit / Output 32bit Input 8bit / Output 32bit Intel Xeon Scalable Processor Codenamed Skylake FUTURE Intel Xeon Scalable Processor Codenamed CascadeLake Performance measurements were obtained prior to implementation of recent software patches and firmware updates intended to address exploits referred to as "Spectre" and "Meltdown." Implementation of these updates may make these results inapplicable to your device or system. Optimization Notice: Intel's compilers may or may not optimize to the same degree for non-intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit: Source: Intel measured as of May

13 I n f e r e n c e T h r o u g h p u t ( i m a g e s / s e c ) Inference Throughput with VNNI I n t e l o p t i m i z a t i o n f o r C a f f e R e s N e t - 50 Estimated Throughput on Cascade Lake with VNNI 5.4X i n t 8 o p t i m i z a t i o n s 2.8X F r a m e w o r k o p t i m i z a t i o n s Framework & Library support 1.0 F P 3 2 M K L - DNN Jul 17 Jan 18 Aug 18 Intel Xeon Scalable Processor 13

14 14

15 Growing Gap Between Memory Hierarchy Limitations to traditional architecture impede unified data management MEMORY DRAM HOT TIER Cost prohibitive for data intensive applications STORAGE SSD WARM TIER HDD/TAPE COLD TIER Performance impedes data intensive applications Media capability limits usage to cold tier 15

16 Growing Gap Between Memory Hierarchy Limitations to traditional architecture impede unified data management MEMORY STORAGE DRAM HOT TIER SSD WARM TIER HDD/TAPE COLD TIER Cost prohibitive for data intensive applications Latency * : ~1000x Bandwidth * : ~0.1x Capacity/$ * : ~40x Performance impedes data intensive applications Media capability limits usage to cold tier * Actual performance and price may vary 16

17 Intel Innovations Address These Gaps MEMORY Improving memory capacity Improving SSD performance DRAM HOT TIER STORAGE Delivering efficient and scalable storage SSD WARM TIER Intel 3D Nand SSD HDD/TAPE COLD TIER 17

18 Big and Affordable Memory 128, 256, 512GB High Performance Storage DDR4 Pin Compatible Direct Load/Store Access Hardware Encryption Native Persistence High Reliability Supported on future Intel Xeon Scalable Processors (Cascade Lake)

19 Intel Optane DC Persistent Memory Hardware Interface Intel Xeon Cascade Lake IMC IMC Intel Optane DC Persistent Memory Module DDR4 electrical and physical interface with proprietary protocol extensions Memory channel can be shared between DDR4 and Intel Optane DC persistent memory modules Enables systems to support greater than 3TB of system memory per CPU socket Cache line size accesses Idle latency close to DDR4 DIMMs Performance measurements were obtained prior to implementation of recent software patches and firmware updates intended to address exploits referred to as "Spectre" and "Meltdown." Implementation of these updates may make these results inapplicable to your device or system. Optimization Notice: Intel's compilers may or may not optimize to the same degree for non-intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit: Source: Intel measured as of May

20 WPQ CPU CACHES Hardware Interface: Persistence Domain MOV Core L1 L1 L2 L3 CLWB + fence -or- CLFLUSHOPT + fence -or- CLFLUSH -or- NT stores + fence -or- WBINVD (kernel only) Custom Power fail protected domain indicated by ACPI property: CPU Cache Hierarchy WPQ ADR -or- WPQ Flush (kernel only) Minimum Required Power fail protected domain: Memory subsystem DIMM 20

21 The SNIA NVM Programming Model Mgmt. storage file memory Management UI Management Library Application Standard Raw Device Access Application Standard File API File System Generic NVDIMM Driver Standard File API Application Load/ Store pmem-aware File System DAX MMU Mappings USER SPACE KERNEL SPACE For more information software.intel.com/pmem 21

22 The Persistent Memory Development Kit - pmdk PMDK is a collection of libraries Developers pull only what they need Low level programming support Transaction APIs Fully validated Performance tuned Open source & product neutral software.intel.com/pmem Application Standard File API pmem-aware File System Persistent Memory PMDK Libraries MMU Mappings USER SPACE KERNEL SPACE

23 Usage Example: High Performance Storage DRAM NAND NVMe SSD 9X More read transactions (Ops/sec) 11X More users Per system Vs. comparable server system with dram and nand nvme drives when using Apache* Cassandra-4.0 Results have been estimated based on tests conducted on pre-production systems, and provided to you for informational purposes. Any differences in your system hardware, software or configuration may affect your actual performance. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to 23

24 Average Expected Latency Usage Example: Data Replication with Persistent Memory over Fabric initiator Average 4KB Write I/O Round Trip Time Comparison NVMe+NAND SSD vs. PMoF PCIe & NVMe Protocol Network/Fabric NVM Media Software FABRIC Target Replication to NAND Replication using PMoF Results have been estimated or simulated using internal Intel analysis or architecture simulation or modeling, and provided to you for informational purposes. Any differences in your system hardware, software or configuration may affect your actual performance. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to *Three 9s and five 9s availability assumes bi-weekly maintenance restarts. 24

25 25

26 Cascade Lake Mitigations for Side-Channel Methods Cascade Lake implements hardware mitigations against targeted side-channel methods Variant Side-Channel Method Mitigation on Cascade Lake Variant 1 Bounds Check Bypass OS/VMM Variant 2 Branch Target Injection Hardware + OS/VMM Variant 3 Rogue Data Cache Load Hardware Variant 3a Rogue System Register Read Firmware Variant 4 Speculative Store Bypass Firmware + OS/VMM or runtime L1 Terminal Fault Hardware Cascade Lake SP expected to provide higher performance over software mitigations available for existing products For additional information related to security updates and side channel methods on Intel products, please visit

27

28 Future Intel Xeon Scalable Processor (Codename: Cascade Lake-SP) Software Libraries and Optimizations AI/DL Enhancement through VNNI Side-Channel Analysis Mitigations Process Tuning, Frequency Boost, Targeted Performance Improvements Intel Xeon Scalable Platform Further Accelerating Data Center Innovations 28

29

30 Config Details for Skylake Inference throughput (March 2018) Config 1 Framework caffe caffe caffe caffe caffe caffe caffe caffe neon neon branch: master branch: master branch: master branch: master branch: master branch: master branch: master branch: master branch: (HEAD branch: master version: f6d01efbe93f 70726ea3796a4b89c a6341 version: f6d01efbe9 3f70726ea3796a4b 89c612365a6341 version: f6d01efbe93 version: f6d01efbe93 version: f6d01efbe93 f70726ea3796a4b89 f70726ea3796a4b89 f70726ea3796a4b89 c612365a6341 c612365a6341 c612365a6341 version: f6d01efbe93f7 0726ea3796a4b89c a6341 version: f6d01efbe93f7 0726ea3796a4b89c a6341 version: f6d01efbe9 3f70726ea3796a4b 89c612365a6341 version: f43cfa2e26f 9c84b0f42fcda50b6 b version: f43cfa2e26f 9c84b0f42fcda50b6 b Platform SKX_8180 SKX_8180 SKX_8180 SKX_8180 SKX_8180 SKX_8180 SKX_8180 SKX_8180 SKX_8180 SKX_8180 Sockets Processor Intel(R) Xeon(R) Intel(R) Xeon(R) Intel(R) Xeon(R) Intel(R) Xeon(R) Intel(R) Xeon(R) Intel(R) Xeon(R) Intel(R) Xeon(R) Intel(R) Xeon(R) Intel(R) Xeon(R) Intel(R) Xeon(R) Platinum 8180 CPU Platinum 8180 CPU Platinum 8180 CPU Platinum 8180 CPU Platinum 8180 Platinum 8180 CPU Platinum 8180 CPU Platinum 8180 CPU Platinum 8180 Platinum 2.50GHz / 2.50GHz / 2.50GHz / 2.50GHz / GHz / GHz / 28 cores@ 2.50GHz / 28 cores@ 2.50GHz / 28 cores 2.50GHz / 28 cores 2.50GHz / 28 cores cores cores cores cores BIOS SE5C620.86B SE5C620.86B SE5C620.86B SE5C620.86B SE5C620.86B SE5C620.86B SE5C620.86B SE5C620.86B SE5C620.86B SE5C620.86B Enabled Cores Slots Total Memory GB GB GB GB GB GB GB GB GB GB Memory Configuration 2666 MHz 2666 MHz 2666 MHz Memory Comments Micron Micron Micron Micron Micron Micron Micron Micron Micron Micron Disks OS sda 744.1GB,sdb 1.5TB,sdc RS3WC080 HDD 5.5TB CentOS Linux Core sda 744.1GB,sdb 1.5TB,sdc 5.5TB CentOS Linux Core sda 744.1GB,sdb 1.5TB,sdc RS3WC080 HDD 5.5TB CentOS Linux Core sda 744.1GB,sdb 1.5TB,sdc RS3WC080 HDD 5.5TB CentOS Linux Core sda 744.1GB,sdb 1.5TB,sdc RS3WC080 HDD 5.5TB CentOS Linux Core sda 744.1GB,sdb 1.5TB,sdc RS3WC080 HDD 5.5TB sda 744.1GB,sdb 1.5TB,sdc RS3WC080 HDD 5.5TB CentOS Linux CentOS Linux Core Core sda 744.1GB,sdb 1.5TB,sdc 5.5TB CentOS Linux Core sda 744.1GB,sdb 1.5TB,sdc 5.5TB sda 744.1GB,sdb 1.5TB,sdc 5.5TB Ubuntu trusty Ubuntu trusty Hyper Threading ON ON ON ON ON ON ON ON ON ON Turbo ON ON ON ON ON ON ON ON ON ON Topology resnet_50_v1 resnet_50_v1 vgg16 vgg16 vgg16 inception_v3 inception_v3 inception_v3 resnet_50_v2 resnet_50_v2 Batchsize Dataset NoDataLayer NoDataLayer NoDataLayer NoDataLayer NoDataLayer NoDataLayer NoDataLayer NoDataLayer NoDataLayer NoDataLayer MKLDNN MKLDNN MKLDNN MKLDNN MKLDNN MKLDNN MKLDNN MKLDNN MKL2017 MKL2017 Engine version: ae00102be50 6ed0fe2099c6557df2 aa88ad57ec1 version: ae00102be5 06ed0fe2099c6557 df2aa88ad57ec1 version: ae00102be5 version: ae00102be5 version: ae00102be5 06ed0fe2099c6557d 06ed0fe2099c6557d 06ed0fe2099c6557d f2aa88ad57ec1 f2aa88ad57ec1 f2aa88ad57ec1 version: ae00102be506 ed0fe2099c6557df2aa 88ad57ec1 version: ae00102be506 ed0fe2099c6557df2aa 88ad57ec1 version: ae00102be5 06ed0fe2099c6557 df2aa88ad57ec1 version: mklml_lnx_2 version: mklml_lnx_ IP Kernel version generic generic generic generic generic el7.x86_ el7.x86_ el7.x86_ el7.x86_ el7.x86_64 30

31 Config Details for Skylake Inference throughput (April 2018) Config 2 tensorflow tensorflow tensorflow tensorflow tensorflow tensorflow tensorflow tensorflow tensorflow mxnet mxnet mxnet mxnet mxnet mxnet branch: mast branch: master branch: master branch: master branch: master branch: master branch: master branch: master branch: master branch: master branch: master branch: master branch: master branch: master branch: master er Framework version: fdb5 version: 024aecf41 version: 024aecf41 version: fdb5664 version: fdb56649 version: fbbc080d version: 9a0d0 version: fdb566 version: 024aecf4149 version: 024aecf4149 version: 024aecf414 version: 024aecf414 version: 024aecf414 version: 024aecf41 version: 024aecf c 4941e11eb643e e11eb643e c8173a c8173a dbc23eef4a cbc c e11eb643e29ceed 41e11eb643e29ceed 941e11eb643e29cee 941e11eb643e29cee 941e11eb643e29cee 4941e11eb643e e11eb643e a50826 ceed3e1c47a115a 9ceed3e1c47a acec8b1111d 26acec8b1111d9c cea8 3ffadf3537b10 a50826acec8b 3e1c47a115ad 3e1c47a115ad d3e1c47a115ad d3e1c47a115ad d3e1c47a115ad ceed3e1c47a115ad ceed3e1c47a115ad acec8b1111d d ad 9cd6f d6f 59b 03bb0006c4 1111d9cd6f 9cd6f Sockets Processor BIOS Intel(R) Xeon(R) Platinum GHz / 28 cores SE5C620.86B Intel(R) Xeon(R) Platinum GHz / 28 cores SE5C620.86B Intel(R) Xeon(R) Platinum GHz / 28 cores Intel(R) Xeon(R) Platinum GHz / 28 cores Intel(R) Xeon(R) Platinum GHz / 28 cores Intel(R) Xeon(R) Platinum GHz / 28 cores Intel(R) Xeon(R) Platinum GHz / 28 cores Intel(R) Xeon(R) Platinum GHz / 28 cores Intel(R) Xeon(R) Platinum GHz / 28 cores Intel(R) Xeon(R) Platinum GHz / 28 cores Intel(R) Xeon(R) Platinum GHz / 28 cores Intel(R) Xeon(R) Platinum GHz / 28 cores Intel(R) Xeon(R) Intel(R) Xeon(R) Platinum 8180 Platinum GHz / GHz / 28 cores cores Intel(R) Xeon(R) Platinum GHz / 28 cores SE5C620.86B SE5C620.86B.00.0 SE5C620.86B.00.0 SE5C620.86B.00.0 SE5C620.86B.00.0 SE5C620.86B.00. SE5C620.86B.00.0 SE5C620.86B.00.0 SE5C620.86B.0 SE5C620.86B.0 SE5C620.86B SE5C620.86B SE5C620.86B Enabled Cores Slots Total Memory GB GB GB GB GB GB GB GB GB GB GB GB GB GB GB Memory Configuration 12slots / 32 GB 12slots / 32 GB / / Memory Comments Micron Micron Micron Micron Micron Micron Micron Micron Micron Micron Micron Micron Micron Micron Micron Disks OS sda 744.1GB,sdb 1.5TB,sdc RS3WC080 HDD 5.5TB CentOS Linux Core sda 744.1GB,sdb 1.5TB,sdc RS3WC080 HDD 5.5TB CentOS Linux Core sda 744.1GB,sdb 1.5TB,sdc 5.5TB CentOS Linux Core sda 744.1GB,sdb 1.5TB,sdc 5.5TB CentOS Linux Core sda 744.1GB,sdb 1.5TB,sdc 5.5TB CentOS Linux Core sda RS3WC080 HDD 744.1GB,sdb 1.5TB,sdc 5.5TB CentOS Linux Core sda RS3WC080 HDD 744.1GB,sdb 1.5TB,sdc 5.5TB CentOS Linux Core sda RS3WC080 HDD 744.1GB,sdb 1.5TB,sdc 5.5TB CentOS Linux Core sda RS3WC080 HDD 744.1GB,sdb 1.5TB,sdc 5.5TB CentOS Linux Core sda RS3WC080 HDD 744.1GB,sdb 1.5TB,sdc 5.5TB Ubuntu xenial sda RS3WC080 HDD 744.1GB,sdb 1.5TB,sdc 5.5TB Ubuntu xenial sda RS3WC080 HDD 744.1GB,sdb 1.5TB,sdc 5.5TB Ubuntu xenial sda RS3WC080 HDD 744.1GB,sdb RS3WC080 HDD 1.5TB,sdc RS3WC080 HDD 5.5TB sda RS3WC080 HDD 744.1GB,sdb RS3WC080 HDD 1.5TB,sdc RS3WC080 HDD 5.5TB 12slots / 32 GB / 2666 MHz sda RS3WC080 HDD 744.1GB,sdb RS3WC080 HDD 1.5TB,sdc RS3WC080 HDD 5.5TB Ubuntu Ubuntu Ubuntuxenial xenial xenial Hyper Threading ON ON ON ON ON ON ON ON ON ON ON ON ON ON ON Turbo ON ON ON ON ON ON ON ON ON ON ON ON ON ON ON Topology resnet_50_v1 resnet_50_v1 resnet_50_v1 vgg16 vgg16 vgg16 inception_v3 inception_v3 inception_v3 resnet_50_v2 vgg16 vgg16 inception_v3 inception_v3 inception_v3 Batchsize ,64, , Instances/ Streams on 2 sockets Dataset NoDataLayer NoDataLayer NoDataLayer NoDataLayer NoDataLayer NoDataLayer NoDataLayer NoDataLayer NoDataLayer Imagenet Imagenet NoDataLayer Imagenet NoDataLayer Imagenet MKLDNN MKLDNN MKLDNN MKLDNN MKLDNN MKLDNN MKLDNN MKLDNN MKLDNN MKLDNN MKLDNN MKLDNN MKLDNN MKLDNN MKLDNN version: f521 version: e0bfcaa7f version: f5218ff4f version: f5218ff4f version: 283c4a8c version: f5218f version: f5218f version: e0bfcaa7fcb2 version: e0bfcaa7fcb2 version: e0bfcaa7fcb version: e0bfcaa7fcb version: e0bfcaa7fcb version: e0bfcaa7fc version: e0bfcaa7fc version: e0bfcaa7fc 8ff4fd2d16d1 Engine b1e1558f5f c b1e1558f5f c 1db807a729 1db807a729 Kernel version el7.x86_ generic 2b1e1558f5f c1db807a729 2b1e1558f5f c1db807a generic generic 2b1e1558f5f c1db807a generic b2b1e1558f5f c1db807a729 b2b1e1558f5f c1db807a generic generic cb2b1e1558f5f06 d2d16d13aada2e d2d16d13aada2e 24b4e1dea05fbc7 f4fd2d16d13aa f4fd2d16d13aa b2b1e1558f5f067 3aada2e632a 76933c1db807a7 632afd18f2514fe 632afd18f2514fe a7375bab da2e632afd18f da2e632afd18f 6933c1db807a e3 e3 ad1 2514fee3 2514fee generic generic fd18f2514fee el7.x86_ el7.x86_ el7.x86_ el7.x8 6_ el7.x8 6_ el7.x 86_64 31

32 Configuration details of Amazon EC2 C5.18xlarge 1 node systems Config 3 Benchmark Segment Benchmark type Benchmark Metric Framework Topology # of Nodes 1 Platform Sockets Processor BIOS Enabled Cores Platform Slots Total Memory Memory Configuration SSD OS Network Configurations HT Turbo Computer Type AI/ML Inference Sentence/Sec Official mxnet GNMT(sockeye) Amazon EC2 C5.18xlarge instance 2S Intel Xeon Platinum 8124M 3.00GHz (Skylake) N/A 18 cores / socket N/A N/A 144GB N/A EBS Optimized 200GB, Provisioned IOPS SSD Red Hat 7.2 (HVM) Amazon Elastic Network Adapter (ENA) Up to 10 Gbps of aggregate network bandwidth Installed Enhanced Networking with ENA on Centos Placed the all instances in the same placement ON ON Server 32

33 Configuration details of Amazon EC2 C5.18xlarge 1 node systems Config 4 Framework Version Topology Version mxnet mkldnn : f6649e329b23a1efdc40aaa25260d47b4195 GNMT: Batch size GNMT: Dataset, version GNMT: WMT 2017 ( MKLDNN MKL F5218ff4fd2d16d13aada2e632afd18f2514fee3 Version: parallel_studio_xe_2018_update1 update1_cluster_edition_online.tgz Compiler g++: gcc:

34 Configuration Details for Inference Throughput with VNNI Config 5 1x inference throughput improvement in July 2017: Tested by Intel as of July 11 th 2017: Platform: 2S Intel Xeon Platinum GHz (28 cores), HT disabled, turbo disabled, scaling governor set to performance via intel_pstate driver, 384GB DDR ECC RAM. CentOS Linux release (Core), Linux kernel el7.x86_64. SSD: Intel SSD DC S3700 Series (800GB, 2.5in SATA 6Gb/s, 25nm, MLC).Performance measured with: Environment variables: KMP_AFFINITY='granularity=fine, compact, OMP_NUM_THREADS=56, CPU Freq set with cpupower frequency-set -d 2.5G -u 3.8G -g performance. Caffe: ( revision f96b759f71b f690af267158b82b150b5c. Inference measured with caffe time --forward_only command, training measured with caffe time command. For ConvNet topologies, dummy dataset was used. For other topologies, data was stored on local storage and cached in memory before training. Topology specs from (ResNet-50),and (ConvNet benchmarks; files were updated to use newer Caffe prototxt format but are functionally equivalent). Intel C++ compiler ver , Intel MKL small libraries version Caffe run with numactl -l. 2.8x inference throughput improvement in January 2018: Tested by Intel as of Jan 19 th 2018 Processor :2 socket Intel(R) Xeon(R) Platinum GHz / 28 cores HT ON, Turbo ON Total Memory GB ( ). CentOS Linux Core, SSD sda 744.1GB,sdb 1.5TB,sdc 5.5TB, Deep Learning Framework Intel Optimization for caffe version:f6d01efbe93f70726ea3796a4b89c612365a6341 Topology::resnet_50_v1 BIOS:SE5C620.86B MKLDNN: version: ae00102be506ed0fe2099c6557df2aa88ad57ec1 NoDataLayer.. Datatype:FP32 Batchsize=64 Measured: imgs/sec vs Tested by Intel as of July 11 th 2017: Platform: 2S Intel Xeon Platinum GHz (28 cores), HT disabled, turbo disabled, scaling governor set to performance via intel_pstate driver, 384GB DDR ECC RAM. CentOS Linux release (Core), Linux kernel el7.x86_64. SSD: Intel SSD DC S3700 Series (800GB, 2.5in SATA 6Gb/s, 25nm, MLC).Performance measured with: Environment variables: KMP_AFFINITY='granularity=fine, compact, OMP_NUM_THREADS=56, CPU Freq set with cpupower frequency-set -d 2.5G -u 3.8G -g performance. Caffe: ( revision f96b759f71b f690af267158b82b150b5c. Inference measured with caffe time --forward_only command, training measured with caffe time command. For ConvNet topologies, dummy dataset was used. For other topologies, data was stored on local storage and cached in memory before training. Topology specs from (ResNet- 50),and (ConvNet benchmarks; files were updated to use newer Caffe prototxt format but are functionally equivalent). Intel C++ compiler ver , Intel MKL small libraries version Caffe run with numactl -l. 34

35 Configuration Details for Inference Throughput with VNNI Config 5 5.4x inference throughput improvement in August 2018: Tested by Intel as of measured July 26 th 2018 :2 socket Intel(R) Xeon(R) Platinum GHz / 28 cores HT ON, Turbo ON Total Memory GB ( ). CentOS Linux Core, kernel: el7.x86_64, SSD sda 744.1GB,sdb 1.5TB,sdc 5.5TB, Deep Learning Framework Intel Optimization for caffe version:a3d5b022fe026e9092fc7abc7654b1162ab9940d Topology::resnet_50_v1 BIOS:SE5C620.86B MKLDNN: version:464c268e544bae26f9b85a2acb9122c766a4c396 instances: 2 instances : NoDatasocket:2 (Results on Intel Xeon Scalable Processor were measured running multiple instances of the framework. Methodology described herelayer. Datatype: INT8 Batchsize=64 Measured: imgs/sec vs Tested by Intel as of July 11 th 2017:2S Intel Xeon Platinum GHz (28 cores), HT disabled, turbo disabled, scaling governor set to performance via intel_pstate driver, 384GB DDR ECC RAM. CentOS Linux release (Core), Linux kernel el7.x86_64. SSD: Intel SSD DC S3700 Series (800GB, 2.5in SATA 6Gb/s, 25nm, MLC).Performance measured with: Environment variables: KMP_AFFINITY='granularity=fine, compact, OMP_NUM_THREADS=56, CPU Freq set with cpupower frequency-set -d 2.5G -u 3.8G -g performance. Caffe: ( revision f96b759f71b f690af267158b82b150b5c. Inference measured with caffe time --forward_only command, training measured with caffe time command. For ConvNet topologies, dummy dataset was used. For other topologies, data was stored on local storage and cached in memory before training. Topology specs from (ResNet-50). Intel C++ compiler ver , Intel MKL small libraries version Caffe run with numactl -l. 11X inference thoughput improvement with CascadeLake: Future Intel Xeon Scalable processor (codename Cascade Lake) results have been estimated or simulated using internal Intel analysis or architecture simulation or modeling, and provided to you for informational purposes. Any differences in your system hardware, software or configuration may affect your actual performance vs Tested by Intel as of July 11 th 2017: 2S Intel Xeon Platinum GHz (28 cores), HT disabled, turbo disabled, scaling governor set to performance via intel_pstate driver, 384GB DDR ECC RAM. CentOS Linux release (Core), Linux kernel el7.x86_64. SSD: Intel SSD DC S3700 Series (800GB, 2.5in SATA 6Gb/s, 25nm, MLC).Performance measured with: Environment variables: KMP_AFFINITY='granularity=fine, compact, OMP_NUM_THREADS=56, CPU Freq set with cpupower frequency-set -d 2.5G -u 3.8G -g performance. Caffe: ( revision f96b759f71b f690af267158b82b150b5c. Inference measured with caffe time --forward_only command, training measured with caffe time command. For ConvNet topologies, dummy dataset was used. For other topologies, data was stored on local storage and cached in memory before training. Topology specs from (ResNet-50),. Intel C++ compiler ver , Intel MKL small libraries version Caffe run with numactl -l. 35

36

Data-Centric Innovation Summit NAVEEN RAO CORPORATE VICE PRESIDENT & GENERAL MANAGER ARTIFICIAL INTELLIGENCE PRODUCTS GROUP

Data-Centric Innovation Summit NAVEEN RAO CORPORATE VICE PRESIDENT & GENERAL MANAGER ARTIFICIAL INTELLIGENCE PRODUCTS GROUP Data-Centric Innovation Summit NAVEEN RAO CORPORATE VICE PRESIDENT & GENERAL MANAGER ARTIFICIAL INTELLIGENCE PRODUCTS GROUP Data center logic silicon Tam ~30% cagr Ai is exploding $8-10B Emerging as a

More information

Move ǀ store ǀ process

Move ǀ store ǀ process Move ǀ store ǀ process EXECUTIVE VICE PRESIDENT & GENERAL MANAGER DATA CENTER GROUP OVER of the World s data WAS CREATED IN THE LAST LESS THAN Has Been Analyzed PROLIFERATION OF GROWTH OF CLOUDIFICATION

More information

Fast forward. To your <next>

Fast forward. To your <next> Fast forward To your Navin Shenoy EXECUTIVE VICE PRESIDENT GENERAL MANAGER, DATA CENTER GROUP CLOUD ECONOMICS INTELLIGENT DATA PRACTICES NETWORK TRANSFORMATION Intel Xeon Scalable Platform The

More information

April 2 nd, Bob Burroughs Director, HPC Solution Sales

April 2 nd, Bob Burroughs Director, HPC Solution Sales April 2 nd, 2019 Bob Burroughs Director, HPC Solution Sales Today - Introducing 2 nd Generation Intel Xeon Scalable Processors how Intel Speeds HPC performance Work Time System Peak Efficiency Software

More information

Hubert Nueckel Principal Engineer, Intel. Doug Nelson Technical Lead, Intel. September 2017

Hubert Nueckel Principal Engineer, Intel. Doug Nelson Technical Lead, Intel. September 2017 Hubert Nueckel Principal Engineer, Intel Doug Nelson Technical Lead, Intel September 2017 Legal Disclaimer Intel technologies features and benefits depend on system configuration and may require enabled

More information

Ultimate Workstation Performance

Ultimate Workstation Performance Product brief & COMPARISON GUIDE Intel Scalable Processors Intel W Processors Ultimate Workstation Performance Intel Scalable Processors and Intel W Processors for Professional Workstations Optimized to

More information

FAST FORWARD TO YOUR <NEXT> CREATION

FAST FORWARD TO YOUR <NEXT> CREATION FAST FORWARD TO YOUR CREATION THE ULTIMATE PROFESSIONAL WORKSTATIONS POWERED BY INTEL XEON PROCESSORS 7 SEPTEMBER 2017 WHAT S NEW INTRODUCING THE NEW INTEL XEON SCALABLE PROCESSOR BREAKTHROUGH PERFORMANCE

More information

NVMe Over Fabrics: Scaling Up With The Storage Performance Development Kit

NVMe Over Fabrics: Scaling Up With The Storage Performance Development Kit NVMe Over Fabrics: Scaling Up With The Storage Performance Development Kit Ben Walker Data Center Group Intel Corporation 2018 Storage Developer Conference. Intel Corporation. All Rights Reserved. 1 Notices

More information

THE STORAGE PERFORMANCE DEVELOPMENT KIT AND NVME-OF

THE STORAGE PERFORMANCE DEVELOPMENT KIT AND NVME-OF 14th ANNUAL WORKSHOP 2018 THE STORAGE PERFORMANCE DEVELOPMENT KIT AND NVME-OF Paul Luse Intel Corporation Apr 2018 AGENDA Storage Performance Development Kit What is SPDK? The SPDK Community Why are so

More information

Intel optane memory as platform accelerator. Vladimir Knyazkin

Intel optane memory as platform accelerator. Vladimir Knyazkin Intel optane memory as platform accelerator Vladimir Knyazkin 2 Legal Disclaimers Intel technologies features and benefits depend on system configuration and may require enabled hardware, software or service

More information

Copyright 2017 Intel Corporation

Copyright 2017 Intel Corporation Agenda Intel Xeon Scalable Platform Overview Architectural Enhancements 2 Platform Overview 3x16 PCIe* Gen3 2 or 3 Intel UPI 3x16 PCIe Gen3 Capabilities Details 10GbE Skylake-SP CPU OPA DMI Intel C620

More information

Andreas Schneider. Markus Leberecht. Senior Cloud Solution Architect, Intel Deutschland. Distribution Sales Manager, Intel Deutschland

Andreas Schneider. Markus Leberecht. Senior Cloud Solution Architect, Intel Deutschland. Distribution Sales Manager, Intel Deutschland Markus Leberecht Senior Cloud Solution Architect, Intel Deutschland Andreas Schneider Distribution Sales Manager, Intel Deutschland Legal Disclaimers 2016 Intel Corporation. Intel, the Intel logo, Xeon

More information

Intel SSD Data center evolution

Intel SSD Data center evolution Intel SSD Data center evolution March 2018 1 Intel Technology Innovations Fill the Memory and Storage Gap Performance and Capacity for Every Need Intel 3D NAND Technology Lower cost & higher density Intel

More information

Intel Architecture 2S Server Tioga Pass Performance and Power Optimization

Intel Architecture 2S Server Tioga Pass Performance and Power Optimization Intel Architecture 2S Server Tioga Pass Performance and Power Optimization Terry Trausch/Platform Architect/Intel Inc. Whitney Zhao/HW Engineer/Facebook Inc. Agenda Tioga Pass Feature Overview Intel Xeon

More information

SPDK China Summit Ziye Yang. Senior Software Engineer. Network Platforms Group, Intel Corporation

SPDK China Summit Ziye Yang. Senior Software Engineer. Network Platforms Group, Intel Corporation SPDK China Summit 2018 Ziye Yang Senior Software Engineer Network Platforms Group, Intel Corporation Agenda SPDK programming framework Accelerated NVMe-oF via SPDK Conclusion 2 Agenda SPDK programming

More information

Changpeng Liu. Senior Storage Software Engineer. Intel Data Center Group

Changpeng Liu. Senior Storage Software Engineer. Intel Data Center Group Changpeng Liu Senior Storage Software Engineer Intel Data Center Group Legal Notices and Disclaimers Intel technologies features and benefits depend on system configuration and may require enabled hardware,

More information

Fast-track Hybrid IT Transformation with Intel Data Center Blocks for Cloud

Fast-track Hybrid IT Transformation with Intel Data Center Blocks for Cloud Fast-track Hybrid IT Transformation with Intel Data Center Blocks for Cloud Kyle Corrigan, Cloud Product Line Manager, Intel Server Products Group Wagner Diaz, Product Marketing Engineer, Intel Data Center

More information

Data-Centric Innovation Summit ALPER ILKBAHAR VICE PRESIDENT & GENERAL MANAGER MEMORY & STORAGE SOLUTIONS, DATA CENTER GROUP

Data-Centric Innovation Summit ALPER ILKBAHAR VICE PRESIDENT & GENERAL MANAGER MEMORY & STORAGE SOLUTIONS, DATA CENTER GROUP Data-Centric Innovation Summit ALPER ILKBAHAR VICE PRESIDENT & GENERAL MANAGER MEMORY & STORAGE SOLUTIONS, DATA CENTER GROUP tapping data value, real time MOUNTAINS OF UNDERUTILIZED DATA Challenge Shifting

More information

Jim Pappas Director of Technology Initiatives, Intel Vice-Chair, Storage Networking Industry Association (SNIA) December 07, 2018

Jim Pappas Director of Technology Initiatives, Intel Vice-Chair, Storage Networking Industry Association (SNIA) December 07, 2018 Jim Pappas Director of Technology Initiatives, Intel Vice-Chair, Storage Networking Industry Association (SNIA) December 07, 2018 jim@intel.com 1 How did this Effort Start? Memristor MRAM Carbon Nanotube

More information

Accelerating NVMe-oF* for VMs with the Storage Performance Development Kit

Accelerating NVMe-oF* for VMs with the Storage Performance Development Kit Accelerating NVMe-oF* for VMs with the Storage Performance Development Kit Jim Harris Principal Software Engineer Intel Data Center Group Santa Clara, CA August 2017 1 Notices and Disclaimers Intel technologies

More information

Changpeng Liu. Cloud Storage Software Engineer. Intel Data Center Group

Changpeng Liu. Cloud Storage Software Engineer. Intel Data Center Group Changpeng Liu Cloud Storage Software Engineer Intel Data Center Group Notices & Disclaimers Intel technologies features and benefits depend on system configuration and may require enabled hardware, software

More information

Munara Tolubaeva Technical Consulting Engineer. 3D XPoint is a trademark of Intel Corporation in the U.S. and/or other countries.

Munara Tolubaeva Technical Consulting Engineer. 3D XPoint is a trademark of Intel Corporation in the U.S. and/or other countries. Munara Tolubaeva Technical Consulting Engineer 3D XPoint is a trademark of Intel Corporation in the U.S. and/or other countries. notices and disclaimers Intel technologies features and benefits depend

More information

unleashed the future Intel Xeon Scalable Processors for High Performance Computing Alexey Belogortsev Field Application Engineer

unleashed the future Intel Xeon Scalable Processors for High Performance Computing Alexey Belogortsev Field Application Engineer the future unleashed Alexey Belogortsev Field Application Engineer Intel Xeon Scalable Processors for High Performance Computing Growing Challenges in System Architecture The Walls System Bottlenecks Divergent

More information

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme FUT3056BU VMware vsphere Scales on the Amazing Next-Gen Intel Xeon Architecture VMworld 2017 Content: Not for publication Tom Adelmeyer, Richard A. Brunner, Principal Engineer, Intel Principal Engineer,

More information

Accelerating NVMe I/Os in Virtual Machine via SPDK vhost* Solution Ziye Yang, Changpeng Liu Senior software Engineer Intel

Accelerating NVMe I/Os in Virtual Machine via SPDK vhost* Solution Ziye Yang, Changpeng Liu Senior software Engineer Intel Accelerating NVMe I/Os in Virtual Machine via SPDK vhost* Solution Ziye Yang, Changpeng Liu Senior software Engineer Intel @optimistyzy Notices & Disclaimers Intel technologies features and benefits depend

More information

Achieving 2.5X 1 Higher Performance for the Taboola TensorFlow* Serving Application through Targeted Software Optimization

Achieving 2.5X 1 Higher Performance for the Taboola TensorFlow* Serving Application through Targeted Software Optimization white paper Internet Discovery Artificial Intelligence (AI) Achieving.X Higher Performance for the Taboola TensorFlow* Serving Application through Targeted Software Optimization As one of the world s preeminent

More information

Re- I m a g i n i n g D a t a C e n t e r S t o r a g e a n d M e m o r y

Re- I m a g i n i n g D a t a C e n t e r S t o r a g e a n d M e m o r y Intel Innovations Re- I m a g i n i n g D a t a C e n t e r S t o r a g e a n d M e m o r y March 2018 Greg Matson, Director of SSD Strategic Planning and Product Marketing Legal Disclaimer Intel may make

More information

Scott Oaks, Oracle Sunil Raghavan, Intel Daniel Verkamp, Intel 03-Oct :45 p.m. - 4:30 p.m. Moscone West - Room 3020

Scott Oaks, Oracle Sunil Raghavan, Intel Daniel Verkamp, Intel 03-Oct :45 p.m. - 4:30 p.m. Moscone West - Room 3020 Scott Oaks, Oracle Sunil Raghavan, Intel Daniel Verkamp, Intel 03-Oct-2017 3:45 p.m. - 4:30 p.m. Moscone West - Room 3020 Big Data Talk Exploring New SSD Usage Models to Accelerate Cloud Performance 03-Oct-2017,

More information

Jim Harris. Principal Software Engineer. Intel Data Center Group

Jim Harris. Principal Software Engineer. Intel Data Center Group Jim Harris Principal Software Engineer Intel Data Center Group Notices & Disclaimers Intel technologies features and benefits depend on system configuration and may require enabled hardware, software or

More information

LS-DYNA Performance on Intel Scalable Solutions

LS-DYNA Performance on Intel Scalable Solutions LS-DYNA Performance on Intel Scalable Solutions Nick Meng, Michael Strassmaier, James Erwin, Intel nick.meng@intel.com, michael.j.strassmaier@intel.com, james.erwin@intel.com Jason Wang, LSTC jason@lstc.com

More information

A New Key-value Data Store For Heterogeneous Storage Architecture Intel APAC R&D Ltd.

A New Key-value Data Store For Heterogeneous Storage Architecture Intel APAC R&D Ltd. A New Key-value Data Store For Heterogeneous Storage Architecture Intel APAC R&D Ltd. 1 Agenda Introduction Background and Motivation Hybrid Key-Value Data Store Architecture Overview Design details Performance

More information

High-performance Deep learning at scale with Intel architecture

High-performance Deep learning at scale with Intel architecture High-performance Deep learning at scale with Intel architecture 2018 HPC Swiss Conference Lugano, Switzerland April 9 th, 2018 Claire Bild, Intel Legal Notices & disclaimers INFORMATION IN THIS DOCUMENT

More information

Ceph BlueStore Performance on Latest Intel Server Platforms. Orlando Moreno Performance Engineer, Intel Corporation May 10, 2018

Ceph BlueStore Performance on Latest Intel Server Platforms. Orlando Moreno Performance Engineer, Intel Corporation May 10, 2018 Ceph BlueStore Performance on Latest Intel Server Platforms Orlando Moreno Performance Engineer, Intel Corporation May 10, 2018 Legal Disclaimers 2017 Intel Corporation. Intel, the Intel logo, Xeon and

More information

OPENSHMEM AND OFI: BETTER TOGETHER

OPENSHMEM AND OFI: BETTER TOGETHER 4th ANNUAL WORKSHOP 208 OPENSHMEM AND OFI: BETTER TOGETHER James Dinan, David Ozog, and Kayla Seager Intel Corporation [ April, 208 ] NOTICES AND DISCLAIMERS Intel technologies features and benefits depend

More information

EPYC VIDEO CUG 2018 MAY 2018

EPYC VIDEO CUG 2018 MAY 2018 AMD UPDATE CUG 2018 EPYC VIDEO CRAY AND AMD PAST SUCCESS IN HPC AMD IN TOP500 LIST 2002 TO 2011 2011 - AMD IN FASTEST MACHINES IN 11 COUNTRIES ZEN A FRESH APPROACH Designed from the Ground up for Optimal

More information

INTEL HPC DEVELOPER CONFERENCE FUEL YOUR INSIGHT

INTEL HPC DEVELOPER CONFERENCE FUEL YOUR INSIGHT INTEL HPC DEVELOPER CONFERENCE FUEL YOUR INSIGHT INTEL HPC DEVELOPER CONFERENCE FUEL YOUR INSIGHT UPDATE ON OPENSWR: A SCALABLE HIGH- PERFORMANCE SOFTWARE RASTERIZER FOR SCIVIS Jefferson Amstutz Intel

More information

Välkommen. Intel Anders Huge

Välkommen. Intel Anders Huge Välkommen Intel Anders Huge Transformative Technology from Intel A n d e r s H u g e I n t e l Why intel INTEL CORPORATION 5 TRANSFORMING BUSINESS MODERN BUSINESS DEMANDS Intel VISION Accelerate workplace

More information

Data and Intelligence in Storage Carol Wilder Intel Corporation

Data and Intelligence in Storage Carol Wilder Intel Corporation Data and Intelligence in Storage Carol Wilder carol.a.wilder@intel.com Intel Corporation 1 Legal Notices/Disclaimer Intel technologies features and benefits depend on system configuration and may require

More information

Daniel Verkamp, Software Engineer

Daniel Verkamp, Software Engineer Daniel Verkamp, Software Engineer Notices and Disclaimers Intel technologies features and benefits depend on system configuration and may require enabled hardware, software or service activation. Learn

More information

Essential Performance and Advanced Security

Essential Performance and Advanced Security Product brief Intel Xeon E-2100 Processor Essential Performance and Advanced Security for Entry Server, Secure Cloud, and Entry Workstation Solutions Performance and Security, Intelligently Designed for

More information

H.J. Lu, Sunil K Pandey. Intel. November, 2018

H.J. Lu, Sunil K Pandey. Intel. November, 2018 H.J. Lu, Sunil K Pandey Intel November, 2018 Issues with Run-time Library on IA Memory, string and math functions in today s glibc are optimized for today s Intel processors: AVX/AVX2/AVX512 FMA It takes

More information

Building an Open Memory-Centric Computing Architecture using Intel Optane Frank Ober Efstathios Efstathiou Oracle Open World 2017 October 3, 2017

Building an Open Memory-Centric Computing Architecture using Intel Optane Frank Ober Efstathios Efstathiou Oracle Open World 2017 October 3, 2017 Building an Memory-Centric Computing Architecture using Intel Optane Frank Ober Efstathios Efstathiou Oracle World 2017 October 3, 2017 Agenda The legal stuff Why Memory Centric Computing? Overview of

More information

Accelerate Deep Learning Inference with openvino toolkit

Accelerate Deep Learning Inference with openvino toolkit Accelerate Deep Learning Inference with openvino toolkit Priyanka Bagade, IoT Developer Evangelist, Intel Core and Visual Computing Group Optimization Notice Intel s compilers may or may not optimize to

More information

Storage and Memory Hierarchy in HPC: New Paradigm and New Solutions with Intel Dr. Jean-Laurent Philippe

Storage and Memory Hierarchy in HPC: New Paradigm and New Solutions with Intel Dr. Jean-Laurent Philippe Storage and Memory Hierarchy in HPC: New Paradigm and New Solutions with Intel Dr. Jean-Laurent Philippe Senior EMEA HPC Technical Specialist Intel Data Center Group Legal Disclaimer Intel may make changes

More information

Accelerating Data Center Workloads with FPGAs

Accelerating Data Center Workloads with FPGAs Accelerating Data Center Workloads with FPGAs Enno Lübbers NorCAS 2017, Linköping, Sweden Intel technologies features and benefits depend on system configuration and may require enabled hardware, software

More information

Andrzej Jakowski, Armoun Forghan. Apr 2017 Santa Clara, CA

Andrzej Jakowski, Armoun Forghan. Apr 2017 Santa Clara, CA Andrzej Jakowski, Armoun Forghan Apr 2017 Santa Clara, CA Legal Disclaimers Intel technologies features and benefits depend on system configuration and may require enabled hardware, software or service

More information

Intel Speed Select Technology Base Frequency - Enhancing Performance

Intel Speed Select Technology Base Frequency - Enhancing Performance Intel Speed Select Technology Base Frequency - Enhancing Performance Application Note April 2019 Document Number: 338928-001 You may not use or facilitate the use of this document in connection with any

More information

Colin Cunningham, Intel Kumaran Siva, Intel Sandeep Mahajan, Oracle 03-Oct :45 p.m. - 5:30 p.m. Moscone West - Room 3020

Colin Cunningham, Intel Kumaran Siva, Intel Sandeep Mahajan, Oracle 03-Oct :45 p.m. - 5:30 p.m. Moscone West - Room 3020 Colin Cunningham, Intel Kumaran Siva, Intel Sandeep Mahajan, Oracle 03-Oct-2017 4:45 p.m. - 5:30 p.m. Moscone West - Room 3020 Big Data Talk Exploring New SSD Usage Models to Accelerate Cloud Performance

More information

S8765 Performance Optimization for Deep- Learning on the Latest POWER Systems

S8765 Performance Optimization for Deep- Learning on the Latest POWER Systems S8765 Performance Optimization for Deep- Learning on the Latest POWER Systems Khoa Huynh Senior Technical Staff Member (STSM), IBM Jonathan Samn Software Engineer, IBM Evolving from compute systems to

More information

ISA-L Performance Report Release Test Date: Sept 29 th 2017

ISA-L Performance Report Release Test Date: Sept 29 th 2017 Test Date: Sept 29 th 2017 Revision History Date Revision Comment Sept 29 th, 2017 1.0 Initial document for release 2 Contents Audience and Purpose... 4 Test setup:... 4 Intel Xeon Platinum 8180 Processor

More information

Intel s Architecture for NFV

Intel s Architecture for NFV Intel s Architecture for NFV Evolution from specialized technology to mainstream programming Net Futures 2015 Network applications Legal Disclaimer INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION

More information

IFS RAPS14 benchmark on 2 nd generation Intel Xeon Phi processor

IFS RAPS14 benchmark on 2 nd generation Intel Xeon Phi processor IFS RAPS14 benchmark on 2 nd generation Intel Xeon Phi processor D.Sc. Mikko Byckling 17th Workshop on High Performance Computing in Meteorology October 24 th 2016, Reading, UK Legal Disclaimer & Optimization

More information

Re-Architecting Cloud Storage with Intel 3D XPoint Technology and Intel 3D NAND SSDs

Re-Architecting Cloud Storage with Intel 3D XPoint Technology and Intel 3D NAND SSDs Re-Architecting Cloud Storage with Intel 3D XPoint Technology and Intel 3D NAND SSDs Jack Zhang yuan.zhang@intel.com, Cloud & Enterprise Storage Architect Santa Clara, CA 1 Agenda Memory Storage Hierarchy

More information

Accelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing

Accelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing Accelerating HPC (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing SAAHPC, Knoxville, July 13, 2010 Legal Disclaimer Intel may make changes to specifications and product

More information

Bei Wang, Dmitry Prohorov and Carlos Rosales

Bei Wang, Dmitry Prohorov and Carlos Rosales Bei Wang, Dmitry Prohorov and Carlos Rosales Aspects of Application Performance What are the Aspects of Performance Intel Hardware Features Omni-Path Architecture MCDRAM 3D XPoint Many-core Xeon Phi AVX-512

More information

POWER YOUR CREATIVITY WITH THE INTEL CORE X-SERIES PROCESSOR FAMILY

POWER YOUR CREATIVITY WITH THE INTEL CORE X-SERIES PROCESSOR FAMILY Product Brief POWER YOUR CREATIVITY WITH THE INTEL CORE X-SERIES PROCESSOR FAMILY The Ultimate Creator PC Platform Made to create, the latest X-series processor family is powered by up to 18 cores and

More information

A New Key-Value Data Store For Heterogeneous Storage Architecture

A New Key-Value Data Store For Heterogeneous Storage Architecture A New Key-Value Data Store For Heterogeneous Storage Architecture brien.porter@intel.com wanyuan.yang@intel.com yuan.zhou@intel.com jian.zhang@intel.com Intel APAC R&D Ltd. 1 Agenda Introduction Background

More information

Small File I/O Performance in Lustre. Mikhail Pershin, Joe Gmitter Intel HPDD April 2018

Small File I/O Performance in Lustre. Mikhail Pershin, Joe Gmitter Intel HPDD April 2018 Small File I/O Performance in Lustre Mikhail Pershin, Joe Gmitter Intel HPDD April 2018 Overview Small File I/O Concerns Data on MDT (DoM) Feature Overview DoM Use Cases DoM Performance Results Small File

More information

Srinivas Chennupaty. Intel Corporation, 2018

Srinivas Chennupaty. Intel Corporation, 2018 Srinivas Chennupaty Intel Corporation, 2018 Notices and Disclaimers This document contains information on products, services and/or processes in development. All information provided here is subject to

More information

Hardware and Software Co-Optimization for Best Cloud Experience

Hardware and Software Co-Optimization for Best Cloud Experience Hardware and Software Co-Optimization for Best Cloud Experience Khun Ban (Intel), Troy Wallins (Intel) October 25-29 2015 1 Cloud Computing Growth Connected Devices + Apps + New Services + New Service

More information

Mohan J. Kumar Intel Fellow Intel Corporation

Mohan J. Kumar Intel Fellow Intel Corporation OCP Initiatives and Intel Implementations Mohan J. Kumar Intel Fellow Intel Corporation Agenda Open Firmware Firmware at Scale Platform Attestation Summary Open Firmware UEFI-based Open Firmware (for Intel-based

More information

Jim Harris. Principal Software Engineer. Data Center Group

Jim Harris. Principal Software Engineer. Data Center Group Jim Harris Principal Software Engineer Data Center Group Notices and Disclaimers Intel technologies features and benefits depend on system configuration and may require enabled hardware, software or service

More information

DPDK Intel NIC Performance Report Release 18.02

DPDK Intel NIC Performance Report Release 18.02 DPDK Intel NIC Performance Report Test Date: Mar 14th 2018 Author: Intel DPDK Validation team Revision History Date Revision Comment Mar 15th, 2018 1.0 Initial document for release 2 Contents Audience

More information

DPDK Performance Report Release Test Date: Nov 16 th 2016

DPDK Performance Report Release Test Date: Nov 16 th 2016 Test Date: Nov 16 th 2016 Revision History Date Revision Comment Nov 16 th, 2016 1.0 Initial document for release 2 Contents Audience and Purpose... 4 Test setup:... 4 Intel Xeon Processor E5-2699 v4 (55M

More information

What s P. Thierry

What s P. Thierry What s new@intel P. Thierry Principal Engineer, Intel Corp philippe.thierry@intel.com CPU trend Memory update Software Characterization in 30 mn 10 000 feet view CPU : Range of few TF/s and

More information

DPDK Intel Cryptodev Performance Report Release 18.08

DPDK Intel Cryptodev Performance Report Release 18.08 DPDK Intel Cryptodev Performance Report Test Date: August 7th 2018 Author: Intel DPDK Validation team Revision History Date Revision Comment August 7th, 2018 1.0 Initial document for release 2 Contents

More information

DEVITO AUTOMATED HIGH-PERFORMANCE FINITE DIFFERENCES FOR GEOPHYSICAL EXPLORATION

DEVITO AUTOMATED HIGH-PERFORMANCE FINITE DIFFERENCES FOR GEOPHYSICAL EXPLORATION DEVITO AUTOMATED HIGH-PERFORMANCE FINITE DIFFERENCES FOR GEOPHYSICAL EXPLORATION F. Luporini 1, C. Yount 4, M. Louboutin 3, N. Kukreja 1, P. Witte 2, M. Lange 5, P. Kelly 1, F. Herrmann 3, G.Gorman 1 1Imperial

More information

HPC Advisory COUNCIL

HPC Advisory COUNCIL Inside AI HPC Advisory COUNCIL Lugano 2017 Scalable Systems for Distributed Deep Learning Benchmarking, Performance Optimization and Architectures Gaurav kaul SYSTEMS ARCHITECT, INTEL DATACENTRE GROUP

More information

Using CNN Across Intel Architecture

Using CNN Across Intel Architecture white paper Artificial Intelligence Object Classification Intel AI Builders Object Classification Using CNN Across Intel Architecture Table of Contents Abstract...1 1. Introduction...1 2. Setting up a

More information

Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms

Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Family-Based Platforms Executive Summary Complex simulations of structural and systems performance, such as car crash simulations,

More information

Enabling the future of Artificial intelligence

Enabling the future of Artificial intelligence Enabling the future of Artificial intelligence Contents AI Overview Intel Nervana AI products Hardware Software Intel Nervana Deep Learning Platform Learn more - Intel Nervana AI Academy Artificial Intelligence,

More information

A U G U S T 8, S A N T A C L A R A, C A

A U G U S T 8, S A N T A C L A R A, C A A U G U S T 8, 2 0 1 8 S A N T A C L A R A, C A Data-Centric Innovation Summit LISA SPELMAN VICE PRESIDENT & GENERAL MANAGER INTEL XEON PRODUCTS AND DATA CENTER MARKETING Increased integration and optimization

More information

IN-PERSISTENT-MEMORY COMPUTING WITH JAVA ERIC KACZMAREK INTEL CORPORATION

IN-PERSISTENT-MEMORY COMPUTING WITH JAVA ERIC KACZMAREK INTEL CORPORATION IN-PERSISTENT-MEMORY COMPUTING WITH JAVA ERIC KACZMAREK INTEL CORPORATION LEGAL DISCLAIMER & OPTIMIZATION NOTICE INFORMATION IN THIS DOCUMENT IS PROVIDED AS IS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL

More information

Evaluation of AMD EPYC

Evaluation of AMD EPYC Evaluation of AMD Chris Hollowell HEPiX Fall 2018, PIC Spain What is? is a new line of x86_64 server CPUs from AMD based on their Zen microarchitecture Same microarchitecture used in

More information

Ziye Yang. NPG, DCG, Intel

Ziye Yang. NPG, DCG, Intel Ziye Yang NPG, DCG, Intel Agenda What is SPDK? Accelerated NVMe-oF via SPDK Conclusion 2 Agenda What is SPDK? Accelerated NVMe-oF via SPDK Conclusion 3 Storage Performance Development Kit Scalable and

More information

THE IN-PLACE WORKING STORAGE TIER OPPORTUNITIES FOR SOFTWARE INNOVATORS KEN GIBSON, INTEL, DIRECTOR MEMORY SW ARCHITECTURE

THE IN-PLACE WORKING STORAGE TIER OPPORTUNITIES FOR SOFTWARE INNOVATORS KEN GIBSON, INTEL, DIRECTOR MEMORY SW ARCHITECTURE THE IN-PLACE WORKING STORAGE TIER OPPORTUNITIES FOR SOFTWARE INNOVATORS KEN GIBSON, INTEL, DIRECTOR MEMORY SW ARCHITECTURE I/O LATENCY WILL SOON EXCEED MEDIA LATENCY 30 NVM Tread 25 NVM xfer Controller

More information

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme FUT3040BU Storage at Memory Speed: Finally, Nonvolatile Memory Is Here Rajesh Venkatasubramanian, VMware, Inc Richard A Brunner, VMware, Inc #VMworld #FUT3040BU Disclaimer This presentation may contain

More information

Accelerating Microsoft SQL Server Performance With NVDIMM-N on Dell EMC PowerEdge R740

Accelerating Microsoft SQL Server Performance With NVDIMM-N on Dell EMC PowerEdge R740 Accelerating Microsoft SQL Server Performance With NVDIMM-N on Dell EMC PowerEdge R740 A performance study with NVDIMM-N Dell EMC Engineering September 2017 A Dell EMC document category Revisions Date

More information

Ravindra Babu Ganapathi

Ravindra Babu Ganapathi 14 th ANNUAL WORKSHOP 2018 INTEL OMNI-PATH ARCHITECTURE AND NVIDIA GPU SUPPORT Ravindra Babu Ganapathi Intel Corporation [ April, 2018 ] Intel MPI Open MPI MVAPICH2 IBM Platform MPI SHMEM Intel MPI Open

More information

DPDK Intel NIC Performance Report Release 18.05

DPDK Intel NIC Performance Report Release 18.05 DPDK Intel NIC Performance Report Test Date: Jun 1th 2018 Author: Intel DPDK Validation team Revision History Date Revision Comment Jun 4th, 2018 1.0 Initial document for release 2 Contents Audience and

More information

Accelerate Finger Printing in Data Deduplication Xiaodong Liu & Qihua Dai Intel Corporation

Accelerate Finger Printing in Data Deduplication Xiaodong Liu & Qihua Dai Intel Corporation Accelerate Finger Printing in Data Deduplication Xiaodong Liu & Qihua Dai Intel Corporation Legal Disclaimer INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS

More information

March NVM Solutions Group

March NVM Solutions Group March 2017 NVM Solutions Group Ideally one would desire an indefinitely large memory capacity such that any particular word would be immediately available. It does not seem possible physically to achieve

More information

WITH INTEL TECHNOLOGIES

WITH INTEL TECHNOLOGIES WITH INTEL TECHNOLOGIES Commitment Is to Enable The Best Democratize technologies Advance solutions Unleash innovations Intel Xeon Scalable Processor Family Delivers Ideal Enterprise Solutions NEW Intel

More information

Outline. Motivation Parallel k-means Clustering Intel Computing Architectures Baseline Performance Performance Optimizations Future Trends

Outline. Motivation Parallel k-means Clustering Intel Computing Architectures Baseline Performance Performance Optimizations Future Trends Collaborators: Richard T. Mills, Argonne National Laboratory Sarat Sreepathi, Oak Ridge National Laboratory Forrest M. Hoffman, Oak Ridge National Laboratory Jitendra Kumar, Oak Ridge National Laboratory

More information

Data center: The center of possibility

Data center: The center of possibility Data center: The center of possibility Diane bryant Executive vice president & general manager Data center group, intel corporation Data center: The center of possibility The future is Thousands of Clouds

More information

DPDK Intel NIC Performance Report Release 17.08

DPDK Intel NIC Performance Report Release 17.08 DPDK Intel NIC Performance Report Test Date: Aug 23th 2017 Author: Intel DPDK Validation team Revision History Date Revision Comment Aug 24th, 2017 1.0 Initial document for release 2 Contents Audience

More information

Capabilities and System Benefits Enabled by NVDIMM-N

Capabilities and System Benefits Enabled by NVDIMM-N Capabilities and System Benefits Enabled by NVDIMM-N Bob Frey Arthur Sainio SMART Modular Technologies August 7, 2018 Santa Clara, CA 1 NVDIMM-N Maturity and Evolution If there's one takeaway you should

More information

Deep learning prevalence. first neuroscience department. Spiking Neuron Operant conditioning First 1 Billion transistor processor

Deep learning prevalence. first neuroscience department. Spiking Neuron Operant conditioning First 1 Billion transistor processor WELCOME TO Operant conditioning 1938 Spiking Neuron 1952 first neuroscience department 1964 Deep learning prevalence mid 2000s The Turing Machine 1936 Transistor 1947 First computer science department

More information

Extremely Fast Distributed Storage for Cloud Service Providers

Extremely Fast Distributed Storage for Cloud Service Providers Solution brief Intel Storage Builders StorPool Storage Intel SSD DC S3510 Series Intel Xeon Processor E3 and E5 Families Intel Ethernet Converged Network Adapter X710 Family Extremely Fast Distributed

More information

Future of datacenter STORAGE. Carol Wilder, Niels Reimers,

Future of datacenter STORAGE. Carol Wilder, Niels Reimers, Future of datacenter STORAGE Carol Wilder, carol.a.wilder@intel.com Niels Reimers, niels.reimers@intel.com Legal Notices/disclaimer Intel technologies features and benefits depend on system configuration

More information

The Transition to PCI Express* for Client SSDs

The Transition to PCI Express* for Client SSDs The Transition to PCI Express* for Client SSDs Amber Huffman Senior Principal Engineer Intel Santa Clara, CA 1 *Other names and brands may be claimed as the property of others. Legal Notices and Disclaimers

More information

Graphics Performance Analyzer for Android

Graphics Performance Analyzer for Android Graphics Performance Analyzer for Android 1 What you will learn from this slide deck Detailed optimization workflow of Graphics Performance Analyzer Android* System Analysis Only Please see subsequent

More information

Intel HPC Technologies Outlook

Intel HPC Technologies Outlook Intel HPC Technologies Outlook Andrey Semin Principal Engineer, HPC Technology Manager, EMEA October 19 th, 2015 ZKI Tagung AK Supercomputing Munich, Germany Legal Disclaimers INFORMATION IN THIS DOCUMENT

More information

Intel tools for High Performance Python 데이터분석및기타기능을위한고성능 Python

Intel tools for High Performance Python 데이터분석및기타기능을위한고성능 Python Intel tools for High Performance Python 데이터분석및기타기능을위한고성능 Python Python Landscape Adoption of Python continues to grow among domain specialists and developers for its productivity benefits Challenge#1:

More information

DPDK Vhost/Virtio Performance Report Release 18.11

DPDK Vhost/Virtio Performance Report Release 18.11 DPDK Vhost/Virtio Performance Report Test Date: December 3st 2018 Author: Intel DPDK Validation Team Revision History Date Revision Comment December 3st, 2018 1.0 Initial document for release 2 Contents

More information

Intel Xeon Scalable Processor for HPC 나승구이사

Intel Xeon Scalable Processor for HPC 나승구이사 Intel Xeon Scalable Processor for HPC 나승구이사 Growing Challenges in HPC System Bottlenecks The Walls Divergent Workloads Machine learning hpc Big Data visualization Barriers to Extending Usage Optimizing

More information

Accelerate Machine Learning on macos with Intel Integrated Graphics. Hisham Chowdhury May 23, 2018

Accelerate Machine Learning on macos with Intel Integrated Graphics. Hisham Chowdhury May 23, 2018 Accelerate Machine Learning on macos with Intel Integrated Graphics Hisham Chowdhury May 23, 2018 Apple Machine Learning Stack Machine Learning Application 1 Machine Learning Application 2 Vision Natural

More information

Engineers can be significantly more productive when ANSYS Mechanical runs on CPUs with a high core count. Executive Summary

Engineers can be significantly more productive when ANSYS Mechanical runs on CPUs with a high core count. Executive Summary white paper Computer-Aided Engineering ANSYS Mechanical on Intel Xeon Processors Engineer Productivity Boosted by Higher-Core CPUs Engineers can be significantly more productive when ANSYS Mechanical runs

More information

Are You Insured Against Your Noisy Neighbor Sunku Ranganath, Intel Corporation Sridhar Rao, Spirent Communications

Are You Insured Against Your Noisy Neighbor Sunku Ranganath, Intel Corporation Sridhar Rao, Spirent Communications Are You Insured Against Your Noisy Neighbor Sunku Ranganath, Intel Corporation Sridhar Rao, Spirent Communications @SunkuRanganath, @ngignir Legal Disclaimer 2018 Intel Corporation. Intel, the Intel logo,

More information

Performance and Energy Efficiency of the 14 th Generation Dell PowerEdge Servers

Performance and Energy Efficiency of the 14 th Generation Dell PowerEdge Servers Performance and Energy Efficiency of the 14 th Generation Dell PowerEdge Servers This white paper details the performance improvements of Dell PowerEdge servers with the Intel Xeon Processor Scalable CPU

More information