Mike Greenfield, Intel MultiCore 7 Workshop September 27 and 28, 2017 National Center for Atmospheric Research in Boulder, Colorado

Size: px
Start display at page:

Download "Mike Greenfield, Intel MultiCore 7 Workshop September 27 and 28, 2017 National Center for Atmospheric Research in Boulder, Colorado"

Transcription

1 Mike Greenfield, Intel Multi 7 Workshop September 27 and 28, 2017 National Center for Atmospheric Research in ulder, Colorado *

2 Legal Disclaimers Intel technologies may require enabled hardware, specific software, or services activation. Performance varies depending on system configuration. Check with your system manufacturer or retailer. For more complete information about performance and benchmark results, visit Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more complete information visit Cost reduction scenarios described are intended as examples of how a given Intel- based product, in the specified circumstances and configurations, may affect future costs and provide cost savings. Circumstances will vary. Intel does not guarantee any costs or cost reduction. For more information go to All information provided here is subject to change without notice. Contact your Intel representative to obtain the latest Intel product specifications and roadmaps. No computer system can be absolutely secure. Statements in this document that refer to Intel s plans and expectations for the quarter, the year, and the future, are forward-looking statements that involve a number of risks and uncertainties. A detailed discussion of the factors that could affect Intel s results and plans is included in Intel s SEC filings, including the annual report on Form 10-K. Intel, the Intel logo, Xeon, Intel vpro, Intel Xeon Phi, Look Inside., are trademarks of Intel Corporation in the U.S. and/or other countries. *Other names and brands may be claimed as the property of others. Microsoft, Windows, and the Windows logo are trademarks, or registered trademarks of Microsoft Corporation in the United States and/or other countries Intel Corporation. 2

3 Optimization Notice Optimization Notice: Intel's compilers may or may not optimize to the same degree for non-intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice. Notice revision #

4 Key Points for today s session Earth System Models/Centers exploiting Intel Architecture today Intel Roadmap Update: KNL, The Intel Xeon Scalable Processor family, AVX-512, SSF and 3D XPoint Selected notes on E2E Workload Optimization and Performance Portability Advancing the performance of Earth System Models on systems based upon Intel Architecture 4

5 Earth System Model Innovation on Intel Architecture Hetero IO Optimizations 84 Climate NERSC 3/20 NESAP ESM projects ESM exploited 8% CORI Source: R Gerber, ICAS a huge portion of contemporary Operational Forecast and Climate Research Centers 5

6 Intel Scalable System Framework A Holistic Solution for All HPC Needs Compute Fabric Memory / Storage Software Small Clusters Through Supercomputers Compute and Data-Centric Computing Standards-Based Programmability On-Premise and Cloud-Based Intel Xeon Processors Intel Xeon Phi Processors Intel FPGAs and Server Solutions Intel Solutions for Lustre* Intel Optane Technology 3D XPoint Technology Intel SSDs Intel Omni-Path Architecture Intel Silicon Photonics Intel Ethernet Intel HPC Orchestrator Intel Software Tools Intel Cluster Ready Program Intel Supported SDVis 31

7 How It Works Industry-Leading Compute Intel Xeon Processors Intel Xeon Phi Processors Intel FPGAs Intel Omni-Path Architecture Intel Silicon Photonics Fast, Cost-Effective Data Movement Innovative Technologies Intel Ethernet Compute Fabric *Other names and brands may be claimed as the property of others. Memory / Storage Software Intel SSDs Intel SW Defined Visualization Fast, Reliable Access to Data 3D XPoint Technology Intel Optane Technology Intel Solutions for Lustre* software Intel HPC Orchestrator Intel Software Tools Intel Cluster Ready Program Ease of Deployment and Management Tight Integration and Co-Design Reference Architecture Benefits Compatibility Bandwidth Density Latency Power Cost 3

8 Intel Xeon Phi Processor Architecture Self-ot Processor Binary-compatibility with Xeon, 3+ TFLOPS 1 (DP) On-package memory 16GB, up to 490 GB/s STREAM TRIAD Platform Memory Up to 384GB (6ch DDR MHz) Other Key Features TILE: (up to 36) 2VPU 2D Mesh Architecture Out-of-Order s 3X Single-Thread vs. KNC Intel AVX-512 Instructions Scatter/Gather Engine Integrated Fabric - OPA HUB 1MB L2 2VPU Enhanced Intel Atom cores based on Silvermont Microarchitecture DDR4 x4 DMI2 to PCH 36 Lanes PCIe* Gen3 (x16, x16, x4) MCDRAM MCDRAM MCDRAM MCDRAM Processor Package DDR4 Tile EDC (Embedded DRAM Controller) IMC (Integrated Memory Controller) IIO (Integrated I/O Controller) 1 Theoretical peak performance 8

9 Delivering Performance for Deep Learning Workloads ACCELERATE Hardware Capabilities Optimize Deep Learning Software Align Developer Ecosystem Available Now Start training models today using Intel Xeon Phi Available Late 2017 Intel Xeon Phi Processor Knights Mill Up to 4x performance over current processor for Deep Learning workloads* Directly Optimized Frameworks Optimizing these frameworks on Intel Xeon & Xeon Phi processors Libraries/Languages Intel MKL MKL-DNN Nervana Graph Tuned for Intel processors Current & next generation Training Tools Community Benefit from expert-led trainings, hands-on workshops, exclusive remote access, and more! Gain access to the latest libraries, frameworks, tools, and technologies from Intel to accelerate you AI project Collaborate with industry luminaries, developers, students, and Intel engineers Delivering hardware optimized for deep learning Optimized via framework & library enhancements Ensure intel solutions are easy to use and readily available *Intel Xeon Phi processor Knights Mill up to 4x estimated performance improvement over Intel Xeon Phi processor 7290 BASELINE: Intel Xeon Phi Processor 7290 (16GB, 1.50 GHz, 72 core) with 192 GB Total Memory on Red Hat Enterprise Linux* 6.7 kernel using MKL 11.3 Update 4, Relative performance 1.0 Knights Mill: Results have been estimated or simulated using internal Intel analysis or architecture simulation or modeling, and provided to you for informational purposes. Any differences in your system hardware, software or configuration may affect your actual performance. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your 9 contemplated purchases, including the performance of that product when combined with other products. For more information go to

10 2-socket+ Intel Xeon Roadmap Thurley Platform Romley Platform Grantley Platform Purley Platform Intel Microarchitecture Codenamed Nehalem Intel Microarchitecture Codenamed Sandy Bridge Intel Microarchitecture Codenamed Haswell Intel Microarchitecture Codenamed Skylake Nehalem Westmere Sandy Bridge Ivy Bridge Haswell Broadwell Skylake Cascade Lake 45nm 32nm 32nm 22nm 22nm 14nm 14nm New Microarchitecture New Microarchitecture New Microarchitecture New Microarchitecture Brickland Platform is Ivy Bridge-EX, Haswell-EX, and Broadwell-EX Skylake microarchitecture delivers ~10% (geomean) IPC improvement v. Broadwell 10

11 INTEL XEON SCALABLE processors The Foundation for Agile, Secure, Workload-Optimized Hybrid Cloud UP TO UP TO 2, 4 & DDR4 M 2666 HZ WITH UP TO 28 CORES 8 SOCKET SUPPORT 1.5 TB WITH UP TO TOPLINE MEMORY CHANNEL BANDWIDTH HIGHEST ACCELERATOR THROUGHPUT 3 UPI LINKS MAINSTREAM UP TO 22 CORES 2 & 4 UP TO 3 SOCKET SUPPORT UPI LINKS RELIABILITY, AVAILABILITY ADVANCED AND SERVICEABILITY Good SCALABLE PERFORMANCE AT LOW POWER STANDARD RAS MODERATE TASKS INTEL TURBO BOOST TECHNOLOGY AND INTEL HYPER-THREADING TECHNOLOGY FOR MODERATE WORKLOADS Efficient ENTRY SCALABLE PERFORMANCE HARDWARE-ENHANCED SECURITY STANDARD RAS Light TASKS ENTRY PERFORMANCE, Price Sensitive FOR LIGHT WORKLOADS ENTRY 11

12 Typical 2-socket configuration Intel Xeon E5 v4 (2016) Intel Xeon Purley Scalable (2017) (2017) CPU Intel QPI CPU CPU Intel UPI CPU PCIe* x4 x8 x4 DMI 2 x8 DMI LBG ** 3x16 PCIe* 1x100G Intel OP Fabric x4 3x16 PCIe* 1x100G Intel OP Fabric Four DDR4 memory channels up to 24 DIMMs Up to 80 PCIe lanes Two QPI links (up to 9.6 GT/s) DDR4 DIMMs PCIe* uplink connection for Intel QuickAssist Technology and Intel Ethernet ** Six DDR4 memory channels up to 24 DIMMs Up to 96 PCIe lanes Two UPI links (up to 10.4 GT/s); up to 3 UPI links in 4S and 8S configurations Integrated Intel Omni-Path Architecture (Fabric) 12

13 /QPII /QPII /QPII /QPII /QPII /QPII /QPII /QPII /QPII /QPII /QPII /QPII New Mesh Interconnect Architecture Intel Xeon Processor E7 family (24-core die) Intel Xeon Scalable Processor (28-core die) QPI QPI Link Link R3QPI QPI Agent PCI-E PCI-E PCI-E PCI-E X16 X16 X8 X4 (ESI) Ux PCU CB DMA R2PCI IOAPIC IIO 2x UPI x 20 PCIe* * x16 PCIe x16 DMI x 4 CBDMA On Pkg PCIe x16 1x UPI x20 PCIe x16 U D P N SKX SKX SKX SKX SKX SKX /QPII U P D N /QPII D N U P DDR4 MC MC DDR4 /QPII U P D N /QPII D N U P DDR4 DDR4 SKX SKX SKX SKX DDR4 DDR4 /QPII U P D N /QPII D N U P /QPII U P D N /QPII D N U P SKX SKX SKX SKX SKX SKX /QPII U P D N /QPII D N U P /QPII U D P N U D P N /QPII D N U P SKX SKX SKX SKX SKX SKX UP DN SKX SKX SKX SKX SKX SKX DDR Home Agent Mem Ctlr DDR DDR Home Agent Mem Ctlr DDR CHA Caching and Home Agent ; SF Snoop Filter; Last Level ; SKX Skylake Server ; UPI Intel UltraPath Interconnect Mesh Improves Scalability with Higher Bandwidth and Reduced Latencies 13

14 Re-Architected L2 & L3 Hierarchy Previous Architectures Shared L3 /core (inclusive) Intel Xeon Scalable Processor Architecture Shared L MB/core (non-inclusive) L2 (256KB private) L2 (256KB private) L2 (256KB private) L2 (1MB private) L2 (1MB private) L2 (1MB private) On-chip cache balance shifted from shared-distributed (prior architectures) to private-local (Skylake architecture): Shared-distributed shared-distributed L3 is primary cache Private-local private L2 becomes primary cache with shared L3 used as overflow cache Shared L3 changed from inclusive to non-inclusive: Inclusive (prior architectures) L3 has copies of all lines in L2 Non-inclusive (Skylake architecture) lines in L2 may not exist in L3 Skylake-SP cache hierarchy architected specifically for Data center use case 14

15 Intel Advanced Vector Extensions-512 (AVX-512) 512-bit wide vectors 32 operand registers 8 64b mask registers Embedded broadcast Embedded rounding Microarchitecture Instruction Set SP FLOPs / cycle DP FLOPs / cycle Skylake Intel AVX-512 & FMA Haswell / Broadwell Intel AVX2 & FMA Sandybridge Intel AVX (256b) 16 8 Nehalem SSE (128b) 8 4 Intel AVX-512 Instruction Types AVX-512-F AVX-512-VL AVX-512-BW AVX-512-DQ AVX-512-CD AVX-512 Foundation Instructions Vector Length Orthogonality : ability to operate on sub-512 vector sizes 512-bit Byte/Word support Additional D/Q/SP/DP instructions (converts, transcendental support, etc.) Conflict Detect : used in vectorizing loops with potential address conflicts Powerful instruction set for data-parallel computation 15

16 Frequency AVX2 AVX512 Non-AVX AVX2 Non-AVX Frequency Behavior While Running Intel AVX Code s running non-avx, Intel AVX2 light/heavy, and Intel AVX-512 light/heavy code have different turbo frequency limits Frequency of each core is determined independently based on workload demand Mixed Workloads Non-AVX_Turbo AVX2_Turbo AVX512_Turbo Code Type SSE AVX2-Light (without FP & int-mul) All Frequency Limit Non-AVX All Turbo Non-AVX_Base AVX2_Base AVX512_Base AVX2-Heavy (FP & int-mul) AVX512-Light (without FP & int-mul) AVX2 All Turbo s AVX512-Heavy (FP & int-mul) AVX512 All Turbo AVX512 AVX2 Non-AVX s using AVX-512 s using AVX2 s not using AVX 16

17 Normalized to SSE4.2 GFLOPs/GHz GFLOPs, System Power Frequency Normalized to SSE4.2 GFLOPs/Watt Performance and Efficiency with Intel AVX LINPACK Performance SSE4.2 AVX AVX2 AVX512 GFLOPs Power (W) Frequency (GHz) Source as of June 2017: Intel internal measurements on platform with Xeon Platinum 8180, Turbo enabled, UPI=10.4, SNC1, 6x32GB DDR per CPU, 1 DPC. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products GFLOPs / Watt SSE4.2 AVX AVX2 AVX GFLOPs / GHz SSE4.2 AVX AVX2 AVX512 Intel AVX-512 delivers significant performance and efficiency gains 17

18 Integrated Intel Omni-Path Architecture Platform Benefits - Maximized I/O Density per Node Up to TWO additional PCIe x16 slots are available for maximizing I/O density 1 x16 Significantly more I/O capacity for compute or storage nodes 1 Compute Node GPU GPU GPU GPU GPU GPU SKX-F OPA HFI IFP Cable IFT Card Intel Xeon Processor-F Intel Xeon Processor-F HFI HFI SKX-F or SKX OPA HFI Storage Node or File System Server SKUS WITH INTEGRATED INTEL OMNI-PATH ARCHITECTURE FABRIC Class SKU s Base Non-AVX Speed (GHz) TDP (W) Platinum x F Platinum 8160F Gold 6148F Gold 6142F Gold 6138F Gold 6130F Gold 6126F Intel Xeon Processor-F HFI Intel Xeon Processor-F HFI 1 For illustrative purposes only. Assumes each CPU socket is configured with all 48 PCIe lanes routed to three x16 slots, or 96 total lanes for a 2S Purley platform. PCIe slot count and PCIe device support will vary by OEM platform, so check with your OEM for more details.

19 3D XPoint 1. 3D XPoint is the next generation non-volatile memory technology by Intel and Micron. 2. Intel SSDs with 3D XPoint media came to market in 2017 (Optane) 3. DDR4 socket compatible, Intel DIMMs based on 3D XPoint technology will be supported on next generation data center platform, code-named Purley. 4. On a 2S Xeon server or workstation, Intel DIMMs can offer up to 2X system memory capacity at significantly lower cost per GB than DRAM 5. Intel DIMMs can deliver big memory benefits to existing OS and apps without any modification in the OS or apps 6. Intel DIMMs will co-exist with conventional DDR4 DRAM DIMMs on same platform 7. Intel has sampled DIMMs to select customers 8. Intel DIMMs will be supported on a new version of Skylake in mid

20 Premise End to End Earth System Simulations Architecture Citations (conventional and novel) Kernel Extractions from End to End Earth System Simulations Goal: Simulate more representative systems at greater fidelity, faster, consuming less power and with maximum developer productivity Basic Principles: Balanced System Data Centric Approaches Decrease latency at every level of integration Minimize data movement and reformatting Exploit Industry Standards Respect and enhance customer IP Accommodate load imbalance Exploit and improve s, Vectors, Memory 20

21 Earth System Model Proxy How often is the profile spread across a moderate Number of functions? How often can Kernel speedups and Kernel Energy Reductions translate into Complete E2E workload speedups? Benefit is the management of balance of Kernel Speedup vs data movement + other overheads Durability of the optimized code across the range of simulation use cases 21

22 Geometry Mapping Grid of Physical System Sizes may be dictated by physics or by collected sensor data SW Instantiation of the model Decomposition in ranks, threads and vector loops Possible cache blocking Machine Geometry #s Vector SIMD Width sizes and levels Adaptive Work decomposition Load balance 22

23 Geometry Mapping Grid of Physical System Sizes may be dictated by physics or by collected sensor data SW Instantiation of the model Decomposition in ranks, threads and vector loops Possible cache blocking Machine Geometry #s Vector SIMD Width sizes and levels Adaptive Work decomposition Load balance 23

24 Geometry Mapping Grid of Physical System Sizes may be dictated by physics or by collected sensor data SW Instantiation of the model Decomposition in ranks, threads and vector loops Possible cache blocking Machine Geometry #s Vector SIMD Width sizes and levels Adaptive Work decomposition Load balance 24

25 Geometry Mapping Grid of Physical System Sizes may be dictated by physics or by collected sensor data SW Instantiation of the model Decomposition in ranks, threads and vector loops Possible cache blocking Machine Geometry #s Vector SIMD Width sizes and levels Adaptive Work decomposition Load balance 25

26 Current Best Known Methods B e n e f i t Vectorization AVX-512 Hybridization Resource Search and eliminate scalability limiters (iterative process) 512 bit vector registers Masking Architecture Full Intel compiler and library support 4 way NUMA platform 18c/socket Huge memory and IO capacity Blocking App C = A * B C = A * B do j = 1, n do i = 1, m do k = 1, l c(i,j) = c(i,j) + a(i,k)*b(k,j) enddo enddo enddo HBM (High Bandwidth Memory) DDRn Numa HBMn Numa 26

27 Position Paper on High Performance Computing Needs in Earth System Prediction (National Earth System Prediction Capability, Silver Spring, MD) April 28, 2017 Selected quotes 1. We advocate for a shift in processor design to increase emphasis on memory bandwidth, so that Earth System models run more efficiently and better serve the public need The present high-water mark of 6% of peak performance achieved for a well designed weather and climate prediction model 14 falls short of what is needed to advance weather and climate prediction in the next decade. References: [1] Carman, Jessie, Thomas Clune, Francis Giraldo, Mark Govett, Brian Gross, Anke Kamrath, Tsengdar Lee, David McCarren, John Michalakes, Scott Sandgathe, Tim Whitcomb Position paper on high performance computing needs in Earth system prediction. National Earth System Prediction Capability. [14] Muller, Andreas, et al. Strong Scaling for Numerical Weather Prediction at Petascale with the Atmospheric Model NUMA, submitted to the International Journal of High-Performance Computing Applications,

28 Tuning the implementation of the radiation scheme ACRANEB2 Per Berg and Jacob Weismann Poulsen, DMI 2 nd ESCAPE Dissemination and Training Workshop, September 2017 Case Study for the refactoring of Radiation kernels Explore portability vs performance and power impact Key conclusions: Portable Competitive Performance SW Refactoring >> Porting impact for Perf and Energy th CPU and GPU are performant (in both absolute and relative sense) in perf and energy when independently refactored. Source: Tuning the implementation of the radiation scheme ACRANEB2, Per Berg and Jacob Weismann Poulsen, DMI 2 nd ESCAPE Dissemination and Training Workshop, Sept 2017 SKX-8180 measurements provided by Intel, Sept 2017 Optimization Notice:

29 Additional Insights from case study Source: Personal communications with Per Berg and Jacob Weismann Poulsen, DMI Source: Tuning the implementation of the radiation scheme ACRANEB2, Per Berg and Jacob Weismann Poulsen, DMI 2 nd ESCAPE Dissemination and Training Workshop, Sept 2017 Setting Expectations for the Performance Portability between Companion Accelerator and Many Systems John M Levesque, DOE Center of Excellence Performance Portability meeting, 2017, Cray Quote The best performance on the GPU does not perform well on KNL and state-of-the-art Xeon The best performance on KNL performs well on Xeon and okay on the GPU Optimizations were more durable across several generations of Xeon and KNL than across several generations of GPU. Speedups are less interesting and speak mostly to legacy; Only the time to solution should be pursued and compared. GPU optimized version for the main loop currently requires 4x sloc-count (affects development and maintainability). Optimization Notice:

30 Summary The Intel Xeon Scalable Processors: Now available Part of the Intel Scalable System Framework ;Spans Processors, Memory, Storage, CFS, Fabric, Software Balanced System design critical for advancing ESM simulation capability; Basic principles introduced ESM kernel optimizations greatly outpace Generalized End to End workload improvements DMI ACRANEB2 SW refactoring shows value of SW refactoring and limits of performance portability 30

31

Hubert Nueckel Principal Engineer, Intel. Doug Nelson Technical Lead, Intel. September 2017

Hubert Nueckel Principal Engineer, Intel. Doug Nelson Technical Lead, Intel. September 2017 Hubert Nueckel Principal Engineer, Intel Doug Nelson Technical Lead, Intel September 2017 Legal Disclaimer Intel technologies features and benefits depend on system configuration and may require enabled

More information

Dr Christopher Dahnken. SSG DRD EMEA Datacenter

Dr Christopher Dahnken. SSG DRD EMEA Datacenter Dr Christopher Dahnken SSG DRD EMEA Datacenter Legal Disclaimer & Optimization Notice INFORMATION IN THIS DOCUMENT IS PROVIDED AS IS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL

More information

FAST FORWARD TO YOUR <NEXT> CREATION

FAST FORWARD TO YOUR <NEXT> CREATION FAST FORWARD TO YOUR CREATION THE ULTIMATE PROFESSIONAL WORKSTATIONS POWERED BY INTEL XEON PROCESSORS 7 SEPTEMBER 2017 WHAT S NEW INTRODUCING THE NEW INTEL XEON SCALABLE PROCESSOR BREAKTHROUGH PERFORMANCE

More information

Copyright 2017 Intel Corporation

Copyright 2017 Intel Corporation Agenda Intel Xeon Scalable Platform Overview Architectural Enhancements 2 Platform Overview 3x16 PCIe* Gen3 2 or 3 Intel UPI 3x16 PCIe Gen3 Capabilities Details 10GbE Skylake-SP CPU OPA DMI Intel C620

More information

Intel Xeon Scalable Processor for HPC 나승구이사

Intel Xeon Scalable Processor for HPC 나승구이사 Intel Xeon Scalable Processor for HPC 나승구이사 Growing Challenges in HPC System Bottlenecks The Walls Divergent Workloads Machine learning hpc Big Data visualization Barriers to Extending Usage Optimizing

More information

Ultimate Workstation Performance

Ultimate Workstation Performance Product brief & COMPARISON GUIDE Intel Scalable Processors Intel W Processors Ultimate Workstation Performance Intel Scalable Processors and Intel W Processors for Professional Workstations Optimized to

More information

April 2 nd, Bob Burroughs Director, HPC Solution Sales

April 2 nd, Bob Burroughs Director, HPC Solution Sales April 2 nd, 2019 Bob Burroughs Director, HPC Solution Sales Today - Introducing 2 nd Generation Intel Xeon Scalable Processors how Intel Speeds HPC performance Work Time System Peak Efficiency Software

More information

IFS RAPS14 benchmark on 2 nd generation Intel Xeon Phi processor

IFS RAPS14 benchmark on 2 nd generation Intel Xeon Phi processor IFS RAPS14 benchmark on 2 nd generation Intel Xeon Phi processor D.Sc. Mikko Byckling 17th Workshop on High Performance Computing in Meteorology October 24 th 2016, Reading, UK Legal Disclaimer & Optimization

More information

Fast-track Hybrid IT Transformation with Intel Data Center Blocks for Cloud

Fast-track Hybrid IT Transformation with Intel Data Center Blocks for Cloud Fast-track Hybrid IT Transformation with Intel Data Center Blocks for Cloud Kyle Corrigan, Cloud Product Line Manager, Intel Server Products Group Wagner Diaz, Product Marketing Engineer, Intel Data Center

More information

Outline. Motivation Parallel k-means Clustering Intel Computing Architectures Baseline Performance Performance Optimizations Future Trends

Outline. Motivation Parallel k-means Clustering Intel Computing Architectures Baseline Performance Performance Optimizations Future Trends Collaborators: Richard T. Mills, Argonne National Laboratory Sarat Sreepathi, Oak Ridge National Laboratory Forrest M. Hoffman, Oak Ridge National Laboratory Jitendra Kumar, Oak Ridge National Laboratory

More information

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme FUT3056BU VMware vsphere Scales on the Amazing Next-Gen Intel Xeon Architecture VMworld 2017 Content: Not for publication Tom Adelmeyer, Richard A. Brunner, Principal Engineer, Intel Principal Engineer,

More information

unleashed the future Intel Xeon Scalable Processors for High Performance Computing Alexey Belogortsev Field Application Engineer

unleashed the future Intel Xeon Scalable Processors for High Performance Computing Alexey Belogortsev Field Application Engineer the future unleashed Alexey Belogortsev Field Application Engineer Intel Xeon Scalable Processors for High Performance Computing Growing Challenges in System Architecture The Walls System Bottlenecks Divergent

More information

Accelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing

Accelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing Accelerating HPC (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing SAAHPC, Knoxville, July 13, 2010 Legal Disclaimer Intel may make changes to specifications and product

More information

Andreas Schneider. Markus Leberecht. Senior Cloud Solution Architect, Intel Deutschland. Distribution Sales Manager, Intel Deutschland

Andreas Schneider. Markus Leberecht. Senior Cloud Solution Architect, Intel Deutschland. Distribution Sales Manager, Intel Deutschland Markus Leberecht Senior Cloud Solution Architect, Intel Deutschland Andreas Schneider Distribution Sales Manager, Intel Deutschland Legal Disclaimers 2016 Intel Corporation. Intel, the Intel logo, Xeon

More information

Intel HPC Technologies Outlook

Intel HPC Technologies Outlook Intel HPC Technologies Outlook Andrey Semin Principal Engineer, HPC Technology Manager, EMEA October 19 th, 2015 ZKI Tagung AK Supercomputing Munich, Germany Legal Disclaimers INFORMATION IN THIS DOCUMENT

More information

Efficient Parallel Programming on Xeon Phi for Exascale

Efficient Parallel Programming on Xeon Phi for Exascale Efficient Parallel Programming on Xeon Phi for Exascale Eric Petit, Intel IPAG, Seminar at MDLS, Saclay, 29th November 2016 Legal Disclaimers Intel technologies features and benefits depend on system configuration

More information

LS-DYNA Performance on Intel Scalable Solutions

LS-DYNA Performance on Intel Scalable Solutions LS-DYNA Performance on Intel Scalable Solutions Nick Meng, Michael Strassmaier, James Erwin, Intel nick.meng@intel.com, michael.j.strassmaier@intel.com, james.erwin@intel.com Jason Wang, LSTC jason@lstc.com

More information

Munara Tolubaeva Technical Consulting Engineer. 3D XPoint is a trademark of Intel Corporation in the U.S. and/or other countries.

Munara Tolubaeva Technical Consulting Engineer. 3D XPoint is a trademark of Intel Corporation in the U.S. and/or other countries. Munara Tolubaeva Technical Consulting Engineer 3D XPoint is a trademark of Intel Corporation in the U.S. and/or other countries. notices and disclaimers Intel technologies features and benefits depend

More information

Intel s Architecture for NFV

Intel s Architecture for NFV Intel s Architecture for NFV Evolution from specialized technology to mainstream programming Net Futures 2015 Network applications Legal Disclaimer INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION

More information

Intel Xeon Phi архитектура, модели программирования, оптимизация.

Intel Xeon Phi архитектура, модели программирования, оптимизация. Нижний Новгород, 2017 Intel Xeon Phi архитектура, модели программирования, оптимизация. Дмитрий Прохоров, Дмитрий Рябцев, Intel Agenda What and Why Intel Xeon Phi Top 500 insights, roadmap, architecture

More information

Bei Wang, Dmitry Prohorov and Carlos Rosales

Bei Wang, Dmitry Prohorov and Carlos Rosales Bei Wang, Dmitry Prohorov and Carlos Rosales Aspects of Application Performance What are the Aspects of Performance Intel Hardware Features Omni-Path Architecture MCDRAM 3D XPoint Many-core Xeon Phi AVX-512

More information

A U G U S T 8, S A N T A C L A R A, C A

A U G U S T 8, S A N T A C L A R A, C A A U G U S T 8, 2 0 1 8 S A N T A C L A R A, C A Data-Centric Innovation Summit LISA SPELMAN VICE PRESIDENT & GENERAL MANAGER INTEL XEON PRODUCTS AND DATA CENTER MARKETING Increased integration and optimization

More information

NVMe Over Fabrics: Scaling Up With The Storage Performance Development Kit

NVMe Over Fabrics: Scaling Up With The Storage Performance Development Kit NVMe Over Fabrics: Scaling Up With The Storage Performance Development Kit Ben Walker Data Center Group Intel Corporation 2018 Storage Developer Conference. Intel Corporation. All Rights Reserved. 1 Notices

More information

Intel Architecture for HPC

Intel Architecture for HPC Intel Architecture for HPC Georg Zitzlsberger georg.zitzlsberger@vsb.cz 1st of March 2018 Agenda Salomon Architectures Intel R Xeon R processors v3 (Haswell) Intel R Xeon Phi TM coprocessor (KNC) Ohter

More information

Intel Architecture 2S Server Tioga Pass Performance and Power Optimization

Intel Architecture 2S Server Tioga Pass Performance and Power Optimization Intel Architecture 2S Server Tioga Pass Performance and Power Optimization Terry Trausch/Platform Architect/Intel Inc. Whitney Zhao/HW Engineer/Facebook Inc. Agenda Tioga Pass Feature Overview Intel Xeon

More information

Philippe Thierry Sr Staff Engineer Intel Corp.

Philippe Thierry Sr Staff Engineer Intel Corp. HPC@Intel Philippe Thierry Sr Staff Engineer Intel Corp. IBM, April 8, 2009 1 Agenda CPU update: roadmap, micro-μ and performance Solid State Disk Impact What s next Q & A Tick Tock Model Perenity market

More information

Intel Xeon Phi архитектура, модели программирования, оптимизация.

Intel Xeon Phi архитектура, модели программирования, оптимизация. Нижний Новгород, 2016 Intel Xeon Phi архитектура, модели программирования, оптимизация. Дмитрий Прохоров, Intel Agenda What and Why Intel Xeon Phi Top 500 insights, roadmap, architecture How Programming

More information

Intel SSD Data center evolution

Intel SSD Data center evolution Intel SSD Data center evolution March 2018 1 Intel Technology Innovations Fill the Memory and Storage Gap Performance and Capacity for Every Need Intel 3D NAND Technology Lower cost & higher density Intel

More information

Trends in systems and how to get efficient performance

Trends in systems and how to get efficient performance Trends in systems and how to get efficient performance Martin Hilgeman HPC Consultant martin.hilgeman@dell.com The landscape is changing We are no longer in the general purpose era the argument of tuning

More information

Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms

Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Family-Based Platforms Executive Summary Complex simulations of structural and systems performance, such as car crash simulations,

More information

Data center: The center of possibility

Data center: The center of possibility Data center: The center of possibility Diane bryant Executive vice president & general manager Data center group, intel corporation Data center: The center of possibility The future is Thousands of Clouds

More information

IXPUG 16. Dmitry Durnov, Intel MPI team

IXPUG 16. Dmitry Durnov, Intel MPI team IXPUG 16 Dmitry Durnov, Intel MPI team Agenda - Intel MPI 2017 Beta U1 product availability - New features overview - Competitive results - Useful links - Q/A 2 Intel MPI 2017 Beta U1 is available! Key

More information

THE STORAGE PERFORMANCE DEVELOPMENT KIT AND NVME-OF

THE STORAGE PERFORMANCE DEVELOPMENT KIT AND NVME-OF 14th ANNUAL WORKSHOP 2018 THE STORAGE PERFORMANCE DEVELOPMENT KIT AND NVME-OF Paul Luse Intel Corporation Apr 2018 AGENDA Storage Performance Development Kit What is SPDK? The SPDK Community Why are so

More information

INTEL HPC DEVELOPER CONFERENCE FUEL YOUR INSIGHT

INTEL HPC DEVELOPER CONFERENCE FUEL YOUR INSIGHT INTEL HPC DEVELOPER CONFERENCE FUEL YOUR INSIGHT INTEL HPC DEVELOPER CONFERENCE FUEL YOUR INSIGHT UPDATE ON OPENSWR: A SCALABLE HIGH- PERFORMANCE SOFTWARE RASTERIZER FOR SCIVIS Jefferson Amstutz Intel

More information

EARLY EVALUATION OF THE CRAY XC40 SYSTEM THETA

EARLY EVALUATION OF THE CRAY XC40 SYSTEM THETA EARLY EVALUATION OF THE CRAY XC40 SYSTEM THETA SUDHEER CHUNDURI, SCOTT PARKER, KEVIN HARMS, VITALI MOROZOV, CHRIS KNIGHT, KALYAN KUMARAN Performance Engineering Group Argonne Leadership Computing Facility

More information

Accelerating Insights In the Technical Computing Transformation

Accelerating Insights In the Technical Computing Transformation Accelerating Insights In the Technical Computing Transformation Dr. Rajeeb Hazra Vice President, Data Center Group General Manager, Technical Computing Group June 2014 TOP500 Highlights Intel Xeon Phi

More information

Dr. Jean-Laurent PHILIPPE, PhD EMEA HPC Technical Sales Specialist. With Dell Amsterdam, October 27, 2016

Dr. Jean-Laurent PHILIPPE, PhD EMEA HPC Technical Sales Specialist. With Dell Amsterdam, October 27, 2016 Dr. Jean-Laurent PHILIPPE, PhD EMEA HPC Technical Sales Specialist With Dell Amsterdam, October 27, 2016 Legal Disclaimers Intel technologies features and benefits depend on system configuration and may

More information

Technologies and application performance. Marc Mendez-Bermond HPC Solutions Expert - Dell Technologies September 2017

Technologies and application performance. Marc Mendez-Bermond HPC Solutions Expert - Dell Technologies September 2017 Technologies and application performance Marc Mendez-Bermond HPC Solutions Expert - Dell Technologies September 2017 The landscape is changing We are no longer in the general purpose era the argument of

More information

What s P. Thierry

What s P. Thierry What s new@intel P. Thierry Principal Engineer, Intel Corp philippe.thierry@intel.com CPU trend Memory update Software Characterization in 30 mn 10 000 feet view CPU : Range of few TF/s and

More information

Achieving 2.5X 1 Higher Performance for the Taboola TensorFlow* Serving Application through Targeted Software Optimization

Achieving 2.5X 1 Higher Performance for the Taboola TensorFlow* Serving Application through Targeted Software Optimization white paper Internet Discovery Artificial Intelligence (AI) Achieving.X Higher Performance for the Taboola TensorFlow* Serving Application through Targeted Software Optimization As one of the world s preeminent

More information

Knights Corner: Your Path to Knights Landing

Knights Corner: Your Path to Knights Landing Knights Corner: Your Path to Knights Landing James Reinders, Intel Wednesday, September 17, 2014; 9-10am PDT Photo (c) 2014, James Reinders; used with permission; Yosemite Half Dome rising through forest

More information

Intel Knights Landing Hardware

Intel Knights Landing Hardware Intel Knights Landing Hardware TACC KNL Tutorial IXPUG Annual Meeting 2016 PRESENTED BY: John Cazes Lars Koesterke 1 Intel s Xeon Phi Architecture Leverages x86 architecture Simpler x86 cores, higher compute

More information

Introduction: Modern computer architecture. The stored program computer and its inherent bottlenecks Multi- and manycore chips and nodes

Introduction: Modern computer architecture. The stored program computer and its inherent bottlenecks Multi- and manycore chips and nodes Introduction: Modern computer architecture The stored program computer and its inherent bottlenecks Multi- and manycore chips and nodes Motivation: Multi-Cores where and why Introduction: Moore s law Intel

More information

Essential Performance and Advanced Security

Essential Performance and Advanced Security Product brief Intel Xeon E-2100 Processor Essential Performance and Advanced Security for Entry Server, Secure Cloud, and Entry Workstation Solutions Performance and Security, Intelligently Designed for

More information

Visualizing and Finding Optimization Opportunities with Intel Advisor Roofline feature. Intel Software Developer Conference London, 2017

Visualizing and Finding Optimization Opportunities with Intel Advisor Roofline feature. Intel Software Developer Conference London, 2017 Visualizing and Finding Optimization Opportunities with Intel Advisor Roofline feature Intel Software Developer Conference London, 2017 Agenda Vectorization is becoming more and more important What is

More information

Enabling Performance-per-Watt Gains in High-Performance Cluster Computing

Enabling Performance-per-Watt Gains in High-Performance Cluster Computing WHITE PAPER Appro Xtreme-X Supercomputer with the Intel Xeon Processor E5-2600 Product Family Enabling Performance-per-Watt Gains in High-Performance Cluster Computing Appro Xtreme-X Supercomputer with

More information

Intel Architecture for Software Developers

Intel Architecture for Software Developers Intel Architecture for Software Developers 1 Agenda Introduction Processor Architecture Basics Intel Architecture Intel Core and Intel Xeon Intel Atom Intel Xeon Phi Coprocessor Use Cases for Software

More information

Intel HPC Portfolio September Emiliano Politano Technical Account Manager

Intel HPC Portfolio September Emiliano Politano Technical Account Manager Intel HPC Portfolio September 2014 Emiliano Politano Technical Account Manager Legal Disclaimers Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors.

More information

Srinivas Chennupaty. Intel Corporation, 2018

Srinivas Chennupaty. Intel Corporation, 2018 Srinivas Chennupaty Intel Corporation, 2018 Notices and Disclaimers This document contains information on products, services and/or processes in development. All information provided here is subject to

More information

The knight makes his play for the crown Phi & Omni-Path Glenn Rosenberg Computer Insights UK 2016

The knight makes his play for the crown Phi & Omni-Path Glenn Rosenberg Computer Insights UK 2016 The knight makes his play for the crown Phi & Omni-Path Glenn Rosenberg Computer Insights UK 2016 2016 Supermicro 15 Minutes Two Swim Lanes Intel Phi Roadmap & SKUs Phi in the TOP500 Use Cases Supermicro

More information

Deep learning prevalence. first neuroscience department. Spiking Neuron Operant conditioning First 1 Billion transistor processor

Deep learning prevalence. first neuroscience department. Spiking Neuron Operant conditioning First 1 Billion transistor processor WELCOME TO Operant conditioning 1938 Spiking Neuron 1952 first neuroscience department 1964 Deep learning prevalence mid 2000s The Turing Machine 1936 Transistor 1947 First computer science department

More information

Performance Evaluation of NWChem Ab-Initio Molecular Dynamics (AIMD) Simulations on the Intel Xeon Phi Processor

Performance Evaluation of NWChem Ab-Initio Molecular Dynamics (AIMD) Simulations on the Intel Xeon Phi Processor * Some names and brands may be claimed as the property of others. Performance Evaluation of NWChem Ab-Initio Molecular Dynamics (AIMD) Simulations on the Intel Xeon Phi Processor E.J. Bylaska 1, M. Jacquelin

More information

Re-Architecting Cloud Storage with Intel 3D XPoint Technology and Intel 3D NAND SSDs

Re-Architecting Cloud Storage with Intel 3D XPoint Technology and Intel 3D NAND SSDs Re-Architecting Cloud Storage with Intel 3D XPoint Technology and Intel 3D NAND SSDs Jack Zhang yuan.zhang@intel.com, Cloud & Enterprise Storage Architect Santa Clara, CA 1 Agenda Memory Storage Hierarchy

More information

Innovation Accelerating Mission Critical Infrastructure

Innovation Accelerating Mission Critical Infrastructure Innovation Accelerating Mission Critical Infrastructure Legal Disclaimers INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE,

More information

HPC Technology Trends

HPC Technology Trends HPC Technology Trends High Performance Embedded Computing Conference September 18, 2007 David S Scott, Ph.D. Petascale Product Line Architect Digital Enterprise Group Risk Factors Today s s presentations

More information

Ravindra Babu Ganapathi Product Owner/ Technical Lead Omni Path Libraries, Intel Corp. Sayantan Sur Senior Software Engineer, Intel Corp.

Ravindra Babu Ganapathi Product Owner/ Technical Lead Omni Path Libraries, Intel Corp. Sayantan Sur Senior Software Engineer, Intel Corp. Ravindra Babu Ganapathi Product Owner/ Technical Lead Omni Path Libraries, Intel Corp. Sayantan Sur Senior Software Engineer, Intel Corp. Legal All information provided here is subject to change without

More information

Kevin O Leary, Intel Technical Consulting Engineer

Kevin O Leary, Intel Technical Consulting Engineer Kevin O Leary, Intel Technical Consulting Engineer Moore s Law Is Going Strong Hardware performance continues to grow exponentially We think we can continue Moore's Law for at least another 10 years."

More information

EPYC VIDEO CUG 2018 MAY 2018

EPYC VIDEO CUG 2018 MAY 2018 AMD UPDATE CUG 2018 EPYC VIDEO CRAY AND AMD PAST SUCCESS IN HPC AMD IN TOP500 LIST 2002 TO 2011 2011 - AMD IN FASTEST MACHINES IN 11 COUNTRIES ZEN A FRESH APPROACH Designed from the Ground up for Optimal

More information

Visualizing and Finding Optimization Opportunities with Intel Advisor Roofline feature

Visualizing and Finding Optimization Opportunities with Intel Advisor Roofline feature Visualizing and Finding Optimization Opportunities with Intel Advisor Roofline feature Intel Software Developer Conference Frankfurt, 2017 Klaus-Dieter Oertel, Intel Agenda Intel Advisor for vectorization

More information

Hardware and Software Co-Optimization for Best Cloud Experience

Hardware and Software Co-Optimization for Best Cloud Experience Hardware and Software Co-Optimization for Best Cloud Experience Khun Ban (Intel), Troy Wallins (Intel) October 25-29 2015 1 Cloud Computing Growth Connected Devices + Apps + New Services + New Service

More information

Intel Many Integrated Core (MIC) Architecture

Intel Many Integrated Core (MIC) Architecture Intel Many Integrated Core (MIC) Architecture Karl Solchenbach Director European Exascale Labs BMW2011, November 3, 2011 1 Notice and Disclaimers Notice: This document contains information on products

More information

Intel Workstation Technology

Intel Workstation Technology Intel Workstation Technology Turning Imagination Into Reality November, 2008 1 Step up your Game Real Workstations Unleash your Potential 2 Yesterday s Super Computer Today s Workstation = = #1 Super Computer

More information

Intel Advisor XE Future Release Threading Design & Prototyping Vectorization Assistant

Intel Advisor XE Future Release Threading Design & Prototyping Vectorization Assistant Intel Advisor XE Future Release Threading Design & Prototyping Vectorization Assistant Parallel is the Path Forward Intel Xeon and Intel Xeon Phi Product Families are both going parallel Intel Xeon processor

More information

The Intel Xeon PHI Architecture

The Intel Xeon PHI Architecture The Intel Xeon PHI Architecture (codename Knights Landing ) Dr. Christopher Dahnken Senior Application Engineer Software and Services Group Legal Disclaimer & Optimization Notice INFORMATION IN THIS DOCUMENT

More information

Intel Select Solutions for Professional Visualization with Advantech Servers & Appliances

Intel Select Solutions for Professional Visualization with Advantech Servers & Appliances Solution Brief Intel Select Solution for Professional Visualization Intel Xeon Processor Scalable Family Powered by Intel Rendering Framework Intel Select Solutions for Professional Visualization with

More information

Accelerate Machine Learning on macos with Intel Integrated Graphics. Hisham Chowdhury May 23, 2018

Accelerate Machine Learning on macos with Intel Integrated Graphics. Hisham Chowdhury May 23, 2018 Accelerate Machine Learning on macos with Intel Integrated Graphics Hisham Chowdhury May 23, 2018 Apple Machine Learning Stack Machine Learning Application 1 Machine Learning Application 2 Vision Natural

More information

DEVITO AUTOMATED HIGH-PERFORMANCE FINITE DIFFERENCES FOR GEOPHYSICAL EXPLORATION

DEVITO AUTOMATED HIGH-PERFORMANCE FINITE DIFFERENCES FOR GEOPHYSICAL EXPLORATION DEVITO AUTOMATED HIGH-PERFORMANCE FINITE DIFFERENCES FOR GEOPHYSICAL EXPLORATION F. Luporini 1, C. Yount 4, M. Louboutin 3, N. Kukreja 1, P. Witte 2, M. Lange 5, P. Kelly 1, F. Herrmann 3, G.Gorman 1 1Imperial

More information

Fast forward. To your <next>

Fast forward. To your <next> Fast forward To your Navin Shenoy EXECUTIVE VICE PRESIDENT GENERAL MANAGER, DATA CENTER GROUP CLOUD ECONOMICS INTELLIGENT DATA PRACTICES NETWORK TRANSFORMATION Intel Xeon Scalable Platform The

More information

SPDK China Summit Ziye Yang. Senior Software Engineer. Network Platforms Group, Intel Corporation

SPDK China Summit Ziye Yang. Senior Software Engineer. Network Platforms Group, Intel Corporation SPDK China Summit 2018 Ziye Yang Senior Software Engineer Network Platforms Group, Intel Corporation Agenda SPDK programming framework Accelerated NVMe-oF via SPDK Conclusion 2 Agenda SPDK programming

More information

The Mont-Blanc approach towards Exascale

The Mont-Blanc approach towards Exascale http://www.montblanc-project.eu The Mont-Blanc approach towards Exascale Alex Ramirez Barcelona Supercomputing Center Disclaimer: Not only I speak for myself... All references to unavailable products are

More information

Aim High. Intel Technical Update Teratec 07 Symposium. June 20, Stephen R. Wheat, Ph.D. Director, HPC Digital Enterprise Group

Aim High. Intel Technical Update Teratec 07 Symposium. June 20, Stephen R. Wheat, Ph.D. Director, HPC Digital Enterprise Group Aim High Intel Technical Update Teratec 07 Symposium June 20, 2007 Stephen R. Wheat, Ph.D. Director, HPC Digital Enterprise Group Risk Factors Today s s presentations contain forward-looking statements.

More information

Intel optane memory as platform accelerator. Vladimir Knyazkin

Intel optane memory as platform accelerator. Vladimir Knyazkin Intel optane memory as platform accelerator Vladimir Knyazkin 2 Legal Disclaimers Intel technologies features and benefits depend on system configuration and may require enabled hardware, software or service

More information

Intel High-Performance Computing. Technologies for Engineering

Intel High-Performance Computing. Technologies for Engineering 6. LS-DYNA Anwenderforum, Frankenthal 2007 Keynote-Vorträge II Intel High-Performance Computing Technologies for Engineering H. Cornelius Intel GmbH A - II - 29 Keynote-Vorträge II 6. LS-DYNA Anwenderforum,

More information

ENVISION TECHNOLOGY CONFERENCE. Functional intel (ia) BLA PARTHAS, INTEL PLATFORM ARCHITECT

ENVISION TECHNOLOGY CONFERENCE. Functional intel (ia) BLA PARTHAS, INTEL PLATFORM ARCHITECT ENVISION TECHNOLOGY CONFERENCE Functional Safety @ intel (ia) BLA PARTHAS, INTEL PLATFORM ARCHITECT Legal Notices & Disclaimers This document contains information on products, services and/or processes

More information

Future of datacenter STORAGE. Carol Wilder, Niels Reimers,

Future of datacenter STORAGE. Carol Wilder, Niels Reimers, Future of datacenter STORAGE Carol Wilder, carol.a.wilder@intel.com Niels Reimers, niels.reimers@intel.com Legal Notices/disclaimer Intel technologies features and benefits depend on system configuration

More information

Intel and SAP Realising the Value of your Data

Intel and SAP Realising the Value of your Data Intel and SAP Realising the Value of your Data Scott Pendrey Intel EMEA, Server Product Manager In the Future Systems will be workload optimized Infrastructure will be software defined Analytics will be

More information

Leading at the edge TECHNOLOGY AND MANUFACTURING DAY

Leading at the edge TECHNOLOGY AND MANUFACTURING DAY Leading at the edge IDM ADVANTAGE DR. MURTHY RENDUCHINTALA President - Client, IoT Businesses & Systems Architecture Group Disclosures Intel Technology and Manufacturing Day 2017 occurs during Intel s

More information

Small File I/O Performance in Lustre. Mikhail Pershin, Joe Gmitter Intel HPDD April 2018

Small File I/O Performance in Lustre. Mikhail Pershin, Joe Gmitter Intel HPDD April 2018 Small File I/O Performance in Lustre Mikhail Pershin, Joe Gmitter Intel HPDD April 2018 Overview Small File I/O Concerns Data on MDT (DoM) Feature Overview DoM Use Cases DoM Performance Results Small File

More information

HPCG on Intel Xeon Phi 2 nd Generation, Knights Landing. Alexander Kleymenov and Jongsoo Park Intel Corporation SC16, HPCG BoF

HPCG on Intel Xeon Phi 2 nd Generation, Knights Landing. Alexander Kleymenov and Jongsoo Park Intel Corporation SC16, HPCG BoF HPCG on Intel Xeon Phi 2 nd Generation, Knights Landing Alexander Kleymenov and Jongsoo Park Intel Corporation SC16, HPCG BoF 1 Outline KNL results Our other work related to HPCG 2 ~47 GF/s per KNL ~10

More information

Introduction: Modern computer architecture. The stored program computer and its inherent bottlenecks Multi- and manycore chips and nodes

Introduction: Modern computer architecture. The stored program computer and its inherent bottlenecks Multi- and manycore chips and nodes Introduction: Modern computer architecture The stored program computer and its inherent bottlenecks Multi- and manycore chips and nodes Multi-core today: Intel Xeon 600v4 (016) Xeon E5-600v4 Broadwell

More information

LS-DYNA Performance Benchmark and Profiling. October 2017

LS-DYNA Performance Benchmark and Profiling. October 2017 LS-DYNA Performance Benchmark and Profiling October 2017 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: LSTC, Huawei, Mellanox Compute resource

More information

High Performance Computing The Essential Tool for a Knowledge Economy

High Performance Computing The Essential Tool for a Knowledge Economy High Performance Computing The Essential Tool for a Knowledge Economy Rajeeb Hazra Vice President & General Manager Technical Computing Group Datacenter & Connected Systems Group July 22 nd 2013 1 What

More information

Intel Xeon Processor E v3 Family

Intel Xeon Processor E v3 Family Intel Xeon Processor E5-2600 v3 Family October 2014 Document Number: 331309-001US All information provided here is subject to change without notice. Contact your Intel representative to obtain the latest

More information

Accelerating Data Center Workloads with FPGAs

Accelerating Data Center Workloads with FPGAs Accelerating Data Center Workloads with FPGAs Enno Lübbers NorCAS 2017, Linköping, Sweden Intel technologies features and benefits depend on system configuration and may require enabled hardware, software

More information

HPC. Accelerating. HPC Advisory Council Lugano, CH March 15 th, Herbert Cornelius Intel

HPC. Accelerating. HPC Advisory Council Lugano, CH March 15 th, Herbert Cornelius Intel 15.03.2012 1 Accelerating HPC HPC Advisory Council Lugano, CH March 15 th, 2012 Herbert Cornelius Intel Legal Disclaimer 15.03.2012 2 INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS.

More information

Benchmark results on Knight Landing (KNL) architecture

Benchmark results on Knight Landing (KNL) architecture Benchmark results on Knight Landing (KNL) architecture Domenico Guida, CINECA SCAI (Bologna) Giorgio Amati, CINECA SCAI (Roma) Roma 23/10/2017 KNL, BDW, SKL A1 BDW A2 KNL A3 SKL cores per node 2 x 18 @2.3

More information

OPENSHMEM AND OFI: BETTER TOGETHER

OPENSHMEM AND OFI: BETTER TOGETHER 4th ANNUAL WORKSHOP 208 OPENSHMEM AND OFI: BETTER TOGETHER James Dinan, David Ozog, and Kayla Seager Intel Corporation [ April, 208 ] NOTICES AND DISCLAIMERS Intel technologies features and benefits depend

More information

LS-DYNA Performance Benchmark and Profiling. October 2017

LS-DYNA Performance Benchmark and Profiling. October 2017 LS-DYNA Performance Benchmark and Profiling October 2017 2 Note The following research was performed under the HPC Advisory Council activities Participating vendors: LSTC, Huawei, Mellanox Compute resource

More information

Data-Centric Innovation Summit NAVEEN RAO CORPORATE VICE PRESIDENT & GENERAL MANAGER ARTIFICIAL INTELLIGENCE PRODUCTS GROUP

Data-Centric Innovation Summit NAVEEN RAO CORPORATE VICE PRESIDENT & GENERAL MANAGER ARTIFICIAL INTELLIGENCE PRODUCTS GROUP Data-Centric Innovation Summit NAVEEN RAO CORPORATE VICE PRESIDENT & GENERAL MANAGER ARTIFICIAL INTELLIGENCE PRODUCTS GROUP Data center logic silicon Tam ~30% cagr Ai is exploding $8-10B Emerging as a

More information

Pactron FPGA Accelerated Computing Solutions

Pactron FPGA Accelerated Computing Solutions Pactron FPGA Accelerated Computing Solutions Intel Xeon + Altera FPGA 2015 Pactron HJPC Corporation 1 Motivation for Accelerators Enhanced Performance: Accelerators compliment CPU cores to meet market

More information

Atos ARM solutions for HPC

Atos ARM solutions for HPC Atos ARM solutions for HPC Eric Eppe Head of Solution Marketing & Portfolio HPC & Quantum Global Business Line Tuesday, March 7th, HPC User Forum, TERATEC Atos HPC and ARM A long time engagement 2012 2013

More information

HETEROGENEOUS HPC, ARCHITECTURAL OPTIMIZATION, AND NVLINK STEVE OBERLIN CTO, TESLA ACCELERATED COMPUTING NVIDIA

HETEROGENEOUS HPC, ARCHITECTURAL OPTIMIZATION, AND NVLINK STEVE OBERLIN CTO, TESLA ACCELERATED COMPUTING NVIDIA HETEROGENEOUS HPC, ARCHITECTURAL OPTIMIZATION, AND NVLINK STEVE OBERLIN CTO, TESLA ACCELERATED COMPUTING NVIDIA STATE OF THE ART 2012 18,688 Tesla K20X GPUs 27 PetaFLOPS FLAGSHIP SCIENTIFIC APPLICATIONS

More information

Ravindra Babu Ganapathi

Ravindra Babu Ganapathi 14 th ANNUAL WORKSHOP 2018 INTEL OMNI-PATH ARCHITECTURE AND NVIDIA GPU SUPPORT Ravindra Babu Ganapathi Intel Corporation [ April, 2018 ] Intel MPI Open MPI MVAPICH2 IBM Platform MPI SHMEM Intel MPI Open

More information

Density Optimized System Enabling Next-Gen Performance

Density Optimized System Enabling Next-Gen Performance Product brief High Performance Computing (HPC) and Hyper-Converged Infrastructure (HCI) Intel Server Board S2600BP Product Family Featuring the Intel Xeon Processor Scalable Family Density Optimized System

More information

Fast and Easy Persistent Storage for Docker* Containers with Storidge and Intel

Fast and Easy Persistent Storage for Docker* Containers with Storidge and Intel Solution brief Intel Storage Builders Storidge ContainerIO TM Intel Xeon Processor Scalable Family Intel SSD DC Family for PCIe*/NVMe Fast and Easy Persistent Storage for Docker* Containers with Storidge

More information

Meet the Increased Demands on Your Infrastructure with Dell and Intel. ServerWatchTM Executive Brief

Meet the Increased Demands on Your Infrastructure with Dell and Intel. ServerWatchTM Executive Brief Meet the Increased Demands on Your Infrastructure with Dell and Intel ServerWatchTM Executive Brief a QuinStreet Excutive Brief. 2012 Doing more with less is the mantra that sums up much of the past decade,

More information

WITH INTEL TECHNOLOGIES

WITH INTEL TECHNOLOGIES WITH INTEL TECHNOLOGIES Commitment Is to Enable The Best Democratize technologies Advance solutions Unleash innovations Intel Xeon Scalable Processor Family Delivers Ideal Enterprise Solutions NEW Intel

More information

Colin Cunningham, Intel Kumaran Siva, Intel Sandeep Mahajan, Oracle 03-Oct :45 p.m. - 5:30 p.m. Moscone West - Room 3020

Colin Cunningham, Intel Kumaran Siva, Intel Sandeep Mahajan, Oracle 03-Oct :45 p.m. - 5:30 p.m. Moscone West - Room 3020 Colin Cunningham, Intel Kumaran Siva, Intel Sandeep Mahajan, Oracle 03-Oct-2017 4:45 p.m. - 5:30 p.m. Moscone West - Room 3020 Big Data Talk Exploring New SSD Usage Models to Accelerate Cloud Performance

More information

Erkenntnisse aus aktuellen Performance- Messungen mit LS-DYNA

Erkenntnisse aus aktuellen Performance- Messungen mit LS-DYNA 14. LS-DYNA Forum, Oktober 2016, Bamberg Erkenntnisse aus aktuellen Performance- Messungen mit LS-DYNA Eric Schnepf 1, Dr. Eckardt Kehl 1, Chih-Song Kuo 2, Dymitrios Kyranas 2 1 Fujitsu Technology Solutions

More information

Intel and Red Hat. Matty Bakkeren Enterprise Technology Specialist

Intel and Red Hat. Matty Bakkeren Enterprise Technology Specialist Intel and Red Hat Matty Bakkeren Enterprise Technology Specialist Legal Disclaimer INFORMATION IN THIS DOCUMENT IS PROVIDED IN CONNECTION WITH INTEL PRODUCTS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL

More information