Accelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing

Similar documents
HPC in the Multicore Era

FAST FORWARD TO YOUR <NEXT> CREATION

Intel s Architecture for NFV

Philippe Thierry Sr Staff Engineer Intel Corp.

HPC. Accelerating. HPC Advisory Council Lugano, CH March 15 th, Herbert Cornelius Intel

John Hengeveld Director of Marketing, HPC Evangelist

Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms

Intel Many Integrated Core (MIC) Architecture

Intel Workstation Platforms

Expand Your HPC Market Reach and Grow Your Sales with Intel Cluster Ready

High Performance Computing The Essential Tool for a Knowledge Economy

Competitive Advantage in a New Economy

Nehalem Hochleistungsrechnen für reale Anwendungen

Intel Workstation Technology

Ultimate Workstation Performance

IFS RAPS14 benchmark on 2 nd generation Intel Xeon Phi processor

Jason Waxman General Manager High Density Compute Division Data Center Group

April 2 nd, Bob Burroughs Director, HPC Solution Sales

Intel Architecture 2S Server Tioga Pass Performance and Power Optimization

Intel HPC Portfolio September Emiliano Politano Technical Account Manager

Fast-track Hybrid IT Transformation with Intel Data Center Blocks for Cloud

Intel High-Performance Computing. Technologies for Engineering

Microsoft SQL Server in a VMware Environment on Dell PowerEdge R810 Servers and Dell EqualLogic Storage

Quad-Core Intel Xeon Processor-based 3200/3210 Chipset Server Platforms

HPC Technology Trends

Intel Xeon Processor E v3 Family

Intel and Red Hat. Matty Bakkeren Enterprise Technology Specialist

Meet the Increased Demands on Your Infrastructure with Dell and Intel. ServerWatchTM Executive Brief

Intel Xeon Phi Coprocessor. Technical Resources. Intel Xeon Phi Coprocessor Workshop Pawsey Centre & CSIRO, Aug Intel Xeon Phi Coprocessor

Intel s next generation processors for IBM System x servers

EPYC VIDEO CUG 2018 MAY 2018

The Intel Xeon Phi Coprocessor. Dr-Ing. Michael Klemm Software and Services Group Intel Corporation

LS-DYNA Performance Benchmark and Profiling. April 2015

Dependable, Intelligent: Intel Xeon Processor- Based Servers

Accelerating Insights In the Technical Computing Transformation

Innovate. Integrate. Innovate. Integrate.

Making the Cloud Work for You Introducing the Intel Xeon Processor E5 Family

Aim High. Intel Technical Update Teratec 07 Symposium. June 20, Stephen R. Wheat, Ph.D. Director, HPC Digital Enterprise Group

Innovation Accelerating Mission Critical Infrastructure

Desktop 4th Generation Intel Core, Intel Pentium, and Intel Celeron Processor Families and Intel Xeon Processor E3-1268L v3

SU Dual and Quad-Core Xeon UP Server

Intel Architecture Press Briefing

STAR-CCM+ Performance Benchmark and Profiling. July 2014

IBM System x family brochure

QuickSpecs. HP Z600 Workstation Overview

What s P. Thierry

Density Optimized System Enabling Next-Gen Performance

Quad-Core Intel Xeon Processor 5400 Series

6th Generation Intel Core Processor Series

Intel Cluster Toolkit Compiler Edition 3.2 for Linux* or Windows HPC Server 2008*

POWER YOUR CREATIVITY WITH THE INTEL CORE X-SERIES PROCESSOR FAMILY

NVMe Over Fabrics: Scaling Up With The Storage Performance Development Kit

Innovating and Integrating for Communications and Storage

Enhancing Analysis-Based Design with Quad-Core Intel Xeon Processor-Based Workstations

Intel HPC Technologies Outlook

Exactly as much as you need.

Re-Architecting Cloud Storage with Intel 3D XPoint Technology and Intel 3D NAND SSDs

Pactron FPGA Accelerated Computing Solutions

IBM System x family brochure

Intel Architecture for Software Developers

CPMD Performance Benchmark and Profiling. February 2014

Legal Notices and Important Information

45nm Product Press Briefing. Stephen L. Smith Corporate Vice President Director of Group Operations Digital Enterprise Group

Kevin O Leary, Intel Technical Consulting Engineer

3rd Generation Intel Core Processor Family Quad Core Launch Product Information. April 23, 2012

ENVISION TECHNOLOGY CONFERENCE. Functional intel (ia) BLA PARTHAS, INTEL PLATFORM ARCHITECT

Essentials. Expected Discontinuance Q2'15 Limited 3-year Warranty Yes Extended Warranty Available

Agenda. Sun s x Sun s x86 Strategy. 2. Sun s x86 Product Portfolio. 3. Virtualization < 1 >

NAMD Performance Benchmark and Profiling. January 2015

Intel Workstation Platforms

Cloud Transformation: Data center usage models driving Cloud computing innovation. Jake Smith, Advanced Server Technologies Data Center Group Intel

Data Sheet Fujitsu Server PRIMERGY CX250 S2 Dual Socket Server Node

WHITE PAPER FUJITSU PRIMERGY SERVERS PERFORMANCE REPORT PRIMERGY BX924 S2

LS-DYNA Performance Benchmark and Profiling. October 2017

Intel Cluster Ready Allowed Hardware Variances

Hardware and Software Co-Optimization for Best Cloud Experience

Munara Tolubaeva Technical Consulting Engineer. 3D XPoint is a trademark of Intel Corporation in the U.S. and/or other countries.

Performance and Energy Efficiency of the 14 th Generation Dell PowerEdge Servers

Intel Core X-Series. The Ultimate platform with unprecedented scalability. The Intel Core i9 Extreme Edition processor

HP and CATIA HP Workstations for running Dassault Systèmes CATIA

Making the Cloud Work for You Introducing the Intel Xeon Processor E5 Family

Sun Fire X4170 M2 Server Frequently Asked Questions

Intel Compiler. Advanced Technical Skills (ATS) North America. IBM High Performance Computing February 2010 Y. Joanna Wong, Ph.D.

DELL Reference Configuration Microsoft SQL Server 2008 Fast Track Data Warehouse

Essential Performance and Advanced Security

Dell EMC Ready Bundle for HPC Digital Manufacturing Dassault Systѐmes Simulia Abaqus Performance

C-Series Servers Meat and Tators

Performance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA

Introduction: Modern computer architecture. The stored program computer and its inherent bottlenecks Multi- and manycore chips and nodes

LAMMPS-KOKKOS Performance Benchmark and Profiling. September 2015

Intel Processor-Based Server Selection Guide

Risk Factors. Rev. 4/19/11

Hubert Nueckel Principal Engineer, Intel. Doug Nelson Technical Lead, Intel. September 2017

Faster Metal Forming Solution with Latest Intel Hardware & Software Technology

Copyright 2017 Intel Corporation

Resources Current and Future Systems. Timothy H. Kaiser, Ph.D.

Dell PowerEdge R910 SQL OLTP Virtualization Study Measuring Performance and Power Improvements of New Intel Xeon E7 Processors and Low-Voltage Memory

Intel Advisor XE Future Release Threading Design & Prototyping Vectorization Assistant

HPC Hardware Overview

Intel Core TM Processor i C Embedded Application Power Guideline Addendum

Transcription:

Accelerating HPC (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing SAAHPC, Knoxville, July 13, 2010

Legal Disclaimer Intel may make changes to specifications and product descriptions at any time, without notice. Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may affect actual performance. Buyers should consult other sources of information to evaluate the performance of systems or components they are considering purchasing. For more information on performance tests and on the performance of Intel products, visit Intel Performance Benchmark Limitations Intel does not control or audit the design or implementation of third party benchmarks or Web sites referenced in this document. Intel encourages all of its customers to visit the referenced Web sites or others where similar performance benchmarks are reported and confirm whether the referenced benchmarks are accurate and reflect performance of systems available for purchase. Intel processor numbers are not a measure of performance. Processor numbers differentiate features within each processor family, not across different processor families. See www.intel.com/products/processor_number for details. Intel, processors, chipsets, and desktop boards may contain design defects or errors known as errata, which may cause the product to deviate from published specifications. Current characterized errata are available on request. Intel Virtualization Technology requires a computer system with a processor, chipset, BIOS, virtual machine monitor (VMM) and applications enabled for virtualization technology. Functionality, performance or other virtualization technology benefits will vary depending on hardware and software configurations. Virtualization technology-enabled BIOS and VMM applications are currently in development. 64-bit computing on Intel architecture requires a computer system with a processor, chipset, BIOS, operating system, device drivers and applications enabled for Intel 64 architecture. Performance will vary depending on your hardware and software configurations. Consult with your system vendor for more information. Intel, Intel Xeon, Intel Core microarchitecture, and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Copyright 2010 Intel Corporation

Agenda Accelerating HPC Intel QuickAssist Technology Update Intel MIC Architecture Summary

Still an Insatiable Need for Computing 1 ZFlops 100 EFlops 10 EFlops 1 EFlops 100 PFlops 10 PFlops 1 PFlops 100 TFlops 10 TFlops 1 TFlops 100 GFlops 10 GFlops 1 GFlops Climate Simulation Genomics Research Medical Imaging 100 MFlops 1993 1999 2005 2011 2017 2023 2029 Source: www.top500.org

What are we doing to accelerate HPC applications? Intel Xeon Processors and Intel MIC Architecture. Support Open Attach for Application Accelerators PCIe Gen 2 today, and PCIe Gen 3 in the future. Enabling FSB & QPI-FPGA cache coherent heterogeneous systems. Intel QuickAssist Technology Initiative. OpenCL Standards efforts.

Intel QuickAssist Technology In-socket FPGA Accelerators Multiple accelerator and attach options with software and ecosystem support Performance and scalability based on customer needs and priorities Simplify The Use and Deployment of Accelerators on Intel Architecture Platforms

Moore s Law: Alive and Well at Intel 65nm 2005 45nm 2007 MANUFACTURING 32nm 2009 22nm 15nm 11nm 8nm 2013 * 2015 * 2011 * DEVELOPMENT RESEARCH 2017 * 2019+ Intel Innovation-Enabled Technology Pipeline is Full

High Performance Micro-Architecture for PetaScale Deployments Tick Tock Tick Tock Tick Tock Tick Tock 65nm 45nm 32nm 22nm Core Harpertown Penryn Nehalem Westmere Sandy Bridge Ivy Bridge Future SSSE3 SSE4.1 New instructions: SSE4.2 AES AVX Future - FMA

Intel Xeon Processor 5600 Series Building on Xeon 5500 Leadership Capabilities 130W 95W 80W 60W (6C) 40W (4C) Higher Frequency Greater performance at the same power New 32nm manufacturing Process Delivering more into the same package Intel Xeon 5600 Intel Xeon 5600 More Cores/More cache DDR3 Memory Up to 6 cores, Up to 12MB Cache. Providing more performance for data intensive workloads PCI Express* 2.0 Intel 82599 10GbE Controller Up to 2 x 1333 MHz DIMMs per channel. Greater performance for bandwidth sensitive applications ICH 10/10R Up to 60% More Performance 1 Better Energy Efficiency New Security Features 1 Source: Internal Intel measurements for Xeon X5680 vs. Xeon X5570 on BlackScholes*. See backup for system configurations. New lower power CPU SKU options for Xeon 5600 9

Intel Xeon Processor 7500 Series Super Node Scalability for HPC Technology Advantages Intel 7500 Chipset Nehalem architecture 8-cores Xeon 7500 Xeon 7500 24MB Shared L3 Cache 64 DIMM slots support up to 1 terabyte of memory (4 sockets) Xeon 7 500 Xeon 7500 72 PCIe Gen2 lanes Scaling from 2-256 sockets Intel Virtualization Technologies PCI Express* 2.0 Memory Intel Scalable Memory Buffer 10 Mission Critical Class Reliability features Increased Resources Larger More Complex Problems Scalable Performance ICH 10/10R Intel 82599 10GbE Controller

From Research to Realization. Intel Many Integrated Core Architecture The Newest Addition to the Intel Server Family. Industry s First General Purpose Many Core Architecture

FIXED FUNCTION LOGIC MEMORY and I/O INTERFACES Intel MIC Architecture: An Intel Co-Processor Architecture VECTOR IA CORE VECTOR IA CORE VECTOR IA CORE VECTOR IA CORE COHERENT CACHE COHERENT CACHE INTERPROCESSOR NETWORK COHERENT CACHE COHERENT CACHE COHERENT CACHE COHERENT CACHE INTERPROCESSOR NETWORK COHERENT CACHE COHERENT CACHE VECTOR IA CORE VECTOR IA CORE VECTOR IA CORE VECTOR IA CORE Many cores and many, many more threads Standard IA programming and memory model

Knights Ferry Software development platform Growing availability through 2010 32 cores, 1.2 GHz 128 threads at 4 threads / core 8MB shared coherent cache 1-2GB GDDR5 Bundled with Intel HPC tools Software development platform for Intel MIC architecture

The Knights Family Future Knights Products Knights Corner 1 st Intel MIC product 22nm process >50 Intel Architecture cores Knights Ferry

Single Source Intel MIC Architecture Programming Compilers and Runtimes Intel Xeon processor Intel MIC architecture co-processor Common with Intel Xeon Languages C, C++, Fortran compilers Intel Xeon processor family Intel developer tools and libraries Coding and optimization techniques Ecosystem support Eliminates Need for Dual Programming Architecture

IA Programming Flexibility Instruction Parallelism Data Parallelism Thread Parallelism Cluster / Process Parallelism Serial Code Node Level Fast Scalar performance, Optimized C/C++,FORTRAN, Threading and Performance Libraries, Debug / Analysis Tools Parallel Node Level Multi-core, Multi-Socket, SSE and AVX instructions, OpenMP, Threading Building Blocks, Performance Libraries, Thread Checker, Ct, Cilk Multi-Node / Cluster Level Cluster Tools, MPI Checker Programming choices and standards for range of parallel efficiency

Summary Scale Performance Forward Optimize your software for multi-core to benefit now, - Intel Xeon 5600 and Intel Xeon 7500 today - Future Ready for Intel Xeon Processors + Intel MIC Architecture Continue to innovate on Intel Platforms for those applications that may benefit from accelerators.

Performance Claim Backup Up to 1.6x performance compared to Xeon 5500 series claim supported by a CPU intensive benchmark (Blackscholes). Intel internal measurement. (Feb 25, 2010) Configuration details: - Blackscholes* Baseline Configuration and Score on Benchmark:- Intel pre-production system with two Intel Xeon processor X5570 (2.93 GHz, 8 MB last level cache, 6.4 GT/sec QPI), 24GB memory (6x4GB DDR3-1333), 4 x 150GB 10K RPM SATA RAID0 for scratch, Red Hat* EL 5 Update 4 64-bit OS. Source: Intel internal testing as of February 2010. SunGard v3.0 source code compiled with Intel v11.0 compiler. Elapsed time to run benchmark: 18.74 seconds. New Configuration and Score on Benchmark:- Intel pre-production system with two Intel Xeon processor X5680 (3.33 GHz, 12 MB last level cache, 6.4 GT/sec QPI), 24GB memory (6x4GB DDR3-1333), 4 x 150GB 10K RPM SATA RAID0 for scratch, Red Hat* EL 5 Update 4 64-bit OS. Source: Intel internal testing as of February 2010. SunGard v3.0 source code compiled with Intel v11.0 compiler. Elapsed time to run benchmark: 11.51 seconds. Up to 40% higher performance/watt compared to Intel Xeon Processor 5500 Series claim supported by performance results on a server side java benchmark in conjunction with power consumption across a load line. Intel internal measurement (Jan 15, 2010) Baseline platform: Intel preproduction server platform with two Quad-Core Intel Xeon processor X5570, 2.93 GHz, 8MB L3 cache, 6.4QPI, 8GB memory (4x2GB DDR3-1333), 1 PSU, Microsoft Windows Server 2008 Enterprise SP2. Intel internal measurement as of January 15,2010. New platform: Intel preproduction server platform with two six-core Intel Xeon processor X5670, 2.93 GHz, 12MB L3 cache, 6.4QPI, 8GB memory (4x2GB DDR3-1333), 1 PSU, Microsoft Windows Server 2008 Enterprise SP2. Intel internal measurement as of January 15, 2010. Intel Xeon processor 5600 series with Intel microarchitecture Nehalem delivers similar performance as previous-generation servers but uses up to 30 percent less power Baseline Configuration and Score on Benchmark: Fujitsu PRIMERGY RX300 S5 system with two Intel Xeon processor sx5570 (2.93 GHz, 8MB L3, 6.4 GT/s, Quad-core, 95W TDP), BIOS rev. R1.09, Turbo Enabled, HT Enabled, NUMA Enabled, 5 x Fans, 24 GB (6x4GB DDR3-1333 DR registered ECC), 1 x Fujitsu MBD2147RC 147GB 10K RPM 2.5 SAS HDD, 1x800W PSU, SLES 11 (X86_64) Kernel 2.6.27.19-5-default. Source: Fujitsu Performance Lab testing as of Mar 2010. SPECint_rate_base2006 score: 250. http://docs.ts.fujitsu.com/dl.aspx?id=0140b19d-56e3-4b24-a01e-26b8a80cfe53 New Configuration and Score on Benchmark: Fujitsu PRIMERGY RX300 S6 system with two Intel Xeon processors L5640 (2.26 GHz, 12MB L3, 5.86 GT/s, Hex-core, 60W TDP), BIOS rev R1.00A, Turbo Enabled, HT Enabled, NUMA Enabled, 5 x Fans, 24 GB (6x4GB DDR3-1333 LV DR registered ECC), 1 x Fujitsu MBD2147RC 147GB 10K RPM 2.5 SAS HDD, 1x800W PSU, SLES 11 (X86_64) Kernel 2.6.27.19-5-default. Source: Fujitsu Performance Lab testing as of Mar 2010. SPECint_rate_base2006 score: 250 http://docs.ts.fujitsu.com/dl.aspx?id=4af74e10-24b1-4cf8-bb3b-9c4f5f177389