Intel High-Performance Computing. Technologies for Engineering

Similar documents
Aim High. Intel Technical Update Teratec 07 Symposium. June 20, Stephen R. Wheat, Ph.D. Director, HPC Digital Enterprise Group

HPC. Accelerating. HPC Advisory Council Lugano, CH March 15 th, Herbert Cornelius Intel

HPC Technology Trends

Accelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing

Intel Many Integrated Core (MIC) Architecture

Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms

Intel Accelerates Supercomputing. Press Briefing June 19, 2007

John Hengeveld Director of Marketing, HPC Evangelist

Munara Tolubaeva Technical Consulting Engineer. 3D XPoint is a trademark of Intel Corporation in the U.S. and/or other countries.

Risk Factors. Rev. 4/19/11

Extending Energy Efficiency. From Silicon To The Platform. And Beyond Raj Hazra. Director, Systems Technology Lab

From Dual Core to Multi-Core, is everybody ready?

Using Intel VTune Amplifier XE for High Performance Computing

Intel Software Development Products Licensing & Programs Channel EMEA

High Performance Computing The Essential Tool for a Knowledge Economy

Intel: Driving the Future of IT Technologies. Kevin C. Kahn Senior Fellow, Intel Labs Intel Corporation

What s P. Thierry

Intel Architecture for Software Developers

Intel Workstation Technology

Intel HPC Technologies Outlook

Technology challenges and trends over the next decade (A look through a 2030 crystal ball) Al Gara Intel Fellow & Chief HPC System Architect

Intel Cluster Toolkit Compiler Edition 3.2 for Linux* or Windows HPC Server 2008*

Understanding Dual-processors, Hyper-Threading Technology, and Multicore Systems

Intel Workstation Platforms

The Stampede is Coming: A New Petascale Resource for the Open Science Community

Accelerating Insights In the Technical Computing Transformation

UPCRC Overview. Universal Computing Research Centers launched at UC Berkeley and UIUC. Andrew A. Chien. Vice President of Research Intel Corporation

Enhancing Analysis-Based Design with Quad-Core Intel Xeon Processor-Based Workstations

Fujitsu s Approach to Application Centric Petascale Computing

IFS RAPS14 benchmark on 2 nd generation Intel Xeon Phi processor

Intel Cluster Ready Allowed Hardware Variances

Accelerate. HP / Intel. CAE Innovation at any Scale with Optimized Simulation Solutions. Performance. Efficiency. Agility

Multi-Core Microprocessor Chips: Motivation & Challenges

Making Supercomputing More Available and Accessible Windows HPC Server 2008 R2 Beta 2 Microsoft High Performance Computing April, 2010

FAST FORWARD TO YOUR <NEXT> CREATION

Graphics Performance Analyzer for Android

Intel Workstation Platforms

Innovating and Integrating for Communications and Storage

Intel Enterprise Processors Technology

HETEROGENEOUS SYSTEM ARCHITECTURE: PLATFORM FOR THE FUTURE

Intel Enterprise Solutions

Intel Parallel Studio XE 2015

Achieving High Performance. Jim Cownie Principal Engineer SSG/DPD/TCAR Multicore Challenge 2013

Performance Optimizations via Connect-IB and Dynamically Connected Transport Service for Maximum Performance on LS-DYNA

Innovate. Integrate. Innovate. Integrate.

Agenda. Optimization Notice Copyright 2017, Intel Corporation. All rights reserved. *Other names and brands may be claimed as the property of others.

CP2K Performance Benchmark and Profiling. April 2011

A U G U S T 8, S A N T A C L A R A, C A

Intel s Architecture for NFV

Intel Xeon Phi Coprocessor. Technical Resources. Intel Xeon Phi Coprocessor Workshop Pawsey Centre & CSIRO, Aug Intel Xeon Phi Coprocessor

Intel VTune Amplifier XE

Getting Started with Intel SDK for OpenCL Applications

Knights Corner: Your Path to Knights Landing

IBM Power Systems HPC Cluster

Race to Exascale: Opportunities and Challenges. Avinash Sodani, Ph.D. Chief Architect MIC Processor Intel Corporation

LS-DYNA Performance Benchmark and Profiling. October 2017

Välkommen. Intel Anders Huge

Introduction CPS343. Spring Parallel and High Performance Computing. CPS343 (Parallel and HPC) Introduction Spring / 29

2 to 4 Intel Xeon Processor E v3 Family CPUs. Up to 12 SFF Disk Drives for Appliance Model. Up to 6 TB of Main Memory (with GB LRDIMMs)

ACCELERATED COMPUTING: THE PATH FORWARD. Jen-Hsun Huang, Co-Founder and CEO, NVIDIA SC15 Nov. 16, 2015

Philippe Thierry Sr Staff Engineer Intel Corp.

HPCG on Intel Xeon Phi 2 nd Generation, Knights Landing. Alexander Kleymenov and Jongsoo Park Intel Corporation SC16, HPCG BoF

Silvermont. Introducing Next Generation Low Power Microarchitecture: Dadi Perlmutter

Parallel Computer Architecture - Basics -

HPE ProLiant. The right compute for the right workload

2009 International Solid-State Circuits Conference Intel Paper Highlights

Fundamentals of Computers Design

Delivering HPC Performance at Scale

HPC in the Multicore Era

MM5 Modeling System Performance Research and Profiling. March 2009

April 2 nd, Bob Burroughs Director, HPC Solution Sales

Computing architectures Part 2 TMA4280 Introduction to Supercomputing

Trends in HPC (hardware complexity and software challenges)

Software Evaluation Guide for ImTOO* YouTube* to ipod* Converter Downloading YouTube videos to your ipod

Toward a Memory-centric Architecture

Resources Current and Future Systems. Timothy H. Kaiser, Ph.D.

Nehalem Hochleistungsrechnen für reale Anwendungen

Birds of a Feather Presentation

Interconnect Challenges in a Many Core Compute Environment. Jerry Bautista, PhD Gen Mgr, New Business Initiatives Intel, Tech and Manuf Grp

Intel tools for High Performance Python 데이터분석및기타기능을위한고성능 Python

SAS workload performance improvements with IBM XIV Storage System Gen3

Efficiently Introduce Threading using Intel TBB

IBM Power Systems solution for SugarCRM

Fundamentals of Computer Design

Technologies and application performance. Marc Mendez-Bermond HPC Solutions Expert - Dell Technologies September 2017

New Intel 45nm Processors. Reinvented transistors and new products

Future Routing Schemes in Petascale clusters

Bei Wang, Dmitry Prohorov and Carlos Rosales

Introduction to Xeon Phi. Bill Barth January 11, 2013

Resources Current and Future Systems. Timothy H. Kaiser, Ph.D.

Intel and Red Hat. Matty Bakkeren Enterprise Technology Specialist

Using Intel VTune Amplifier XE and Inspector XE in.net environment

PORTING CP2K TO THE INTEL XEON PHI. ARCHER Technical Forum, Wed 30 th July Iain Bethune

The Transition to PCI Express* for Client SSDs

ENVISION TECHNOLOGY CONFERENCE. Functional intel (ia) BLA PARTHAS, INTEL PLATFORM ARCHITECT

Intel Architecture Press Briefing

Enabling Performance-per-Watt Gains in High-Performance Cluster Computing

Intel Xeon Phi архитектура, модели программирования, оптимизация.

Continuing Moore s law

(ii) Why are we going to multi-core chips to find performance? Because we have to.

Transcription:

6. LS-DYNA Anwenderforum, Frankenthal 2007 Keynote-Vorträge II Intel High-Performance Computing Technologies for Engineering H. Cornelius Intel GmbH A - II - 29

Keynote-Vorträge II 6. LS-DYNA Anwenderforum, Frankenthal 2007 Intel High-Performance Computing Technologies for Engineering October 2007 Dr. Herbert Cornelius Director Advanced Computing Center Intel EMEA Risk Factors Today s presentation contains forward-looking statements. All statements made that are not historical facts are subject to a number of risks and uncertainties, and actual results may differ materially. Please refer to our most recent Earnings Release and our most recent Form 10-Q or 10-K filing available on our website for more information on the risk factors that could cause actual results to differ. Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may affect actual performance. Buyers should consult other sources of information to evaluate the performance of systems or components they are considering purchasing. For more information on performance tests and on the performance of Intel products, visit Intel Performance Benchmark Limitations (http://www.intel.com/performance/resources/limits.htm). All information provided related to future Intel products and plans is preliminary and subject to change at any time, without notice. THIS DOCUMENT AND RELATED MATERIALS AND INFORMATION ARE PROVIDED "AS IS" WITH NO WARRANTIES, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO ANY IMPLIED WARRANTY OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE, NON-INFRINGEMENT OF INTELLECTUAL PROPERTY RIGHTS, OR ANY WARRANTY OTHERWISE ARISING OUT OF ANY PROPOSAL, SPECIFICATION, OR SAMPLE. INTEL ASSUMES NO RESPONSIBILITY FOR ANY ERRORS CONTAINED IN THIS DOCUMENT AND HAS NO LIABILITIES OR OBLIGATIONS FOR ANY DAMAGES ARISING FROM OR IN CONNECTION WITH THE USE OF THIS DOCUMENT. Copyright Intel Corporation. All rights reserved. A - II - 30

6. LS-DYNA Anwenderforum, Frankenthal 2007 Keynote-Vorträge II The Three Pillars of Science & Engineering Theory Experiment Numerical Simulation High-Performance Computing GFLOPS [10 9 ] TFLOPS [10 [10 12 12 ] PFLOPS [10 [10 15 15 ] yesterday today tomorrow From Supercomputers to Supercomputing A Strategic Investment *Other names and brands may be claimed as the property of others. A - II - 31

Keynote-Vorträge II 6. LS-DYNA Anwenderforum, Frankenthal 2007 CAx Challenges Shorten Design and Development Time Shorten time-to to-market/money Better Products in all respect Reduce expensive and time consuming mechanical tests Reduce material (costs( costs, weight, new) Optimize design specific features Keep competetive advantage Solution: High-Performance Computing (HPC) Numerical analysis and simulations on high- performance computer systems Increase capability and capacity More flexibiity and agility to changes More cost effective Simulate physical difficult tasks Better optimization within the design space Changing the way CAD/CAE tools can be used From serial to simultaneous workflows, reducing the time between an idea and a finished product using HPC Technologies A - II - 32

5 nm 6. LS-DYNA Anwenderforum, Frankenthal 2007 Keynote-Vorträge II Moore s Law is (still) Leading the Chip Industry Transistor density on a chip doubles every 18-24 month Ongoing Transistor Miniaturization 90nm 2003 65nm 2005 45nm 2007 32nm 2009 22nm 2011+ 50 nm 35 nm 30 nm Manufacturing Development Research (forecast) 20 nm 10 nm earch (forecast) 5 nm Moore s Law is (still) leading the Chip Industry Nanowire Future options subject to change; Source: Intel All products, dates, and figures are preliminary and are subject to change without any notice. A - II - 33

Keynote-Vorträge II 6. LS-DYNA Anwenderforum, Frankenthal 2007 45nm Volume Manufacturing in 2007 Penryn Die Photo Higher Performance with lower Power Consumption Evolving Value Proposition 1960s-1980s 1980s PERFORMANCE 1990s PRICE/PERFORMANCE 2000s PRICE/PERFORMANCE/WATT All Segments: HANDHELDS CLIENTS SERVERS HPC A - II - 34

6. LS-DYNA Anwenderforum, Frankenthal 2007 Keynote-Vorträge II Industry Trend to Multi/Many-Core Intel Tera-Scale Computing Research Program: www.intel.com/go/terascale Many-Core Multi-Core Dual-Core Hyper-Threading QUAD-CORE Multi Processor Energy Efficient Petascale with Multi-threaded Cores Intel Processors & HPC Large Node, Scalable Shared Memory Largest shared memory to 1000s of CPUs and up to 1 PB Highest level of system RAS for large system stability I/O and Compute Scalability Clustered applications requiring large memory footprints Tigerton brings a new dimension of MP computing to HPC (2H 07) Optimal for General Purpose Clusters Leading 2-way performance and energy efficiency Best FLOPS/$, increased density with new Intel Quad-Core Processors Enhanced DP platform for HPC and WS (2H 07) Best Bus Bandwidth (UP Cluster) Higher bytes/flop and bandwidth density Form factor, performance/$ for personal & departmental clusters 9000 Sequence 7x00 Sequence 5x00 Sequence 3x00 Sequence Optimized solutions for any HPC workload A - II - 35

Keynote-Vorträge II 6. LS-DYNA Anwenderforum, Frankenthal 2007 Multi-Threaded Cores Intel Tera-Scale Computing Research Program: www.intel.com/go/terascale All Large Core Many Small Cores Mixed Large and Small Core All Small Core Energy Efficient Petascale with Multi-threaded Cores Note: the above pictures don t represent any current or future Intel products Multi/Many-Core Chip Research Future tera-scale chips could use an array of tens to hundreds of cores with reconfigurable caches, as well as special-purpose hardware accelerators utilizing a scalable on-die interconnect fabric. All plans, products, dates, and figures are preliminary and are subject to change without any notice. A - II - 36

6. LS-DYNA Anwenderforum, Frankenthal 2007 Keynote-Vorträge II NEED FOR SPEED & DENSITY Tera-Scale Computing TERAOPS Performance TERABYTES Memory Bandwidth TERABITS I/O Throughput 1 TFLOPS on a Chip 62W Experimental Research Test Chip 100M Transistors 80 Tiles 275mm 2 A - II - 37

Keynote-Vorträge II 6. LS-DYNA Anwenderforum, Frankenthal 2007 TERAFLOP OF PERFORMANCE 8 0 CORES ROUTER 22 mm CORE 13.75 mm Experimental Research Prototype 2006 Intel Corporation. All rights reserved. Source: Intel TERABYTES OF BANDWIDTH Ongoing Research 2006 Intel Corporation. All rights reserved. Source: Intel A - II - 38

6. LS-DYNA Anwenderforum, Frankenthal 2007 Keynote-Vorträge II TERABITS OF I/O-THROUGHPUT LASER MODULATOR PHOTO-DETECTOR LIGHT GUIDES SELF-ALIGNMENT 2006 Intel Corporation. All rights reserved. BUILDING BLOCKS OF SILICON PHOTONICS Source: Intel Industry s s First 45nm High-K Products Coming 2H 07/Q1 07/Q1 0808 Quad Core Dual Core Server Desktop Mobile Intel Forecast A - II - 39

Keynote-Vorträge II 6. LS-DYNA Anwenderforum, Frankenthal 2007 Tick/Tock: Our Model for Sustained Microprocessor Leadership Intel Core Sandy Bridge Penryn Nehalem Westmere NEW Microarchitecture Compaction/ Derivative NEW Microarchitecture Compaction/ Derivative NEW Microarchitecture 65nm 45nm 45nm 32nm 32nm 2006 2007 2008 2009 2010 Forecast Future options subject to change without notice. Potential HW-Accelerator Options Node Fabric Fabric Multi-Cores Geneseo/PCIe (Gen 2) n p On-Die o Socket Intel QuickAssist Technology Other brands and names are the property of their respective owners. A - II - 40 Future options subject to change without notice. Enabling Partnerships

6. LS-DYNA Anwenderforum, Frankenthal 2007 Keynote-Vorträge II Chip revolution poses problems for programmers: Software developers face a culture shock as they grapple with the next generation of microprocessors NewScientist, 10 March 2007 Intel s s Software Tools and Support for HPC Intel Cluster Toolkit Intel Thread Checker Intel Thread Profiler Intel Threading Building Blocks Cluster Tools ISN & ISC ISS Performance Compatibility Support Productivity Cross-Platform www.intel.com/software A - II - 41

Keynote-Vorträge II 6. LS-DYNA Anwenderforum, Frankenthal 2007 Levels of Parallelism Intel SW Development Tools for HPC Serial Core Level Multi-Core/SMP Core/SMP- Node level Multi-Node Cluster Level Grid Level Programming Model Implementation Correctness & Debugging C/C++ FORTRAN95 IDB Auto-Parallelization OpenMP* TBB Thread Checker ClusterOpenMP Intel MPI Message Checker IDB-MPP Intel GPE (UNICORE) Performance Libraries Performance Analysis MKL IPP VTune MKL IPP VTune Thread Profiler Cluster MKL Trace Analyzer *Other names and brands may be claimed as the property of others. Intel Software Tools for Parallelism Architectural Analysis Introduce Parallelism Confidence / Correctness Optimize / Tune Visualization of parallel (threaded or MPI) application execution and communication behavior give valuable insights for application architects and programmers Highly optimized OpenMP* and MPI library and run-time system for scalable solutions MKL threaded and distributed mathematical library Detecting actual and potential Threading and MPI programming and API issues - to address challenges unique to parallel programming Valuable insights for performance and scalability tuning of threaded and MPI applications *Other names and brands may be claimed as the property of others. A - II - 42

6. LS-DYNA Anwenderforum, Frankenthal 2007 Keynote-Vorträge II Intel Cluster Ready A Program to make it easier for end users to buy, deploy and use clusters. Backed by reference implementations on Intel Server Platforms (recipes) Including tools to check compliance A three-way collaboration between System Vendors/Integrators, Software Vendors, and Intel. Systems Vendors (OEMs) and Integrators solutions certified as compliant with the specification May resell reference recipes or implement their own ISVs applications registered as compliant with the specification Registered applications will run out-of of-the box on any compliant cluster More than a Cluster Solution Specification Covering the Globe With Software R&D Labs Over 50 Development Sites Across More Than 20 Countries A - II - 43

Keynote-Vorträge II 6. LS-DYNA Anwenderforum, Frankenthal 2007 Conclusions End-Users: The available compute power will continue to increase with flexible system architectures to meet the applications demand and requirements. Software Vendors: Extract the available computational power via parallelization on all levels (multi-core, clusters) utilizing parallel software development tools for new and advanced functionality. IT: Energy and cost efficient solutions continue to evolve providing the needed flexibility and business agility with improved capacity and capabilities. A - II - 44