Future of Compute is Big Data. Mark Seager Intel Fellow, CTO for HPC Ecosystem Intel Corporation

Similar documents
High Performance Computing The Essential Tool for a Knowledge Economy

Mark Seager Intel Fellow, CTO for the HPC Ecosystem RackScale Design Group. Presented to icas14, Annecy France September 11, 2017

Technology challenges and trends over the next decade (A look through a 2030 crystal ball) Al Gara Intel Fellow & Chief HPC System Architect

unleashed the future Intel Xeon Scalable Processors for High Performance Computing Alexey Belogortsev Field Application Engineer

Re-Architecting Cloud Storage with Intel 3D XPoint Technology and Intel 3D NAND SSDs

Evolving Small Cells. Udayan Mukherjee Senior Principal Engineer and Director (Wireless Infrastructure)

Intel Many Integrated Core (MIC) Architecture

Intel HPC Technologies Outlook

April 2 nd, Bob Burroughs Director, HPC Solution Sales

RE-IMAGINING THE DATACENTER. Lynn Comp Director of Datacenter Solutions and Technologies

Efficient Parallel Programming on Xeon Phi for Exascale

HPCG on Intel Xeon Phi 2 nd Generation, Knights Landing. Alexander Kleymenov and Jongsoo Park Intel Corporation SC16, HPCG BoF

Maximize Performance and Scalability of RADIOSS* Structural Analysis Software on Intel Xeon Processor E7 v2 Family-Based Platforms

Accelerating Insights In the Technical Computing Transformation

Drive Recovery Panel

Intel s Architecture for NFV

What s P. Thierry

Dr. Jean-Laurent PHILIPPE, PhD EMEA HPC Technical Sales Specialist. With Dell Amsterdam, October 27, 2016

IXPUG 16. Dmitry Durnov, Intel MPI team

Intel Xeon Phi Coprocessor. Technical Resources. Intel Xeon Phi Coprocessor Workshop Pawsey Centre & CSIRO, Aug Intel Xeon Phi Coprocessor

Simón Viñals Larruga. Director de Tecnología, Intel Corp. Iberia. HPCadmintech, Madrid, Marzo 2016

OpenCL* and Microsoft DirectX* Video Acceleration Surface Sharing

Intel SSD Data center evolution

Interconnect Your Future

Välkommen. Intel Anders Huge

Performance Evaluation of NWChem Ab-Initio Molecular Dynamics (AIMD) Simulations on the Intel Xeon Phi Processor

Intel & Lustre: LUG Micah Bhakti

Intel. Rack Scale Design: A Deeper Perspective on Software Manageability for the Open Compute Project Community. Mohan J. Kumar Intel Fellow

Sample for OpenCL* and DirectX* Video Acceleration Surface Sharing

Desktop 4th Generation Intel Core, Intel Pentium, and Intel Celeron Processor Families and Intel Xeon Processor E3-1268L v3

Andreas Schneider. Markus Leberecht. Senior Cloud Solution Architect, Intel Deutschland. Distribution Sales Manager, Intel Deutschland

Bei Wang, Dmitry Prohorov and Carlos Rosales

Intel Atom Processor Based Platform Technologies. Intelligent Systems Group Intel Corporation

Small File I/O Performance in Lustre. Mikhail Pershin, Joe Gmitter Intel HPDD April 2018

Fast-track Hybrid IT Transformation with Intel Data Center Blocks for Cloud

Expand Your HPC Market Reach and Grow Your Sales with Intel Cluster Ready

Ravindra Babu Ganapathi

Optimization of Lustre* performance using a mix of fabric cards

Extending Energy Efficiency. From Silicon To The Platform. And Beyond Raj Hazra. Director, Systems Technology Lab

Intel Atom Processor D2000 Series and N2000 Series Embedded Application Power Guideline Addendum January 2012

Omni-Path Cluster Configurator

Intel and the Future of Consumer Electronics. Shahrokh Shahidzadeh Sr. Principal Technologist

Risk Factors. Rev. 4/19/11

Gen-Z Memory-Driven Computing

Intel Core TM i7-4702ec Processor for Communications Infrastructure

investor meeting SANTA CLARA Diane Bryant Senior Vice President & General Manager Data Center Group

Becca Paren Cluster Systems Engineer Software and Services Group. May 2017

Intel optane memory as platform accelerator. Vladimir Knyazkin

12th ANNUAL WORKSHOP 2016 NVME OVER FABRICS. Presented by Phil Cayton Intel Corporation. April 6th, 2016

Opportunities and Challenges in Sparse Linear Algebra on Many-Core Processors with High-Bandwidth Memory

A U G U S T 8, S A N T A C L A R A, C A

LNet Roadmap & Development. Amir Shehata Lustre * Network Engineer Intel High Performance Data Division

Data and Intelligence in Storage Carol Wilder Intel Corporation

Real World Development examples of systems / iot

Intel Xeon Processor E v3 Family

HPC. Accelerating. HPC Advisory Council Lugano, CH March 15 th, Herbert Cornelius Intel

Intel Omni-Path Architecture

Vectorization Advisor: getting started

Forging a Future in Memory: New Technologies, New Markets, New Applications. Ed Doller Chief Technology Officer

Munara Tolubaeva Technical Consulting Engineer. 3D XPoint is a trademark of Intel Corporation in the U.S. and/or other countries.

Lustre Beyond HPC. Presented to the Lustre* User Group Beijing October 2013

Non-Volatile Memory Cache Enhancements: Turbo-Charging Client Platform Performance

Intel Omni-Path Fabric Manager GUI Software

Bitonic Sorting. Intel SDK for OpenCL* Applications Sample Documentation. Copyright Intel Corporation. All Rights Reserved

Intel Atom Processor E6xx Series Embedded Application Power Guideline Addendum January 2012

Accelerating Real-Time Big Data. Breaking the limitations of captive NVMe storage

Accelerating HPC. (Nash) Dr. Avinash Palaniswamy High Performance Computing Data Center Group Marketing

CSD3 The Cambridge Service for Data Driven Discovery. A New National HPC Service for Data Intensive science

Mobility: Innovation Unleashed!

Sayantan Sur, Intel. ExaComm Workshop held in conjunction with ISC 2018

Intel Core TM Processor i C Embedded Application Power Guideline Addendum

ENVISION TECHNOLOGY CONFERENCE. Ethernet TSN Overview ANIL N. KUMAR, INTEL PRINCIPAL ENGINEER

Intel Cluster Checker 3.0 webinar

The Transition to PCI Express* for Client SSDs

SSDs Going Mainstream Do you believe? Tom Rampone Intel Vice President/General Manager Intel NAND Solutions Group

Agenda. Optimization Notice Copyright 2017, Intel Corporation. All rights reserved. *Other names and brands may be claimed as the property of others.

Innovation Accelerating Mission Critical Infrastructure

Intel and SAP Realising the Value of your Data

IFS RAPS14 benchmark on 2 nd generation Intel Xeon Phi processor

MICHAL MROZEK ZBIGNIEW ZDANOWICZ

Data-Centric Innovation Summit ALPER ILKBAHAR VICE PRESIDENT & GENERAL MANAGER MEMORY & STORAGE SOLUTIONS, DATA CENTER GROUP

A Breakthrough in Non-Volatile Memory Technology FUJITSU LIMITED

DDN About Us Solving Large Enterprise and Web Scale Challenges

Jim Pappas Director of Technology Initiatives, Intel Vice-Chair, Storage Networking Industry Association (SNIA) December 07, 2018

Innovating and Integrating for Communications and Storage

Create a Flexible, Scalable High-Performance Storage Cluster with WekaIO Matrix

HPCG Results on IA: What does it tell about architecture?

Intel Data Center Manager. Data center IT agility and control

Intel Cache Acceleration Software for Windows* Workstation

Contributors: Surabhi Jain, Gengbin Zheng, Maria Garzaran, Jim Cownie, Taru Doodi, and Terry L. Wilmarth

Olaf Weber Senior Software Engineer SGI Storage Software. Amir Shehata Lustre Network Engineer Intel High Performance Data Division

FlashGrid Software Enables Converged and Hyper-Converged Appliances for Oracle* RAC

OpenMP * 4 Support in Clang * / LLVM * Andrey Bokhanko, Intel

Bitonic Sorting Intel OpenCL SDK Sample Documentation

THE STORAGE PERFORMANCE DEVELOPMENT KIT AND NVME-OF

Entry-level Intel RAID RS3 Controller Family

ABySS Performance Benchmark and Profiling. May 2010

Theory and Practice of the Low-Power SATA Spec DevSleep

Re- I m a g i n i n g D a t a C e n t e r S t o r a g e a n d M e m o r y

IBM Data Center Networking in Support of Dynamic Infrastructure

Transcription:

Future of Compute is Big Data Mark Seager Intel Fellow, CTO for HPC Ecosystem Intel Corporation

Traditional HPC is scientific simulation First ever Nobel prize for HPC takes the experiment into cyberspace Chemical reactions occur at lightning speed; electrons jump between atoms hidden from the prying eyes of scientists. The Nobel Laureates in Chemistry 2013 have made it possible to map the mysterious ways of chemistry by using computers. Detailed knowledge of chemical processes makes it possible to optimize catalysts, drugs and solar cells. Spanning multiple scales by combining quantum mechanics and molecular dynamics 2 Martin Karplus Michael Levitt Arieh Warshel The Nobel Prize in Chemistry 2013 was awarded jointly to Martin Karplus, Michael Levitt and Arieh Warshel "for the development of multi-scale models for complex chemical systems".

Other emerging technical computing usage models are driven by Big Data Government & Research Commerce & Industry New Users & New Uses My goal is simple. It is complete understanding of the universe, why it is as it is, and why it exists at all Stephen Hawking Better Products Digital Twin Faster Time to Market Reduced R&D New Business Models Data Services Genomics Clinical Information From Diagnosis to personalized treatments quickly Basic Science Business Transformation Data-Driven Discovery Transform data into useful knowledge Source: IDC: Worldwide Technical Computing Server 2013 2017 Forecast; Other brands, names, and images are the property of their respective owners. Other brands, names and images may be claimed as the property of others

Data-Driven Discovery Hypothesis Formation Modeling & Prediction Drug Discovery Treatment Optimization Astronomy Particle Physics Public Policy Trend Analysis Genome Data Medical Imaging Clininical Trials Sensor Data Images Sim Data Census Data Text A/V Surveys Life Sciences Physical Sciences Social Sciences

Data-Driven Discovery in Science CERN NextBio 600 million collisions / sec 1 human genome = 1 petabyte 5 Detecting 1 in 1 trillion events to help find the Higgs Boson Finding patterns in clinical and genome data at scale can cure cancer and other diseases

Source: Rick Stevens, Argonne Nat Lab Project Moonshot for Cancer: Predictive Oncology Pilot 2 6

Convergence of Driving Forces Intelligent Devices Cloud HPC Big Data 19B Connected devices by 2016 1 $200B Cloud services In 2016 2 2X Annual growth in supercomputing FLOPS 3 15PB Data collected in 1 year at CERN 4 1 Source: Cisco Visual Networking Index (VNI) Forecast (2011-2016) 2 Source: Gartner Worldwide IT Spending Forecast, 2Q12 Update 3 Source: Top 500 list: Top 10 change from November 2007 to November 2012 4 Source: CERN * Other names and brands may be claimed as the property of others.

Intelligent Devices - New Era of Computing Enabling an Industry of Pervasive Computing Focused Segments Health Smart Mfg Automotive Sector Retail Sector Gaming Infotainment Visual Retailing Imaging Mil/Aero ADAS Autonomous Transactional Retailing Retailing Services Energy DSS CONNECTED MANAGED SECURED DATA ANALYTICS SAFETY

Example: CIA Mission Drives The Cycle * Other names and brands may be claimed as the property of others.

Intel Scalable System Framework A Holistic Solution for All HPC Needs Compute Fabric Memory / Storage Software Small Clusters Through Peta and Exascale Compute and Data-Centric Computing Standards-Based Programmability IA and HPC Ecosystem Enabling On-Premise and Cloud-Based Intel Xeon Processors Intel Xeon Phi Processors Intel FPGAs and Server Solutions Intel Solutions for Lustre* Intel Optane Technology 3D XPoint Technology Intel SSDs Intel Omni-Path Architecture Intel Silicon Photonics Intel Ethernet Intel HPC Orchestrator Intel Software Tools Intel Cluster Ready Program Intel Supported SDVis 1

SSF: Enabling Configurability & Scalability from components to racks to clusters SSF Path To Exascale SSF for Scalable Clusters Cluster Rack Chassis Simple, dense, & configurable I/O Compute Adaptive & configurable Right sized & configurable Memory Enough capacity to support apps Xeon or Xeon-Phi based on workload needs Compute flexibly aggregated Lowest latency compute to compute interconnect I/O Topologies for best performance Configurable I/O bandwidth director switch Burst buffer to decouple storage from I/O 2

Higher Bandwidth. Lower Latency and Capacity Processor Tighter System-Level Integration Innovative Memory-Storage Hierarchy Today Future Compute Compute Intel SSF Memory Caches Caches Memory Bus Local memory is now faster & in processor package On-Package High Bandwidth Memory* Local Memory Much larger memory capacities keep data in local memory Intel DIMMs based on 3D XPoint Technology Local Memory Compute Node I/O Node Remote Storage Local Storage SSD Storage Parallel File System (Hard Drive Storage) I/O Node storage moves to compute node Some remote data moves onto I/O node Intel Optane Technology SSDs Burst Buffer Node with Intel Optane Technology SSDs Parallel File System (Hard Drive Storage) *cache, memory or hybrid mode 18

Bringing Memory Back Into Balance High Bandwidth, On-Package Memory Intel SSF Memory Up to 16GB with Knights Landing 5x the Bandwidth vs DDR4 1, >400 GB/s 1 >5x More Energy Efficient vs GDDR5 2 >3x More Dense vs GDDR5 2 3 Modes of Operation Flat Mode: Acts as Memory Cache Mode: Acts as Cache Hybrid Mode: Mix of Cache and Flat 1 Projected result based on internal Intel analysis of STREAM benchmark using a Knights Landing processor with 16GB of ultra high-bandwidth versus DDR4 memory with all channels populated. 2 Projected result based on internal Intel analysis comparison of 16GB of ultra high-bandwidth memory to 16GB of GDDR5 memory used in the Intel Xeon Phi coprocessor 7120P. 1 9

NAND Flash and 3D XPoint Technology 3D MLC and TLC NAND 3D XPoint Technology Selector Memory Cell Enabling highest capacity SSDs at the lowest price Enabling highest performance SSDs and expanding use cases

15 3D Xpoint TM Technology

PERFORMANCE CPU-Fabric Integration with the Intel Omni-Path Architecture KEY VALUE VECTORS Performance Density Cost Power Reliability Tighter Integration Multi-chip Package Integration Intel OPA Intel OPA Future Generations Additional integration, improvements, and features Next generation Intel Xeon Phi processor Future Intel Xeon processor (14nm) Intel SSF Fabric Intel OPA HFI Card Intel Xeon Phi 7220 (KNL) processor Next generation Intel Xeon processor Intel Xeon processor E5-2600 v3 TIME 16

OmniPath is Optimized for scalability Intel SSF Fabric SWITCH CHIPS REQUIRED 1 1000 900 800 700 600 500 400 300 200 100 FEWER SWITCHES REQUIRED InfiniBand* 36-port switch 0 Nodes 249 499 749 999 1249 1499 1749 1999 2249 2499 2749 2999 3249 3499 NODES Intel OPA 48-port switch 1000 500 0 More Servers Same Budget 750 Servers 932 Mellanox EDR Intel OPA Up to 24% more servers 1 1 Configuration assumes a 750-node cluster, and number of switch chips required is based on a full bisectional bandwidth (FBB) Fat-Tree configuration. Intel OPA uses one fully-populated 768-port director switch, and Mellanox EDR solution uses a combination of 648-port director switches and 36-port edge switches. Intel and Mellanox component pricing from www.kernelsoftware.com, with prices as of May 5, 2016. Compute node pricing based on Dell PowerEdge R730 server from www.dell.com, with prices as of November 3, 2015. Intel OPA pricing based on estimated reseller pricing based on projected Intel MSRP pricing at time of launch. * Other names and brands may be claimed as property of others. 17

SWITCH Compute Fabric SWITCH SAN SWITCH IO Fabric CN CN... CN CN CN... CN New storage paradigm for data intensive systems HOT Crystal Ridge ~200 TB/s ~15 PB ION ION WARM 3D NAND ~10-50 TB/s ~150 PB MDS MDS OSS OSS GW GW COLD SSD+HDD ~1-2 TB/s ~250 PB MDS MDS OSS OSS SSF Enables HPC+HPDA workloads System components can be configured to match workload requirements Enables new access methodologies (DAOS) to create new generation applications Incremental improvements to Lustre to provide enhanced performance for existing applications Intel SSF SOFTWARE Distributed Asynchronous Object Storage... CN CN... CN ION Load/Store Get/Put DAOS-M Extreme BW+LL Get/Put DAOS LNET High BW+IOPS DAOS LNET High file create Integrated Fabric Persistent Memory Block Storage Applications Tools Top-level APIs DAOS-Cache/Teiring DAOS-Sharding/Resilience DAOS-Persistent Memory 18

Benefit from Intel s long-standing investments Systems Architecture Manufacturing Leadership Energy Efficient Performance Security Software Global Ecosystem

Legal Information INFORMATION IN THIS DOCUMENT IS PROVIDED AS IS. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. INTEL ASSUMES NO LIABILITY WHATSOEVER AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO THIS INFORMATION INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. Intel product plans in this presentation do not constitute Intel plan of record product roadmaps. Please contact your Intel representative to obtain Intel's current plan of record product roadmaps. Intel's compilers may or may not optimize to the same degree for non-intel microprocessors for optimizations that are not unique to Intel microprocessors. These optimizations include SSE2, SSE3, and SSE3 instruction sets and other optimizations. Intel does not guarantee the availability, functionality, or effectiveness of any optimization on microprocessors not manufactured by Intel. Microprocessor-dependent optimizations in this product are intended for use with Intel microprocessors. Certain optimizations not specific to Intel microarchitecture are reserved for Intel microprocessors. Please refer to the applicable product User and Reference Guides for more information regarding the specific instruction sets covered by this notice. Notice revision #20110804 All products, computer systems, dates, and figures specified are preliminary based on current expectations, and are subject to change without notice. Intel processor numbers are not a measure of performance. Processor numbers differentiate features within each processor family, not across different processor families. Go to: http://www.intel.com/products/processor_number Intel, processors, chipsets, and desktop boards may contain design defects or errors known as errata, which may cause the product to deviate from published specifications. Current characterized errata are available on request. Intel, Intel Xeon, Intel Xeon Phi, the Intel Xeon Phi logo, the Intel Xeon logo and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries. Intel does not control or audit the design or implementation of third party benchmark data or Web sites referenced in this document. Intel encourages all of its customers to visit the referenced Web sites or others where similar performance benchmark data are reported and confirm whether the referenced benchmark data are accurate and reflect performance of systems available for purchase. Other names and brands may be claimed as the property of others. Copyright 2013, Intel Corporation. All rights reserved.