Fujitsu SC 18. Human Centric Innovation Co-creation for Success 2018 FUJITSU

Similar documents
A Breakthrough in Non-Volatile Memory Technology FUJITSU LIMITED

NEXTGenIO Performance Tools for In-Memory I/O

ECMWF's Next Generation IO for the IFS Model and Product Generation

ECMWF s Next Generation IO for the IFS Model

I/O Profiling Towards the Exascale

Analyzing I/O Performance on a NEXTGenIO Class System

Windows Support for PM. Tom Talpey, Microsoft

Windows Support for PM. Tom Talpey, Microsoft

DataON and Intel Select Hyper-Converged Infrastructure (HCI) Maximizes IOPS Performance for Windows Server Software-Defined Storage

Flash Memory Summit Persistent Memory - NVDIMMs

Approaches to I/O Scalability Challenges in the ECMWF Forecasting System

Gen-Z Memory-Driven Computing

HPE ProLiant Gen10. Franz Weberberger Presales Consultant Server

NVMFS: A New File System Designed Specifically to Take Advantage of Nonvolatile Memory

Data-Centric Innovation Summit ALPER ILKBAHAR VICE PRESIDENT & GENERAL MANAGER MEMORY & STORAGE SOLUTIONS, DATA CENTER GROUP

Innovator, Disruptor or Laggard, Where will your storage applications live? Next generation storage

IBM Power Systems HPC Cluster

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

IME (Infinite Memory Engine) Extreme Application Acceleration & Highly Efficient I/O Provisioning

Fast and Easy Persistent Storage for Docker* Containers with Storidge and Intel

THE IN-PLACE WORKING STORAGE TIER OPPORTUNITIES FOR SOFTWARE INNOVATORS KEN GIBSON, INTEL, DIRECTOR MEMORY SW ARCHITECTURE

March NVM Solutions Group

Persistent Memory. High Speed and Low Latency. White Paper M-WP006

Leveraging Software-Defined Storage to Meet Today and Tomorrow s Infrastructure Demands

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

DDN. DDN Updates. DataDirect Neworks Japan, Inc Nobu Hashizume. DDN Storage 2018 DDN Storage 1

Cisco and Cloudera Deliver WorldClass Solutions for Powering the Enterprise Data Hub alerts, etc. Organizations need the right technology and infrastr

PM Support in Linux and Windows. Dr. Stephen Bates, CTO, Eideticom Neal Christiansen, Principal Development Lead, Microsoft

Intel Select Solutions for Professional Visualization with Advantech Servers & Appliances

CS6453. Data-Intensive Systems: Rachit Agarwal. Technology trends, Emerging challenges & opportuni=es

Intel SSD Data center evolution

ADVANCED IN-MEMORY COMPUTING USING SUPERMICRO MEMX SOLUTION

Introducing NVDIMM-X: Designed to be the World s Fastest NAND-Based SSD Architecture and a Platform for the Next Generation of New Media SSDs

SoftFlash: Programmable Storage in Future Data Centers Jae Do Researcher, Microsoft Research

Using NVDIMM under KVM. Applications of persistent memory in virtualization

Architected for Performance. NVMe over Fabrics. September 20 th, Brandon Hoff, Broadcom.

Efficient Memory Mapped File I/O for In-Memory File Systems. Jungsik Choi, Jiwon Kim, Hwansoo Han

Toward a Memory-centric Architecture

Re- I m a g i n i n g D a t a C e n t e r S t o r a g e a n d M e m o r y

Impact on Application Development: SNIA NVM Programming Model in the Real World. Andy Rudoff pmem SW Architect, Intel

NVM Express TM Ecosystem Enabling PCIe NVMe Architectures

Hewlett Packard Enterprise HPE GEN10 PERSISTENT MEMORY PERFORMANCE THROUGH PERSISTENCE

NVDIMM-N Cookbook: A Soup-to-Nuts Primer on Using NVDIMM-Ns to Improve Your Storage Performance

Adrian Proctor Vice President, Marketing Viking Technology

Full Featured with Maximum Flexibility for Expansion

Expand In-Memory Capacity at a Fraction of the Cost of DRAM: AMD EPYCTM and Ultrastar

APIs for Persistent Memory Programming

IBM Power Advanced Compute (AC) AC922 Server

SGI Overview. HPC User Forum Dearborn, Michigan September 17 th, 2012

Accessing NVM Locally and over RDMA Challenges and Opportunities

NVM Express over Fabrics Storage Solutions for Real-time Analytics

DDN. DDN Updates. Data DirectNeworks Japan, Inc Shuichi Ihara. DDN Storage 2017 DDN Storage

STAR-CCM+ Performance Benchmark and Profiling. July 2014

PCIe Storage Beyond SSDs

Results from TSUBAME3.0 A 47 AI- PFLOPS System for HPC & AI Convergence

PowerServe HPC/AI Servers

Storage and Memory Hierarchy in HPC: New Paradigm and New Solutions with Intel Dr. Jean-Laurent Philippe

Accelerating Microsoft SQL Server Performance With NVDIMM-N on Dell EMC PowerEdge R740

TECHNICAL OVERVIEW ACCELERATED COMPUTING AND THE DEMOCRATIZATION OF SUPERCOMPUTING

New Approach to Unstructured Data

What You can Do with NVDIMMs. Rob Peglar President, Advanced Computation and Storage LLC

Benchmark: In-Memory Database System (IMDS) Deployed on NVDIMM

Store Process Analyze Collaborate Archive Cloud The HPC Storage Leader Invent Discover Compete

Create a Flexible, Scalable High-Performance Storage Cluster with WekaIO Matrix

HCI: Hyper-Converged Infrastructure

3D Xpoint Status and Forecast 2017

Evolution of Rack Scale Architecture Storage

Density Optimized System Enabling Next-Gen Performance

Building NVLink for Developers

Creating Storage Class Persistent Memory With NVDIMM

Data center day. Non-volatile memory. Rob Crooke. August 27, Senior Vice President, General Manager Non-Volatile Memory Solutions Group

DDN About Us Solving Large Enterprise and Web Scale Challenges

VEXATA FOR ORACLE. Digital Business Demands Performance and Scale. Solution Brief

Session 201-B: Accelerating Enterprise Applications with Flash Memory

VMware vsphere Virtualization of PMEM (PM) Richard A. Brunner, VMware

Flavors of Memory supported by Linux, their use and benefit. Christoph Lameter, Ph.D,

Evaluation of Intel Memory Drive Technology Performance for Scientific Applications

Apache Spark Graph Performance with Memory1. February Page 1 of 13

A New Key-value Data Store For Heterogeneous Storage Architecture Intel APAC R&D Ltd.

Headline in Arial Bold 30pt. Visualisation using the Grid Jeff Adie Principal Systems Engineer, SAPK July 2008

GEN-Z AN OVERVIEW AND USE CASES

Technology Advancement in SSDs and Related Ecosystem Changes

Capabilities and System Benefits Enabled by NVDIMM-N

Emerging Technologies for HPC Storage

System Memory at a Fraction of the DRAM Cost

Improved Solutions for I/O Provisioning and Application Acceleration

Fast Forward Storage & I/O. Jeff Layton (Eric Barton)

Flash Storage with 24G SAS Leads the Way in Crunching Big Data

HPE SimpliVity. The new powerhouse in hyperconvergence. Boštjan Dolinar HPE. Maribor Lancom

Digital Co-Creation with Hybrid IT. Fujitsu World Tour 2017

Quobyte The Data Center File System QUOBYTE INC.

Jim Pappas Director of Technology Initiatives, Intel Vice-Chair, Storage Networking Industry Association (SNIA) December 07, 2018

Universal Storage. Innovation to Break Decades of Tradeoffs VASTDATA.COM

Copyright 2012 EMC Corporation. All rights reserved.

FUJITSU Server PRIMERGY CX400 M4 Workload-specific power in a modular form factor. 0 Copyright 2018 FUJITSU LIMITED

HPC Storage Use Cases & Future Trends

Copyright 2012 EMC Corporation. All rights reserved.

NVDIMM Overview. Technology, Linux, and Xen

Datasheet Nutanix Enterprise Cloud on PRIMERGY

IBM Power AC922 Server

Transcription:

Fujitsu SC 18 Human Centric Innovation Co-creation for Success 2018 FUJITSU

A Breakthrough in Non-Volatile Memory Technology & 2018 FUJITSU

IT needs to accelerate time-to-market Situation: End users and applications need instant access to data to progress faster and create better business outcomes Challenge: Today most IT infrastructures use separate data storage in addition to the main computer systems Conventional storage systems can t keep up with the speed of DRAM memory I/O operations increasingly impact application performance For I/O-intensive applications a system is only as fast as the storage, regardless of memory buildout In-memory computing can address this issue to an extent, but it is unproven and traditional memory is volatile Solution: Servers with non-volatile memory, which provides speed and data persistence 2 2018 FUJITSU

Next Generation I/O Solutions for Exascale EU Horizon 2020 NEXTGenIO Project Remove I/O bottlenecks through exploitation of non-volatile memory (NVRAM) technologies Bridge the gap between memory (fast access/small capacity) and storage (slow access/large capacity) Create NEXTGenIO hardware platform and validate NVRAM applicability for high performance and data-intensive computing Develop system software for HPC applications to exploit the NEXTGenIO platform NVRAM features. ECMWF runs a time-critical operational Weather Forecasting system that features a highly intensive HPC IO workload. As such, ECMWF is exploring the usage of NVRAM technology within HPC environments to tackle the ever-growing demands for high-density, high-contention IO systems. Validate the performance benefits of NVRAM using a mix of HPC workloads The entire project was driven by a strong co-design philosophy by the following partners: Coordinator Hardware technology System software Software tools Applications The NEXTGenIO project has received funding from the European Union s Horizon 2020 Research and Innovation programed under Grant Agreement no. 671951 3 2018 FUJITSU

NEXTGenIO Platform Development A key output of NEXTGenIO will be a prototype system based on the new NVRAM technology Fujitsu is responsible for the hardware prototype using Intel Optane DC persistent memory technology and system software (e.g. BIOS, drivers, ) Prototype system will be used to explore use of NVRAM technology for I/O intensive high-performance scientific computing Development team and production located at Augsburg, Germany 4 2018 FUJITSU

Growing Need for New Class of Memory Data Analytics, AI & Deep Learning Enormous amount of data to be processed Data/compute locality is important Keep data (large data sets) in fast memory close to compute for transfer and reinforcement learning Multiple users can share the same data Build in resilience thanks to NVRAM persistence 5 2018 FUJITSU

Growing Need for New Class of Memory Engineering Apps & Computational Steering Simulations are extremely compute intensive. Engineers have no insight what is happening during simulation e.g. of a car in a wind tunnel (OpenFOAM ) Check what happens WHILE a simulation is running, to tweak and adjust designs on-the-fly Typical use cases in car/aerospace design, such as OpenFOAM, Star CCM+, Fluent (any engineering application with visualization and steering function would benefit) Workflow: keeps all data in NVRAM and avoids need for I/O to permanent storage 6 2018 FUJITSU

Growing Need for New Class of Memory Traditional & High-end HPC Typical use cases are weather and climate modelling which are very I/O intensive Weather modelling: precise forecasts are calculated in short cycles (e.g. 4 a day) producing enormous amounts of data I/O write (and read back) operations not required if data stays in NVRAM significant reduction of runtime Additionally, streamed data (from different sources, e.g. mobile phones, sensors) can be used on the cluster / node WHILE the data is being processed 7 2018 FUJITSU

How it Works 8 2018 FUJITSU

NVRAM Technology Significantly Changes Memory and Storage Hierarchies NVRAM: Non-Volatile RAM In NEXTGenIO: Intel Optane DC Persistent Memory With up to 512GB, significantly larger capacity than DRAM Hosted in the DIMM slots, controlled by a standard CPU memory controller Comparable performance to DRAM; significantly faster than PCIe-attached SSDs 9 2018 FUJITSU

Software Stack NVDIMM Support in NEXTGenIO irmc Out-of-Band profiles/redfish DCPMM FW NVDIMM Firmware BIOS Platform settings, ACPI Rev. 6.2 e.g. NFIT Table Basic Systemware SLURM Data & Energy Aware Job Scheduler ipmctl (CLI, API, Mon) Manage device e.g. regions, FW DRAM Replacement Huge Memory Regions Hardware managed, DRAM as cache High Capacity and affordable Memory Unmodified Applications OS Linux CentOS 7.5 ndctl Manage namespaces High Speed Storage gekkofs, echofs, DAOS, ext4, fs without DAX option High-Speed classic Storage paradigm Persistent System Memory PMDK Persistent memory libs FS DAX File-system with DAX option DEVDAX Modified Applications New storage paradigm In-Memory, Big-Data, HPC, etc. Operating Systems Tools & Libs Usage Model Intel Optane DC Usage pattern App Impact Applications 10 2018 FUJITSU

Persistent Memory Introduction What is persistent memory? Persistent memory modules based on Intel Optane DC persistent memory Breakthrough in non-volatile memory for servers using future Intel Xeon Scalable CPUs Memory-like performance, byte-addressable Accessed through load/store instructions Why does it matter? Adds a new tier between DRAM and block storage Larger capacity, higher endurance, consistent low latency Just a few instructions instead of ten thousands of instructions to get data persistent stored Application can direct read/write from/to Intel Optane DC persistent memory without use of I/O links Memory & storage latency gaps 11 2018 FUJITSU

How fast is Fast Enough Goods for Production, Data for the CPU container ship grouped shipment Express train grouped shipment high-bay storage Goods can be taken individually Manufacturing *source Windmöller & Hölscher Shanghai Hamburg Berlin ~3.000-10.000 Kilometer. ~3-10 msec. ~80-300 Kilometer. ~80-300 µsec. 100-300 Meter 100-300 nsec. HDD Block Device NVMe SSD Block Device 12 2018 FUJITSU

Why is NVRAM so Much Faster? Main Memory Addr.Space Write op. DDR 1 2 Main Memory Addr.Space DDR 1 2 3 PCIe 6 PCIe SAS 4 5 SAS 13 2018 FUJITSU

System Software The system software stack (developed in NEXTGenIO) sits between the hardware and the user applications, exposes the infrastructure to the user level and enables direct interaction Job scheduler (SLURM) Object store as alternatives to file systems DAOS (Distributed Application Object Storage) dataclay Multi-node NVRAM file system echofs & GekkoFS (a collaboration with JGU university) Key goal: Platform must be usable as is for legacy applications 14 2018 FUJITSU

System Software Deep Dive on dataclay and echofs Application object store as alternative to file systems: dataclay Persistent data can be stored using the same data model used by the application in memory. dataclay is a platform that: enables storing and computing data in persistent memory facilities the location of the required data by an applications after a failure facilities the sharing of persistent data by several applications using the same memory data model Multi-node NVRAM file system: echofs & GekkoFS (collaboration with Johannes Gutenberg University Mainz) Leverages NVM providing a local and distributed filesystem for legacy applications. Allows automatizing stage-in and stageout using the job-scheduler to provide new scheduling and coordination options. Interfaces with applications using a POSIX interface with some features disabled to increase performance 15 2018 FUJITSU

Access Through User-Level Filesystem Allow legacy applications to transparently benefit from new storage layers Make new layers readily available to applications Hide I/O stack complexity from applications Construct collaborative burst buffer assigned to a batch job scheduler Echofs & GekkoFS DAOS DataClay 16 2018 FUJITSU

NEXTGenIO Prototype Rack Login / Management Node Xeon Platinum 8260M 24C 2.4GHz 165W 6x 16GB RAM 2x 256GB NVDIMM 1x 1GE (Mgmt) 1x 10GE 1x Omni-Path 1x NVIDIA Quadro M4000 (FH, 120W) 3x SATA-SSD ~2TB Boot Node Xeon Platinum 8260M 24C 2.4GHz 165W 3x 16GB RAM 1x 1GE (Mgmt) 1x 10GE 1x SATA-SSD 400GB Compute Node Xeon Platinum 8260M 24C 2.4GHz 165W 12x 16GB RAM 12x 256GB NVDIMM 2x 1GE (1x Mgmt / 1x LAN) 2x Omni-Path Remote Boot / (no internal boot device HDD / SSD / M.2) Gateway Node Xeon Platinum 8260M 24C 2.4GHz 165W 12x 16 GB RAM 2x 1GE (1x Mgmt / 1x LAN) 2x Omni-Path 2x 2-ch IB-FDR (LP) NVM up to 32x12x1MIOPs = 384MIOPs @ 1/1µs (r/w) 32x12x0.25TB = 96TB 12MIOPs, 3TB per Node Rack / 42 HU populated CN: 32 x 1 = 32 L/M: 2 x 1 = 2 B/N: 2 x 1 = 2 1/10GE: 2 x 1 = 2 OPA: 2 x 1 = 2 GW: 2 x 1 = 2 17 2018 FUJITSU

Fujitsu Sans Light abcdefghijklmnopqrstuvwxyz ABCDEFGHIJKLMNOPQRSTUVWXYZ 0123456789! $%^&*()_+-=[]{}; #:@~,./<>?\ ~ ª - ±²³µ ¹º¼½¾ ÀÁÂÃÄÅÇÈÆÉÊËÌÍÎÏÐÑÒÓÔÕÖ ØÙÚÛÜÝÞßàáâãäåæçèéêëìíîïðñòóôõö øùúûü ýþÿđıœœššÿžžƒʼˆˇˉ - ⁰⁴⁵⁶⁷⁸⁹₀₁₂₃₄₅₆₇₈₉ Ω fifl Fujitsu Sans abcdefghijklmnopqrstuvwxyz 0123456789! $%^&*()_+-=[]{}; #:@~,./<>?\ ~ ª - ±²³µ ¹º¼½¾ ÀÁÂÃÄÅÇÈÆÉÊËÌÍÎÏÐÑÒÓÔÕÖ ØÙÚÛÜÝÞßàáâãäåæçèéêëìíîïðñòóôõö øùúû üýþÿđıœœššÿžžƒʼˆˇˉ - ⁰⁴⁵⁶⁷⁸⁹₀₁₂₃₄₅₆₇₈₉ Ω fifl Fujitsu Sans Medium abcdefghijklmnopqrstuvwxyz 0123456789! $%^&*()_+- =[]{}; #:@~,./<>?\ ~ ª - ±²³µ ¹º¼½¾ ÀÁÂÃÄÅÇÈÆÉÊËÌÍÎÏÐÑÒÓÔÕÖ ØÙÚÛÜÝÞßàáâãäåæçèéêëìíîïðñòóôõö øùú ûüýþÿđıœœššÿžžƒʼˆˇˉ - ⁰⁴⁵⁶⁷⁸⁹₀₁₂₃₄₅₆₇₈₉ Ω fifl