HOW TO BUILD A MODERN AI

Similar documents
In partnership with. VelocityAI REFERENCE ARCHITECTURE WHITE PAPER

New Approach to Unstructured Data

TECHNICAL WHITE PAPER REAL-WORLD AI DEEP LEARNING PIPELINE POWERED BY FLASHBLADE

ACCELERATE YOUR ANALYTICS GAME WITH ORACLE SOLUTIONS ON PURE STORAGE

OPERATIONALIZING MACHINE LEARNING USING GPU ACCELERATED, IN-DATABASE ANALYTICS

DDN. DDN Updates. DataDirect Neworks Japan, Inc Nobu Hashizume. DDN Storage 2018 DDN Storage 1

Cloud Meets Big Data For VMware Environments

Deep Learning Performance and Cost Evaluation

Storage for HPC, HPDA and Machine Learning (ML)

DDN. DDN Updates. Data DirectNeworks Japan, Inc Shuichi Ihara. DDN Storage 2017 DDN Storage

IBM SpectrumAI with NVIDIA Converged Infrastructure Solutions for AI workloads

FAST SQL SERVER BACKUP AND RESTORE

AI-accelerated HPC Hardware Infrastructure. Francis Lam Huawei Technologies

Deep Learning Performance and Cost Evaluation

Edge to Core to Cloud Architecture for AI

NVIDIA PLATFORM FOR AI

Building the Most Efficient Machine Learning System

Isilon: Raising The Bar On Performance & Archive Use Cases. John Har Solutions Product Manager Unstructured Data Storage Team

IBM Spectrum Scale IO performance

Modern Data Warehouse The New Approach to Azure BI

UNFAIR ADVANTAGE Your Road to SAP Hana 2016 PURE STORAGE INC.

Scalable AI Infrastructure

Developing Low Latency NVMe Systems for HyperscaleData Centers. Prepared by Engling Yeo Santa Clara, CA Date: 08/04/2017

Building the Most Efficient Machine Learning System

Is your IT Infrastructure Ready for Machine Learning & Artificial Intelligence?

Accelerate AI with Cisco Computing Solutions

EMC SOLUTION FOR SPLUNK

Flash Storage Complementing a Data Lake for Real-Time Insight

Emerging Technologies for HPC Storage

Deep Learning Inference on Openshift with GPUs

THE PURE STORAGE DATA PLATFORM

Onto Petaflops with Kubernetes

Next Generation Computing Architectures for Cloud Scale Applications

Copyright 2012 EMC Corporation. All rights reserved.

Best Practices for Validating the Performance of Data Center Infrastructure. Henry He Ixia

Cloud Analytics and Business Intelligence on AWS

EFFICIENT INFERENCE WITH TENSORRT. Han Vanholder

A NEW COMPUTING ERA. Shanker Trivedi Senior Vice President Enterprise Business at NVIDIA

NVIDIA DGX SYSTEMS PURPOSE-BUILT FOR AI

Accelerating Digital Transformation with InterSystems IRIS and vsan

VMware Virtual SAN Technology

Fast Hardware For AI

Storage Class Memory in Scalable Cognitive Systems

IBM Deep Learning Solutions

FLASHARRAY AT A GLANCE

The Datacentered Future Greg Huff CTO, LSI Corporation

Flash Considerations for Software Composable Infrastructure. Brian Pawlowski CTO, DriveScale Inc.

2013 AWS Worldwide Public Sector Summit Washington, D.C.

HPE Synergy HPE SimpliVity 380

HPE Deep Learning Cookbook: Recipes to Run Deep Learning Workloads. Natalia Vassilieva, Sergey Serebryakov

NAS for Server Virtualization Dennis Chapman Senior Technical Director NetApp

Fast forward. To your <next>

WHITE PAPER AI AND BIG DATA IN THE SAP WORLD PREPARE YOUR SAP ENVIRONMENT FOR THE FUTURE WITH PURE

IBM Db2 Analytics Accelerator Version 7.1

Transforming Transport Infrastructure with GPU- Accelerated Machine Learning Yang Lu and Shaun Howell

AIRI SCALE-OUT AI-READY INFRASTRUCTURE ARCHITECTED BY PURE STORAGE AND NVIDIA WITH ARISTA 7060X SWITCH REFERENCE ARCHITECTURE

PROTECTING MISSION CRITICAL DATA

Verron Martina vspecialist. Copyright 2012 EMC Corporation. All rights reserved.

Pivot3 Acuity with Microsoft SQL Server Reference Architecture

Accelerating Data Center Workloads with FPGAs

Veritas NetBackup on Cisco UCS S3260 Storage Server

World s most advanced data center accelerator for PCIe-based servers

Pouya Kousha Fall 2018 CSE 5194 Prof. DK Panda

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Data Movement & Tiering with DMF 7

Next Generation Storage for The Software-Defned World

Modern hyperconverged infrastructure. Karel Rudišar Systems Engineer, Vmware Inc.

Predicting Service Outage Using Machine Learning Techniques. HPE Innovation Center

On-Premises Cloud Platform. Bringing the public cloud, on-premises

Enterprise Architectures The Pace Accelerates Camberley Bates Managing Partner & Analyst

SSDs that Think. Noam Mizrahi Vice President, Technology and Architecture CTO Office, Marvell

Fast forward. To your <next>

Storage for High-Performance Computing Gets Enterprise Ready

NVDIA DGX Data Center Reference Design

Accelerating Real-Time Big Data. Breaking the limitations of captive NVMe storage

Performance Benefits of NVMe over Fibre Channel A New, Parallel, Efficient Protocol

S INSIDE NVIDIA GPU CLOUD DEEP LEARNING FRAMEWORK CONTAINERS

Flashmatrix Technology

DEEP NEURAL NETWORKS CHANGING THE AUTONOMOUS VEHICLE LANDSCAPE. Dennis Lui August 2017

Dell EMC PowerEdge R740xd as a Dedicated Milestone Server, Using Nvidia GPU Hardware Acceleration

NVIDIA T4 FOR VIRTUALIZATION

Agenda. AWS Database Services Traditional vs AWS Data services model Amazon RDS Redshift DynamoDB ElastiCache

I/O Acceleration by Host Side Resources

NVM Express Awakening a New Storage and Networking Titan Shaun Walsh G2M Research

IBM CORAL HPC System Solution

AN ALTERNATIVE TO ALL- FLASH ARRAYS: PREDICTIVE STORAGE CACHING

The Fastest And Most Efficient Block Storage Software (SDS)

Key Technology Trends, Marketplace Drivers, & AFA Rankings Jerome M. Wendt President & Founder Ken Clipperton Lead Analyst, Storage

Acceĺeŕez vos environnements de bases de donneés/erp

Decentralized Distributed Storage System for Big Data

Highly Scalable, Non-RDMA NVMe Fabric. Bob Hansen,, VP System Architecture

Virtual SQL Servers. Actual Performance. 2016

Dell EMC Isilon and NVIDIA DGX-1 servers for deep learning

Scality RING on Cisco UCS: Store File, Object, and OpenStack Data at Scale

Sharing High-Performance Devices Across Multiple Virtual Machines

Dell EMC Isilon All-Flash

XTREMIO: TRANSFORMING APPLICATIONS, ENABLING THE AGILE DATA CENTER

Transforming IT: From Silos To Services

Aerospike Scales with Google Cloud Platform

FLASH.NEXT. Zero to One Million IOPS in a Flash. Ahmed Iraqi Account Systems Engineer North Africa

Transcription:

HOW TO BUILD A MODERN AI FOR THE UNKNOWN IN MODERN DATA 1 2016 PURE STORAGE INC.

2

Official Languages Act (1969/1988) 3 Translation Bureau

4

5

DAWN OF 4 TH INDUSTRIAL REVOLUTION BIG DATA, AI DRIVING CHANGE IN EVERY INDUSTRY 1 st Revolution 1760-1820 s Steam Power Rural to Industrial 2 nd Revolution 1870-1914 Electricity Industrial to Mass Production 3 rd Revolution 1980-2010 PC Mass production to Digital 4 th Revolution 2010-now AI, Big Data & IoT Digital to Intelligence 6

DATA IS VITAL TO MACHINE LEARNING OBSERVATION BY PROF. ANDREW NG, AI LUMINARY 7

VALUABLE DATA STUCK IN NEUTRAL LEGACY, RETROFIT STORAGE BUILT ON SERIAL TECHNOLOGIES, PERFORMANCE GAP GROWING STORAGE TECHNOLOGY NOT KEEPING UP Gap Will Only Grow Worse LEGACY & RETROFIT STORAGE Built on Decade-Old Serial Technology Deep Learning Compute Required 15x in 2 Years Object Translation Layer NFS Software Stack PERFORMANCE GAP Compute Delivered 10x in 2 Years Newer Technologies Retrofitted SAS (Serial Attached SCSI) SATA Disk Emulation Software Decade-old Protocol & SW SSD/Disk Performance Delivered ~Flat in 2 Years 8 2015 2016 2017

STORAGE FOR AI: BOTH A TRUCK AND A RACE CAR 9 CAPACITY LARGE FILES THROUGHPUT SEQUENTIAL ACCESS CONCURRENCY SMALL FILES LATENCY RANDOM ACCESS

SOUL OF DGX-1 IS PARALLEL 10

SOUL OF FLASHBLADE IS PARALLEL POWERING 75 BLADE-SCALE IN SINGLE IP WITH PURITY FOR FLASHBLADE NATIVE OBJECT NATIVE NFS/SMB KEY VALUE KEY-VALUE DATABASE STORE FOR DISTRIBUTED PARTITIONS BILLIONS & BILLIONS OF OBJECTS 11 75 blade feature is subject to GA release

THREE ESSENTIAL THINGS FOR AI FRAMEWORKS & APPLICATIONS COMPUTE FROM CPU TO GPU SERVERS STORAGE POWER ENTIRE AI PIPELINE 12

WIDE RANGE OF NEEDS IN THE PIPELINE SIGNIFICANT CHALLENGE TO LEGACY STORAGE INGEST CLEAN & TRANSFORM EXPLORE TRAIN From sensors, machines, & user generated CPU Servers GPU Server GPU Production Cluster IO PROFILE 13

REAL WORLD PIPELINE IN AN AUTONOMOUS CAR COMPANY INGEST CLEAN, LABEL, RESIZE EXPLORE TRAIN INFERENCE IN VIRTUAL WORLD CPU Servers GPU Server GPU Production Cluster GPU Production Cluster 10 S OF PB COLD STORAGE 14

AI SYSTEMS DESIGN PATTERNS GOAL IS TO KEEP THE GPUs 100% BUSY FULL TRAINING WORKFLOW decode scale evaluate forwardpropagation update back-propagation I/O CPU GPU BENCHMARK SETUP Setup #1: DGX-1 with 4x Local SSDs Setup #2: DGX-1 with 1x FlashBlade 15

RESULT: FLASHBLADE vs LOCAL SSDs TENSORFLOW TRAINING BENCHMARK WITH RESNET-50 TIME TO SOLUTIONS (HOURS) 1.8 Hours to process 10M images 6% Faster 1.7 Hours to process 10M images 16 NVIDIA DGX-1 + Local SSDs NVIDIA DGX-1 + Pure FlashBlade

RESULT: 3X FASTER END-TO-END TENSORFLOW TRAINING BENCHMARK WITH RESNET-50 TIME TO SOLUTIONS (HOURS) 3.4 Hours to load 8TB into SSDs 1.8 Hours to process 10M images 305% Faster 1.7 Hours to process 10M images 17 NVIDIA DGX-1 + Local SSDs NVIDIA DGX-1 + Pure FlashBlade

ANALYTICS FOR PRODUCTION DATA TUNED FOR EVERYTHING DATA PLATFORM FOR BOTH TRAINING AND INFERENCING WORKLOADS TRAINING ANALYSIS ON INFERENCE DATA DEPLOY TRAINED MODELS 18

19

MAKING AUTONOMOUS CARS POSSIBLE BY 2021 20 Zenuity, a joint venture of Volvo and Autoliv, aims to build autonomous driving software for production vehicles by 2021. They chose to build their deep learning infrastructure with NVIDIA DGX-1 servers and Pure FlashBlade systems to accelerate their AI initiative.

AUTONOMOUS VEHICLE SOFTWARE COMPANY 4 NVIDIA DGX-1 DL Training Cluster 2 PURE FLASHBLADE Over 1 PB of training data w/ performance headroom ENTIRE AI PIPELINE ON A SINGLE HUB Preprocessing, Exploring, and Training on FlashBlade 21

AI EXPANDED OUR VIEW OF THE WORLD STRUCTURED (BLOCK, DBs, VMs) DBs & Apps UNSTRUCTURED (FILE, OBJECT, KEY-VALUE CONTAINERS) ALL-FLASH ARRAYS VM Farms Tier2 Apps 22

FLASHBLADE INDUSTRY S FIRST DATA HUB PURPOSE-BUILT FOR AI & DEEP LEARNING BLADE PURITY SCALE-OUT FABRIC Powerful, Elastic Data Processing & Storage Unit Massively Distributed Software for Limitless Scale Software-defined fabric that scales linearly with more data & clients 23 75 blade feature is subject to GA release