Parallel File Systems. John White Lawrence Berkeley National Lab

Similar documents
An Introduction to GPFS

Data Management. Parallel Filesystems. Dr David Henty HPC Training and Support

An introduction to GPFS Version 3.3

Parallel File Systems for HPC

Beyond Petascale. Roger Haskin Manager, Parallel File Systems IBM Almaden Research Center

A GPFS Primer October 2005

Architecting Storage for Semiconductor Design: Manufacturing Preparation

Market Report. Scale-out 2.0: Simple, Scalable, Services- Oriented Storage. Scale-out Storage Meets the Enterprise. June 2010.

Feedback on BeeGFS. A Parallel File System for High Performance Computing

The RAMDISK Storage Accelerator

DVS, GPFS and External Lustre at NERSC How It s Working on Hopper. Tina Butler, Rei Chi Lee, Gregory Butler 05/25/11 CUG 2011

Lustre overview and roadmap to Exascale computing

Coordinating Parallel HSM in Object-based Cluster Filesystems

CSCS HPC storage. Hussein N. Harake

RAIDIX Data Storage Solution. Clustered Data Storage Based on the RAIDIX Software and GPFS File System

Analyzing the High Performance Parallel I/O on LRZ HPC systems. Sandra Méndez. HPC Group, LRZ. June 23, 2016

LustreFS and its ongoing Evolution for High Performance Computing and Data Analysis Solutions

HIGH-PERFORMANCE STORAGE FOR DISCOVERY THAT SOARS

IBM CAUDIT/RDSI Briefing

Sun Lustre Storage System Simplifying and Accelerating Lustre Deployments

Storage Optimization with Oracle Database 11g

Computer Science Section. Computational and Information Systems Laboratory National Center for Atmospheric Research

The advantages of architecting an open iscsi SAN

Parallel File Systems Compared

Mission-Critical Lustre at Santos. Adam Fox, Lustre User Group 2016

Storage Supporting DOE Science

GPFS on a Cray XT. 1 Introduction. 2 GPFS Overview

Introduction to High Performance Parallel I/O

IBM Storwize V7000 Unified

Veritas NetBackup on Cisco UCS S3260 Storage Server

IME (Infinite Memory Engine) Extreme Application Acceleration & Highly Efficient I/O Provisioning

High Performance Storage Solutions

XtreemStore A SCALABLE STORAGE MANAGEMENT SOFTWARE WITHOUT LIMITS YOUR DATA. YOUR CONTROL

Data Movement & Tiering with DMF 7

IBM ProtecTIER and Netbackup OpenStorage (OST)

INTEGRATING HPFS IN A CLOUD COMPUTING ENVIRONMENT

WHITE PAPER: A NEW STORAGE ARCHITECTURE FOR THE COMMODITIZATION ERA

Vendor: IBM. Exam Code: Exam Name: Storage Sales V2. Version: DEMO

Driving Data Warehousing with iomemory

Storage for HPC, HPDA and Machine Learning (ML)

The Leading Parallel Cluster File System

Application Performance on IME

Store Process Analyze Collaborate Archive Cloud The HPC Storage Leader Invent Discover Compete

The BioHPC Nucleus Cluster & Future Developments

Data Movement & Storage Using the Data Capacitor Filesystem

Next Generation Storage for The Software-Defned World

Current Topics in OS Research. So, what s hot?

Write a technical report Present your results Write a workshop/conference paper (optional) Could be a real system, simulation and/or theoretical

LS-DYNA Best-Practices: Networking, MPI and Parallel File System Effect on LS-DYNA Performance

LUG 2012 From Lustre 2.1 to Lustre HSM IFERC (Rokkasho, Japan)

DELL EMC ISILON F800 AND H600 I/O PERFORMANCE

Distributed Filesystem

BeeGFS Solid, fast and made in Europe

Infinite Memory Engine Freedom from Filesystem Foibles

THE EMC ISILON STORY. Big Data In The Enterprise. Deya Bassiouni Isilon Regional Sales Manager Emerging Africa, Egypt & Lebanon.

Filesystems on SSCK's HP XC6000

Data storage services at KEK/CRC -- status and plan

Building Self-Healing Mass Storage Arrays. for Large Cluster Systems

朱义普. Resolving High Performance Computing and Big Data Application Bottlenecks with Application-Defined Flash Acceleration. Director, North Asia, HPC

GPFS on a Cray XT. Shane Canon Data Systems Group Leader Lawrence Berkeley National Laboratory CUG 2009 Atlanta, GA May 4, 2009

Evaluating Cloud Storage Strategies. James Bottomley; CTO, Server Virtualization

Efficient Object Storage Journaling in a Distributed Parallel File System

RESEARCH DATA DEPOT AT PURDUE UNIVERSITY

IBM Spectrum Scale vs EMC Isilon for IBM Spectrum Protect Workloads

Diamond Networks/Computing. Nick Rees January 2011

Managing HPC Active Archive Storage with HPSS RAIT at Oak Ridge National Laboratory

IME Infinite Memory Engine Technical Overview

GPFS Experiences from the Argonne Leadership Computing Facility (ALCF) William (Bill) E. Allcock ALCF Director of Operations

The Oracle Database Appliance I/O and Performance Architecture

Symantec Design of DP Solutions for UNIX using NBU 5.0. Download Full Version :

Tuning I/O Performance for Data Intensive Computing. Nicholas J. Wright. lbl.gov

ARCHER/RDF Overview. How do they fit together? Andy Turner, EPCC

Improved Solutions for I/O Provisioning and Application Acceleration

The UnAppliance provides Higher Performance, Lower Cost File Serving

Lustre* is designed to achieve the maximum performance and scalability for POSIX applications that need outstanding streamed I/O.

EMC Integrated Infrastructure for VMware. Business Continuity

TGCC OVERVIEW. 13 février 2014 CEA 10 AVRIL 2012 PAGE 1

I/O: State of the art and Future developments

File Systems for HPC Machines. Parallel I/O

Guillimin HPC Users Meeting February 11, McGill University / Calcul Québec / Compute Canada Montréal, QC Canada

Introduction to HPC Parallel I/O

Extraordinary HPC file system solutions at KIT

A Generic Methodology of Analyzing Performance Bottlenecks of HPC Storage Systems. Zhiqi Tao, Sr. System Engineer Lugano, March

SGI Overview. HPC User Forum Dearborn, Michigan September 17 th, 2012

Lustre HSM at Cambridge. Early user experience using Intel Lemur HSM agent

libhio: Optimizing IO on Cray XC Systems With DataWarp

The Hadoop Distributed File System Konstantin Shvachko Hairong Kuang Sanjay Radia Robert Chansler

An ESS implementation in a Tier 1 HPC Centre

DDN s Vision for the Future of Lustre LUG2015 Robert Triendl

The next step in Software-Defined Storage with Virtual SAN

Balakrishnan Nair. Senior Technology Consultant Back Up & Recovery Systems South Gulf. Copyright 2011 EMC Corporation. All rights reserved.

NAS for Server Virtualization Dennis Chapman Senior Technical Director NetApp

Turning Object. Storage into Virtual Machine Storage. White Papers

InfiniBand Networked Flash Storage

BeeGFS. Parallel Cluster File System. Container Workshop ISC July Marco Merkel VP ww Sales, Consulting

HIGH PERFORMANCE COMPUTING: MODELS, METHODS, & MEANS PARALLEL FILE I/O 1

Automated Storage Tiering on Infortrend s ESVA Storage Systems

Highly Scalable, Non-RDMA NVMe Fabric. Bob Hansen,, VP System Architecture

LCE: Lustre at CEA. Stéphane Thiell CEA/DAM

DDN and Flash GRIDScaler, Flashscale Infinite Memory Engine

Transcription:

Parallel File Systems John White Lawrence Berkeley National Lab

Topics Defining a File System Our Specific Case for File Systems Parallel File Systems A Survey of Current Parallel File Systems Implementation

What is a File System? Simply, a method for ensuring A Unified Access Method to Data Organization (in a technical sense ) Data Integrity Efficient Use of Hardware

The HPC Application (our application) Large Node Count High IO Code (small file operations) High Throughput Code (large files fast) You Can Never Provide Too Much Capacity

What s the Problem With Tradition? NFS/CIFS/AFP/NAS is slow Single point of contact for both data and metadata Protocol Overhead File based locking We want parallelism from the application to disk We Need a Single Namespace We Need Truly Massive Aggregate Throughput (stop thinking MB/s) Bottlenecks are Inherent to Architecture Most Importantly:

Researchers Just Don t Care They want their data available everywhere They hate transferring data (this bears repeating) Their code wants the data several cycles ago If they have to learn new IO APIs, they commonly won't use it, period An increasing number aren t aware their code is inefficient

Performance in Aggregate: A Specific Case File System capable of Performance of 5GB/s Researcher running an analysis of past stock ticker data 10 independent processes per node, 10+ nodes, sometimes 1000+ processes Was running into performance issues In Reality, code was hitting 90% of peak performance 100s of processes choking each other Efficiency is key

Parallel File Systems A File System That Provides Access to Massive Amounts of Data at Large Client Counts Simultaneous Client Access at Sub-File Levels Striping at Sub-File Levels Massive Scalability A Method to Aggregate Large Numbers of Disks

Popular Parallel File Systems Lustre Purchased by Intel Support offerings from Intel, Whamcloud and numerous vendors Object based Growing feature list Information Lifecycle Management Wide Area mounting support Data replication and Metadata clustering planned Open source Large and growing install base, vibrant community Open compatibility

Popular Parallel File Systems GPFS IBM, born around 1993 as Tiger Shark multimedia file system Support direct from vendor AIX, Linux, some Windows Ethernet and Infiniband support Wide Area Support ILM Distributed metadata and locking Matured storage pool support Replication

Licensing Landscape GPFS (A Story of a Huge Feature Set at a Huge Cost) Binary IBM licensing Per Core Site-Wide Lustre Open Paid Licensing available tied to support offerings

Striping Files

SAN All nodes have access to storage fabric, all LUNs

Direct Connect A separate storage cluster hosts and exports via common fabric

Berkeley Research Computing Current Savio Scratch File System Lustre 2.5 210TB of DDN 9900 ~10GB/s ideal throughput Accessible on all nodes Future Lustre 2.5 or GPFS 4.1 ~1PB+ Capacity ~20GB/s throughput Vendor yet to be determined

Berkeley Research Computing Access Methods Available on every node POSIX MPIIO Data Transfer Globus Online Ideal for large transfers Restartable Tuned for large networks and long distance Easy to use graphical interface online SCP/SFTP Well known Suitable for quick and dirty transfers

Current Technological Landscape Tiered Storage (Storage Pools) When you have multiple storage needs within a single namespace SSD/FC for for jobs, metadata (Tier0) SATA for capacity (Tier1) Tape for long-term/archival (Tier2) ILM Basically, perform actions on data per a rule set Migration to Tape Fast Tier 0 storage use case Purge Policies Replication Dangers of metadata operations Long term storage

Further Information Berkeley Research Computing http://research-it.berkeley.edu/brc HPCS At LBNL http://scs.lbl.gov/ Email: jwhite@lbl.gov