HPC File Systems and Storage. Irena Johnson University of Notre Dame Center for Research Computing

Similar documents
Introducing Panasas ActiveStor 14

Storage for High-Performance Computing Gets Enterprise Ready

CA485 Ray Walshe Google File System

Data Management. Parallel Filesystems. Dr David Henty HPC Training and Support

COMP283-Lecture 3 Applied Database Management

Effective Use of CSAIL Storage

IST346. Data Storage

Zadara Enterprise Storage in

New HPE 3PAR StoreServ 8000 and series Optimized for Flash

IBM Spectrum NAS. Easy-to-manage software-defined file storage for the enterprise. Overview. Highlights

BeoLink.org. Design and build an inexpensive DFS. Fabrizio Manfredi Furuholmen. FrOSCon August 2008

Chapter Two. Lesson A. Objectives. Exploring the UNIX File System and File Security. Understanding Files and Directories

Shared Object-Based Storage and the HPC Data Center

An Introduction to GPFS

Isilon Performance. Name

Feedback on BeeGFS. A Parallel File System for High Performance Computing

Storage Optimization with Oracle Database 11g

High Performance Computing Course Notes High Performance Storage

UCS Invicta: A New Generation of Storage Performance. Mazen Abou Najm DC Consulting Systems Engineer

Chapter 10: Mass-Storage Systems

CS370 Operating Systems

Xcellis Technical Overview: A deep dive into the latest hardware designed for StorNext 5

Chapter 10: Mass-Storage Systems. Operating System Concepts 9 th Edition

SurFS Product Description

Mass-Storage Structure

Veritas NetBackup on Cisco UCS S3260 Storage Server

Input/Output. Today. Next. Principles of I/O hardware & software I/O software layers Disks. Protection & Security

Linux+ Guide to Linux Certification, Third Edition. Chapter 6 Advanced Installation

Introducing SUSE Enterprise Storage 5

IOPStor: Storage Made Easy. Key Business Features. Key Business Solutions. IOPStor IOP5BI50T Network Attached Storage (NAS) Page 1 of 5

Storage Technologies - 3

SolidFire and Ceph Architectural Comparison

Sun Lustre Storage System Simplifying and Accelerating Lustre Deployments

Storage Update and Storage Best Practices for Microsoft Server Applications. Dennis Martin President, Demartek January 2009 Copyright 2009 Demartek

Computer Architecture 计算机体系结构. Lecture 6. Data Storage and I/O 第六讲 数据存储和输入输出. Chao Li, PhD. 李超博士

Introduction to Virtualization. From NDG In partnership with VMware IT Academy

Panasas Performance Scale-Out NAS-Architectural Overview. Panasas ActiveStor Solution: Architectural Overview WHITE PAPER

Dell Fluid Data solutions. Powerful self-optimized enterprise storage. Dell Compellent Storage Center: Designed for business results

Evaluating Cloud Storage Strategies. James Bottomley; CTO, Server Virtualization

FILE SYSTEMS. CS124 Operating Systems Winter , Lecture 23

Isilon Scale Out NAS. Morten Petersen, Senior Systems Engineer, Isilon Division

CSCI-GA Database Systems Lecture 8: Physical Schema: Storage

White Paper. EonStor GS Family Best Practices Guide. Version: 1.1 Updated: Apr., 2018

StarWind Storage Appliance

Storage Systems. Storage Systems

EI 338: Computer Systems Engineering (Operating Systems & Computer Architecture)

Virtual File System -Uniform interface for the OS to see different file systems.

COSC6376 Cloud Computing Lecture 17: Storage Systems

Panasas Corporate & Product Overview

IBM IBM Open Systems Storage Solutions Version 4. Download Full Version :

A GPFS Primer October 2005

Choosing Hardware and Operating Systems for MySQL. Apr 15, 2009 O'Reilly MySQL Conference and Expo Santa Clara,CA by Peter Zaitsev, Percona Inc

Chapter 12: Mass-Storage

NAS When, Why and How?

CSE 451: Operating Systems Winter Redundant Arrays of Inexpensive Disks (RAID) and OS structure. Gary Kimura

Chapter 12: Mass-Storage

Chapter 12: Mass-Storage

Network Storage Appliance

CS 470 Spring Distributed Web and File Systems. Mike Lam, Professor. Content taken from the following:

OpenAFS A HPC filesystem? Rich Sudlow Center for Research Computing University of Notre Dame

File System. Computadors Grau en Ciència i Enginyeria de Dades. Xavier Verdú, Xavier Martorell

PC-based data acquisition II

Current Topics in OS Research. So, what s hot?

Realizing the Promise of SANs

The advantages of architecting an open iscsi SAN

Coordinating Parallel HSM in Object-based Cluster Filesystems

V. Mass Storage Systems

IBM Spectrum Protect Version Introduction to Data Protection Solutions IBM

CS 470 Spring Distributed Web and File Systems. Mike Lam, Professor. Content taken from the following:

DELL POWERVAULT NX3500. A Dell Technical Guide Version 1.0

Storage Area Network (SAN)

Brent Welch. Director, Architecture. Panasas Technology. HPC Advisory Council Lugano, March 2011

Distributed Systems. Hajussüsteemid MTAT Distributed File Systems. (slides: adopted from Meelis Roos DS12 course) 1/25

Get More Out of Storage with Data Domain Deduplication Storage Systems

PRESENTATION TITLE GOES HERE

Chapter 10: Mass-Storage Systems

Mass-Storage Systems. Mass-Storage Systems. Disk Attachment. Disk Attachment

CS3600 SYSTEMS AND NETWORKS

1Z0-433

Discover CephFS TECHNICAL REPORT SPONSORED BY. image vlastas, 123RF.com

SOLO NETWORK (11) (21) (31) (41) (48) (51) (61)

SATA RAID For The Enterprise? Presented at the THIC Meeting at the Sony Auditorium, 3300 Zanker Rd, San Jose CA April 19-20,2005

HYBRID STORAGE TM. WITH FASTier ACCELERATION TECHNOLOGY

Quobyte The Data Center File System QUOBYTE INC.

General Purpose Storage Servers

Storage. CS 3410 Computer System Organization & Programming

TECHNICAL OVERVIEW OF NEW AND IMPROVED FEATURES OF EMC ISILON ONEFS 7.1.1

Object Storage: Redefining Bandwidth for Linux Clusters

File Systems. Kartik Gopalan. Chapter 4 From Tanenbaum s Modern Operating System

Linux Files and the File System

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

NST6000 UNIFIED HYBRID STORAGE. Performance, Availability and Scale for Any SAN and NAS Workload in Your Environment

Deploying Celerra MPFSi in High-Performance Computing Environments

EMC Solutions for Backup to Disk EMC Celerra LAN Backup to Disk with IBM Tivoli Storage Manager Best Practices Planning

Storage Area Networks SAN. Shane Healy

The File Systems Evolution. Christian Bandulet, Sun Microsystems

Parallel File Systems Compared

CS 550 Operating Systems Spring File System

EMC Celerra CNS with CLARiiON Storage

HPC Growing Pains. IT Lessons Learned from the Biomedical Data Deluge

Transcription:

HPC File Systems and Storage Irena Johnson University of Notre Dame Center for Research Computing

HPC (High Performance Computing) Aggregating computer power for higher performance than that of a typical desktop computer/workstation for solving large problems in science, engineering, business Large systems perform calculations Data access is critical HPC: Compute node Head node File System Storage Networking

File Collection of data/information: Document Picture Audio or video stream Application Other collection of data

Metadata The information that describes the data contained in files: Size Date created Date modified Location on disk Permissions (who can view/modify your file)

File System Definition from TLDP (The Linux Documentation Project): "On a UNIX system, everything is a file; if something is not a file, it is a process*. Many Types of File Systems Not all file systems are equal Designed for different uses Data is organized in different ways Some are faster than others Some are more robust/reliable Some support large storage drives *Process - a task (a process is started when a program is initiated)

UNIX/Linux File System Hierarchical file structure Tree-structured file system (upside down tree) Everything starts from the root directory / and expands into sub-directories and so forth Unlike Windows which uses drives

UNIX/Linux File System / /bin/ /boot/ /dev/ /etc/ /home/ /lib/ /media/ /mnt/ /opt/ /proc/ /bea/ /ed/ /jen/

UNIX/Linux File System Serial File System (Traditional) A single server controls the users and data Can be faster for one user No redundancy Simple

UNIX/Linux File System Distributed / Parallel File System Data is spread out across many systems on a network Single shared global namespace Supports multiple users (can be distributed) Supports high bandwidth More storage than on a single system Fault tolerant Reliable Scalable Complex

Parallel File System Clients Clients Clients Metadata Parallel Read/Write Metadata servers Metadata servers Management Storage Devices

Parallel File System Breaks up a data set and distributes (stripes), the blocks to multiple storage drives (local and/or remote servers). Users do not need to know the physical location of the data blocks to retrieve a file. Data access is done via a global namespace. A metadata server stores the file name, location, owner, access permissions. Reads and writes data to distributed storage devices using multiple I/O paths concurrently. Capacity and bandwidth can be scaled. Storage - high availability, mirroring, replication, snapshots.

File Systems at CRC AFS (Andrew File System) Developed in 1982, part of the Andrew project at Carnegie Mellon University. Named after Andrew Carnegie and Andrew Mellon Client-server architecture Federated file sharing Provides location independence Scalable Secure (Kerberos for authentication and ACL - access control lists on directories for users and groups) Available for a wide range of heterogeneous systems - UNIX/ Linux, MacOS X, and Microsoft Windows

File Systems at CRC Panasas - High Performance Parallel scratch File System /scratch365 Parallel access to data Data is striped across multiple storage nodes, providing increased capacity and/or performance Concurrent reading and writing (scalable performance to individual files) Global Namespace - all compute nodes accessing the storage see the same namespace (same name and pathname); management is done through one system only

Overview CRC File Systems Purpose File System Type, Full Name File Access Space Available Aggregated bandwidth (approx.) Globally accessible home and project directories User s Home Directories AFS - crc.nd.edu /afs/crc.nd.edu/user/first/netid Group Directories AFS crc.nd.edu /afs/crc.nd.edu/group/ Pseudo-temporary File System Directly using OpenAFS client (open source) $HOME Directly using OpenAFS client 100GB - 2TB volume 100GB - 2TB volume up to 70-85 MB/sec per node - Approximately 200 MB/sec aggregated using multiple nodes Panasas High Performance Parallel scratch file system /scratch365/netid Directly using Panasas proprietary pants client 500GB - 1TB 70-90 MB/sec per node with 1 Gb network Local File Systems Node local temporary scratch file system Local disks /scratch (link to /tmp) Directly - shared with other users on node R815-500GB HP DL160 - d6copt - 100GB IBM/Lenovo nx360m4-400gb Daccssfe - 5TB R815-H700 RAIDctrl -250-300 MB/sec HP DL160 -d6copt 50-60 MB/sec IBM/Lenovo 90-100 MB/sec daccssfe - 800-1,000 MB/sec

RAID Redundant Array(s) of Inexpensive/Independent Disks Physical disks bound together with hardware or software Used to create larger filesystems out of standard drive technology Configurations optimize cost vs capability RAID Levels: 0, 1, 3, 4, 5, 6, 0+1, 1+0 RAID 0 - striped (performance and capacity) RAID 1 - mirrored (read performance, fault tolerance FT) RAID 5 - striped with distributed parity (performance, capacity, FT N+1) RAID 6 - striped with distributed parity (performance, capacity, FT N+2) https://searchstorage.techtarget.com/definition/raid

Data Storage How information is kept in a digital format that may be retrieved later Computers/Laptops/Tablets/Smartphones/other devices - all store data Hard drive/disk/flash drive/ssd (solid state data)/cloud Is not the same as RAM memory * Hard drive - think long term memory * RAM - think short term memory

Data Storage Types File-based storage Block-based storage Object-based storage

File Storage Also called file-level or file-based storage You use file storage when you access documents/pictures saved in files on your computer Data is stored as a single piece of information inside a file, inside a directory A single path to data Hierarchical in nature (called tree-structured system) Oldest type of storage Inexpensive Simple

Block Storage Breaks a file into individual blocks of data The blocks are stored as separate pieces of data No need for file-folder structure because each block of data has a unique address The smaller blocks of data spread out to where is most efficient The storage system software pulls all the blocks back together to assemble the file when accessed The more data you need to store, the better

Block Storage Used in storage-area network (SAN) environments where data is stored in volumes (blocks) Data is divided into blocks (can be different sizes) which are stored separately on hard drive(s) Consistent I/O performance, low latency connectivity More expensive, complex Good for data that has to be frequently accessed and updated Usage examples: database storage; applications like Java

Object Storage Also called object-based storage Files are broken into units called objects and spread out among hardware The objects are kept in a single repository, instead of being kept as files in directories or as blocks on servers The blocks of data that make up a file, the metadata is kept into a storage pool Unique identifier assigned to the object Cost efficient: you only pay for what you use Usage examples: big data, web applications, backup archives Good for data that doesn t need to be modified (just READ)

File/Block/Object Storage Comparison File-based storage Block-based storage Object-based storage Transaction units Files Blocks Objects Protocols CIFS, NFS SCSI, FiberChannel, SATA Metadata File-system attributes Recommended for Shared file data Strength Simplified access and management of shared files File-system attributes Transactional data, frequently changing data High performance Web services (XMLbased messaging) Custom metadata Static file data, cloud storage Scalable, distributed access

SAN (Storage Area Network) dedicated high-speed network that interconnects and shares pools of storage devices to multiple servers each server accesses the shared storage as if it were directly attached to it raw storage is treated as a pool of resources which can be centrally managed and allocated highly scalable - capacity can be added as needed disadvantages: cost and complexity Clients Clients Clients Network Storage Network Metadata server

NAS (Network Attached Storage) dedicated file storage device that provides nodes within same network file-based storage via Ethernet connection storage appliance, connected to a network switch reliable, flexible highly scalable network storage speed Clients Clients Clients NAS Storage Network Network

Panasas - object-based storage cluster Performance improves with scale - linear scalability Data protection improves with scale Scalable storage - easy to access, deploy and manage

Panasas - ActiveStor Parallel scale-out NAS storage appliance Complete hardware and software storage solution Implements: Parallel, object-based filesystem Global namespace Strict client cache coherency Network Attached Storage (NAS) - Panasas DirectFlow (pnfs, CIFS, NFS) - rpm package for Linux (also MAC OS supported) Scale-out NAS - serving parallel access to data (data is striped across multiple storage nodes, providing increased capacity and/or performance) Parallel File System - concurrent reading and writing (data for a single file is striped across multiple storage nodes to provide scalable performance to individual files) Global Namespace - all compute nodes accessing the storage see the same namespace (same name and pathname); management is done through 1 system only

Panasas 1 node of Panasas architecture ActiveStor16 Hybrid - Storage media is a combination of HDD and SDD: - Hard Drives - for larger files - SSD (Flash) - for small files or metadata (no moving parts - solid state) Scalable data solution Redundant power modules - two Redundant battery module (power backup) Redundant switch modules (connected to the network, provide access to storage) Storage blade: 2 HDD, 1 Flash, CPU, RAM Mem Director blade: serves File system metadata, legacy protocol (NFS, CIFS) 1 shelf = 1 Director Blade + 10 Storage Blades

Panasas Storage blade: 2 HDD, 1 Flash, CPU, RAM Mem Director blade 1. Serves File System metadata (file s location, owner, permissions, size of file) 2. The gateway to the storage for standard legacy protocols (NFS, CIFS) DirectFlow - direct access between compute clients and storage

ActiveStor16-7 Shelves: - 2 Director Blade per shelf - 9 Storage Blades per shelf Panasas at CRC Total Director Blades: 14 Total Storage Blades: 63 Total Capacity 771.12 TB Data space Used 296.02 TB Metadata Used 8.59 TB Free Space 437.04 TB