Storage Aggregation for Performance & Availability:
|
|
- Daniel Little
- 6 years ago
- Views:
Transcription
1 Storage Aggregation for Performance & Availability: The Path from Physical RAID to Virtual Objects Univ Minnesota, Third Intelligent Storage Workshop Garth Gibson CTO, Panasas, and Assoc Prof, CMU May 4, 2005
2 Cluster Computing: The New Wave Traditional Computing Monolithic Computers Cluster Computing Linux Compute Cluster Issues Complex Scaling Limited BW & I/O Islands of storage Inflexible Expensive Single data path Monolithic Storage? But Parallel data paths Scale to bigger box Scale: File & total BW File & total capacity Load & capacity balancing lower $ / Gbps Slide 2 April 26, 2005
3 Panasas: Next Gen Cluster Storage Scalable performance Parallel data paths to compute nodes Scale clients, network and capacity As capacity grows, performance grows Simplified and dynamic management Robust, shared file access by many clients Seamless growth within single namespace eliminates time-consuming admin tasks Integrated HW/SW solution Optimizes performance and manageability Ease of integration and support ActiveScale Storage Cluster Single Step: Perform job directly from high I/O Panasas Storage Cluster Control path Compute Cluster Parallel data paths Metadata Managers Object Storage Devices Slide 3 April 26, 2005
4 Rooted Firmly in RAID May 4, 2005
5 Birth of RAID ( ) Member of 4th Berkeley RISC CPU design team (SPUR: 84-89) Dave Patterson decides CPU design is a solved problem Sends me to figure out how storage plays in SYSTEM PERFORMANCE IBM 3380 disk is 4 arms in a 7.5 GB washing machine box SLED: Single Large Expensive Disk New PC industry demands cost effective 100 MB 3.5 disks Enabled by new SCSI embedded controller architecture Use many PC disks for parallelism SIGMOD88: A case for RAID PS. $10-20 per MB (~1000X now) 100 MB/arm (~1000X now) IO/sec/arm (5X now) Slide 5 April 26, 2005
6 But RAID is really about Availability Arrays have more Hard Disk Assemblies (HDAs) -- more failures Apply replication and/or error/erasure detection codes Mirroring wastes 50% space; RAID wastes 1/N Mirroring halves, RAID 5 quarters small write bandwidth Slide 6 April 26, 2005
7 Off to CMU & More Availability Parity Declustering spreads RAID groups to reduce MTTR Each parity disk block protects fewer than all data disk blocks (C) Virtualizing RAID group lessens recovery work Faster recovery or better user response time during recovery or mixture of both RAID over X? X = Independent fault domains Disk is easiest X Parity declustering is first step in RAID virtualization Slide 7 April 26, 2005
8 Network-Attached Secure Disks (NASD, 95-99) May 4, 2005
9 Storage Interconnect Evolution Outboard circuitry increases over time (VLSI density) Hardware (#hosts, #disks, #paths) sharing increases over time Logical (information) sharing limited by host SW 1995: Fibrechannel packetizes SCSI over a near general network Outboard circuitry increases over time (VLSI density) Hardware (#hosts, #disks, #paths) sharing increases over time Logical (information) sharing limited by host SW 1995: Fibrechannel packetizes SCSI over a near general network Packetized SCSI will go over any network (iscsi) What do we next do with device intelligence? Slide 9 April 26, 2005
10 Storage as First Class Network Citizen Direct transfer between client and storage Exploit scalable switched cluster area networking Split file service into: primitives (in drive) and policies (in manager) Slide 10 April 26, 2005
11 NASD Architecture Before NASD there was store&forward Server-Attached Disks (SAD) Move access control, consistency out-of-band and cache decisions Raise storage abstraction: encapsulate layout, offload data access Slide 11 April 26, 2005
12 Fine Grain Access Enforcement State of art is VPN of all out-of-band clients, all sharable data and metadata Accident prone & vulnerable to subverted client; analogy to single-address space computing File manager Private Communication NASD Integrity/Privacy Secret Key 2: CapArgs, CapKey 1: Request for access Object Storage uses a digitally signed, objectspecific capabilities on each request CapKey= MAC SecretKey (CapArgs) CapArgs= ObjID, Version, Rights, Expiry,... ReqMAC = MAC CapKey (Req,NonceIn) 3: CapArgs, Req, NonceIn, ReqMAC Client Secret Key NASD 4: Reply, NonceOut, ReplyMAC ReplyMAC = MAC CapKey (Reply,NonceOut) Slide 12 April 26, 2005
13 Scalable File System Taxonomy May 4, 2005
14 Today s Ubiquitous NFS ADVANTAGES Familiar, stable & reliable Widely supported by vendors Competitive market DISADVANTAGES Capacity doesn t scale Bandwidth doesn t scale Cluster by customer-exposed namespace partitioning Clients File Servers Storage Net Disk arrays Host Net Exported Sub- File System Slide 14 April 26, 2005
15 Scale Out 1: Forwarding NFS Servers Bind many file servers into single system image with forwarding Mount point binding less relevant, allows DNS-style balancing, more manageable Control and data traverse mount point path (in band) passing through two servers Single file and single file system bandwidth limited by backend server & storage Tricord, Spinnaker File Server Cluster Clients Disk arrays Host Net Storage Net Slide 15 April 26, 2005
16 Scale Out File Service w/ Out-of-Band Client sees many storage addresses, accesses in parallel Zero file servers in data path allows high bandwidth thru scalable networking Mostly built on block-based SANs where servers trust all clients E.g.: IBM SanFS, IBM GPFS, EMC HighRoad, Sun QFS, SGI CXFS, etc Beginning to be built on Object storage (SANs too) E.g. Panasas DirectFlow, Lustre, Clients Storage File Servers Slide 16 April 26, 2005
17 Object Storage Standards May 4, 2005
18 Object Storage Architecture An evolutionary improvement to standard SCSI storage interface (OSD) Offload most data path work from server to intelligent storage Finer granularity of security: protect & manage one file at a time Raises level of abstraction: Object is container for related data Operations: Read block Write block Storage understands how blocks of a file are related -> self-management Per Object Extensible Attributes key expansion of storage functionality Addressing: Block range Allocation: External Block Based Disk Security At Volume Level Slide 18 April 26, 2005 Operations: Create object Delete object Read object Write object Addressing: [object, byte range] Allocation: Internal Object Based Disk Security At Source: Intel Object Level
19 Why I Like Objects Objects are flexible, autonomous Variable length data with layout metadata encapsulated SNIA Shared Storage Model, 2001 Share one access control decision Amortize object metadata Defer allocation, smarter caching With extensible attributes E.g. size, timestamps, ACLs, + Some updated inline by device Extensible programming model Metadata server decisions are signed & cached at clients, enforced at device Rights and object map small relative to block allocation map Clients can be untrusted b/c limited exposure of bugs & attacks (only authorized object data) Cache decisions (& maps) replaced transparently -- dynamic remapping -- virtualization Slide 19 April 26, 2005
20 OSD is now an ANSI Standard Panasas NSIC NASD SNIA/T10 OSD OSD Standard CMU NASD Lustre Key Players: T10/SCSI ratifies OSD command set 9/27/04 Co-chaired by IBM and Seagate, protocol is a general framework (transport independent) & KEY: Resulting standard will build options for customers looking to deploy objects Slide 20 April 26, 2005
21 ActiveScale Storage Cluster May 4, 2005
22 Object Storage Systems Expect wide variety of Object Storage Devices Disk array subsystem Ie. LLNL with Lustre Smart disk for objects 2 SATA disks 240/500 GB Prototype Seagate OSD Highly integrated, single disk Orchestrates system activity Balances objects across OSDs Stores up to 5 TBs per shelf 16-Port GE Switch Blade 4 Gbps per shelf to cluster Slide 22 April 26, 2005
23 BladeServer Storage Cluster Integrated GE Switch Shelf Front 1 DB, 10 SB Battery Module (2 Power units) Shelf Rear DirectorBlade Midplane routes GE, power StorageBlade Slide 23 April 26, 2005
24 How Do Panasas Objects Scale? Scale capacity, bandwidth, reliability by striping according to small map File Comprised of: User Data Attributes Layout DATA Scalable Object Map 1. Purple OSD & Object 2. Gold OSD & Object 3. Red OSD & Object Plus stripe size, RAID level Slide 24 April 26, 2005
25 Object Storage Bandwidth Scalable Bandwidth demonstrated with GE switching GB/sec Object Storage Devices Lab results Slide 25 April 26, 2005
26 ActiveScale SW Architecture Mgr DB ActiveScale DirectorBlades Realm & Performance Mgrs + Web Mgmt Server Virtual Sub Mgrs File File File Mgr Mgr Mgr Quota Quota Stor Stor Stor Mgr Mgr Mgr Mgr Mgr Mgr RPC ActiveScale Protocol Servers NDMP Buffer Cache DirectFLOW Client RAID NFS NFS CIFS CIFS OSD / iscsi TCP/IP NTP + DHCP Server DirectFLOW DirectFLOW DirectFLOW StorageBlades Mgmt Agent DFLOW fs DFLOW fs RAID DFLOW 0 fs RAID 0 RAID 0 Zero NVRAM Zero Copy NVRAM Zero Copy Cache NVRAM Copy Cache Cache OSD / iscsi TCP/IP DirectFLOW DirectFLOW DirectFLOW NFS CIFS UNIX Windows POSIX App NT App Slide 26 April 26, 2005 Direct- FLOW RAID OSD / iscsi TCP/IP POSIX App Local Buffer Cache DirectFLOW Client Linux VFS
27 DirectFLOW Client Standard installable file system for Linux POSIX syscalls with common performance exceptions Reduce serialization & media updates (e.g. durable read timestamps) Add create parameters for file width, RAID level, stripe unit size Add open in concurrent write mode; where app handles consistency Mount options Local (NFS) mount anywhere in client, or Global (AFS) mount at /panfs on all clients Cache consistency and callbacks Need a callback on a file in order to cache any file state Exclusive, read-only, read-write (non cacheable), concurrent write Callbacks are really leases so AWOL clients lose promises Slide 27 April 26, 2005
28 Fault Tolerance Overall up/down state of blades Subset of managers track overall state with heartbeats Maintain identical state with quorum/consensus Per file RAID: no parity for unused capacity RAID level per file; small files mirror RAID5 for large files First step toward policy quality of storage associated w/ data Scalable rebuilt rate Large files striped on all blades Recon XOR on all managers Rebuilt files spread on all blades Slide 28 April 26, 2005
29 Manageable Storage Clusters Snapshots: consistency for copying, backing up Copy-on-write duplication of contents of objects Named as /.snapshot/juliansnaptimestamp/filename Snaps can be scheduled, auto-deleted Soft volumes: grow management without physical constraints Volumes can be quota bounded, unbounded, or just send on threshold Multiple volumes can share space of a set of shelves (double disk failure domain) Capacity and load balancing: seamless use of growing set of blades All blades track capacity & load; manager aggregates & ages utilization metrics Unbalanced systems influence allocation; can trigger moves Adding a blade simply makes a system unbalanced for awhile Slide 29 April 26, 2005
30 Out-of-band DF & Clustered NAS Slide 30 April 26, 2005
31 Scale Out 1b: Any NFS Server Will Do Single server does all data transfer in single system image Servers share access to all storage and hand off role of accessing storage Control and data traverse mount point path (in band) passing through one server Hand off cheap: difference between 100% & 0% colocated front & back end was about zero SFS97 NFS op/s and < 10% latency increase for Panasas NFS Allows single file & single filesystem capacity & bandwidth to scale with server cluster Clients File Server Cluster Disk arrays Host Net Storage Net Slide 31 April 26, 2005
32 Performance & Scalability for all Workloads Objects: breakthrough data throughput AND random I/O Slide 32 April 26, 2005 Source: SPEC.org & Panasas
33 ActiveScale In Practice May 4, 2005
34 Los Alamos Lightning 1400 nodes and 60TB (120 TB): Ability to deliver ~ 3 GB/s* * (~6 GB/s) Panasas 12 shelves Lightning 1400 nodes 4 4 Switch 4 Slide 34 April 26, 2005
35 Progress in the Field Seismic Processing Genomics Sequencing HPC Simulations Post-Production Rendering Slide 35 April 26, 2005
36 pnfs - Parallel NFS May 4, 2005
37 Out-of-Band Interoperability Issues ADVANTAGES Capacity scaling Faster bandwidth scaling DISADVANTAGES Requires client kernel addition Many non-interoperable solutions Not necessarily able to replace NFS EXAMPLE FEATURES POSIX Plus & Minus Global mount point Fault tolerant cache coherence RAID 0, 1, 5 & snapshots Distributed metadata and online growth, upgrade Clients Vendor X Kernel Patch/RPM Storage Vendor X File Servers Slide 37 April 26, 2005
38 Standards Efforts: Parallel NFS (pnfs) December 2003 U. Michigan workshop Whitepapers at U. Michigan CITI, EMC, IBM, Johns Hopkins, Network Appliance, Panasas, Spinnaker Networks, Veritas, ZForce NFSv4 extensions enabling clients to access data in parallel storage devices of multiple standard flavors Parallel NFS request routing / file virtualization Extend block or object layout information to clients, enabling parallel direct access IETF pnfs Documents: draft-gibson-pnfs-problem-statement-01.txt draft-gibson-pnfs-reqs-00.txt draft-welch-pnfs-ops-01.txt Slide 38 April 26, 2005
39 Multiple Data Server Protocols Inclusiveness favors success Three (or more) flavors of out-ofband metadata attributes: BLOCKS: SBC/FCP/FC or SBC/iSCSI for files built on blocks OBJECTS: OSD/iSCSI/TCP/IP/GE for files built on objects FILES: NFS/ONCRPC/TCP/IP/GE for files built on subfiles Inode-level encapsulation in server and client code NFSv4 extended w/ orthogonal disk metadata attributes pnfs server Local File system Client Apps pnfs IFS pnfs Disk driver disk metadata grant & revoke 1. SBC (blocks) 2. OSD (objects) 3. NFS (files) Slide 39 April 26, 2005
40 Object Storage Value Proposition NFS NFS CIFS DirectFLOW Linux Compute Cluster DirectFLOW Panasas Storage Cluster Large datasets, checkpointing Analysis & Simulation Results Gateway Scalable Shared Storage Value Proposition Larger surveys; more active analysis Applications leverage full bandwidth Simplified storage management Results gateway Slide 40 April 26, 2005 Analysis & Visualization Tools
41 Next Generation Network Storage Garth Gibson May 4, 2005
Object Storage: Redefining Bandwidth for Linux Clusters
Object Storage: Redefining Bandwidth for Linux Clusters Brent Welch Principal Architect, Inc. November 18, 2003 Blocks, Files and Objects Block-base architecture: fast but private Traditional SCSI and
More informationNASD: Network-Attached Secure Disks
NASD: Network-Attached Secure Disks Garth Gibson garth.gibson@cmu.edu also David Nagle, Khalil Amiri,Jeff Butler, Howard Gobioff, Charles Hardin, Nat Lanza, Erik Riedel, David Rochberg, Chris Sabol, Marc
More informationThe Last Bottleneck: How Parallel I/O can improve application performance
The Last Bottleneck: How Parallel I/O can improve application performance HPC ADVISORY COUNCIL STANFORD WORKSHOP; DECEMBER 6 TH 2011 REX TANAKIT DIRECTOR OF INDUSTRY SOLUTIONS AGENDA Panasas Overview Who
More informationObject-based Storage (OSD) Architecture and Systems
Object-based Storage (OSD) Architecture and Systems Erik Riedel, Seagate Technology April 2007 Abstract Object-based Storage (OSD) Architecture and Systems The Object-based Storage Device interface standard
More informationParallel NFS (pnfs) Garth Gibson CTO & co-founder, Panasas Assoc. Professor, Carnegie Mellon University.
Parallel NFS (pnfs) Garth Gibson CTO & co-founder, Panasas Assoc. Professor, Carnegie Mellon University www.panasas.com www.panasas.com Panasas Parallel Storage: A Little Background Panasas Storage Cluster
More informationBrent Welch. Director, Architecture. Panasas Technology. HPC Advisory Council Lugano, March 2011
Brent Welch Director, Architecture Panasas Technology HPC Advisory Council Lugano, March 2011 Panasas Background Technology based on Object Storage concepts invented by company founder, Garth Gibson, a
More informationThe Last Bottleneck: How Parallel I/O can attenuate Amdahl's Law
The Last Bottleneck: How Parallel I/O can attenuate Amdahl's Law ERESEARCH AUSTRALASIA, NOVEMBER 2011 REX TANAKIT DIRECTOR OF INDUSTRY SOLUTIONS AGENDA Parallel System Parallel processing goes mainstream
More informationHPC File Systems and Storage. Irena Johnson University of Notre Dame Center for Research Computing
HPC File Systems and Storage Irena Johnson University of Notre Dame Center for Research Computing HPC (High Performance Computing) Aggregating computer power for higher performance than that of a typical
More informationSystem input-output, performance aspects March 2009 Guy Chesnot
Headline in Arial Bold 30pt System input-output, performance aspects March 2009 Guy Chesnot Agenda Data sharing Evolution & current tendencies Performances: obstacles Performances: some results and good
More informationpnfs Update A standard for parallel file systems HPC Advisory Council Lugano, March 2011
pnfs Update A standard for parallel file systems HPC Advisory Council Lugano, March 2011 Brent Welch welch@panasas.com Panasas, Inc. 1 Why a Standard for Parallel I/O? NFS is the only network file system
More informationObject-based Storage Devices (OSD) T10 Standard
Object-based Storage Devices (OSD) T10 Standard Erik Riedel Seagate Research Motivation for OSD Improved device and data sharing Platform-dependent metadata moved to device Systems need only agree on naming
More informationLustre A Platform for Intelligent Scale-Out Storage
Lustre A Platform for Intelligent Scale-Out Storage Rumi Zahir, rumi. May 2003 rumi.zahir@intel.com Agenda Problem Statement Trends & Current Data Center Storage Architectures The Lustre File System Project
More informationCoordinating Parallel HSM in Object-based Cluster Filesystems
Coordinating Parallel HSM in Object-based Cluster Filesystems Dingshan He, Xianbo Zhang, David Du University of Minnesota Gary Grider Los Alamos National Lab Agenda Motivations Parallel archiving/retrieving
More informationComing Changes in Storage Technology. Be Ready or Be Left Behind
Coming Changes in Storage Technology Be Ready or Be Left Behind Henry Newman, CTO Instrumental Inc. hsn@instrumental.com Copyright 2008 Instrumental, Inc. 1of 32 The Future Will Be Different The storage
More informationCA485 Ray Walshe Google File System
Google File System Overview Google File System is scalable, distributed file system on inexpensive commodity hardware that provides: Fault Tolerance File system runs on hundreds or thousands of storage
More informationHPC I/O and File Systems Issues and Perspectives
HPC I/O and File Systems Issues and Perspectives Gary Grider, LANL LA-UR-06-0473 01/2006 Why do we need so much HPC Urgency and importance of Stockpile Stewardship mission and schedule demand multiphysics
More informationThe File Systems Evolution. Christian Bandulet, Sun Microsystems
The s Evolution Christian Bandulet, Sun Microsystems SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA. Member companies and individuals may use this material in presentations
More informationParallel Data Lab Research Overview
Parallel Data Lab Research Overview Garth A. Gibson http://www.cs.cmu.edu/web/groups/pdl/ Reliable, Parallel Storage Subsystems configurable architectures; rapid prototyping Discovering and Managing Storage
More informationpnfs and Linux: Working Towards a Heterogeneous Future
CITI Technical Report 06-06 pnfs and Linux: Working Towards a Heterogeneous Future Dean Hildebrand dhildebz@umich.edu Peter Honeyman honey@umich.edu ABSTRACT Anticipating terascale and petascale HPC demands,
More informationCurrent Topics in OS Research. So, what s hot?
Current Topics in OS Research COMP7840 OSDI Current OS Research 0 So, what s hot? Operating systems have been around for a long time in many forms for different types of devices It is normally general
More informationGFS: The Google File System
GFS: The Google File System Brad Karp UCL Computer Science CS GZ03 / M030 24 th October 2014 Motivating Application: Google Crawl the whole web Store it all on one big disk Process users searches on one
More informationCustomer Success Story Los Alamos National Laboratory
Customer Success Story Los Alamos National Laboratory Panasas High Performance Storage Powers the First Petaflop Supercomputer at Los Alamos National Laboratory Case Study June 2010 Highlights First Petaflop
More informationIntroducing Panasas ActiveStor 14
Introducing Panasas ActiveStor 14 SUPERIOR PERFORMANCE FOR MIXED FILE SIZE ENVIRONMENTS DEREK BURKE, PANASAS EUROPE INTRODUCTION TO PANASAS Storage that accelerates the world s highest performance and
More informationHPC Considerations for Scalable Multidiscipline CAE Applications on Conventional Linux Platforms. Author: Correspondence: ABSTRACT:
HPC Considerations for Scalable Multidiscipline CAE Applications on Conventional Linux Platforms Author: Stan Posey Panasas, Inc. Correspondence: Stan Posey Panasas, Inc. Phone +510 608 4383 Email sposey@panasas.com
More informationNFS-Ganesha w/gpfs FSAL And Other Use Cases
NFS-Ganesha w/gpfs FSAL And Other Use Cases Marc Eshel, JV, Jim, Philippe 2/26/2013 1 SN MP via Age ntx (opt ion al) Log gin g Stat s Ad min GS SA PI The modules within NFS-Ganesha Has h Tab les clients
More informationAn Introduction to GPFS
IBM High Performance Computing July 2006 An Introduction to GPFS gpfsintro072506.doc Page 2 Contents Overview 2 What is GPFS? 3 The file system 3 Application interfaces 4 Performance and scalability 4
More informationDistributed File Systems II
Distributed File Systems II To do q Very-large scale: Google FS, Hadoop FS, BigTable q Next time: Naming things GFS A radically new environment NFS, etc. Independence Small Scale Variety of workloads Cooperation
More informationSun N1: Storage Virtualization and Oracle
OracleWorld 2003 Session 36707 - Sun N1: Storage Virtualization and Oracle Glenn Colaco Performance Engineer Sun Microsystems Performance and Availability Engineering September 9, 2003 Background PAE works
More informationFile Server Scaling with Network-Attached Secure Disks (NASD)
File Server Scaling with Network-Attached Secure Disks (NASD) Garth A. Gibson, CMU, http://www.pdl.cs.cmu.edu/nasd David Nagle, Khalil Amiri, Fay Chang, Eugene Feinberg, Howard Gobioff, Chen Lee, Berend
More informationCS370 Operating Systems
CS370 Operating Systems Colorado State University Yashwant K Malaiya Spring 2018 Lecture 24 Mass Storage, HDFS/Hadoop Slides based on Text by Silberschatz, Galvin, Gagne Various sources 1 1 FAQ What 2
More informationThe Parallel NFS Bugaboo. Andy Adamson Center For Information Technology Integration University of Michigan
The Parallel NFS Bugaboo Andy Adamson Center For Information Technology Integration University of Michigan Bugaboo? Bugaboo a source of concern. the old bugaboo of inflation still bothers them [n] an imaginary
More informationpnfs BOF SC
pnfs BOF SC08 2008-11-19 Spencer Shepler, StorSpeed Bruce Fields, CITI (University of Michigan) Sorin Faibish, EMC Roger Haskin, IBM Ken Gibson, LSI Joshua Konkle, NetApp Brent Welch, Panasas Bill Baker,
More informationShared Object-Based Storage and the HPC Data Center
Shared Object-Based Storage and the HPC Data Center Jim Glidewell High Performance Computing BOEING is a trademark of Boeing Management Company. Computing Environment Cray X1 2 Chassis, 128 MSPs, 1TB memory
More informationPANASAS TIERED PARITY ARCHITECTURE
PANASAS TIERED PARITY ARCHITECTURE Larry Jones, Matt Reid, Marc Unangst, Garth Gibson, and Brent Welch White Paper May 2010 Abstract Disk drives are approximately 250 times denser today than a decade ago.
More informationpnfs: Blending Performance and Manageability
pnfs: Blending Performance and Manageability Lisa Week and Piyush Shivam Sun Microsystems Data-Intensive Applications Examples: Data mining, oil and gas, weather modeling 100s to 1000s of compute nodes..
More informationNFS: Naming indirection, abstraction. Abstraction, abstraction, abstraction! Network File Systems: Naming, cache control, consistency
Abstraction, abstraction, abstraction! Network File Systems: Naming, cache control, consistency Local file systems Disks are terrible abstractions: low-level blocks, etc. Directories, files, links much
More informationStorage Area Network (SAN)
Storage Area Network (SAN) 1 Outline Shared Storage Architecture Direct Access Storage (DAS) SCSI RAID Network Attached Storage (NAS) Storage Area Network (SAN) Fiber Channel and Fiber Channel Switch 2
More informationParallel File Systems Compared
Parallel File Systems Compared Computing Centre (SSCK) University of Karlsruhe, Germany Laifer@rz.uni-karlsruhe.de page 1 Outline» Parallel file systems (PFS) Design and typical usage Important features
More informationA GPFS Primer October 2005
A Primer October 2005 Overview This paper describes (General Parallel File System) Version 2, Release 3 for AIX 5L and Linux. It provides an overview of key concepts which should be understood by those
More informationCOS 318: Operating Systems. NSF, Snapshot, Dedup and Review
COS 318: Operating Systems NSF, Snapshot, Dedup and Review Topics! NFS! Case Study: NetApp File System! Deduplication storage system! Course review 2 Network File System! Sun introduced NFS v2 in early
More informationOpen Source Storage. Ric Wheeler Architect & Senior Manager April 30, 2012
Open Source Storage Architect & Senior Manager rwheeler@redhat.com April 30, 2012 1 Linux Based Systems are Everywhere Used as the base for commercial appliances Enterprise class appliances Consumer home
More informationEMC Celerra CNS with CLARiiON Storage
DATA SHEET EMC Celerra CNS with CLARiiON Storage Reach new heights of availability and scalability with EMC Celerra Clustered Network Server (CNS) and CLARiiON storage Consolidating and sharing information
More informationObject-Based Cluster Storage Systems. Brent Welch & David Nagle Panasas, Inc
Object-Based Cluster Storage Systems Brent Welch & David Nagle {welch,dnagle}@panasas.com Panasas, Inc New Object Storage Architecture Raises storage s level of abstraction From logical blocks to objects
More informationToday: Coda, xfs! Brief overview of other file systems. Distributed File System Requirements!
Today: Coda, xfs! Case Study: Coda File System Brief overview of other file systems xfs Log structured file systems Lecture 21, page 1 Distributed File System Requirements! Transparency Access, location,
More informationpnfs, parallel storage for grid and enterprise computing Joshua Konkle, NetApp, Inc.
pnfs, parallel storage for grid and enterprise computing Joshua Konkle, NetApp, Inc. SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA. Member companies and individuals
More informationShared Parallel Filesystems in Heterogeneous Linux Multi-Cluster Environments
LCI HPC Revolution 2005 26 April 2005 Shared Parallel Filesystems in Heterogeneous Linux Multi-Cluster Environments Matthew Woitaszek matthew.woitaszek@colorado.edu Collaborators Organizations National
More informationData Management. Parallel Filesystems. Dr David Henty HPC Training and Support
Data Management Dr David Henty HPC Training and Support d.henty@epcc.ed.ac.uk +44 131 650 5960 Overview Lecture will cover Why is IO difficult Why is parallel IO even worse Lustre GPFS Performance on ARCHER
More informationRAIDIX Data Storage Solution. Clustered Data Storage Based on the RAIDIX Software and GPFS File System
RAIDIX Data Storage Solution Clustered Data Storage Based on the RAIDIX Software and GPFS File System 2017 Contents Synopsis... 2 Introduction... 3 Challenges and the Solution... 4 Solution Architecture...
More informationNFSv4.1 Using pnfs PRESENTATION TITLE GOES HERE. Presented by: Alex McDonald CTO Office, NetApp
NFSv4.1 Using pnfs PRESENTATION TITLE GOES HERE Presented by: Alex McDonald CTO Office, NetApp Webcast Presenter Alex McDonald joined NetApp in 2005, after more than 30 years in a variety of roles with
More informationParallel File Systems for HPC
Introduction to Scuola Internazionale Superiore di Studi Avanzati Trieste November 2008 Advanced School in High Performance and Grid Computing Outline 1 The Need for 2 The File System 3 Cluster & A typical
More informationLustre overview and roadmap to Exascale computing
HPC Advisory Council China Workshop Jinan China, October 26th 2011 Lustre overview and roadmap to Exascale computing Liang Zhen Whamcloud, Inc liang@whamcloud.com Agenda Lustre technology overview Lustre
More informationParallel NFS Dr. Oliver Tennert, Head of Technology. sambaxp 2008 Göttingen,
Parallel NFS Dr. Oliver Tennert, Head of Technology sambaxp 2008 Göttingen, 18.04.2008 Overview Motivation: Why Parallel NFS? What is pnfs? How does it work? Some numbers 2 Siberia 30.06.1908 massive explosion
More informationDistributed File Systems
Distributed File Systems Today l Basic distributed file systems l Two classical examples Next time l Naming things xkdc Distributed File Systems " A DFS supports network-wide sharing of files and devices
More informationUK LUG 10 th July Lustre at Exascale. Eric Barton. CTO Whamcloud, Inc Whamcloud, Inc.
UK LUG 10 th July 2012 Lustre at Exascale Eric Barton CTO Whamcloud, Inc. eeb@whamcloud.com Agenda Exascale I/O requirements Exascale I/O model 3 Lustre at Exascale - UK LUG 10th July 2012 Exascale I/O
More informationIntroduction to iscsi
Introduction to iscsi As Ethernet begins to enter into the Storage world a new protocol has been getting a lot of attention. The Internet Small Computer Systems Interface or iscsi, is an end-to-end protocol
More informationSee what s new: Data Domain Global Deduplication Array, DD Boost and more. Copyright 2010 EMC Corporation. All rights reserved.
See what s new: Data Domain Global Deduplication Array, DD Boost and more 2010 1 EMC Backup Recovery Systems (BRS) Division EMC Competitor Competitor Competitor Competitor Competitor Competitor Competitor
More informationHP AutoRAID (Lecture 5, cs262a)
HP AutoRAID (Lecture 5, cs262a) Ali Ghodsi and Ion Stoica, UC Berkeley January 31, 2018 (based on slide from John Kubiatowicz, UC Berkeley) Array Reliability Reliability of N disks = Reliability of 1 Disk
More informationMass-Storage Structure
Operating Systems (Fall/Winter 2018) Mass-Storage Structure Yajin Zhou (http://yajin.org) Zhejiang University Acknowledgement: some pages are based on the slides from Zhi Wang(fsu). Review On-disk structure
More informationFast Forward I/O & Storage
Fast Forward I/O & Storage Eric Barton Lead Architect 1 Department of Energy - Fast Forward Challenge FastForward RFP provided US Government funding for exascale research and development Sponsored by 7
More informationScalable I/O, File Systems, and Storage Networks R&D at Los Alamos LA-UR /2005. Gary Grider CCN-9
Scalable I/O, File Systems, and Storage Networks R&D at Los Alamos LA-UR-05-2030 05/2005 Gary Grider CCN-9 Background Disk2500 TeraBytes Parallel I/O What drives us? Provide reliable, easy-to-use, high-performance,
More informationToward An Integrated Cluster File System
Toward An Integrated Cluster File System Adrien Lebre February 1 st, 2008 XtreemOS IP project is funded by the European Commission under contract IST-FP6-033576 Outline Context Kerrighed and root file
More informationHIGH PERFORMANCE COMPUTING: MODELS, METHODS, & MEANS PARALLEL FILE I/O 1
Prof. Thomas Sterling Chirag Dekate Department of Computer Science Louisiana State University March 29 th, 2011 HIGH PERFORMANCE COMPUTING: MODELS, METHODS, & MEANS PARALLEL FILE I/O 1 Topics Introduction
More informationDeploying Celerra MPFSi in High-Performance Computing Environments
Deploying Celerra MPFSi in High-Performance Computing Environments Best Practices Planning Abstract This paper describes the EMC Celerra MPFSi approach to scalable high-performance data sharing, with a
More informationThe advantages of architecting an open iscsi SAN
Storage as it should be The advantages of architecting an open iscsi SAN Pete Caviness Lefthand Networks, 5500 Flatiron Parkway, Boulder CO 80301, Ph: +1-303-217-9043, FAX: +1-303-217-9020 e-mail: pete.caviness@lefthandnetworks.com
More informationGFS: The Google File System. Dr. Yingwu Zhu
GFS: The Google File System Dr. Yingwu Zhu Motivating Application: Google Crawl the whole web Store it all on one big disk Process users searches on one big CPU More storage, CPU required than one PC can
More informationNext-Generation NVMe-Native Parallel Filesystem for Accelerating HPC Workloads
Next-Generation NVMe-Native Parallel Filesystem for Accelerating HPC Workloads Liran Zvibel CEO, Co-founder WekaIO @liranzvibel 1 WekaIO Matrix: Full-featured and Flexible Public or Private S3 Compatible
More informationAn introduction to GPFS Version 3.3
IBM white paper An introduction to GPFS Version 3.3 Scott Fadden, IBM Corporation Contents 1 Overview 2 What is GPFS? 2 The file system 2 Application interfaces 3 Performance and scalability 3 Administration
More informationThe Evolution of File Systems
Presenter: Thomas Rivera Senior Technical Associate, Hitachi Systems Author: Christian Bandulet Principal Engineer, Oracle SNIA Legal Notice The material contained in this tutorial is copyrighted by the
More informationDELL POWERVAULT NX3500. A Dell Technical Guide Version 1.0
DELL POWERVAULT NX3500 A Dell Technical Guide Version 1.0 DELL PowerVault NX3500, A Dell Technical Guide THIS TRANSITION GUIDE IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN TYPOGRAPHICAL ERRORS AND
More informationStructuring PLFS for Extensibility
Structuring PLFS for Extensibility Chuck Cranor, Milo Polte, Garth Gibson PARALLEL DATA LABORATORY Carnegie Mellon University What is PLFS? Parallel Log Structured File System Interposed filesystem b/w
More informationCrossing the Chasm: Sneaking a parallel file system into Hadoop
Crossing the Chasm: Sneaking a parallel file system into Hadoop Wittawat Tantisiriroj Swapnil Patil, Garth Gibson PARALLEL DATA LABORATORY Carnegie Mellon University In this work Compare and contrast large
More informationStorage Performance Through Standards
Storage Performance Through Standards Bjorn Andersson HPC Advisory Workshop, March 21-23 2011 Lugano, Switzerland Who is BlueArc Private Company, founded in 1998 Headquarters in San Jose, CA Highest Performing
More informationCephFS A Filesystem for the Future
CephFS A Filesystem for the Future David Disseldorp Software Engineer ddiss@suse.com Jan Fajerski Software Engineer jfajerski@suse.com Introduction to Ceph Distributed storage system based on RADOS Scalable
More informationCSE 153 Design of Operating Systems
CSE 153 Design of Operating Systems Winter 2018 Lecture 22: File system optimizations and advanced topics There s more to filesystems J Standard Performance improvement techniques Alternative important
More informationChapter 11: Implementing File Systems
Chapter 11: Implementing File Systems Operating System Concepts 99h Edition DM510-14 Chapter 11: Implementing File Systems File-System Structure File-System Implementation Directory Implementation Allocation
More informationGlamour: An NFSv4-based File System Federation
Glamour: An NFSv4-based File System Federation Jon Haswell Mgr NAS Systems IBM Almaden Research Center haswell@us.ibm.com Based on work by Carl Burnett, Jim Myers, Manoj Naik, Steven Parkes, Renu Tewari,
More informationStorage Optimization with Oracle Database 11g
Storage Optimization with Oracle Database 11g Terabytes of Data Reduce Storage Costs by Factor of 10x Data Growth Continues to Outpace Budget Growth Rate of Database Growth 1000 800 600 400 200 1998 2000
More informationDell Fluid File System Version 6.0 Support Matrix
Dell Fluid File System Version 6.0 Support Matrix Notes, Cautions, and Warnings NOTE: A NOTE indicates important information that helps you make better use of your product. CAUTION: A CAUTION indicates
More informationLinux File Systems: Challenges and Futures Ric Wheeler Red Hat
Linux File Systems: Challenges and Futures Ric Wheeler Red Hat Overview The Linux Kernel Process What Linux Does Well Today New Features in Linux File Systems Ongoing Challenges 2 What is Linux? A set
More informationClient Server & Distributed System. A Basic Introduction
Client Server & Distributed System A Basic Introduction 1 Client Server Architecture A network architecture in which each computer or process on the network is either a client or a server. Source: http://webopedia.lycos.com
More informationThe ASCI/DOD Scalable I/O History and Strategy Run Time Systems and Scalable I/O Team Gary Grider CCN-8 Los Alamos National Laboratory LAUR
The ASCI/DOD Scalable I/O History and Strategy Run Time Systems and Scalable I/O Team Gary Grider CCN-8 Los Alamos National Laboratory LAUR 042787 05/2004 Parallel File Systems and Parallel I/O Why - From
More informationWhy software defined storage matters? Sergey Goncharov Solution Architect, Red Hat
Why software defined storage matters? Sergey Goncharov Solution Architect, Red Hat sgonchar@redhat.com AGENDA Storage and Datacenter evolution Red Hat Storage portfolio Red Hat Gluster Storage Red Hat
More informationCloud Computing CS
Cloud Computing CS 15-319 Distributed File Systems and Cloud Storage Part I Lecture 12, Feb 22, 2012 Majd F. Sakr, Mohammad Hammoud and Suhail Rehman 1 Today Last two sessions Pregel, Dryad and GraphLab
More information! Design constraints. " Component failures are the norm. " Files are huge by traditional standards. ! POSIX-like
Cloud background Google File System! Warehouse scale systems " 10K-100K nodes " 50MW (1 MW = 1,000 houses) " Power efficient! Located near cheap power! Passive cooling! Power Usage Effectiveness = Total
More informationCOM Verification. PRESENTATION TITLE GOES HERE Alan G. Yoder, Ph.D. SNIA Technical Council Huawei Technologies, LLC
COM Verification PRESENTATION TITLE GOES HERE Alan G. Yoder, Ph.D. SNIA Technical Council Huawei Technologies, LLC Outline COM overview How they work Verifying the COMs SNIA Emerald TM Training ~ June
More informationCOS 318: Operating Systems. Journaling, NFS and WAFL
COS 318: Operating Systems Journaling, NFS and WAFL Jaswinder Pal Singh Computer Science Department Princeton University (http://www.cs.princeton.edu/courses/cos318/) Topics Journaling and LFS Network
More informationpnfs, parallel storage for grid, virtualization and database computing Joshua Konkle, Chair NFS SIG
pnfs, parallel storage for grid, virtualization and database computing Joshua Konkle, Chair NFS SIG SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA. Member companies
More informationChapter 10: Mass-Storage Systems
Chapter 10: Mass-Storage Systems Silberschatz, Galvin and Gagne 2013 Chapter 10: Mass-Storage Systems Overview of Mass Storage Structure Disk Structure Disk Attachment Disk Scheduling Disk Management Swap-Space
More informationChapter 12: File System Implementation
Chapter 12: File System Implementation Chapter 12: File System Implementation File-System Structure File-System Implementation Directory Implementation Allocation Methods Free-Space Management Efficiency
More informationChapter 10: Mass-Storage Systems. Operating System Concepts 9 th Edition
Chapter 10: Mass-Storage Systems Silberschatz, Galvin and Gagne 2013 Chapter 10: Mass-Storage Systems Overview of Mass Storage Structure Disk Structure Disk Attachment Disk Scheduling Disk Management Swap-Space
More informationSystem that permanently stores data Usually layered on top of a lower-level physical storage medium Divided into logical units called files
System that permanently stores data Usually layered on top of a lower-level physical storage medium Divided into logical units called files Addressable by a filename ( foo.txt ) Usually supports hierarchical
More informationComparing File (NAS) and Block (SAN) Storage
Comparing File (NAS) and Block (SAN) Storage January 2014 Contents Abstract... 3 Introduction... 3 Network-Attached Storage... 3 Storage Area Network... 4 Networks and Storage... 4 Network Roadmaps...
More informationCOSC 6374 Parallel Computation. Parallel I/O (I) I/O basics. Concept of a clusters
COSC 6374 Parallel I/O (I) I/O basics Fall 2010 Concept of a clusters Processor 1 local disks Compute node message passing network administrative network Memory Processor 2 Network card 1 Network card
More informationCrossing the Chasm: Sneaking a parallel file system into Hadoop
Crossing the Chasm: Sneaking a parallel file system into Hadoop Wittawat Tantisiriroj Swapnil Patil, Garth Gibson PARALLEL DATA LABORATORY Carnegie Mellon University In this work Compare and contrast large
More informationStorage and File System
COS 318: Operating Systems Storage and File System Andy Bavier Computer Science Department Princeton University http://www.cs.princeton.edu/courses/archive/fall10/cos318/ Topics Storage hierarchy File
More informationOPERATING SYSTEM. Chapter 12: File System Implementation
OPERATING SYSTEM Chapter 12: File System Implementation Chapter 12: File System Implementation File-System Structure File-System Implementation Directory Implementation Allocation Methods Free-Space Management
More informationChapter 10: File System Implementation
Chapter 10: File System Implementation Chapter 10: File System Implementation File-System Structure" File-System Implementation " Directory Implementation" Allocation Methods" Free-Space Management " Efficiency
More informationiscsi Technology Brief Storage Area Network using Gbit Ethernet The iscsi Standard
iscsi Technology Brief Storage Area Network using Gbit Ethernet The iscsi Standard On February 11 th 2003, the Internet Engineering Task Force (IETF) ratified the iscsi standard. The IETF was made up of
More informationDistributed Systems 16. Distributed File Systems II
Distributed Systems 16. Distributed File Systems II Paul Krzyzanowski pxk@cs.rutgers.edu 1 Review NFS RPC-based access AFS Long-term caching CODA Read/write replication & disconnected operation DFS AFS
More informationMassively Scalable File Storage. Philippe Nicolas, KerStor
Philippe Nicolas, KerStor SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA. Member companies and individuals may use this material in presentations and literature under
More information