High Performance Computing Course Notes High Performance Storage

Similar documents
Associate Professor Dr. Raed Ibraheem Hamed

Mass-Storage Structure

Definition of RAID Levels

Storage Devices for Database Systems

CSE 120. Operating Systems. March 27, 2014 Lecture 17. Mass Storage. Instructor: Neil Rhodes. Wednesday, March 26, 14

Section 5 MUST BE COMPLETED BY: 12/5

CSE 380 Computer Operating Systems

CSE 451: Operating Systems Winter Redundant Arrays of Inexpensive Disks (RAID) and OS structure. Gary Kimura

Today: Secondary Storage! Typical Disk Parameters!

Failover Solutions with Open-E Data Storage Server (DSS V6)

Chapter 11: File System Implementation. Objectives

CS 554: Advanced Database System

COSC 6385 Computer Architecture Storage Systems

Storage and File Structure

I/O Management and Disk Scheduling. Chapter 11

Computer Architecture 计算机体系结构. Lecture 6. Data Storage and I/O 第六讲 数据存储和输入输出. Chao Li, PhD. 李超博士

Linux+ Guide to Linux Certification, Third Edition. Chapter 6 Advanced Installation

Chapter 10 Storage and File Structure

Storage System COSC UCB

Chapter 12: Mass-Storage

IST346. Data Storage

Storage Update and Storage Best Practices for Microsoft Server Applications. Dennis Martin President, Demartek January 2009 Copyright 2009 Demartek

CISC 7310X. C11: Mass Storage. Hui Chen Department of Computer & Information Science CUNY Brooklyn College. 4/19/2018 CUNY Brooklyn College

RAID (Redundant Array of Inexpensive Disks)

Chapter 6. Storage and Other I/O Topics

Appendix D: Storage Systems

COMP283-Lecture 3 Applied Database Management

Computer-System Organization (cont.)

Chapter 12: Mass-Storage Systems. Operating System Concepts 8 th Edition,

I/O CANNOT BE IGNORED

Tape pictures. CSE 30341: Operating Systems Principles

I/O CANNOT BE IGNORED

UC Santa Barbara. Operating Systems. Christopher Kruegel Department of Computer Science UC Santa Barbara

RAID. Redundant Array of Inexpensive Disks. Industry tends to use Independent Disks

Lecture 23: Storage Systems. Topics: disk access, bus design, evaluation metrics, RAID (Sections )

Computer Organization and Structure. Bing-Yu Chen National Taiwan University

MASS-STORAGE STRUCTURE

CS143: Disks and Files

Storage systems. Computer Systems Architecture CMSC 411 Unit 6 Storage Systems. (Hard) Disks. Disk and Tape Technologies. Disks (cont.

Database Systems. November 2, 2011 Lecture #7. topobo (mit)

Module 13: Secondary-Storage

Chapter 14: Mass-Storage Systems

Guide to SATA Hard Disks Installation and RAID Configuration

Computer Organization and Technology External Memory

The term "physical drive" refers to a single hard disk module. Figure 1. Physical Drive

Storage Systems. Storage Systems

Chapter 14: Mass-Storage Systems

Chapter 13: Mass-Storage Systems. Disk Scheduling. Disk Scheduling (Cont.) Disk Structure FCFS. Moving-Head Disk Mechanism

Chapter 13: Mass-Storage Systems. Disk Structure

Chapter 6 - External Memory

Configuring Storage Profiles

STORAGE MANAGEMENT USING OPENFILER II

COT 4600 Operating Systems Fall 2009

CS5460: Operating Systems Lecture 20: File System Reliability

CSE325 Principles of Operating Systems. Mass-Storage Systems. David P. Duggan. April 19, 2011

SPECIFICATION FOR NETWORK ATTACHED STORAGE (NAS) TO BE FILLED BY BIDDER. NAS Controller Should be rack mounted with a form factor of not more than 2U

Part IV I/O System. Chapter 12: Mass Storage Structure

Storage Area Network (SAN)

Modern RAID Technology. RAID Primer A Configuration Guide

Storage. CS 3410 Computer System Organization & Programming

Che-Wei Chang Department of Computer Science and Information Engineering, Chang Gung University

Silberschatz, et al. Topics based on Chapter 13

HPSS RAIT. A high performance, resilient, fault-tolerant tape data storage class. 1

COSC 6374 Parallel Computation. Parallel I/O (I) I/O basics. Concept of a clusters

Business Continuity and Disaster Recovery. Ed Crowley Ch 12

COSC 6385 Computer Architecture. Storage Systems

Lecture 23: I/O Redundant Arrays of Inexpensive Disks Professor Randy H. Katz Computer Science 252 Spring 1996

Lecture 21: Reliable, High Performance Storage. CSC 469H1F Fall 2006 Angela Demke Brown

Server Systems and Motherboards

Storage Profiles. Storage Profiles. Storage Profiles, page 12

CSE 451: Operating Systems Spring Module 18 Redundant Arrays of Inexpensive Disks (RAID)


FAULT TOLERANT SYSTEMS

NVIDIA RAID Installation Guide

RAID Tower XIII (RT134SDEU3)

CSCI-GA Database Systems Lecture 8: Physical Schema: Storage

Part IV I/O System Chapter 1 2: 12: Mass S torage Storage Structur Structur Fall 2010

Chapter 6. Storage and Other I/O Topics. ICE3003: Computer Architecture Fall 2012 Euiseong Seo

Guide to SATA Hard Disks Installation and RAID Configuration

Everything You Wanted To Know About Storage (But Were Too Proud To Ask) The Basics

HPC File Systems and Storage. Irena Johnson University of Notre Dame Center for Research Computing

Lecture 13. Storage, Network and Other Peripherals

Homework, etc. Computer Science Foundations

Administrivia. CMSC 411 Computer Systems Architecture Lecture 19 Storage Systems, cont. Disks (cont.) Disks - review

Mladen Stefanov F48235 R.A.I.D

Chapter 9: Peripheral Devices: Magnetic Disks

Lecture: Storage, GPUs. Topics: disks, RAID, reliability, GPUs (Appendix D, Ch 4)

Database Management Systems, 2nd edition, Raghu Ramakrishnan, Johannes Gehrke, McGraw-Hill

Computer System Architecture

PC-based data acquisition II

BBM371- Data Management. Lecture 2: Storage Devices

NVIDIA RAID Installation Guide

Virtual Memory. Reading. Sections 5.4, 5.5, 5.6, 5.8, 5.10 (2) Lecture notes from MKP and S. Yalamanchili

Guide to SATA Hard Disks Installation and RAID Configuration

Disk Scheduling. Based on the slides supporting the text

After the Attack. Business Continuity. Planning and Testing Steps. Disaster Recovery. Business Impact Analysis (BIA) Succession Planning

Readings. Storage Hierarchy III: I/O System. I/O (Disk) Performance. I/O Device Characteristics. often boring, but still quite important

Replication Solutions with Open-E Data Storage Server (DSS) April 2009

Chapter 10: Mass-Storage Systems

VMware vsphere Clusters in Security Zones

Transcription:

High Performance Computing Course Notes 2008-2009 2009 High Performance Storage

Storage devices Primary storage: register (1 CPU cycle, a few ns) Cache (10-200 cycles, 0.02-0.5us) Main memory Local main memory (0.2-4us) NUMA (2-10 x latency of local memory) Secondary storage: Magnetic disk (2-20ms) Solid state disk (0.05-0.5ms) Cache in storage controller (0.05-0.5ms) Tertiary storage Removable media: tapes, floppies, CDs (ms-minutes) Tape library (few seconds few minutes) 2

Hard disk vs. solid state drive a) 2.5-inch hard disk b) solid state drive 3

Tape library - Consisting of - Tape - Tape drive - tape slots - Barcode reader - robot 4

Disks 5

Disk failure and metrics Mean Time Between Failures (MTBF): the average time between failures of a disk MTBF= (downtime-uptime)/number-of-failures Annual failure rate (AFR): number of failures per year AFR=running-hours-per-year/MTBF AFR disks =N disks *AFR disk 6

Solutions for disk failures Full redundancy Replication (mirroring) Partial Redundancy Parity information 7

RAID RAID: Redundant Arrays of Independent Disks Goals: increased data reliability and increased I/O performance Main concepts in RAID Mirroring Parity stripping Advantages: High capability High performance: data stripe Graceful degrading One disk fails, only that disk needs to be replaced 8

RAID Disadvantage: failures AFR disks =N disks *AFR disk Solution Redundancy: 1) replication/mirroring: need more space 2) parity: recover from single disk failure; need more operations to maintain parity info and recover 9

Parity Parity calculation is performed using XOR. XOR operator is "true" if and only if one of its two operands is true Property of XOR: If D p =D 1 XOR D k XOR D n, then D k = D p XOR D 1 D k-1 XOR D k+1 XOR D n Therefore, if any data is lost, we can recover the data from parity and the remaining data Advantages: only one of the "N+1" drives contains redundancy information Disadvantages: parity information has to be computed every time the data is updated 10

Disk arrays taxonomy RAID levels 0: stripping without redundancy 1: full copy mirroring 3: separate disk for parity 4: similar as RAID 3 5: rotated distributed parity 6: double parity They are just classifications rather than a ordered list 11

RAID levels RAID0 Stripped without redundancy Data can be read off in parallel Any disk failure destroys the entire array RAID1 Mirrored Array continues to operate so long as at least one drive is functioning 12

RAID3 Striped set with dedicated parity single parity disk is a bottleneck for writing Byte-level striping (typically under 1k) RAID4 Identical to RAID 3 but does block-level striping instead of byte-level striping The block can be of any size 13

RAID5 Striped set with distributed parity the array is not destroyed by a single drive failure Upon drive failure, any subsequent reads can be calculated from the distributed parity The array will have data loss in the event of a second drive failure 14

RAID6 Striped set with dual parity. Provides fault tolerance from two drive failures 15

16

17

Network Attached Storage (NAS) Follows a client/server design A NAS head acts as the interface between the NAS and network clients The NAS appears on the network as a single "node" that is the IP address of the head device Clients access a NAS over an Ethernet connection The NAS devices require no monitor, keyboard or mouse and run an embedded os NAS uses file-based application protocols such as NFS (Network File System) 18

Storage Area Networks (SANs( SANs) An architecture to attach remote storage devices to computer servers in such a way that the storage devices appear as locally attached to the OS The data can be accessed in blocks Use FibreChannel protocol to access data 19

NAS vs. SAN 20