Disk for Real-time Data. Presented at the THIC Meeting at the Sony Auditorium, 3300 Zanker Rd, San Jose CA March 4-5, 2003

Similar documents
Storage Systems. Storage Systems

Storage. Hwansoo Han

I/O CANNOT BE IGNORED

SCRAMNet GT. A New Technology for Shared-Memor y Communication in High-Throughput Networks. Technology White Paper

Specifying Storage Servers for IP security applications

CPS104 Computer Organization and Programming Lecture 18: Input-Output. Outline of Today s Lecture. The Big Picture: Where are We Now?

Computer Organization and Structure. Bing-Yu Chen National Taiwan University

Ref: Chap 12. Secondary Storage and I/O Systems. Applied Operating System Concepts 12.1

Chapter 6. Storage and Other I/O Topics

SATA RAID For The Enterprise? Presented at the THIC Meeting at the Sony Auditorium, 3300 Zanker Rd, San Jose CA April 19-20,2005

Introduction I/O 1. I/O devices can be characterized by Behavior: input, output, storage Partner: human or machine Data rate: bytes/sec, transfers/sec

COS 318: Operating Systems. Storage Devices. Vivek Pai Computer Science Department Princeton University

Chapter 6. Storage and Other I/O Topics

DELL EMC DATA DOMAIN SISL SCALING ARCHITECTURE

COS 318: Operating Systems. Storage Devices. Kai Li Computer Science Department Princeton University

I/O CANNOT BE IGNORED

CISC 7310X. C11: Mass Storage. Hui Chen Department of Computer & Information Science CUNY Brooklyn College. 4/19/2018 CUNY Brooklyn College

Lecture 23: Storage Systems. Topics: disk access, bus design, evaluation metrics, RAID (Sections )

Chapter 10: Mass-Storage Systems

Chapter 10: Mass-Storage Systems. Operating System Concepts 9 th Edition

Storage Technologies - 3

Appendix D: Storage Systems

Today: Secondary Storage! Typical Disk Parameters!

Iomega REV Drive Data Transfer Performance

EMC CLARiiON Backup Storage Solutions

Application debugging USB Bus utilization graph

u Covered: l Management of CPU & concurrency l Management of main memory & virtual memory u Currently --- Management of I/O devices

Readings. Storage Hierarchy III: I/O System. I/O (Disk) Performance. I/O Device Characteristics. often boring, but still quite important

Lecture 23: I/O Redundant Arrays of Inexpensive Disks Professor Randy H. Katz Computer Science 252 Spring 1996

Lecture 29. Friday, March 23 CS 470 Operating Systems - Lecture 29 1

COS 318: Operating Systems. Storage Devices. Jaswinder Pal Singh Computer Science Department Princeton University

Mass-Storage Systems. Mass-Storage Systems. Disk Attachment. Disk Attachment

EMC CLARiiON CX3 Series FCP

PC-based data acquisition II

Milestone Solution Partner IT Infrastructure Components Certification Report

Computer Architecture Computer Science & Engineering. Chapter 6. Storage and Other I/O Topics BK TP.HCM

Advanced Database Systems

Chapter 12: Mass-Storage

Advances in Storage Connectivity ATTO Ultra320 SCSI The Next Generation A Technical Brief

Computer Science 146. Computer Architecture

Chapter 6. Storage and Other I/O Topics. ICE3003: Computer Architecture Fall 2012 Euiseong Seo

Reference Architecture

Input/Output. Today. Next. Principles of I/O hardware & software I/O software layers Disks. Protection & Security

Priority Traffic CSCD 433/533. Advanced Networks Spring Lecture 21 Congestion Control and Queuing Strategies

HIGH SPEED CONNECTIVITY BETWEEN AN ID-1 TAPE RECORDER AND HIGH PERFORMANCE COMPUTERS THIC MEETING, JANUARY 22-24, DATATAPE Incorporated

IBM Power 755 server. High performance compute node for scalable clusters using InfiniBand architecture interconnect products.

Matrox Imaging White Paper

CS370 Operating Systems

V. Mass Storage Systems

Device-Functionality Progression

Chapter 12: I/O Systems. I/O Hardware

IMPROVING THE PERFORMANCE, INTEGRITY, AND MANAGEABILITY OF PHYSICAL STORAGE IN DB2 DATABASES

Administrivia. CMSC 411 Computer Systems Architecture Lecture 19 Storage Systems, cont. Disks (cont.) Disks - review

PeerStorage Arrays Unequalled Storage Solutions

CS2410: Computer Architecture. Storage systems. Sangyeun Cho. Computer Science Department University of Pittsburgh

EMC Symmetrix DMX Series The High End Platform. Tom Gorodecki EMC

Topics in Computer System Performance and Reliability: Storage Systems!

HD Tune Pro manual version 4.50 HD Tune Pro manual

Kadence. Kadence Systems Co South Pointe Dr. #107 Laguna Hills, Ca Phone: (949) Fax: (949)

Mass-Storage Structure

Milestone Solution Partner IT Infrastructure Components Certification Report

Performance of relational database management

VX3000-E Unified Network Storage

IBM _` p5 570 servers

Some material adapted from Mohamed Younis, UMBC CMSC 611 Spr 2003 course slides Some material adapted from Hennessy & Patterson / 2003 Elsevier

Department of Computer Engineering University of California at Santa Cruz. File Systems. Hai Tao

PESIT Bangalore South Campus

Quiz for Chapter 6 Storage and Other I/O Topics 3.10

Definition of RAID Levels

IBM TotalStorage Enterprise Storage Server Model 800

Module 4 STORAGE NETWORK BACKUP & RECOVERY

CS3600 SYSTEMS AND NETWORKS

How to Choose the Right Bus for Your Measurement System

Models HP 900GB SAS 10K SFF HDD HP 1.2TB SAS 10K SFF HDD

I/O Buffering and Streaming

Name: uteid: 1. CS439: Fall 2011 Midterm 2

Module 12: I/O Systems

NAND Interleaving & Performance

Cache Memory COE 403. Computer Architecture Prof. Muhamed Mudawar. Computer Engineering Department King Fahd University of Petroleum and Minerals

Chapter 11. I/O Management and Disk Scheduling

Chapter 13: Mass-Storage Systems. Disk Scheduling. Disk Scheduling (Cont.) Disk Structure FCFS. Moving-Head Disk Mechanism

Chapter 13: Mass-Storage Systems. Disk Structure

CS370: Operating Systems [Fall 2018] Dept. Of Computer Science, Colorado State University

Computer Architecture A Quantitative Approach, Fifth Edition. Chapter 2. Memory Hierarchy Design. Copyright 2012, Elsevier Inc. All rights reserved.

Performance Evaluation of iscsi based storage network with cluster mirroring. Rahul Jangid 1, Mr. Rajendra Purohit 2

Operating Systems 2010/2011

Securing Data-at-Rest

Computer Architecture 计算机体系结构. Lecture 6. Data Storage and I/O 第六讲 数据存储和输入输出. Chao Li, PhD. 李超博士

CS370: Operating Systems [Fall 2018] Dept. Of Computer Science, Colorado State University

Unit 3 and Unit 4: Chapter 4 INPUT/OUTPUT ORGANIZATION

Storage Update and Storage Best Practices for Microsoft Server Applications. Dennis Martin President, Demartek January 2009 Copyright 2009 Demartek

Virtual Memory. Reading. Sections 5.4, 5.5, 5.6, 5.8, 5.10 (2) Lecture notes from MKP and S. Yalamanchili

VX1800 Series Unified Network Storage

SYSTEM UPGRADE, INC Making Good Computers Better. System Upgrade Teaches RAID

Chapter 6. Storage and Other I/O Topics

Models Smart Array 6402/128 Controller B21 Smart Array 6404/256 Controller B21

IBM System Storage DS5020 Express

Condusiv s V-locity VM Accelerates Exchange 2010 over 60% on Virtual Machines without Additional Hardware

Thomas Polzer Institut für Technische Informatik


Transcription:

Disk for Real-time Data Phil Brunelle Conduant Corporation 1501 S. Sunset St., Suite C, Longmont, CO 80501 Phone:303-485-2721 FAX: 303-485-5104 E-mail: phil@conduant.com Presented at the THIC Meeting at the Sony Auditorium, 3300 Zanker Rd, San Jose CA 95134-1940 March 4-5, 2003

Disk for Real-time Data

There s a Universe of Data out there Just Waiting

Topics Whats and whys of Real-time Data? Why disk? Understanding the Architecture Specsmanship Performance graphs Summary

Whats and Whys of Real-time Data?

What is Real-time Data? Is a data stream produced by or representing a physical event with specific temporal characteristics While recording, a stream of data that must be recorded at a continuous rate to accurately capture the temporal nature of the data While playing back, a stream of data that must be reproduced at a specific continuous rate to accurately represent the temporal nature of the data

Why record and play back real-time data? Why record? Why play back? To capture details of physical events for analysis with data processing techniques or through human examination So that we can play it back in real-time To reproduce a physical event in the same manner that the event originally occurred To produce a physical event with a computer-generated representation of realtime data (i.e. simulation)

What is the difference between data processing and real-time storage? Data Processing Storage: Maximizes overall throughput of many small, unrelated transactions, rather than being most efficient on any one Generally not concerned with variable access times for any individual transaction Logically sequential data physically scatters through the course of normal operations Requires exactness of data but NOT temporal repeatability Real-time Storage: Sustains continuous throughput of a single transaction Uses various techniques to compensate for the unpredictable mechanical delays inherent in magnetic storage devices Data records and remains as a continuous linear record Generally requires both exactness of data and temporal repeatability

What is the Real-time storage problem? Source Real-time rate = 60MB/s Data Path Burst, Average, and Instantaneous Rates Real-time rate = 60MB/s FC Burst rate = 100MB/s Average rate = 60MB/s Instantaneous rate = 30 to 90 MB/s Data is lost during periods when instantaneous rate falls below 60MB/s even though average and burst rates are higher than 60MB/s

What are some real-time applications? Radar Imaging Stress Measurements Radio Astronomy Torpedo runs Entertainment Surveillance Virtual Reality Law Enforcement Digital Radio Missile tracking GPS Acoustics Code/logic/network trace Instrumentation Semiconductor Test Telemetry

Why Disk?

Why Disk? Value Performance RAS (Reliability/Availability/Serviceability) Many packaging options Integration Environmental Archival Futures

Why Disk? Value Low cost per gigabyte ($1.40/GB for IDE on 1/30/03) Low cost per megabyte/second Negligible cost to maintain Low cost to upgrade Minimal cost for on-site sparing Reduced power, cooling, footprint/tb Economical data mirror for redundancy

Why Disk? Performance: Scalable, multiple drives can be ganged together sustain very high data rates (gigabits or 10s of gigabits/sec and beyond) Fast random access during playback (10s of milliseconds to totally different locations) Rapid offline data duplication, local or remote High-speed real-time remote data mirror Simultaneous continuous recording and random data review

Why Disk? Reliability/Availability/Serviceability Low MTBF (< 1 in 10 14 bits read) per drive Ramp loaded heads Good shock/vibration using 3-1/2 drives, better with 2-1/2 drives, and best with Flash disks or specifically designed packaging Economical on-site sparing minimizes MTTR (mean time to repair) Replacement drives available at store down the street or overnight from mail order Specifications based on HGST (formerly IBM) DeskStar 120.

Why Disk? Many packaging options:

Why Disk? Integration Easily integrated with most common computer systems Remote control and data access over Ethernet Seamless infinite recording (TK200)

Why Disk? Environmental/Facilities Low power and reduced dissipation (~10W/drive operational, less with 2-1/2 drives) Reduced footprint (4.05GB/in 3 with 250GB drives, 2TB per TK200 removable drawer) Reduced noise (~ 3 bels/drive) Specifications based on Hitachi (formerly IBM) DeskStar 120.

Why Disk? Archival Low cost per uncompressed gigabyte ($1.40/GB for IDE on 1/30/03) High Density (4.05GB/in 3 with 250GB drives, 2TB per removable unit) Conduant TK200

Why Disk? Futures: It just keeps getting better, better, and better!

Understanding the System Architecture

Not all recording systems are alike Total system performance is dependent on: Data organization on disk Individual disk performance Dedicated Disk/Data engine Data path Operating System/File System Data buffer DSA (Dynamic Storage Allocation)

Understanding the Architecture Data Organization - Sequential is the natural organization for both disk drives and real-time data - Sequential yields high performance, predictability, and repeatability - Fragmented or scattered is BAD! - OS/File System organized is fragmented!

Understanding the Architecture Disk Performance: Bit rate between media and head (Mbits/sec) Head switch and incremental seek delays Overhead fields (Inter-sector gaps, ECC, PLL sync, etc.) Head position from OD (Outer Diameter) Maximum interface transfer rate

Understanding the Architecture Dedicated Disk/Data engine is GOOD! Can ensure physical sequentiality Predictable, repeatable, performance Responds to asynchronous events at hardware speeds Pushes drives to their inherent capabilities Higher level of resource utilization

Understanding the Architecture Customer data rate defines the minimum instantaneous data rate Best data path Point-to-point, single pass Dedicated (unshared) Simple protocol

Understanding the Architecture Operating System/File System not designed for real-time data Responds to asynchronous events at software speeds Unpredictable shared resource Layers of abstraction produce logically sequential but physically scattered data Drives operate at a fraction of their inherent capabilities A compromise to meet the needs of all kinds of users and services Storage destination committed when file is opened for writing

Understanding the Architecture Data buffer More is better Think number of seconds at a given data rate, not number of megabytes Larger buffer allows more time for disk drive error recovery

Understanding the Architecture DSA (Dynamic Storage Allocation) Priority on getting data to magnetic media above all Compensates for underperforming drive with underutilized drive Picks up where data buffer leaves off

Simple solution: The first solution people think of is riddled with problems: Same data moves twice on same PCI bus Same data is addressed twice in system memory Real-time data competes with processor instruction fetches Data moves from source to disk under control of application software Storage is under control of OS Data organized per OS file system is not physically sequentially Source card must include data buffer sufficient to account for when PCI bus is busy with other traffic

Card-to-card and dedicated real-time engine. A significant improvement: Data moves only once on PCI bus, thus doubling possible bandwidth DMA engine on source card pushes data directly to storage controller with no software intervention Dedicated hardware engine for disk storage independently controls with primitive commands to ensure physically sequential recording Source card must include data buffer sufficient to compensate for when PCI bus is busy with other traffic

Add a direct customer data path into the storage controller for the best solution. Similar to card-to-card solution except that a dedicated data interface: Reduces or eliminates the need for buffer FIFOs on the source card Totally eliminates the possibility of computer interference in data path Simplifies the interface protocol due to lack of required arbitration

Specsmanship

Specsmanship Watch out for Magic Numbers: usually theoretically maximum burst rates that are not sustainable 40,80,160, and 320 MB/s for SCSI 100 and 200 MB/s for FC Highly impressive for sales purposes, worthless for the realtime recorder application

Specsmanship Benchmark software: Fine for relative comparisons of different storage systems under identical conditions Optimized to test raw disk performance does NOT reflect performance across the complete data path Does not reflect performance of application software Full data path performance is generally much less than disk benchmark under OS

Specsmanship Ask the supplier: How did he arrive at the minimum sustained data rates from data source to disk? If he is being optimistic or conservative? Why? If the performance is contingent on certain conditions? To explain his architecture from data source to disk and back. Do you see any problems? If his company or a strategic partner owns the IP? If not, he may not really know the details and he may not be able to directly address problems should they arise.

Performance Graphs

Summary

Summary Think Minimum Instantaneous Data Rate, NOT burst data rate and NOT average sustained data rate Real-time data can require the exactness of data processing data but definitely includes the temporal component Current IDE disk technology, when coupled with controller designed for real-time storage, fills most of the required and desired aspects for real-time recording and playback Understand the specifications, how they were arrived at, and what the seller will guarantee.

There s a Universe of Data out there Just Waiting Thank You!