Overview of the Storage Systems Research Center Darrell Long & Ethan Miller Jack Baskin School of Engineering
|
|
- Poppy Nelson
- 5 years ago
- Views:
Transcription
1 Overview of the Storage Systems Research Center Darrell Long & Ethan Miller Jack Baskin School of Engineering
2 The SSRC in one slide Research Challenges Exascale capacity & scalability erformance Security Reliability & self-healing New storage technologies Improved metadata & indexing SSRC features 2+2 core faculty 20 graduate students High degree of collaboration Close cooperation with sponsors High visibility Research Thrusts Archival storage New metadata & indexing approaches New storage technologies Deduplication Reliable & secure storage Large-scale object-based storage SSRC sponsors Dept. of Energy, NSF, LLNL, LANL Data Domain, Hitachi, IBM, LSI, NetApp, Seagate, Symantec Over $1 million per year 2
3 Recent SSRC research Archival storage Secure long-term storage: OTSHARDS (USENIX 2007) Brick-based archival storage: ergamum (FAST 2008) Managing exascale archives (DSW 2008) Metadata and file system indexing Understanding current workloads (USENIX 2008) Scalable metadata indexes (FAST 2009) Deduplication Scalable, partitioned indexes (KDD 2007) Secure deduplication (StorageSS 2008) Secure petabyte-scale storage (SC 07) Reliability Large-stripe redundancy for disks (StorageSS 2007) Finding good erasure codes (DSN 2008) 3
4 External collaborations etascale Data Storage Institute (DSI) Three universities (UCSC, CMU, Michigan) Five national laboratories (LANL, NERSC, ORNL, NL, Sandia) NetApp Non-volatile memory for large-scale file servers Archival storage Scalable indexing Symantec Deduplication for virtual machines IBM Storage class memories Hewlett ackard Deduplication Reliability in file and storage systems 4
5 A few of our projects... Archival Storage Evolvable and Secure archives, Long-term management Deduplication VM deduplication, Secure deduplication, Scalable indexing, On-line deduplication etascale Storage Security and Reliability Secure petascale storage, Quotas, Store, Forget & Check Reliability Disk reliability, Disaster codes, Restructuring RAID Indexing Highly scalable metadata indexing Content-based access 5
6 ergamum: Evolvable Archival Storage Archival storage requires archival solutions Legislated requirements for professional data ersonal histories recorded digitally (photos, documents,...) Traditional systems too expensive to buy and operate Archives must scale over many dimensions (time, technology, capacity vendors,...) Approach: distributed network of intelligent, disk-based devices Function independently, cooperate in inter-device redundancy Static cost control: standardized interfaces, commodity hardware Operational cost control Low power hardware Keep disks spun-down 6
7 ower-efficient Data rotection Goal: provide data protection without wasting energy Approach: Scale the response to the problem Redundancy Group R R R R R R R R R R R R R R R R R R Redundancy Group R R R R R R R R R R R R R R R R R R Redundancy Group Tome 0 Tome 1 Tome 2 Intra-disk protection from latent sector errors Inter-disk protection from device loss Trees of signatures for algebraic consistency checking ower aware encoding and rebuild strategies R R R R R R Tome 3 7
8 OTSHARDS: Secure Long-Term Storage roblem: over potentially indefinite data lifetimes......computationally-bound mechanisms may eventually fail...change and loss are constants (keys, cryptosystems,...) Trust the consensus of groups, not individuals Store data on multiple, independently managed archives Full collaboration should allow data recovery Use alternatives to computation-bound security Unconditionally secure mechanisms (e.g. secret splitting) Make targeted attacks difficult Make aberrant behavior noticeable 8
9 Key Concept: Secret Splitting User Object Fragment Fragment Fragment Shard Shard Shard Shard X Shard Y arity Shard Archive 0 Archive 1 Archive 2 Archive 3 Redundancy Group 9
10 Data Deduplication Data deduplication is typically achieved by chunking files Different ways to chunk data Variable and fixed size chunking Analyze the chunks to see if there are any interesting properties Frequency of chunks per chunking scheme Chunk locality Chunks occurring in sequence each time the first occurs What about three or four chunks? 10
11 Managing Deduplication in Large Scale Archival Systems Index lookup: an integral part of the deduplication process in archival systems A lookup index can consist of chunk signatures, shingles In large scale archival systems index lookup becomes a performance bottleneck Index size increases Query, update performance suffers Network bandwidth under-utilized Turnaround time not satisfactory Approaches used today: Out-of-band deduplication In-band deduplication Goal: partition the index while preserving efficient lookups 11
12 artitioning the Chunk Index Goal: artition chunk index I into multiple partitions I0,, IK-1 When adding a new file fn to the archive: Extract chunk signatures Choose m partitions using the document routing algorithm (m: routing factor) Route all the chunks to m partitions, m < K Add the chunk signatures to each of the m partitions When identifying redundancies within a file fq Extract chunk signatures Choose m partitions using the document routing algorithm Route all the features to m partitions, m < K Query each of the m partitions to identify redundant chunks Goal: Minimal loss of compression while m being much smaller than K 12
13 VM Deduplication Huge opportunity for saving space in virtualized environments. Efficient with homogeneous VMs. More images, more savings. Half space saving even with heterogeneous VMs. Collection of Unbuntu VMs 13
14 Secure deduplication Naive deduplication of encrypted data cannot work. Solution: deduplicate the data first! But then how to you encrypt? Encrypt so that if you know the data that was deduplicated, you can encrypt, otherwise impossible to guess. Solution: Convergent encryption of chunks (with appropriate data structures). Hash Key File Chunking Chunk Data Encrypt Chunk Map Chunk Chunk Chunk Hash ID ID ID 14
15 Secure file systems Scalable security for Ceph At most one ticket per file, regardless of how many clients and OSDs ublic-key operations amortized across many uses Quotas in Ceph Ensure usage within limits for near-zero cost and 100% catching of cheaters Remote verification of stored data Verify consistency of stored erasure-coded data Secure against collusion by all remote servers 15
16 Scalable Security in Ceph Extended capabilities Can authorize I/O for any number of clients to any number of whole files Reduce number of capabilities Automatic revocation Capability expiration capability revocation Revocation without any contact Secure delegation Delegate access rights to other clients Shift security to key possession 16
17 Quotas in Ceph Separate allocation and quota management using a digital cash-based system model. Quota management server acts as a bank. Clients withdraw vouchers from the quota server for a user and store for later use. Clients spend vouchers for users in order to purchase storage from storage servers. Storage servers periodically update the quota server about user storage. Cheaters are caught by the bank at defined intervals. Vouchers are cryptographically-protected byte sequences {epoch, expiry, user, amount, serial} auth
18 Store, Forget and Check Systems store data on remote nodes Remote nodes may not be trustworthy Data owner must check to ensure that data is really stored Two current approaches: Read data from multiple sites and check for consistency Generate checksum remotely and compare to checksum of local data We developed an efficient algorithm that does not require keeping a local copy of the data Storage utility provides remotely managed storage Client sends data to the SS then client retrieves data as needed Trust issue: how can client tell if SS is doing its job? Read data, check (public key-based) signature Read data, decrypt, check secure hash and object ID 18
19 Reliable Storage (Advanced use of erasure codes) Cannot treat erasure codes as a black-box! Must do more than choose m and n Code choice affects performance, reliability and power Goal: smart application of codes in emerging storage systems NVRAM-based storage systems Arrays of heterogeneous devices ower-managed archives Secure storage systems 19
20 Surviving disasters in storage Storage systems need to survive failures CUs / networks / power supplies fail: no data loss (usually) Drives fail: data loss! Typical solution: use RAID RAIDs don t provide enough reliability for extreme cases (multiple failures) RAIDs that can survive multiple failures are often slow Our solution: disaster recovery codes Handle common case (single failure) quickly Handle uncommon case (multiple failure) correctly but somewhat more slowly 20
21 How effective is this? Just a single NVRAM parity disk reduces the chance of data loss dramatically One disk of NVRAM in a system with hundreds of disks isn t that expensive... 1 Data Loss robabilities for Mirroring and Single NVRAM arity robability of Data Loss Num OSD Failures No_arity Single_arity 21
22 Using non-volatile memory technologies File systems for byte-addressable NVRAM Highly compressible metadata: 3 regular file systems High performance: compressed metadata is faster Rich linking structures Reliable file systems in NVRAM Algebraic signatures ensure B blocks are correct arity (Reed-Solomon) provides redundancy for sets of blocks Necessary given relatively low reliability of NVRAM rotects against software errors as well (in conjunction with good file system design) View-based file systems 22
23 Reliable NVRAM arrays Flash memory is unreliable Relatively high error rates More reads higher error rates More writes higher error rates How can reliable systems rely on flash? Use multi-level redundancy for arrays of flash memory Choose codes at each level knowing that higher levels are present to correct errors Low levels detect errors (and correct a few) Higher levels correct erasures Tradeoffs in performance and reliability 23
24 Distributed Indexing Storage systems are rapidly increasing in size etabytes of data, billions of files, thousands of users Too large to manage with existing tools! Need a scalable way to find and access files Users spend too much time organizing their data Administrators need scalable tools to manage systems Search is an emerging solution Common on desktops (Spotlight, WinFS) However, these solutions cannot scale to large systems Need scalable search for large storage systems 24
25 Spyglass: Scalable Distributed Search Designed to allow fast search and indexing across a data center and its entire history! Optimized for non-selective, hierarchical data sets Storage nodes partition namespace into individual indexes reserves locality of data properties, exploits hierarchical namespace Index is a KD-Tree, a k- dimensional search tree Fast K dimensional search for non-selective data 25
26 Conclusions The SSRC is a very active research group Lots of interesting projects Many collaborators from academia, industry, and the national laboratories Several students graduating each year We welcome your involvement! Graduate Student internships (get them early) Recruiting researchers (employees) Sponsoring research Visiting professors Visitors from your laboratory 26
Cloud-related Storage Research in Santa Cruz
Cloud-related Storage Research in Santa Cruz Darrell Long University of California, Santa Cruz Trading Storage for Computation (and vice versa) 2 Trade Storage for Computation Inputs rocess Result Storing
More informationPergamum Replacing Tape with Energy Efficient, Reliable, Disk- Based Archival Storage
ergamum Replacing Tape with Energy Efficient, Reliable, Disk- Based Archival Storage Mark W. Storer Kevin M. Greenan Ethan L. Miller Kaladhar Voruganti* University of California, Santa Cruz *Network Appliance
More informationWhere d My Photos Go? Challenges in Preserving Digital Data for the Long Term
Where d My hotos Go? Challenges in reserving Digital Data for the Long Term Ethan L. Miller Storage Systems Research Center Center for Research in Intelligent Storage University of California, Santa Cruz
More informationStore, Forget & Check: Using Algebraic Signatures to Check Remotely Administered Storage
Store, Forget & Check: Using Algebraic Signatures to Check Remotely Administered Storage Ethan L. Miller & Thomas J. E. Schwarz Storage Systems Research Center University of California, Santa Cruz What
More informationSoftware-defined Storage: Fast, Safe and Efficient
Software-defined Storage: Fast, Safe and Efficient TRY NOW Thanks to Blockchain and Intel Intelligent Storage Acceleration Library Every piece of data is required to be stored somewhere. We all know about
More informationHPSS Treefrog Summary MARCH 1, 2018
HPSS Treefrog Summary MARCH 1, 2018 Disclaimer Forward looking information including schedules and future software reflect current planning that may change and should not be taken as commitments by IBM
More informationDynamic Metadata Management for Petabyte-scale File Systems
Dynamic Metadata Management for Petabyte-scale File Systems Sage Weil Kristal T. Pollack, Scott A. Brandt, Ethan L. Miller UC Santa Cruz November 1, 2006 Presented by Jae Geuk, Kim System Overview Petabytes
More informationReliable and Efficient Metadata Storage and Indexing Using NVRAM
Reliable and Efficient Metadata Storage and Indexing Using NVRAM Ethan L. Miller Kevin Greenan Andrew Leung Darrell Long Avani Wildani (and others) Storage Systems Research Center University of California,
More informationDeduplication Storage System
Deduplication Storage System Kai Li Charles Fitzmorris Professor, Princeton University & Chief Scientist and Co-Founder, Data Domain, Inc. 03/11/09 The World Is Becoming Data-Centric CERN Tier 0 Business
More informationPANASAS TIERED PARITY ARCHITECTURE
PANASAS TIERED PARITY ARCHITECTURE Larry Jones, Matt Reid, Marc Unangst, Garth Gibson, and Brent Welch White Paper May 2010 Abstract Disk drives are approximately 250 times denser today than a decade ago.
More informationFile systems CS 241. May 2, University of Illinois
File systems CS 241 May 2, 2014 University of Illinois 1 Announcements Finals approaching, know your times and conflicts Ours: Friday May 16, 8-11 am Inform us by Wed May 7 if you have to take a conflict
More informationCS-580K/480K Advanced Topics in Cloud Computing. Object Storage
CS-580K/480K Advanced Topics in Cloud Computing Object Storage 1 When we use object storage When we check Facebook, twitter Gmail Docs on DropBox Check share point Take pictures with Instagram 2 Object
More informationENCRYPTED DATA MANAGEMENT WITH DEDUPLICATION IN CLOUD COMPUTING
ENCRYPTED DATA MANAGEMENT WITH DEDUPLICATION IN CLOUD COMPUTING S KEERTHI 1*, MADHAVA REDDY A 2* 1. II.M.Tech, Dept of CSE, AM Reddy Memorial College of Engineering & Technology, Petlurivaripalem. 2. Assoc.
More informationSpyglass: Fast, Scalable Metadata Search for Large-Scale Storage Systems
Spyglass: Fast, Scalable Metadata Search for Large-Scale Storage Systems Andrew W Leung Ethan L Miller University of California, Santa Cruz Minglong Shao Timothy Bisson Shankar Pasupathy NetApp 7th USENIX
More informationOasis: An Active Storage Framework for Object Storage Platform
Oasis: An Active Storage Framework for Object Storage Platform Yulai Xie 1, Dan Feng 1, Darrell D. E. Long 2, Yan Li 2 1 School of Computer, Huazhong University of Science and Technology Wuhan National
More information朱义普. Resolving High Performance Computing and Big Data Application Bottlenecks with Application-Defined Flash Acceleration. Director, North Asia, HPC
October 28, 2013 Resolving High Performance Computing and Big Data Application Bottlenecks with Application-Defined Flash Acceleration 朱义普 Director, North Asia, HPC DDN Storage Vendor for HPC & Big Data
More informationCOS 318: Operating Systems. NSF, Snapshot, Dedup and Review
COS 318: Operating Systems NSF, Snapshot, Dedup and Review Topics! NFS! Case Study: NetApp File System! Deduplication storage system! Course review 2 Network File System! Sun introduced NFS v2 in early
More informationCeph: A Scalable, High-Performance Distributed File System PRESENTED BY, NITHIN NAGARAJ KASHYAP
Ceph: A Scalable, High-Performance Distributed File System PRESENTED BY, NITHIN NAGARAJ KASHYAP Outline Introduction. System Overview. Distributed Object Storage. Problem Statements. What is Ceph? Unified
More informationDisclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme
STO1926BU A Day in the Life of a VSAN I/O Diving in to the I/O Flow of vsan John Nicholson (@lost_signal) Pete Koehler (@vmpete) VMworld 2017 Content: Not for publication #VMworld #STO1926BU Disclaimer
More informationHP AutoRAID (Lecture 5, cs262a)
HP AutoRAID (Lecture 5, cs262a) Ion Stoica, UC Berkeley September 13, 2016 (based on presentation from John Kubiatowicz, UC Berkeley) Array Reliability Reliability of N disks = Reliability of 1 Disk N
More informationECS High Availability Design
ECS High Availability Design March 2018 A Dell EMC white paper Revisions Date Mar 2018 Aug 2017 July 2017 Description Version 1.2 - Updated to include ECS version 3.2 content Version 1.1 - Updated to include
More informationThe What, Why and How of the Pure Storage Enterprise Flash Array. Ethan L. Miller (and a cast of dozens at Pure Storage)
The What, Why and How of the Pure Storage Enterprise Flash Array Ethan L. Miller (and a cast of dozens at Pure Storage) Enterprise storage: $30B market built on disk Key players: EMC, NetApp, HP, etc.
More informationCSE380 - Operating Systems
CSE380 - Operating Systems Notes for Lecture 17-11/10/05 Matt Blaze, Micah Sherr (some examples by Insup Lee) Implementing File Systems We ve looked at the user view of file systems names, directory structure,
More informationHP AutoRAID (Lecture 5, cs262a)
HP AutoRAID (Lecture 5, cs262a) Ali Ghodsi and Ion Stoica, UC Berkeley January 31, 2018 (based on slide from John Kubiatowicz, UC Berkeley) Array Reliability Reliability of N disks = Reliability of 1 Disk
More information[537] RAID. Tyler Harter
[537] RAID Tyler Harter Review Disks/Devices Device Protocol Variants Status checks: polling vs. interrupts Data: PIO vs. DMA Control: special instructions vs. memory-mapped I/O Disks Doing an I/O requires:
More informationHYDRAstor: a Scalable Secondary Storage
HYDRAstor: a Scalable Secondary Storage 7th TF-Storage Meeting September 9 th 00 Łukasz Heldt Largest Japanese IT company $4 Billion in annual revenue 4,000 staff www.nec.com Polish R&D company 50 engineers
More informationHow to Reduce Data Capacity in Objectbased Storage: Dedup and More
How to Reduce Data Capacity in Objectbased Storage: Dedup and More Dong In Shin G-Cube, Inc. http://g-cube.kr Unstructured Data Explosion A big paradigm shift how to generate and consume data Transactional
More informationFinal Examination CS 111, Fall 2016 UCLA. Name:
Final Examination CS 111, Fall 2016 UCLA Name: This is an open book, open note test. You may use electronic devices to take the test, but may not access the network during the test. You have three hours
More informationSolidFire and Ceph Architectural Comparison
The All-Flash Array Built for the Next Generation Data Center SolidFire and Ceph Architectural Comparison July 2014 Overview When comparing the architecture for Ceph and SolidFire, it is clear that both
More informationCopyright 2010 EMC Corporation. Do not Copy - All Rights Reserved.
1 Using patented high-speed inline deduplication technology, Data Domain systems identify redundant data as they are being stored, creating a storage foot print that is 10X 30X smaller on average than
More informationHPSS RAIT. A high performance, resilient, fault-tolerant tape data storage class. 1
HPSS RAIT A high performance, resilient, fault-tolerant tape data storage class http://www.hpss-collaboration.org 1 Why RAIT? HPSS supports striped tape without RAIT o Conceptually similar to RAID 0 o
More informationWhite Paper. Nexenta Replicast
White Paper Nexenta Replicast By Caitlin Bestler, September 2013 Table of Contents Overview... 3 Nexenta Replicast Description... 3 Send Once, Receive Many... 4 Distributed Storage Basics... 7 Nexenta
More informationHyper-converged Secondary Storage for Backup with Deduplication Q & A. The impact of data deduplication on the backup process
Hyper-converged Secondary Storage for Backup with Deduplication Q & A The impact of data deduplication on the backup process Table of Contents Introduction... 3 What is data deduplication?... 3 Is all
More informationA product by CloudFounders. Wim Provoost Open vstorage
A product by CloudFounders Wim Provoost (@wimpers_be) Open vstorage (@openvstorage) http://www.openvstorage.com CloudFounders vrun Converged infrastructure that combines the benefits of the hyperconverged
More informationDELL EMC DATA DOMAIN SISL SCALING ARCHITECTURE
WHITEPAPER DELL EMC DATA DOMAIN SISL SCALING ARCHITECTURE A Detailed Review ABSTRACT While tape has been the dominant storage medium for data protection for decades because of its low cost, it is steadily
More informationActiveScale Erasure Coding and Self Protecting Technologies
WHITE PAPER AUGUST 2018 ActiveScale Erasure Coding and Self Protecting Technologies BitSpread Erasure Coding and BitDynamics Data Integrity and Repair Technologies within The ActiveScale Object Storage
More informationThe Effectiveness of Deduplication on Virtual Machine Disk Images
The Effectiveness of Deduplication on Virtual Machine Disk Images Keren Jin & Ethan L. Miller Storage Systems Research Center University of California, Santa Cruz Motivation Virtualization is widely deployed
More informationDeduplication File System & Course Review
Deduplication File System & Course Review Kai Li 12/13/13 Topics u Deduplication File System u Review 12/13/13 2 Storage Tiers of A Tradi/onal Data Center $$$$ Mirrored storage $$$ Dedicated Fibre Clients
More informationCSE380 - Operating Systems. Communicating with Devices
CSE380 - Operating Systems Notes for Lecture 15-11/4/04 Matt Blaze (some examples by Insup Lee) Communicating with Devices Modern architectures support convenient communication with devices memory mapped
More informationToday s Papers. Array Reliability. RAID Basics (Two optional papers) EECS 262a Advanced Topics in Computer Systems Lecture 3
EECS 262a Advanced Topics in Computer Systems Lecture 3 Filesystems (Con t) September 10 th, 2012 John Kubiatowicz and Anthony D. Joseph Electrical Engineering and Computer Sciences University of California,
More informationActiveScale Erasure Coding and Self Protecting Technologies
NOVEMBER 2017 ActiveScale Erasure Coding and Self Protecting Technologies BitSpread Erasure Coding and BitDynamics Data Integrity and Repair Technologies within The ActiveScale Object Storage System Software
More informationRemote Data Checking: Auditing the Preservation Status of Massive Data Sets on Untrusted Store
Remote Data Checking: Auditing the Preservation Status of Massive Data Sets on Untrusted Store Randal Burns randal@cs.jhu.edu www.cs.jhu.edu/~randal/ Department of Computer Science, Johns Hopkins Univers
More informationChapter 11: File System Implementation. Objectives
Chapter 11: File System Implementation Objectives To describe the details of implementing local file systems and directory structures To describe the implementation of remote file systems To discuss block
More informationGuide. A small business guide to data storage and backup
Guide A small business guide to data storage and backup 0345 600 3936 www.sfbcornwall.co.uk Contents Introduction... 3 Why is data storage and backup important?... 4 Benefits of cloud storage technology...
More informationScaling Without Sharding. Baron Schwartz Percona Inc Surge 2010
Scaling Without Sharding Baron Schwartz Percona Inc Surge 2010 Web Scale!!!! http://www.xtranormal.com/watch/6995033/ A Sharding Thought Experiment 64 shards per proxy [1] 1 TB of data storage per node
More informationDeduplication and Incremental Accelleration in Bacula with NetApp Technologies. Peter Buschman EMEA PS Consultant September 25th, 2012
Deduplication and Incremental Accelleration in Bacula with NetApp Technologies Peter Buschman EMEA PS Consultant September 25th, 2012 1 NetApp and Bacula Systems Bacula Systems became a NetApp Developer
More informationHYDRAstor: a Scalable Secondary Storage
HYDRAstor: a Scalable Secondary Storage 7th USENIX Conference on File and Storage Technologies (FAST '09) February 26 th 2009 C. Dubnicki, L. Gryz, L. Heldt, M. Kaczmarczyk, W. Kilian, P. Strzelczak, J.
More informationChunkStash: Speeding Up Storage Deduplication using Flash Memory
ChunkStash: Speeding Up Storage Deduplication using Flash Memory Biplob Debnath +, Sudipta Sengupta *, Jin Li * * Microsoft Research, Redmond (USA) + Univ. of Minnesota, Twin Cities (USA) Deduplication
More informationV. Mass Storage Systems
TDIU25: Operating Systems V. Mass Storage Systems SGG9: chapter 12 o Mass storage: Hard disks, structure, scheduling, RAID Copyright Notice: The lecture notes are mainly based on modifications of the slides
More informationRAIDIX Data Storage Solution. Clustered Data Storage Based on the RAIDIX Software and GPFS File System
RAIDIX Data Storage Solution Clustered Data Storage Based on the RAIDIX Software and GPFS File System 2017 Contents Synopsis... 2 Introduction... 3 Challenges and the Solution... 4 Solution Architecture...
More informationWhat's new in Jewel for RADOS? SAMUEL JUST 2015 VAULT
What's new in Jewel for RADOS? SAMUEL JUST 2015 VAULT QUICK PRIMER ON CEPH AND RADOS CEPH MOTIVATING PRINCIPLES All components must scale horizontally There can be no single point of failure The solution
More informationApril 2010 Rosen Shingle Creek Resort Orlando, Florida
Data Reduction and File Systems Jeffrey Tofano Chief Technical Officer, Quantum Corporation Today s Agenda File Systems and Data Reduction Overview File System and Data Reduction Integration Issues Reviewing
More informationTake Back Lost Revenue by Activating Virtuozzo Storage Today
Take Back Lost Revenue by Activating Virtuozzo Storage Today JUNE, 2017 2017 Virtuozzo. All rights reserved. 1 Introduction New software-defined storage (SDS) solutions are enabling hosting companies to
More informationBlock or Object Storage?
Block or Object? MARKET REPORT SPONSORED BY image vlastas, 123RF.com Block or Object? MARKET REPORT Block or Object? If you re buying a storage solution, whether in the form of a software-defined storage
More informationNetApp Clustered Data ONTAP 8.2 Storage QoS Date: June 2013 Author: Tony Palmer, Senior Lab Analyst
ESG Lab Spotlight NetApp Clustered Data ONTAP 8.2 Storage QoS Date: June 2013 Author: Tony Palmer, Senior Lab Analyst Abstract: This ESG Lab Spotlight explores how NetApp Data ONTAP 8.2 Storage QoS can
More informationDATA DOMAIN INVULNERABILITY ARCHITECTURE: ENHANCING DATA INTEGRITY AND RECOVERABILITY
WHITEPAPER DATA DOMAIN INVULNERABILITY ARCHITECTURE: ENHANCING DATA INTEGRITY AND RECOVERABILITY A Detailed Review ABSTRACT No single mechanism is sufficient to ensure data integrity in a storage system.
More informationCOS 318: Operating Systems. File Systems. Topics. Evolved Data Center Storage Hierarchy. Traditional Data Center Storage Hierarchy
Topics COS 318: Operating Systems File Systems hierarchy File system abstraction File system operations File system protection 2 Traditional Data Center Hierarchy Evolved Data Center Hierarchy Clients
More informationSimplified. Software-Defined Storage INSIDE SSS
Software-Defined Storage INSIDE SSS Overcome SDS Challenges Page 2 Simplified Choose the Right Workloads for SDS Using Microsoft Storage Spaces Page 7 The need for agility, scalability, and cost savings
More informationWednesday, May 3, Several RAID "levels" have been defined. Some are more commercially viable than others.
Wednesday, May 3, 2017 Topics for today RAID: Level 0 Level 1 Level 3 Level 4 Level 5 Beyond RAID 5 File systems RAID revisited Several RAID "levels" have been defined. Some are more commercially viable
More informationCeph: A Scalable, High-Performance Distributed File System
Ceph: A Scalable, High-Performance Distributed File System S. A. Weil, S. A. Brandt, E. L. Miller, D. D. E. Long Presented by Philip Snowberger Department of Computer Science and Engineering University
More informationWhy Datrium DVX is Best for VDI
Why Datrium DVX is Best for VDI 385 Moffett Park Dr. Sunnyvale, CA 94089 844-478-8349 www.datrium.com Technical Report Introduction Managing a robust and growing virtual desktop infrastructure in current
More informationComing Changes in Storage Technology. Be Ready or Be Left Behind
Coming Changes in Storage Technology Be Ready or Be Left Behind Henry Newman, CTO Instrumental Inc. hsn@instrumental.com Copyright 2008 Instrumental, Inc. 1of 32 The Future Will Be Different The storage
More informationRBF: A New Storage Structure for Space- Efficient Queries for Multidimensional Metadata in OSS
RBF: A New Storage Structure for Space- Efficient Queries for Multidimensional Metadata in OSS Yu Hua 1, Dan Feng 1, Hong Jiang 2, Lei Tian 1 1 School of Computer, Huazhong University of Science and Technology,
More informationCSE 153 Design of Operating Systems
CSE 153 Design of Operating Systems Winter 2018 Lecture 22: File system optimizations and advanced topics There s more to filesystems J Standard Performance improvement techniques Alternative important
More informationStorage and File System
COS 318: Operating Systems Storage and File System Andy Bavier Computer Science Department Princeton University http://www.cs.princeton.edu/courses/archive/fall10/cos318/ Topics Storage hierarchy File
More informationWhite paper ETERNUS CS800 Data Deduplication Background
White paper ETERNUS CS800 - Data Deduplication Background This paper describes the process of Data Deduplication inside of ETERNUS CS800 in detail. The target group consists of presales, administrators,
More informationStaggeringly Large Filesystems
Staggeringly Large Filesystems Evan Danaher CS 6410 - October 27, 2009 Outline 1 Large Filesystems 2 GFS 3 Pond Outline 1 Large Filesystems 2 GFS 3 Pond Internet Scale Web 2.0 GFS Thousands of machines
More informationEMC Data Domain for Archiving Are You Kidding?
EMC Data Domain for Archiving Are You Kidding? Bill Roth / Bob Spurzem EMC EMC 1 Agenda EMC Introduction Data Domain Enterprise Vault Integration Data Domain NetBackup Integration Q & A EMC 2 EMC Introduction
More informationTurning Object. Storage into Virtual Machine Storage. White Papers
Turning Object Open vstorage is the World s fastest Distributed Block Store that spans across different Datacenter. It combines ultrahigh performance and low latency connections with a data integrity that
More informationQuota enforcement for high-performance distributed storage systems
Quota enforcement for high-performance distributed storage systems Kristal T. Pollack, Darrell D. E. Long, Richard A. Golding, Ralph A. Becker-Szendy IBM Almaden Research Center, San Jose, CA Benjamin
More informationProvisioning with SUSE Enterprise Storage. Nyers Gábor Trainer &
Provisioning with SUSE Enterprise Storage Nyers Gábor Trainer & Consultant @Trebut gnyers@trebut.com Managing storage growth and costs of the software-defined datacenter PRESENT Easily scale and manage
More informationNovember 7, DAN WILSON Global Operations Architecture, Concur. OpenStack Summit Hong Kong JOE ARNOLD
November 7, 2013 DAN WILSON Global Operations Architecture, Concur dan.wilson@concur.com @tweetdanwilson OpenStack Summit Hong Kong JOE ARNOLD CEO, SwiftStack joe@swiftstack.com @joearnold Introduction
More informationDeploying De-Duplication on Ext4 File System
Deploying De-Duplication on Ext4 File System Usha A. Joglekar 1, Bhushan M. Jagtap 2, Koninika B. Patil 3, 1. Asst. Prof., 2, 3 Students Department of Computer Engineering Smt. Kashibai Navale College
More informationStore Process Analyze Collaborate Archive Cloud The HPC Storage Leader Invent Discover Compete
Store Process Analyze Collaborate Archive Cloud The HPC Storage Leader Invent Discover Compete 1 DDN Who We Are 2 We Design, Deploy and Optimize Storage Systems Which Solve HPC, Big Data and Cloud Business
More informationEncrypted Data Deduplication in Cloud Storage
Encrypted Data Deduplication in Cloud Storage Chun- I Fan, Shi- Yuan Huang, Wen- Che Hsu Department of Computer Science and Engineering Na>onal Sun Yat- sen University Kaohsiung, Taiwan AsiaJCIS 2015 Outline
More informationA GPFS Primer October 2005
A Primer October 2005 Overview This paper describes (General Parallel File System) Version 2, Release 3 for AIX 5L and Linux. It provides an overview of key concepts which should be understood by those
More informationAn Application Awareness Local Source and Global Source De-Duplication with Security in resource constraint based Cloud backup services
An Application Awareness Local Source and Global Source De-Duplication with Security in resource constraint based Cloud backup services S.Meghana Assistant Professor, Dept. of IT, Vignana Bharathi Institute
More information-Presented By : Rajeshwari Chatterjee Professor-Andrey Shevel Course: Computing Clusters Grid and Clouds ITMO University, St.
-Presented By : Rajeshwari Chatterjee Professor-Andrey Shevel Course: Computing Clusters Grid and Clouds ITMO University, St. Petersburg Introduction File System Enterprise Needs Gluster Revisited Ceph
More informationUniversal Storage. Innovation to Break Decades of Tradeoffs VASTDATA.COM
Universal Storage Innovation to Break Decades of Tradeoffs F e b r u a r y 2 0 1 9 AN END TO DECADES OF STORAGE COMPLEXITY AND COMPROMISE SUMMARY When it s possible to store all of your data in a single
More informationStorage and File Structure. Classification of Physical Storage Media. Physical Storage Media. Physical Storage Media
Storage and File Structure Classification of Physical Storage Media Overview of Physical Storage Media Magnetic Disks RAID Tertiary Storage Storage Access File Organization Organization of Records in Files
More informationCh 11: Storage and File Structure
Ch 11: Storage and File Structure Overview of Physical Storage Media Magnetic Disks RAID Tertiary Storage Storage Access File Organization Organization of Records in Files Data-Dictionary Dictionary Storage
More informationSC17 - Overview
HPSS @ SC17 - Overview High Performance Storage System The value and benefits of the HPSS service offering http://www.hpss-collaboration.org 1 We are storage industry thought leaders HPSS is a development
More informationCS 111. Operating Systems Peter Reiher
Operating System Principles: Accessing Remote Data Operating Systems Peter Reiher Page 1 Outline Data on other machines Remote file access architectures Challenges in remote data access Security Reliability
More informationWas ist dran an einer spezialisierten Data Warehousing platform?
Was ist dran an einer spezialisierten Data Warehousing platform? Hermann Bär Oracle USA Redwood Shores, CA Schlüsselworte Data warehousing, Exadata, specialized hardware proprietary hardware Introduction
More informationThe Data-Protection Playbook for All-flash Storage KEY CONSIDERATIONS FOR FLASH-OPTIMIZED DATA PROTECTION
The Data-Protection Playbook for All-flash Storage KEY CONSIDERATIONS FOR FLASH-OPTIMIZED DATA PROTECTION The future of storage is flash The all-flash datacenter is a viable alternative You ve heard it
More informationLOAD BALANCING AND DEDUPLICATION
LOAD BALANCING AND DEDUPLICATION Mr.Chinmay Chikode Mr.Mehadi Badri Mr.Mohit Sarai Ms.Kshitija Ubhe ABSTRACT Load Balancing is a method of distributing workload across multiple computing resources such
More informationChapter 10: Mass-Storage Systems
Chapter 10: Mass-Storage Systems Silberschatz, Galvin and Gagne 2013 Chapter 10: Mass-Storage Systems Overview of Mass Storage Structure Disk Structure Disk Attachment Disk Scheduling Disk Management Swap-Space
More informationRAID SEMINAR REPORT /09/2004 Asha.P.M NO: 612 S7 ECE
RAID SEMINAR REPORT 2004 Submitted on: Submitted by: 24/09/2004 Asha.P.M NO: 612 S7 ECE CONTENTS 1. Introduction 1 2. The array and RAID controller concept 2 2.1. Mirroring 3 2.2. Parity 5 2.3. Error correcting
More informationHPSS Treefrog Introduction.
HPSS Treefrog Introduction Disclaimer Forward looking information including schedules and future software reflect current planning that may change and should not be taken as commitments by IBM or the other
More informationEMC DATA DOMAIN OPERATING SYSTEM
EMC DATA DOMAIN OPERATING SYSTEM Powering EMC Protection Storage ESSENTIALS High-Speed, Scalable Deduplication Up to 31 TB/hr performance Reduces requirements for backup storage by 10 to 30x and archive
More informationUsing Cloud Services behind SGI DMF
Using Cloud Services behind SGI DMF Greg Banks Principal Engineer, Storage SW 2013 SGI Overview Cloud Storage SGI Objectstore Design Features & Non-Features Future Directions Cloud Storage
More informationUK LUG 10 th July Lustre at Exascale. Eric Barton. CTO Whamcloud, Inc Whamcloud, Inc.
UK LUG 10 th July 2012 Lustre at Exascale Eric Barton CTO Whamcloud, Inc. eeb@whamcloud.com Agenda Exascale I/O requirements Exascale I/O model 3 Lustre at Exascale - UK LUG 10th July 2012 Exascale I/O
More informationExtreme Storage Performance with exflash DIMM and AMPS
Extreme Storage Performance with exflash DIMM and AMPS 214 by 6East Technologies, Inc. and Lenovo Corporation All trademarks or registered trademarks mentioned here are the property of their respective
More informationTIBX NEXT-GENERATION ARCHIVE FORMAT IN ACRONIS BACKUP CLOUD
TIBX NEXT-GENERATION ARCHIVE FORMAT IN ACRONIS BACKUP CLOUD 1 Backup Speed and Reliability Are the Top Data Protection Mandates What are the top data protection mandates from your organization s IT leadership?
More informationFederated Array of Bricks Y Saito et al HP Labs. CS 6464 Presented by Avinash Kulkarni
Federated Array of Bricks Y Saito et al HP Labs CS 6464 Presented by Avinash Kulkarni Agenda Motivation Current Approaches FAB Design Protocols, Implementation, Optimizations Evaluation SSDs in enterprise
More informationToday: Coda, xfs! Brief overview of other file systems. Distributed File System Requirements!
Today: Coda, xfs! Case Study: Coda File System Brief overview of other file systems xfs Log structured file systems Lecture 21, page 1 Distributed File System Requirements! Transparency Access, location,
More informationFile. File System Implementation. File Metadata. File System Implementation. Direct Memory Access Cont. Hardware background: Direct Memory Access
File File System Implementation Operating Systems Hebrew University Spring 2009 Sequence of bytes, with no structure as far as the operating system is concerned. The only operations are to read and write
More informationCS 162 Operating Systems and Systems Programming Professor: Anthony D. Joseph Spring Lecture 18: Naming, Directories, and File Caching
CS 162 Operating Systems and Systems Programming Professor: Anthony D. Joseph Spring 2004 Lecture 18: Naming, Directories, and File Caching 18.0 Main Points How do users name files? What is a name? Lookup:
More informationTen things hyperconvergence can do for you
Ten things hyperconvergence can do for you Francis O Haire Director, Technology & Strategy DataSolutions Evolution of Enterprise Infrastructure 1990s Today Virtualization Server Server Server Server Scale-Out
More informationCONFIGURATION GUIDE WHITE PAPER JULY ActiveScale. Family Configuration Guide
WHITE PAPER JULY 2018 ActiveScale Family Configuration Guide Introduction The world is awash in a sea of data. Unstructured data from our mobile devices, emails, social media, clickstreams, log files,
More information