Low-cost BYO Mass Storage Project

Similar documents
Storage Optimization with Oracle Database 11g

Storage Systems. Storage Systems

DAHA AKILLI BĐR DÜNYA ĐÇĐN BĐLGĐ ALTYAPILARIMIZI DEĞĐŞTĐRECEĞĐZ

Effective Use of CSAIL Storage

Take control of storage performance

Storage Update and Storage Best Practices for Microsoft Server Applications. Dennis Martin President, Demartek January 2009 Copyright 2009 Demartek

IT Certification Exams Provider! Weofferfreeupdateserviceforoneyear! h ps://

The World s Fastest Backup Systems

TCO REPORT. NAS File Tiering. Economic advantages of enterprise file management

Today: Secondary Storage! Typical Disk Parameters!

Dell PowerVault MD Family. Modular storage. The Dell PowerVault MD storage family

Data Centers and Cloud Computing

Data Centers and Cloud Computing. Slides courtesy of Tim Wood

Deep Storage for Exponential Data. Nathan Thompson CEO, Spectra Logic

Assessing performance in HP LeftHand SANs

Evaluation Report: Improving SQL Server Database Performance with Dot Hill AssuredSAN 4824 Flash Upgrades

Data Centers and Cloud Computing. Data Centers

THE SUMMARY. CLUSTER SERIES - pg. 3. ULTRA SERIES - pg. 5. EXTREME SERIES - pg. 9

Introducing Tegile. Company Overview. Product Overview. Solutions & Use Cases. Partnering with Tegile

SurFS Product Description

Exam : Title : Storage Sales V2. Version : Demo

Design a Remote-Office or Branch-Office Data Center with Cisco UCS Mini

TECHNICAL SPECIFICATIONS + TECHNICAL OFFER

Backup Solutions with (DSS) July 2009

The Microsoft Large Mailbox Vision

The Oracle Database Appliance I/O and Performance Architecture

StarWind Virtual SAN Free

IBM Storwize V7000 Unified

Flash Decisions: Which Solution is Right for You?

COSC6376 Cloud Computing Lecture 17: Storage Systems

VSTOR Vault Mass Storage at its Best Reliable Mass Storage Solutions Easy to Use, Modular, Scalable, and Affordable

The Fastest And Most Efficient Block Storage Software (SDS)

Data Protection in 2006 A look into strategy, technology and design

LEVERAGING FLASH MEMORY in ENTERPRISE STORAGE

Hyper-converged Secondary Storage for Backup with Deduplication Q & A. The impact of data deduplication on the backup process

Nexsan Storage Optimizes Virtual Operating Environment for The Clark Enersen Partners

Is Tape Really Cheaper Than Disk?

Sun Lustre Storage System Simplifying and Accelerating Lustre Deployments

Opendedupe & Veritas NetBackup ARCHITECTURE OVERVIEW AND USE CASES

Evaluating Cloud Storage Strategies. James Bottomley; CTO, Server Virtualization

Design a Remote-Office or Branch-Office Data Center with Cisco UCS Mini

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

FOUR WAYS TO LOWER THE COST OF REPLICATION

ZFS User Conference Large Scale Homelab Backups Mike Trogni

Nimble Storage vs HPE 3PAR: A Comparison Snapshot

50 TB. Traditional Storage + Data Protection Architecture. StorSimple Cloud-integrated Storage. Traditional CapEx: $375K Support: $75K per Year

Clouds, Convergence & Consolidation

Identifying and Eliminating Backup System Bottlenecks: Taking Your Existing Backup System to the Next Level

EMC Backup and Recovery for Microsoft Exchange 2007

HP X9000 Network Storage Systems (NAS)

Storageflex HA3969 High-Density Storage: Key Design Features and Hybrid Connectivity Benefits. White Paper

TCC, so your business continues

Where s Your Third Copy?

IBM BladeCenter S Competitive Summary

The advantages of architecting an open iscsi SAN

A Look at CLARiiON with ATA CX Series Disk Drives and Enclosures

Surveon NVR Series Overview

Introduction To Computer Hardware. Hafijur Rahman

COMP283-Lecture 3 Applied Database Management

It s Time to Move Your Critical Data to SSDs Introduction

Deploy a High-Performance Database Solution: Cisco UCS B420 M4 Blade Server with Fusion iomemory PX600 Using Oracle Database 12c

SOLO NETWORK (11) (21) (31) (41) (48) (51) (61)

HYBRID STORAGE TM. WITH FASTier ACCELERATION TECHNOLOGY

Executive Summary. The Need for Shared Storage. The Shared Storage Dilemma for the SMB. The SMB Answer - DroboElite. Enhancing your VMware Environment

MD4 esata. 4-Bay Rack Mount Chassis. User Manual February 6, v1.0

Introducing SUSE Enterprise Storage 5

QNAP OpenStack Ready NAS For a Robust and Reliable Cloud Platform

Introduction to I/O and Disk Management

Stargate Studios Builds Highly Efficient Storage Infrastructure with Nexsan

Introduction to I/O and Disk Management

IBM System Storage DCS3700

If you knew then...what you know now. The Why, What and Who of scale-out storage

Fujitsu PRIMERGY Servers Portfolio

Cold Storage: The Road to Enterprise Ilya Kuznetsov YADRO

PrepAwayExam. High-efficient Exam Materials are the best high pass-rate Exam Dumps

IBM Netfinity ServeRAID-3H and -3L Ultra2 SCSI Adapters

Veritas NetBackup on Cisco UCS S3260 Storage Server

Storage Devices for Database Systems

Creating the Fastest Possible Backups Using VMware Consolidated Backup. A Design Blueprint

How to Pick SQL Server Hardware

The IBM Storwize V3700: Meeting the Big Data Storage Needs of SMBs

Taurus Super-S Combo

DELL POWERVAULT MD FAMILY MODULAR STORAGE THE DELL POWERVAULT MD STORAGE FAMILY

Shared Storage Solutions. for Collaborative Media Production Networks

Dell PowerVault MD Family. Modular storage. The Dell PowerVault MD storage family

vstart 50 VMware vsphere Solution Specification

THESUMMARY. ARKSERIES - pg. 3. ULTRASERIES - pg. 5. EXTREMESERIES - pg. 9

Quantifying FTK 3.0 Performance with Respect to Hardware Selection

LEVERAGING A PERSISTENT HARDWARE ARCHITECTURE

Isilon: Raising The Bar On Performance & Archive Use Cases. John Har Solutions Product Manager Unstructured Data Storage Team

Storage Best Practices for Microsoft Server Applications

IBM N Series. Store the maximum amount of data for the lowest possible cost. Matthias Rettl Systems Engineer NetApp Austria GmbH IBM Corporation

Automated Storage Tiering on Infortrend s ESVA Storage Systems

White Paper. A System for Archiving, Recovery, and Storage Optimization. Mimosa NearPoint for Microsoft

Upgrading and Repairing: Build a PC with Scott Mueller

The Future of Business Depends on Software Defined Storage (SDS) How SSDs can fit into and accelerate an SDS strategy

THE FUTURE OF BUSINESS DEPENDS ON SOFTWARE DEFINED STORAGE (SDS)

Distributed Systems. 31. The Cloud: Infrastructure as a Service Paul Krzyzanowski. Rutgers University. Fall 2013

AN ALTERNATIVE TO ALL- FLASH ARRAYS: PREDICTIVE STORAGE CACHING

Artico NAS Storage Appliance Site Planning Guide

Transcription:

Low-cost BYO Mass Storage Project James Cizek Unix Systems Manager Academic Computing and Networking Services

The Problem Reduced Budget Storage needs growing Storage needs changing (Tiered Storage) Long term (Archiving) storage needs growing 2

Projected Needs (2009 Survey) 5,000 Research Data Storage Need (TBytes) 4,000 3,000 2,000 1,000-4,914 1,384 219 Now 2 Years 5 Years 3

The Goal Find a mass storage solution that won t break the bank Vendors sell high-speed, costly systems (suitable for Amazon, Google, etc.), but we want slower, low-cost Looking at vendor offerings, we decided to roll our own Maximize TB/$$ with reasonable assurance that data are redundant and safe 4

Some Understandings Approached this project as Secondary or Tier 2 type storage, not intended to replace extremely fast, ultra-reliable, expensive disk systems Realized that device management, support, and component failure need to be addressed 5

A starting point Online backup company Backblaze opensourced their storage pod design, see https://www.backblaze.com/petabytes-on-a-budget-howto-build-cheap-cloud-storage.html Thought that starting with a proven design would eliminate many unknowns and speed up our design process Turned out to be helpful, but ran into many of our own headaches 6

The BackBlaze design 7

BackBlaze vs. CSU design goals Realized that the BackBlaze design didn t exactly meet our requirements No redundant power supplies Cheap SATA cards didn t take advantage of performance available by having large number of spinning hard drives Case too small to accommodate server-class motherboard Single system hard drive is single point of failure. Realized the need to over-engineer cooling and vibration reduction (2 major contributors to drive failure) Chassis was red instead of CSU green! 8

CSU design changes Lengthened case by 3 inches to accommodate dual CPU server-class motherboard Added more RAM for file system buffering (6 GB compared to BackBlaze 4GB) Added larger, redundant power supplies - individual supply can run entire case Used Enterprise grade drives instead of consumer grade, after much research Drives selected have vibration sense / damping Replaced cheap SATA cards with highperformance PCI-e cards 9

CSU chassis nearing completion 10

CSU chassis nearing completion 11

DIY storage- V2.0 As time progressed, CSU found a commercial offering that nearly replicates our Green Box Offered in 36 drive smart and 45 drive dumb models Price point equivalent to in-house unit New technology (SAS expander vs. SATA multiplier) offers twice the speed and far more flexibility (More OS choices, drive choices) 12

DYI storage- V2.0 (cont) Although dampens the DIY spirit of the project, the new chassis offers more reliability, expansion and speed than Green box Still functionally identical, but can be built from opening the box to installed in rack in 2 hours! Allows multiple hosts to access simultaneously (i.e. Windows, Unix) 13

CSU Commercial Chassis 14

CSU Commercial Chassis 15

CSU Commercial Chassis 16

Costs (Greenbox V1.0) Case: $700 1 TB Drives: $99 x 45 ($4,455) Motherboard / Processors / Memory: $900 Power Supplies: $200 SATA cards: $300 Ethernet card with iscsi offload: $350 SATA Multipliers: $45 x 9 ($405) Fans/Cables/Hardware/DVD/Mounts/etc.: $1,000 Total: 45 Raw TB for $8,310! 17

Costs (Silverbox V2.0) Case: $3600 (includes MB, memory, etc..) 2 TB Drives: $189 x 36 ($6,804) Total: 72 Raw TB for $10,404! 18

Initial Performance Indicates that these units will be more than adequate as secondary and long term storage May be fast enough to replace expensive primary storage in the future (Lot of spindles) Has proven expandability is easy and cost effective 19

Cost / Performance Comparison As primary storage: Current models ~$25K for 1 Terabyte As secondary storage: Current models ~$8K for 16 Terabytes Our system: 72 Terabytes for ~$10K In initial performance testing, our system beats the best published performance numbers of both our primary and secondary storage systems 20

Challenges ahead Support management (What happens when a disk fails?) Backup and protection of stored data Mirroring units Avoid backing up to enterprise backup system Data storage and protection policies Parallel file system 21

Where will this be useful? Library digital repository Research computing HPC, tier 2 Campus wide Cloud storage Second or Third Tier storage for your Enterprise backups Email/File archiving Database snapshots kept for long term (LMS) 22

In Summary At $139 / Terabyte, this solution provides mass storage cheaper than anything found to date Low price allows building 2 and mirroring (Much less expensive than tape backup!) Space, power, and cooling savings are substantial over other offerings Provides a simple solution to allow *all* research data to be kept from a project instead of discarding portions Flexible enough to fit many applications where large data storage is a necessity Reusable! After project completion, can be reconfigured to fit needs on the fly 23

Questions? james@colostate.edu 24