Large-scale Archival Storage - a brief overview for the HEP use case -

Similar documents
Deep Storage for Exponential Data. Nathan Thompson CEO, Spectra Logic

Study of the viability of a Green Storage for the ALICE-T1. Eduardo Murrieta Técnico Académico: ICN - UNAM

A Ten Year ( ) Storage Landscape LTO Tape Media, HDD, NAND

PASIG Disk Trends. Oracle Storage Technology 101 Session. Philippe Deverchère EMEA Storage CTO. September 16, 2014

Storage: The Insatiable Demand.when does it end?

Progress of the Development of High Performance Removable Storage at InPhase Technologies for Application to Archival Storage

DMF-UG Mike Grayson Solutions Architect APAC

Whither Hard Disk Archives? Dave Anderson Seagate Technology 6/2016

Report from CHEP2015. Vladimir Sapunenko INFN- CNAF

Advanced Information Storage 05

<Insert Picture Here> Tape Technologies April 4, 2011

<Insert Picture Here> Oracle Storage

Fujifilm 2015 Conference Into Tomorrow with Tape Technology Investing in the Future. Nathan Thompson CEO & Founder Spectra Logic Corporation

Seagate Technology Revenue: Driven by Product, Cloud Storage?

IBM s 3592 Storage Solution: A Taste of the Future

Invest in New Technologies or Divest in Market Share

Brendan Lelieveld-Amiro, Director of Product Development StorageQuest Inc. December 2012

Scaling the Areal Density Mountain. Dave Anderson Seagate

Low-cost BYO Mass Storage Project

Multi-terabyte Tape System (MTS/ATP) Advanced Tape Technology Development, NIST. Rich Jewett Imation Corporation

The future of data archives and nearline storage

Effizientes Speichern von Cold-Data

Global Headquarters: 5 Speen Street Framingham, MA USA P F

IT Certification Exams Provider! Weofferfreeupdateserviceforoneyear! h ps://

TAPE $AVES: COST ENERGY DATA COMPANY.

Mark Geenen TrendFocus Presented at the THIC Meeting at the Sony Auditorium, 3300 Zanker Rd, San Jose CA March 9-10, 2004

Emerging Information Storage Technology A Technologist Viewpoint. Gordon Hughes, Associate Director, UCSD CMRR Center for Magnetic Recording Research

Seagate Point of View Cloud and Data Center Trend

Flash Storage with 24G SAS Leads the Way in Crunching Big Data

Storage Systems. Storage Systems

Costefficient Storage with Dataprotection

William Stallings Computer Organization and Architecture 8 th Edition. Chapter 6 External Memory

IT Certification Exams Provider! Weofferfreeupdateserviceforoneyear! h ps://

By 2014, World-Wide file based

Using Simulation to Design Scalable and Cost-Efficient Archival Storage Systems

Table 6.1 Physical Characteristics of Disk Systems

Cold Storage: The Road to Enterprise Ilya Kuznetsov YADRO

The Benefits of Enterprise-Class in a Small Form Factor. Determining Whether Enterprise Class in a Smaller Library is the Right Choice

Tech Talk on HPC. Ken Claffey. VP, Cloud Systems. May 2016

Lenovo Enterprise Portfolio

PMR: Innovation Achieved Implications for a New Era of Hard Disk Drive Technology

THE FUTURE OF STORAGE

Modular Drive Process System Design for Optimal Factory Efficiency September 9, 2010

Panasonic Optical Data Archiver freeze-ray

SSD Architecture Considerations for a Spectrum of Enterprise Applications. Alan Fitzgerald, VP and CTO SMART Modular Technologies

PRODUCT OVERVIEW SPECTRALOGIC.COM

Cat Herding. Why It s Time for a Millennial Approach to Storage. Cloud Expo East Western Digital Corporation All rights reserved 01/25/2016

Tape in the Microsoft Datacenter: The Good and Bad of Tape as a Target for Cloud-based Archival Storage

Accelerate with IBM Storage:

STATUS OF OPTICAL STORAGE IN JAPAN

Data oriented job submission scheme for the PHENIX user analysis in CCJ

Discovering Computers Fundamentals, 2011 Edition. Living in a Digital World

Session: Hardware Topic: Disks. Daniel Chang. COP 3502 Introduction to Computer Science. Lecture. Copyright August 2004, Daniel Chang

HP Storage Summit 2015 Transform Now.

Microsoft Exchange Server 2010 workload optimization on the new IBM PureFlex System

SAS Technical Update Connectivity Roadmap and MultiLink SAS Initiative Jay Neer Molex Corporation Marty Czekalski Seagate Technology LLC

Online data storage service strategy for the CERN computer Centre G. Cancio, D. Duellmann, M. Lamanna, A. Pace CERN, Geneva, Switzerland

Achieving Energy Efficiency in Data Storage for the Zettabyte Era

dcache tape pool performance Niklas Edmundsson HPC2N, Umeå University

HPSS Treefrog Summary MARCH 1, 2018

Storage Technology Outlook & and how the Internet fits into 1TB of IBM Watson. Dr Axel Köster Enterprise Storage Technologist ESCC Mainz

The Benefits of Solid State in Enterprise Storage Systems. David Dale, NetApp

SMART SERVER AND STORAGE SOLUTIONS FOR GROWING BUSINESSES

SSDs Driving Greater Efficiency in Data Centers

SDLT 600 Performance Whitepaper SDLT 600 Outperforms LTO 2 and AIT-3

16/06/56. Secondary Storage. Secondary Storage. Secondary Storage The McGraw-Hill Companies, Inc. All rights reserved.

Liz Conner Senior Research Analyst, Storage Systems & Personal Storage

Removable Disk Storage Successes & Flameouts: What Can We Learn from the Past as We Move Forward?

Chaz Stevens Director of Marketing

Windows Servers In Microsoft Azure

Cloudian Sizing and Architecture Guidelines

Product Development Rev II

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

Combining HP StoreOnce and HP StoreEver Tape

Storage. CS 3410 Computer System Organization & Programming

ARCHIVE AND RECORDS MANAGEMENT

Objectives Overview. Chapter 7 Types of Storage. Instructor: M. Imran Khalil. MSc-IT 1st semester Fall Discovering Computers 2012

Exam : Title : Storage Sales V2. Version : Demo

Forging a Future in Memory: New Technologies, New Markets, New Applications. Ed Doller Chief Technology Officer

SurFS Product Description

LTO-8 The Future of Storage is Here

The QM2 PCIe Expansion Card Gives Your NAS a Performance Boost! QM2-2S QM2-2P QM2-2S10G1T QM2-2P10G1T

Technical Training. David Barrett-Hague Head of Sales and Marketing

Analysts Weigh In On Persistent Memory

High-Energy Physics Data-Storage Challenges

Market analysis report published on May, 2013

LaCie 12big Thunderbolt 3. Clement Barberis

Reconstruyendo una Nube Privada con la Innovadora Hiper-Convergencia Infraestructura Huawei FusionCube Hiper-Convergente

Long- Term Storage Panel Session

IBM TS4300 Tape Library

SGI Overview. HPC User Forum Dearborn, Michigan September 17 th, 2012

Agenda. Sun s x Sun s x86 Strategy. 2. Sun s x86 Product Portfolio. 3. Virtualization < 1 >

Storage Systems for Shingled Disks

Subodh Kulkarni Executive Director, R&D

Unveiling the new QM2 M.2 SSD/10GbE PCIe Expansion cards

Enterprise Ceph: Everyway, your way! Amit Dell Kyle Red Hat Red Hat Summit June 2016

STORING DATA: DISK AND FILES

Sun and Oracle. Kevin Ashby. Oracle Technical Account Manager. Mob:

High Volumes Storage Fundamentials V2

Solution Brief: XenData Digital Video Archives in a Dalet Environment

Transcription:

Large-scale Archival Storage - a brief overview for the HEP use case - GDB, 13/9/2017 Germán Cancio CERN IT/ST 1

Agenda Status of tape market and technology Alternatives to tape Disk Optical Holographic 2

Tape Market dominated by LTO consortium (~95%) IBM, HP, Quantum (drives) + Fujifilm, Sony (media) Oracle resells LTO drives from IBM Enterprise tape (IBM+Oracle) ~4% IBM: Latest: TS1155 @ 15TB, 350MB/s introduced May 2017 Oracle: Latest T10KD @ 8TB, 250MB/s introduced Sept 2013 Tape drive head technology: From GMR to TMR GMR has reached its density limits (HDD s: stopped in 2004) TMR requires substantial R&D and manufacturing retooling TS1155 uses TMR, LTO-8 will use it Large-scale libraries (>=10K slots) Oracle, IBM, Spectra Logic, now also Quantum 3

LTO and IBM enterprise tape roadmap (source: IBM) Expected EOY 2017 Released Q2 2017 8/6/2015 DPHEP Collaboration Workshop 4

Oracle enterprise tape roadmap 5

Tape drive head manufacturing (Source: Spectra) 6

Tape drive head manufacturing (Source: Spectra) 7

08/2017: Sony/IBM demo (using CoPtCr) 201Gb/in 2 ~330TB tape IBM TS1155 9.6Gb/in 2 04/2015: Fuji/IBM demo (using BaFe) 123Gb/in 2 ~220TB tape

Tape Market evolution LTO media shipments decreasing since ~2007 Consolidation, competition of disk and cloud solutions LTO media units shipped, 200 2016 (source: LTO consortium) ~ 20M units per year (~40 EB) Media price ~10-15CHF/TB(*), but decay has slowed down (-20%/yr over last 4 years) Two remaining media manufacturers (TDK exitus 2014) (*)Media is ~50% of tape TCO (add drive and library HW, maintenance) SPoF as IBM only remaining (major) manufacturer + R&D of tape drive technology Will this market sustain to drive (enterprise and LTO) tape research (new heads, new media) and production? 9

Disk (Spinning) disk market: WD (41%), Seagate (37%), Toshiba (22%) ~600EB/year, decreasing since 2010 Nearline (high capacity, high quality) drives used in HEP: ~10% of market sales Increased competition from cloud & SSD for notebooks and enterprise disks Source: Statista / B.P.S. Source: Wikipedia 10

Disk Technology Shingled recording disks (SMR) ~2013 Helium filled disks (more platters) ~2013 HAMR disks not before 2018(?) Complex technology, laser+new media, reliability+cost are issues Capacity evolution 14-16TB in 3-12 months 20TB in 2020 100TB by ~2025 with HAMR/HDMR Pricing for nearline disks decreasing ~14%/year, currently ~35-40CHF/TB Very shaky price evolution SSD s vs HDD s? Shipped capacity in 2016: ~45EB (roughly 7.5% of shipped HDD capacity expected to grow to ~20% around 2021) Large investments required for SSD manufacturing (200-300B$) SSD/TB price for capacity in foreseeable future still expected O(10)x of disk/tb price 11

Disk Servers for Archival? Current CERN EOS disk servers: one CPU node + 2x24 enterprise-class capacity disks. JBOD with 2 replica. Disk cost is 75% TCO is ~3x of tape CHF/TB Looking into optimising CHF/TB to close gap with tape. Ongoing investigations: Using desktop disks à la BackBlaze (30%-40% cheaper) No warranty / certification for our use case Measure reliability of different models/vendors (integrity, failure rates) using SMART and EOS monitoring Compensate lower reliability with higher redundancy Review operating procedures: Let disks die rather than replacing Two servers in production for ALICE (successfully so far) Monster servers for optimising disk-to-infrastructure cost ratio Testing 192 disks (2 trays with 8x24 HDD s each) on one server Up to 1.1PB raw with 6TB disks Evaluate different file system and redundancy layouts (ZFS pools, RAID, EOS erasure encoding) -> f(capacity, reliability, performance) 12

Disk Servers for Archival? Current CERN EOS disk servers: one CPU node + 2x24 enterprise-class capacity disks. JBOD with 2 replica. Disk cost is 75% TCO is ~3x of tape CHF/TB Looking into optimising CHF/TB to close gap with tape. Ongoing investigations: Using desktop disks à la BackBlaze (30%-40% cheaper) No warranty / certification for our use case Measure reliability of different models/vendors (integrity, failure rates) using SMART and EOS monitoring Compensate lower reliability with higher redundancy Review operating procedures: Let disks die rather than replacing Two servers in production for ALICE (successfully so far) Monster servers for optimising disk-to-infrastructure cost ratio Testing 192 disks (2 trays with 8x24 HDD s each) on one server Up to 1.1PB raw with 6TB disks Evaluate different file system and redundancy layouts (ZFS pools, RAID, EOS erasure encoding) -> f(capacity, reliability, performance) Roberto Valverde/CERN Roberto Valverde/CERN 13

Optical Archival Disk evolution of Blu-Ray Collaboration between Sony & Panasonic Max capacity 300GB disks (WORM only) 140MB/s write, 280MB/s read Reliability (Blu-Ray UBER 10-12 ) -> erasure coding overhead Roadmap to 1TB but no timeline nor new products 1TB by now 2020 (was 2010, then 2012..) Consumer market for optical disks vanishing Behind magnetic storage in capacity, performance, reliability Media volumes, pricing? Cheaper than HDD Libraries announced by Sony (Everspan) and Panasonic (Freeze-Ray) Robots mounting media trays for 4x16 disks (Everspan) / 12 disks (Freeze-Ray) Up to 14 expansion media racks @ 13PB raw (Everspan) -> ~180PB Up to 64 drives / library (Everspan) Customers? Everspan evaluation started by LANL 14

Holographic reference beam CCD readout Record information across media volume, not just surface High densities using different recording angles, wavelengths, position on single media location Potentially, O(GB)/mm 3, fast R/W rates 2015 Demo: 2Tb/in 2 -> ~770GB/in 3 signal modulation media Prototypes, even ECMA standards 300GB/disk, 20MB/s Small companies: InPhase (bankrupt in 2011), Akonia Holographics Research @ IBM Almaden Labs (until ~2000); GE No products on the market, nor any signs of upcoming ones 15

Summary Tape is still the most cost-effective archival solution for HEP, but concerns about the long-term sustainability of a contracting tape market dominated by a single technology provider Disk market also contracting but from a wider base in terms of volume and vendors Possible opportunities by exploring massive cheap disk setups for archival in order to close the 3x cost gap wrt tape Optical archival storage has recently seen a kind of revival, but no noticeable market impact yet Will holographic ever surface? 16

References Storage Technology and Markets (B. Panzer-Steindel, CERN IT CTO) IBM enterprise and LTO product roadmap Spectra Logic presentation to the DMF UG, 2017 INSIC consortium tape technology roadmap 2015-2025 LTO consortium 2016 tape capacity shipments Clipper Group TCO study ASTC disk technology roadmap, 2016 Monster node testing @ CERN/IT: Roberto.Valverde.Cameselle@cern.ch BackBlaze blog: Storage Pod Sony Everspan specs Akonia Holographics 2Tbit/in 2 press release 17

Source: IBM 18