PRESENTATION TITLE GOES HERE. Understanding Architectural Trade-offs in Object Storage Technologies

Similar documents
LTFS Bulk Transfer Standard PRESENTATION TITLE GOES HERE

Multi-Cloud Storage: Addressing the Need for Portability and Interoperability

Mark Rogov, Dell EMC Chris Conniff, Dell EMC. Feb 14, 2018

Use Cases for iscsi and FCoE: Where Each Makes Sense

Mobile and Secure Healthcare: Encrypted Objects and Access Control Delegation

Deploying Public, Private, and Hybrid. Storage Cloud Environments

Scaling Data Center Application Infrastructure. Gary Orenstein, Gear6

Interoperable Cloud Storage with the CDMI Standard. Mark Carlson, SNIA TC and Oracle Co-Chair, SNIA Cloud Storage TWG

OpenStack Manila An Overview of Manila Liberty & Mitaka

Tiered File System without Tiers. Laura Shepard, Isilon

Application Recovery. Andreas Schwegmann / HP

Trends in Worldwide Media and Entertainment Storage

ADVANCED DEDUPLICATION CONCEPTS. Thomas Rivera, BlueArc Gene Nagle, Exar

Tom Sas HP. Author: SNIA - Data Protection & Capacity Optimization (DPCO) Committee

LEVERAGING FLASH MEMORY in ENTERPRISE STORAGE

ADVANCED DATA REDUCTION CONCEPTS

Expert Insights: Expanding the Data Center with FCoE

Data Deduplication Methods for Achieving Data Efficiency

Everything You Wanted To Know About Storage (But Were Too Proud To Ask) The Basics

Cloud Archive and Long Term Preservation Challenges and Best Practices

Architectural Principles for Networked Solid State Storage Access

The File Systems Evolution. Christian Bandulet, Sun Microsystems

ASPECTS OF DEDUPLICATION. Dominic Kay, Oracle Mark Maybee, Oracle

Create a Smarter and More Economic Cloud Storage Architecture. November 7, 2018

Fibre Channel vs. iscsi. January 31, 2018

A Promise Kept: Understanding the Monetary and Technical Benefits of STaaS Implementation. Mark Kaufman, Iron Mountain

Storage in combined service/product data infrastructures. Craig Dunwoody CTO, GraphStream Incorporated

Massively Scalable File Storage. Philippe Nicolas, KerStor

SAS: Today s Fast and Flexible Storage Fabric. Rick Kutcipal President, SCSI Trade Association Product Planning and Architecture, Broadcom Limited

Trends in Data Protection and Restoration Technologies. Mike Fishman, EMC 2 Corporation

Interoperable Cloud Storage with the CDMI Standard. Mark Carlson, SNIA TC and Oracle Chair, SNIA Cloud Storage TWG

Virtualization Practices:

Expert Panel: Cloud Storage Initiatives: A Storage Developer Conference Preview August 4, 2015

The Role of WAN Optimization in Cloud Infrastructures. Josh Tseng, Riverbed

Adrian Proctor Vice President, Marketing Viking Technology

Notes & Lessons Learned from a Field Engineer. Robert M. Smith, Microsoft

EVERYTHING YOU WANTED TO KNOW ABOUT STORAGE, BUT WERE TOO PROUD TO ASK Part Cyan Storage Management. September 28, :00 am PT

Tutorial. A New Standard for IP Based Drive Management. Mark Carlson SNIA Technical Council Co-Chair

Virtualization Practices: Providing a Complete Virtual Solution in a Box

The Evolution of File Systems

Facing an SSS Decision? SNIA Efforts to Evaluate SSS Performance. Ray Lucchesi Silverton Consulting, Inc.

SNIA Tutorial 1 A CASE FOR FLASH STORAGE HOW TO CHOOSE FLASH STORAGE FOR YOUR APPLICATIONS

Effective Storage Tiering for Databases

Analysis and Optimization. Carl Waldspurger Irfan Ahmad CloudPhysics, Inc.

Planning For Persistent Memory In The Data Center. Sarah Jelinek/Intel Corporation

Restoration Technologies

SAS: Today s Fast and Flexible Storage Fabric

Extending RDMA for Persistent Memory over Fabrics. Live Webcast October 25, 2018

An Introduction to Key Management for Secure Storage. Walt Hubis, LSI Corporation

Deploying Software Defined Storage for the Enterprise with Ceph. PRESENTATION TITLE GOES HERE Paul von Stamwitz Fujitsu

Ron Emerick, Oracle Corporation

Cassandra, MongoDB, and HBase. Cassandra, MongoDB, and HBase. I have chosen these three due to their recent

Overview and Current Topics in Solid State Storage

The Benefits of Solid State in Enterprise Storage Systems. David Dale, NetApp

Felix Xavier CloudByte Inc.

How to create a synthetic workload test. Eden Kim, CEO Calypso Systems, Inc.

Everything You Wanted To Know About Storage: Part Teal The Buffering Pod

Felix Xavier CloudByte Inc.

A Gentle Introduction to Ceph

Optimize Storage Efficiency & Performance with Erasure Coding Hardware Offload. Dror Goldenberg VP Software Architecture Mellanox Technologies

Overview and Current Topics in Solid State Storage

in Transition to the Cloud David A. Chapa, CTE EVault, a Seagate Company

Jeff Dodson / Avago Technologies

IP-Based Object Drives Now Have a Management Standard

Centralized vs. Distributed A Great Storage Debate. Live Webcast September 11, :00 am PT

SRM: Can You Get What You Want? John Webster, Evaluator Group.

WHAT HAPPENS WHEN THE FLASH INDUSTRY GOES TO TLC? Luanne M. Dauber, Pure Storage

Storage Clouds. Marty Stogsdill Oracle

Trends in Data Protection and Restoration Technologies. Jason Iehl, NetApp

Apples to Apples, Pears to Pears in SSS performance Benchmarking. Esther Spanjer, SMART Modular

Storage Performance Management Overview. Brett Allison, IntelliMagic, Inc.

Design and Implementations of FCoE for the DataCenter. Mike Frase, Cisco Systems

STORAGE CONSOLIDATION WITH IP STORAGE. David Dale, NetApp

Ramin Elahi, Adjunct Faculty UC Santa Cruz Silicon Valley

Introduction to NoSQL Databases

Why software defined storage matters? Sergey Goncharov Solution Architect, Red Hat

iscsi : A loss-less Ethernet fabric with DCB Jason Blosil, NetApp Gary Gumanow, Dell

Scale-out Data Deduplication Architecture

Strategies for Single Instance Storage. Michael Fahey Hitachi Data Systems

ActiveScale Erasure Coding and Self Protecting Technologies

Achieving the Potential of a Fully Distributed Storage System

Challenges for Data Driven Systems

SNIA Tutorial 3 EVERYTHING YOU WANTED TO KNOW ABOUT STORAGE: Part Teal Queues, Caches and Buffers

Blockchain Beyond Bitcoin. Mark O Connell

SRM: Can You Get What You Want? John Webster Principal IT Advisor, Illuminata

New Oracle NoSQL Database APIs that Speed Insertion and Retrieval

Scale-out Storage Solution and Challenges Mahadev Gaonkar igate

CISC 7610 Lecture 2b The beginnings of NoSQL

Architecture of a Real-Time Operational DBMS

High Availability Using Fault Tolerance in the SAN. Wendy Betts, IBM Mark Fleming, IBM

ActiveScale Erasure Coding and Self Protecting Technologies

Scale-Out Architectures for Secondary Storage

EVERYTHING YOU WANTED TO KNOW ABOUT STORAGE, BUT WERE TOO PROUD TO ASK Where Does My Data Go? August 1, :00 am PT

Milestone Solution Partner IT Infrastructure Components Certification Report

Understanding Your Virtual Workload. Irfan Ahmad CTO CloudPhysics

New Fresh Storage Approach for New IT Challenges Laurent Denel Philippe Nicolas OpenIO

Business Benefits of Policy Based Data De-Duplication Data Footprint Reduction with Quality of Service (QoS) for Data Protection

Decentralized Distributed Storage System for Big Data

Database Availability and Integrity in NoSQL. Fahri Firdausillah [M ]

NFSv4.1 Using pnfs PRESENTATION TITLE GOES HERE. Presented by: Alex McDonald CTO Office, NetApp

Transcription:

Object Storage 201 PRESENTATION TITLE GOES HERE Understanding Architectural Trade-offs in Object Storage Technologies

SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA unless otherwise noted. Member companies and individual members may use this material in presentations and literature under the following conditions: Any slide or slides used must be reproduced in their entirety without modification The SNIA must be acknowledged as the source of any material used in the body of any document containing material from these presentations. This presentation is a project of the SNIA Education Committee. Neither the author nor the presenter is an attorney and nothing in this presentation is intended to be, or should be construed as legal advice or an opinion of counsel. If you need legal advice or a legal opinion please contact your attorney. The information presented herein represents the author's personal opinion and current understanding of the relevant issues involved. The author, the presenter, and the SNIA do not assume any responsibility or liability for damages arising out of any reliance on or use of this information. NO WARRANTIES, EXPRESS OR IMPLIED. USE AT YOUR OWN RISK. 2

Today s Presenters Alex McDonald, SNIA CSI Cloud Storage Initiative Chair - NetApp Vishnu Vardhan, Sr Mgr, Product Management, NetApp Twitter: @vardhanv Paul S. Levy System s Engineer & Architect Intel Storage Division 3

Agenda Object store market requirements How do object stores scale? Architectural choices for object stores Conclusions 4

Object store market requirements Expected Scale Performance Requirements Latency (Low latency vs Medium latency workloads) Throughput (High throughput vs Low throughput workloads) Key Features 100s of Billions of objects Multiple sites Data protection policies Business logic policies Erasure coding (leading to an object count explosion) Multiple / complex failure domains (disk, node, rack, data center, business unit) Operations and Management Requirements Adding nodes & sites Changing capacity of existing nodes and related balancing / re-balancing System upgrades and Data Migration Long lived systems, with topology changes and required rebalancing 5 *Sources: Google: Out Mobile Planet (26 apps/phone), Intel est.

Agenda Object store market requirements How do object stores scale? Architectural choices for object stores Conclusions 6

Scale-Out Properties Shared nothing architecture Ubiquitous Ethernet to tie things together Horizontally scaling of Independent services Load Balancing Load balancing across a large number of nodes Placement Service Determines location of where data needs to be placed Storage Service Basic storage service Tiering and Policy management Execute tiering & policy actions Access Control, audit & security Provide authentication, audit and security services Administration and Maintenance Handle topology changes (disks, nodes, racks, sites), expansions, contractions, hardware refreshes, system upgrades 7

How do Object Stores Scale? Functional Decomposition application Controller Essential functions are segmented and optimized Service for horizontal scaling Controller Admin & Maintenance Gateway Client interface Load Balancing Data Placement Tiering, Policy Management & Data protection System metadata Mapping information (Topology) Object Metadata Admin & Maintenance Storage Servers Information Repository Admin Service Storage Service

Storage Nodes - Typical Storage Nodes store information. Functionally they: Store Objects Support the Private Network Support Maintenance activities Defines Blast Radius 9

Controllers (Proxy Nodes) Typical Controller Nodes are the brains of the Object Store. Functionally they: Controllers (Proxy Nodes) Provide a gateway Client interface Load Balancing Data Placement System metadata Mapping information (Topology) Object Metadata Tiering, Policy Management & Data protection Object Request Storage Nodes Zone/Region A Controllers (Proxy Nodes) Storage Nodes Zone/Region B 10

Agenda Object store market requirements How do object stores scale? Architectural choices for object stores Conclusions 11

Architectural choices TODAY S DISCUSSION Controller Nodes Provide a gateway Client interface Load Balancing Data Placement System metadata Mapping information (Topology) Object Metadata Tiering, Policy Management & Data protection Admin Nodes Admin and Maintenance procedures Storage Nodes Information repository 12

Architectural choices Controller Nodes Provide a gateway Client interface Load Balancing Data Placement System metadata Mapping information (Topology) Object Metadata Tiering, Policy Management & Data protection Admin Nodes Admin and Maintenance procedures Storage Nodes Information repository 13

Consistent Hash Generates a repeatable sequence of hash values based on the input key and topology State Less Every element in the system knows data location given the Key and Topology Extreme Performance, Stateless, Consistent (implementations vary) 14

Consistent Hash Pros State Less Scales to trillions of Objects Fast Cons Consistent Topology Dependent Collision avoidance to guarantee uniform distribution Data movement required for adding storage (Topology Change) Should replace failed nodes for best performance 15

Centralized Database Organized collection data in structured sets that form tabular relationships between a key and a value. Stateful Located in one place Moderately Performant, Stateful, Consistent 16

Centralized Database Pros Consistent always returns the latest copy Flexible Placement Scales to Billions of Objects Maintenance and Expansion has manageable impact Cons State Full Needs protection from Single point of failure Centralized Bottleneck must be managed Not as performant as Consistent Hash or scalable as No-SQL 17

NoSQL Databases that contain relationships other than structured tabular sets. (e.g. KeyValue, Graph, Document) Distributed across multiple systems No single point of failure Consistent Hashing Distributed Virtual Nodes Data Protection High Performance, De-Centralized, State-full, Eventually Consistent 18

No-SQL Pros Not Centralized Flexible Placement Scales to Trillions of Objects Distributed across multiple sites Partition tolerant Highly performant @ scale Cons Eventually Consistent 19

Data Placement Tradeoff s Performance and scalability Comparison of options Attributes Consistent hashing Centralized DB s NoSQL DB s Lookup Latency Real time / fixed time DB size dependent 2001 us Throughput Network topology & system design Network topology & system design Network topology & system design Scalability No limit DB size dependent 100 s of Billions2 Consistency Strong Strong Eventual Examples Ceph, Swift MySql, Hadoop Name Node MongoDB, Cassandra, HBASE 1. Assumes Cassandra with a mostly read workload with 8 shards. (http://planetcassandra.org/nosql-performance-benchmarks/ ) Converting Ops / Sec into latency 2. http://www.mongodb.com/scale 20

Architectural choices Controller Nodes Provide a gateway Client interface Load Balancing Data Placement System metadata Mapping information (Topology) Object Metadata Tiering, Policy Management & Data protection Admin Nodes Admin and Maintenance procedures Storage Nodes Information repository 21

Object Meta-Data storage Infinitely scalable Limited usability apart from serving the meta-data back to the application Country = France Store in a centralized database Country = France Store with the data (e.g in the XATTR of the file-system) Single point of failure and scalability challenges Store in a NoSql database Scale to 100 s of billions of objects Policy based on business logic and application metadata Analytics on the metadata Country = France NoSQL DB Country = France 22

Tiering, policy management and data protection Consistent Hashing Simplistic policy Keep 3 copies in Rack 1, Use erasure coding instead of replication No per-object policies Ingest only policy NoSQL & Centralized DB approaches Keep 3 copies in Rack 1 Keep 3 copies in Rack 1 for 10 days, then keep 2 copies for 30 days, then keep 1 copy (erasure coded) If Object metadata = CEO s email keep 5 copies If object metadata = Europe stay within London datacenter Per object policies 23

Architectural choices Controller Nodes Provide a gateway Client interface Load Balancing Data Placement System metadata Mapping information (Topology) Object Metadata Tiering, Policy Management & Data protection Admin Nodes Admin and Maintenance procedures Storage Nodes Information repository 24

Operations Maintenance and Growth Scenarios to consider Adding a disk, node, rack, site Reducing capacity of a node (Rebalance) Disk, node, rack, site failure (Rebuild) Hardware refresh (and migrating data to a new node) Upgrade Hash based systems must maintain hash order when topology changes. (DIAG) DB Based system can re-balance over time Long lived systems, with topology changes and required rebalancing Systems live for many generations, topology changes are common 25

Architectural choices Controller Nodes Provide a gateway Client interface Load Balancing Data Placement System metadata Mapping information (Topology) Object Metadata Tiering, Policy Management & Data protection Admin Nodes Admin and Maintenance procedures Storage Nodes Information repository 26

Existing file-systems vs custom file-systems Use existing filesystems Leverage years of file-system hardening Limited by / Advantages of file-system scalability at a per node level EXT4 vs XFS vs BTRFS Custom filesystem Optimize for specific use cases by reducing unnecessary file-system overhead Minimize storage overhead Reduce operating system latency LevelDB/RocksDB 27

Streaming vs Store & forward architectures Streaming vs Store & Forward Streaming architectures Write - Do not wait for a full object to be written, before you make placement decisions make them on the fly Read - Do not wait for a full object to be assembled before you respond to a read Streaming Lower time to first byte Enabling scaling to very large object sizes Compress and encrypt data on the fly Can restore lost fragments instead of whole objects Ability to support random reads Store and forward architectures Lower complexity Store & Forward 28

Agenda Object store market requirements How do object stores scale? Architectural choices for object stores Conclusions 29

Performance and scalability in the real world Real world performance Most object storage latencies in the 10 s of ms Limited by WAN & LAN latencies / throughput At scale - Object sizes matter, data protection schemes matter Real world scalability Ability to handle topology changes (disks, nodes, racks, sites), expansions, contractions, hardware refreshes, system upgrades Real world scale trillions of objects vs 100 s of billions, single name-space versus 5 namespaces? Real world consistency Use case specific decision Simultaneous modification of same object needs strong consistency properties 30

Architecture tradeoffs Simple to deploy Scalable How the systems scale beyond a few PBs, Manageable How will they be managed across multiple years and deployments Flexible Ability to incorporate new features and capabilities 31

Object storage Market Requirements versus Your requirements Expected Scale Performance Requirements Latency (Low latency vs Medium latency workloads ) Throughput (High throughput vs Low throughput workloads) Key Features 100s of Billions of objects Multiple sites Data protection policies Business logic policies Erasure coding (leading to an object count explosion) Multiple / complex failure domains (disk, node, rack, data center, business unit) Operations and Management Requirements Adding nodes & sites Changing capacity of existing nodes and related balancing / re-balancing System upgrades and Data Migration Long lived systems, with topology changes and required rebalancing 32

After This Webcast This webcast and a copy of the slides will be posted to the SNIA Cloud Storage Initiative (CSI) website and available on-demand http://www.snia.org/forum/csi/knowledge/webcasts A full Q&A from this webcast, including answers to questions we couldn't get to today, will be posted to the SNIA-CSI blog http://www.sniacloud.com/ The original Object Storage 101 webcast is available on-demand at the SNIA-ESF website http://www.snia.org/forums/esf/knowledge/webcasts 33

Conclusion Thank You 34