Evaluating Cloud Storage Strategies. James Bottomley; CTO, Server Virtualization

Similar documents
SolidFire and Ceph Architectural Comparison

THE ZADARA CLOUD. An overview of the Zadara Storage Cloud and VPSA Storage Array technology WHITE PAPER

The Fastest And Most Efficient Block Storage Software (SDS)

Turning Object. Storage into Virtual Machine Storage. White Papers

-Presented By : Rajeshwari Chatterjee Professor-Andrey Shevel Course: Computing Clusters Grid and Clouds ITMO University, St.

EMC Celerra CNS with CLARiiON Storage

Introducing SUSE Enterprise Storage 5

Data Sheet: Storage Management Veritas Storage Foundation for Oracle RAC from Symantec Manageability and availability for Oracle RAC databases

Veritas InfoScale Enterprise for Oracle Real Application Clusters (RAC)

Midsize Enterprise Solutions Selling Guide. Sell NetApp s midsize enterprise solutions and take your business and your customers further, faster

Dell Fluid Data solutions. Powerful self-optimized enterprise storage. Dell Compellent Storage Center: Designed for business results

GlusterFS and RHS for SysAdmins

FOUR WAYS TO LOWER THE COST OF REPLICATION

Provisioning with SUSE Enterprise Storage. Nyers Gábor Trainer &

The advantages of architecting an open iscsi SAN

A product by CloudFounders. Wim Provoost Open vstorage

IOPStor: Storage Made Easy. Key Business Features. Key Business Solutions. IOPStor IOP5BI50T Network Attached Storage (NAS) Page 1 of 5

<Insert Picture Here> Btrfs Filesystem

VMware Virtual SAN Technology

Introducing Tegile. Company Overview. Product Overview. Solutions & Use Cases. Partnering with Tegile

EMC Business Continuity for Microsoft Applications

Red Hat Gluster Storage performance. Manoj Pillai and Ben England Performance Engineering June 25, 2015

Omneon MediaGrid Technical Overview

Ceph Software Defined Storage Appliance

VMware Virtual SAN. Technical Walkthrough. Massimiliano Moschini Brand Specialist VCI - vexpert VMware Inc. All rights reserved.

Cloud-Oriented Converged Storage

Microsoft Office SharePoint Server 2007

VERITAS Storage Foundation 4.0 TM for Databases

Veritas Storage Foundation from Symantec

Why software defined storage matters? Sergey Goncharov Solution Architect, Red Hat

Mostafa Magdy Senior Technology Consultant Saudi Arabia. Copyright 2011 EMC Corporation. All rights reserved.

Cloud-Oriented Converged Storage

Thinking Different: Simple, Efficient, Affordable, Unified Storage

STORAGE CONSOLIDATION WITH IP STORAGE. David Dale, NetApp

50 TB. Traditional Storage + Data Protection Architecture. StorSimple Cloud-integrated Storage. Traditional CapEx: $375K Support: $75K per Year

Deploying Software Defined Storage for the Enterprise with Ceph. PRESENTATION TITLE GOES HERE Paul von Stamwitz Fujitsu

2014 VMware Inc. All rights reserved.

SvSAN Data Sheet - StorMagic

Costefficient Storage with Dataprotection

Veritas Storage Foundation for Oracle RAC from Symantec

Symantec Storage Foundation for Oracle Real Application Clusters (RAC)

Next Generation Computing Architectures for Cloud Scale Applications

Virtuozzo Containers

Workspace & Storage Infrastructure for Service Providers

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Scale-out Storage Solution and Challenges Mahadev Gaonkar igate

DAHA AKILLI BĐR DÜNYA ĐÇĐN BĐLGĐ ALTYAPILARIMIZI DEĞĐŞTĐRECEĞĐZ

Mission-Critical Lustre at Santos. Adam Fox, Lustre User Group 2016

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure

STORAGE CONSOLIDATION WITH IP STORAGE. David Dale, NetApp

ROCK INK PAPER COMPUTER

New HPE 3PAR StoreServ 8000 and series Optimized for Flash

Nutanix White Paper. Hyper-Converged Infrastructure for Enterprise Applications. Version 1.0 March Enterprise Applications on Nutanix

Nimble Storage Adaptive Flash

DATA PROTECTION IN A ROBO ENVIRONMENT

Ceph Intro & Architectural Overview. Abbas Bangash Intercloud Systems

The Btrfs Filesystem. Chris Mason

Storage and File Hierarchy

The storage challenges of virtualized environments

Rio-2 Hybrid Backup Server

Milestone Solution Partner IT Infrastructure Components Certification Report

Copyright 2012 EMC Corporation. All rights reserved.

COS 318: Operating Systems

Configuring Short RPO with Actifio StreamSnap and Dedup-Async Replication

DELL POWERVAULT MD FAMILY MODULAR STORAGE THE DELL POWERVAULT MD STORAGE FAMILY

Storage Optimization with Oracle Database 11g

Increasing Performance of Existing Oracle RAC up to 10X

Automated Storage Tiering on Infortrend s ESVA Storage Systems

ECONOMICAL, STORAGE PURPOSE-BUILT FOR THE EMERGING DATA CENTERS. By George Crump

The Construction of Open Source Cloud Storage System for Digital Resources

iscsi Technology Brief Storage Area Network using Gbit Ethernet The iscsi Standard

IBM DeepFlash Elastic Storage Server

HYBRID STORAGE TM. WITH FASTier ACCELERATION TECHNOLOGY

HPC File Systems and Storage. Irena Johnson University of Notre Dame Center for Research Computing

Discover CephFS TECHNICAL REPORT SPONSORED BY. image vlastas, 123RF.com

StorageCraft OneXafe and Veeam 9.5

White Paper. EonStor GS Family Best Practices Guide. Version: 1.1 Updated: Apr., 2018

Secure Block Storage (SBS) FAQ

Next Generation Storage for The Software-Defned World

A Cloud WHERE PHYSICAL ARE TOGETHER AT LAST

Virtual Desktop Infrastructure (VDI) Bassam Jbara

TCC, so your business continues

VMware vsphere 5.5 Advanced Administration

Comparing File (NAS) and Block (SAN) Storage

OpenStack SwiftOnFile: User Identity for Cross Protocol Access Demystified Dean Hildebrand, Sasikanth Eda Sandeep Patil, Bill Owen IBM

BeoLink.org. Design and build an inexpensive DFS. Fabrizio Manfredi Furuholmen. FrOSCon August 2008

Data Sheet: Storage Management Veritas Storage Foundation by Symantec Heterogeneous online storage management

IBM Spectrum NAS. Easy-to-manage software-defined file storage for the enterprise. Overview. Highlights

NEC M100 Frequently Asked Questions September, 2011

StorPool Distributed Storage Software Technical Overview

IOmark- VM. HP MSA P2000 Test Report: VM a Test Report Date: 4, March

Sun Lustre Storage System Simplifying and Accelerating Lustre Deployments

Persistent Storage with Docker in production - Which solution and why?

SONAS Best Practices and options for CIFS Scalability

SolidFire and Pure Storage Architectural Comparison

EMC Backup and Recovery for Microsoft SQL Server

GlusterFS Architecture & Roadmap

INTRODUCTION TO CEPH. Orit Wasserman Red Hat August Penguin 2017

Database Services at CERN with Oracle 10g RAC and ASM on Commodity HW

Cohesity Flash Protect for Pure FlashBlade: Simple, Scalable Data Protection

Transcription:

Evaluating Cloud Storage Strategies James Bottomley; CTO, Server Virtualization

Introduction to Storage Attachments: - Local (Direct cheap) SAS, SATA - Remote (SAN, NAS expensive) FC net Types - Block Spinning Disk Drive SSD RAID unit - File NFS CEPH - Object RADOS PCS Profit from the cloud 2

Storage Performance Comparison Profit from the cloud 3

Storage Cost Comparison Profit from the cloud 4

A Closer Look at the Terms Block device - A unit of storage - May be divided inflexibly (by partitioning) - Usually locally attached, but may be on a SAN File based Storage - Exports views of a filesystem via NFS, CIFS or other protocols - Is flexible storage in views can be expanded and contracted on the fly - Suffers from metadata issues on the server Object Storage - Really just means a flexible block device - May be expanded and contracted on the fly - Easily administrable (unlike LUN partitioning in SANs) Profit from the cloud 5

Storage Types Comparison Cloud Utility Simple Web API No easy way to update objects Slow CEPH, Gluster Object Size tuning problem Tuned to disk image size objects Designed for rapid update Scalable B/W Inelastic Hard to Aggregate Attached to individual systems Slightly Elastic Fixed size Good B/W Dedicated network Based on SAN Limited Scaling Hosting Utility Profit from the cloud 6

Object vs File and the Metadata Problem A large number of Cloud storage systems are file based - CEPH, Gluster The specific problem is that updating any file requires a change in the metadata - This produces both a hotness in the journal - As well as locking hierarchy issues - And communication with the metadata server - All of which slow the operations down Object storage only uses metadata when objects are resized, created or destroyed - Using a fixed size object incurs no metadata overhead whatsoever So objects providing virtual environment roots allows efficient embedded filesystems with zero metadata overhead Profit from the cloud 7

FUSE Issues Fuse is the Linux Userspace Filesystem Main problem is it s incredibly SLOW However, it is very useful, so a large number of cloud filesystems use it - Gluster Parallels originally avoided using it. However, now we ve decided we ll fix it for everyone Parallels engineers are currently interacting with the linux filesystems and fuse lists Object is to add write caching and mtime fixes to accelerate fuse Tests show we can get ~95% of the performance of a natively written filesystem Profit from the cloud 8

Consistency Strong Consistency is hard to achieve in clusters - Strong Consistency means that all updates are seen immediately after they are committed - Strong consistency is most often violated across cluster reconfigurations - Ironically, this is precisely when you usually need it (HA) - Sheepdog, CEPH, PStorage Eventual Consistency is the usual norm - Means that all updates are eventually seen, but may not be immediately visible after they are committed - SWIFT, Gluster (does have a much slower strong consistency quorum enforcement mode) Weak Consistency - Does not guarantee write ordering and visibility - Too weak to be useful for most cloud storage Profit from the cloud 9

Performance and Scalability Cloud storage must be designed to scale not just per node, but also per Virtual Environment per node This requires there be no bottlenecks connecting a virtual environment to storage - Sheepdog problem: it uses a single threaded per-node gateway process causing its scalability per VE to be poor Ideally, a direct connection should be made between the virtual environment using the object and the storage providing it with no intermediate broker - Or using an intermediate broker tuned for scalability Chunking (large block size for objects) also improves performance Profit from the cloud 10

Requirements for Hosting Storage The Cardinal hosting requirement is that existing local storage should be repurposed as generic object based storage for 1. Supporting Existing Hosting Environments and additional services 2. Enabling the provision of Cloud Services Equating to the technical requirements 1. Performance must be wire speed SATA (100MB/s) Tuned exactly for GB objects containing small files 2. Storage must be object based to avoid metadata issues 3. Objects should be capable of rapid random read/write updates 4. Storage bandwidth should scale linearly with the cluster Profit from the cloud 11

Simple Requirements for Additional Benefits Hosting Enhancements 1. Free storage from individual nodes Easy, fast migration of Virtual Environments High Availability 2. Simple and Efficient resizing with assist for legacy roots (ext3) Makes storage easier to sell in increments 3. Cloning and Snapshotting Value add for templating block based roots Permits easy backup 4. Redundancy Allows different storage SLAs for different prices Cloud Enhancements (Ideal Storage Solution) 1. Dropbox like services 2. Storage as a Service (like S3) 3. Storage on Demand 4. Tiered Storage Pricing Profit from the cloud 12

Ideal solution Technical Specs - Metadata is the key to improving performance - Large Static objects with rapid updates have fixed metadata - 100MB/s performance over gigabit ethernet (no 10GE requirement) Avoid - Anything like a filesystem (CEPH, Gluster) because of Locking problems Speed issues with per file need to consult metadata - Anything using FUSE (Gluster) At least anything using FUSE without the Parallels acceleration patches - Anything with a single threaded connection multiplexor (sheepdog) Per cluster is worse (kills all scalability) Per node is still bad (kills VE scalability) Profit from the cloud 13

Introducing Parallels Cloud Storage Why Choose Us? - We re the experts in the field (we studied the problem) - We fixed FUSE - We redid the Linux loop device to work efficiently for virtual environment roots In collaboration with Oracle who did the Direct I/O patches - Loop device also modified to do snapshotting and legacy filesystem resizing. - All the necessary infrastructure patches are upstream in linux Or are moving that way What we provide - Complete leverage of existing local node storage - Strong Consistency and Redundancy - Wire speed transfers because of optimised data architecture Up to 100MB/s/node over 1GigE - Hot object tiering and SSD caching Profit from the cloud 14

Parallels Cloud Storage Architecture Profit from the cloud 15

Future Features Chunk Server based snapshotting De-duplication Thin Provisioning - Actual storage size can appear much larger than in-use backing store because of sparsity of objects - Also provides ability to do dynamic in-place upgrades of actual storage capacity Innovative redundancy algorithms Geographic Object Replication for advanced disaster recovery Profit from the cloud 16

Conclusions Getting Cloud storage right for current hosting needs is not a simple problem - The basic construction of many cloud storage offerings is unsuitable to hosting provider environments Parallels has devoted considerable study and effort to mapping the needs of hosters on to cloud storage Parallels has studied the strengths and weaknesses of current cloud storage offerings and incorporated the best into our cloud storage offerings - While attempting to eliminate all the negative issues - And improve performance Parallels will leverage (and enhance) open source to achieve the best cloud storage system for hosters Profit from the cloud 17