Block or Object Storage?

Similar documents
Discover CephFS TECHNICAL REPORT SPONSORED BY. image vlastas, 123RF.com

Choosing an Interface

SUSE Enterprise Storage Technical Overview

Introducing SUSE Enterprise Storage 5

Software-defined Storage: Fast, Safe and Efficient

Using Cloud Services behind SGI DMF

Cloud Computing: Making the Right Choice for Your Organization

Choosing the Right Container Infrastructure for Your Organization

PRESENTATION TITLE GOES HERE. Understanding Architectural Trade-offs in Object Storage Technologies

OpenStack and Beyond Built on ProphetStor Federator

Hyper-Converged Infrastructure: Providing New Opportunities for Improved Availability

-Presented By : Rajeshwari Chatterjee Professor-Andrey Shevel Course: Computing Clusters Grid and Clouds ITMO University, St.

RED HAT CEPH STORAGE ROADMAP. Cesar Pinto Account Manager, Red Hat Norway

A Robust, Flexible Platform for Expanding Your Storage without Limits

Chapter 11: Implementing File Systems. Operating System Concepts 8 th Edition,

Chapter 11: File System Implementation. Objectives

STORAGE PROTOCOLS. Storage is a major consideration for cloud initiatives; what type of disk, which

White Paper. Nexenta Replicast

The General Parallel File System

RAIDIX Data Storage Solution. Clustered Data Storage Based on the RAIDIX Software and GPFS File System

Move Up to an OpenStack Private Cloud and Lose the Vendor Lock-in

Disaster Recovery Guide

White P C aper Title Here arbonite Cloud Migration Te T c e hnica ic l a G l g uide VM VM

Fully journaled filesystems. Low-level virtualization Filesystems on RAID Filesystems on Flash (Filesystems on DVD)

iscsi Target Usage Guide December 15, 2017

Virtual Memory. Chapter 8

virtual machine block storage with the ceph distributed storage system sage weil xensummit august 28, 2012

Bringing OpenStack to the Enterprise. An enterprise-class solution ensures you get the required performance, reliability, and security

vsan Mixed Workloads First Published On: Last Updated On:

Main Points. File layout Directory layout

RAIDIX Data Storage Solution. Highly Scalable Data Storage System for Postproduction Studios and Broadcasting Corporations

SurFS Product Description

Evaluating Cloud Storage Strategies. James Bottomley; CTO, Server Virtualization

Provisioning with SUSE Enterprise Storage. Nyers Gábor Trainer &

ZYNSTRA TECHNICAL BRIEFING NOTE

IBM Spectrum NAS. Easy-to-manage software-defined file storage for the enterprise. Overview. Highlights

SwiftStack and python-swiftclient

CLOUD COMPUTING IT0530. G.JEYA BHARATHI Asst.Prof.(O.G) Department of IT SRM University

File Management By : Kaushik Vaghani

Da-Wei Chang CSIE.NCKU. Professor Hao-Ren Ke, National Chiao Tung University Professor Hsung-Pin Chang, National Chung Hsing University

Turning Object. Storage into Virtual Machine Storage. White Papers

Chapter 8 Virtual Memory

Chapter 10: File System Implementation

Key metrics for effective storage performance and capacity reporting

VMware vsphere VMFS First Published On: Last Updated On:

CHAPTER 11: IMPLEMENTING FILE SYSTEMS (COMPACT) By I-Chen Lin Textbook: Operating System Concepts 9th Ed.

What to Look for in a Partner for Software-Defined Data Center (SDDC)

Maximize the Speed and Scalability of Your MuleSoft ESB with Solace

THE ZADARA CLOUD. An overview of the Zadara Storage Cloud and VPSA Storage Array technology WHITE PAPER

OPERATING SYSTEM. Chapter 12: File System Implementation

SEP sesam Backup & Recovery to SUSE Enterprise Storage. Hybrid Backup & Disaster Recovery

CA485 Ray Walshe Google File System

A fields' Introduction to SUSE Enterprise Storage TUT91098

CPSC 426/526. Cloud Computing. Ennan Zhai. Computer Science Department Yale University

Chapter 11: Implementing File Systems

Scality RING on Cisco UCS: Store File, Object, and OpenStack Data at Scale

The Leading Parallel Cluster File System

Eliminate the Complexity of Multiple Infrastructure Silos

IME (Infinite Memory Engine) Extreme Application Acceleration & Highly Efficient I/O Provisioning

Deploying Software Defined Storage for the Enterprise with Ceph. PRESENTATION TITLE GOES HERE Paul von Stamwitz Fujitsu

hot plug RAID memory technology for fault tolerance and scalability

IBM Spectrum NAS, IBM Spectrum Scale and IBM Cloud Object Storage

An Overview of CORAID Technology and ATA-over-Ethernet (AoE)

CLOUD-SCALE FILE SYSTEMS

Data Protection for Virtualized Environments

Chapter 11: Implementing File-Systems

Ceph Intro & Architectural Overview. Abbas Bangash Intercloud Systems

Finding a needle in Haystack: Facebook's photo storage

Adobe Digital Marketing s IT Transformation with OpenStack

Chapter 12: File System Implementation

Data safety for digital business. Veritas Backup Exec WHITE PAPER. One solution for hybrid, physical, and virtual environments.

ECONOMICAL, STORAGE PURPOSE-BUILT FOR THE EMERGING DATA CENTERS. By George Crump

Write a technical report Present your results Write a workshop/conference paper (optional) Could be a real system, simulation and/or theoretical

StorNext 3.0 Product Update: Server and Storage Virtualization with StorNext and VMware

Veeam with Cohesity Data Platform

CS370 Operating Systems

How to Reduce Data Capacity in Objectbased Storage: Dedup and More

PANASAS TIERED PARITY ARCHITECTURE

File System Internals. Jo, Heeseung

Cassandra, MongoDB, and HBase. Cassandra, MongoDB, and HBase. I have chosen these three due to their recent

Operating Systems. Week 9 Recitation: Exam 2 Preview Review of Exam 2, Spring Paul Krzyzanowski. Rutgers University.

UNLEASH YOUR APPLICATIONS

Acceleration Systems Technical Overview. September 2014, v1.4

EI 338: Computer Systems Engineering (Operating Systems & Computer Architecture)

Storage Virtualization Explained

WWW. FUSIONIO. COM. Fusion-io s Solid State Storage A New Standard for Enterprise-Class Reliability Fusion-io, All Rights Reserved.

MEMORY MANAGEMENT/1 CS 409, FALL 2013

powered by Cloudian and Veritas

EMC ScaleIO In The Enterprise: The Citi Experience. Tamir Segal Head of ScaleIO Marketing Eliot A. Wilson Senior Engineer, Citi

VIRTUAL MEMORY. Operating Systems 2015 Spring by Euiseong Seo

Bacula Systems Virtual Machine Performance Backup Suite

Rio-2 Hybrid Backup Server

Simplified Storage Migration for Microsoft Cluster Server

VMware vsphere Clusters in Security Zones

Using MySQL in a Virtualized Environment. Scott Seighman Systems Engineer Sun Microsystems

How CloudEndure Disaster Recovery Works

Virtual Machines Disco and Xen (Lecture 10, cs262a) Ion Stoica & Ali Ghodsi UC Berkeley February 26, 2018

VMWARE VSAN LICENSING GUIDE - MARCH 2018 VMWARE VSAN 6.6. Licensing Guide

GlusterFS and RHS for SysAdmins

Linux and Xen. Andrea Sarro. andrea.sarro(at)quadrics.it. Linux Kernel Hacking Free Course IV Edition

Transcription:

Block or Object? MARKET REPORT SPONSORED BY image vlastas, 123RF.com

Block or Object? MARKET REPORT Block or Object? If you re buying a storage solution, whether in the form of a software-defined storage system or a hardware device, one of the first questions you ll need to consider is whether to choose a product based on block or object storage. The differences between block and object storage are largely invisible in the configuration and daily operation of the solution, but behind the scenes, these two technologies behave quite differently. If you want to get the most for your storage dollar, you ll need to compare these techniques carefully before deciding on a solution for your storage environment. the user just hits Save, and the details of splitting up and saving the blocks happen behind the scenes. Block storage evolved for a number of reasons, including n writing data to the disk in fixed-sized increments was a logical solution in an era where the average file size exceeded the available memory, because limited memory required the operating system to write the data to disk in increments anyway. FILE Understanding Block Block storage has been around for years and was the most common storage format in the era of spinning media such as floppy disks and the conventional hard drive. In a block storage environment (Figure 1), a data storage object, such as a file, is broken into fixed-sized chunks, and the chunks (or blocks) are then stored on the storage medium. The filesystem abstracts this storage process from the user, so FIGURE 1: Block A file is broken into fixed-size blocks for storage on disk 2 WWW.ADMIN-MAGAZINE.COM

n breaking the data into chunks allowed the system to store blocks wherever they would fit on the disk, with the filesystem itself providing a means for reconstructing the original file by recovering the stored blocks. The block storage concept was quite successful for its time and forms the basis for most of the conventional filesystems we know today, such as Microsoft s NTFS, Apple s HFS+ series, and the Linux Ext filesystems. Some forms of block storage have made the transition to the era of virtual systems and cloud computing. Many block storage systems today use virtual storage techniques to allow a hardware appliance or distributed, softwaredefined infrastructure to behave like a hard disk or other block device. A Area Network (SAN) is an appliance or software system that appears to the network as a block storage device. Behind the simple block-storage interface, some SANs provide distributed, fault-tolerant storage. A SAN can come as a closed hardware appliance, or it can be implemented through software. Another recent block-storage technology related to a SAN is iscsi, which stands for Internet Small Computer Systems Interface. SCSI is a well known protocol for communication between a computer and a block storage device. iscsi (Internet SCSI) extends the SCSI protocol to allow access to block-storage devices over TCP/IP networks. Of course, an ordinary file server that stores files on a hard disk and uses a conventional filesystem, such as Ext or NTFS, is another form of block storage solution. Block storage can lead to inefficient use of the disk. Increasing the block size improves write times on larger files, but it can waste disk space (Figure 2). A filesystem with a block size of 4096 bytes must allocate a full block to a file that is only a few bytes long. Likewise, a file that does not break up evenly at the block boundary, such as a file that is 4.5 blocks long, requires the allocation of 5 blocks and therefore wastes half a block of storage. The theoretical average amount of wasted space is half a block per file. As you can see, in an environment with relatively few, larger files, the wasted space is negliable, but on a system with many small files, the lost storage could be significant. A couple of other issues have arisen concerning block storage in the data center age: n The division of the files and storage medium into fixed-size chunks does not scale well to very large disks and file sizes. n The block system, in which the contents of a single file might be scattered over several blocks, does not lend itself to associating metadata with a file. Although block storage remains a popular alternative for some storage scenarios, the limitations of the block storage paradigm have led FILE FILE Pro and Con Breaking up and reassembling the data into blocks takes time, which can reduce performance in some use cases. On the other hand, block storage is a better choice in the case of a transactional database, which requires low latency and frequent writes and re-writes to the same file. WWW.ADMIN-MAGAZINE.COM 4 BLOCKS FIGURE 2: Depending on the use case, block storage could make inefficient use of disk resources. A full block is allocated even for a file of only a few bytes. The allocated space for a larger file must exceed the actual file size unless file size is an exact multiple of block size. 3

MARKET REPORT The storage model used to save data to the storage medium doesn t have to be the same as the model presented to the network. The modern world of virtualization and application interfaces allows a much richer array of choices for building a storage environment. On real networks, the optimum technology for interfacing with storage clients and server systems might not be the optimum for intersome cloud vendors, such as Amazon, Google, and several other technology companies to adopt an alternative solution. Something New: Object Object storage evolved to address some of the limitations of the block storage model. The object storage concept has gained popularity in recent years as an alternative to block storage systems. One important innovation of object storage is that the location of the data is determined by parameters derived from the filename and other available metadata. In other words, the system knows where to find the file contents without searching for it in a look-up table or consulting a separate service that manages and oversees storage on the disk. In theory, object storage does not break the file into chunks but, instead, stores a whole file together in contiguous or near-contiguous space. In practice, some cluster-based object storage systems use a technique called erasure coding as a built-in form of data protection. Erasure coding breaks the data into fragments for storage within the cluster, encoding additional information with the fragments that will allow the object to be reconstructed if part of the data is lost. Erasure coding provides more fault tolerance than a conventional RAID system, because it continues to offer data protection even after a disk failure. Erasure encoding also reduces the time and overhead for reconstructing data. Pro and Con Eliminating the need to break the file into blocks improves scalability for large files and filesystems. At the same time, deriving the storage location from a hash based on the filename eliminates the overhead of a broker process or lookup table for managing the storage space. The improved support for metadata enhances the opportunity for systems and applications to gain knowledge of the data with minimal overhead. The erasure coding technique used with object storage is more fault-tolerant than RAID, and it offers more efficient reconstruction of data following a disk failure. On the other hand, erasure encoding increases latency and therefore isn't well suited for low-latency scenarios. The complexity of object storage systems, with their distributed technologies and sophisticated hash algorithms, is typically too much for single-user scenarios. Object storage is best suited for busy enterprise networks with large amounts of storage capacity. Which is Best? Both block and object storage have uses in today s storage environment. Block storage is often preferrable for transactional systems such as a relational database and other uses that require low latency. The block storage environment also has benedits for scenarios that require frequent reading or rewriting of previously stored data. Object storage is best suited to files of arbitrary size and very large disks. The object storage envionment puts the emphasis on storing, and the object format is optimized for situations that require fast storage and infrequent or moderately frequent later access. Better scalability makes object storage a better solution for large filesystems. Both block and object storage support a variety of software-defined storage solutions (see Table 1). Interfaces 4 WWW.ADMIN-MAGAZINE.COM

TABLE 1: Software-Defined Solutions Core Interfaces Open Source License? Model Block Filesystem Object iscsi REST Scality Ring No Object Via OpenStack Yes Yes No YES Datacore VirtualSAN No Block Yes No No No No ScaleIO No Block Yes No No Yes Yes Vmware VSAN No Block Yes No No No No Ceph Yes Object Yes Yes Yes Yes Yes facing with the storage medium. As shown in Figure 3, the software-defined storage system can layer various interface components onto the underlying infrastructure that connects with the storage medium. The Ceph software-defined storage solution (refer to Table 1) is an example of a solution that supports a rich collection of front-end interfaces for interacting with the network in a variety of ways. Ceph provides efficient object-based storage at the hardware level, and on the front end, Ceph can appear to the network as a block storage, object storage, or a network filesys- Block Filesystem Software-Defined Object tem. Because Ceph is open source, it is easy to modify and extend. Part of the power of virtual storage and the software-defined storage revolution is the ability to offer different interfaces and protocols to a variety of different storage clients. The flexibility lets a software-defined storage solution slip easily into a complex enterprise scenario. For instance, a group of user workstations that are configured to interact with a conventional file server can access the storage cluster through a network filesystem. An application that expects to write directly to a disk can use the block storage interface, and a local cloud or virtual storage infrastructure can leverage the efficiencies of object-based storage technology through an object storage front end. SUSE Enterprise : Software-Defined for the Enterprise SUSE Enterprise is a robust, well integrated software-defined storage solution. SUSE Enterprise is based on Ceph, so it offers the benefts of back-end object storage techology while providing a variety of interfaces for offering storage services to network clients. FIGURE 3: A software-defined storage system can support several interfaces for network clients that handle data in dissimilar ways. WWW.ADMIN-MAGAZINE.COM SUSE Enterprise is tailored for enterprise networks, so it provides the benefits of open source with the rigorous testing, reliability, and support required for the enterprise. In addition to the basic filesystem, block, and 5

MARKET REPORT object interfaces, SUSE Enterprise includes support for iscsi and provides a RESTful gateway to support Amazon S3 and OpenStack Swift cloud frameworks. The SUSE Enterprise environment is shown in Figure 4. SUSE Enterprise is actually a version of SUSE Enterprise Linux, with additional components and extensions to support Ceph and a rich software-defined storage environment. A SUSE Enterprise cluster consists of three or more object storage daemons and at least two monitor daemons. The client requires only minimal knowledge of the network, and storage decisions are distributed through the cluster, improving efficiency and minimizing write times. The SUSE Enterprise cluster provides erasure encoding for continuous fault tolerance. And the cluster is self-healing. If a node on the cluster goes down, the other cluster nodes adapt, redistributing the workload and continuing to operate. You can build your SUSE Enterprise cluster from inexpensive comodity hardware at a fraction of the cost of proproprietary, appli- Block Clients File Server Clients ance-based systems. The open source technologies at the core of SUSE Enterprise encourage innovation and offer natural immunity from vendor lock-in. Conclusion Block storage is a long-established means for storing data to a conventional hard disk, and recent developments have extended the block storage paradigm to network storage cluster environments. Although block storage often makes inefficient use of the disk space, it still offers performance benefits for transactional database systems and environments with frequent reads and rewrites. Object-based storage is a more recent technology that uses disk space more efficiently and scales better to larger large disks and distributed clusters. Object storage is optimized for fast writes and less frequent later access. Modern-day virtual storage environments let the storage system present a number of interface systems to the network regardless of the back-end disk storage technology. Ceph is a self-healing, open source solution with a robust object-based backend and a collection of interface components Local Cloud Object for supporting filesystem, block, and object storage clients. iscsi (Network Block ) FIGURE 4: SUSE Enterprise. SUSE Enterprise Cluster RESTful Gateway SUSE Enterprise brings the power of Ceph to corporate enterprise systems, with enterprise-grade testing and certification. In addition to the filesystem, object, and block storage components included with vanilla versions of Ceph, SUSE Enterprise adds advanced tools for cluster management and configuration, as well as iscsi support and a strong affinity for OpenStack environments. 6 WWW.ADMIN-MAGAZINE.COM