Cloud object storage : the right way. Orit Wasserman Open Source Summit 2018

Similar documents
Cloud object storage in Ceph. Orit Wasserman Fosdem 2017

Ceph Rados Gateway. Orit Wasserman Fosdem 2016

AWS Storage Gateway. Not your father s hybrid storage. University of Arizona IT Summit October 23, Jay Vagalatos, AWS Solutions Architect

Object storage platform How it can help? Martin Lenk, Specialist Senior Systems Engineer Unstructured Data Solution, Dell EMC

INTRODUCTION TO CEPH. Orit Wasserman Red Hat August Penguin 2017

AWS Certified Solutions Architect - Associate 2018 (SAA-001)

CEPHALOPODS AND SAMBA IRA COOPER SNIA SDC

Exam : Implementing Microsoft Azure Infrastructure Solutions

Developing Microsoft Azure Solutions (70-532) Syllabus

Scaling Massive Content Stores in the Cloud. CloudExpo New York June Alfresco Founder & CTO

Developing Microsoft Azure Solutions (70-532) Syllabus

November 7, DAN WILSON Global Operations Architecture, Concur. OpenStack Summit Hong Kong JOE ARNOLD

Deploying Software Defined Storage for the Enterprise with Ceph. PRESENTATION TITLE GOES HERE Paul von Stamwitz Fujitsu

THE ZADARA CLOUD. An overview of the Zadara Storage Cloud and VPSA Storage Array technology WHITE PAPER

AWS Administration. Suggested Pre-requisites Basic IT Knowledge

ROCK INK PAPER COMPUTER

Developing Microsoft Azure Solutions (70-532) Syllabus

Amazon Web Services (AWS) Solutions Architect Intermediate Level Course Content

SCALITY ZENKO. Freedom & control across Hybrid IT and Multi-Cloud

AWS Storage Gateway. Amazon S3. Amazon EFS. Amazon Glacier. Amazon EBS. Amazon EC2 Instance. storage. File Block Object. Hybrid integrated.

RED HAT CEPH STORAGE ROADMAP. Cesar Pinto Account Manager, Red Hat Norway

Red Hat Storage Server for AWS

OpenStack SwiftOnFile: User Identity for Cross Protocol Access Demystified Dean Hildebrand, Sasikanth Eda Sandeep Patil, Bill Owen IBM

CEPH DATA SERVICES IN A MULTI- AND HYBRID CLOUD WORLD. Sage Weil - Red Hat OpenStack Summit

architecting block and object geo-replication solutions with ceph sage weil sdc

Object Storage Service. Product Introduction. Issue 04 Date HUAWEI TECHNOLOGIES CO., LTD.

Storage Made Easy Enterprise File Share and Sync Fabric Architecture

Niraj Kumar Lead Azure Architect, MCT( Microsoft Certified Trainer)

-Presented By : Rajeshwari Chatterjee Professor-Andrey Shevel Course: Computing Clusters Grid and Clouds ITMO University, St.

Dell EMC CIFS-ECS Tool

Amazon Web Services and Feb 28 outage. Overview presented by Divya

Overview of AWS Security - Database Services

OnCommand Cloud Manager 3.2 Deploying and Managing ONTAP Cloud Systems

Security on AWS(overview) Bertram Dorn EMEA Specialized Solutions Architect Security and Compliance

Document Sub Title. Yotpo. Technical Overview 07/18/ Yotpo

Amazon Glacier. Developer Guide API Version

TIBCO StreamBase 10 Distributed Computing and High Availability. November 2017

Elastic Cloud Storage (ECS)

THE CEPH POWER SHOW. Episode 2 : The Jewel Story. Daniel Messer Technical Marketing Red Hat Storage. Karan Singh Sr. Storage Architect Red Hat Storage

AWS Storage Optimization. AWS Whitepaper

8/3/17. Encryption and Decryption centralized Single point of contact First line of defense. Bishop

The New Economics of Cloud Storage


GlusterFS Architecture & Roadmap

IBM Spectrum NAS, IBM Spectrum Scale and IBM Cloud Object Storage

Google Cloud Platform for Systems Operations Professionals (CPO200) Course Agenda

Next Generation Storage for The Software-Defned World

4) An organization needs a data store to handle the following data types and access patterns:

Azure File Sync. Webinaari

Introducing Amazon Elastic File System (EFS)

GFS: The Google File System

Unstructured Data in the Cloud with ECS

How To Guide: Long Term Archive for Rubrik. Using SwiftStack Storage as a Long Term Archive for Rubrik

Get the Most Out of GoAnywhere: Achieving Cloud File Transfers and Integrations

Ceph Intro & Architectural Overview. Abbas Bangash Intercloud Systems

Eight Tips for Better Archives. Eight Ways Cloudian Object Storage Benefits Archiving with Veritas Enterprise Vault

IBM Compose Managed Platform for Multiple Open Source Databases

Microsoft Azure BLOB Storage

Amazon Web Services (AWS) Training Course Content

Amazon S3 Glacier. Developer Guide API Version

CloudBerry Backup for Windows 5.9

Storage for HPC, HPDA and Machine Learning (ML)

LINUX, WINDOWS(MCSE),

Oracle 10g and IPv6 IPv6 Summit 11 December 2003

PrepAwayExam. High-efficient Exam Materials are the best high pass-rate Exam Dumps

At Course Completion Prepares you as per certification requirements for AWS Developer Associate.

REFERENCE ARCHITECTURE Quantum StorNext and Cloudian HyperStore

SERVERS TO SERVICES HOW MICROSOFT AZURE CAN MODERNISE YOUR IT INFRASTRUCTURE. Joey Lau 9 November 2017

Opendedupe & Veritas NetBackup ARCHITECTURE OVERVIEW AND USE CASES

Best Practices in Designing Cloud Storage based Archival solution Sreenidhi Iyangar & Jim Rice EMC Corporation

ECE 7650 Scalable and Secure Internet Services and Architecture ---- A Systems Perspective

Amazon Web Services Training. Training Topics:

Azure SQL Database Training. Complete Practical & Real-time Trainings. A Unit of Sequel Gate Innovative Technologies Pvt. Ltd.

Cloud Computing /AWS Course Content

ProfileUnity with FlexApp Technology

Enabling Data Control in a Multi-Cloud World

Ambry: LinkedIn s Scalable Geo- Distributed Object Store

Azure Certification BootCamp for Exam (Developer)

What s New in Oracle Cloud Infrastructure Object Storage Classic. Topics: On Oracle Cloud. Oracle Cloud

OPEN HYBRID CLOUD. ALEXANDRE BLIN CLOUD BUSINESS DEVELOPMENT Red Hat France

Side Load Feature Nasuni Corporation Boston, MA

Developing Microsoft Azure Solutions

Object Storage Level 100

Datacenter replication solution with quasardb

GFS: The Google File System. Dr. Yingwu Zhu

Discover the all-new CacheMount

TestkingPass. Reliable test dumps & stable pass king & valid test questions

Release Notes Release (December 4, 2017)... 4 Release (November 27, 2017)... 5 Release

NPTEL Course Jan K. Gopinath Indian Institute of Science

Developing Microsoft Azure and Web Services. Course Code: 20487C; Duration: 5 days; Instructor-led

Questions and Answers No.:81-90(143q)

! Design constraints. " Component failures are the norm. " Files are huge by traditional standards. ! POSIX-like

Agenda. Introduction Storage Primer Block Storage Shared File Systems Object Store On-Premises Storage Integration

Yves Goeleven. Solution Architect - Particular Software. Shipping software since Azure MVP since Co-founder & board member AZUG

Designing Modern Apps Using New Capabilities in Microsoft Azure SQL Database. Bill Gibson, Principal Program Manager, SQL Database

IBM Aspera Direct-to-Cloud Storage

Red Hat Quay 2.9 Deploy Red Hat Quay on OpenShift

Distributed File Systems II

A New Key-value Data Store For Heterogeneous Storage Architecture Intel APAC R&D Ltd.

Security Overview of the BGI Online Platform

Transcription:

Cloud object storage : the right way Orit Wasserman Open Source Summit 2018 1

About me 20+ years of development 10+ in open source: Nested virtualization for KVM Maintainer of live migration in Qemu/kvm 4 years as Ceph core developer at Red Hat Architect at lightbits labs 2

Cloud object storage: the right way Introduction to cloud object storage Features: Multipart upload Versioning Life cycle Prefix Static website Security DR Summary 3

Introduction to cloud object storage 4

Object storage Flat namespace Objects are immutable Range Read Rich Metadata: 5 Ownership (Users and tenants) ACL User metadata

Cloud object storage 6 Restful API Common clouds: AWS S3 Swift (openstack) Google cloud storage Azure blob storage Ceph Digital Ocean

Example: Media 7

Example: Documents 8

When to use cloud object storage Cloud or large scale environment Lots of large objects that are rarely updated. Small objects that are updated infrequently and are not performance sensitive. Hard drives 9

When not to use cloud object storage If the application does lots of inplace writes inside big files. Change workload to larger writes Divide big file into smaller ones Legacy application 10 File on object (NFS on RGW, s3fs )

Cloud object storage features 11

Multipart upload 12 Upload a single object as a set of parts Transaction: Initiate Upload parts Complete

Multipart upload 13 Improved throughput Quick recovery from any network issues Pause and resume object uploads Begin an upload before you know the final object size Instead of FS rename

Multipart upload pitfalls 14 Due to the performance impact not recommend for small objects Regular upload is up to 5 GB Check your framework/sdk defaults! Orphans...

Versioning 15 Keeps the previous copy of the object in case of overwrite or deletion Problem: space usage

Life cycle 16 Configure automatic object transition: Expiration: used to clean old objects, older versions and failed multipart uploads Tiering: move object to colder storage

virtual hierarchy Add a prefix to an object Listing a sub folder by listing objs with a specific prefix 17

Static website Host a static website directly from the cloud object storage 18

Security 19

Signature: AWS4 More secure: Key is not part of the request All requests are signed Streaming support Not all SDK use it by default or even support it 20

Protocol and transport Encrypt the traffic High performance penalty Options: 21 Tunneling Terminate at the load balancers like HAProxy and use http for your internal network

Encryption Server side encryption is not enough Use client side encryption: SSE-C: Customer provided keys SSE-KMS: Key management service 22

Bucket and Object ACL 23 Owner System/Admin user Other users: Read/Write/Read ACP/Write ACP/Full control

Canned ACL 24

Be careful of public buckets 25

Bucket and Users policy 26 Access policies for users and buckets: Grant access from multiple accounts Cross account permission Read only for anonymous users Restricting access to a IP specific

Grant access from multiple accounts { "Version":"2012-10-17", "Statement":[ { "Sid":"AddCannedAcl", "Effect":"Allow", "Principal": {"AWS": ["arn:aws:iam::111122223333:root","arn:aws:iam::444455556666:root"]}, "Action":["s3:PutObject","s3:PutObjectAcl"], "Resource":["arn:aws:s3:::examplebucket/*"], "Condition":{"StringEquals":{"s3:x-amz-acl":["public-read"]}} } ] } 27

Secure Token Service Provides a temporary token to access the cloud storage Assume rule Used by storage class and glacier 28

Disaster Recovery 29

Test your DR plan! 30

Solution: geo replication Global object storage clusters with a single namespace Enables deployment of clusters across multiple geographic locations Clusters synchronize, allowing users to read from or write to the closest one Disaster recovery in case of a zone failure 31

RGW Multisite definitions 32 Realm - namespace Zone - represent a geographical location, cannot cross clusters ZoneGroup - group of replicating zones Period - current realm configuration. Updates are local and are only applied when committed.

How does the replication works Metadata ops 33 User and bucket updates Small amount of data rare updates wide effect Synchronous Meta master (master zone in the master zonegroup) Data ops Objects update Large amount of data Frequent operations Only affects a single object Asynchronous All zones

RGW default setup Created automatically first time radosgw runs without any multisite configuration 34 Zonegroup: default Zone: default Radosgw Radosgw

RGW local configuration Used to set zonegroup parameters like: 35 Default bucket index shards Placement target Realm: myrealm Zonegroup: local-zg Zone: local-zone Radosgw Radosgw

RGW Simple DR configuration Realm: myrealm Zonegroup: us Zone: us-west Radosgw Radosgw 36 Realm: myrealm Zonegroup: us Zone: us-east Radosgw Radosgw

RGW Local and replicated data configuration Realm: myrealm Zonegroup: local-zg Zone: local-zone Radosgw Radosgw 37 Realm: myrealm Zonegroup: us Zone: us-west Radosgw Radosgw Realm: myrealm Zonegroup: us Zone: us-east Radosgw Radosgw

Cloud sync Replicate your data to public cloud for DR Radosgw Radosgw 38

One cloud is not enough Disaster recovery to a different public cloud Replicate your private cloud data to public cloud 39

Metadata search 40 API to query based on object metadata Integration with ElasticSearch

Summary Object storage was designed for large scale and for the cloud Use object storage api to get all it advance features. Make sure your data is safe! Test your DR plan! Use Ceph for private cloud object storage! 41

github.com/oritwas @oritwas 42