Best Practices in Designing Cloud Storage based Archival solution Sreenidhi Iyangar & Jim Rice EMC Corporation

Similar documents
Multi-Cloud Storage: Addressing the Need for Portability and Interoperability

shiftz Citrix virtual desktops and applications on Microsoft Azure Lieven Van de Walle

Object Storage Service. Product Introduction. Issue 04 Date HUAWEI TECHNOLOGIES CO., LTD.

CogniFit Technical Security Details

Opendedupe & Veritas NetBackup ARCHITECTURE OVERVIEW AND USE CASES

Interoperable Cloud Storage with the CDMI Standard. Mark Carlson, SNIA TC and Oracle Co-Chair, SNIA Cloud Storage TWG

Microsoft Azure BLOB Storage

Cloud FastPath: Highly Secure Data Transfer

How To Guide: Long Term Archive for Rubrik. Using SwiftStack Storage as a Long Term Archive for Rubrik

Eight Tips for Better Archives. Eight Ways Cloudian Object Storage Benefits Archiving with Veritas Enterprise Vault

Using Cohesity with Amazon Web Services (AWS)

REFERENCE ARCHITECTURE Quantum StorNext and Cloudian HyperStore

Developing Enterprise Cloud Solutions with Azure

Executive Summary SOLE SOURCE JUSTIFICATION. Microsoft Integration

Sentinet for Microsoft Azure SENTINET

Introduction to Digital Archiving and IBM archive storage options

Interoperable Cloud Storage with the CDMI Standard. Mark Carlson, SNIA TC and Oracle Chair, SNIA Cloud Storage TWG

Elastic Cloud Storage (ECS)

CDMI Support to Object Storage in Cloud K.M. Padmavathy Wipro Technologies

SECURE, FLEXIBLE ON-PREMISE STORAGE WITH EMC SYNCPLICITY AND EMC ISILON

Unstructured Data in the Cloud with ECS

FIREFLY ARCHITECTURE: CO-BROWSING AT SCALE FOR THE ENTERPRISE

Backup Solution. User Guide. Issue 01 Date

Dell EMC CIFS-ECS Tool

Datacenter replication solution with quasardb

WINDOWS AZURE QUEUE. Table of Contents. 1 Introduction

Developing Microsoft Azure Solutions: Course Agenda

Nasuni UniFS a True Global File System

ARCHITECTING WEB APPLICATIONS FOR THE CLOUD: DESIGN PRINCIPLES AND PRACTICAL GUIDANCE FOR AWS

Cloud Archive and Long Term Preservation Challenges and Best Practices

Amazon AWS-Solution-Architect-Associate Exam

Hosted Azure for your business. Build virtual servers, deploy with flexibility, and reduce your hardware costs with a managed cloud solution.

More on Testing and Large Scale Web Apps

Rio-2 Hybrid Backup Server

Advanced iscsi Management April, 2008

Acronis Backup. Acronis, All rights reserved. Dual headquarters in Switzerland and Singapore. Dual headquarters in Switzerland and Singapore

Cloud object storage : the right way. Orit Wasserman Open Source Summit 2018

IBM Tivoli Storage Manager Version Introduction to Data Protection Solutions IBM

Document Sub Title. Yotpo. Technical Overview 07/18/ Yotpo

Introduction. The Safe-T Solution

Nasuni for Oil & Gas. The Challenge: Managing the Global Flow of File Data to Improve Time to Market and Profitability

DATA PROTECTION FOR THE CLOUD

The Latest EMC s announcements

Pentaho and Microsoft Azure

Niraj Kumar Lead Azure Architect, MCT( Microsoft Certified Trainer)

Course Outline. Lesson 2, Azure Portals, describes the two current portals that are available for managing Azure subscriptions and services.

Large-Scale Web Applications

Using the Cisco ACE Application Control Engine Application Switches with the Cisco ACE XML Gateway

Business and IT Challenges

IBM Spectrum Protect Version Introduction to Data Protection Solutions IBM

COURSE 20487B: DEVELOPING WINDOWS AZURE AND WEB SERVICES

IBM TS7700 grid solutions for business continuity

Tintri Cloud Connector

Title: Planning AWS Platform Security Assessment?

Exam : Implementing Microsoft Azure Infrastructure Solutions

Hedvig as backup target for Veeam

Azure Integration Services

Top 40 Cloud Computing Interview Questions

IBM Archiving Solution DB2 CommonStore for Lotus Domino

Cisco CloudCenter Solution with Cisco ACI: Common Use Cases

Developing Windows Azure and Web Services

SAML-Based SSO Solution

SNIA Cloud Storage TWG

ECS High Availability Design

TechTour. Winning Technology Roadmaps. Sig Knapstad. Cutting Edge

EBOOK. FROM DISASTER RECOVERY TO ACTIVE-ACTIVE: NuoDB AND MULTI-DATA CENTER DEPLOYMENTS

MODERN ARCHIVING WITH ELASTIC CLOUD STORAGE (ECS)

DISASTER RECOVERY IN AN EMC DISKXTENDER FOR WINDOWS ENVIRONMENT

WHITE PAPER Cloud FastPath: A Highly Secure Data Transfer Solution

ITP 140 Mobile Technologies. Mobile Topics

Technology Outsourcing with Cloud Computing. Understanding the Opportunity

Cloud Storage. Patrick Osborne Director of Product Management. Sam Fineberg Distinguished Technologist.

Twilio cloud communications SECURITY

WHY SECURE MULTI-TENANCY WITH DATA DOMAIN SYSTEMS?

SECURE CLOUD BACKUP AND RECOVERY

Simplifying Collaboration in the Cloud

Developing Microsoft Azure and Web Services. Course Code: 20487C; Duration: 5 days; Instructor-led

Storage Made Easy Enterprise File Share and Sync Fabric Architecture

Dell EMC Surveillance for IndigoVision Body-Worn Cameras

Develop Mobile Front Ends Using Mobile Application Framework A - 2

Simplify and Improve DB2 Administration by Leveraging Your Storage System

Developing Microsoft Azure Solutions (70-532) Syllabus

Content Addressed Storage (CAS)

How Teridion Works. A Teridion Technical Paper. A Technical Overview of the Teridion Virtual Network. Teridion Engineering March 2017

Cisco Cloud Application Centric Infrastructure

Course Outline. Developing Microsoft Azure Solutions Course 20532C: 4 days Instructor Led

Secure Messaging Large File Sharing

Backup & Recovery on AWS

Naresh Information Technologies

The Nasuni Security Model

The Three Data Challenges

DISTRIBUTED SYSTEMS [COMP9243] Lecture 8a: Cloud Computing WHAT IS CLOUD COMPUTING? 2. Slide 3. Slide 1. Why is it called Cloud?

SQL Azure. Abhay Parekh Microsoft Corporation

5 OPPORTUNITIES TO DELIVER BUSINESS VALUE WITH THE CLOUD

Cloud Computing Lectures. Cloud Security

YOUR APPLICATION S JOURNEY TO THE CLOUD. What s the best way to get cloud native capabilities for your existing applications?

A Close-up Look at Potential Future Enhancements in Tivoli Storage Manager

ECE 7650 Scalable and Secure Internet Services and Architecture ---- A Systems Perspective

Global Data Plane. The Cloud is not enough: Saving IoT from the Cloud & Toward a Global Data Infrastructure PRESENTED BY MEGHNA BAIJAL

DATA INTEGRITY TECHNIQUES IN CLOUD: AN ANALYSIS

Transcription:

Best Practices in Designing Cloud Storage based Archival solution Sreenidhi Iyangar & Jim Rice EMC Corporation

Abstract Cloud storage facilitates the use case of digital archiving for long periods of time by transparently providing scalable storage resources. With ever increasing amount of data to be preserved for legal and compliance reasons, cloud storage when designed correctly, can provide a low cost solution in a geographically distributed environment. This presentation highlights the key considerations while developing an archive product using cloud storage based on REST interface. It goes on to highlight the design choices while developing a file based archiving solution to cloud storage using EMC Atmos as an example. The aspects covered in the slides are security, performance, using vendor neutral APIs, developing portable application irrespective of the backend cloud supported, taking advantage of geographically spread cloud storage nodes, faster searches and an efficient disaster recovery mechanism

Learning Objectives Considerations while developing an archival application with cloud storage Security and performance aspects while designing an application for cloud storage Leverage cloud vendor provided capabilities in your application 3

Agenda Archiving Cloud Storage EMC Atmos Development considerations Performance Security Metadata feature Flexibility References Acknowledgements Q & A 4

Archiving SNIA - A collection of data objects, perhaps with associated metadata, in a storage system whose primary purpose is the long-term preservation and retention of that data. Source: 100 Year Archive Requirements Survey 5

Cloud Storage SNIA - Delivery over a network of appropriately configured virtual storage and related data services, based on a request for a given service level Pay for what you use No Storage devices Maintenance offloaded to service providers Image Source: www.indianweb2.com EMC Atmos Amazon WebServices Windows Azure 6

EMC Atmos EMC Atmos is globally accessible cloud storage platform to manage content in modern content-rich applications, highscale content infrastructures and cloud service provider environment. 7

REST API Stands for REpresentational State Transfer Standard programming interface for Apps integrating with Cloud Methods POST GET PUT DELETE 8

Challenges in design Key considerations in cloud storage based archival solution Performance Security Retention Searching Flexibility

Performance Improvements Issues to be considered by application Archiving application could be far away from storage; high latency Data access in reasonable amount of time Amazon S3 - Peak requests: 200,000+ per sec (source: www.datacenterknowledge.com Low cost low performance Design consideration for performance optimization in data transfer Asynchronous Transfer of Packets Load balancing Read & write from multiple nodes in round robin fashion Optimum buffer size for chunk transfer Send header and data together with small files 10

Security - Data Data access considerations Data resides somewhere else Hacks during data transfer Avoiding blocked/unavailable port numbers Provide data protection options, such as, Facility to use HTTPS for secure data transfer Based on security level of the network, option to choose HTTP or HTTPS Expose cloud storage data at-rest encryption choice to users through the application based on rules Securing REST messages with signatures (hash of the request and secret ID) Port number modification 11

Security - Access Persisting access restriction to the cloud Access from anywhere at anytime Access only to legitimate users Data protection Authenticating by signatures (combination of User ID and request header) User level authorization using ACL Pre-authenticated URL to an object 12

Search & DR Aspects to address archival needs for Fast search with high accuracy Disaster recovery Mechanism to identify objects written Expose Name = value(s) pair tagging to end user Tag acts as index for the object Fast search by querying tag/token Disaster recovery by tagging unique-id with every object written Versioning of objects (files) 13

Retention, Replication Data availability considerations Retain data >50 years or permanently (Source: 100 Year Archive Requirements Survey) No DUDL Inconsistent policies across applications Aid in data life cycle Specific value in Name = value pair activates exact policy Policies could provide various Retention Replication Striping across servers Deletion 14

Vendor Neutral Multi-Vendor issues to be considered by application Design to support multiple vendors Moving data to different vendor No feature compromise Use vendor neutral APIs Separate business logic from REST/SOAP communication A layer above commands Features exposed to user independent of vendor Language independence Vendor provided SDK

Flexibility Optimizations for accessing cloud storage objects Sub-directory specific Disaster recovery Fast reading If available, use flexible view of files to optimize access Writing files in namespace (file system hierarchy) Query under specific path Fast access by reading with object ID 16

References 1. SNIA Definitions - http://www.snia.org/education/dictionary 2. EMC Atmos Programmers Guide - https://community.emc.com/docs/doc-3481 3. 100 year archiving requirements survey http://www.snia.org/sites/default/files2/100yratf_archive-requirements- Survey_20070619.pdf 4. EMC Atmos http://www.emc.com/storage/atmos/atmos.htm#! 5. Cloud Archiving http://www.snia.org/sites/default/education/tutorials/2010/spring/cloud/paulfield_cloud_arc hiving.pdf 17

Acknowledgement We are thankful to Vinodraj Daniel (vinodraj.daniel@emc.com) - Engineering Manager, EMC Corporation for their ideas and suggestions 18

Questions? You can email queries to jim.rice@emc.com or sreenidhi.iyangar@emc.com 19