BARNS: Backup and Recovery for NoSQL Databases

Size: px
Start display at page:

Download "BARNS: Backup and Recovery for NoSQL Databases"

Transcription

1 BARNS: Backup and Recovery for NoSQL Databases Atish Kathpal, Priya Sehgal Advanced Technology Group, NetApp 1

2 Why Backup/Restore NoSQL DBs? Customers are directly ingesting into NoSQL Security breach are on the rise e.g. ransomware attacks on MongoDB [1] and recent WannaCrypt exploits Fat-finger errors eventually propagate to replicas Ransomware Sandbox deployments for test/dev Bring up shadow clusters of different cardinality (from production cluster snapshots) Compliance and regulatory requirements IDC, 2016 report [2] lists data-protection and retention as one of the top infrastructural requirements for NoSQL 2 [1] [2] Nadkarni A., Polyglot Persistence: Insights on NoSQL Adoption and the Resulting Impact on Infrastructure. IDC Feb.

3 NoSQL Database Classes From Backup/Restore Service Perspective Master-slave Authoritative copy of each partition is contained in the master node that we can backup. Loss of primary node leads to shard/partition-unavailability until new leader is elected. Example: MongoDB, Redis, Oracle NoSQL, MarkLogic Master-less Data is scattered across nodes using consistent hashing techniques, no single node has all data for a given partition Eventual consistency: Unavailability of a destination node does not lead to write-failure, data is eventually replicated Example: Cassandra, Couchbase 3

4 NoSQL DBs Hosted on Shared Storage High-level, conceptual deployment architecture Backup: Leverage storage snapshots Restore: Leverage cloning NoSQL DB software (Node1) NoSQL DB software (Node2) NoSQL DB software (Node3) Filesystem (DB Journal, Logs, Data Files) Filesystem (DB Journal, Logs, Data Files) Filesystem (DB Journal, Logs, Data Files) LUN-1 LUN-2 LUN-3 Shared Storage Array (Snapshots, Cloning, Compression, Deduplication, Encryption, Cloud Integrations) 4

5 Backup/Restore Challenges Cluster-consistency at scale Cluster/App quiesce significantly hampers application performance. Cross node consistency not guaranteed Take crash consistent snapshots Post process crash consistent snapshots (in a sand-box) using NoSQL DB stack to reach an cluster-consistent state Space Efficiency Replica set data copies do not de-duplicate small row sizes, scattered across nodes (Cassandra) and unique ids added by storage engines (MongoDB) DB performs compression and encryption Remove replicas logically (application aware backup) Topology Changes Commodity nodes, at scale of s of nodes. E.g., Primary node might be unreachable while taking backup in case of MongoDB Storage snapshots do not have context about cluster topology Use cases may require restore to a test/dev cluster of different cardinality Save Cluster topology and storage mapping as part of backup Insignificant block based deduplication Write: (K1, V1), (K3, V3) Cassandra N1 Replication factor = 2 Write: (K1, V1), (K2, V2) Cassandra N2 Shared Storage cluster LUN1: K1, K3 LUN2: K1, K2 LUN3: K2, K3 Write: (K2, V2), (K3, V3) Cassandra N3 Existing open source utilities like Mongodump and Cassandra snapshots suffer from above challenges. 5

6 NoSQL Data Protection Challenges Master-Less Databases Client, Update K1 (CL.ONE) Challenges: Fault tolerance Backup may capture stale data due to eventual consistency Higher restore times, since Cassandra will perform repairs during restore Ack Update: (K1, V11, Tnew) Cassandra N1 Update: (K1, V11, Tnew) Cassandra N2 Cassandra N3 Snapshot of LUN2 will point to stale data Shared Storage cluster LUN1: (K1, V1, Told), (K1, V11, Tnew) LUN2: (K1, V1, Told), LUN3: 6

7 BARNS Architecture Addresses challenges of: 1. Taking cluster-consistent backup at scale 2. Taking storage efficient backups (through replica removal) 3. Enables recovery/cloning to different cluster topologies Mongo APIs MongoDB Cluster Lib_MongoDB Light Weight Backup Process (LWB) Lib_Store1 APIs Lib_DB Post- Process (PP) Lib_Storage Cassandra Cluster Lib_Cassandra Restore Process Lib_Store2 Cassandra APIs APIs Stop_balancer() Get_topology() Check_cluster_health() Prepare_Backup_Topo( Etc. Get_lun_mappings() Take_snapshots() Lun_clone() Vault_backup() Etc. 7 Store1 Store2

8 BARNS Solution: Cassandra Master-less Distributed Database 8

9 Phase 1: Light-weight Backup Phase C1 Token1 Production Cluster C2 C3 Token2 Token3 C4 Token4 BARNS: LWB 1. Capture token assignment of each node 2. Store mapping of LUNs à Tokens L1 L2 L3 L4 CL1 D1 CL2 D2 CL3 D3 CL4 D4 3. Take snapshots of L1 to L4 Backup Metadata example CL1 CL2 CL3 CL4 D1 D2 D3 D4 sn1 sn2 sn3 sn4 9

10 Phase2: Post Process Phase Part1: Flush Commitlogs PP Node(s) BARNS: PP-Phase P1 P2 P3 P4 CL_sn1 CL_sn2 CL_sn3 CL_sn4 1. Clone the LWB Snapshot LUNs 2. Mount on different PP processes or different nodes 3. Start Cassandra processes P1 to P4 CL1 D1 CL2 D2 CL3 D3 CL4 D4 4. Flush CommitLogs E.g. Data K1, V1, T1 K2, V2, T1 K1,V11, T2 10

11 Phase2: Post Process Phase Part2: Compaction BARNS: PP-Phase PP Node /cassandra unionfs 1. Using UnionFS mount all snapshot clones as readonly 2. Create a new volume and LUN for full backup 3. Mount it as RW through UnionFS 4. Let PP node have all tokens of prod cluster 5. Start Cassandra 6. Start compaction process 7. Final compacted files will be stored on Fullbackup LUN Data 1 Data 2 Data 3 Data 4 K1, V1, T1 K1, V1, T1 K1, V1, T1 K3,V3, T3 K2, V2, T1 K1, V11, T2 K2, V2, T1 K2, V22, T4 K1,V11, T2 K3,V3, T3 CL_sn1 CL_sn2 CL_sn3 CL_sn4 Data Compact K1, V11, T2 K2, V22, T1 K3,V3, T3 Full_bk_lun Keeping single copy of data in the backup => ~66% reduction in backup storage requirements 11

12 Cassandra Restore The post-process step enables cloning to different restore/clone topologies Token1 Token2 N1 Token3 Token4 N2 Clones Full Backup K1, V11, T2 K2, V22, T1 Full_bk_lun K3,V3, T3 12

13 Evaluation - Cassandra Backup and Restore Full Backup LWB <10 secs pp-flush - ~40 secs pp-compact time increases by 35-40% à incremental backup Production cluster 4 nodes 4 iscsi LUNs Commitlog and SSTables for a node on same LUN Cassandra 4.0 Post Process Node 2 CPUs 8GB RAM YCSB to ingest data Restore time less than ~2 mins (irrespective of cluster size and data set size) 13

14 BARNS Solution: MongoDB Master-Slave Distributed Database Ø Check the paper or just attend the poster session J 14

15 Summary Tracking replicas and cluster topologies is important for taking backups and performing flexible topology restores Existing open-source solutions have several inefficiencies like need for repairs after restore, lack of storage efficiency in backup and poor integrations with shared storage Opportunity to provide efficient backup and restore through light-weight snapshots and clones 15

16 Thank You. 16

BARNS: Towards Building Backup and Recovery for NoSQL Databases

BARNS: Towards Building Backup and Recovery for NoSQL Databases BARNS: Towards Building Backup and Recovery for NoSQL Databases Atish Kathpal, NetApp Abstract While NoSQL databases are gaining popularity for business applications, they pose unique challenges towards

More information

Document Sub Title. Yotpo. Technical Overview 07/18/ Yotpo

Document Sub Title. Yotpo. Technical Overview 07/18/ Yotpo Document Sub Title Yotpo Technical Overview 07/18/2016 2015 Yotpo Contents Introduction... 3 Yotpo Architecture... 4 Yotpo Back Office (or B2B)... 4 Yotpo On-Site Presence... 4 Technologies... 5 Real-Time

More information

Cloud Backup and Recovery for Healthcare and ecommerce

Cloud Backup and Recovery for Healthcare and ecommerce Get Your Cloud Backup On Cloud Backup and Recovery for Healthcare and ecommerce Peter Smails, Vice President, Marketing & Business Development Shalabh Goyal, Director, Product Management October 12 th,

More information

MongoDB Architecture

MongoDB Architecture VICTORIA UNIVERSITY OF WELLINGTON Te Whare Wananga o te Upoko o te Ika a Maui MongoDB Architecture Lecturer : Dr. Pavle Mogin SWEN 432 Advanced Database Design and Implementation Advanced Database Design

More information

How to Scale MongoDB. Apr

How to Scale MongoDB. Apr How to Scale MongoDB Apr-24-2018 About me Location: Skopje, Republic of Macedonia Education: MSc, Software Engineering Experience: Lead Database Consultant (since 2016) Database Consultant (2012-2016)

More information

NoSQL Databases MongoDB vs Cassandra. Kenny Huynh, Andre Chik, Kevin Vu

NoSQL Databases MongoDB vs Cassandra. Kenny Huynh, Andre Chik, Kevin Vu NoSQL Databases MongoDB vs Cassandra Kenny Huynh, Andre Chik, Kevin Vu Introduction - Relational database model - Concept developed in 1970 - Inefficient - NoSQL - Concept introduced in 1980 - Related

More information

Finding Consistency in an Inconsistent World: Towards Deep Semantic Understanding of Scale-out Distributed Databases

Finding Consistency in an Inconsistent World: Towards Deep Semantic Understanding of Scale-out Distributed Databases USENIX HotStorage2016 Finding Consistency in an Inconsistent World: Towards Deep Semantic Understanding of Scale-out Distributed Databases Neville Carvalho, Hyojun Kim, Maohua Lu, Prasenjit Sarkar, Rohit

More information

Migrating to Cassandra in the Cloud, the Netflix Way

Migrating to Cassandra in the Cloud, the Netflix Way Migrating to Cassandra in the Cloud, the Netflix Way Jason Brown - @jasobrown Senior Software Engineer, Netflix Tech History, 1998-2008 In the beginning, there was the webapp and a single database in a

More information

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure Nutanix Tech Note Virtualizing Microsoft Applications on Web-Scale Infrastructure The increase in virtualization of critical applications has brought significant attention to compute and storage infrastructure.

More information

Course Content MongoDB

Course Content MongoDB Course Content MongoDB 1. Course introduction and mongodb Essentials (basics) 2. Introduction to NoSQL databases What is NoSQL? Why NoSQL? Difference Between RDBMS and NoSQL Databases Benefits of NoSQL

More information

Hedvig as backup target for Veeam

Hedvig as backup target for Veeam Hedvig as backup target for Veeam Solution Whitepaper Version 1.0 April 2018 Table of contents Executive overview... 3 Introduction... 3 Solution components... 4 Hedvig... 4 Hedvig Virtual Disk (vdisk)...

More information

SOLUTION BRIEF Fulfill the promise of the cloud

SOLUTION BRIEF Fulfill the promise of the cloud SOLUTION BRIEF Fulfill the promise of the cloud NetApp Solutions for Amazon Web Services Fulfill the promise of the cloud NetApp Cloud Volumes Service for AWS: Move and manage more workloads faster Many

More information

A Non-Relational Storage Analysis

A Non-Relational Storage Analysis A Non-Relational Storage Analysis Cassandra & Couchbase Alexandre Fonseca, Anh Thu Vu, Peter Grman Cloud Computing - 2nd semester 2012/2013 Universitat Politècnica de Catalunya Microblogging - big data?

More information

THE COMPLETE GUIDE COUCHBASE BACKUP & RECOVERY

THE COMPLETE GUIDE COUCHBASE BACKUP & RECOVERY THE COMPLETE GUIDE COUCHBASE BACKUP & RECOVERY INTRODUCTION Driven by the need to remain competitive and differentiate themselves, organizations are undergoing digital transformations and becoming increasingly

More information

Microsoft SQL Server HA and DR with DVX

Microsoft SQL Server HA and DR with DVX Microsoft SQL Server HA and DR with DVX 385 Moffett Park Dr. Sunnyvale, CA 94089 844-478-8349 www.datrium.com Technical Report Introduction A Datrium DVX solution allows you to start small and scale out.

More information

Introduction to OpenStack Trove

Introduction to OpenStack Trove Introduction to OpenStack Trove A multi-database deployment Doug Shelley - Tesora Michael Basnight - Rackspace Agenda What is Trove How to Install Trove A look at the Trove API Multi-Database demonstration

More information

Your Complete Guide to Backup and Recovery for MongoDB

Your Complete Guide to Backup and Recovery for MongoDB Your Complete Guide to Backup and Recovery for MongoDB EBOOK Your Complete Guide to Backup and Recovery for MongoDB Table of Contents Part I: Backup and Recovery for MongoDB Part II: Customer Case Study

More information

Spotify. Scaling storage to million of users world wide. Jimmy Mårdell October 14, 2014

Spotify. Scaling storage to million of users world wide. Jimmy Mårdell October 14, 2014 Cassandra @ Spotify Scaling storage to million of users world wide! Jimmy Mårdell October 14, 2014 2 About me Jimmy Mårdell Tech Product Owner in the Cassandra team 4 years at Spotify

More information

Trends in Data Protection and Restoration Technologies. Mike Fishman, EMC 2 Corporation

Trends in Data Protection and Restoration Technologies. Mike Fishman, EMC 2 Corporation Trends in Data Protection and Restoration Technologies Mike Fishman, EMC 2 Corporation SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA unless otherwise noted. Member

More information

How do we build TiDB. a Distributed, Consistent, Scalable, SQL Database

How do we build TiDB. a Distributed, Consistent, Scalable, SQL Database How do we build TiDB a Distributed, Consistent, Scalable, SQL Database About me LiuQi ( 刘奇 ) JD / WandouLabs / PingCAP Co-founder / CEO of PingCAP Open-source hacker / Infrastructure software engineer

More information

Executive Summary SOLE SOURCE JUSTIFICATION. Microsoft Integration

Executive Summary SOLE SOURCE JUSTIFICATION. Microsoft Integration Executive Summary Commvault Simpana software delivers the unparalleled advantages and benefits of a truly holistic approach to data management. It is one product that contains individually licensable modules

More information

Hadoop An Overview. - Socrates CCDH

Hadoop An Overview. - Socrates CCDH Hadoop An Overview - Socrates CCDH What is Big Data? Volume Not Gigabyte. Terabyte, Petabyte, Exabyte, Zettabyte - Due to handheld gadgets,and HD format images and videos - In total data, 90% of them collected

More information

Opendedupe & Veritas NetBackup ARCHITECTURE OVERVIEW AND USE CASES

Opendedupe & Veritas NetBackup ARCHITECTURE OVERVIEW AND USE CASES Opendedupe & Veritas NetBackup ARCHITECTURE OVERVIEW AND USE CASES May, 2017 Contents Introduction... 2 Overview... 2 Architecture... 2 SDFS File System Service... 3 Data Writes... 3 Data Reads... 3 De-duplication

More information

The Definitive Guide to Backup and Recovery for Cassandra

The Definitive Guide to Backup and Recovery for Cassandra The Definitive Guide to Backup and Recovery for Cassandra EBOOK The Definitive Guide to Backup and Recovery for Cassandra Table of Contents Executive Summary 1. Cassandra Technology Overview 2. The Need

More information

Introduction to Database Services

Introduction to Database Services Introduction to Database Services Shaun Pearce AWS Solutions Architect 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved Today s agenda Why managed database services? A non-relational

More information

Kubernetes Integration with Virtuozzo Storage

Kubernetes Integration with Virtuozzo Storage Kubernetes Integration with Virtuozzo Storage A Technical OCTOBER, 2017 2017 Virtuozzo. All rights reserved. 1 Application Container Storage Application containers appear to be the perfect tool for supporting

More information

Cohesity Flash Protect for Pure FlashBlade: Simple, Scalable Data Protection

Cohesity Flash Protect for Pure FlashBlade: Simple, Scalable Data Protection Cohesity Flash Protect for Pure FlashBlade: Simple, Scalable Data Protection Executive overview Pure Storage FlashBlade is the industry s first all-flash, scale-out file storage purpose-built for high-performance

More information

Backup Edition Comparison OVERVIEW

Backup Edition Comparison OVERVIEW Backup 12.5 Edition Comparison Available in and editions, protects data on 21 platforms in on-premises and remote systems, in private and public clouds, and on mobile devices. With active protection from

More information

Simple Data Protection for the Cloud Era

Simple Data Protection for the Cloud Era Simple Data Protection for the Era Enterprise Data Protection, Simplified Struggling with data protection? You re not alone. Many storage admins are faced with the challenge of protecting an increasing

More information

Backup License Comparison OVERVIEW

Backup License Comparison OVERVIEW Backup 12.5 Comparison Acronis Backup 12.5 is the world s easiest and fastest backup solution, protecting data on 21 platforms, whether data is located on-premises, in remote systems, in private and public

More information

Distributed PostgreSQL with YugaByte DB

Distributed PostgreSQL with YugaByte DB Distributed PostgreSQL with YugaByte DB Karthik Ranganathan PostgresConf Silicon Valley Oct 16, 2018 1 CHECKOUT THIS REPO: github.com/yugabyte/yb-sql-workshop 2 About Us Founders Kannan Muthukkaruppan,

More information

TIBX NEXT-GENERATION ARCHIVE FORMAT IN ACRONIS BACKUP CLOUD

TIBX NEXT-GENERATION ARCHIVE FORMAT IN ACRONIS BACKUP CLOUD TIBX NEXT-GENERATION ARCHIVE FORMAT IN ACRONIS BACKUP CLOUD 1 Backup Speed and Reliability Are the Top Data Protection Mandates What are the top data protection mandates from your organization s IT leadership?

More information

Dell EMC Unity: Data Protection & Copy Data Management Options. Ryan Poulin Product Technologist Midrange & Entry Solutions Group

Dell EMC Unity: Data Protection & Copy Data Management Options. Ryan Poulin Product Technologist Midrange & Entry Solutions Group Dell EMC Unity: Data Protection & Copy Data Management Options Ryan Poulin Product Technologist Midrange & Entry Solutions Group What s New with Dell EMC Unity 650F 550F 450F 350F Optimized for All-Flash

More information

Axway API Management 7.5.x Cassandra Best practices. #axway

Axway API Management 7.5.x Cassandra Best practices. #axway Axway API Management 7.5.x Cassandra Best practices #axway Axway API Management 7.5.x Cassandra Best practices Agenda Apache Cassandra - Overview Apache Cassandra - Focus on consistency level Apache Cassandra

More information

Using Cohesity with Amazon Web Services (AWS)

Using Cohesity with Amazon Web Services (AWS) Using Cohesity with Amazon Web Services (AWS) Achieve your long-term retention and archival objectives for secondary data Cohesity DataPlatform is a hyperconverged secondary data and application solution

More information

Backup and Recovery Best Practices With Tintri VMstore

Backup and Recovery Best Practices With Tintri VMstore Backup and Recovery Best Practices With Tintri VMstore Backup and Recovery Best Practices with Tintri VMstore TECHNICAL BEST PRACTICES PAPER, Revision 1.0, April 10, 2014 Contents Contents Introduction

More information

MongoDB Backup & Recovery Field Guide

MongoDB Backup & Recovery Field Guide MongoDB Backup & Recovery Field Guide Tim Vaillancourt Percona Speaker Name `whoami` { name: tim, lastname: vaillancourt, employer: percona, techs: [ mongodb, mysql, cassandra, redis, rabbitmq, solr, mesos

More information

Making Non-Distributed Databases, Distributed. Ioannis Papapanagiotou, PhD Shailesh Birari

Making Non-Distributed Databases, Distributed. Ioannis Papapanagiotou, PhD Shailesh Birari Making Non-Distributed Databases, Distributed Ioannis Papapanagiotou, PhD Shailesh Birari Dynomite Ecosystem Dynomite - Proxy layer Dyno - Client Dynomite-manager - Ecosystem orchestrator Dynomite-explorer

More information

Cohesity DataPlatform Protecting Individual MS SQL Databases Solution Guide

Cohesity DataPlatform Protecting Individual MS SQL Databases Solution Guide Cohesity DataPlatform Protecting Individual MS SQL Databases Solution Guide Abstract This solution guide outlines the workflow for creating backups with Microsoft SQL Server databases and Cohesity Data

More information

OpenStack Trove and DBaaS: Impedance Match?

OpenStack Trove and DBaaS: Impedance Match? OpenStack Trove and DBaaS: Impedance Match? June 11, 2015 2014 EnterpriseDB Corporation. All rights reserved. 1 Introduction Fred Dalrymple EDB, product manager, Postgres Plus Cloud Database Representing

More information

The Definitive Guide to MongoDB Backup and Recovery

The Definitive Guide to MongoDB Backup and Recovery The Definitive Guide to MongoDB Backup and Recovery EBOOK The Definitive Guide to MongoDB Backup and Recovery Table of Contents Executive Summary 1. MongoDB Technology Overview 2. The Need for Backup &

More information

Introducing RecoverX 2.5

Introducing RecoverX 2.5 Backup & Recovery for Modern Applications Introducing RecoverX 2.5 Shalabh Goyal, Director, Product Management Kedar Hiremath, Product Marketing Manager November 16 th, 2017 What We Will Cover Today What

More information

The course modules of MongoDB developer and administrator online certification training:

The course modules of MongoDB developer and administrator online certification training: The course modules of MongoDB developer and administrator online certification training: 1 An Overview of the Course Introduction to the course Table of Contents Course Objectives Course Overview Value

More information

Tintri & Veeam VM Backup & Replication Best Practices. John Phillips Strategic Alliances and Technical Marketing Ryan Post Systems Engineer

Tintri & Veeam VM Backup & Replication Best Practices. John Phillips Strategic Alliances and Technical Marketing Ryan Post Systems Engineer Tintri & Veeam VM Backup & Replication Best Practices John Phillips Strategic Alliances and Technical Marketing Ryan Post Systems Engineer 1 VM-aware Storage from Tintri Stores VMs and vdisks (only!) No

More information

THE COMPLETE GUIDE HADOOP BACKUP & RECOVERY

THE COMPLETE GUIDE HADOOP BACKUP & RECOVERY THE COMPLETE GUIDE HADOOP BACKUP & RECOVERY INTRODUCTION Driven by the need to remain competitive and differentiate themselves, organizations are undergoing digital transformations and becoming increasingly

More information

NoSQL BENCHMARKING AND TUNING. Nachiket Kate Santosh Kangane Ankit Lakhotia Persistent Systems Ltd. Pune, India

NoSQL BENCHMARKING AND TUNING. Nachiket Kate Santosh Kangane Ankit Lakhotia Persistent Systems Ltd. Pune, India NoSQL BENCHMARKING AND TUNING Nachiket Kate Santosh Kangane Ankit Lakhotia Persistent Systems Ltd. Pune, India Today large variety of available NoSQL options has made it difficult for developers to choose

More information

Distributed System. Gang Wu. Spring,2018

Distributed System. Gang Wu. Spring,2018 Distributed System Gang Wu Spring,2018 Lecture7:DFS What is DFS? A method of storing and accessing files base in a client/server architecture. A distributed file system is a client/server-based application

More information

Acronis Backup 12.5 License Comparison incl. cloud deployment functionality

Acronis Backup 12.5 License Comparison incl. cloud deployment functionality DATA SHEET ACRONIS BACKUP LICENSING COMPARISON Acronis Backup 12.5 License Comparison incl. cloud deployment functionality 1 VIRTUAL HOST OFFICE 365 VIRTUAL HOST OFFICE 365 Storage disks, NAS, SAN, Acronis

More information

Protecting Hyper-V Environments

Protecting Hyper-V Environments TECHNICAL WHITE PAPER: BACKUP EXEC TM 2014 PROTECTING HYPER-V ENVIRONMENTS Backup Exec TM 2014 Technical White Paper Protecting Hyper-V Environments Technical White Papers are designed to introduce Symantec

More information

Adaptation in distributed NoSQL data stores

Adaptation in distributed NoSQL data stores Adaptation in distributed NoSQL data stores Kostas Magoutis Department of Computer Science and Engineering University of Ioannina, Greece Institute of Computer Science (ICS) Foundation for Research and

More information

ApsaraDB for Redis. Product Introduction

ApsaraDB for Redis. Product Introduction ApsaraDB for Redis is compatible with open-source Redis protocol standards and provides persistent memory database services. Based on its high-reliability dual-machine hot standby architecture and seamlessly

More information

Data Protection Guide

Data Protection Guide SnapCenter Software 4.0 Data Protection Guide For Custom Plug-ins March 2018 215-12932_C0 doccomments@netapp.com Table of Contents 3 Contents Deciding on whether to read the SnapCenter Data Protection

More information

Construct a High Efficiency VM Disaster Recovery Solution. Best choice for protecting virtual environments

Construct a High Efficiency VM Disaster Recovery Solution. Best choice for protecting virtual environments Construct a High Efficiency VM Disaster Recovery Solution Best choice for protecting virtual environments About NAKIVO Established in the USA since 2012 Provides data protection solutions for VMware, Hyper-V

More information

Architecture of a Real-Time Operational DBMS

Architecture of a Real-Time Operational DBMS Architecture of a Real-Time Operational DBMS Srini V. Srinivasan Founder, Chief Development Officer Aerospike CMG India Keynote Thane December 3, 2016 [ CMGI Keynote, Thane, India. 2016 Aerospike Inc.

More information

EMC Virtual Infrastructure for Microsoft Exchange 2007

EMC Virtual Infrastructure for Microsoft Exchange 2007 EMC Virtual Infrastructure for Microsoft Exchange 2007 Enabled by EMC Replication Manager, EMC CLARiiON AX4-5, and iscsi Reference Architecture EMC Global Solutions 42 South Street Hopkinton, MA 01748-9103

More information

Deploying Software Defined Storage for the Enterprise with Ceph. PRESENTATION TITLE GOES HERE Paul von Stamwitz Fujitsu

Deploying Software Defined Storage for the Enterprise with Ceph. PRESENTATION TITLE GOES HERE Paul von Stamwitz Fujitsu Deploying Software Defined Storage for the Enterprise with Ceph PRESENTATION TITLE GOES HERE Paul von Stamwitz Fujitsu Agenda Yet another attempt to define SDS Quick Overview of Ceph from a SDS perspective

More information

SnapCenter Software 4.0 Concepts Guide

SnapCenter Software 4.0 Concepts Guide SnapCenter Software 4.0 Concepts Guide May 2018 215-12925_D0 doccomments@netapp.com Table of Contents 3 Contents Deciding whether to use the Concepts Guide... 7 SnapCenter overview... 8 SnapCenter architecture...

More information

Copyright 2010 EMC Corporation. Do not Copy - All Rights Reserved.

Copyright 2010 EMC Corporation. Do not Copy - All Rights Reserved. 1 Using patented high-speed inline deduplication technology, Data Domain systems identify redundant data as they are being stored, creating a storage foot print that is 10X 30X smaller on average than

More information

Nutanix White Paper. Hyper-Converged Infrastructure for Enterprise Applications. Version 1.0 March Enterprise Applications on Nutanix

Nutanix White Paper. Hyper-Converged Infrastructure for Enterprise Applications. Version 1.0 March Enterprise Applications on Nutanix Nutanix White Paper Hyper-Converged Infrastructure for Enterprise Applications Version 1.0 March 2015 1 The Journey to Hyper-Converged Infrastructure The combination of hyper-convergence and web-scale

More information

Availability for the Modern Data Center on FlexPod Introduction NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only

Availability for the Modern Data Center on FlexPod Introduction NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only Availability for the Modern Data Center on FlexPod Introduction 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only Abstract Veeam Availability Suite v8 leverages NetApp storage

More information

Virtual Server Agent for VMware VMware VADP Virtualization Architecture

Virtual Server Agent for VMware VMware VADP Virtualization Architecture Virtual Server Agent for VMware VMware VADP Virtualization Architecture Published On: 11/19/2013 V10 Service Pack 4A Page 1 of 18 VMware VADP Virtualization Architecture - Virtual Server Agent for VMware

More information

Distributed File Systems II

Distributed File Systems II Distributed File Systems II To do q Very-large scale: Google FS, Hadoop FS, BigTable q Next time: Naming things GFS A radically new environment NFS, etc. Independence Small Scale Variety of workloads Cooperation

More information

Trends in Data Protection and Restoration Technologies. Jason Iehl, NetApp

Trends in Data Protection and Restoration Technologies. Jason Iehl, NetApp Trends in Data Protection and Restoration Technologies Jason Iehl, NetApp SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA unless otherwise noted. Member companies and

More information

Google File System, Replication. Amin Vahdat CSE 123b May 23, 2006

Google File System, Replication. Amin Vahdat CSE 123b May 23, 2006 Google File System, Replication Amin Vahdat CSE 123b May 23, 2006 Annoucements Third assignment available today Due date June 9, 5 pm Final exam, June 14, 11:30-2:30 Google File System (thanks to Mahesh

More information

StorageCraft OneXafe and Veeam 9.5

StorageCraft OneXafe and Veeam 9.5 TECHNICAL DEPLOYMENT GUIDE NOV 2018 StorageCraft OneXafe and Veeam 9.5 Expert Deployment Guide Overview StorageCraft, with its scale-out storage solution OneXafe, compliments Veeam to create a differentiated

More information

Scylla Open Source 3.0

Scylla Open Source 3.0 SCYLLADB PRODUCT OVERVIEW Scylla Open Source 3.0 Scylla is an open source NoSQL database that offers the horizontal scale-out and fault-tolerance of Apache Cassandra, but delivers 10X the throughput and

More information

Integrated Data Protection

Integrated Data Protection Integrated Data Protection Subra Mohan Director, Product Management SnapCenter and Data Protection Solutions 1 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only Agenda NetApp Data

More information

The Data Protection Rule and Hybrid Cloud Backup

The Data Protection Rule and Hybrid Cloud Backup The 3-2-1 Data Protection Rule and Hybrid Cloud Backup IT teams are under extreme pressure to improve backup, disaster recovery and data protection to eliminate downtime and facilitate digital transformation.

More information

MongoDB and Mysql: Which one is a better fit for me? Room 204-2:20PM-3:10PM

MongoDB and Mysql: Which one is a better fit for me? Room 204-2:20PM-3:10PM MongoDB and Mysql: Which one is a better fit for me? Room 204-2:20PM-3:10PM About us Adamo Tonete MongoDB Support Engineer Agustín Gallego MySQL Support Engineer Agenda What are MongoDB and MySQL; NoSQL

More information

Data Protection Modernization: Meeting the Challenges of a Changing IT Landscape

Data Protection Modernization: Meeting the Challenges of a Changing IT Landscape Data Protection Modernization: Meeting the Challenges of a Changing IT Landscape Tom Clark IBM Distinguished Engineer, Chief Architect Software 1 Data growth is continuing to explode Sensors & Devices

More information

The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into

The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material,

More information

Evaluating Cloud Storage Strategies. James Bottomley; CTO, Server Virtualization

Evaluating Cloud Storage Strategies. James Bottomley; CTO, Server Virtualization Evaluating Cloud Storage Strategies James Bottomley; CTO, Server Virtualization Introduction to Storage Attachments: - Local (Direct cheap) SAS, SATA - Remote (SAN, NAS expensive) FC net Types - Block

More information

Cohesity Architecture White Paper. Building a Modern, Web-Scale Architecture for Consolidating Secondary Storage

Cohesity Architecture White Paper. Building a Modern, Web-Scale Architecture for Consolidating Secondary Storage Architecture White Paper Building a Modern, Web-Scale Architecture for Consolidating Secondary Storage The Band-Aid Effect : The Hidden Cost of Trying to Make Legacy Solutions Work The combination of explosive

More information

IBM Spectrum Protect Version Introduction to Data Protection Solutions IBM

IBM Spectrum Protect Version Introduction to Data Protection Solutions IBM IBM Spectrum Protect Version 8.1.2 Introduction to Data Protection Solutions IBM IBM Spectrum Protect Version 8.1.2 Introduction to Data Protection Solutions IBM Note: Before you use this information

More information

Server Fault Protection with NetApp Data ONTAP Edge-T

Server Fault Protection with NetApp Data ONTAP Edge-T Technical Report Server Fault Protection with NetApp Data ONTAP Edge-T Jeff Whitaker, NetApp March 2013 TR-4154 TABLE OF CONTENTS 1 Introduction... 3 2 Backup and Disaster Recovery Technology... 4 2.1

More information

Understanding Virtual System Data Protection

Understanding Virtual System Data Protection Understanding Virtual System Data Protection Server virtualization is the most important new technology introduced in the data center in the past decade. It has changed the way we think about computing

More information

XtremIO Business Continuity & Disaster Recovery. Aharon Blitzer & Marco Abela XtremIO Product Management

XtremIO Business Continuity & Disaster Recovery. Aharon Blitzer & Marco Abela XtremIO Product Management XtremIO Business Continuity & Disaster Recovery Aharon Blitzer & Marco Abela XtremIO Product Management Agenda XtremIO Current BC/DR Offerings New BC/DR Offering Benefits of New Offering Technical Overview

More information

How Microsoft Built MySQL, PostgreSQL and MariaDB for the Cloud. Santa Clara, California April 23th 25th, 2018

How Microsoft Built MySQL, PostgreSQL and MariaDB for the Cloud. Santa Clara, California April 23th 25th, 2018 How Microsoft Built MySQL, PostgreSQL and MariaDB for the Cloud Santa Clara, California April 23th 25th, 2018 Azure Data Service Architecture Share Cluster with SQL DB Azure Infrastructure Services Azure

More information

EBOOK. NetApp ONTAP Cloud FOR MICROSOFT AZURE ENTERPRISE DATA MANAGEMENT IN THE CLOUD

EBOOK. NetApp ONTAP Cloud FOR MICROSOFT AZURE ENTERPRISE DATA MANAGEMENT IN THE CLOUD EBOOK NetApp ONTAP Cloud FOR MICROSOFT AZURE ENTERPRISE DATA MANAGEMENT IN THE CLOUD NetApp ONTAP Cloud for Microsoft Azure The ONTAP Cloud Advantage 3 Enterprise-Class Data Management 5 How ONTAP Cloud

More information

Protecting Miscrosoft Hyper-V Environments

Protecting Miscrosoft Hyper-V Environments Protecting Miscrosoft Hyper-V Environments Who should read this paper Technical White Papers are designed to introduce Veritas partners and end users to key technologies and technical concepts that are

More information

Cassandra - A Decentralized Structured Storage System. Avinash Lakshman and Prashant Malik Facebook

Cassandra - A Decentralized Structured Storage System. Avinash Lakshman and Prashant Malik Facebook Cassandra - A Decentralized Structured Storage System Avinash Lakshman and Prashant Malik Facebook Agenda Outline Data Model System Architecture Implementation Experiments Outline Extension of Bigtable

More information

Move Amazon RDS MySQL Databases to Amazon VPC using Amazon EC2 ClassicLink and Read Replicas

Move Amazon RDS MySQL Databases to Amazon VPC using Amazon EC2 ClassicLink and Read Replicas Move Amazon RDS MySQL Databases to Amazon VPC using Amazon EC2 ClassicLink and Read Replicas July 2017 2017, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document is provided

More information

A. Deduplication rate is less than expected, accounting for the remaining GSAN capacity

A. Deduplication rate is less than expected, accounting for the remaining GSAN capacity Volume: 326 Questions Question No: 1 An EMC Avamar customer s Gen-1 system with 4 TB of GSAN capacity has reached read-only threshold. The customer indicates that the deduplicated backup data accounts

More information

Copyright 2012 EMC Corporation. All rights reserved.

Copyright 2012 EMC Corporation. All rights reserved. 1 BACKUP BUILT FOR VMWARE Mark Twomey Technical Director, The Office Of The CTO 2 Agenda Market Forces Optimized VMware Backup Backup And Recovery For VCE Vblock Protecting vcloud Director Customer Success

More information

New Oracle NoSQL Database APIs that Speed Insertion and Retrieval

New Oracle NoSQL Database APIs that Speed Insertion and Retrieval New Oracle NoSQL Database APIs that Speed Insertion and Retrieval O R A C L E W H I T E P A P E R F E B R U A R Y 2 0 1 6 1 NEW ORACLE NoSQL DATABASE APIs that SPEED INSERTION AND RETRIEVAL Introduction

More information

MySQL Cluster Ed 2. Duration: 4 Days

MySQL Cluster Ed 2. Duration: 4 Days Oracle University Contact Us: +65 6501 2328 MySQL Cluster Ed 2 Duration: 4 Days What you will learn This MySQL Cluster training teaches you how to install and configure a real-time database cluster at

More information

NoSQL systems. Lecture 21 (optional) Instructor: Sudeepa Roy. CompSci 516 Data Intensive Computing Systems

NoSQL systems. Lecture 21 (optional) Instructor: Sudeepa Roy. CompSci 516 Data Intensive Computing Systems CompSci 516 Data Intensive Computing Systems Lecture 21 (optional) NoSQL systems Instructor: Sudeepa Roy Duke CS, Spring 2016 CompSci 516: Data Intensive Computing Systems 1 Key- Value Stores Duke CS,

More information

GFS: The Google File System. Dr. Yingwu Zhu

GFS: The Google File System. Dr. Yingwu Zhu GFS: The Google File System Dr. Yingwu Zhu Motivating Application: Google Crawl the whole web Store it all on one big disk Process users searches on one big CPU More storage, CPU required than one PC can

More information

5 Fundamental Strategies for Building a Data-centered Data Center

5 Fundamental Strategies for Building a Data-centered Data Center 5 Fundamental Strategies for Building a Data-centered Data Center June 3, 2014 Ken Krupa, Chief Field Architect Gary Vidal, Solutions Specialist Last generation Reference Data Unstructured OLTP Warehouse

More information

SnapCenter Software 2.0 Installation and Setup Guide

SnapCenter Software 2.0 Installation and Setup Guide SnapCenter Software 2.0 Installation and Setup Guide July 2017 215-11357_C0 doccomments@netapp.com Table of Contents 3 Contents Deciding whether to read this information... 7 SnapCenter overview... 8

More information

EMC RecoverPoint. EMC RecoverPoint Support

EMC RecoverPoint. EMC RecoverPoint Support Support, page 1 Adding an Account, page 2 RecoverPoint Appliance Clusters, page 3 Replication Through Consistency Groups, page 4 Group Sets, page 22 System Tasks, page 24 Support protects storage array

More information

Master Services Agreement:

Master Services Agreement: This Service Schedule for Hosted Backup Services v8.0.0 (the Service ) marketed as RecoveryVault replaces all previously signed / incorporated version(s) of the Service Schedule(s) for Hosted Backup Services

More information

CS 655 Advanced Topics in Distributed Systems

CS 655 Advanced Topics in Distributed Systems Presented by : Walid Budgaga CS 655 Advanced Topics in Distributed Systems Computer Science Department Colorado State University 1 Outline Problem Solution Approaches Comparison Conclusion 2 Problem 3

More information

INTRODUCTION TO XTREMIO METADATA-AWARE REPLICATION

INTRODUCTION TO XTREMIO METADATA-AWARE REPLICATION Installing and Configuring the DM-MPIO WHITE PAPER INTRODUCTION TO XTREMIO METADATA-AWARE REPLICATION Abstract This white paper introduces XtremIO replication on X2 platforms. XtremIO replication leverages

More information

Google File System. Arun Sundaram Operating Systems

Google File System. Arun Sundaram Operating Systems Arun Sundaram Operating Systems 1 Assumptions GFS built with commodity hardware GFS stores a modest number of large files A few million files, each typically 100MB or larger (Multi-GB files are common)

More information

Modernize Your Backup and DR Using Actifio in AWS

Modernize Your Backup and DR Using Actifio in AWS FOR AWS Modernize Your Backup and DR Using Actifio in AWS 150105H FOR AWS Modernize Your Backup and DR Using Actifio in AWS What is Actifio? Actifio virtualizes the data that s the lifeblood of business.

More information

EMC Data Protection for Microsoft

EMC Data Protection for Microsoft EMC Data Protection for Microsoft Featuring Industry Perspectives from IDC 7 November 2013 Ashish Nadkarni, IDC Research Director, Storage Systems @Ashish_Nadkarni Phil George, EMC Backup Recovery Systems

More information

White paper ETERNUS CS800 Data Deduplication Background

White paper ETERNUS CS800 Data Deduplication Background White paper ETERNUS CS800 - Data Deduplication Background This paper describes the process of Data Deduplication inside of ETERNUS CS800 in detail. The target group consists of presales, administrators,

More information

NOSQL DATABASE SYSTEMS: DECISION GUIDANCE AND TRENDS. Big Data Technologies: NoSQL DBMS (Decision Guidance) - SoSe

NOSQL DATABASE SYSTEMS: DECISION GUIDANCE AND TRENDS. Big Data Technologies: NoSQL DBMS (Decision Guidance) - SoSe NOSQL DATABASE SYSTEMS: DECISION GUIDANCE AND TRENDS h_da Prof. Dr. Uta Störl Big Data Technologies: NoSQL DBMS (Decision Guidance) - SoSe 2017 163 Performance / Benchmarks Traditional database benchmarks

More information

Asigra Cloud Backup Provides Comprehensive Virtual Machine Data Protection Including Replication

Asigra Cloud Backup Provides Comprehensive Virtual Machine Data Protection Including Replication Datasheet Asigra Cloud Backup Provides Comprehensive Virtual Machine Data Protection Including Replication Virtual Machines (VMs) have become a staple of the modern enterprise data center, but as the usage

More information