BARNS: Backup and Recovery for NoSQL Databases
|
|
- Myrtle Banks
- 6 years ago
- Views:
Transcription
1 BARNS: Backup and Recovery for NoSQL Databases Atish Kathpal, Priya Sehgal Advanced Technology Group, NetApp 1
2 Why Backup/Restore NoSQL DBs? Customers are directly ingesting into NoSQL Security breach are on the rise e.g. ransomware attacks on MongoDB [1] and recent WannaCrypt exploits Fat-finger errors eventually propagate to replicas Ransomware Sandbox deployments for test/dev Bring up shadow clusters of different cardinality (from production cluster snapshots) Compliance and regulatory requirements IDC, 2016 report [2] lists data-protection and retention as one of the top infrastructural requirements for NoSQL 2 [1] [2] Nadkarni A., Polyglot Persistence: Insights on NoSQL Adoption and the Resulting Impact on Infrastructure. IDC Feb.
3 NoSQL Database Classes From Backup/Restore Service Perspective Master-slave Authoritative copy of each partition is contained in the master node that we can backup. Loss of primary node leads to shard/partition-unavailability until new leader is elected. Example: MongoDB, Redis, Oracle NoSQL, MarkLogic Master-less Data is scattered across nodes using consistent hashing techniques, no single node has all data for a given partition Eventual consistency: Unavailability of a destination node does not lead to write-failure, data is eventually replicated Example: Cassandra, Couchbase 3
4 NoSQL DBs Hosted on Shared Storage High-level, conceptual deployment architecture Backup: Leverage storage snapshots Restore: Leverage cloning NoSQL DB software (Node1) NoSQL DB software (Node2) NoSQL DB software (Node3) Filesystem (DB Journal, Logs, Data Files) Filesystem (DB Journal, Logs, Data Files) Filesystem (DB Journal, Logs, Data Files) LUN-1 LUN-2 LUN-3 Shared Storage Array (Snapshots, Cloning, Compression, Deduplication, Encryption, Cloud Integrations) 4
5 Backup/Restore Challenges Cluster-consistency at scale Cluster/App quiesce significantly hampers application performance. Cross node consistency not guaranteed Take crash consistent snapshots Post process crash consistent snapshots (in a sand-box) using NoSQL DB stack to reach an cluster-consistent state Space Efficiency Replica set data copies do not de-duplicate small row sizes, scattered across nodes (Cassandra) and unique ids added by storage engines (MongoDB) DB performs compression and encryption Remove replicas logically (application aware backup) Topology Changes Commodity nodes, at scale of s of nodes. E.g., Primary node might be unreachable while taking backup in case of MongoDB Storage snapshots do not have context about cluster topology Use cases may require restore to a test/dev cluster of different cardinality Save Cluster topology and storage mapping as part of backup Insignificant block based deduplication Write: (K1, V1), (K3, V3) Cassandra N1 Replication factor = 2 Write: (K1, V1), (K2, V2) Cassandra N2 Shared Storage cluster LUN1: K1, K3 LUN2: K1, K2 LUN3: K2, K3 Write: (K2, V2), (K3, V3) Cassandra N3 Existing open source utilities like Mongodump and Cassandra snapshots suffer from above challenges. 5
6 NoSQL Data Protection Challenges Master-Less Databases Client, Update K1 (CL.ONE) Challenges: Fault tolerance Backup may capture stale data due to eventual consistency Higher restore times, since Cassandra will perform repairs during restore Ack Update: (K1, V11, Tnew) Cassandra N1 Update: (K1, V11, Tnew) Cassandra N2 Cassandra N3 Snapshot of LUN2 will point to stale data Shared Storage cluster LUN1: (K1, V1, Told), (K1, V11, Tnew) LUN2: (K1, V1, Told), LUN3: 6
7 BARNS Architecture Addresses challenges of: 1. Taking cluster-consistent backup at scale 2. Taking storage efficient backups (through replica removal) 3. Enables recovery/cloning to different cluster topologies Mongo APIs MongoDB Cluster Lib_MongoDB Light Weight Backup Process (LWB) Lib_Store1 APIs Lib_DB Post- Process (PP) Lib_Storage Cassandra Cluster Lib_Cassandra Restore Process Lib_Store2 Cassandra APIs APIs Stop_balancer() Get_topology() Check_cluster_health() Prepare_Backup_Topo( Etc. Get_lun_mappings() Take_snapshots() Lun_clone() Vault_backup() Etc. 7 Store1 Store2
8 BARNS Solution: Cassandra Master-less Distributed Database 8
9 Phase 1: Light-weight Backup Phase C1 Token1 Production Cluster C2 C3 Token2 Token3 C4 Token4 BARNS: LWB 1. Capture token assignment of each node 2. Store mapping of LUNs à Tokens L1 L2 L3 L4 CL1 D1 CL2 D2 CL3 D3 CL4 D4 3. Take snapshots of L1 to L4 Backup Metadata example CL1 CL2 CL3 CL4 D1 D2 D3 D4 sn1 sn2 sn3 sn4 9
10 Phase2: Post Process Phase Part1: Flush Commitlogs PP Node(s) BARNS: PP-Phase P1 P2 P3 P4 CL_sn1 CL_sn2 CL_sn3 CL_sn4 1. Clone the LWB Snapshot LUNs 2. Mount on different PP processes or different nodes 3. Start Cassandra processes P1 to P4 CL1 D1 CL2 D2 CL3 D3 CL4 D4 4. Flush CommitLogs E.g. Data K1, V1, T1 K2, V2, T1 K1,V11, T2 10
11 Phase2: Post Process Phase Part2: Compaction BARNS: PP-Phase PP Node /cassandra unionfs 1. Using UnionFS mount all snapshot clones as readonly 2. Create a new volume and LUN for full backup 3. Mount it as RW through UnionFS 4. Let PP node have all tokens of prod cluster 5. Start Cassandra 6. Start compaction process 7. Final compacted files will be stored on Fullbackup LUN Data 1 Data 2 Data 3 Data 4 K1, V1, T1 K1, V1, T1 K1, V1, T1 K3,V3, T3 K2, V2, T1 K1, V11, T2 K2, V2, T1 K2, V22, T4 K1,V11, T2 K3,V3, T3 CL_sn1 CL_sn2 CL_sn3 CL_sn4 Data Compact K1, V11, T2 K2, V22, T1 K3,V3, T3 Full_bk_lun Keeping single copy of data in the backup => ~66% reduction in backup storage requirements 11
12 Cassandra Restore The post-process step enables cloning to different restore/clone topologies Token1 Token2 N1 Token3 Token4 N2 Clones Full Backup K1, V11, T2 K2, V22, T1 Full_bk_lun K3,V3, T3 12
13 Evaluation - Cassandra Backup and Restore Full Backup LWB <10 secs pp-flush - ~40 secs pp-compact time increases by 35-40% à incremental backup Production cluster 4 nodes 4 iscsi LUNs Commitlog and SSTables for a node on same LUN Cassandra 4.0 Post Process Node 2 CPUs 8GB RAM YCSB to ingest data Restore time less than ~2 mins (irrespective of cluster size and data set size) 13
14 BARNS Solution: MongoDB Master-Slave Distributed Database Ø Check the paper or just attend the poster session J 14
15 Summary Tracking replicas and cluster topologies is important for taking backups and performing flexible topology restores Existing open-source solutions have several inefficiencies like need for repairs after restore, lack of storage efficiency in backup and poor integrations with shared storage Opportunity to provide efficient backup and restore through light-weight snapshots and clones 15
16 Thank You. 16
BARNS: Towards Building Backup and Recovery for NoSQL Databases
BARNS: Towards Building Backup and Recovery for NoSQL Databases Atish Kathpal, NetApp Abstract While NoSQL databases are gaining popularity for business applications, they pose unique challenges towards
More informationDocument Sub Title. Yotpo. Technical Overview 07/18/ Yotpo
Document Sub Title Yotpo Technical Overview 07/18/2016 2015 Yotpo Contents Introduction... 3 Yotpo Architecture... 4 Yotpo Back Office (or B2B)... 4 Yotpo On-Site Presence... 4 Technologies... 5 Real-Time
More informationCloud Backup and Recovery for Healthcare and ecommerce
Get Your Cloud Backup On Cloud Backup and Recovery for Healthcare and ecommerce Peter Smails, Vice President, Marketing & Business Development Shalabh Goyal, Director, Product Management October 12 th,
More informationMongoDB Architecture
VICTORIA UNIVERSITY OF WELLINGTON Te Whare Wananga o te Upoko o te Ika a Maui MongoDB Architecture Lecturer : Dr. Pavle Mogin SWEN 432 Advanced Database Design and Implementation Advanced Database Design
More informationHow to Scale MongoDB. Apr
How to Scale MongoDB Apr-24-2018 About me Location: Skopje, Republic of Macedonia Education: MSc, Software Engineering Experience: Lead Database Consultant (since 2016) Database Consultant (2012-2016)
More informationNoSQL Databases MongoDB vs Cassandra. Kenny Huynh, Andre Chik, Kevin Vu
NoSQL Databases MongoDB vs Cassandra Kenny Huynh, Andre Chik, Kevin Vu Introduction - Relational database model - Concept developed in 1970 - Inefficient - NoSQL - Concept introduced in 1980 - Related
More informationFinding Consistency in an Inconsistent World: Towards Deep Semantic Understanding of Scale-out Distributed Databases
USENIX HotStorage2016 Finding Consistency in an Inconsistent World: Towards Deep Semantic Understanding of Scale-out Distributed Databases Neville Carvalho, Hyojun Kim, Maohua Lu, Prasenjit Sarkar, Rohit
More informationMigrating to Cassandra in the Cloud, the Netflix Way
Migrating to Cassandra in the Cloud, the Netflix Way Jason Brown - @jasobrown Senior Software Engineer, Netflix Tech History, 1998-2008 In the beginning, there was the webapp and a single database in a
More informationNutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure
Nutanix Tech Note Virtualizing Microsoft Applications on Web-Scale Infrastructure The increase in virtualization of critical applications has brought significant attention to compute and storage infrastructure.
More informationCourse Content MongoDB
Course Content MongoDB 1. Course introduction and mongodb Essentials (basics) 2. Introduction to NoSQL databases What is NoSQL? Why NoSQL? Difference Between RDBMS and NoSQL Databases Benefits of NoSQL
More informationHedvig as backup target for Veeam
Hedvig as backup target for Veeam Solution Whitepaper Version 1.0 April 2018 Table of contents Executive overview... 3 Introduction... 3 Solution components... 4 Hedvig... 4 Hedvig Virtual Disk (vdisk)...
More informationSOLUTION BRIEF Fulfill the promise of the cloud
SOLUTION BRIEF Fulfill the promise of the cloud NetApp Solutions for Amazon Web Services Fulfill the promise of the cloud NetApp Cloud Volumes Service for AWS: Move and manage more workloads faster Many
More informationA Non-Relational Storage Analysis
A Non-Relational Storage Analysis Cassandra & Couchbase Alexandre Fonseca, Anh Thu Vu, Peter Grman Cloud Computing - 2nd semester 2012/2013 Universitat Politècnica de Catalunya Microblogging - big data?
More informationTHE COMPLETE GUIDE COUCHBASE BACKUP & RECOVERY
THE COMPLETE GUIDE COUCHBASE BACKUP & RECOVERY INTRODUCTION Driven by the need to remain competitive and differentiate themselves, organizations are undergoing digital transformations and becoming increasingly
More informationMicrosoft SQL Server HA and DR with DVX
Microsoft SQL Server HA and DR with DVX 385 Moffett Park Dr. Sunnyvale, CA 94089 844-478-8349 www.datrium.com Technical Report Introduction A Datrium DVX solution allows you to start small and scale out.
More informationIntroduction to OpenStack Trove
Introduction to OpenStack Trove A multi-database deployment Doug Shelley - Tesora Michael Basnight - Rackspace Agenda What is Trove How to Install Trove A look at the Trove API Multi-Database demonstration
More informationYour Complete Guide to Backup and Recovery for MongoDB
Your Complete Guide to Backup and Recovery for MongoDB EBOOK Your Complete Guide to Backup and Recovery for MongoDB Table of Contents Part I: Backup and Recovery for MongoDB Part II: Customer Case Study
More informationSpotify. Scaling storage to million of users world wide. Jimmy Mårdell October 14, 2014
Cassandra @ Spotify Scaling storage to million of users world wide! Jimmy Mårdell October 14, 2014 2 About me Jimmy Mårdell Tech Product Owner in the Cassandra team 4 years at Spotify
More informationTrends in Data Protection and Restoration Technologies. Mike Fishman, EMC 2 Corporation
Trends in Data Protection and Restoration Technologies Mike Fishman, EMC 2 Corporation SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA unless otherwise noted. Member
More informationHow do we build TiDB. a Distributed, Consistent, Scalable, SQL Database
How do we build TiDB a Distributed, Consistent, Scalable, SQL Database About me LiuQi ( 刘奇 ) JD / WandouLabs / PingCAP Co-founder / CEO of PingCAP Open-source hacker / Infrastructure software engineer
More informationExecutive Summary SOLE SOURCE JUSTIFICATION. Microsoft Integration
Executive Summary Commvault Simpana software delivers the unparalleled advantages and benefits of a truly holistic approach to data management. It is one product that contains individually licensable modules
More informationHadoop An Overview. - Socrates CCDH
Hadoop An Overview - Socrates CCDH What is Big Data? Volume Not Gigabyte. Terabyte, Petabyte, Exabyte, Zettabyte - Due to handheld gadgets,and HD format images and videos - In total data, 90% of them collected
More informationOpendedupe & Veritas NetBackup ARCHITECTURE OVERVIEW AND USE CASES
Opendedupe & Veritas NetBackup ARCHITECTURE OVERVIEW AND USE CASES May, 2017 Contents Introduction... 2 Overview... 2 Architecture... 2 SDFS File System Service... 3 Data Writes... 3 Data Reads... 3 De-duplication
More informationThe Definitive Guide to Backup and Recovery for Cassandra
The Definitive Guide to Backup and Recovery for Cassandra EBOOK The Definitive Guide to Backup and Recovery for Cassandra Table of Contents Executive Summary 1. Cassandra Technology Overview 2. The Need
More informationIntroduction to Database Services
Introduction to Database Services Shaun Pearce AWS Solutions Architect 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved Today s agenda Why managed database services? A non-relational
More informationKubernetes Integration with Virtuozzo Storage
Kubernetes Integration with Virtuozzo Storage A Technical OCTOBER, 2017 2017 Virtuozzo. All rights reserved. 1 Application Container Storage Application containers appear to be the perfect tool for supporting
More informationCohesity Flash Protect for Pure FlashBlade: Simple, Scalable Data Protection
Cohesity Flash Protect for Pure FlashBlade: Simple, Scalable Data Protection Executive overview Pure Storage FlashBlade is the industry s first all-flash, scale-out file storage purpose-built for high-performance
More informationBackup Edition Comparison OVERVIEW
Backup 12.5 Edition Comparison Available in and editions, protects data on 21 platforms in on-premises and remote systems, in private and public clouds, and on mobile devices. With active protection from
More informationSimple Data Protection for the Cloud Era
Simple Data Protection for the Era Enterprise Data Protection, Simplified Struggling with data protection? You re not alone. Many storage admins are faced with the challenge of protecting an increasing
More informationBackup License Comparison OVERVIEW
Backup 12.5 Comparison Acronis Backup 12.5 is the world s easiest and fastest backup solution, protecting data on 21 platforms, whether data is located on-premises, in remote systems, in private and public
More informationDistributed PostgreSQL with YugaByte DB
Distributed PostgreSQL with YugaByte DB Karthik Ranganathan PostgresConf Silicon Valley Oct 16, 2018 1 CHECKOUT THIS REPO: github.com/yugabyte/yb-sql-workshop 2 About Us Founders Kannan Muthukkaruppan,
More informationTIBX NEXT-GENERATION ARCHIVE FORMAT IN ACRONIS BACKUP CLOUD
TIBX NEXT-GENERATION ARCHIVE FORMAT IN ACRONIS BACKUP CLOUD 1 Backup Speed and Reliability Are the Top Data Protection Mandates What are the top data protection mandates from your organization s IT leadership?
More informationDell EMC Unity: Data Protection & Copy Data Management Options. Ryan Poulin Product Technologist Midrange & Entry Solutions Group
Dell EMC Unity: Data Protection & Copy Data Management Options Ryan Poulin Product Technologist Midrange & Entry Solutions Group What s New with Dell EMC Unity 650F 550F 450F 350F Optimized for All-Flash
More informationAxway API Management 7.5.x Cassandra Best practices. #axway
Axway API Management 7.5.x Cassandra Best practices #axway Axway API Management 7.5.x Cassandra Best practices Agenda Apache Cassandra - Overview Apache Cassandra - Focus on consistency level Apache Cassandra
More informationUsing Cohesity with Amazon Web Services (AWS)
Using Cohesity with Amazon Web Services (AWS) Achieve your long-term retention and archival objectives for secondary data Cohesity DataPlatform is a hyperconverged secondary data and application solution
More informationBackup and Recovery Best Practices With Tintri VMstore
Backup and Recovery Best Practices With Tintri VMstore Backup and Recovery Best Practices with Tintri VMstore TECHNICAL BEST PRACTICES PAPER, Revision 1.0, April 10, 2014 Contents Contents Introduction
More informationMongoDB Backup & Recovery Field Guide
MongoDB Backup & Recovery Field Guide Tim Vaillancourt Percona Speaker Name `whoami` { name: tim, lastname: vaillancourt, employer: percona, techs: [ mongodb, mysql, cassandra, redis, rabbitmq, solr, mesos
More informationMaking Non-Distributed Databases, Distributed. Ioannis Papapanagiotou, PhD Shailesh Birari
Making Non-Distributed Databases, Distributed Ioannis Papapanagiotou, PhD Shailesh Birari Dynomite Ecosystem Dynomite - Proxy layer Dyno - Client Dynomite-manager - Ecosystem orchestrator Dynomite-explorer
More informationCohesity DataPlatform Protecting Individual MS SQL Databases Solution Guide
Cohesity DataPlatform Protecting Individual MS SQL Databases Solution Guide Abstract This solution guide outlines the workflow for creating backups with Microsoft SQL Server databases and Cohesity Data
More informationOpenStack Trove and DBaaS: Impedance Match?
OpenStack Trove and DBaaS: Impedance Match? June 11, 2015 2014 EnterpriseDB Corporation. All rights reserved. 1 Introduction Fred Dalrymple EDB, product manager, Postgres Plus Cloud Database Representing
More informationThe Definitive Guide to MongoDB Backup and Recovery
The Definitive Guide to MongoDB Backup and Recovery EBOOK The Definitive Guide to MongoDB Backup and Recovery Table of Contents Executive Summary 1. MongoDB Technology Overview 2. The Need for Backup &
More informationIntroducing RecoverX 2.5
Backup & Recovery for Modern Applications Introducing RecoverX 2.5 Shalabh Goyal, Director, Product Management Kedar Hiremath, Product Marketing Manager November 16 th, 2017 What We Will Cover Today What
More informationThe course modules of MongoDB developer and administrator online certification training:
The course modules of MongoDB developer and administrator online certification training: 1 An Overview of the Course Introduction to the course Table of Contents Course Objectives Course Overview Value
More informationTintri & Veeam VM Backup & Replication Best Practices. John Phillips Strategic Alliances and Technical Marketing Ryan Post Systems Engineer
Tintri & Veeam VM Backup & Replication Best Practices John Phillips Strategic Alliances and Technical Marketing Ryan Post Systems Engineer 1 VM-aware Storage from Tintri Stores VMs and vdisks (only!) No
More informationTHE COMPLETE GUIDE HADOOP BACKUP & RECOVERY
THE COMPLETE GUIDE HADOOP BACKUP & RECOVERY INTRODUCTION Driven by the need to remain competitive and differentiate themselves, organizations are undergoing digital transformations and becoming increasingly
More informationNoSQL BENCHMARKING AND TUNING. Nachiket Kate Santosh Kangane Ankit Lakhotia Persistent Systems Ltd. Pune, India
NoSQL BENCHMARKING AND TUNING Nachiket Kate Santosh Kangane Ankit Lakhotia Persistent Systems Ltd. Pune, India Today large variety of available NoSQL options has made it difficult for developers to choose
More informationDistributed System. Gang Wu. Spring,2018
Distributed System Gang Wu Spring,2018 Lecture7:DFS What is DFS? A method of storing and accessing files base in a client/server architecture. A distributed file system is a client/server-based application
More informationAcronis Backup 12.5 License Comparison incl. cloud deployment functionality
DATA SHEET ACRONIS BACKUP LICENSING COMPARISON Acronis Backup 12.5 License Comparison incl. cloud deployment functionality 1 VIRTUAL HOST OFFICE 365 VIRTUAL HOST OFFICE 365 Storage disks, NAS, SAN, Acronis
More informationProtecting Hyper-V Environments
TECHNICAL WHITE PAPER: BACKUP EXEC TM 2014 PROTECTING HYPER-V ENVIRONMENTS Backup Exec TM 2014 Technical White Paper Protecting Hyper-V Environments Technical White Papers are designed to introduce Symantec
More informationAdaptation in distributed NoSQL data stores
Adaptation in distributed NoSQL data stores Kostas Magoutis Department of Computer Science and Engineering University of Ioannina, Greece Institute of Computer Science (ICS) Foundation for Research and
More informationApsaraDB for Redis. Product Introduction
ApsaraDB for Redis is compatible with open-source Redis protocol standards and provides persistent memory database services. Based on its high-reliability dual-machine hot standby architecture and seamlessly
More informationData Protection Guide
SnapCenter Software 4.0 Data Protection Guide For Custom Plug-ins March 2018 215-12932_C0 doccomments@netapp.com Table of Contents 3 Contents Deciding on whether to read the SnapCenter Data Protection
More informationConstruct a High Efficiency VM Disaster Recovery Solution. Best choice for protecting virtual environments
Construct a High Efficiency VM Disaster Recovery Solution Best choice for protecting virtual environments About NAKIVO Established in the USA since 2012 Provides data protection solutions for VMware, Hyper-V
More informationArchitecture of a Real-Time Operational DBMS
Architecture of a Real-Time Operational DBMS Srini V. Srinivasan Founder, Chief Development Officer Aerospike CMG India Keynote Thane December 3, 2016 [ CMGI Keynote, Thane, India. 2016 Aerospike Inc.
More informationEMC Virtual Infrastructure for Microsoft Exchange 2007
EMC Virtual Infrastructure for Microsoft Exchange 2007 Enabled by EMC Replication Manager, EMC CLARiiON AX4-5, and iscsi Reference Architecture EMC Global Solutions 42 South Street Hopkinton, MA 01748-9103
More informationDeploying Software Defined Storage for the Enterprise with Ceph. PRESENTATION TITLE GOES HERE Paul von Stamwitz Fujitsu
Deploying Software Defined Storage for the Enterprise with Ceph PRESENTATION TITLE GOES HERE Paul von Stamwitz Fujitsu Agenda Yet another attempt to define SDS Quick Overview of Ceph from a SDS perspective
More informationSnapCenter Software 4.0 Concepts Guide
SnapCenter Software 4.0 Concepts Guide May 2018 215-12925_D0 doccomments@netapp.com Table of Contents 3 Contents Deciding whether to use the Concepts Guide... 7 SnapCenter overview... 8 SnapCenter architecture...
More informationCopyright 2010 EMC Corporation. Do not Copy - All Rights Reserved.
1 Using patented high-speed inline deduplication technology, Data Domain systems identify redundant data as they are being stored, creating a storage foot print that is 10X 30X smaller on average than
More informationNutanix White Paper. Hyper-Converged Infrastructure for Enterprise Applications. Version 1.0 March Enterprise Applications on Nutanix
Nutanix White Paper Hyper-Converged Infrastructure for Enterprise Applications Version 1.0 March 2015 1 The Journey to Hyper-Converged Infrastructure The combination of hyper-convergence and web-scale
More informationAvailability for the Modern Data Center on FlexPod Introduction NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only
Availability for the Modern Data Center on FlexPod Introduction 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only Abstract Veeam Availability Suite v8 leverages NetApp storage
More informationVirtual Server Agent for VMware VMware VADP Virtualization Architecture
Virtual Server Agent for VMware VMware VADP Virtualization Architecture Published On: 11/19/2013 V10 Service Pack 4A Page 1 of 18 VMware VADP Virtualization Architecture - Virtual Server Agent for VMware
More informationDistributed File Systems II
Distributed File Systems II To do q Very-large scale: Google FS, Hadoop FS, BigTable q Next time: Naming things GFS A radically new environment NFS, etc. Independence Small Scale Variety of workloads Cooperation
More informationTrends in Data Protection and Restoration Technologies. Jason Iehl, NetApp
Trends in Data Protection and Restoration Technologies Jason Iehl, NetApp SNIA Legal Notice The material contained in this tutorial is copyrighted by the SNIA unless otherwise noted. Member companies and
More informationGoogle File System, Replication. Amin Vahdat CSE 123b May 23, 2006
Google File System, Replication Amin Vahdat CSE 123b May 23, 2006 Annoucements Third assignment available today Due date June 9, 5 pm Final exam, June 14, 11:30-2:30 Google File System (thanks to Mahesh
More informationStorageCraft OneXafe and Veeam 9.5
TECHNICAL DEPLOYMENT GUIDE NOV 2018 StorageCraft OneXafe and Veeam 9.5 Expert Deployment Guide Overview StorageCraft, with its scale-out storage solution OneXafe, compliments Veeam to create a differentiated
More informationScylla Open Source 3.0
SCYLLADB PRODUCT OVERVIEW Scylla Open Source 3.0 Scylla is an open source NoSQL database that offers the horizontal scale-out and fault-tolerance of Apache Cassandra, but delivers 10X the throughput and
More informationIntegrated Data Protection
Integrated Data Protection Subra Mohan Director, Product Management SnapCenter and Data Protection Solutions 1 2014 NetApp, Inc. All rights reserved. NetApp Proprietary Limited Use Only Agenda NetApp Data
More informationThe Data Protection Rule and Hybrid Cloud Backup
The 3-2-1 Data Protection Rule and Hybrid Cloud Backup IT teams are under extreme pressure to improve backup, disaster recovery and data protection to eliminate downtime and facilitate digital transformation.
More informationMongoDB and Mysql: Which one is a better fit for me? Room 204-2:20PM-3:10PM
MongoDB and Mysql: Which one is a better fit for me? Room 204-2:20PM-3:10PM About us Adamo Tonete MongoDB Support Engineer Agustín Gallego MySQL Support Engineer Agenda What are MongoDB and MySQL; NoSQL
More informationData Protection Modernization: Meeting the Challenges of a Changing IT Landscape
Data Protection Modernization: Meeting the Challenges of a Changing IT Landscape Tom Clark IBM Distinguished Engineer, Chief Architect Software 1 Data growth is continuing to explode Sensors & Devices
More informationThe following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into
The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material,
More informationEvaluating Cloud Storage Strategies. James Bottomley; CTO, Server Virtualization
Evaluating Cloud Storage Strategies James Bottomley; CTO, Server Virtualization Introduction to Storage Attachments: - Local (Direct cheap) SAS, SATA - Remote (SAN, NAS expensive) FC net Types - Block
More informationCohesity Architecture White Paper. Building a Modern, Web-Scale Architecture for Consolidating Secondary Storage
Architecture White Paper Building a Modern, Web-Scale Architecture for Consolidating Secondary Storage The Band-Aid Effect : The Hidden Cost of Trying to Make Legacy Solutions Work The combination of explosive
More informationIBM Spectrum Protect Version Introduction to Data Protection Solutions IBM
IBM Spectrum Protect Version 8.1.2 Introduction to Data Protection Solutions IBM IBM Spectrum Protect Version 8.1.2 Introduction to Data Protection Solutions IBM Note: Before you use this information
More informationServer Fault Protection with NetApp Data ONTAP Edge-T
Technical Report Server Fault Protection with NetApp Data ONTAP Edge-T Jeff Whitaker, NetApp March 2013 TR-4154 TABLE OF CONTENTS 1 Introduction... 3 2 Backup and Disaster Recovery Technology... 4 2.1
More informationUnderstanding Virtual System Data Protection
Understanding Virtual System Data Protection Server virtualization is the most important new technology introduced in the data center in the past decade. It has changed the way we think about computing
More informationXtremIO Business Continuity & Disaster Recovery. Aharon Blitzer & Marco Abela XtremIO Product Management
XtremIO Business Continuity & Disaster Recovery Aharon Blitzer & Marco Abela XtremIO Product Management Agenda XtremIO Current BC/DR Offerings New BC/DR Offering Benefits of New Offering Technical Overview
More informationHow Microsoft Built MySQL, PostgreSQL and MariaDB for the Cloud. Santa Clara, California April 23th 25th, 2018
How Microsoft Built MySQL, PostgreSQL and MariaDB for the Cloud Santa Clara, California April 23th 25th, 2018 Azure Data Service Architecture Share Cluster with SQL DB Azure Infrastructure Services Azure
More informationEBOOK. NetApp ONTAP Cloud FOR MICROSOFT AZURE ENTERPRISE DATA MANAGEMENT IN THE CLOUD
EBOOK NetApp ONTAP Cloud FOR MICROSOFT AZURE ENTERPRISE DATA MANAGEMENT IN THE CLOUD NetApp ONTAP Cloud for Microsoft Azure The ONTAP Cloud Advantage 3 Enterprise-Class Data Management 5 How ONTAP Cloud
More informationProtecting Miscrosoft Hyper-V Environments
Protecting Miscrosoft Hyper-V Environments Who should read this paper Technical White Papers are designed to introduce Veritas partners and end users to key technologies and technical concepts that are
More informationCassandra - A Decentralized Structured Storage System. Avinash Lakshman and Prashant Malik Facebook
Cassandra - A Decentralized Structured Storage System Avinash Lakshman and Prashant Malik Facebook Agenda Outline Data Model System Architecture Implementation Experiments Outline Extension of Bigtable
More informationMove Amazon RDS MySQL Databases to Amazon VPC using Amazon EC2 ClassicLink and Read Replicas
Move Amazon RDS MySQL Databases to Amazon VPC using Amazon EC2 ClassicLink and Read Replicas July 2017 2017, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document is provided
More informationA. Deduplication rate is less than expected, accounting for the remaining GSAN capacity
Volume: 326 Questions Question No: 1 An EMC Avamar customer s Gen-1 system with 4 TB of GSAN capacity has reached read-only threshold. The customer indicates that the deduplicated backup data accounts
More informationCopyright 2012 EMC Corporation. All rights reserved.
1 BACKUP BUILT FOR VMWARE Mark Twomey Technical Director, The Office Of The CTO 2 Agenda Market Forces Optimized VMware Backup Backup And Recovery For VCE Vblock Protecting vcloud Director Customer Success
More informationNew Oracle NoSQL Database APIs that Speed Insertion and Retrieval
New Oracle NoSQL Database APIs that Speed Insertion and Retrieval O R A C L E W H I T E P A P E R F E B R U A R Y 2 0 1 6 1 NEW ORACLE NoSQL DATABASE APIs that SPEED INSERTION AND RETRIEVAL Introduction
More informationMySQL Cluster Ed 2. Duration: 4 Days
Oracle University Contact Us: +65 6501 2328 MySQL Cluster Ed 2 Duration: 4 Days What you will learn This MySQL Cluster training teaches you how to install and configure a real-time database cluster at
More informationNoSQL systems. Lecture 21 (optional) Instructor: Sudeepa Roy. CompSci 516 Data Intensive Computing Systems
CompSci 516 Data Intensive Computing Systems Lecture 21 (optional) NoSQL systems Instructor: Sudeepa Roy Duke CS, Spring 2016 CompSci 516: Data Intensive Computing Systems 1 Key- Value Stores Duke CS,
More informationGFS: The Google File System. Dr. Yingwu Zhu
GFS: The Google File System Dr. Yingwu Zhu Motivating Application: Google Crawl the whole web Store it all on one big disk Process users searches on one big CPU More storage, CPU required than one PC can
More information5 Fundamental Strategies for Building a Data-centered Data Center
5 Fundamental Strategies for Building a Data-centered Data Center June 3, 2014 Ken Krupa, Chief Field Architect Gary Vidal, Solutions Specialist Last generation Reference Data Unstructured OLTP Warehouse
More informationSnapCenter Software 2.0 Installation and Setup Guide
SnapCenter Software 2.0 Installation and Setup Guide July 2017 215-11357_C0 doccomments@netapp.com Table of Contents 3 Contents Deciding whether to read this information... 7 SnapCenter overview... 8
More informationEMC RecoverPoint. EMC RecoverPoint Support
Support, page 1 Adding an Account, page 2 RecoverPoint Appliance Clusters, page 3 Replication Through Consistency Groups, page 4 Group Sets, page 22 System Tasks, page 24 Support protects storage array
More informationMaster Services Agreement:
This Service Schedule for Hosted Backup Services v8.0.0 (the Service ) marketed as RecoveryVault replaces all previously signed / incorporated version(s) of the Service Schedule(s) for Hosted Backup Services
More informationCS 655 Advanced Topics in Distributed Systems
Presented by : Walid Budgaga CS 655 Advanced Topics in Distributed Systems Computer Science Department Colorado State University 1 Outline Problem Solution Approaches Comparison Conclusion 2 Problem 3
More informationINTRODUCTION TO XTREMIO METADATA-AWARE REPLICATION
Installing and Configuring the DM-MPIO WHITE PAPER INTRODUCTION TO XTREMIO METADATA-AWARE REPLICATION Abstract This white paper introduces XtremIO replication on X2 platforms. XtremIO replication leverages
More informationGoogle File System. Arun Sundaram Operating Systems
Arun Sundaram Operating Systems 1 Assumptions GFS built with commodity hardware GFS stores a modest number of large files A few million files, each typically 100MB or larger (Multi-GB files are common)
More informationModernize Your Backup and DR Using Actifio in AWS
FOR AWS Modernize Your Backup and DR Using Actifio in AWS 150105H FOR AWS Modernize Your Backup and DR Using Actifio in AWS What is Actifio? Actifio virtualizes the data that s the lifeblood of business.
More informationEMC Data Protection for Microsoft
EMC Data Protection for Microsoft Featuring Industry Perspectives from IDC 7 November 2013 Ashish Nadkarni, IDC Research Director, Storage Systems @Ashish_Nadkarni Phil George, EMC Backup Recovery Systems
More informationWhite paper ETERNUS CS800 Data Deduplication Background
White paper ETERNUS CS800 - Data Deduplication Background This paper describes the process of Data Deduplication inside of ETERNUS CS800 in detail. The target group consists of presales, administrators,
More informationNOSQL DATABASE SYSTEMS: DECISION GUIDANCE AND TRENDS. Big Data Technologies: NoSQL DBMS (Decision Guidance) - SoSe
NOSQL DATABASE SYSTEMS: DECISION GUIDANCE AND TRENDS h_da Prof. Dr. Uta Störl Big Data Technologies: NoSQL DBMS (Decision Guidance) - SoSe 2017 163 Performance / Benchmarks Traditional database benchmarks
More informationAsigra Cloud Backup Provides Comprehensive Virtual Machine Data Protection Including Replication
Datasheet Asigra Cloud Backup Provides Comprehensive Virtual Machine Data Protection Including Replication Virtual Machines (VMs) have become a staple of the modern enterprise data center, but as the usage
More information