Trends in Data Protection and Restoration Technologies. Jason Iehl, Network Appliance Michael Rowan, Consultant

Similar documents
Trends in Data Protection CDP and VTL

Trends in Data Protection and Restoration Technologies. Jason Iehl, NetApp

Trends in Data Protection and Restoration Technologies. Mike Fishman, EMC 2 Corporation

Restoration Technologies

Disk and Tape Backup Mechanisms

ADVANCED DEDUPLICATION CONCEPTS. Thomas Rivera, BlueArc Gene Nagle, Exar

ADVANCED DATA REDUCTION CONCEPTS

Chapter 11. SnapProtect Technology

Continuous Data Protection Solving the Problem of Restoration June, 2008

Copyright 2010 EMC Corporation. Do not Copy - All Rights Reserved.

A Crash Course In Wide Area Data Replication. Jacob Farmer, CTO, Cambridge Computer

Tom Sas HP. Author: SNIA - Data Protection & Capacity Optimization (DPCO) Committee

ECE Engineering Robust Server Software. Spring 2018

1 Quantum Corporation 1

Disk-to-Disk backup. customer experience. Harald Burose. Architect Hewlett-Packard

Netapp Exam NS0-510 NCIE-Backup & Recovery Implementation Engineer Exam Version: 7.0 [ Total Questions: 216 ]

50 TB. Traditional Storage + Data Protection Architecture. StorSimple Cloud-integrated Storage. Traditional CapEx: $375K Support: $75K per Year

Application Recovery. Andreas Schwegmann / HP

Storage Virtualization II Effective Use of Virtualization - focusing on block virtualization -

IBM Storage Software Strategy

How NAS Systems Participate in Data Protection. Paul Massiglia, Symantec Corporation

MOVING TOWARDS ZERO DOWNTIME FOR WINTEL Caddy Tan 21 September Leaders Have Vision visionsolutions.com 1

How Symantec Backup solution helps you to recover from disasters?

WHITE PAPER: DATA PROTECTION. Veritas NetBackup for Microsoft Exchange Server Solutions Guide. Larry Cadloff February 2009

Chapter 10 Protecting Virtual Environments

Get More Out of Storage with Data Domain Deduplication Storage Systems

TSM Paper Replicating TSM

Balakrishnan Nair. Senior Technology Consultant Back Up & Recovery Systems South Gulf. Copyright 2011 EMC Corporation. All rights reserved.

Data Deduplication Methods for Achieving Data Efficiency

STORAGE CONSOLIDATION WITH IP STORAGE. David Dale, NetApp

Storage Virtualization II. - focusing on block virtualization -

STORAGE CONSOLIDATION WITH IP STORAGE. David Dale, NetApp

Identifying and Eliminating Backup System Bottlenecks: Taking Your Existing Backup System to the Next Level

EMC DATA DOMAIN PRODUCT OvERvIEW

The Nuances of Backup and Recovery Solutions

SAP HANA Disaster Recovery with Asynchronous Storage Replication

Storage Virtualization I What, Why, Where and How? Rob Peglar Xiotech Corporation

Boost your data protection with NetApp + Veeam. Schahin Golshani Technical Partner Enablement Manager, MENA

Simplify and Improve DB2 Administration by Leveraging Your Storage System

The storage challenges of virtualized environments

Backups and archives: What s the scoop?

Chapter 12 Backup and Recovery

ECE Enterprise Storage Architecture. Fall 2018

HP D2D & STOREONCE OVERVIEW

Dell DR4000 Replication Overview

Protect enterprise data, achieve long-term data retention

Backup and archiving need not to create headaches new pain relievers are around

Simplify and Improve IMS Administration by Leveraging Your Storage System

Business Continuity and Disaster Recovery. Ed Crowley Ch 12

Veeam and HP: Meet your backup data protection goals

A Promise Kept: Understanding the Monetary and Technical Benefits of STaaS Implementation. Mark Kaufman, Iron Mountain

VPLEX & RECOVERPOINT CONTINUOUS DATA PROTECTION AND AVAILABILITY FOR YOUR MOST CRITICAL DATA IDAN KENTOR

SAP with Oracle on UNIX and NFS with NetApp Clustered Data ONTAP and SnapManager for SAP 3.4

Exploring Options for Virtualized Disaster Recovery

What We ll Do Today. Brainstorm three future scenarios Explore five dials we can turn Discuss the probable futures Q&A

Veeam Availability Solution for Cisco UCS: Designed for Virtualized Environments. Solution Overview Cisco Public

Chapter 1. Storage Concepts. CommVault Concepts & Design Strategies:

Integrated Data Protection

10 Reasons Why Your DR Plan Won t Work

Redefine Data Protection: Next Generation Backup & Business Continuity Solutions

INTRODUCTION TO XTREMIO METADATA-AWARE REPLICATION

Archiving, Backup, and Recovery for Complete the Promise of Virtualisation Unified information management for enterprise Windows environments

Dell PowerVault DL2100 Powered by CommVault

ECONOMICAL, STORAGE PURPOSE-BUILT FOR THE EMERGING DATA CENTERS. By George Crump

Oracle Zero Data Loss Recovery Appliance (ZDLRA)

The Best Storage for Virtualized Environments

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure

Arcserve Unified Data Protection Virtualization Solution Brief

Brendan Lelieveld-Amiro, Director of Product Development StorageQuest Inc. December 2012

Simplify and Improve IMS Administration by Leveraging Your Storage System

Redefine Data Protection: Next Generation Backup And Business Continuity

EMC CLARiiON CX3-40. Reference Architecture. Enterprise Solutions for Microsoft Exchange 2007

Hitachi Adaptable Modular Storage and Hitachi Workgroup Modular Storage

Availability and the Always-on Enterprise: Why Backup is Dead

As storage networking technology

NEC Express5800 R320f Fault Tolerant Servers & NEC ExpressCluster Software

Solution Brief: Archiving with Harmonic Media Application Server and ProXplore

Chapter 3 `How a Storage Policy Works

SPECIFICATION FOR NETWORK ATTACHED STORAGE (NAS) TO BE FILLED BY BIDDER. NAS Controller Should be rack mounted with a form factor of not more than 2U


Using Computer Associates BrightStor ARCserve Backup with Microsoft Data Protection Manager

STOREONCE OVERVIEW. Neil Fleming Mid-Market Storage Development Manager. Copyright 2010 Hewlett-Packard Development Company, L.P.

Hitachi Adaptable Modular Storage and Workgroup Modular Storage

SAP HANA Disaster Recovery with Asynchronous Storage Replication

SRM: Can You Get What You Want? John Webster, Evaluator Group.

IBM Spectrum Protect Version Introduction to Data Protection Solutions IBM

Next Generation Data Protection and Recovery Solution. Den Ng

Copyright 2012 EMC Corporation. All rights reserved.

A Centralized Data Protection Application for Cross Vendor Storage Systems Prateek Sinha & Nishi Gupta Tata Consultancy Services

Benefits of Multi-Node Scale-out Clusters running NetApp Clustered Data ONTAP. Silverton Consulting, Inc. StorInt Briefing

The Need for a Terminology Bridge. May 2009

Server Fault Protection with NetApp Data ONTAP Edge-T

Step-by-Step Guide. Round the clock backup of everything with On- & Off-site Data Protection

Virtualization with Arcserve Unified Data Protection

Branch Office Data Consolidation

DAHA AKILLI BĐR DÜNYA ĐÇĐN BĐLGĐ ALTYAPILARIMIZI DEĞĐŞTĐRECEĞĐZ

A CommVault White Paper: Business Continuity: Architecture Design Guide

DEMYSTIFYING DATA DEDUPLICATION A WHITE PAPER

Disaster Recovery-to-the- Cloud Best Practices

DELL EMC DATA DOMAIN EXTENDED RETENTION SOFTWARE

Transcription:

Trends in Data Protection and Restoration Technologies Jason Iehl, Network Appliance Michael Rowan, Consultant

SNIA Legal Notices The material contained in this tutorial is copyrighted by the SNIA Member companies and individuals may use this material in presentations and literature under the following conditions: Any slide or slides used must be reproduced without modification The SNIA must be acknowledged as source of any material used in the body of any document containing material from these presentations This presentation is a project of the SNIA Education Committee 2

Storage Networking Industry Association SNIA is the trade group for storage networks ensuring that storage networks become complete and trusted solutions across the IT community - http://www.snia.org SNIA Dictionary of Storage Networking Terminology is an excellent resource http://www.snia.org/dictionary SNIA Tutorials are on-line: http://www.snia.org/education/tutorials 3

About SNIA and the DMF About the Storage Networking Industry Association (SNIA) SNIA s primary goal is to ensure that storage networks become complete and trusted solutions across the IT community For additional information about SNIA see www.snia.org SNIA s Dictionary of Storage Networking Terminology is online at www.snia.org/dictionary About the SNIA Data Management Forum (DMF) The DMF is a sub-group of SNIA acting as the worldwide authority on Data Management, Data Protection and ILM www.snia-dmf.org The DMF is a collaborative storage industry resource available to anyone responsible for the accessibility and integrity of their organization s information. Data Protection Initiative (DPI) DMF Information Lifecycle Management Initiative (ILMI) Long term Archive and Compliance Storage Initiative (LTACSI) Defining new approaches and best practices for data protection and recovery Developing, teaching and promoting ILM practices, implementation methods, and benefits Addressing challenges in developing, securing, and retaining long-term digital archives 4

Trends in Data Protection and Restoration Technologies We will cover the following technologies: Application Consistency Snapshots Backup to disk VTL CDP Replication What each is, its value, what is good and not so good, integration into infrastructure. 5

Outline Traditional data protection challenges Application Consistency Snapshots Backup to disk VTL CDP Replication Evolving data protection SNIA resources 6

Traditional Data Protection Challenges Backup window Failed, inconsistent, unreliable recovery Recovery time too long (poor RTO) Negative production impact Protection gaps (poor RPO) Disaster recovery Regulatory compliance and corporate governance Costly and inefficient Disruptive to change 7

Protection Based on Recovery Years Days Hrs Mins Secs Secs Mins Hrs Days Recovery Point Recovery Time Protection Methods Recovery Methods Tape Backups Vaults Archival Capture on Write Disk Backups Snapshots Synthetic Backup Real Time Replication Instant Recovery Disk Restores Tape Restores Point-in-Time Roll Back Search & Retrieve Enabling Technologies Tape & Automation Virtual Tape Library Deduplication Continuous Data Protection Snapshots 8

Disk-based Data Protection Data Protection is about Data Availability Backup is NOT a business requirement -- Availability is. Data protection as an extension of data availability - Restoration focused Disk-Assisted and Disk-based protection methods Speed Backup Windows Recovery Time Objectives Random Access Data set, application object or sub-object recovery - quickly ILM More gradations in lifecycle (B&W -> shades of grey) 9

Application Consistency When an application is running during the copy process, various techniques are available to ensure data consistency Much like the open files issue when backing up a file system that is in use, applications (like databases, messaging systems, etc) allow for different approaches to capturing a holistic picture of the applications data during a copy process (such as a snapshot, a mirror-split, or CDP protection). It is important to understand the consistency semantics of your application so that your data protection copies are recoverable. 10

Cold Protective Copy Old School approach: Bring application down Create copy (or mark timeline as a cold moment ) Bring application back online Requires downtime How much is dependent on technology, implementation and environment Can be from seconds to hours This is a true backup window Actual downtime during backup copy operation 11

To Quiesce or Not? Cold Snapshot Less complex, but backup window is downtime Application Consistent Snapshot Application intervention Application dependent Hot Backup or Online Backup Atomic or Crash Consistent Snapshot Ability to take snapshot for entire dataset at exactly same moment Can be done in multiple ways Recovery domain same as high availability systems 12

Application Assisted Protective Copy A hot or online backup or copy - is done while application is online and active Application coordination is invoked to facilitate copy Often done in stages across data set or using a transaction log for shorter durations Performance of application often degrades during hot window Also known as a backup window Application isn t offline, but isn t performing at optimal levels Application performance often continues degrading as the window expands or transaction flow increases Warnings Not all applications support this operative mode Source of complexity -- high failure rates (test often and thoroughly) 13

Crash Consistent Crash Consistent or Atomic Protective Copy An atomic instant picture of the data Application must be capable of crash recovery Can it be clustered? No application interaction Application isn t involved in the snapshot or CDP operation No (snap/operation) related performance issues Backup window is eliminated completely Simplifies operation run-book and backups Warnings Not all applications can do crash recovery Not all disk protections can do atomic operations across a wide set of disks 14

Snapshots A disk based instant copy that captures the original data at a specific point in time. Snapshots can be read-only or read-write. A fully usable copy of a defined collection of data that contains an image of the data as it appeared at the point in time at which the copy was initiated. A snapshot may be either a duplicate or a replicate of the data it represents. www.snia.org/dictionary 15

Snapshot of Networked Storage Terminology: Snapshot, Checkpoint, Point-in-Time, Stable Image = Any technology that presents a consistent point-in-time view of changing data. Many implementations exist. Why? Allows for complete backup or restore, with application downtime measured in minutes (or less) Most vendors: Image only = (entire Volume) Backup/Restore of individual files is possible If conventional backup is done from snapshot Or, if file-map is stored with Image backup 16

Full Copy Snapshots Keep multiple RAID-1 sets of data Cease RAID-1 operations to one set of the disks Freezes the contents of that set at a specific point in time Can be done by a variety of sources Host based logical volume manager Fabric based logical volume manager or virtualizer Array based functions Frozen copy is not dependent on current copy for content Better tolerance for failure Be careful: If alternate set is dependent (cage, power, director), this independence is compromised 17

Differential Snapshots Keeps track of changes to the primary copy Uses a combination of the change set and the primary disks to save/present snapshot Different approaches optimize for different use cases Copy On Write (CoW) Redirect On Write (RoW) Write Anywhere (WA) 18

Copy On Write Primary disks remain current Whenever a Write operation arrives, it is held: First the current contents of the write-destination are read in The old-contents from the primary disk is saved off somewhere and indexed The new write is now allowed to pass through Read path of current disks remains optimized Write path of current disks is potentially impacted Read/Write path of snapshot disks impacted 19

Redirect on Write Primary disks are frozen New write operations to primary disks are stored in a journal (and indexed) To read current copy, the journal is checked first To read the snapshot copy, the primary disks are used When snapshot is dissolved, write journal must be applied to primary disks to catch up Read path of snapshot is optimized Write path of current disks is optimized (no copy) Read path of current disks is potentially impacted 20

Write Anywhere All disk blocks are virtualized Current disk is represented by a map to real blocks -- not directly mapped Disk storage is larger than maps present New writes, instead of overwritting blocks, are directed to free blocks Maps are kept for now and potentially for multiple snapshots Reference counts are kept for blocks in-use Performance doesn t generally change primary/snapshot Performance can be impacted by fragmentation depending implimentation 21

Snapshot Comparison Upsides Downsides Applications Full Copy Snapshot No cost during snapshot process Can be used for DR - independent copy Massive storage cost 1x of storage per RPO Like disk - expensive Often in the same disk frame Loss of DR component Consider re-sync time in schedules Disaster Recovery Near zero backup window 24x7 operations Faster restore Can do no-copy restore Most run-books require copy Can help with data repurposing Differential Copy Snapshot Less storage consumption - typically 10-20% Depends on churn Typically can take advantage of cheaper disk Performance impacts while snapshot exists Multiple implementations to optimize performance impact Most vendors don t offer multiple implementations - pick at onset Leverages main copy - not DR capable Backup source Near zero backup window 24x7 operations Fast restore copy based by definition Can help with data repurposing Beware performance impact 22

Backup to Disk What and Why? Part of the Backup Application itself Easy to implement Vendors may have low cost or no cost option Can be directed to any type of disk or connection FC, SATA, DAS, NAS, etc Allows backup application to attain better tape utilization streaming Especially when backing up slower clients Single/Volume Restores are faster than tape What to watch out for Can be faster than tape, slower than VTL (generally) Current versions of enterprise backup applications support it May charge more for advanced functionality Backup window issues may still exist - not instant 23

Introduction to VTL What: Originally designed for mainframe Moved into open systems market Fits within the backup environment Easy to deploy and integrate Takes advantage of current processes Reduces tape media handling IP / FC SAN VTL Backup Server Why: Improved speed and reliability Speed of higher end tape without the downside Ability to achieve high performance with or without the use of multiplexing Enables faster access to data for single file/folder restores No detached tape leaders or mechanical failures Trends in Data Protection - CDP and VTL 2007 Storage Networking Industry Association. All 2007 Rights Storage Reserved. Networking Industry Association. All Rights Reserved. Tape Library 24

The VTL Difference Easy to manage in traditional backup software environment: Works like normal tape library Fits into existing backup and restore processes Viewed as open systems cartridges, robot, tape drives, and in some cases even a mail slot Standard tape copy, cloning, or vaulting functions apply for off-site copies Used to replicate data to physical tape for long term retention Cost effective solution Leverages lower cost disk Can extend the life of current physical tape investment Used as a front-end to the backup process Tape is still used for longer term retention Trends in Data Protection - CDP and VTL 2007 Storage Networking Industry Association. All 2007 Rights Storage Reserved. Networking Industry Association. All Rights Reserved. 25

VTL Deployment Scenario 1 Scenario 2 Backup Server Backup Server VTL VTL IP / FC SAN IP / FC SAN Tape Library Tape Library Backup Media Servers Backup Media Servers Trends in Data Protection - CDP and VTL 2007 Storage Networking Industry Association. All 2007 Rights Storage Reserved. Networking Industry Association. All Rights Reserved. 26

VTL Best Practices Use as primary backup target to reduce backup window Still take advantage of tape as backup target where appropriate Add storage to enable additional restore time objectives Rules the same rules for libraries and tape when physically connecting Virtual drives per connection Tape redeployment Eject process, controlled by the backup software Offsite requirements Bandwidth, connectivity, time to complete tape copies Trends in Data Protection - CDP and VTL 2007 Storage Networking Industry Association. All 2007 Rights Storage Reserved. Networking Industry Association. All Rights Reserved. 27

Introduction to CDP What: Capture every change as it occurs Protected copy in a secondary location Recover to any point in time How: Block-based File-based Application-based Why: Implementations of true CDP today are delivering zero data loss, zero backup window and simple recovery. CDP customers can protect all data at all times and recover directly to any point in time. Trends in Data Protection - CDP and VTL 2007 Storage Networking Industry Association. All 2007 Rights Storage Reserved. Networking Industry Association. All Rights Reserved. 28

The CDP Difference Replication is not CDP: Maintains only a current copy of the data May be combined with some snapshot capabilities Snapshots are not CDP: Snapshots are scheduled events Data loss possible if crash or corruption happens between snaps Snapshots frequently to same system as primary Lack continuous index with embedded knowledge of relationship of data to files, folders, application and server Scheduled events are not CDP: Scheduled backup processes Log collection for database style applications, rolling transactions forwards or backwards Trends in Data Protection - CDP and VTL 2007 Storage Networking Industry Association. All 2007 Rights Storage Reserved. Networking Industry Association. All Rights Reserved. 29

Traditional Recovery Last Known- Good Image Analyze APPLICATION DOWNTIME Application Restarted Modifications Since Last Image Analyze Restore* Recover Recovery Point Objective Drives *10TB = 4 hours from disk, 12.5 hours from tape Recovery Time Objective 30

Recovery with CDP Last known good image APPLICATION DOWNTIME Application Restarted Analyze Recover Offers a Closer Recovery Point Instant Restore Trends in Data Protection - CDP and VTL 2007 Storage Networking Industry Association. All 2007 Rights Storage Reserved. Networking Industry Association. All Rights Reserved....and a Shorter Recovery Time 31

CDP Deployment Desktops / Clients CDP Repository Application / Database Servers IP / FC SAN Email / File Servers SAN Storage 32

CDP Implementation Models Database Database Server Applications Network Server Generic File System NFS, CIFS Files Virtual Blocks Application-based File-based Volume Manager Logical Blocks Device Driver Storage Network Trends in Data Protection - CDP and VTL 2007 Storage Networking Industry Association. All 2007 Rights Storage Reserved. Networking Industry Association. All Rights Reserved. Block-based 33

Consistency Logical Coherence In electronic data management, a set of data is said to be consistent when the data can be correctly and unambiguously interpreted by an application. Data Consistency is THE critical issue in data recovery 34

Remote Replication What and Why? Extension of full snapshots, delta snapshots, DP images or CDP (depending on implementation/vendor) File or block replication across a communications pipe synchronously or asynchronously (remote) Combines DR with corruption protection Extend tier 1 data protection to second and thirds data tier s Can be used to consolidate tape creation to central location What to watch out for Speed of light/ bandwidth issues Site-to-site recovery can be awkward -- local corruption issues better dealt with locally 35

Solving the problem Desktops / Clients CDP Repository Application / Database Servers IP Network IP / FC SAN VTL Email / File Servers SAN Storage w/ Snapshots Tape Library Backup Media Servers 36

SNIA Resources Related tutorials Disk and Tape Backup Mechanisms Disk Based Restoration Technology Visit the Data Management Forum website at http://www.snia-dmf.org Data Protection Buyers Guide available 37

Q&A / Feedback Please send any questions or comments on this presentation to the SNIA: trackvirtualization@snia.org Many thanks to the following individuals for their contributions to this tutorial. SNIA Education Committee Frank Bunn Curt Kolovson Ben Kuo John Logan Gene Nagle Rob Peglar Abbott Schindler Nik Simpson Wolfgang Singer David Thiel 38

Q&A / Feedback Please send any comments on this tutorial to SNIA: trackdatamgmt@snia.org Many thanks to the following individuals for their contributions to this tutorial: SNIA Data protection Initiative SNIA Data Management Forum SNIA Tech Council Nancy Clay Mike Rowan SW Worth Mike Fishman Jason Iehl Get Involved! www.snia-dmf.org - Find a passion - Join a committee - Gain knowledge & influence - Make a difference 39

Thank you for your feedback Questions and Answers 40