Optimizing for Recovery

Similar documents
A Close-up Look at Potential Future Enhancements in Tivoli Storage Manager

IBM Tivoli Storage Manager Version Introduction to Data Protection Solutions IBM

IBM Spectrum Protect Version Introduction to Data Protection Solutions IBM

C Q&As. IBM Tivoli Storage Manager V7.1 Implementation. Pass IBM C Exam with 100% Guarantee

EMC Solutions for Backup to Disk EMC Celerra LAN Backup to Disk with IBM Tivoli Storage Manager Best Practices Planning

IBM Exam C IBM Tivoli Storage Manager V7.1 Fundamentals Version: 6.0 [ Total Questions: 63 ]

Exam Name: IBM Tivoli Storage Manager V6.2

IBM IBM Tivoli Storage Manager V5.4 Administration. Download Full Version :

IBM Tivoli Storage Manager V5.1 Implementation Exam # Sample Test

C demo. Number: Passing Score: 800 Time Limit: 120 min File Version: 1.0. Version:

IBM Spectrum Protect Node Replication

PASS4TEST. IT Certification Guaranteed, The Easy Way! We offer free update service for one year

IBM Spectrum Storage Professional Certification Program

Tivoli Storage Manager V3.7 Performance Update Joseph Giulianelli IBM San Jose

IBM C IBM Tivoli Storage Manager V7.1 Fundamentals. Download Full Version :

IBM Tivoli Storage Manager 6

Exam Questions c

Insights into TSM/HSM for UNIX and Windows

IBM C IBM Tivoli Storage Manager V7.1 Implementation.

Pass4Sure.C _70.Q&A

TSM Studio Dataview's and Dataview Commands. TSM Studio

Chapter 2 CommVault Data Management Concepts

IBM C IBM Tivoli Storage Manager V7.1 Fundamentals. Download Full Version :

TSM Node Replication Deep Dive and Best Practices

Passit4sure.C questions C IBM Tivoli Storage Manager V7.1 Implementation

Configuring Tivoli Storage Manager for Mirapoint NDMP Backup

Preview of Future Enhancements in Tivoli Storage Manager

IBM IBM Tivoli Storage Manager V5.5 Implementation. Download Full Version :

TSM Paper Replicating TSM

ZertSoft % Erfolgsgarantie und 100% Geld-Zurück Garantie

TSM HSM Explained. Agenda. Oxford University TSM Symposium Introduction. How it works. Best Practices. Futures. References. IBM Software Group

Chapter 12 Backup and Recovery

Backup management with D2D for HP OpenVMS

A Guided Tour of the TSM Client for Windows

Oracle Secure Backup 12.1 Technical Overview

IBM Tivoli Storage Manager for HP-UX Version Installation Guide IBM

Chapter 7. GridStor Technology. Adding Data Paths. Data Paths for Global Deduplication. Data Path Properties

Announcing Oracle Secure Backup 10.3: Fastest, Most Cost-Effective Oracle Backup

EXAMGOOD QUESTION & ANSWER. Accurate study guides High passing rate! Exam Good provides update free of charge in one year!

A GPFS Primer October 2005

Veritas NetBackup OpenStorage Solutions Guide for Disk

Aspects of Information Lifecycle Management

C exam. IBM Tivoli Storage Manager V7.1.1 Administration

IBM IBM Open Systems Storage Solutions Version 4. Download Full Version :

IBM Spectrum Protect for Enterprise Resource Planning Data Protection for SAP for Oracle Version 8.1. Installation and User's Guide IBM

C Exam Questions Demo IBM. Exam Questions C

Exam Name: IBM Tivoli Storage Manager V6.2

A. Deduplication rate is less than expected, accounting for the remaining GSAN capacity

RealTime. RealTime. Real risks. Data recovery now possible in minutes, not hours or days. A Vyant Technologies Product. Situation Analysis

The Google File System

IBM Spectrum Protect for Enterprise Resource Planning Data Protection for SAP HANA Version Installation and User's Guide IBM

Technology Insight Series

Chapter 11. SnapProtect Technology

IBM Tivoli Storage Manager for Databases Version Data Protection for Microsoft SQL Server Installation and User's Guide IBM

C exam. Number: C Passing Score: 800 Time Limit: 120 min File Version:

EMC CLARiiON Backup Storage Solutions

TSM Studio Server Alerting User Guide

SwiftStack Object Storage

REO SERIES. AN-DBP-065a: Configuring IBM Tivoli Storage Manager 5.3 and Windows Server 2003 with Fibre Channel- Connected REO VTL

TRIM Integration with Data Protector

IBM A EXAM QUESTIONS & ANSWERS

Effective Planning and Use of IBM Spectrum Protect Container Storage Pools and Data Deduplication

Exam : A : Assess: Fundamentals of Applying Tivoli Storage Management V3. Title. Version : Demo

E DECE-IE. A Success Guide to Prepare- Dell EMC Avamar Expert for Implementation Engineers. edusum.com

Providing a first class, enterprise-level, backup and archive service for Oxford University

IBM Spectrum Protect for Enterprise Resource Planning: Data Protection for SAP for DB2 Version 8.1. Installation and User's Guide IBM

Administration Guide - NetApp File Archiver

INTEROPERABILITY OF AVAMAR AND DISKXTENDER FOR WINDOWS

VCS-276.exam. Number: VCS-276 Passing Score: 800 Time Limit: 120 min File Version: VCS-276

WHY DO I NEED FALCONSTOR OPTIMIZED BACKUP & DEDUPLICATION?

EMC Data Domain for Archiving Are You Kidding?

Protect enterprise data, achieve long-term data retention

Gerd W. Becker Oxford, 27. September

Dell EMC Unity: Built-In Hybrid Cloud & Software Defined Capabilities. Wei Chen Product Technologist Midrange & Entry Systems

IBM IBM Storage Sales, Version 7. Download Full Version :

Efficiently Backing up Terabytes of Data with pgbackrest. David Steele

Backups and archives: What s the scoop?

CLOUD-SCALE FILE SYSTEMS

Next Generation of Data Recovery with TSM 6

Backup Solutions with (DSS) July 2009

Exam Questions C

Distributed Systems - III

The NetBackup catalog resides on the disk of the NetBackup master server. The catalog consists of the following parts:

C13: Files and Directories: System s Perspective

CPSC 421 Database Management Systems. Lecture 11: Storage and File Organization

Chapter 9 Protecting Client Data

IBM V7000 Unified R1.4.2 Asynchronous Replication Performance Reference Guide

Configuring IBM Spectrum Protect for IBM Spectrum Scale Active File Management

An Introduction to GPFS

Oracle Secure Backup: Achieve 75 % Cost Savings with Your Tape Backup

IBM Tivoli Storage Manager HSM for Windows Version 7.1. Administration Guide

IBM Tivoli Storage Manager for AIX Version Installation Guide IBM

Chapter 17: Distributed-File Systems. Operating System Concepts 8 th Edition,

Restoring the Server to Bare Metal

Chapter 4 Data Movement Process

Configuring Short RPO with Actifio StreamSnap and Dedup-Async Replication

IBM. DFSMS Introduction. z/os. Version 2 Release 3 SC

GFS: The Google File System. Dr. Yingwu Zhu

SPECIFICATION FOR NETWORK ATTACHED STORAGE (NAS) TO BE FILLED BY BIDDER. NAS Controller Should be rack mounted with a form factor of not more than 2U

Title : Symantec NetBackup 7.0 for UNIX (STS) Version : DEMO

Transcription:

IBM Software Group Dave Cannon IBM Tivoli Storage Management Development Oxford University TSM Symposium 2003 Setting Expectations This session describes TSM functions and methods for optimizing recovery of client data Most of the techniques described in this presentation must be implemented before recovery is needed, so plan ahead This is not a performance tuning session We won t cover disaster recovery or bare-metal restore There is not a magic, one-size-fits-all solution for optimal recovery 2 1

Topics General principles Exploiting disk storage Sequential-access storage Special situations Potential future enhancements 3 General Principles 4 2

Tradeoffs Restore performance Other considerations Cost of storage Capacity requirements Device characteristics Backup constraints Backup window Network bandwidth Impact to client Efficient data transfer within storage hierarchy Policy goals Number of versions Retention time 5 Prioritizing Data Since there are tradeoffs (costs) associated with fast recovery, prioritize data to optimize recovery at least for most critical data Consider using different storage pool hierarchies and methods for different nodes, data types, or objects Disk usually allows fastest recovery for small-medium files (<100MB) High-performance tape may be optimal for large objects (databases or file system images) Critical Files Less-critical Files Databases Disk only Disk and tape High-performance tape 6 3

Prioritizing Recovery Scenarios Single file or multiple files stored together Tracking of objects by TSM database optimizes restore Multiple files (such as a directory) backed up at different times Fragmentation of data in sequential-access storage pools may make this challenging Presentation will emphasize approaches to optimize for this scenario Entire file system Effects of fragmentation can be mitigated with image backup/restore Databases or other applications with large objects Restore of large applications is typically fast To optimize, see publications for your application protection product 7 Exploiting Disk Storage 8 4

Disk Storage Basics Disk storage can provide optimal restore for small/medium files No delays for mounting of volumes Fast positioning to data within volume Simultaneous volume access from multiple sessions Random-access disk storage pools defined with device class of DISK Sequential-access disk storage pools defined using device class with DEVTYPE=FILE 9 Random vs. Sequential Disk Space allocation/tracking TSM caching Space recovery (no cache) Recovery of cache space Aggregate reconstruction Concurrent volume access Multi-session client restore Target for LAN-free backup Can be used for copy pools Migration/stg pool backup Parallel migration processes Migration to collocated pool Pools can span file systems Database regression Random-access Random 4KB blocks Optional (backup overhead) When file is deleted/moved When space is needed Not supported Yes One session for all volumes Not supported No By node and file space Yes Very efficient Yes Must audit all volumes Sequential-access Sequential within volume Not supported When volume is reclaimed Not applicable During volume reclamation Not supported One session per volume Yes, via SANergy Yes By volume No Moderately efficient No Reuse delay avoids audit 10 5

Sequential-access Storage Optimizing restore from sequential-access storage such as tape can be a challenge Mounting of storage volumes Positioning to data to be restored Fragmentation Techniques described in this section can be applied to sequential disk storage, but are most beneficial for other sequential storage media 11 Fragmentation of Data in Sequential Storage Pools Active Inactive Time Expired Tape volumes Data required for a restore can be fragmented Within a volume Across multiple volumes Fragmentation increases restore time 12 6

Reducing Fragmentation of Sequential Media Expiration and reclamation Collocation Limit number of versions File-system image backup and restore Full backups 13 Expiration and Reclamation Initial backup Expiration deletes files per policy Reclamation consolidates data in sequential storage pools leading to Reduced storage requirement Improved restore performance Perform expiration and reclamation regularly to reduce fragmentation Incremental backups File expiration Automated space reclamation Incremental backups Time 14 7

Collocation Minimizes number of sequential-access volumes used for a client Node File space Directory tree (using virtual mount point and collocation by file space) Improves restore performance by reducing volume mounts Can minimize volume contention during concurrent restores of data for multiple nodes or file spaces Reduces benefits of multi-session restore operations Increases number of volumes required Can increase processing for data transfer to collocated media 15 Limit Number of Versions Older versions expire earlier allowing reclamation to consolidate remaining data Restore operations locate data faster because there are fewer versions to skip Tailor policies for critical data to retain necessary versions while also considering impact to restore 16 8

File-system Image Backup and Restore Optimizes restore of entire file system by eliminating fragmentation of data to be restored Can be combined with incremental backups to allow restore of file system to arbitrary point in time Incremental backups may be fragmented which extends restore time Perform image backups often enough to minimize impact of fragmentation (before more than 20% of data has changed) If available, snapshot technology minimizes impact to client applications 17 Full Backups Periodic full selective backup collocates data, which can greatly improve restore time Consider using a different node for the full backup Avoids versioning files that have not changed To restore to point in time using two nodes Restore from last full backup previous to required point in time Restore files that changed since full backup Could be used for critical file spaces or directory trees 18 9

Minimizing the Effects of Fragmentation Move Nodedata command Multi-session restore Parallel restore operations Separate node to optimize restore of active data 19 Move Nodedata Command Moves files for specified node, file space, and data type residing in a specified sequential-access storage pool Prepares for rapid client restore by Consolidating node data within a sequential-access pool Moving data to disk for fast access Reconstruction option removes unused space within aggregates Requires lead time before restore Consolidation Files for node BOB Movement to Disk Files for node BOB move nodedata bob fromstgpool=tapepool move nodedata bob fromstgpool=tapepool tostgpool=diskpool 20 10

Multi-session Restore Session 1 Session 2 Session 3 Storage pool volumes Client Server Restore uses multiple client-server sessions for a single file space Provides improved throughput Limited by Number of sequential-access volumes with data to be restored plus one session for random-access disk Mount points Client's resourceutilization option Most effective when data is spread over multiple sequential volumes Works only for no-query restore (unrestricted wildcard) 21 Parallel Restore Operations Restore 1 Restore 2 Restore 3 Client Server Divide data to be restored and use separate restore operation for each data subset File spaces Directory trees May cause volume contention unless Data can be restored from random-access disk OR Data required for each restore is located on separate sequential volume (e.g., collocation by file space) 22 11

Separate Node to Optimize Restore of Active Data Incremental backup of node X Incremental backup of node X_ACTIVE Client Server Node X For restore of inactive versions Progressive incremental backup Data stored in disk-tape hierarchy Policy allows active/inactive versions as required Node X_ACTIVE For optimal restore of active versions Progressive incremental backup Data stored in FILE storage pool to allow aggregate reconstruction Policy allows only active version (or possibly active plus small number of inactive versions) 23 Special Situations 24 12

Using NDMP to Protect and Recover NAS Data High-performance, scalable backup of NAS file servers using Network Data Management Protocol (NDMP) Full file-system image Differential file-system image (files that have changed since last full backup) NDMP-controlled restore Full file-system image Full file-system image plus one differential image Selected files and/or directories using the image table of contents Data flow for backup/restore is LAN-free and outboard of TSM client/server 25 Classic vs. No-query Restore Classic restore processing 1. Client queries server for information about files to be restored 2. Server sends file information to client 3. Client sorts files by storage location 4. Client sends restore request for each file in optimal restore order 5. Server sends each specified file to client No-query restore (NQR) processing 1. Client sends restore specification to server 2. Server sends matching files to client 26 13

Classic vs. No-query Restore (continued) Classic vs. NQR tradeoffs NQR reduces client-server interaction and client memory requirement NQR allows multi-session restores, but classic restore does not NQR is usually faster for restoring entire file systems or large directories Classic restore may be faster for restoring directories if data is highly fragmented across sequential volumes with only a small amount of data on each volume NQR is automatically used if both of the following apply The file specification is an unrestricted wildcard (/home/mydata/*) None of the following are used: inactive, latest, pick, fromdate, todate To force classic restore, use?* in the file specification (home/mydata/?*) 27 LTO Tape Apply techniques as for other sequential media IC33920 TSM server uses read rather than locate to improve performance for positioning to fragmented data Not available on Windows Recent Windows LTO device driver (5.0.6.4) can improve performance significantly for Gen 1 drives Use LTO Gen 2 rather than Gen 1 Faster streaming Improved locate performance Improved I/O for Windows Note that larger capacity of Gen 2 media may reduce potential benefit of multi-session restore 28 14

Potential Future Enhancements Collocation groups Disk storage enhancements Collocation of active data Restore of individual files from file system image Increased tape block size for Windows 29 15