DAR (DIRECT ACCESS RECOVERY) - A LOOK UNDER THE COVERS

Similar documents
WHITE PAPER: ENTERPRISE SOLUTIONS

MANAGING MULTI-VENDOR SANS WITH VERITAS SANPOINT CONTROL

VERITAS NetBackup 6.0 Enterprise Server INNOVATIVE DATA PROTECTION DATASHEET. Product Highlights

EMC Disk Library with NetApp Filers

The NetBackup catalog resides on the disk of the NetBackup master server. The catalog consists of the following parts:

INTRODUCING VERITAS BACKUP EXEC SUITE

VERITAS StorageCentral 5.2

Configuring Veritas NetBackup Version 5.1 for NDMP Remote Backup on a Solaris

The VERITAS VERTEX Initiative. The Future of Data Protection

IMPROVING THE PERFORMANCE, INTEGRITY, AND MANAGEABILITY OF PHYSICAL STORAGE IN DB2 DATABASES

Symantec NetBackup 7 for VMware

WHITE PAPER. Recovery of a Single Microsoft Exchange 2000 Database USING VERITAS EDITION FOR MICROSOFT EXCHANGE 2000

Quest NetVault Backup Plug-in for NDMP. Application Notes for Dell FluidFS NAS Appliances. NCG x-EN-01 03/20/13

USING ISCSI AND VERITAS BACKUP EXEC 9.0 FOR WINDOWS SERVERS BENEFITS AND TEST CONFIGURATION

Designing Data Protection Strategies for Oracle Databases

Veritas InfoScale Enterprise for Oracle Real Application Clusters (RAC)

Veritas NetBackup for NDMP Administrator's Guide

Veritas NetBackup for NDMP Administrator's Guide

VERITAS Storage Foundation 4.0 TM for Databases

EMC CLARiiON Backup Storage Solutions

Designing Data Protection Strategies for Oracle Databases

SwiftStack Object Storage

TECHNICAL OVERVIEW. Solaris 2.6, 7, 8, 9 with VxFS 3.4 HP-UX 11, 11.11, 11i with Online JFS 3.3 SGI/IRIX F with XFS

VERITAS Volume Replicator. Successful Replication and Disaster Recovery

VERITAS NetBackup 6.0 Advanced Client

VERITAS Volume Manager for Windows 2000

Veritas NetBackup Copilot for Oracle Configuration Guide. Release 3.1 and 3.1.1

Simplified Storage Migration for Microsoft Cluster Server

Implementing an NDMP Backup Solution Using NetBackup 6.5 and 7.0 on NetApp Storage R.Balakrishnan, NetApp January 2010 TR-3815

Admin Plus Pack Option. ExecView Web Console. Backup Exec Admin Console

NetBackup Copilot for Oracle Configuration Guide. Release 2.7.1

An Automated DVD-R Recording Solution for Veritas' Backup Exec

Backup Exec 9.0 for Windows Servers. SAN Shared Storage Option

Chapter 11: File System Implementation

WHITE PAPER. VERITAS Edition for Microsoft Exchange Migrating Your Exchange Storage To a Storage Area Network with

Chapter 11: File System Implementation

Data Domain OpenStorage Primer

Chapter 11: Implementing File Systems

Reducing Costs in the Data Center Comparing Costs and Benefits of Leading Data Protection Technologies

Veritas NetBackup 6.5 Clients and Agents

PERFORMANCE TUNING TECHNIQUES FOR VERITAS VOLUME REPLICATOR

Setting Up the Dell DR Series System as an NFS Target on Amanda Enterprise 3.3.5

BACKUP AND RECOVERY FOR ORACLE DATABASE 11g WITH EMC DEDUPLICATION A Detailed Review

Configuring Veritas NetBackup Version 4.5 for NDMP

Technology Insight Series

Setting Up the DR Series System as an NFS Target on Amanda Enterprise 3.3.5

Veritas Storage Foundation for Windows by Symantec

Data Protection and Synchronization for Desktop and Laptop Users VERITAS BACKUP EXEC 9.1 FOR WINDOWS SERVERS DESKTOP AND LAPTOP OPTION

Chapter 11: Implementing File Systems

Data Sheet: Storage Management Veritas Storage Foundation for Oracle RAC from Symantec Manageability and availability for Oracle RAC databases

VERITAS Storage Foundation 4.0 for Windows

Veritas NetBackup OpenStorage Solutions Guide for Disk

VERITAS NetBackup for SAP R/3 Environment Protection

Chapter 2 CommVault Data Management Concepts

VERITAS Cluster Server. QuickStart. Product Overview

EMC Data Domain for Archiving Are You Kidding?

Unicode and the Implications of Its Implementation

Veritas NetBackup Copilot for Oracle Configuration Guide. Release 2.7.2

Quest NetVault Backup Plug-in for SnapMirror To Tape. User s Guide. version 7.6. Version: Product Number: NTG EN-01 NTG

EMC DiskXtender for NAS Release 3.1

NetApp AltaVault Cloud-Integrated Storage Appliances

BEST PRACTICES GUIDE FOR DATA PROTECTION WITH FILERS RUNNING FCP

Controlling Costs and Driving Agility in the Datacenter

VERITAS SANPoint Storage Appliance Overview of an Open Platform for the Implementation of Intelligent Storage Server

VERITAS Backup Exec for Windows NT/2000 Intelligent Disaster Recovery

Veritas NetBackup Copilot for Oracle Configuration Guide. Release 2.7.3

IM B09 Best Practices for Backup and Recovery of VMware - DRAFT v1

VERITAS Foundation Suite for HP-UX

EMC Solutions for Backup to Disk EMC Celerra LAN Backup to Disk with IBM Tivoli Storage Manager Best Practices Planning

Comparison: Microsoft Logical Disk Manager (LDM) and VERITAS Volume Manager

Chapter 14: File-System Implementation

Using Symantec NetBackup 6.5 with Symantec Security Information Manager 4.7

Oracle Secure Backup 12.1 Technical Overview

Veritas NetBackup Appliance Fibre Channel Guide

Cutting Backup and Recovery Time in Half with Solutions from Symantec

Optimizing and Managing File Storage in Windows Environments

Plug-in for VMware vcenter

Backup Exec Subscription Licensing Guide

NetApp AltaVault Cloud-Integrated Storage Appliances

EMC DATA DOMAIN OPERATING SYSTEM

Quest NetVault Backup Plug-in for NDMP

Enterprise Vault Whitepaper Enterprise Vault Integration with Veritas Products

WHY DO I NEED FALCONSTOR OPTIMIZED BACKUP & DEDUPLICATION?

Quest DR Series Disk Backup Appliances

Monitoring NetApp Products

Veritas Storage Foundation for Windows by Symantec

Chapter 11. SnapProtect Technology

Data Protection for Virtualized Environments

OPERATING SYSTEM. Chapter 12: File System Implementation

1 Backup. Introduction

WHITE PAPER. Oracle Backup and Recovery Essentials INFORMATION THAT EVERY ORACLE DATABASE ADMINISTRATOR SHOULD KNOW

Symantec NetBackup PureDisk Compatibility Matrix Created August 26, 2010

Best Practices for Using Symantec Online Storage for Backup Exec

Contingency Planning and Disaster Recovery

Executive Summary SOLE SOURCE JUSTIFICATION. Microsoft Integration

CS720 - Operating Systems

NetVault Backup Client and Server Sizing Guide 3.0

VERITAS StorageCentral 5.2

NetVault Backup Client and Server Sizing Guide 2.1

Symantec Design of DP Solutions for UNIX using NBU 5.0. Download Full Version :

Transcription:

DAR (DIRECT ACCESS RECOVERY) - A LOOK UNDER THE COVERS B. Roth April 22, 2003 VERITAS ARCHITECT NETWORK

TABLE OF CONTENTS Summary... 3 An Introduction to NDMP... 3 NDMP History... 3 How NDMP Works... 4 DAR... 5 Dissecting fh_info... 5 The Data Recovery Interface... 7 File History and the Data Management Application... 8 Disabling DAR... 8 Comparing DAR and non-dar Single File Recovery A Case Study... 9 The Results... 9 Conclusion... 10 VERITAS ARCHITECT NETWORK 2

Summary The Network Data Management Protocol (NDMP) is an open communications standard, developed to improve interoperability between network attached storage (NAS) filers, tape equipment, and backup and recovery software. Direct Access Recovery (DAR) is an optional capability of NDMP that addresses the need to quickly restore a single file from a stream of backup data containing potentially millions of individual files. Vendors of backup and recovery software - known as data management applications (DMA) in the parlance of NDMP have, until recently, failed to address shortcomings in handling DAR processing. For storage administrators this has resulted in a trade-off between faster backups and faster file recovery. The single-file recovery capabilities provided by DAR rely on file history information generated at the NAS device during a backup operation. When a NAS volume contains predominantly small files the file history information can overload the DMA, slowing the performance of backup processing. For storage administrators, under continual pressure to fit backup processing within shorter and shorter windows of time, disabling the generation of file history information, and consequently disabling DAR capabilities, provides a substantial boost in backup performance. The penalty, however, is dramatically slower single-file recovery. This article looks under the covers of NDMP, providing insight into the mechanics of DAR and the important role played by the NDMP-compliant DMA catalog and data recovery interface. Armed with this information administrators will be in a better position to make intelligent choices about how to accomplish efficient single file restore in an NDMP managed environment. An Introduction to NDMP The maturing of network storage architectures, and the insatiable appetite of business applications for storage capacity, have created a buoyant market for NAS devices. These products have been widely deployed within the enterprise. In their wake, however, they have created a significant interoperability problem. Network-based backup and recovery processing has traditionally required the installation of client-side binaries on the storage platform. The client software processes command and control instructions and channels data to a backup medium, usually tape. The availability of many different NAS device implementations, using operating system software that ranges from the standard to the obscure - with a liberal scattering of the proprietary thrown in for good measure has created difficulties for vendors of DMA software. Coding, maintaining, and supporting client binaries for every NAS environment is simply beyond the capabilities of the DMA vendors. Storage administrators, tasked with protecting enterprise data resources, cannot exclude a storage device from the backup schedule just because the DMA vendor does not support it. This leaves administrators with two choices: backup via scalability limiting sneaker-net physically walking to each NAS device, inserting a tape, and entering backup commands; or time-consuming and bandwidthhogging NFS and CIFS volume mounts across the network. NDMP History NDMP was developed to solve the data management interoperability problem. The protocol provides benefits to all parties involved in the backup and recovery of storage resources: vendors of DMA software no longer need to develop and support client-side code for every conceivable NAS environment; vendors of NAS devices and tape equipment need not worry about third-party software running on their products, potentially creating performance and reliability problems outside of their VERITAS ARCHITECT NETWORK 3

control; and storage administrators gain centralized control of heterogeneous, distributed storage devices, and a greater selection of standards-based DMA products. How NDMP Works Figure 1: Simple NDMP Configuration. NDMP separates data management intelligence from the physical location of storage hardware. The architecture of NDMP is based on a client-server model and minimizes the need for administrative logic at NAS and tape devices, such as scheduling and media management. The DMA is an NDMP client, and initiates the physical transfer of backup data at an NDMP server (see Figure 1). Backup data flows between the NDMP data and tape services at the NDMP server, without interruption from the DMA. Separating command information from physical backup data, NDMP significantly improves a storage administrator s ability to support backup and recovery processing for many NAS resources. A single, centrally located DMA is able to manage backup and recovery processing for heterogeneous NAS devices. VERITAS ARCHITECT NETWORK 4

DAR DMA software determines what NAS device contents are written to an NDMP backup stream but it is not aware of the location of individual data files within the stream. Recovering a single file from an NDMP backup is time consuming. The entire backup tape must be read to locate the individual file to be restored. DAR overcomes this problem. Verifying NAS and DMS Implementation Limits Using DAR-enabled recovery, the DMA is able to specify the exact location of a file in a backup data stream. The NDMP server need only read the data applicable to the single file being restored, reducing the amount of information processed and significantly reducing recovery time. The key enabling technology of DAR is the NDMP file history interface. The NDMP server on the NAS device returns fh_info data to the DMA during a backup operation. The fh_info data contains the byte- or record-offset of a file relative to the start of the backup stream. This information can later be used to quickly locate the file on tape. The format of the file history data sent from the NDMP server to the DMA depends on the type of backup operation. File name backup methods, like tar and cpio, generate "add path" formatted history messages, Although the NDMP standard has been widely adopted by both NAS equipment manufacturers and DMA software vendors, support for DAR may not be as common. Check with your DMA and NAS vendors to see if DAR can be enabled in your environment. Directory level DAR provides support for recovery of an entire subdirectory from an NDMP backup. As with basic DAR some NAS and DMA vendors may not support directory level DAR. Check with your vendor before assuming that subdirectory recovery is available. A flexible response to DAR support can provide administrators with the tools they need to perform DARbased recovery of multiple files during a single recovery session, and allow those files to exist in different subdirectories. Once again, assuming that a NAS device or DMA product can provide this level of support may not be wise. Check with your vendor to see if multi-file and multisubdirectory recoveries are supported. requiring no additional processing at the DMA. However, i-node backup methods, such as dump, generate "add directory or "add node" formatted messages. Because the full pathname of a file is not available at the time of an i-node type backup, the DMA must perform additional conversion to pathbased format, enabling administrators initiating recovery operations to easily interpret the data. Dissecting fh_info Figure 2: Sample Unix File Structure. The details within the fh_info data generated by the NDMP server can best be illustrated using an example. Figure 2 shows a simple Unix file structure. VERITAS ARCHITECT NETWORK 5

Figure 3: Ls lir Output. The results of running a recursive i-node list command (ls lir) on the example file system, when mounted to a Solaris 9 client, are displayed in Figure 3. Executing a full volume NDMP backup (dump) on the sample file system prompts the NDMP server at the NAS device to transmit external data representation (XDR) encoded i-node file history data over a TCP/IP connection to the DMA (Figure 4). This information must be received, decoded from XDR, processed, and then stored in a catalog or database for subsequent use in a recovery. The DMA must also record tape volume information along with i-node and fh_info data, to avoid confusion when recovering data from multi-volume backups. Figure 4: I-node Output From NDMP Server Using Dump Backup Command. The output from the dump command (Figure 4) contains the following information for the highlighted file:! i-node number of the file (node:99)! file name (sample_file_2) this data is obtained by cross-referencing fh_info data, from Figure 4, with ls lir command output, from Figure 3, using i-node as a key VERITAS ARCHITECT NETWORK 6

! tape positioning information used by DAR (fh_info:2189312)! NDMP file system type (s_fs_type:0)! file type (ftype:4)! file modification time (mtime:1041532387)! file last access time (atime:1041532387)! file status modification time (ctime:1041532387)! file owner (owner:0)! file group (group:1)! system file attribute (fattr:81a4)! file size, in bytes, (size:23)! number of links to this file (links:0) The Data Recovery Interface When a data recovery operation is initiated the DMA instructs the NDMP tape service to load the tape volume containing the file to be restored. The tape header information is then read, to validate that the correct tape has been loaded, and the NDMP Mover is prepared. The Mover is a component of the NDMP tape service responsible for transferring data from the backup tape to the NDMP data service which eventually writes the data back to the NAS file system. The DMA uses i-node and fh_info values, for the file being recovered, to establish the NDMP Mover Window. The NDMP Mover Window represents a range of contiguous blocks to be read by the Mover. During non-dar recovery the Mover Window will cover the entire contents of the backup tape. DAR-base recovery uses the file history information, stored by the DMA during backup processing, to confine reading of the tape to blocks that contain data relevant to the recovery of the appropriate file. This significantly reduces the amount of information read from tape. Figure 5: NDMP Tape Server Output. VERITAS ARCHITECT NETWORK 7

Figure 5 shows a sample log of data captured from the NDMP tape server during recovery. Note that the DIRECT environment variable has been set to Y, and that the fh_info and i-node data from the catalog have been passed to the NDMP tape server. File History and the Data Management Application Figure 6: File History Producer-Consumer Relationship The NDMP standard dictates that during backup processing the NDMP server on the NAS device and the DMA exist in a producer-consumer relationship. This linkage means that if the DMA cannot consume file history information at the same rate that the NAS device produces it, the entire process slows to match the pace of DMA consumption. For NAS devices containing relatively large files, the impact of DMA file history processing is negligible. However, small files, like the 23-byte file sample_file_2 in the previous example, can generate more file history information than backup data. If a NAS volume contains predominantly small files, and the DMA is not configured to efficiently receive, decode, process and store fh_info data, the processing of file history information is likely to significantly slow backup processing. Disabling DAR The importance of efficient processing of file history data at the DMA cannot be overstated. NAS equipment manufacturers, sensitive to the needs of their storage administrator customers, have responded to complaints about backup performance with a switch to turn off the generation of file history information at the NDMP server. By setting the HIST environment variable to N, at the DMA platform, a storage administrator can quickly improve backup performance. This performance boost can prove costly, however, if single-file recovery is later required. VERITAS ARCHITECT NETWORK 8

Comparing DAR and non-dar Single File Recovery A Case Study The following test case uses a simple configuration to demonstrate the difference between DAR and non-dar single file recovery. The test case configuration consisted of: a Network Appliance F760 filer, running Data ONTAP version 6.2.1, with a single fibre channel attached disk shelf containing seven 72 GB disk drives; a SunBlade 100 platform, running Solaris 9, connected to the filer with a 100Base-Tx crossover cable; and an IBM 3483 L18 tape library, containing a single IBM 3580 LTO tape drive, directly attached to the filer by SCSI cable. A two-drive root volume and five-drive data volume were configured on the seven NAS filer disks. The data volume - /vol/vol1 - was populated using a shell script that recursively copied the Solaris /usr/sbin directory contents to new folders immediately under the /vol/vol1 root directory. The shell script created 10,572 copies of /usr/sbin, each in its own folder. The output from a df bk command, showing volume free blocks, files, and allocation statistics for the root directory, can be seen in Figure 7. Figure 7: df bk command on sample volume /vol/vol1 Two full volume backups were executed on the NAS filer using VERITAS NetBackup 4.5 software, feature pack 3. The first backup was written to tape volume 7AATL1, the second began on tape volume 7AATL1 and completed on tape volume 8AATL1. The Results The table in Figure 8 shows the results of recovering a single file from the backup tapes under a variety DAR, non-dar, and multi-tape configurations. In all cases in the demonstration the DAR-enabled single file restore completed in substantially less time than the non-dar enabled restore. While corner cases may exist that show elongated restore durations for DAR recoveries, these were not identified or investigated as part of this study. VERITAS Netbackup 4.5 Software and DAR VERITAS NetBackup 4.5 software has been significantly improved to address growing customer demand for efficient single file recovery processing. The algorithms that manage file history data associated with DAR have been reengineered, enabling information transmitted to NetBackup software by the NDMP server to be processed much faster. A new high-performance binary catalog has replaced the older ASCII catalog used by prior versions of NetBackup software. The binary catalog has increased the efficiency of file history store and retrieval operations and reduced the volume of storage space consumed by each backup. Informal tests of these changes show a 10-times improvement in efficiency of NetBackup software DAR processing. VERITAS ARCHITECT NETWORK 9

Figure 8: Comparing DAR enabled and DAR disabled restores Conclusion The popularity of the NDMP standard with storage vendors is an indication of how positively enterprise storage customers have responded to the benefits of greater interoperability. But, if the vendors of DMA software do not improve file history processing to eliminate the backup performance issues, many of the benefits of NDMP will never be realized. DAR offers administrators, without question, the fastest method of restoring a single file from an NDMPgenerated backup. Unfortunately, this is not where the story ends. For storage administrators to successfully employ DAR in their networked storage environments, DMA vendors must improve DAR support in their products. VERITAS ARCHITECT NETWORK 10

VERITAS Software Corporation Corporate Headquarters 350 Ellis Street Mountain View, CA 94043 650-527-8000 or 866-837-4827 For additional information about VERITAS Software, its products, VERITAS Architect Network, or the location of an office near you, please call our corporate headquarters or visit our Web site at www.veritas.com. Copyright (c) 2003 VERITAS Software Corporation. All rights reserved. VERITAS, the VERITAS Logo, NetBackup, and all other VERITAS product names and slogans are trademarks or registered trademarks of VERITAS Software Corporation. VERITAS, the VERITAS Logo, and NetBackup Reg. U.S. Pat. & Tm. Off. Other product names and/or slogans mentioned herein may be trademarks or registered trademarks of their respective companies.