DEDUPLICATION BASICS

Similar documents
Preserving the World s Most Important Data. Yours. SYSTEMS AT-A-GLANCE: KEY FEATURES AND BENEFITS

WHITE PAPER: ENTERPRISE SOLUTIONS. Disk-Based Data Protection Achieving Faster Backups and Restores and Reducing Backup Windows

RECOVERY SCALABLE STORAGE

Archiving, Backup, and Recovery for Complete the Promise of Virtualisation Unified information management for enterprise Windows environments

Symantec Backup Exec Blueprints

HP Dynamic Deduplication achieving a 50:1 ratio

Symantec NetBackup 7 for VMware

The Business Case to deploy NetBackup Appliances and Deduplication

EMC DATA DOMAIN PRODUCT OvERvIEW

Reducing Costs in the Data Center Comparing Costs and Benefits of Leading Data Protection Technologies

Managing Data Growth and Storage with Backup Exec 2012

Protect enterprise data, achieve long-term data retention

Quest DR Series Disk Backup Appliances

Data Protection for Cisco HyperFlex with Veeam Availability Suite. Solution Overview Cisco Public

1 Quantum Corporation 1

Veeam Availability Solution for Cisco UCS: Designed for Virtualized Environments. Solution Overview Cisco Public

Balakrishnan Nair. Senior Technology Consultant Back Up & Recovery Systems South Gulf. Copyright 2011 EMC Corporation. All rights reserved.

White paper: Agentless Backup is Not a Myth. Agentless Backup is Not a Myth

Symantec NetBackup 5200

EMC DATA DOMAIN OPERATING SYSTEM

Backup and Recovery Best Practices

Data safety for digital business. Veritas Backup Exec WHITE PAPER. One solution for hybrid, physical, and virtual environments.

ECONOMICAL, STORAGE PURPOSE-BUILT FOR THE EMERGING DATA CENTERS. By George Crump

50 TB. Traditional Storage + Data Protection Architecture. StorSimple Cloud-integrated Storage. Traditional CapEx: $375K Support: $75K per Year

Quest DR Series Disk Backup Appliances

Data Sheet FUJITSU Storage ETERNUS CS200c S3

Optimizing and Managing File Storage in Windows Environments

Backup Appliances. Geir Aasarmoen og Kåre Juvkam

Backup and archiving need not to create headaches new pain relievers are around

Oracle Zero Data Loss Recovery Appliance (ZDLRA)

HP StorageWorks D2D Backup Systems and StoreOnce

Technology Insight Series

Symantec Backup Exec Blueprints

Symantec Backup Exec 2012

LTO and Magnetic Tapes Lively Roadmap With a roadmap like this, how can tape be dead?

Understanding Virtual System Data Protection

VMware vsphere Data Protection 5.8 TECHNICAL OVERVIEW REVISED AUGUST 2014

Hitachi Adaptable Modular Storage and Hitachi Workgroup Modular Storage

Protecting Microsoft SharePoint

Integrating RDX QuikStation into QNAP NAS Backup

Deep Dive on SimpliVity s OmniStack A Technical Whitepaper

Tivoli Storage Manager for Virtual Environments: Data Protection for VMware Solution Design Considerations IBM Redbooks Solution Guide

Hyper-converged Secondary Storage for Backup with Deduplication Q & A. The impact of data deduplication on the backup process

Lab Validation Report

Scalar i500. The Intelligent Midrange Tape Library Platform FEATURES AND BENEFITS

HP s VLS9000 and D2D4112 deduplication systems

Technology Insight Series

WHITE PAPER. DATA DEDUPLICATION BACKGROUND: A Technical White Paper

The storage challenges of virtualized environments

Thinking Different: Simple, Efficient, Affordable, Unified Storage

Arcserve Cloud Frequently Asked Questions

Tyler Orick SJCOE/CEDR Joan Wrabetz Eversync Solutions

Cost savings of disk-based backup using a Dell PowerVault DL Backup to Disk Appliance powered by Symantec Backup Exec 2010 R2 vs.

Mostafa Magdy Senior Technology Consultant Saudi Arabia. Copyright 2011 EMC Corporation. All rights reserved.

Backup and Restore Strategies

Chapter 1. Storage Concepts. CommVault Concepts & Design Strategies:

DELL EMC DATA DOMAIN SISL SCALING ARCHITECTURE

DELL EMC Backup and recovery hardware and software solutions overview.

Combining HP StoreOnce and HP StoreEver Tape

Data Sheet FUJITSU Storage ETERNUS CS200c S4

Credit: Dell/ExaGrid/MR2 Technical Lunch Event, Downtown Los Angeles. ExaGrid Systems (Up to 130TB per GRID) -

TOP REASONS TO CHOOSE DELL EMC OVER VEEAM

Backup Appliances: Key Players and Criteria for Selection

WHITE PAPER. How Deduplication Benefits Companies of All Sizes An Acronis White Paper

Copyright 2010 EMC Corporation. Do not Copy - All Rights Reserved.

See what s new: Data Domain Global Deduplication Array, DD Boost and more. Copyright 2010 EMC Corporation. All rights reserved.

IM-B23: What s New with NetBackup Appliances

Dell DR4000 Replication Overview

Hitachi Adaptable Modular Storage and Workgroup Modular Storage

IBM ProtecTIER and Netbackup OpenStorage (OST)

How to solve your backup problems with HP StoreOnce

NetBackup & Backup Exec OST Configuration Guide Instructions

QuickSpecs. HP StoreOnce D2D Backup Systems. Overview. Retired

Simplify Backups. Dell PowerVault DL2000 Family

WHY DO I NEED FALCONSTOR OPTIMIZED BACKUP & DEDUPLICATION?

Protect Your Data At Every Point Possible. Reduce risk while controlling costs with Dell EMC and Intel #1 in data protection 1

De-dupe: It s not a question of if, rather where and when! What to Look for and What to Avoid

DXI-SERIES CORE CONCEPTS EXPLAINED for systems with DXi 2.3 and later software

Accelerate the Journey to 100% Virtualization with EMC Backup and Recovery. Copyright 2010 EMC Corporation. All rights reserved.

Construct a High Efficiency VM Disaster Recovery Solution. Best choice for protecting virtual environments

ADVANCED DEDUPLICATION CONCEPTS. Thomas Rivera, BlueArc Gene Nagle, Exar

Deduplication has been around for several

HP Simply StoreIT: Cut through the confusion of storage

VMware Backup and Replication using Vembu VMBackup

Veritas NetBackup 6.5 Clients and Agents

Executive Summary. The Need for Shared Storage. The Shared Storage Dilemma for the SMB. The SMB Answer - DroboElite. Enhancing your VMware Environment

Get More Out of Storage with Data Domain Deduplication Storage Systems

StorageCraft OneXafe and Veeam 9.5

SnapServer Family (XSD & XSR)

VMware vsan 6.6. Licensing Guide. Revised May 2017

DELL EMC DATA DOMAIN EXTENDED RETENTION SOFTWARE

Dell PowerVault DL2100 Powered by CommVault

Arcserve UDP 7000 Appliance Series Revolution, Not Just Evolution

Virtualizing SQL Server 2008 Using EMC VNX Series and VMware vsphere 4.1. Reference Architecture

Veritas NetBackup on Cisco UCS S3260 Storage Server

Cost savings of disk-based backup using the Dell PowerVault DL2100 powered by Symantec Backup Exec 2010 vs. tape-based backup

Zero Data Loss Recovery Appliance DOAG Konferenz 2014, Nürnberg

A Robust, Flexible Platform for Expanding Your Storage without Limits

EMC Data Domain for Archiving Are You Kidding?

Global Headquarters: 5 Speen Street Framingham, MA USA P F

Transcription:

DEDUPLICATION BASICS

4 DEDUPE BASICS 6 WHAT IS DEDUPLICATION 8 METHODS OF DEDUPLICATION 10 DEDUPLICATION EXAMPLE

12 HOW DO DISASTER RECOVERY & ARCHIVING FIT IN? 14 DEDUPLICATION FOR EVERY BUDGET QUANTUM DXi4000 Series 16 DEDUPLICATION FOR EVERY BUDGET QUANTUM NDX-8 and NDX-12 18 DEDUPLICATION FOR EVERY BUDGET QUANTUM RDX

4 DEDUPE BASICS Data deduplication is a breakthrough technology that has changed the way data is protected in businesses of all sizes. It can reduce the amount of backup data by up to 90%, which leads to smaller-capacity backup devices making disk-based backup more affordable. Additionally, deduplication dramatically reduces network traffic, enabling simple, cost-effective replication (copying of data) to remote sites for disaster recovery....reduce the amount of backup data by up to 90%

5 The end result is a comprehensive backup and disaster recovery solution that saves money and takes less time to manage, while also providing faster and easier data restoration. IN THE NEXT FEW PAGES, you ll see how deduplication works, how it fits into your existing data protection infrastructure, how it works with replication and disaster recovery, and a sampling of market-leading products in various price ranges that take advantage of this technology. PRIMARY BACKUP REPLICATION DEDUPLICATED DATA Deduplicated Data DEDUPLICATED DATA Deduplicated Data

6 WHAT IS DEDUPLICATION? The least efficient method of backup is repeatedly copying the data from your primary disk drives to backup disk drives. To save disk space, and therefore cost, you might choose to overwrite the previous backups, but if you need an earlier version of a file (such as before you inadvertently deleted the most important part), it will be gone. Standard backup software helps somewhat by saving only new files or files that have changed since previous backups (called an incremental backup or single-instance storage), but that is also inefficient because if you change a single word in a document, the entire document is still backed up again. DATA DEDUPLICATION takes a different approach. The software looks inside the files and stores only the unique file segments, called blocks, that aren t already backed up. This greatly reduces storage requirements over conventional backup processes.

7 When you want to retrieve the backed up file as it looked on a certain date, the software intelligently reconstructs it, putting all the changes in the right places, and presents you with the version you requested. To you, it looks like the system backed up the complete file each time, but in reality, it only backed up the file once and then added a few pieces here and there. As a result, a great deal of disk space is saved. 5 copies of same file Data Stored 5 files with unique title pages Data Stored Imagine five identical Powerpoint files stored on different computers. Deduplication will recognize the common content and will only back up one copy. Now, imagine that each person changed the first page of the PowerPoint presentation to include their name and the title, for example. Deduplication software will look into the components that make up each file, and will only copy the unique content.

8 METHODS OF DEDUPLICATION There are several approaches to deduplication, which can be combined in various ways. SOFTWARE-BASED Deduplication software runs either on backup clients or on media servers. Software-based dedupe reduces network traffic; however, this approach can tie up system resources and typically has slower performance than purpose-built dedupe appliances. APPLIANCE-BASED Backup data is sent to a purpose-built device, such as Quantum s DXi, and deduplication occurs there, using hardware and built-in software designed to optimize the deduplication process. This generally gives higher performance, and it doesn t increase the load on backup clients or media servers. In addition, appliances are compatible with existing backup applications, so a company can standardize software across all backup systems, including both disk and tape.

9 FILE-BASED Smaller-scale deduplication systems tend to be file-based and perform the action on each file separately. This reduces the processing power requirements, but doesn t save as much disk space as more sophisticated systems. For the example on page 7, file-based deduplication would have stored each PowerPoint presentation, in its entirety, as it recognizes duplicate files but does not look inside the files for duplicate content. GLOBAL DEDUPLICATION A common pool of file components is shared across machines and networks. The smallest components across this common pool are compared and only stored only once, thus providing the greatest savings.

10 A DEDUPE EXAMPLE As with any other technology, it s important to understand the business case for adopting deduplication. Backing up to disk is appealing, but with today s exponential data growth, it can quickly become very expensive. That s where data deduplication pays its way. If you can avoid storing redundant data while still ensuring data integrity, your system has a huge advantage. You not only save on hardware acquisition costs, but also on the ongoing costs of managing and maintaining these resources, perhaps at multiple locations. The advantage is even greater in virtual environments. Deduplication rates may climb as high as 100:1 because duplicate files, cloned machines and test machines significantly increase the amount of redundant data across the infrastructure.

11 AS AN EXAMPLE OF THE BENEFITS PROVIDED BY DEDUPLICATION, assume a company s data set begins at 3 terabytes and grows 40% a year. Management wants backups done daily, retaining the most recent data for 30 days and monthly backups retained for three years. By the end of this period, deduplication will have reduced the required disk space more than eightfold... Retaining Monthly Backups For 3 Years Year 1 Year 2 Year 3 Data To Protect, TB (40% annual growth) 3.0 4.2 5.9 Disk Capacity Used, TB (normal 2:1 backup compression) 5.6 17.1 42.5 Deduplicated Backups, TB (2:1 first backup, 20:1 subsequent) 2.1 3.2 4.8...a powerful way to control data growth and save money.

12 HOW DO DISASTER RECOVERY & ARCHIVING FIT IN? As we have seen, deduplication of your important data makes backup storage much more cost effective. But no backup system is complete without off-site storage, which can combine disaster recovery with long-term archiving. This is typically accomplished by replication, which sends a copy of the backup data over a wide-area network (WAN) to another site, or by transporting removable media, such as tape or removable disks. These approaches can also be combined, depending upon your company s particular needs. Replication is the most convenient and efficient solution, since it is usually automated. Before deduplication, bandwidth consumption made replication impractical, but now as only a small fraction of your raw data needs to be sent, replication is cost-effective and enjoys widespread adoption. This means you can have an off-site image of all your backup data every day.

13 Removable disk or tape is the more traditional method of disaster recovery protection and continues to be the primary method for long-term data retention, or archiving. Here, the physical media is taken to an off-site location on a regular basis, such as weekly. After a predetermined period of time, when sufficient new copies have been made, the media is brought back to the original site and reused (called a rotation cycle). This would not be done if the data is intended to be archived for long-term storage, such as for government regulation compliance. Tape can also play a role in the replication approach, where data that has been replicated to a disk array or data protection appliance is ultimately offloaded to a tape library for low-cost, long-term storage. This process too can be automated. Replication is the most convenient and efficient solution for disaster recovery

14 DEDUPLICATION FOR EVERY BUDGET Quantum DXi4000 Series The most powerful products for small and mid-sized businesses are the DXi4000 Series backup appliances. They range from 5TB to 135TB of usable capacity and offer Capacity-on-Demand (CoD) scalability by simply activating a software license. These appliances combine high performance and affordability. They are easy to install and integrate with all leading backup software. They can also be linked to larger DXi units at the data center, to centralize data protection from branch offices and provide easy archiving to tape libraries.

15 DXi4701 Affordable, high-performance deduplication appliances featuring Capacity-on-Demand scalability for fast, easy growth. Scalable capacities from 5-135TB usable. The DXi4701 offers Capacity-on-Demand (CoD) scalability from 5-135TB by simply activating a license key scalability with no additional hardware to purchase or install Provides up to 5.0TB/hour native performance faster than leading competitors Reduces typical disk capacity needs by 90% or more through deduplication Integrates smoothly with all leading backup applications, including NetBackup and Backup Exec

16 DEDUPLICATION FOR EVERY BUDGET Quantum NDX-8 and NDX-12 NAS Storage The Quantum NDX Series of network-attached storage (NAS) appliances for backup, disaster recovery and primary storage offer a complete storage and data protection solution for small and medium businesses. The NDX-8d and NDX-12d Data Protection Appliances utilize client-side deduplication, to hold 5x more data than nondeduplicated NAS, while reducing network traffic by 90%. Operations are centrally managed through the NAS, and there are no software agents to install on clients being backed up (agentless). The NDX-8d and NDX-12d back up Windows systems including desktops, servers, virtual machines, Exchange, SQL Server and SharePoint. The Quantum NDX-8 and NDX-12 NAS are designed for primary storage and boast more powerful processors and RAM than competitors, while featuring enterprise-class hard drives for ultimate speed and RAID reliability. NDX offers true business class storage that fits the budget of small and medium businesses.

17 NDX-8d and NDX-12d Data Protection Appliance with Deduplication. Protects two years of data or more, while maintaining greater history than tape or other disk Controls data growth, reducing the amount of data stored and network traffic by up to 90% through deduplication Centrally managed from the NAS unit through a simple, easy-to-use interface Comes standard with easy to use DATASTOR Shield deduplication software Agentless backup no software to install on the clients Designed for Windows environments NDX-8 and NDX-12 NAS Storage Business class storage for networked environments Use as a file server or as iscsi direct attached storage Faster, more reliable with Intel Dual Core i3 3.3GHz processor, enterprise-class HDDs and 4GB of RAM Compatible, consistent Windows storage server operating system Available in a tower or 1U rackmount configuration

18 DEDUPLICATION FOR EVERY BUDGET Quantum RDX The Quantum RDX and RDX 8000 disk-based removable storage systems are entry-level data protection solutions for small and mid-sized businesses. They combine the best features of disk fast access and fast restores with the best features of tape removability for disaster recovery and an easy-to-use backup process. The single-cartridge standalone RDX includes full-featured backup software that integrates data deduplication for a single server, while the RDX 8000 is an eight-slot library that optionally includes an enterprise-level version of the same software, with agentless, client-side deduplication to minimize network traffic. Both are plug-and-play so you can be up and running in minutes. Using removable, industry-standard RDX cartridges with interchangeable capacities (currently up to 1TB native capacity per cartridge before deduplication), the RDX systems make it easy to take data off-site and also scale up as your data grows.

19 RDX 8000 8-slot disk-based library that stores up to 8TB like a tape autoloader but uses RDX removable disk cartridges instead Take cartridges off-site for disaster recovery Available with easy to use DATASTOR Shield deduplication software, reducing cartridge use by 2/3 (also available without this software) Simple network-attached iscsi connectivity Ultimate scalability uses any RDX cartridge from 160GB to 1TB compatible with future capacities, increasing capacity instantly without having to buy more hardware

20 NOTES

NOTES 21

22 LEARN MORE For more information about data deduplication technology and data protection in general, consult a knowledgeable reseller in your area. He or she can analyze your backup needs and configure the right system for your company. For more details about the Quantum deduplication products mentioned in this booklet, visit www.quantum.com/dxi. In North America, you can also talk with a product representative in the Quantum Call Center, toll-free at 866-809-5230.

23 ABOUT QUANTUM Quantum is a proven global expert in Data Protection and Big Data management, providing specialized storage solutions for physical, virtual and cloud environments. From small businesses to major enterprises, more than 100,000 customers have trusted Quantum to help maximize the value of their data by protecting and preserving it over its entire lifecycle. With Quantum, customers can Be Certain they re able to adapt in a changing world keeping more data longer, bridging from today to tomorrow, and reducing costs. See how at www.quantum.com. www.quantum.com 1.866.809.5230 2014 Quantum Corporation. All rights reserved. Quantum, the Quantum logo, DXi, Scalar, StorNext and Vision are registered trademarks of Quantum Corporation and its affiliates in the United States and/or other countries. All other trademarks are the property of their respective owners.

BE CERTAIN ST00899A-v03 January 2014