Why In-Place Copy Data Management is The Better Choice

Similar documents
Catalogic DPX: Backup Done Right

The Data-Protection Playbook for All-flash Storage KEY CONSIDERATIONS FOR FLASH-OPTIMIZED DATA PROTECTION

What s New in Catalogic ECX 2.5

Understanding Virtual System Data Protection

ECONOMICAL, STORAGE PURPOSE-BUILT FOR THE EMERGING DATA CENTERS. By George Crump

Solution Brief: Commvault HyperScale Software

Symantec NetBackup 7 for VMware

Inside Track Research Note. In association with. Everyone s talking about Dedupe Where does dedupe and data compression make sense?

Red Hat Enterprise Virtualization (RHEV) Backups by SEP

Hedvig as backup target for Veeam

How to Protect Your Small or Midsized Business with Proven, Simple, and Affordable VMware Virtualization

Hyper-Converged Infrastructure: Providing New Opportunities for Improved Availability

Composable Infrastructure for Public Cloud Service Providers

Scale-Out Architectures for Secondary Storage

IBM Spectrum Protect Plus

The Technology Behind Datrium Cloud DVX

HYCU and ExaGrid Hyper-converged Backup for Nutanix

Data Protection for Virtualized Environments

XTREMIO: TRANSFORMING APPLICATIONS, ENABLING THE AGILE DATA CENTER

Dell DR4000 Replication Overview

Discover the all-flash storage company for the on-demand world

Flash Decisions: Which Solution is Right for You?

Modernize Your Backup and DR Using Actifio in AWS

Optimizing and Managing File Storage in Windows Environments

The Data Protection Rule and Hybrid Cloud Backup

PRESENTATION TITLE GOES HERE

Deep Dive on SimpliVity s OmniStack A Technical Whitepaper

Veeam with Cohesity Data Platform

Simplifying HDS Thin Image (HTI) Operations

Hyper-converged Secondary Storage for Backup with Deduplication Q & A. The impact of data deduplication on the backup process

Considerations to Accurately Measure Solid State Storage Systems

Business Benefits of Policy Based Data De-Duplication Data Footprint Reduction with Quality of Service (QoS) for Data Protection

Veritas NetBackup Appliance Family OVERVIEW BROCHURE

De-dupe: It s not a question of if, rather where and when! What to Look for and What to Avoid

Cisco HyperFlex and the F5 BIG-IP Platform Accelerate Infrastructure and Application Deployments

DEDUPLICATION BASICS

HP Dynamic Deduplication achieving a 50:1 ratio

PURITY FLASHRECOVER REPLICATION. Native, Data Reduction-Optimized Disaster Recovery Solution

Availability and the Always-on Enterprise: Why Backup is Dead

Cohesity Architecture White Paper. Building a Modern, Web-Scale Architecture for Consolidating Secondary Storage

VMware vsan 6.6. Licensing Guide. Revised May 2017

WHITE PAPER PernixData FVP

Why Datrium DVX is Best for VDI

White paper ETERNUS CS800 Data Deduplication Background

São Paulo. August,

StarWind Virtual SAN Free

HCI: Hyper-Converged Infrastructure

VMWARE PROTECTION WITH DELL EMC NETWORKER 9

ADVANCED DEDUPLICATION CONCEPTS. Thomas Rivera, BlueArc Gene Nagle, Exar

Cisco HyperConverged Infrastructure

Managing Data Growth and Storage with Backup Exec 2012

MODERNISE WITH ALL-FLASH. Intel Inside. Powerful Data Centre Outside.

NetApp SolidFire and Pure Storage Architectural Comparison A SOLIDFIRE COMPETITIVE COMPARISON

EBOOK. NetApp ONTAP Cloud FOR MICROSOFT AZURE ENTERPRISE DATA MANAGEMENT IN THE CLOUD

IBM Real-time Compression and ProtecTIER Deduplication

Actifio Test Data Management

Protect enterprise data, achieve long-term data retention

HPE Synergy HPE SimpliVity 380

1 Quantum Corporation 1

VMWARE VSAN LICENSING GUIDE - MARCH 2018 VMWARE VSAN 6.6. Licensing Guide

FOCUS ON THE FACTS: SOFTWARE-DEFINED STORAGE

Warsaw. 11 th September 2018

UNLEASH YOUR APPLICATIONS

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure

EMC Integrated Infrastructure for VMware. Business Continuity

Protecting Mission-Critical Application Environments The Top 5 Challenges and Solutions for Backup and Recovery

Rok: Decentralized storage for the cloud native world

Take Back Lost Revenue by Activating Virtuozzo Storage Today

Modernize Without. Compromise. Modernize Without Compromise- All Flash. All-Flash Portfolio. Haider Aziz. System Engineering Manger- Primary Storage

Remove complexity in protecting your virtual infrastructure with. IBM Spectrum Protect Plus. Data availability made easy. Overview

Commvault with Cohesity Data Platform

Veritas Storage Foundation for Oracle RAC from Symantec

7 Ways Compellent Optimizes VMware Server Virtualization WHITE PAPER FEBRUARY 2009

Using Self-Protecting Storage to Lower Backup TCO

Storage in combined service/product data infrastructures. Craig Dunwoody CTO, GraphStream Incorporated

Nimble Storage Adaptive Flash

ADVANCED DATA REDUCTION CONCEPTS

Overview Brochure. Veritas NetBackup TM Appliance Family

DISASTER RECOVERY TESTING, YOUR EXCUSES, AND HOW TO WIN

Virtuozzo Containers

Veeam Availability Solution for Cisco UCS: Designed for Virtualized Environments. Solution Overview Cisco Public

Storage Performance Validation for Panzura

Boost your data protection with NetApp + Veeam. Schahin Golshani Technical Partner Enablement Manager, MENA

New Approach to Unstructured Data

Arcserve Solutions for Amazon Web Services (AWS)

Understanding Managed Services

Move, manage, and run SAP applications in the cloud. SAP-Certified Infrastructure from IBM Cloud

STORAGE CONSOLIDATION AND THE SUN ZFS STORAGE APPLIANCE

Disaster Recovery Guide

Virtualization & On-Premise Cloud

Microsoft DPM Meets BridgeSTOR Advanced Data Reduction and Security

Virtual Desktop Infrastructure (VDI) Bassam Jbara

THE COMPLETE GUIDE COUCHBASE BACKUP & RECOVERY

VMware vsphere Clusters in Security Zones

Accelerate the Journey to 100% Virtualization with EMC Backup and Recovery. Copyright 2010 EMC Corporation. All rights reserved.

Credit: Dell/ExaGrid/MR2 Technical Lunch Event, Downtown Los Angeles. ExaGrid Systems (Up to 130TB per GRID) -

Vendor: Citrix. Exam Code: 1Y Exam Name: Designing Citrix XenDesktop 7.6 Solutions. Version: Demo

Storage Solutions for VMware: InfiniBox. White Paper

EMC NetWorker Backup Solution for SAP HANA. Data Protection for Big Data

Why Scale-Out Big Data Apps Need A New Scale- Out Storage

Transcription:

Why In-Place Copy Data Management is The Better Choice Copy Data Management (CDM) is increasingly being recognized as a musthave technology for the modern data center. But there remains a fundamental difference in approach among vendors: should you use In-Place CDM or go with an Out-of-Band, appliance-based model? Let s look at the differences, but first let s agree on what CDM means in the first place. What is Copy Data Management? Copy Data Management is managing the creation, use, distribution, retention and clean-up of data copies. Different feature sets may or may not be added on top of the solution; things like automation, user self-service options, REST APIs for DevOps integration, reporting and so on (Catalogic offers all of these features). The goal of CDM is to eliminate the many manual steps currently used to create and deliver copies like getting rid of scripting and to centralize operations into one tool rather than several. The goal being greater efficiency in storage consumption and much more importantly dramatically lowering operational overhead and improving reliability. There are two main architectural approaches to CDM: the Out-of-Band method and the In-Place method. Let s look at them.

The Out-of-Band Approach In the Out-of-Band (OoB) model, data is moved from the source production storage to a target storage device; that is, a separate storage stack different from the primary. The data is typically moved by a client agent residing on the host server. The copy method is usually some kind of changed-block model. While this reduces the host impact compared to traditional backup, the impact is not zero as it will affect CPU utilization and host bandwidth. The target device is usually an appliance, though some vendors offer bring your own storage software versions as well in the form of a virtualized appliance. It is important to realize that the appliance is a separate storage device with its own controller and firmware. This can be significant and will be discussed further below. When using the data copies (for reporting, dev-test, etc.), a workload server connects to the OoB appliance. If changes need to go back to the production side, they would be copied back to production over the network. Multiple workload servers can connect to a single copy, which is parceled out via snapshot images. This is how efficiency benefits are realized. But performance characteristics of the OoB appliance of course will be different than the production storage. The Out-of-Band approach can be visualized as follows: 2 of 5

The In-Place Approach Catalogic uses an In-Place Copy Data Management model. This In-Place approach makes use of primary storage array copy processes snapshots, replication and clones as well as any data reduction inherent in the array (deduplication, compression). Catalogic ECX functions as a control plane and doesn t replace any storage array features; it makes use of them, providing features like automation, self-service, application integration (for data consistency, log management, etc.), hypervisor integration, and so on. There is no separate storage device involved. Catalogic communicates to both the host server and the production storage, driving copy processes. In this model, there is no data movement as storage snapshots are created In-Place. Workload server access is provided by using the storage cloning features and connecting the snapshot clone to the server. With the In-Place approach, recoveries or data copying are far quicker because little or no data movement is involved, and nothing traverses the network. This approach can be visualized as follows: The benefits of this approach are apparent. No additional storage footprint required No separate storage device to manage Makes use of existing storage resources and features Maximizes current hardware investment Instant data capture; no host impact Maintains any array data reduction (no need to rehydrate when copying to alternate tier) There is one aspect of the In-Place model that should be discussed. Not everyone is comfortable with running non-production workloads on production storage. Even though all-flash arrays have plenty of IOPs to spare and can run non-prod workloads without performance impact (unlike with spinning disk), many organizations simply won t allow non-prod work to happen on prod storage. 3 of 5

Catalogic provides an easy solution for that. Since Catalogic ECX can manage replication as well as snapshots, it s simple enough to replicate to an alternate array (which can be in the same storage rack or physically separated) and run nonprod workloads on the alternate array. This is visualized as follows: In this way, you have complete array separation. A major benefit of this approach is that by using array replication, you maintain all the data reduction from the primary array, thereby greatly decreasing the amount of data replicated. And no data goes through the host, which limits network and host impact and reduces complexity. There s something else to consider, as we learned from end-users. Consider the case of Dev-Test. In the Out-of-Band model, your Dev work is being done on a storage system that is fundamentally different than your production storage. It has different software, a different file system, different disks, which all equate to different performance characteristics. Therefore, if you are doing development work on a different system, when you move the application back to production you can t be certain it will work properly. An element of risk remains. And when something does go wrong, it inevitably leads to fingerpointing as development blames IT for an infrastructure problem and IT blames development for problems with their code. Troubleshooting this kind of problem can be nightmarish, because the source of the problem could be anywhere on either side of the equation. In the Catalogic approach, the same array type is used in prod and non-prod, so all your Dev work is done on a storage stack that is 100% identical to production. Even the copy processes use the native array software. Catalogic does not introduce any software changes; it only manages the process by talking to the array APIs. This is a no-risk infrastructure model for development. 4 of 5

Conclusion In some ways, this choice in approaches is reminiscent of the early days of VTLs and deduplication. There was a huge battle about the better approach: inline dedupe during backup, or post-processing dedupe after backup completes. The battle raged for a few years, until hardware improvements made it impossible to argue anymore: in-line dedupe won, because it was clearly more efficient. A similar shakeup is happening in the copy data space. Out-of-band was the first approach that came to market, but In-Place is proving to be the better choice. The differences are straightforward. The Out-of-Band model requires you to deploy an entirely different storage hardware/software stack with different operational and performance behaviors. And incidentally, the appliances are often quite expensive. But the costs are more than the initial purchase price. You must manage and maintain a separate storage environment just for the sake of your copies. And you also have to consider scale. Catalogic has seen users with thousands of database instances, each of which spins off 8 or 10 copies for non-prod workloads. How many copy appliances will you need to deliver on this kind of workload demand? Catalogic s In-Place approach lets you maximize the storage investment you ve already made (investments in both money and operational know-how). If required by budget limits and allowed by policy, you can run nonproduction workloads on the production array. That is, with an easy, non-disruptive software tool you can bring automation, self-service and other operational enhancements to your existing storage footprint. If you need to deploy a second array (or third, fourth, fifth ), Catalogic fully manages operations across nodes: production array snapshots for recovery (scheduling, retention, easy recovery workflows, etc.) and replication (scheduling, retention, easy use of data at the second node, etc.) to put data on a separate frame. Your choices in a copy data management approach are clear, and we think the better choice is clear as well. Copyright Catalogic Software 2017 50 Tice Boulevard, Suite 110 Woodcliff Lake, NJ 07677 201.249.8980 U.S.A. catalogicsoftware.com United Kingdom: +44 (0) 207 712 1667 Germany: +49 (0) 2102 307614 0 Netherlands: + 31 (0) 20 347 23 88 Catalogic is a registered trademark of Catalogic Software Inc. ECX is a trademark of Catalogic Software Inc. All other company and product names used herein may be the trademarks of their respective companies., JANUARY 2017