A Dell technical white paper By Fabian Salamanca, Javier Jiménez, and Leopoldo Orona

Similar documents
Reference Architecture for Dell VIS Self-Service Creator and VMware vsphere 4

Enhancing Oracle VM Business Continuity Using Dell Compellent Live Volume

Dell DR4000 Replication Overview

A Dell Technical White Paper Dell Virtualization Solutions Engineering

Deployment of VMware Infrastructure 3 on Dell PowerEdge Blade Servers

Dell PowerVault MD Family. Modular Storage. The Dell PowerVault MD Storage Family

Dell Server Migration Utility (SMU)

Microsoft SharePoint Server 2010 on Dell Systems

Dell PowerVault MD3600f/MD3620f Remote Replication Functional Guide

Dell Compellent Storage Center with CommVault Simpana 9.0. Best Practices

EMC Backup and Recovery for Microsoft Exchange 2007

Microsoft SQL Server in a VMware Environment on Dell PowerEdge R810 Servers and Dell EqualLogic Storage

Overview of Dell AIM Integration with Microsoft System Center 2012

VMware Infrastructure Update 1 for Dell PowerEdge Systems. Deployment Guide. support.dell.com

Storage Consolidation with the Dell PowerVault MD3000i iscsi Storage

Dell Fluid Data solutions. Powerful self-optimized enterprise storage. Dell Compellent Storage Center: Designed for business results

vstart 50 VMware vsphere Solution Overview

vstart 50 VMware vsphere Solution Specification

Dell PowerVault NX1950 configuration guide for VMware ESX Server software

VMware Infrastructure 3.5 for Dell PowerEdge Systems. Deployment Guide. support.dell.com

VMware Infrastructure Update 1 for Dell PowerEdge Systems. Deployment Guide. support.dell.com

DELL POWERVAULT NX3500. A Dell Technical Guide Version 1.0

Deployment of VMware ESX 2.5.x Server Software on Dell PowerEdge Blade Servers

Overview. Implementing Fibre Channel SAN Boot with the Oracle ZFS Storage Appliance. January 2014 By Tom Hanvey; update by Peter Brouwer Version: 2.

Real-time Protection for Microsoft Hyper-V

Deploying and Managing Dell Big Data Clusters with StackIQ Cluster Manager

FOUR WAYS TO LOWER THE COST OF REPLICATION

How to Increase VMware Application Availability with Shared Storage

Access Control Policies

Dell Compellent Storage Center. Microsoft Server 2008 R2 Hyper-V Best Practices for Microsoft SCVMM 2012

Data Protection Using Premium Features

Microsoft Office SharePoint Server 2007

EMC Backup and Recovery for Microsoft Exchange 2007 SP1. Enabled by EMC CLARiiON CX4-120, Replication Manager, and VMware ESX Server 3.

Competitive Power Savings with VMware Consolidation on the Dell PowerEdge 2950

DELL POWERVAULT MD FAMILY MODULAR STORAGE THE DELL POWERVAULT MD STORAGE FAMILY

EMC Business Continuity for Microsoft Applications

iscsi Boot from SAN with Dell PS Series

Overview. Implementing Fibre Channel SAN Boot with Oracle's Sun ZFS Storage Appliance. August By Tom Hanvey; update by Peter Brouwer

Huawei OceanStor ReplicationDirector Software Technical White Paper HUAWEI TECHNOLOGIES CO., LTD. Issue 01. Date

VMware VAAI Integration. VMware vsphere 5.0 VAAI primitive integration and performance validation with Dell Compellent Storage Center 6.

Active System Manager Release 8.2 Compatibility Matrix

Disaster Recovery-to-the- Cloud Best Practices

EMC Virtual Infrastructure for Microsoft Exchange 2010 Enabled by EMC Symmetrix VMAX, VMware vsphere 4, and Replication Manager

Veritas NetBackup Appliance Fibre Channel Guide

EMC CLARiiON CX3-40. Reference Architecture. Enterprise Solutions for Microsoft Exchange 2007

Protecting VMware vsphere/esx Environments with CA ARCserve

Always Available Dell Storage SC Series Date: October 2015 Author: Brian Garrett, VP ESG Lab

DELL Reference Configuration Microsoft SQL Server 2008 Fast Track Data Warehouse

Four-Socket Server Consolidation Using SQL Server 2008

Dell PowerVault MD Family. Modular storage. The Dell PowerVault MD storage family

EMC CLARiiON CX3-40. Reference Architecture. Enterprise Solutions for Microsoft Exchange Enabled by MirrorView/S

Veritas Storage Foundation for Windows by Symantec

Best Practices for Deploying a Mixed 1Gb/10Gb Ethernet SAN using Dell EqualLogic Storage Arrays

Functional Testing of SQL Server on Kaminario K2 Storage

Optimizing and Managing File Storage in Windows Environments

DELL EMC UNITY: REPLICATION TECHNOLOGIES

LIFECYCLE MANAGEMENT FOR ORACLE RAC 12c WITH EMC RECOVERPOINT

High performance and functionality

Dell Data Protection for VMware Horizon View Infrastructures

LEVERAGING A PERSISTENT HARDWARE ARCHITECTURE

Accelerating Microsoft SQL Server 2016 Performance With Dell EMC PowerEdge R740

MOVING TOWARDS ZERO DOWNTIME FOR WINTEL Caddy Tan 21 September Leaders Have Vision visionsolutions.com 1

DATA PROTECTION IN A ROBO ENVIRONMENT

SAN Implementation (SANIW)

Protecting VMware vsphere/esx Environments with Arcserve

EMC Virtual Infrastructure for Microsoft Applications Data Center Solution

Deploying Solaris 11 with EqualLogic Arrays

White Paper. A System for Archiving, Recovery, and Storage Optimization. Mimosa NearPoint for Microsoft

Cost and Performance benefits of Dell Compellent Automated Tiered Storage for Oracle OLAP Workloads

EMC Integrated Infrastructure for VMware. Business Continuity

Proactive maintenance and adaptive power management using Dell OpenManage Systems Management for VMware DRS Clusters

Veritas NetBackup on Cisco UCS S3260 Storage Server

INFOBrief. Dell PowerEdge Key Points

Veritas Storage Foundation for Windows by Symantec

DELL TM PowerVault TM DL Backup-to-Disk Appliance

VMWARE PROTECTION WITH DELL EMC NETWORKER 9

Lifecycle Controller 2 Release 1.0 Version Readme

Copyright 2015 EMC Corporation. All rights reserved. Published in the USA.

How CloudEndure Disaster Recovery Works

OpenManage TM Integration for VMware vcenter FAQ

Virtualization Support in Dell Management Console v1.0

INFOBrief. Dell OpenManage Client Administrator 3.0. Key Points

Dell Reference Configuration for Large Oracle Database Deployments on Dell EqualLogic Storage

Deployment of VMware ESX 2.5 Server Software on Dell PowerEdge Blade Servers

iscsi Target Usage Guide December 15, 2017

Data Migration from Dell PS Series or PowerVault MD3 to Dell EMC SC Series Storage using Thin Import

DELL POWERVAULT NX3500 INTEGRATION WITHIN A MICROSOFT WINDOWS ENVIRONMENT

Using Dell Repository Manager to Update Your Local Repository

Get on-premise private cloud with enterprise managed services.

Hitachi Unified Compute Platform Pro for VMware vsphere

Virtual Disaster Recovery

INFOBrief. Dell-IBRIX Cluster File System Solution. Key Points

Protecting Mission-Critical Application Environments The Top 5 Challenges and Solutions for Backup and Recovery

Setting Up the Dell DR Series System on Veeam

Microsoft E xchange 2010 on VMware

A Reference Architecture for SAP Solutions on Dell and SUSE

Microsoft SQL Server 2012 Fast Track Reference Architecture Using PowerEdge R720 and Compellent SC8000

VMware Site Recovery Manager 5.x guidelines for the IBM Storwize family

Dell Compellent Storage Center

Dell FS8600 with VMware vsphere

Transcription:

Implementing cost-effective disaster recovery A Dell technical white paper By Fabian Salamanca, Javier Jiménez, and Leopoldo Orona

THIS WHITE PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY CONTAIN TYPOGRAPHICAL ERRORS AND TECHNICAL INACCURACIES. THE CONTENT IS PROVIDED AS IS, WITHOUT EXPRESS OR IMPLIED WARRANTIES OF ANY KIND. Copyright 2011 Dell Inc. All rights reserved. Reproduction of this material in any manner whatsoever without the express written permission of Dell Inc. is strictly forbidden. For more information, contact Dell. Dell, the Dell logo, the Dell badge, Dell Compellent, Data Instant Replay, Fluid Data, PowerConnect, PowerEdge, Remote Instant Replay, and Storage Center are trademarks of Dell Inc. Microsoft is a registered trademark and Hyper-V is a trademark of Microsoft Corporation in the United States and/or other countries. Red Hat and Enterprise Linux are registered trademarks of Red Hat, Inc. VMware is a registered trademark and vsphere is a trademark of VMware, Inc. Other trademarks and trade names may be used in this document to refer to either the entities claiming the marks and names or their products. Dell Inc. disclaims any proprietary interest in trademarks and trade names other than its own. November 2011 ii

Contents Introduction... 1 Accelerating storage area network based replication... 2 Changing workloads dynamically... 2 Leveraging a foundation for efficient business continuity... 3 Managing storage replication... 3 Configuring persona management... 4 Simulating the disaster recovery process... 6 Summary... 11 Figures Figure 1. Implementing a disaster recovery time line for enhanced business continuity... 1 Figure 2. Configuring a dual-site disaster recovery simulation... 4 Figure 3. Creating a volume using the main site storage view... 5 Figure 4. Viewing storage at the disaster recovery site... 5 Figure 5. Creating a disaster recovery persona... 6 Figure 6. Confirming disaster recovery persona image details... 7 Figure 7. Stopping the replication process to simulate a failure... 7 Figure 8. Listing site volume Replay history... 8 Figure 9. Verifying a site persona is running properly... 8 Figure 10. Executing the disaster recovery script... 10 Figure 11. Confirming workload recovery in a persona startup log... 10 iii

Introduction Today s working environment requires the availability of business-critical applications to ensure the successful operation of the organization, and IT departments are seeking innovative, cost-effective ways to provide business continuity. As a result, IT organizations are escalating their efforts to protect mission-critical applications such as e-mail, Internet presence, enterprise resource planning (ERP), and customer relationship management (CRM) from sudden disruption or downtime. Although high availability clustering provides local protection, critical applications also require geographical protection. The stakes for preserving business continuity are high: among organizations that experience a major loss of business data, a significant number face critical problems, and only a few are able to overcome them. Organizations creating a business continuity plan should begin by identifying critical applications and functions requiring protection. Next, they should delineate the recovery time objective (RTO), which specifies the maximum allowable time to restore each critical process after an adverse event occurs. Then they should define the recovery point objective (RPO), which targets the maximum acceptable amount of data that is at risk of loss after an adverse event occurs. Time to data (TTD) is the time required for retrieving backup data and delivering it to the recovery site (see Figure 1). RTO and RPO are key measures that drive the configuration of a disaster recovery implementation, which also affect its cost. Reduced RTO and RPO translate into an enhanced business continuity response and a cost-effective disaster recovery implementation. Figure 1. Implementing a disaster recovery time line for enhanced business continuity Organizations taking an approach to deploy cost-effective business continuity and data replication can leverage Dell Advanced Infrastructure Manager (AIM) a component of the Dell Virtual Integrated System (VIS) portfolio with Dell Compellent Storage Center storage area network (SAN) arrays. This 1

architecture is designed to provide reliable data replication, OS image integrity, and efficient workload provisioning. The configuration referenced in this document implements Fiber Channel for OS image and data access, IP and Ethernet networks for long-distance, WAN-based replication using Internet SCSI (iscsi) connectivity, and AIM to manage the provisioning of workload identities and network configuration. Accelerating storage area network based replication Dell Compellent Storage Center SANs are designed to actively and effectively manage enterprise data throughout its life cycle, giving organizations the agility to constantly adapt in a dynamic business environment. Together, the Dell Fluid Data architecture, storage virtualization, advanced software, and modular hardware deliver enhanced efficiency, ease of use, and security. Built-in intelligence and automation help ensure that data is available when and where it is needed, and an open, persistent hardware platform scales in line with business needs to help protect storage infrastructure over the long term. Dell Compellent Storage Center SANs are also designed to improve RTO and RPO while accelerating replication and recovery operations, helping reduce time consumed by management tasks as well as capacity and bandwidth costs, and increasing business continuity. Its thin replication feature allows for multisite replication that is more cost-efficient than traditional approaches to replication. Thin replication allows IT organizations to realize performance benefits using built-in bandwidth simulation and shaping. This feature helps align bandwidth procurement with estimated requirements based on actual data traffic flow, and transfer rates can be customized based on link speed, time of day, and priority. Thin provisioning in Dell Compellent Storage Center SANs is designed to reduce the disk space that organizations consume and free their bandwidth resources. Volumes are created based on only written data, and thin replication enables intelligent transfer of only changed blocks of data thereafter. Dell Compellent Storage Center SANs also offer a technology-agnostic approach that provides the flexibility and scalability to support synchronous or asynchronous replication, Fibre Channel or iscsi connectivity, and bidirectional, point-to-point, or multipoint configurations. With the ability to create continuous snapshots Replays created with Dell Compellent Data Instant Replay software between local and remote sites, Dell Compellent Storage Center SANs provide unlimited recovery points and block-level management that allows for very rapid recovery. Remote replication can be tested with just a few mouse clicks, without disrupting production environments. Changing workloads dynamically Dell AIM enables data centers to react in real time to changing business needs by dynamically changing which servers are running and how those servers are connected to the network and storage. IT services can be dynamically provisioned or reconfigured in minutes, helping increase the utilization of existing assets, recover quickly if problems occur, and rapidly scale services to support business processes. AIM creates dynamic workloads that can leverage a flexible, cable-once data center infrastructure by coordinating end-to-end network, storage, and computing resources. A dynamic workload, or persona, is a server environment captured on disk. It comprises the OS, the optional AIM agent software, application software, and storage and networking settings including either iscsi or Fibre Channel. A persona can also include other settings required to run an application on a server, either a virtual server or a physical server. This personality includes persistent 2

identification settings to help ensure the persona has access to the same resources no matter which changes in an AIM-managed data center may occur. One type of persona, the network-booted persona, is a workload that is able to boot on any validated component, using its personality to always access the same resources. For example, at any given time, a network-booted persona can run on a physical server such as a Dell PowerEdge R610 server, and after business hours it can run on either a VMware vsphere -based or Microsoft Hyper-V -based virtual machine to save power and cooling costs. This dynamic capability enables the workload to use resources as needed on demand for example, using low-cost hardware or virtual machines when load is expected to be minimal and retargeting to high-performance gear when load is expected to be high. All data related to a network-booted persona resides in a SAN, enabling IT organizations to leverage the management benefits associated with SANs. For example, if a network-booted persona resides within a Dell Compellent virtual storage array, it benefits from the Dell Fluid Data architecture by using Data Instant Replay software for backup and recovery and Dell Compellent Remote Instant Replay software for long-distance replication. Leveraging a foundation for efficient business continuity Dell AIM and Dell Compellent Storage Center SANs offer a rich set of complementary features that together form a cost-effective foundation for an efficient approach to business continuity. In September 2011, Dell engineers configured a disaster recovery scenario utilizing two simulated sites that support production operation of workloads, replication, and disaster recovery. Two racks of equipment were used for this example infrastructure. One simulated the main site in Mexico City, Mexico, and the other simulated the disaster recovery site in Round Rock, Texas. In this scenario, the main site runs a SAN-booted persona using the Red Hat Enterprise Linux 5 64-bit OS on a Dell PowerEdge R610 server (see Figure 2). The volume where this persona resides is to be replicated to the disaster recovery site using Dell Compellent Remote Instant Replay. At the disaster recovery site this persona will be booted on a VMware ESX 4.1 virtual machine. Configurations at both sites include the following components: Dell Compellent Series 40 controller A 3.5-inch Serial Attached SCSI (SAS) disk array enclosure (DAE) Dell PowerConnect 6224 switch for local area network (LAN) and wide area network (WAN) access Brocade Silkworm 300 Fibre Channel switch ESX virtual machine running Dell AIM release 3.4.1 Primary challenges to achieving a successful disaster recovery configuration include the following: Data replication Physical-to-virtual (P2V) migration on the fly also known as retargeting Virtual LAN (VLAN) management SAN access management Managing storage replication A 20 GB volume was created at the main site using Dell Compellent Enterprise Manager software (see Figure 3). An easy-to-use wizard allows administrators to create the replication volume at the disaster 3

recovery site automatically and at exactly the same time. This feature is available for any single volume created using the Dell Compellent Enterprise Manager software. Figure 2. Configuring a dual-site disaster recovery simulation The volume created at the main site is presented to a physical server using 8 Gbps Fibre Channel connectivity and its World Wide Port Number (WWPN). Proper zoning was configured at the Brocade Silkworm Fibre Channel switch (see Figure 4). In this scenario, iscsi connectivity, and not Fibre Channel connectivity, is deployed at the disaster recovery site. As a result, when the disaster recovery volume has been created through the automated process and first replication of data has completed, the disaster recovery volume is mapped to the original persona using iscsi after stopping the replication process momentarily. Then, a unique iscsi Qualified Name (IQN), which was previously configured in the persona image at the main site, is already present at the disaster recovery volume. AIM can then be used to manage the replicated persona s access to the SAN. The replication process then begins anew to keep transferring new data. Configuring persona management The persona to be replicated is created and located at both sites, and the protected workload runs the Red Hat Enterprise Linux 5 64-bit OS. Note: Because the scope of this scenario is related to data replication and disaster recovery, local persona retargeting is not shown in this disaster recovery example. 4

Figure 3. Creating a volume using the main site storage view Figure 4. Viewing storage at the disaster recovery site SAN zoning had already been implemented using the WWPN of the host bus adapter (HBA) and WWPN and World Wide Node Name (WWNN) of the Dell Compellent Storage Center SAN. Both servers the Dell PowerEdge R610 running the persona at the main site and the VMware ESX based virtual server at the remote site have already been discovered. 1 1 For information on discovering physical servers, VMRacks, and VMware virtualized servers, please refer to the Dell Advanced Infrastructure Manager Release 3.4.1 User s Guide that is provided with the AIM software. 5

The persona at the main site was created by installing the OS directly on the master volume, with SAN booting previously configured in the HBA and BIOS. The LAN on Motherboard (LOM) on the server was connected to the Dell AIM control VLAN. The persona appears on the AIM console after installing the AIM agent software, which enables AIM to stop the persona and then start it successfully. The VMRack and an AIM-managed virtual machine should be created and discovered by the AIM controller at the remote site. This virtual machine is intended to host the disaster recovery persona, which boots from an iscsi volume. The disaster recovery persona was created using the Add Persona Wizard (see Figure 5). The persona was configured with the following settings: Network mode: auto Networking enabled: yes Agent exists: yes iscsi boot image The disaster recovery persona is assigned to the VMRack an ESX-based virtual machine (see Figure 6). Figure 5. Creating a disaster recovery persona Simulating the disaster recovery process A failure test at the main site can be simulated by stopping the replication process between both sites from the same Dell Compellent Enterprise Manager interface (see Figure 7). All replicas configured on the Dell Compellent Storage Center SAN can be tested using the Test Recovery Site Tool without disruption to help improve disaster recovery testing times and processes. 6

Figure 6. Confirming disaster recovery persona image details Figure 7. Stopping the replication process to simulate a failure The disaster recovery volume at the remote site can be activated by using the Dell AIM disaster recovery persona iscsi login process, which is part of its previously configured booting process. Using Dell Compellent Enterprise Manager features helps simplify creating one or several recovery points using available disaster recovery tools to meet specific needs (see Figure 8). 7

Figure 8. Listing site volume Replay history To simulate a failure at the main site, the server running the persona can simply be powered off directly, circumventing a clean shutdown process. The first step following the failure is to get to the data restoration point (see Figure 1). Figure 9 shows the main site persona up and running properly just prior to the simulated failure. Figure 9. Verifying a site persona is running properly Tackling the RTO issue requires using AIM to easily implement a disaster recovery script. For example, the following generic command-line interface (CLI) wrapper and AIM shell script help simplify the development and execution of AIM CLI scripting, and a separate text file containing AIM-related commands is the only input necessary for this wrapper. 8

#!/bin/bash # # ************************************** # # (C) 2011 Dell # # ************************************** SDK=/opt/dell/aim/bin/sdk ACCOUNT=admin PASS=admin SCRIPT=$1 if [ "$SCRIPT" == "" ]; then echo "Usage: $0 [script filename]" exit 1 fi echo "==========================" echo "DELL AIM SDK wrapper" echo "Running script $SCRIPT" echo "==========================" echo $SDK account=$account password=$pass ifile=$script Many tools can be leveraged to implement similar functionality, which facilitates efficient integration with several open platforms. Figure 10 shows execution of the disaster recovery script. Although this script represents a simple start persona, additional tasks can be included to restore the operating environment of a production workload. For example, test and development environments can be gracefully shut down to make room for business-critical applications that require the additional compute capacity. In addition, creating, configuring, and virtually cabling new networks may be automated before starting hundreds of personas, which can follow a specific order to correctly sequence the start of dependent workloads (see Figure 11). 9

Figure 10. Executing the disaster recovery script Figure 11. Confirming workload recovery in a persona startup log 10

Summary Using Dell AIM workload mobility, Dell Compellent Storage Center SANs, and Dell Fluid Data Architecture, this approach to automated replication for remote disaster recovery demonstrates how several steps in the process can be simplified. In a manner representing an efficient variation to industry standards, managing the comprehensive set of activities required for disaster recovery from the storage point of view in a single interface Dell Compellent Enterprise Manager is now possible. RTO is an important measure in the operations necessary for restoration after data has been restored. Dell AIM helps reduce this target by automating and validating several resource management, virtual and physical synchronization, workload assignment, networking, and storage access tasks. Restoring the workload in this simulation took 1 minute, 27 seconds plus an additional 56 seconds to manually trigger and execute the disaster recovery script to yield a total recovery time of 2 minutes, 23 seconds. Using an integrated monitoring tool instead of a manually executed script may reduce the RTO. Several personas can be configured using the same steps and controlled by the same disaster recovery script. As a result, organizations can gain similar benefits when working with several workloads at one time. 11