Enterprise Solutions for Microsoft Exchange 2007 EMC CLARiiON CX3-40 Metropolitan Exchange Recovery (MER) for Exchange Server Enabled by MirrorView/S and Replication Manager Reference Architecture EMC Global Solutions Operations EMC Corporation Corporate Headquarters Hopkinton MA 01748-9103 1.508.435.1000 www.emc.com
Copyright and Trademark Information Copyright 2008 EMC Corporation. All rights reserved. EMC believes the information in this publication is accurate as of its publication date. The information is subject to change without notice. THE INFORMATION IN THIS PUBLICATION IS PROVIDED AS IS. EMC CORPORATION MAKES NO REPRESENTATIONS OR WARRANTIES OF ANY KIND WITH RESPECT TO THE INFORMATION IN THIS PUBLICATION, AND SPECIFICALLY DISCLAIMS IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Use, copying, and distribution of any EMC software described in this publication requires an applicable software license. For the most up-to-date listing of EMC product names, see EMC Corporation Trademarks on EMC.com All other trademarks used herein are the property of their respective owners. Part number: H4147 2
Contents About this Document Purpose... 5 Audience... 5 Scope... 6 Chapter 1 Chapter 2 Solution Overview The business challenge... 8 The technology solution... 9 Solution components... 10 EMC CLARiiON CX3-40... 10 EMC MirrorView/S... 10 EMC SnapView... 10 EMC Replication Manager... 10 Environment profile... 11 Environment architecture... 12 Hardware resources... 13 Software resources... 14 Solution Details Storage... 16 MirrorView/S... 16 Consistency groups... 16 Replication Manager... 18 SnapView... 19 EMC SnapView and Replication Manager... 19 EMC SnapView and MirrorView/S... 19 3
Contents Chapter 3 Chapter 4 Testing Overview and Observations Workload generation tools... 22 Environment monitoring applications... 22 Testing methods... 23 Observations... 24 Conclusion 4
About this Document This document describes the reference architecture of an EMC Enterprise solution tested and validated by EMC Global Solutions Operations (GSO). It was produced as part of the EMC Total Customer Experience (TCE) program by the GSO TCE Customer Integration Labs, working collaboratively with Technical Field Consultants, Engineering, and the Microsoft Solutions Practice. Purpose Audience The purpose of this solution is to demonstrate the remote synchronous replication and disaster recovery (DR) capabilities of EMC software in an Enterprise Microsoft Exchange 2007 environment where EMC CLARiiON CX3-40f is used for storage and consolidation. In this solution, EMC s remote synchronous replication and disaster recovery capabilities are tested over a metropolitan area of 50 km. EMC MirrorView/Synchronous (MirrorView/S) is used to provide highly available data protection with zero data loss between the primary and secondary site, while EMC Replication Manager (RM) and EMC SnapView work together to minimize backup windows. Collectively, EMC s hardware and software work to provide a unique solution for online data availability and disaster recovery. This document is intended for technical staff interested in deploying a new or an existing EMC replication and disaster recovery (DR) solution. Executives evaluating such a solution will also find this document useful. 5
About this Document Scope This document provides an overview of a solution that incorporates emerging requirements and best practices for Microsoft deployments with EMC technologies. An architectural overview and descriptions of the hardware and software components used in this solution are also included. Note: This document describes only the features and methodologies specific to this solution. For more detailed information on CLARiiON best practices for specific components of this solution, or other EMC solutions, consult the appropriate EMC documentation. 6
1 Solution Overview This chapter contains the following sections: The business challenge... 8 The technology solution... 9 Solution components... 10 EMC CLARiiON CX3-40... 10 EMC MirrorView/S... 10 EMC SnapView... 10 EMC Replication Manager... 10 Environment profile... 11 Environment architecture... 12 Hardware resources... 13 Software resources... 14 7
Solution Overview The business challenge Businesses today face the challenges of: Meeting service-level requirements for recovery time and data loss. Adapting to a variety of replication distance requirements. Enabling the efficient, effective, and affordable replication of information remotely. Performing rapid restart of applications upon failure. Controlling costs associated with bandwidth to support remote replication. For many businesses, it is critical that business information always be available. To protect this information, it is important to have a disaster recovery plan in place to safeguard against disasters that could make the data at the primary site unavailable. Recovery point objective (RPO) defines the amount of acceptable data loss in the event of a disaster. Many businesses require their RPO to be as low as possible. This solution illustrates how EMC MirrorView /S and Replication Manager with SnapView can: Protect data and applications from local and regional disruptions. Provide zero data loss. Recover quickly from unplanned events. Migrate data across geographically dispersed storage systems. Limit impact on server resources and networks. 8
The technology solution The technology solution In this solution, MirrorView/S is used to provide protection from site disasters by replicating the Exchange 2007 database and logs to a remote CLARiiON. In addition, EMC Replication Manager 5.0 is used to coordinate local replication. This solution describes an Enterprise (12,000 users) Exchange environment consisting of two clusters, one on the production side, and one on the DR side, each with two active and one passive nodes. Storage is provided by an EMC CLARiiON CX3-40f on the production side, replicating via MirrorView/S to a second CX3-40f on the DR side. The solution includes information on: Creating a well-performing storage design for Microsoft Exchange 2007 on an EMC CX3-40f with a large and very active database. Leveraging the data replication, recovery, and protection capabilities of MirrorView/S and SnapView on the EMC CLARiiON CX3-40f storage system, with Replication Manager used to manage the point-in-time-based replicas. Accessing the impact of creating local clones of production LUNs that are also being replicated to a DR site with MirrorView/S at a distance of 50 km. Recovery performance and MirrorView/S interaction. Execution of planned and unplanned failure conditions to the remote site. In partnership with Microsoft and its certified partners, EMC has designed email solutions for Microsoft 2007 Exchange environments using EMC software for the EMC CLARiiON platform. The Jetstress test results for this solution were used to produce a Microsoft Exchange Solution Review Program (ESRP) submission. The corresponding submission for this solution is entitled EMC CLARiiON CX3-40 MirrorView/S (12,000 user) Storage Solution for Microsoft Exchange Server 2007 and can be found on EMC.com. Note: The ESRP storage program is designed to facilitate third-party storage testing and solution publishing for Microsoft Exchange Server. 9
Solution Overview Solution components EMC CLARiiON CX3-40 This Enterprise Exchange 2007 CX3-40 replication and DR solution reference architecture includes components from EMC, Microsoft, Dell, Brocade, and Cisco. This section briefly describes the EMC components. The CLARiiON CX3 UltraScale architecture, which is based on a high-performance, high-availability design, enables the CX3 UltraScale series to address a broad range of application environments. The CX3 UltraScale series systems are built on the same redundant modular architecture and run the FLARE storage operating environment. The innovative UltraScale architecture incorporates state-of-the-art CPUs, and the highest performing memory subsystem, leveraging low-latency, high-bandwidth PCI Express interconnect technology. EMC MirrorView/S EMC SnapView EMC MirrorView/S provides highly available data protection across a campus or metropolitan area environment. It provides an online, host independent, mirrored data storage and protection solution that duplicates production site data (primary) to one or two secondary sites. MirrorView/S is array-based, and therefore does not use any server cycles; it also supports consistency groups, which allows an application to use the secondary images should the primary storage system fail. EMC SnapView is an array-based software product that creates point-in-time snapshots and full copy clones of production data. Through economical, disk-based consistent split and instant-restore of production data, SnapView cost-effectively accelerates backup and recovery. SnapView runs inside the CLARiiON storage processors, providing the ability to efficiently create bit image replicas of a CLARiiON LUN inside the storage system, without requiring any host processing resources. EMC Replication Manager EMC s Replication Manager software provides application-centric replication management for EMC storage, improving access to information. It orchestrates the operating system, application, and CLARiiON technology for optimal management and use of data replicas. Businesses can save valuable operations time by using Replication Manager to automate the mounting, dismounting, scheduling, and expiration of replicas. 10
Environment profile Environment profile Table 1 lists the parameters and assumptions for the solution. Table 1 Environment profile Parameter Number and type of storage system Value 2 x CX3-40f Number of Exchange 2007 users 12,000 Read/Write ratio 1:1 IOPS Exchange 2007 server cluster configuration 0.48 IOPS per user 3 nodes (2 active and 1 passive) Number of Exchange 2007 users per server 6,000 Number of Exchange 2007 storage groups per server 8 Number of Exchange 2007 mail databases per storage group 1 Number of Exchange 2007 users per mail database 750 Size of Exchange 2007 user mailbox Storage group mailbox database LUN size Storage group log LUN size Replication mechanism 200 MB 180 GB 40 GB MirrorView/S Number of links 1 Simulated link distance 50 km 11
Solution Overview Environment architecture Figure 1 illustrates the overall physical architecture of the solution. Production Mailbox Cluster Disaster Recovery Mailbox Cluster Ethernet Router 50km Metro SAN Extension RM Hosts RM Hosts Mirror view/s Dual Brocade 4100 Switches Production CX3-40f Disaster Recovery CX3-40f Dual Brocade 4100 Switches Client Access & Hub Transport Roles Client Access & Hub Transport Roles AD Server Client Load Generators AD Server GEN-000652 Figure 1 Enterprise Exchange 2007 CX3-40 MirrorView/S and RM environment 12
Environment profile Hardware resources Table 2 lists the hardware resources used in this solution. Table 2 Hardware resources Equipment Quantity Configuration Storage 2 CX3-40f FC array with FLARE 24 (version 03.24.040.5.011): 64 x 146 GB Fibre Channel, 15k rpm production disks + 2 hot spares 32 x 146 GB Fibre Channel, 15k rpm clone disks + 1 hot spare SAN 4 Brocade 4100 v5.2.2 (128 ports total, 32 ports per switch) Active Node Production Exchange servers 2 Dell 6850, 4 CPU, 3 GHz, 16 GB RAM, 4 GB Emulex LPe-11002-E HBAs Active Node DR Exchange servers 2 Dell 6850, 4 CPU, 3 GHz, 16 GB RAM, 4 GB Emulex LPe-11002-E HBAs RM mount host 4 Dell 2850, 2 CPU, 3 GHz Client load generators 3 Dell 6850, 4 CPU, 3 GHz, 16 GB RAM Hub Transport / CAS servers 2 Dell 2850, 2 CPU, 3 GHz, 16 GB RAM Active Directory 2 Dell 2850, 2 CPU, 3 GHz Network switch 4 Cisco 3750 13
Solution Overview Software resources Table 3 lists the software resources used in this solution. Table 3 Software resources Title Version Configuration Windows Server 2003 Enterprise Edition (64 bit) 2003 (R2) Cluster (2 active / 1 passive) Microsoft Exchange 2007 Cluster (2 active / 1 passive) Windows 2003 (64 bit) resource kit 2003 (SP2) PowerPath 5.0.1.10 Solution Enabler 6.4.2.0 (or later) Replication Manager 5.1 Navisphere Host software SnapView MirrorView/S R24 R24 R24 14
2 Solution Details This chapter contains the following sections: Storage... 16 MirrorView/S... 16 Consistency groups... 16 Replication Manager... 18 SnapView... 19 EMC SnapView and Replication Manager... 19 EMC SnapView and MirrorView/S... 19 15
Solution Details Storage MirrorView/S Consistency groups This reference architecture uses a storage design concept that is referred to as the Exchange server building blocks. The building block approach defines the number of disks required for a certain number of users per Exchange server. Exchange server building blocks are validated to provide optimum storage performance to the corresponding number of users and their user profile. To support a given number of Exchange users (and their user profile) it is simply a matter of creating the appropriate number of building blocks and allocating that storage to the Exchange servers. This solution uses a building block of 32 disks, which supports 6,000 Exchange heavy users. To scale to the required 12,000 users, two building blocks are configured in the CLARiiON storage system and presented to the Exchange servers to hold the databases and logs. In this solution EMC MirrorView/S is used to maintain the synchronous remote copies of production data across a simulated metropolitan area of 50 km. This synchronous replication ensures that there is no data loss in the event of a failure at the primary site. In a MirrorView/S configuration, a server writes to the source array and the remote array simultaneously. An acknowledgement is sent back to the server once the data has been written to both the source and remote arrays, ensuring a complete transaction record on both arrays. The copy of the data at the remote site is a consistent, restartable copy of the production database. Refer to Figure 2 on page 17 for more information. This solution provides requirements to customers on the setup and configuration of the remote replication capabilities of MirrorView/S as part of a disaster recovery solution for the primary Exchange environment. Storage system-based consistency groups are available for MirrorView/S solutions. These consistency groups contain a number of mirrors that function together as a single unit. All operations, such as synchronize, fracture, and promote happen to all of the mirrors in the group. These operations are no longer valid for any single member. In database applications, this is important because database restartability is dependent on maintaining write order. Managing these volumes as a consistency group ensures that there is a restartable copy of the database at the DR site. 16
MirrorView/S Figure 2 provides a high-level overview of a MirrorView/S architecture. 2 1 2 Limited distance Production server 4 3 3 Primary Host Secondary Host GEN-000646 Figure 2 MirrorView/S overview With Figure 2 the following legend applies: 1. I/O write received from the server to the primary array. 2. I/O is written to primary and secondary images simultaneously. 3. Receipt acknowledgement is provided by the secondary array back to the primary array. 4. Acknowledgement is sent to the server. 17
Solution Details Replication Manager EMC Replication Manager is a software application that resides on a host and simplifies the creation and management of disk-based replicas. Replication Manager automates the creation of exact point-in-time copies of mission-critical data that can be used for rapid recovery or repurposing. The software manages the creation of the replica from a pool or storage, eliminating the need to preconfigure or define an association with the production volumes. The lifecycle of the replica is also seemlessly managed, ensuring appropriate retention and preventing user error. Figure 3 provides a high-level overview of Replication Manager. Production Host Agent (IRCCD) Storage Services DBMS/FS agents Users can access the software from anywhere on the network Replication Manager Console (Java) Replication Manager Console (Java) TCP/IP (LAN) Replication Server Connection is NOT required from Replication Manager Server to the storage array Replication Manager policy engines Replication Manager Service (IRD) Replication database GEN-000647 Figure 3 Replication Manager overview 18
SnapView SnapView EMC SnapView is an array-based local replication technology that allows the creation of mountable snapshots or full-copy clones of production data, enabling you to run testing, decision support, debugging, reporting, or maintenance in parallel without taking applications offline. The process is handled on the EMC CLARiiON storage system and is transparent to the primary server, yet the resulting data is available to a secondary server with the appropriate access privileges. EMC SnapView and Replication Manager EMC SnapView and MirrorView/S EMC SnapView and Replication Manager work together to minimize backup windows. This is facilitated by the ability to offload backup cycles from production systems, allowing for the flexibility to conduct backups at any time, without concern for affecting other functions on the system. Integration with Exchange allows organizations to leverage their information storage environment for the easy creation, separation, and reassignment of database replicas. EMC SnapView is used in conjunction with MirrorView/S to create a point-in-time image of the primary image or the secondary image at a remote site. With SnapView at the secondary site, Replication Manager can create replicas of the mirror images, mount them to a host at the secondary site, and present them for archival backup to other media. This provides point-in-time replicas of the production data. It is important to note that a host cannot directly address the secondary image. The benefit of SnapView is that it provides a point-in-time replica of the secondary image that is accessible by a host. 19
Solution Details Figure 4 illustrates the data flow of the combined solution components. Data Data Production server Limited distance Clone Clone Local RM Mount Host Primary Array Secondary Array Remote RM Mount Host SnapView Production I/O MirrorView/S Integrity Check Figure 4 Data flow of combined solution components 20
3 Testing Overview and Observations This chapter contains the following sections: Workload generation tools... 22 Environment monitoring applications... 22 Testing methods... 23 Observations... 24 21
Testing Overview and Observations Workload generation tools The following tools were used to generate the workload and to validate the solution: Microsoft Exchange Server Jetstress was used to generate I/O on the storage sub-system while being replicated over a distance of 50 km using MirrorView/S. Microsoft Exchange Load Generator was used to simulate heavy Outlook 2007 online users while the storage sub-system was being replicated over a distance of 50 km using MirrorView/S. Environment monitoring applications The following applications were used to monitor and validate the solution: Windows Server 2003 Performance Monitor was used to observe: Server RPC latencies Disk response times Memory, network, and processor utilization Various Exchange specific counters Navisphere Analyzer was used to observe: LUN utilization SP utilization Overall I/O throughput and bandwidth Full synchronization and catch-up times (post link failure) while under client load Effect of failure scenarios on performance of storage array Microsoft Exchange Load Generator output was used to observe: Client experience 22
Testing methods Testing methods The following list details the methods used to test the solution: Microsoft Exchange Server Jetstress was used to place I/O load on the production volumes to ensure correct performance was achieved. Microsoft Exchange Load Generator was used to simulate Outlook 2007 online users while various MirrorView configurations were tested, and also while several simulated failures were introduced to the environment. ANUE distance simulator was used to simulate the 50 km distance between sites. Inter Site Link failures were simulated by blocking Fibre Channel Inter Switch Link (ISL) switch ports. CLARiiON CX3-40f intended failures were simulated by the physical removal of hardware during tests. Between each test, Exchange databases were restored to the original load generation initialized state to ensure the accuracy of comparison between tests. 23
Testing Overview and Observations Observations This solution was focused on building and validating an environment that supports 12,000 heavy Exchange 2007 users, while also demonstrating the remote synchronous replication and disaster recovery capabilities of EMC software in an Enterprise Microsoft Exchange 2007 environment where EMC CLARiiON CX3-40f is used for storage and consolidation. The following observations were noted: CLARiiON CX3-40f storage processors never exceeded acceptable levels, even during scenarios where a storage processor failure was simulated, and the entire workload was handled by the single remaining storage processor. Microsoft Exchange Load Generator reports passed every test - including during failure scenarios. Creation and restore of all replicas using Replication Manager completed as expected. MirrorView/S consistency group technology was successfully verified by recovering Exchange 2007 databases on the remote site using the RecoverCMS process. The recovery process was less than 120 seconds on the pre-prepared DR standby MSCS cluster. 24
4 Conclusion Building an Enterprise Exchange 2007 environment that encompasses synchronous replication and recovery of data over a simulated metropolitan area of 50 km is a complicated endeavor. This reference architecture depicts a validated design using EMC CX3-40f storage systems, EMC MirrorView/S, EMC Replication Manager, and EMC SnapView. The combination of these elements provides a holistic approach to recovery that enables low-impact backups and rapid restores. The solution simplifies and automates local and remote replication while satisfying today s data protection needs. To learn more about this, and other solutions, contact an EMC representative or visit www.emc.com/solutions/microsoft. 25
Conclusion 26