A Guide to Architecting the Active/Active Data Center

Similar documents
ScaleArc for SQL Server

Technology Overview ScaleArc. All Rights Reserved.

Highly Available Database Architectures in AWS. Santa Clara, California April 23th 25th, 2018 Mike Benshoof, Technical Account Manager, Percona

Cloud Confidence: Simple Seamless Secure. Dell EMC Data Protection for VMware Cloud on AWS

Software Defined Storage for the Evolving Data Center

The Key to Disaster Recovery

VERITAS Volume Replicator. Successful Replication and Disaster Recovery

MODERNIZE INFRASTRUCTURE

EBOOK. FROM DISASTER RECOVERY TO ACTIVE-ACTIVE: NuoDB AND MULTI-DATA CENTER DEPLOYMENTS

Veritas InfoScale Enterprise for Oracle Real Application Clusters (RAC)

Azure Webinar. Resilient Solutions March Sander van den Hoven Principal Technical Evangelist Microsoft

Data Sheet: Storage Management Veritas Storage Foundation for Oracle RAC from Symantec Manageability and availability for Oracle RAC databases

IBM TS7700 grid solutions for business continuity

High Noon at AWS. ~ Amazon MySQL RDS versus Tungsten Clustering running MySQL on AWS EC2

Real-time Protection for Microsoft Hyper-V

SQL Server HA and DR: A Simple Strategy for Realizing Dramatic Cost Savings

Migrating a critical high-performance platform to Azure with zero downtime

White Paper. How to select a cloud disaster recovery method that meets your requirements.

RA-GRS, 130 replication support, ZRS, 130

SQL Azure. Abhay Parekh Microsoft Corporation

Focus On: Oracle Database 11g Release 2

A Digium Solutions Guide. Switchvox On-Premise Options: Is it Time to Virtualize?

Databases In the Cloud

Citrix SD-WAN for Optimal Office 365 Connectivity and Performance

Modernize Your Storage

CA ARCserve Backup. Benefits. Overview. The CA Advantage

Transform Availability

Disaster Recovery Is A Business Strategy

Maximum Availability Architecture: Overview. An Oracle White Paper July 2002

TECHNICAL WHITE PAPER - MAY 2017 MULTI DATA CENTER POOLING WITH NSX WHITE PAPER

MySQL HA Solutions Selecting the best approach to protect access to your data

How to Lift-and-Shift a Line of Business Application onto Google Cloud Platform

Easy VMware Disaster Recovery & Business Continuity in Amazon Web Services

An Oracle White Paper May Oracle VM 3: Overview of Disaster Recovery Solutions

Become a MongoDB Replica Set Expert in Under 5 Minutes:

Choosing a MySQL HA Solution Today. Choosing the best solution among a myriad of options

Managing Exchange Migration with Enterprise Vault

Virtualizing the SAP Infrastructure through Grid Technology. WHITE PAPER March 2007

The Data Protection Rule and Hybrid Cloud Backup

Solution Brief: Commvault HyperScale Software

Disaster Recovery Guide

Veritas Cluster Server from Symantec

HARNESSING THE HYBRID CLOUD TO DRIVE GREATER BUSINESS AGILITY

Protecting VMware vsphere/esx Environments with CA ARCserve

Carbonite Availability. Technical overview

High Availability through Warm-Standby Support in Sybase Replication Server A Whitepaper from Sybase, Inc.

Data Sheet: High Availability Veritas Cluster Server from Symantec Reduce Application Downtime

THE UTILITY OF DNS TRAFFIC MANAGEMENT

A Practical Guide to Cost-Effective Disaster Recovery Planning

Veritas Storage Foundation for Oracle RAC from Symantec

Verron Martina vspecialist. Copyright 2012 EMC Corporation. All rights reserved.

VERITAS Volume Replicator Successful Replication and Disaster Recovery

HOW TO AVOID STORAGE. Leveraging Flash Memory and Breakthrough Architecture to Accelerate Performance, Simplify Management, and Protect Your Data

INTRODUCING VERITAS BACKUP EXEC SUITE

Hybrid Cloud for Business Communications

EMC Business Continuity for Microsoft Applications

NEC Express5800 R320f Fault Tolerant Servers & NEC ExpressCluster Software

White Paper BC/DR in the Cloud Era

Data Protection for Cisco HyperFlex with Veeam Availability Suite. Solution Overview Cisco Public

Protecting Microsoft SQL Server databases using IBM Spectrum Protect Plus. Version 1.0

Veritas Volume Replicator Option by Symantec

Copyright 2012 EMC Corporation. All rights reserved.

VMware vsphere 4. The Best Platform for Building Cloud Infrastructures

Randy Pagels Sr. Developer Technology Specialist DX US Team AZURE PRIMED

Connectivity to Cloud-First Applications

Advanced Architecture Design for Cloud-Based Disaster Recovery WHITE PAPER

MySQL Multi-Site/Multi-Master Done Right

Hyper-Converged Infrastructure: Providing New Opportunities for Improved Availability

Aurora, RDS, or On-Prem, Which is right for you

Buyer s Guide: DRaaS features and functionality

WHITE PAPER. Header Title. Side Bar Copy. Header Title 5 Reasons to Consider Disaster Recovery as a Service for IBM i WHITEPAPER

Nutanix Tech Note. Virtualizing Microsoft Applications on Web-Scale Infrastructure

Protecting Mission-Critical Application Environments The Top 5 Challenges and Solutions for Backup and Recovery

WEBSCALE CONVERGED APPLICATION DELIVERY PLATFORM

Data Sheet: Storage Management Veritas Storage Foundation by Symantec Heterogeneous online storage management

MySQL Enterprise High Availability

Arcserve Unified Data Protection Virtualization Solution Brief

Perfect Balance of Public and Private Cloud

SOLUTION BRIEF Fulfill the promise of the cloud

Virtualization with Protection for SMBs Using the ReadyDATA 5200

Symantec NetBackup 7 for VMware

5/24/ MVP SQL Server: Architecture since 2010 MCT since 2001 Consultant and trainer since 1992

Copyright 2012 EMC Corporation. All rights reserved.

Integrating Macs with Microsoft DFS A Technical Best Practices Whitepaper

MySQL High Availability


Veritas Resiliency Platform: The Moniker Is New, but the Pedigree Is Solid

A Practitioner s Guide to Migrating Workloads to VMware Cloud on AWS

Virtual protection gets real

New England Data Camp v2.0 It is all about the data! Caregroup Healthcare System. Ayad Shammout Lead Technical DBA

What s New in VMware vsphere Availability

ARCHITECTING WEB APPLICATIONS FOR THE CLOUD: DESIGN PRINCIPLES AND PRACTICAL GUIDANCE FOR AWS

Virtual SAN and vsphere w/ Operations Management

Decrease IT Cost. Using SQL Server 2012 to. eguide. By Michael K. Campbell

Protecting Microsoft Hyper-V 3.0 Environments with Arcserve

Welcome to the. Migrating SQL Server Databases to Azure

Market Report. Scale-out 2.0: Simple, Scalable, Services- Oriented Storage. Scale-out Storage Meets the Enterprise. June 2010.

Microsoft SQL Server HA and DR with DVX

MAXIMIZE YOUR NUTANIX ROI WITH SURELINE SUREedge

Copyright 2012 EMC Corporation. All rights reserved.

Transcription:

White Paper A Guide to Architecting the Active/Active Data Center 2015 ScaleArc. All Rights Reserved.

White Paper The New Imperative: Architecting the Active/Active Data Center Introduction With the average cost of downtime per hour estimated at a whopping $163,674 1, ensuring continuous availability for applications has become a top business priority. Aware of the business ramifications, organizations have gone to great lengths to keep their databases up and running, building disaster recovery (DR) systems to minimize the damage an outage can create. Because of cost pressures, most organizations today are relegated to running active/standby systems. These architectures can take hours to restore operations and recover essential data following a failure. Within a single data center, line of business application recovery can take up to 30 minutes even when adequately set up for DR. For application processes that must run across data centers, recovery time can multiply, adding up to hours of downtime, with lost revenues and unmet service level agreements (SLAs) driving costs higher. By their very design, active/standby scenarios depend on a secondary site that sits idle until a disaster hits. For most organizations, the operational cost of moving to active/active architectures has been too high to justify such a design requires sophisticated application, database, and networking configuration. As a result, few organizations have implemented this approach. New technologies, however, are making it easier and more affordable to architect for active/active operations. The operational cost of moving to active/active architectures has been too high to justify. New technologies, however are making it easier and more affordable to build active/ active operations. Active/Active Data Centers The Gold Standard Running two or more data centers in active/active mode means all data centers can serve application traffic at the same time. In addition to enabling continuous app availability, running the additional data center(s) in active mode also delivers increased capacity and better performance. This additional capacity improves the economics of active/active operations, since the systems are delivering throughput vs. sitting idle. Implementing an active/active architecture requires more than just replicating the database from one data center to another; applications themselves must understand the database infrastructure and know details such as which servers are closer when clusters span different data centers. Without this insight into database topology and locality, applications can t send load optimally. With random load balancing, the application will send as much traffic to the remote data center as the local one, increasing WAN traffic and slowing application performance. Furthermore, understanding locality for writes is essential for supporting cross-data center failover. Applications need to understand both read and write locality reads should be sent to the closest available local server within the same data center, and writes should be sent to the current primary server in the appropriate data center. 1 Aberdeen Group research report, Downtime and Data Loss: How Much Can You Afford? 8/16/2013 2015 ScaleArc. All Rights Reserved. 1

Existing Approaches to Supporting Active/Active Operations The three most common approaches to architecting active/active operations are building mechanisms within the application itself, relying on the database, and creating an abstraction layer. White Paper The New Imperative: Architecting the Active/Active Data Center 1. Building Mechanisms for Active/Active Ops into the Application Building support for active/active operations into the application takes a huge amount of engineering resources and continual maintenance. With systems being accessed not only synchronously (via user devices) but also asynchronously (via API calls from third-party services), supporting user and device locality within applications is highly complex and labor intensive. These architectures get exponentially more complex when you want to span more than two regions. Overall, this approach has suffered from such a poor ROI that few organizations have embraced it. 2. Relying on the Database for Active/Active Ops Relying on the database to achieve active/active operations usually requires a third-party cluster management system (such as Galera) for master/master capabilities. This approach creates a slew of issues, including application consistency restraints that can negatively impact the user experience. In addition, this approach replicates all writes in all data centers, which decreases its value from a capacity standpoint. Building support for active/active operations into the application takes a huge amount of engineering resources and continual maintenance. Few organizations have embraced the design, historically, because of their poor ROI. Building active/active ops within SQL Server has also proven very challenging. AlwaysOn Availability Groups in SQL Server 2012/2014 don t support location or replication awareness. The application is offered a random server for its connection the first replica in the list would frequently not be the closest available server, and without replication monitoring, its data could be stale. Both scenarios can cause performance and data consistency issues that are difficult to diagnose and resolve. In the same manner, because the AlwaysOn Availability Group Listener can t perform read load balancing, all read loads go to only the first available read replica, leaving other read replicas in the group idle. 3. Creating an Abstraction Layer to Support Active/Active According to the database consultancy Pythian, the abstraction layer approach provides a compelling alternative. This architecture gives you a way to abstract away all the database and application variations, says Aaron Lee, managing principal for DevOps at Pythian. Lee contends that for 80% of organizations, using database load balancing software offers the best solution for supporting active/active operations. The software which provides a range of traffic management capabilities including load balancing understands the database architecture, knows the SQL state, and manages the SQL queries based on this deep knowledge. The software supports replication and performance awareness, ensuring quality of service and consistency rules without having to build the capabilities into applications or dealing with the limitations of what a database natively supports. 2015 ScaleArc. All Rights Reserved. 2

White Paper The New Imperative: Architecting the Active/Active Data Center The database load balancing software creates an abstraction layer, separating the apps from the databases. The software delivers load in the most optimal way, with no application or database changes. How ScaleArc Enables Active/Active Operations The ScaleArc software provides a range of traffic management capabilities, enabling an agile data tier. The software supports four key features that make it possible for organizations to easily adopt and benefit from active/active operations: Automatic support for read/write split Geo-aware load balancing Replication-aware serving of data Auto failover within or between data centers This [abstraction layer] architecture gives you a way to abstract away all the database and application variations. -- Aaron Lee Managing Principal for DevOps, Pythian 1. Automatic support for read/write split ScaleArc has the ability to understand which queries are reads vs. writes and then appropriately distribute the traffic accordingly, sending all write transactions to the primary server and load balancing all reads across the available secondary servers. With ScaleArc, you don t need to modify your applications with configurations such as read-intent strings to get the performance and reliability benefits of read/write split. 2. Geo-aware load balancing ScaleArc uses Time To First Byte to understand server response times, so the software will send queries to the servers that are closer and will send more load to those better-performing servers automatically. 3. Replication-aware serving of data ScaleArc monitors replication lag through a replication monitoring engine. The software will automatically avoid sending traffic to a node that has fallen behind the user-set threshold for replication delay. 4. Auto failover within or between data centers ScaleArc leverages the failover architected in the database to provide automatic failover at the application layer. The software recognizes a failover is underway, queues inbound queries to avoid application errors, and then forward the queries once the secondary is promoted to primary. The software can also route read queries to other available secondaries throughout the failover, ensuring continuous application availability. Because the application is directed to communicate with the ScaleArc abstraction layer, changes in the database structure behind that layer are transparent to the application, enabling cross-data center failover with no application changes (see diagram). 2015 ScaleArc. All Rights Reserved. 3

White Paper The New Imperative: Architecting the Active/Active Data Center Site A Web Load Balancer Site B Web Load Balancer App Servers App Servers replication Figure 1. Running in HA mode in two data centers, ScaleArc sits transparently between the database and application servers. Since ScaleArc operates at the SQL protocol layer, it s also completely app agnostic. Apps connect to the ScaleArc instances in each data center, via a common DNS name. As a result, the application thinks it is always accessing the same database server regardless of the server location where ScaleArc forwards the queries. Impact of the Cloud With capacity elasticity and vertical scale, the cloud can be less expensive than on-premise deployments for implementing active/active from a capacity allocation standpoint. The cloud also makes it very simple to migrate operations to a new region to support moves or changing demand. With ScaleArc, you don t need to modify your applications with configurations such as readintent strings to get the performance and reliability benefits of read/write split. 2015 ScaleArc. All Rights Reserved. 4

White Paper Architecting Active/Active Data Centers However, without the ability to migrate IP addresses between machines, most existing HA methodologies don t work in the cloud, as automated failover must also be supported across data centers in different geographies. This shortcoming, along with smaller instances, can make achieving active/active operations more problematic in the cloud. ScaleArc solves this problem in the same manner it enables cross-data center failover for on-premise deployments. Because ScaleArc routes database traffic at Layer 7 or the Application layer, it can also automatically redirect traffic flows across data centers during failure. Reducing Risk and Adding Value ScaleArc s database load balancing software gives organizations a seamless and cost-effective way to implement active/active operations. With the ability to send load to multiple data centers, businesses can better utilize all database capacity. Rather than maintaining a cold DR site, organizations maximize performance and increase uptime with a second site that s always servicing traffic. A live secondary makes it possible to identify potential performance problems in the data stack in advance of a failure. ScaleArc s solution for active/active ops gives enterprises the confidence of knowing the secondary data center is ready to take load. In addition, because all data centers are now active, each site can be smaller. With support for distributed load, organizations don t require as many servers in the primary and secondary data centers about 30% fewer. If a failure happens, the site may slow slightly but will still remain operational. Active/standby operations cost 2.5x to 3x the cost of a single data center, accounting for full redundancy and the operational costs of connecting and maintaining the two sites. Active/active operations, on the other hand, cost about 1.4x to 1.8x, since each site operates less equipment. Organizations can leverage resources more efficiently, while ensuring system reliability and lowering costs. Most importantly, the active/active architecture ensures application continuity, delivering the most value to today s businesses, whether on-premise or in the cloud. To learn more about these best practices, see ScaleArc s Active/Active Operations webinar. ScaleArc s database load balancing software gives organizations a seamless and cost-effective way to implement active/active operations through built-in locality and replication awareness and automated cross-data center failover. 2901 Tasman Drive, Suite 205 Santa Clara, CA 95054 Phone: 1-408-780-2040 Fax: 1-408-427-3748 www.scalearc.com ScaleArc is the leading provider of database load balancing software. The ScaleArc software inserts transparently between applications and databases, creating an agile data tier that provides continuous availability and increased performance for all apps. With ScaleArc, enterprises also gain instant database scalability and a new level of real-time visibility for their application environments, both on prem and in the cloud. Learn more about ScaleArc, our customers, and our partners at www.scalearc.com. 2015 ScaleArc. All Rights Reserved. ScaleArc and the ScaleArc logo are trademarks or registered trademarks of ScaleArc in the United States and other countries. All brand names, product names, or trademarks belong to their respective holders. 1/20/15