RAC on Extended Distance Clusters. Erik Peterson RAC Development Oracle Corporation

Similar documents
The Right Choice for DR: Data Guard, Stretch Clusters, or Remote Mirroring. Ashish Ray Group Product Manager Oracle Corporation

EMC VPLEX Geo with Quantum StorNext

Oracle9i Real Application Clusters. Principal Sales Consultant DB Tech. Team Oracle Corporation

EMC VPLEX with Quantum Stornext

Oracle RAC 10g on HP-UX

Maximum Availability Architecture: Overview. An Oracle White Paper July 2002

Experience the GRID Today with Oracle9i RAC

Planning & Installing a RAC Database

Step into the future. HP Storage Summit Converged storage for the next era of IT

HP & Oracle A Powerful Alliance

Data Sheet: High Availability Veritas Cluster Server from Symantec Reduce Application Downtime

Maximum Availability Architecture on Dell PowerEdge Servers and Dell/EMC Storage over Wide Area Networks

How To Make Databases on Linux on System z Highly Available

Maximize Availability on Private Clouds

1 BRIEF / Oracle Solaris Cluster Features and Benefits

Eliminating Downtime When Migrating or Upgrading to Oracle 10g

Veritas Volume Replicator Option by Symantec

Data Sheet: Storage Management Veritas Storage Foundation for Oracle RAC from Symantec Manageability and availability for Oracle RAC databases

Oracle Real Application Clusters One Node

Storage Foundation for Oracle RAC with EMC SRDF

Presented By Chad Dimatulac Principal Database Architect United Airlines October 24, 2011

Session: Oracle RAC vs DB2 LUW purescale. Udo Brede Quest Software. 22 nd November :30 Platform: DB2 LUW

Oracle 11g: RAC and Grid Infrastructure Administration Accelerated Release 2

Oracle Database 12c Flex Clusters

ORACLE RAC DBA COURSE CONTENT

Exam : S Title : Snia Storage Network Management/Administration. Version : Demo

Oracle 11g: RAC and Grid Infrastructure Administration Accelerated Release 2

High Availability Solutions with Oracle on HP-UX

Veritas Cluster Server from Symantec

Essentials. Oracle Solaris Cluster. Tim Read. Upper Saddle River, NJ Boston Indianapolis San Francisco. Capetown Sydney Tokyo Singapore Mexico City

IBM IBM Open Systems Storage Solutions Version 4. Download Full Version :

Unisys SafeGuard Solutions

Extend your DB2 purescale cluster to another city- Geographically Dispersed purescale Cluster

Veritas InfoScale Enterprise for Oracle Real Application Clusters (RAC)

IBM TotalStorage Enterprise Storage Server Model 800

<Insert Picture Here> Oracle MAA und RAC Best Practices und Engineered Systems

Senior Technical Manager, ATG, Oracle Corporation. Vamsi Mudumba. High Availability. High Availability

Everything You Need To Know About Oracle & Storage Foundation HA

High Availability Infrastructure for Cloud Computing

SAN Storage Array Workbook September 11, 2012

Oracle RAC Course Content

GiantLoop Testing and Certification (GTAC) Lab

MQ High Availability and Disaster Recovery Implementation scenarios

OTN Case Study: Amadeus Using Data Guard for Disaster Recovery & Rolling Upgrades

VPLEX Networking. Implementation Planning and Best Practices

White Paper. Dell Reference Configuration

VERITAS Global Cluster Manager

Data Center Interconnect Solution Overview

Exam Name: Midrange Storage Technical Support V2

Network Layer Flow Control via Credit Buffering

Architectures for High Availability and Disaster Tolerant Solutions

Veritas Storage Foundation for Windows by Symantec

Mission-Critical Databases in the Cloud. Oracle RAC in Microsoft Azure Enabled by FlashGrid Software.

What's in this guide... 4 Documents related to NetBackup in highly available environments... 5

Veritas Storage Foundation for Windows by Symantec

IBM Clustering Solutions for Unix Servers

Software-defined Shared Application Acceleration

MySQL High Availability. Michael Messina Senior Managing Consultant, Rolta-AdvizeX /

11i on RAC: Sweating the Details. Practical overview of Technical Details of Implementing RAC for 11i E-Business Suite

Oracle Database 12c: Clusterware & RAC Admin Accelerated Ed 1

Horizontal Scaling Solution using Linux Environment

Enabling Fast Recovery of Your Virtual Environments: NetBackup, Backup Exec & VCS for VMware

Maximum Availability Architecture (MAA): Oracle E-Business Suite Release 12

HP StorageWorks Cluster Extension XP user guide

Symantec NetBackup PureDisk Compatibility Matrix Created August 26, 2010

Oracle Real Application Clusters Handbook

HP StorageWorks Continuous Access EVA 2.1 release notes update

MySQL HA Solutions Selecting the best approach to protect access to your data

IBM TotalStorage SAN Switch F08

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Transformez votre PRA

An Oracle White Paper November Oracle RAC One Node 11g Release 2 User Guide

Storage Area Networks: Performance and Security

Benefits of IT Consolidation with HP and Oracle

VocaLink and BGC Application Failover

Maximum Availability Architecture. Oracle Best Practices For High Availability

The advantages of architecting an open iscsi SAN

Shire Pharmaceuticals Deploys Oracle Grid Computing Model to Lower Costs and Increase Reliability

Storage Area Network (SAN) Training Presentation. July 2007 IBM PC CLUB Jose Medeiros Storage Systems Engineer MCP+I, MCSE, NT4 MCT

Building a Dynamic and Flexible Exchange Architecture. B S Nagarajan Senior Technology Consultant 6 th November, 2008

PracticeTorrent. Latest study torrent with verified answers will facilitate your actual test

Installation, Storage, and Compute with Windows Server 2016

Introduction. Solution Overview. RAC Certification Program for Intel processor-based servers

Clustering In A SAN For High Availability

Oracle Database 11g Direct NFS Client Oracle Open World - November 2007

Data Protector 10.x Express Support Matrix

Oracle Database 11g: RAC Administration Release 2 NEW

IBM TotalStorage Enterprise Storage Server (ESS) Model 750

Page 1 نشانی: تهران خیابان شهید بهشتی نرسیده به قائم مقام فراهانی پالک طبقه تلفن: فکس:

Oracle GoldenGate and Oracle Streams: The Future of Oracle Replication and Data Integration

IT Certification Exams Provider! Weofferfreeupdateserviceforoneyear! h ps://

Copyright 2012 EMC Corporation. All rights reserved.

OPTIMIZING YOUR ORACLE DATABASE ENVIRONMENTS

PURE STORAGE PURITY ACTIVECLUSTER

HPE Serviceguard I H6487S

Virtualizing Oracle 11g/R2 RAC Database on Oracle VM: Methods/Tips

Rackable 24 Slot Expansion B21 See text for FC hubs, FC switches, platform software, host adapters, disks and options for complete solutions.

Oracle E-Business Availability Options. Solution Series for Oracle: 2 of 5

SM B10: Rethink Disaster Recovery: Replication and Backup Are Not Enough

<Insert Picture Here> Exadata MAA Best Practices Series Session 1: E-Business Suite on Exadata

Transcription:

RAC on Extended Distance Clusters Erik Peterson RAC Development Oracle Corporation 1

Agenda Benefits of RAC on extended clusters Design considerations Empirical performance data Live customer examples Positioning w.r.t. DataGuard Summary 2

Benefits of RAC on Extended Clusters Full utilization of resources no matter where they are located All Work gets Distributed to All Nodes Site A One Physical Database Site B 3

Benefits of RAC on Extended Clusters Faster recovery from site failure than any other technology in the market Work Continues on Remaining Site Site A One Physical Database Site B 4

Design Considerations 5

Design Considerations Connectivity Disk Mirroring Quorum Comparing Alternatives Other Considerations 6

Connectivity Redundant connections for public traffic, interconnect and I/O Dual Public Connections Dual Private Interconnects Site A Dual SAN Connections Site B 7

Connectivity Distances > 10km require Dark Fiber (DWDM or CWM). Extra benefit of separate dedicated channels on 1 fibre Essential to setup buffer credits for large distances D W D M D W D M Site A Site B 8

Connectivity Caveats Distance Single fiber limit (100km?) Performance Cost Need to Minimize Latency. Direct effect on synchronous disk mirroring and Cache Fusion operation Direct point to point connection => Additional routers, hubs, or extra switches add latency High cost of DWDM if not already present in the infrastructure

Disk Mirroring Need copy of data at each location 2 options exist: Host Based Mirroring (CLVM) Remote Array Based Mirroring

Host Based Mirroring Standard cluster aware host based LVM solutions (requires a CLVM) Disks appear as one set All writes get sent to both sets of disks

Array Based Mirroring All I/Os get sent to one site, mirrored to other Examples: EMC SRDF Longer outage in case of failure of primary site Primary Secondary

Mirroring Example: Large UK Bank 2 nodes AIX Tested both 9 km Host Based Mirroring Shark Storage (<1 minute down) 20 km Array Based Mirroring (PPRC) w/ ERCMF (extended remote copy facility) that avoids doing a manual restart by suspending I/Os until PPRC has done the switch. (1-5 minutes down)

Cluster Quorum: Recommendations What happens if all communications between sites is lost?

Cluster Quorum: Recommendations Use a third site for quorum device for maximum availability Third Site

Primary/Primary or Primary/Secondary? Active/Active is different from Primary/Primary Primary means site continues w/o restart should other site fail Primary/Primary preferred, but requires: 3 rd Site for Quorum and 10gR2 Oracle Clusterware & ASM for Mirroring or 3 rd Party Clusterware + Host Based Mirroring Primary/Secondary if any of these conditions exist No 3 rd Site Array Based Mirroring 9i or 10gR1 Oracle Clusterware

Clusterware Specifics Clusterware Versions Primary/ Primary Limitations Oracle 9i, 10gR1 N 64 Nodes Oracle 10gR2 Y 100 Nodes Veritas All Y 8 (16) Nodes Not Supported on Linux HP ServiceGuard All Y 16 Nodes < 10km 2 Nodes > 10km Not Supported on Linux IBM HACMP All Y? Sun Cluster All Y 8 Nodes 17

10gR2 Extended RAC on pure Oracle Stack Any site can fail, and system continues Support for Generic NFS for 3 rd Voting Disk Currently limited to Linux WAN Third Site For Voting Disk (mounted via NFS or iscsi) DB files (ASM) OCR Voting Disk Primary ASM used for Mirroring DB files Primary DB files (ASM) OCR Voting Disk

Current Limitations of ASM for Extended RAC Mirroring Should connectivity between sites be lost, ASM will need to do full resilvering of lost volumes. Partial resilvering is not yet supported. ASM currently will read from any available disk group. No optimization is done to do local reads. Both items are currently done by some cluster aware LVMs, and are planned for a future release of ASM.

Other Considerations Needs to looks like a local cluster to Oracle, i.e.: Sharing subnets Private Interconnect Public VIPs

Performance 21

Empirical Performance Data Unit Tests (Oracle/HP Test results) Cache Fusion I/O Overall Application Tests (from 4 different sets of tests)

Empirical Performance Data Cache Fusion Unit Test 6 Low Load - 1 or 2 IC High Load - 1 IC High Load - 2 IC 5 4 MS 3 2 1 0 Distance Local 25km 50km 100km ~1ms increased memory-to-memory block transfer latency over 100km for all cases Results from joint Oracle/HP testing 23

Empirical Performance Data I/O Unit Test I/O Latency (ms) 14 12 10 8 6 4 2 0 Distance Local 50km 100km I/O latency increased by 43% over 100km. Note: Without buffer credits this tested at120-270% I/O latency degradation Results from joint Oracle/HP testing 24

Empirical Performance Data Overall Results: Joint Oracle/HP Testing For 100km Memory-to-memory messaging latency increased ~ 1ms I/O latency increased in the ballpark of 43%. This is ~ 3-4 ms 25

Empirical Performance Data Overall Application Effect % of Local Performance 100% 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% Distance No Buffer Credits Untuned example w/o buffer credits Local 25km 50km 100km 26

Empirical Performance Data Overall Application Effect % of Local Performance 100% 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% Distance Veritas RAC Test IBM/Oracle RAC Test Tuned example with buffer credits Local 20/25km 50km 100km Note: differences in results are due to differences in test cases, not in clusterware used 27

Oracle 9i - Live Customer Examples Name Release Nodes Platform OS Clusterware Stretch Distance (KM) European Electronics firm 9i 2 IBM AIX HACMP 8 US Police Department 9i 2 IBM AIX HACMP 3 European Government 9i 2 IBM AIX HACMP 8 US Broadcaster 9i 2 IBM AIX HACMP 0.2 Austrian Hospital 9i 2 IBM AIX HACMP 0.6 Brazilian Credit Union Network 9i 3 IBM AIX HACMP 10 UzPromStroyBank 9i 2 IBM AIX HACMP 1.7 US Fortune 100 firm 9i 2 HP HP-UX HP Service Guard 2 Brazilian Hospital 9i 2 HP HP-UX HP Service Guard 0.5 North American Lottery 9i 4 HP OpenVMS 10 European Mobile Operator 9i 3 Sun Solaris Veritas Cluster 48 Comic Relief 9i 3 Sun Solaris Sun Cluster 8 German Bank 9i 2 Sun Solaris 12 European Mail 9i 2 Sun Solaris Veritas Cluster 12 European Government 9i 2 Sun Solaris Sun Cluster 0.4 UK University 9i 2 Sun Solaris Sun Cluster 0.8 Italian Telco 9i 2 Sun Solaris Sun Cluster 2 Austrian Railways 9i 2 HP Tru64 TruCluster 1.5 Nordac/ Draeger 9i 4 HP Tru64 TruCluster 0.3 University of Melbourne 9i 3 HP Tru64 TruCluster 0.8 28

Oracle 10g - Live Customer Examples Name Release Nodes Platform OS Clusterware Stretch Distance (KM) Italian Financial Services firm 10g 20 IBM AIX HACMP 0.2 Groupe Diffusion Plus 10g 2 IBM AIX Oracle 0.5 Austrian IT Services Provider 10g 2 IBM AIX HACMP 1 Daiso Sangyo 10g 2 HP HP-UX Oracle 10 Italian Manufacturer 10g 4 HP Linux Oracle 0.8 Swedish Automotive Parts 10g 2 IBM Linux Oracle 2 Austrian Health Provider 10g 2 IBM Linux Oracle 0.3 Thomson Legal 10g 8 Sun Linux Oracle 1 German Telecom 10g 4 Sun Solaris Sun Cluster 5 European Bank 10g 2 Sun Solaris Oracle 5 European Electronics Components firm 10g 2 IBM Windows Oracle 0.5 29

RAC on Extended Clusters Positioning W.R.T. Data Guard 30

Additional Benefits Data Guard Provides Greater Disaster Protection Greater distance Additional protection against corruptions Better for Planned Maintenance Full Rolling Upgrades More performance neutral at large distances Option to do asynchronous If you cannot handle the costs of a DWDM network, Data Guard still works over cheap standard networks.

When does it not work well? Distance is too great No fixed cutoff, but as distance increases you are slowing down both cache fusion & I/O activity. The impact of this will vary by application. Prototype first if doing this over ~50km. Public Networks To much latency added between the nodes.

Summary RAC on Extended Cluster It works! proven at customer sites & partner labs. Good design is key! Bad design can lead to a badly performing system. Data Guard offers additional benefits 33

References o Roland Knapp, Daniel Dibbets, Amit Das, Using standard NFS to support a third voting disk on a stretch cluster configuration on Linux, September 2006 o EMEA Joint Solutions Center Oracle/IBM, 10gRAC Release2 High Availability Test Over 2 distant sites on xseries, July 2005 o Paul Bramy (Oracle), Christine O Sullivan (IBM), Thierry Plumeau (IBM) at the EMEA Joint Solutions Center Oracle/IBM, Oracle9i RAC Metropolitan Area Network implementation in an IBM pseries environment, July 2003 o Veritas, VERITAS Volume Manager for Solaris: Performance Brief Remote Mirroring Using VxVM, December 2003 o HP Oracle CTC, Extended Serviceguard cluster configurations. Detailed configuration information for extended RAC on HP-UX clusters, Novemer 2003 o Mai Cutler (HP), Sandy Gruver (HP), Stefan Pommerenk (Oracle) Eliminate the Current Physical Restrictions of a Single Oracle Cluster, OracleWorld San Francisco 2003 o Joseph Algieri & Xavier Dahan (HP), Extended MC/ServiceGuard cluster configurations (Metro clusters), Version 1.4, January 2002 o Michael Hallas and Robert Smyth, Comic Relief Red Nose Day 2003 (RND03), Installing a Three-Node RAC Cluster in a Dual-Site Configuration using an 8 Km DWDM Link, Issue 1, April 2003 o Ray Dutcher, Oracle9i Data Guard: Primary Site and Network Configuration Best Practices, October 2003 o Joseph Meeks, Michael T. Smith, Ashish Ray, Sadhana Kyathappala, Oracle Data Guard 10g Release 2 Fast-Start Failover Best Practices, November, 2005 o Tim Read, Architecting Availability & Disaster Recovery Solutions, Sun BluePrints OnLine, April 2006 34

Questions Answers Discussion 35