Oracle Database 12c Rel. 2 Cluster Health Advisor - How it Works & How to Use it

Similar documents
Machine Learning Diagnostics Using Oracle Autonomous Health Framework

Oracle Database 18c Oracle Autonomous Health Framework O R A C L E W H I T E P A P E R F E B R U A R Y

Oracle Database 12c Release 2 Oracle Autonomous Health Framework O R A C L E W H I T E P A P E R M A R C H

Oracle 12c Grid Infrastructure Management Repository Everything You Wanted To Know

What s Next with Oracle RAC?

RAC Performance Monitoring and Diagnosis using Oracle Enterprise Manager. Kai Yu Senior System Engineer Dell Oracle Solutions Engineering

Oracle Real Application Clusters (RAC) Your way to the Cloud

Autonomous Health Framework User s Guide. 12c Release 2 (12.2)

Oracle Database 11g: Real Application Testing & Manageability Overview

Exadata Monitoring and Management Best Practices

Exadata Database Machine: 12c Administration Workshop Ed 2

Exadata Database Machine: 12c Administration Workshop Ed 2

Exadata Database Machine: 12c Administration Workshop Ed 2 Duration: 5 Days

Oracle Database 12c R2: RAC Administration Ed 2

End-to-end Management with Grid Control. John Abrahams Technology Sales Consultant Oracle Nederland B.V.

Oracle Database 12c Performance Management and Tuning

<Insert Picture Here> Managing Oracle Exadata Database Machine with Oracle Enterprise Manager 11g

Database Performance Analysis Techniques Using Metric Extensions and SPA

Exdata Database Machine: 12c Administration Workshop Ed 2

Exadata Database Machine: 12c Administration Workshop Ed 1

ORACLE ENTERPRISE MANAGER 10g ORACLE DIAGNOSTICS PACK FOR NON-ORACLE MIDDLEWARE

Oracle Diagnostics Pack For Oracle Database

Exadata Database Machine Administration Workshop

Oracle Trace File Analyzer (TFA) Diagnostics for the Cloud

Copyright 2018, Oracle and/or its affiliates. All rights reserved.

Oracle Database 11g: RAC Administration Release 2 NEW

Katharina Römer Principal Sales Consultant STCC Stuttgart ORACLE Deutschland GmbH

Help Us Help You - TFA Collector and the Support Tools Bundle

Oracle Database 11g: Self-Managing Database - The Next Generation

ORACLE RAC DBA COURSE CONTENT

Oracle Database 10G. Lindsey M. Pickle, Jr. Senior Solution Specialist Database Technologies Oracle Corporation

Oracle Database 10g The Self-Managing Database

Oracle - Exadata Database Machine: 12c

How to Troubleshoot Databases and Exadata Using Oracle Log Analytics

Oracle 11g: RAC and Grid Infrastructure Administration Accelerated Release 2

Oracle 11g Release 2 RAC & Grid Infrastructure Administration Course Overview

Oracle Database 12c: Clusterware & RAC Admin Accelerated Ed 1

Oracle Database 11g: New Features for Administrators Release 2

Oracle RAC Course Content

Oracle Clustering: Oracle 11g Real Application Clusters for Administrators

Oracle Real Application Clusters Handbook

Oracle 11g: RAC and Grid Infrastructure Administration Accelerated Release 2

Oracle Database 12c Administration I

Oracle Database 12c: Performance Management and Tuning

Oracle Database 11g: New Features for Administrators DBA Release 2

Enterprise Manager: Scalable Oracle Management

EZY Intellect Pte. Ltd., #1 Changi North Street 1, Singapore

ORACLE 11gR2 DBA. by Mr. Akal Singh ( Oracle Certified Master ) COURSE CONTENT. INTRODUCTION to ORACLE

Load Testing and Monitoring Oracle Real Application Clusters (RAC)

Oracle Database 12c: RAC Administration Ed 1

ORACLE DIAGNOSTICS PACK

Page 1 نشانی: تهران خیابان شهید بهشتی نرسیده به قائم مقام فراهانی پالک طبقه تلفن: فکس:

ORACLE 11g R2 New Features

Understanding Oracle RAC ( ) Internals: The Cache Fusion Edition

Leng Leng Tan Vice President Server Manageability and Diagnosability Oracle Corporation. Arvind Gidwani Kothandapani Subramaniyam

IT115: Oracle Database 12c Administration I

Oracle Database 12c: RAC Administration Ed 1 LVC

Oracle 1Z0-417 Exam Questions and Answers (PDF) Oracle 1Z0-417 Exam Questions 1Z0-417 BrainDumps

Microsoft SQL Server Fix Pack 15. Reference IBM

Oracle Database 11g: Manageability Overview. An Oracle White Paper August 2007

Oralogic Education Systems

Learning Objectives : This chapter provides an introduction to performance tuning scenarios and its tools.

Oracle 11g: RAC and Grid Infrastructure Administration Accelerated Release 2

Oracle Enterprise Manager 12c IBM DB2 Database Plug-in

Oracle Real Application Clusters (RAC) on Oracle Database 18c ORACLE WHITE PAPER FEBRUARY 2018

Toad for Oracle Suite 2017 Functional Matrix

MAXGAUGE for Oracle Web Version 5.3

Copyright 2012, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 12

Oracle Database 11g: New Features for Oracle 9i DBAs

Oracle Grid Infrastructure 12c Release 2 Cluster Domains O R A C L E W H I T E P A P E R N O V E M B E R

Oracle 11g: RAC and Grid Infrastructure Administration Accelerated Release 2

Oracle 11g: RAC and Grid Infrastructure Administration Accelerated Release 2 NEW

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

Oracle Database 11g: SQL Fundamentals I

Oracle Grid Infrastructure Cluster Domains O R A C L E W H I T E P A P E R F E B R U A R Y

Domain Services Clusters Centralized Management & Storage for an Oracle Cluster Environment Markus Flechtner

Oracle Enterprise Manager 12c Sybase ASE Database Plug-in

Monitoring & Tuning Azure SQL Database

Oracle 1Z0-515 Exam Questions & Answers

An Oracle White Paper June Managing Oracle Exadata with Oracle Enterprise Manager 12c

Optimizing SQL Transactions

The Role of Database Aware Flash Technologies in Accelerating Mission- Critical Databases

What you will learn. interface clients. Learn To:

Oracle Clusterware 12c Release 2 Technical Overview O R A C L E W H I T E P A P E R M A R C H

Oracle Autonomous Database. White Paper Series: The Industry s First Self-Repairing Database

"Charting the Course... Oracle Database 12c: Architecture & Internals. Course Summary

An Oracle White Paper June Manageability with Oracle Database 12c

Oracle Exadata Recipes

Exadata Database Machine Administration Workshop NEW

What is Real Application Testing?

<Insert Picture Here> Exadata MAA Best Practices Series Session 1: E-Business Suite on Exadata

Oracle Database 11g: Real Application Testing & Manageability Overview. An Oracle White Paper June 2007

Safe Harbor Statement

Oracle Database 11g for Experienced 9i Database Administrators

Storage Monitoring Made Easy for DBAs: Diagnosing Performance Problems. Senior Product Manager Consulting Member of Technical Staff

Quality of Service Management

Oracle Database 18c: Quality of Service Management

Oracle TimesTen In-Memory Database 18.1

Oracle Clusterware 18c Technical Overview O R A C L E W H I T E P A P E R F E B R U A R Y

Technical Overview. Jack Smith Sr. Solutions Architect

Transcription:

Oracle Database 12c Rel. 2 Cluster Health Advisor - How it Works & How to Use it Mark V. Scardina - Director Oracle QoS Management & Oracle Autonomous Health Framework

Agenda 1 2 3 4 5 6 7 Introduction Architecture and Operation Details Using CHA from the Command Line Using CHA from EMCC for Alerts and Corrective Actions Using the CHA GUI to Perform Root-Cause Analysis Calibrating CHA to your RAC deployment Q & A Further Information 3

Agenda 1 2 3 4 5 6 7 Introduction Architecture and Operation Details Using CHA from the Command Line Using CHA from EMCC for Alerts and Corrective Actions Using the CHA GUI to Perform Root-Cause Analysis Calibrating CHA to your RAC deployment Q & A Further Information 4

Oracle s Database and Clusterware Tools What if issues were detected before they had an impact? What if you were notified with a specific diagnosis and corrective actions? What if resource bottlenecks threatening SLAs were identified early? What if bottlenecks could be automatically relieved just in time? What if database hangs and node reboots could be eliminated? Cluster Health Monitor Hang Manager EXAchk Quality of Service Management Memory Guard Cluster Verification Utility Cluster Health Advisor Trace File Analyzer ORAchk 5

Introducing Oracle 12c Autonomous Health Framework 6

Introducing Quality of Service Management Cluster Verification Utility ORAchk Oracle 12c Autonomous Health Framework Working For You Continuously Memory Guard Hang Manager Trace File Analyzer Cluster Health Advisor Cluster Health Monitor 7

Oracle Autonomous Health Framework Working for You Continuously Integrates next generation tools running as components - 24/7 Discovers Potential Issues and Notifies or takes Corrective Actions Speeds Issue Diagnosis and Recovery Preserves Database and Server Availability and Performance Autonomously Monitors and Manages resources to maintain SLAs 8

Discovers Potential Cluster & DB Problems - Notifies with Corrective Actions Oracle 12c Cluster Health Advisor 9

Introducing Oracle 12c Cluster Health Advisor Proactive Health Prognostics System Real time monitoring of Oracle RAC database systems and their hosts Early detection of impending as well as ongoing system faults Diagnoses and identifies the most likely root causes Provides corrective actions for targeted triage. Generates alerts and notifications for rapid recovery 10

Agenda 1 2 3 4 5 6 7 Introduction Architecture and Operation Details Using CHA from the Command Line Using CHA from EMCC for Alerts and Corrective Actions Using the CHA GUI to Perform Root-Cause Analysis Calibrating CHA to your RAC deployment Q & A Further Information 11

Cluster Health Advisor (CHA) Architecture Overview cha Cluster node resource Single Java ochad daemon per node Reads Cluster Health Monitor data directly from memory Reads DB ASH data from SMR w/o DB connection Uses OS and DB models and data to perform prognostics Stores analysis and evidence in the GI Management Repository Sends alerts to EMCC Incident Manager per target ochad OS Data CHM Node Health Prognostics Engine OS Model GIMR DB Data Database Health Prognostics Engine DB Model EMCC Alert 12

Cluster Health Advisor (CHA) Operation Overview SRVCTL lifecycle daemon management Enabled by default - Activates when 1 st RAC instance starts New CHACTL command line tool for all local operations Java Interactive GUI Tool available on OTN Integrated into EMCC Incident Manager and notifications Monitoring has no impact on database performance or availability EM Cloud Control ochad OS Data CHM Node Health Prognostics Engine OS Model GIMR DB Data Database Health Prognostics Engine DB Model Local to Cluster SRVCTL CHACTL Client CHA Java GUI Client 13

Cluster Health Advisor - Scope of Problem Detection Best Effort Immediate Guided Diagnosis Over 100 node and database problems have been modeled OS and DB metric predictors identified Problem network model created based upon its signature Problem Detection in 12.2.0.1 includes Interconnect, Global Cache and Cluster Problems Host CPU and Memory, PGA Memory stress IO Performance issues Reconfiguration and Recovery issues Workload and Session abnormal variations 14

Agenda 1 2 3 4 5 6 7 Introduction Architecture and Operation Details Using CHA from the Command Line Using CHA from EMCC for Alerts and Corrective Actions Using the CHA GUI to Perform Root-Cause Analysis Calibrating CHA to your RAC deployment Q & A Further Information 15

Using CHA From the Command Line Overview CHA is enabled by default Autonomously monitors nodes once a RAC DB starts in the cluster RAC or RAC One Node Database must be explicitly monitored opt in CHACTL Command Line supports Start/Stop Monitoring Diagnosing Health Issues and Corrective Actions Model Calibration, Activation and Lifecycle Management CHA Repository Data Lifecycle Management 16

Cluster Health Advisor Command Line Operations Monitoring Your Databases and Nodes with CHACTL Enable CHA monitoring on RAC database with optional model $ chactl monitor database db oltpacdb [-model model_name] Enable CHA monitoring on RAC database with optional verbose $ chactl status verbose monitoring nodes rwsbi06, rwsbi07 using model DEFAULT_CLUSTER monitoring database qosdo1, instances oltpacdb_1, oltpacdb_2 using model DEFAULT_DB 17

CHA Command Line Operations Checking for Health Issues and Corrective Actions with CHACTL QUERY DIAGNOSIS $ chactl query diagnosis -db oltpacdb -start "2016-02-01 02:52:50" -end "2016-02-01 03:19:15" 2016-10-28 01:47:10.0 Database oltpacdb DB Control File IO Performance (oltpacdb_1) [detected] 2016-10-28 01:47:10.0 Database oltpacdb DB Control File IO Performance (oltpacdb_2) [detected] 2016-10-28 02:59:35.0 Database oltpacdb DB Log File Switch (oltpacdb_1) [detected] 2016-10-28 02:59:45.0 Database oltpacdb DB Log File Switch (oltpacdb_2) [detected] Problem: DB Control File IO Performance Description: CHA has detected that reads or writes to the control files are slower than expected. Cause: The Cluster Health Advisor (CHA) detected that reads or writes to the control files were slow because of an increase in disk IO. The slow control file reads and writes may have an impact on checkpoint and Log Writer (LGWR) performance. Action: Separate the control files from other database files and move them to faster disks or Solid State Devices. Problem: DB Log File Switch Description: CHA detected that database sessions are waiting longer than expected for log switch completions. Cause: The Cluster Health Advisor (CHA) detected high contention during log switches because the redo log files were small and the redo logs switched frequently. Action: Increase the size of the redo logs. 18

Cluster Health Advisor Command Line Operations HTML Diagnostic Health Output Available (-html <file_name>) 19

Agenda 1 2 3 4 5 6 7 Introduction Architecture and Operation Details Using CHA from the Command Line Using CHA from EMCC for Alerts and Corrective Actions Using the CHA GUI to Perform Root-Cause Analysis Calibrating CHA to your RAC deployment Q & A Further Information 20

CHA has detected a service degradation due to higher than expected I/O latencies. 21

CHA has detected a service degradation due to higher than expected I/O latencies. Cluster Health Advisor CHA/DB Health CHA detected a for service degradation due to higher than expected I/O latencies. CHA/DB Health: I/O problem Cluster Health Advisor Problem Confidence Action The degradation is caused by a higher than expected utilization of shared storage devices for this database. No evidence of significant increase in I/O demand on the local node. 95.17% Validate whether there is increase in I/O demand on other nodes than the local and find I/O intensive SQL. Add more disks to disk group or move database to faster disks. proddb_1 proddb_2 22

Agenda 1 2 3 4 5 6 7 Introduction Architecture and Operation Details Using CHA from the Command Line Using CHA from EMCC for Alerts and Corrective Actions Using the CHA GUI to Perform Root-Cause Analysis Calibrating CHA to your RAC deployment Q & A Further Information 23

Using the CHA GUI to Perform Root-Cause Analysis Overview Standalone Java GUI Client Must be run on local cluster node Can be run against live GIMR or MDB (dump) file Used internally for development Will be available and maintained on Oracle Technology Network COMING 24

At Postbank we recently installed the Oracle Cluster Health Advisor (CHA) in the Beta release of the RDBMS and Grid Infrastructure 12.2.0.1, and started testing it. Installing CHA and activating the monitoring was simple and easy to manage. Detecting, correlating and diagnosing performance and availability issues proactively and automatically in our complex and dynamic Cloud environment is essential, and the approach taken with CHA looks consistent and promising. We also deployed the CHA GUI which has been designed specifically to visualize abnormalities in OS and RDBMS statistics data streams which are detected and diagnosed by CHA in real time. It stands out as a successful visualization of the real time and historical monitoring data and diagnostics in our Oracle RAC database environment. Jens-Christian Pokolm, Head of Databases Postbank Systems AG 25

Agenda 1 2 3 4 5 6 7 Introduction Architecture and Operation Details Using CHA from the Command Line Using CHA from EMCC for Alerts and Corrective Actions Using the CHA GUI to Perform Root-Cause Analysis Calibrating CHA to your RAC Deployment Q & A Further Information 26

Calibrating CHA to your RAC Deployment Overview Calibration Goal: Increase sensitivity and accuracy with sufficient warning Release ships with conservative models to minimize false warnings DEFAULT_CLUSTER for each cluster node DEFAULT_DB for each database instance Use your own data for periods of normal operations to increase sensitivity Recommended minimum 6 hour period Should include all normal workload phases for that model Models may be changed dynamically online using CHACTL 27

Calibrating CHA to your RAC deployment Choosing a Data Set for Calibration Defining normal $ chactl query calibration cluster timeranges start=2016-10-28 09:00:00,end=2016-10-28 13:00:00 Cluster name : rws0508-mb1 Start time : 2016-10-28 09:00:00 End time : 2016-10-28 13:00:00 Total Samples : 11524 Percentage of filtered data : 100% 1) Disk read (ASM) (Mbyte/sec) MEAN MEDIAN STDDEV MIN MAX 0.11 0.00 2.62 0.00 114.66 <25 <50 <75 <100 >=100 99.87% 0.08% 0.00% 0.02% 0.03% 2) Disk write (ASM) (Mbyte/sec) MEAN MEDIAN STDDEV MIN MAX 0.01 0.00 0.15 0.00 6.77 <50 <100 <150 <200 >=200 100.00% 0.00% 0.00% 0.00% 0.00% 3) Disk throughput (ASM) (IO/sec) MEAN MEDIAN STDDEV MIN MAX 2.20 0.00 31.17 0.00 1100.00 <5000 <10000 <15000 <20000 >=20000 100.00% 0.00% 0.00% 0.00% 0.00% 4) CPU utilization (total) (%) MEAN MEDIAN STDDEV MIN MAX 9.62 9.30 7.95 1.80 77.90 <20 <40 <60 <80 >=80 92.67% 6.17% 1.11% 0.05% 0.00% 28

Calibrating CHA to your RAC deployment Creating a new CHA Model with CHACTL Create and store the new model $ chactl query calibrate cluster model daytime timeranges start=2016-10-28 09:00:00, end=2016-10-28 13:00:00 Begin using the new model $ chactl monitor cluster model daytime Confirm the new model is being used $ chactl status verbose monitoring nodes rwsbi06, rwsbi07 using model daytime monitoring database qosdo1, instances oltpacdb_1, oltpacdb_2 using model DEFAULT_DB 29

Agenda 1 2 3 4 5 6 7 Introduction Architecture and Operation Details Using CHA from the Command Line Using CHA from EMCC for Alerts and Corrective Actions Using the CHA GUI to Perform Root-Cause Analysis Calibrating CHA to your RAC deployment Q & A Further Information 30

For Further Information Oracle 12c Autonomous Health Framework User s Guide Oracle 12c Clusterware Administration and Deployment Guide Oracle Autonomous Health Framework on OTN 31

32