Improve Data Integration with Changed Data Capture. An Oracle Data Integrator Technical Brief Updated December 2006

Similar documents
Technical Upgrade Guidance SEA->SIA migration

An Oracle White Paper October The New Oracle Enterprise Manager Database Control 11g Release 2 Now Managing Oracle Clusterware

An Oracle White Paper November Primavera Unifier Integration Overview: A Web Services Integration Approach

Bulk Processing with Oracle Application Integration Architecture. An Oracle White Paper January 2009

Managing Metadata with Oracle Data Integrator. An Oracle Data Integrator Technical Brief Updated December 2006

Oracle WebCenter Portal 11g Developer Workshop

Advanced Global Intercompany Systems : Transaction Account Definition (TAD) In Release 12

Oracle Fusion Middleware 11g Oracle Access Manager Frequently Asked Questions June 2009

Oracle WebCenter Portal 11g Developer Workshop

Oracle FLEXCUBE Direct Banking Release Dashboard Widgets Transfer Payments User Manual. Part No. E

Oracle Database Lite. Automatic Synchronization White Paper. An Oracle White Paper August 2008

Oracle Database 10g Release 2 Database Vault - Restricting the DBA From Accessing Business Data

Partitioning in Oracle Database 10g Release 2. An Oracle White Paper May 2005

Installation Instructions: Oracle XML DB XFILES Demonstration. An Oracle White Paper: November 2011

Oracle FLEXCUBE Direct Banking Release Corporate Cash Management User Manual. Part No. E

Frequently Asked Questions Oracle Content Management Integration. An Oracle White Paper June 2007

Oracle Enterprise Performance Management Cloud

Configuring Oracle Business Intelligence Enterprise Edition to Support Teradata Database Query Banding

Generate Invoice and Revenue for Labor Transactions Based on Rates Defined for Project and Task

Data Capture Recommended Operating Environments

Oracle Database Vault

August Oracle - GoldenGate Statement of Direction

Creating Custom Project Administrator Role to Review Project Performance and Analyze KPI Categories

Using the Oracle Business Intelligence Publisher Memory Guard Features. August 2013

Oracle Enterprise Data Quality New Features Overview

Performance and Scalability Benchmark: Siebel CRM Release 7 on HP-UX Servers and Oracle9i Database. An Oracle White Paper Released October 2003

Oracle CIoud Infrastructure Load Balancing Connectivity with Ravello O R A C L E W H I T E P A P E R M A R C H

Maximum Availability Architecture. Oracle Best Practices For High Availability

Correction Documents for Poland

Automatic Receipts Reversal Processing

Oracle Fusion General Ledger Hierarchies: Recommendations and Best Practices. An Oracle White Paper April, 2012

JD Edwards EnterpriseOne Licensing

An Oracle Technical White Paper September Oracle VM Templates for PeopleSoft

An Oracle Technical White Paper May Deploying Oracle Beehive with BlackBerry Enterprise Server for MDS Applications

Oracle Data Masking and Subsetting

Siebel CRM Applications on Oracle Ravello Cloud Service ORACLE WHITE PAPER AUGUST 2017

Veritas NetBackup and Oracle Cloud Infrastructure Object Storage ORACLE HOW TO GUIDE FEBRUARY 2018

Oracle WebCenter Portal 11g Developer Workshop

An Oracle White Paper Released April 2008

Oracle Data Provider for.net Microsoft.NET Core and Entity Framework Core O R A C L E S T A T E M E N T O F D I R E C T I O N F E B R U A R Y

Handling Memory Ordering in Multithreaded Applications with Oracle Solaris Studio 12 Update 2: Part 2, Memory Barriers and Memory Fences

Oracle Warehouse Builder 10g Release 2 Integrating Packaged Applications Data

An Oracle White Paper December, 3 rd Oracle Metadata Management v New Features Overview

April Understanding Federated Single Sign-On (SSO) Process

An Oracle White Paper. Released April 2013

Leverage the Oracle Data Integration Platform Inside Azure and Amazon Cloud

Oracle WebCenter Portal 11g Developer Workshop

An Oracle White Paper September Security and the Oracle Database Cloud Service

Technical White Paper August Recovering from Catastrophic Failures Using Data Replicator Software for Data Replication

Next-Generation SOA Infrastructure. An Oracle White Paper May 2007

Working with Time Zones in Oracle Business Intelligence Publisher ORACLE WHITE PAPER JULY 2014

Oracle WebCenter Suite Integrating Secure Enterprise Search

Oracle JD Edwards EnterpriseOne Object Usage Tracking Performance Characterization Using JD Edwards EnterpriseOne Object Usage Tracking

Achieving High Availability with Oracle Cloud Infrastructure Ravello Service O R A C L E W H I T E P A P E R J U N E

Adding Mobile Capability to an Enterprise Application With Oracle Database Lite. An Oracle White Paper June 2007

Oracle FLEXCUBE Direct Banking Release Dashboard Widgets Customer Services User Manual. Part No. E

Oracle DIVArchive Storage Plan Manager

Oracle NoSQL Database For Time Series Data O R A C L E W H I T E P A P E R D E C E M B E R

Sun Fire X4170 M2 Server Frequently Asked Questions

StorageTek ACSLS Manager Software Overview and Frequently Asked Questions

Loading User Update Requests Using HCM Data Loader

Oracle WebCenter Portal 11g Developer Workshop

Oracle WebLogic Portal O R A C L E S T A T EM EN T O F D I R E C T IO N F E B R U A R Y 2016

An Oracle White Paper June StorageTek In-Drive Reclaim Accelerator for the StorageTek T10000B Tape Drive and StorageTek Virtual Storage Manager

Data Capture Recommended Operating Environments

Oracle Data Integrator: An Executive Overview. An Oracle White Paper Updated January 2007

An Oracle Technical Article March Certification with Oracle Linux 4

PeopleSoft Applications Portal and WorkCenter Pages

Tutorial on How to Publish an OCI Image Listing

Oracle Application Server 10g Integration Interconnect. An Oracle Technical White Paper January 2005

WHAT S NEW IN ORACLE USER PRODUCTIVITY KIT PROFESSIONAL

An Oracle White Paper July Methods for Downgrading from Oracle Database 11g Release 2

Oracle Database 10g Workspace Manager Support for Oracle Spatial Topology Data Model. An Oracle White Paper May 2005

An Oracle White Paper October Minimizing Planned Downtime of SAP Systems with the Virtualization Technologies in Oracle Solaris 10

ORACLE S PEOPLESOFT GENERAL LEDGER 9.2 (WITH COMBO EDITING) USING ORACLE DATABASE 11g FOR ORACLE SOLARIS (UNICODE) ON AN ORACLE S SPARC T7-2 Server

Receiving PeopleSoft Message (PeopleTools 8.17) through the Oracle AS PeopleSoft Adapter. An Oracle White Paper September 2008

Oracle Cloud Applications. Oracle Transactional Business Intelligence BI Catalog Folder Management. Release 11+

An Oracle White Paper September Upgrade Methods for Upgrading to Oracle Database 11g Release 2

An Oracle White Paper October Advanced Compression with Oracle Database 11g

Oracle JD Edwards EnterpriseOne Object Usage Tracking Performance Characterization Using JD Edwards EnterpriseOne Object Usage Tracking

Oracle Fusion Middleware

An Oracle White Paper September, Oracle Real User Experience Insight Server Requirements

WHAT S NEW IN ORACLE USER PRODUCTIVITY KIT

Highly Available Forms and Reports Applications with Oracle Fail Safe 3.0

Load Project Organizations Using HCM Data Loader O R A C L E P P M C L O U D S E R V I C E S S O L U T I O N O V E R V I E W A U G U S T 2018

October Oracle Application Express Statement of Direction

Oracle Warehouse Builder 10g Release 2 What is an Expert?

Oracle Application Development Framework Overview

Oracle Application Server 10g Oracle XML Developer s Kit Frequently Asked Questions September, 2005

Oracle Fusion Configurator

Automatic Data Optimization with Oracle Database 12c O R A C L E W H I T E P A P E R S E P T E M B E R

Rapid Bottleneck Identification A Better Way to do Load Testing. An Oracle White Paper June 2008

Oracle Database Vault

Superior Product Variants Software for Multi-Attribute Product Companies. An Oracle White Paper April 2004

RAC Database on Oracle Ravello Cloud Service O R A C L E W H I T E P A P E R A U G U S T 2017

Oracle Grid Infrastructure 12c Release 2 Cluster Domains O R A C L E W H I T E P A P E R N O V E M B E R

Oracle Secure Backup. Getting Started. with Cloud Storage Devices O R A C L E W H I T E P A P E R F E B R U A R Y

VISUAL APPLICATION CREATION AND PUBLISHING FOR ANYONE

Oracle WebCenter Suite Provides Web 2.0 Services for Enterprise Developers. An Oracle White Paper October 2006

Create Individual Membership. This step-by-step guide takes you through the process to create an Individual Membership.

Transcription:

Improve Data Integration with Changed Data Capture An Oracle Data Integrator Technical Brief Updated December 2006

Improve Data Integration with Changed Data Capture: An Oracle Data Integrator Technical Brief Oracle Data Integrator s approach to integration clearly separates the declarative rules (the what ) from the actual implementation (the how ). This declarative design approach dramatically reduces the development and maintenance workload by up to five times over traditional ETL tools. EXECUTIVE OVERVIEW Today s integration project teams face the daunting challenge of deploying integrations that fully meet functional, performance, and quality specifications on time and within budget. Further, these processes must be maintainable over time, and the completed work should be reusable for further, broader integration initiatives. Traditional Extract, Transform, Load (ETL) tools closely intermix data transformation rules with the integration process procedures, requiring the development of both data transformations and data flow. Oracle Data Integrator takes a different approach to integration by clearly separating the declarative rules (the what ) from the actual implementation (the how ). With Oracle Data Integrator, declarative rules describing mappings and transformations are defined graphically, through a drag-and-drop interface, and stored independently from the implementation. Oracle Data Integrator automatically generates the data flow, which can be fine-tuned if required. This declarative design approach dramatically reduces the development and maintenance workload by up to five times over traditional ETL tools. This technical brief describes Oracle Data Integrator s use of Changed Data Capture to track changes in source data changes that are typically performed by other applications. When running integration interfaces, Oracle Data Integrator can reduce the volume of source data processed in the flow by extracting only the changed data. Then, only those changes are moved to target systems, tremendously improving the efficiency of data integration processes. Reducing the volume of source data is useful in many fields, such as data synchronization and replication. And it is essential when setting up an event-driven architecture for integration. INTRODUCTION The conventional approach to data integration involves extracting all data from the source system and then integrating the entire set possibly using an incremental strategy in the target system. This approach, which is suitable in most cases, can be inefficient when the integration process requires real-time data integration. In Improve Data Integration with Changed Data Capture: An Oracle Data Integrator Technical Brief Page 2

such situations, the amount of data involved makes data integration impossible in the given timeframes. Reducing the volume of the source data to only changed data is a major issue in many fields, such as data synchronization and replication. It is an absolute requirement when implementing event-driven integration processes. Basic solutions, such as filtering records according to a timestamp column or changed flag, are possible, but they might require modifications in the applications. In addition, they usually do not sufficiently ensure that all changes are taken into account. Oracle Data Integrator s Changed Data Capture identifies and captures data as it is being inserted, updated, or deleted from datastores, and it makes the changed data available for integration processes. Publish-and-Subscribe Model Changed Data Capture uses a publish-and-subscribe model. This model works in three steps: 1. An identified subscriber, usually an integration process, subscribes to changes that might occur in a datastore. Multiple subscribers can subscribe to these changes. 2. The Changed Data Capture framework captures changes in the datastore and then publishes them for the subscriber. 3. The subscriber an integration process can process the tracked changes at any time and consume these events. Once consumed, events are no longer available for this subscriber. Oracle Data Integrator processes datastore changes in two ways: Regularly in batches (pull mode) for example, processes new orders from the Web site every five minutes and loads them into the operational datastore (ODS) In real time (push mode) as the changes occur for example, when a product is changed in the enterprise resource planning (ERP) system, immediately updates the on-line catalog Improve Data Integration with Changed Data Capture: An Oracle Data Integrator Technical Brief Page 3

Figure 1: The Changed Data Capture (CDC) framework uses a publish-and-subscribe architecture. IMPLEMENTING CHANGED DATA CAPTURE Methods for Tracking Changes Oracle Data Integrator provides two methods for tracking changes from source datastores to the Changed Data Capture framework: triggers and relational database management system (RDBMS) log mining. The triggers method creates triggers on the source tables to track changes as data is inserted, updated, or deleted. This method can be implemented on most RDBMS, but it can have an impact on the transactional performance of the source systems. The second method involves mining the RDBMS logs, which are the internal change history of the database engine. This method has no effect on the system s transactional performance; it is database-specific. This method is supported out-ofthe-box for Oracle Database (through the Log Miner feature) and IBM DB2. The Changed Data Capture framework used to manage changes is generic and open. The change tracking method can be customized, and any third-party change provider can be used to load the framework with changes. Processing the Changes Oracle Data Integrator employs a powerful declarative design approach, Extract- Load, Transform (E-LT), which separates the rules from the implementation details. Its out-of-the-box integration interfaces use and process the tracked changes. Developers define the declarative rules for the captured changes within the integration processes in the Oracle Data Integrator Designer graphical user interface without having to code. With the Oracle Data Integrator Designer, customers declaratively specify set-based maps between sources and targets, and then the system automatically generates the data flow from the set-based maps. The technical processes required for processing the changes captured are implemented in Oracle Data Integrator s Knowledge Modules. Knowledge Modules are scripted modules that contain database and application-specific patterns. The runtime then interprets these modules and optimizes the instructions for targets. Improve Data Integration with Changed Data Capture: An Oracle Data Integrator Technical Brief Page 4

Ensuring Data Consistency Changes frequently involve several datastores at one time. For example, when an order is created, updated, or deleted, it involves both the orders table and the order lines table. When processing a new order line, the new order to which this line is related must be taken into account. Oracle Data Integrator provides a mode of tracking changes, called Consistent Set Changed Data Capture, for this purpose. This mode allows you to process sets of changes that guarantee data consistency. Oracle Data Integrator can reduce the volume of source data processed in the flow by extracting only the changed data. Then, only those changes are propagated to target systems, tremendously improving the efficiency of data integration processes. CONCLUSION Integrating data and applications throughout the enterprise, and presenting a unified view of them, is a complex proposition. Not only are there broad disparities in data structures and application functionality, but there are also fundamental differences in integration architectures. Some integration needs are data oriented, especially those involving large data volumes. Other integration projects lend themselves to an event-oriented architecture for asynchronous or synchronous integration. Changes tracked by Changed Data Capture constitute data events. The ability to track these events and process them regularly in batches or in real time is key to the success of an event-driven integration architecture. Oracle Data Integrator provides rapid implementation and maintenance for all types of integration projects. Improve Data Integration with Changed Data Capture: An Oracle Data Integrator Technical Brief Page 5

Improve Data Integration with Changed Data Capture: An Oracle Data Integrator Technical Brief Updated December 2006 Oracle Corporation World Headquarters 500 Oracle Parkway Redwood Shores, CA 94065 U.S.A. Worldwide Inquiries: Phone: +1.650.506.7000 Fax: +1.650.506.7200 oracle.com Copyright 2007, Oracle. All rights reserved. This document is provided for information purposes only and the contents hereof are subject to change without notice. This document is not warranted to be error-free, nor subject to any other warranties or conditions, whether expressed orally or implied in law, including implied warranties and conditions of merchantability or fitness for a particular purpose. We specifically disclaim any liability with respect to this document and no contractual obligations are formed either directly or indirectly by this document. This document may not be reproduced or transmitted in any form or by any means, electronic or mechanical, for any purpose, without our prior written permission. Oracle, JD Edwards, PeopleSoft, and Siebel are registered trademarks of Oracle Corporation and/or its affiliates. Other names may be trademarks of their respective owners.