CA ERwin Data Profiler

Similar documents
TECHNOLOGY BRIEF: CA ERWIN DATA PROFILER. Combining Data Profiling and Data Modeling for Better Data Quality

CA ERwin Data Modeler r7.3

Overview. Business value

CA ERwin Modeling Family At the Center of Your Data Management Initiatives

Version Overview. Business value

White Paper. Metadata Management for Enterprise Applications. Modeling Suite. Neil Buchwalter, Product Manager, AllFusion

CA ERwin Data Modeler

CA Rapid Reorg for DB2 for z/os

CA Test Data Manager Key Scenarios

CA RC/Secure for DB2 for z/os

5 OAuth Essentials for API Access Control

Moving From Reactive to Proactive Storage Management with an On-demand Cloud Solution

IBM InfoSphere Information Analyzer

BRM Accelerator Release Notes - On Premise. Service Pack

CA Nimsoft Monitor. Probe Guide for DHCP Server Response Monitoring. dhcp_response v3.2 series

CA ecometer. Overview. Benefits. agility made possible. Improve data center uptime and availability through better energy management

CA Security Management

CA Cloud Service Delivery Platform

The Value of Data Modeling for the Data-Driven Enterprise

Data Quality in the MDM Ecosystem

CA IDMS 18.0 & 18.5 for z/os and ziip

IBM Software IBM InfoSphere Information Server for Data Quality

WHITE PAPER MARCH Automating Data Masking and Reduction for SAP System Copy

CA ERwin Data Modeler

Solving the Enterprise Data Dilemma

SOLUTION BRIEF NETWORK OPERATIONS AND ANALYTICS. How Can I Predict Network Behavior to Provide for an Exceptional Customer Experience?

Comune della Spezia protects service continuity with integrated backup and replication from CA Technologies

SOLUTION BRIEF CA TEST DATA MANAGER FOR HPE ALM. CA Test Data Manager for HPE ALM

CA Nimsoft Monitor Snap

CA ERwin Data Modeler

RELEASING LATENT VALUE DOCUMENT: CA NETMASTER NETWORK MANAGEMENT R11.5. Releasing the Latent Value of CA NetMaster Network Management r11.

The Value of Data Governance for the Data-Driven Enterprise

ORACLE SERVICES FOR APPLICATION MIGRATIONS TO ORACLE HARDWARE INFRASTRUCTURES

CA ERwin Data Modeler

Portlet Reference Guide. Release

CA Dynam /T Tape Management for z/vse

2 The IBM Data Governance Unified Process

CA Automation Capabilities A Technical Look at Process and Runbook Automation. Tom Kouhsari and AJ Dennis

PPM Essentials Accelerator Product Guide - On Premise. Service Pack

agility made possible

Accelerate Your Enterprise Private Cloud Initiative

Protecting VMware vsphere/esx Environments with Arcserve

Improve Service Quality: CA Insight DPM Integration with CA Spectrum Service Assurance. Walter Guerrero, Sr Software Engineer

CA Chorus. Release Notes. Version , Sixth Edition

CA Nimsoft Monitor. Probe Guide for iseries Job Monitoring. jobs v1.3 series

Protecting Microsoft Hyper-V 3.0 Environments with Arcserve

Luncheon Webinar Series June 3rd, Deep Dive MetaData Workbench Sponsored By:

Optimizing Data Integration Solutions by Customizing the IBM InfoSphere Information Server Deployment Architecture IBM Redbooks Solution Guide

CA IT Client Manager / CA Unicenter Desktop and Server Management

CA ERwin Data Modeler s Role in the Relational Cloud. Nuccio Piscopo.

APM Import Tool. Product Guide

CA IT Client Manager. Release Notes. Release 12.8

Symantec Data Center Transformation

CA Desktop Migration Manager

SOLUTION BRIEF CA Database Management for DB2 for z/os. How Can I Establish a Solid Foundation for Successful DB2 Database Management?

Portlet Reference Guide. Release

Dynamic What? I m Dynamic, Aren t You? Andrew Chapman & Sam Knutson VP Product Management CA Technologies

CA ARCserve Backup. Benefits. Overview. The CA Advantage

CA SSO. Agent for Oracle PeopleSoft Release Notes. r12.51

CA Cloud Service Delivery Platform

BPS Suite and the OCEG Capability Model. Mapping the OCEG Capability Model to the BPS Suite s product capability.

Composite Software Data Virtualization The Five Most Popular Uses of Data Virtualization

Fast Innovation requires Fast IT

A Distinctive View across the Continuum of Care with Oracle Healthcare Master Person Index ORACLE WHITE PAPER NOVEMBER 2015

Low Friction Data Warehousing WITH PERSPECTIVE ILM DATA GOVERNOR

Skybox Security Vulnerability Management Survey 2012

arcserve r16.5 Hybrid data protection

Data Protection. Practical Strategies for Getting it Right. Jamie Ross Data Security Day June 8, 2016

CA SiteMinder. Advanced Password Services Release Notes 12.52

CA Database Management Solutions for IMS for z/os. Product Information Bulletin

OVERVIEW BROCHURE GRC. When you have to be right

GDPR: An Opportunity to Transform Your Security Operations

Arcserve Unified Data Protection Virtualization Solution Brief

CA File Master Plus for IMS

ARCSERVE UNIFIED DATA PROTECTION

Improving Data Governance in Your Organization. Faire Co Regional Manger, Information Management Software, ASEAN

Informatica Enterprise Information Catalog

Arcserve Cloud Frequently Asked Questions

Oracle Utilities Meter Data Management Integration to SAP for Meter Data Unification and Synchronization

Abstract. Introduction

On Premise. Service Pack

Vulnerability Assessments and Penetration Testing

HPE IT Operations Management (ITOM) Thought Leadership Series

On Premise. Service Pack

PREPARE FOR TAKE OFF. Accelerate your organisation s journey to the Cloud.

Accelerator for the PMBOK Product Guide - On Demand. Service Pack

Cyber Defense Maturity Scorecard DEFINING CYBERSECURITY MATURITY ACROSS KEY DOMAINS

CA Cloud Service Delivery Platform

IBM Tivoli Directory Server

Understanding Virtual System Data Protection

August Oracle - GoldenGate Statement of Direction

Installing ISV Mainframe Products through a Web Browser with CA MSM: Update and User Experiences

CA IDMS Server. Release Notes. r17

Oracle Utilities Work and Asset Management Integration to Primavera P6 Enterprise Project Portfolio Management

In 2017, the Auditor General initiated an audit of the City s information technology infrastructure and assets.

How a Metadata Repository enables dynamism and automation in SDTM-like dataset generation

How to Accelerate Merger and Acquisition Synergies

RSA Solution Brief. The RSA Solution for VMware. Key Manager RSA. RSA Solution Brief

CA Unified Infrastructure Management

CA Productivity Accelerator 13.0 SYSTEM REQUIREMENTS. Type: System Requirements Date: CAP13SYR1

Transcription:

PRODUCT BRIEF: CA ERWIN DATA PROFILER CA ERwin Data Profiler CA ERWIN DATA PROFILER HELPS ORGANIZATIONS LOWER THE COSTS AND RISK ASSOCIATED WITH DATA INTEGRATION BY PROVIDING REUSABLE, AUTOMATED, CROSS-DATA-SOURCE DISCOVERY, ANALYSIS AND PROFILING. Overview Benefits The CA Advantage The costs associated with poorly documented, or completely undocumented, data sources often represent more than 50 percent of a project s overall budget putting the success of any data warehouse (DW), master data management (MDM) or integrationoriented data management project at risk. CA ERwin Data Profiler can help you significantly reduce the time and effort involved in understanding and crossreferencing these data sources by assisting in the creation of accurate, reusable data models. CA ERwin Data Profiler reduces the time required to document and crossreference data sources while increasing the accuracy and depth of your data foundation. As a result, you achieve a continual return on investment (ROI) in the form of reduced integration, delivery and maintenance costs, increased data quality and accuracy and the effective leveraging of strategic information assets for competitive business intelligence (BI) and mission-critical information governance. A low-cost alternative that extends the value of data profiling technology, CA ERwin Data Profiler combines proven techniques with value-added capabilities to help you synchronize the data design within your database. With such functionality as primary key (PK) and foreign key (FK) discovery, multisource overlap and mapping and metadata inference and exchange, the solution helps you better align your business with CA s Enterprise IT Management (EITM) vision.

When System Analysis Puts Strategic Data Initiatives at Risk For most organizations, data-source discovery, analysis, documentation and comparison are major resource drains during data-intensive projects. Since the expenses associated with these ongoing tasks often represent a large percentage of a given project s overall cost, strategic initiatives are often delayed, reduced in scope or cancelled altogether due to a lack of funding and resources. It is critical that you always know what data you have, where it is located, how it is structured and how it is related between disparate systems before implementing a data governance, data management or data integration project. Understanding your environment is extremely useful as a first step because it provides insight into your existing architecture and allows a data modeler to leverage the as-is structures to better design the to-be target. It also provides information that is critical for integrating or migrating your legacy data into a new environment. For a variety of reasons, however, most companies do not have a good understanding of their existing distributed data landscape. As subject matter experts move on, design specifications disappear or become outdated and mergers and acquisitions create massive redundancy and semantic reconciliation issues, you must still have a centralized understanding of where your data is located and how you can leverage it to ensure the success of your strategic data initiatives. The Power of Reusable, Multi-source Data Analysis A source system analysis and data profiling solution, CA ERwin Data Profiler is the newest addition to the CA ERwin Modeling family, offering integration with visual data modeling to enable data and metadata discovery, analysis, design, documentation and reuse in a single product family. It can be deployed throughout your organization for use in MDM, data migration, data integration BI and DW projects. The solution delivers robust data profiling management, cross-system overlap analysis and data model reconciliation that includes: Support for Open Database Connectivity (ODBC) and flat-file data sources Data profiling capabilities to analyze individual columns within tables Fully automated discovery of primary and foreign keys and orphaned rows Simultaneous cross-source data analysis of up to 20 data sources to identify overlapping and unique attributes and to discover attribute supersets and subsets Interactive, side-by-side previewing and comparing of record values within and between different data sources The ability to define and persist critical data elements within data sources Export of all discovered metadata for reuse with CA ERwin Data Modeler, which allows the design reality exposed in the instance data to be compared with the data model or database metadata CA ERwin Data Profiler provides an entry-level product that ensures rapid and comprehensive ROI. By leveraging the solution, you can immediately improve the value of the information that drives your business, as well as increase your ability to respond to ongoing and future business needs. 2 PRODUCT BRIEF: CA ERWIN DATA PROFILER

CA ERwin Data Profiler also includes robust PK-FK discovery and cross-system overlap analysis. Integration with CA ERwin Data Modeler enables you to discover, audit and remediate data-related business rules and unknown design inconsistencies as well as visualize the resultant discovered and inferred metadata in a data modeling environment providing maximum reusability of the analyst s work product. FIGURE A The automated discovery and visualization of PK FK relationships accelerates the understanding and documentation of legacy or loosely defined data structures. AUTOMATED DISCOVERY OF PRIMARY KEY-FOREIGN KEY RELATIONSHIPS Data Profiling in Your Unique IT Environment With support for a wide variety of environments, databases and source systems, CA ERwin Data Profiler offers the right combination of flexibility and compatibility, helping you leverage your unique IT components to achieve strategic data initiatives. SUPPORTED ENVIRONMENT Microsoft Windows XP SUPPORTED DATABASES FOR REPOSITORY AND STAGING AREAS Oracle 10g IBM DB2 UDB 8.1 SUPPORTED SOURCE SYSTEMS FOR DATA PROFILING AND ANALYSIS Relational databases Spreadsheets VSAM PRODUCT BRIEF: CA ERWIN DATA PROFILER 3

Decreasing Data Analysis Cycles By reducing the time required to document cross-reference data sources and increasing the accuracy and depth of understanding of an organization s data foundation, CA ERwin Data Profiler facilitates ongoing ROI with: Reduced integration, delivery and maintenance costs Increased data quality and accuracy Improved utilization of strategic information assets for competitive BI and mission-critical data governance. CA ERwin Data Profiler includes a host of features and functionality engineered to help you not only gain a better understanding of your distributed data landscape, but also increase your level of control over important business information so you can effectively pursue and complete strategic data initiatives. FIGURE B CA ERwin Data Profiler includes functionality that helps you better understand your distributed data landscape and effectively pursue strategic data initiatives. FEATURES AND BENEFITS OF CA ERWIN DATA PROFILER FEATURES Column Profiling and Analysis CAPABILITIES Automatically discovers standard column statistics BENEFITS Improves speed and effectiveness of analysis cycles through consistent and standardized results Primary Key Foreign Key Discovery Cross-system Attribute Overlap Analysis Data Synchronization Analysis CA ERwin Data Modeler Integration Automatically discovers, defines and visualizes relationships within a single data source Identifies overlapping and unique attributes, as well as attribute supersets and subsets Validates the global identifiers and ensures that cross-source alignment and synchronization can occur Creates CA ERwin Data Modeler data models based on metadata inferred from the instance data and profiling results Provides insight into legacy data structures and documented metadata, so analysts can account for inherent relationships Speeds reconciliation of disparate data sources and enables effective identification of transformation requirements Ensures data consistency across existing sources and aids in designing well-aligned target structures Provides reusable data for documentation, impact analysis and stakeholder visualization purposes 4 PRODUCT BRIEF: CA ERWIN DATA PROFILER

COLUMN PROFILING AND ANALYSIS Helps reduce analysis cycles and increases their effectiveness through consistent and standardized results. In addition, this functionality enables the automatic discovery of standard column statistics, including: Data type Value frequency Length Precision Scale Formats Cardinality Selectivity Non-null selectivity Non-null count Min Max Mode Mode % Null count Blank count PRIMARY KEY FOREIGN KEY DISCOVERY Offers fully automated discovery, definition and visualization of relationships within a single data source. Inferred relationships provide critical structural insights into legacy data structures, as well as points of comparison and confirmation for documented metadata allowing your analysts and designers to account for inherent relationships when building target systems. CROSS-SYSTEM ATTRIBUTE OVERLAP ANALYSIS Performs an automated cross-compare of all columns across many data sources (up to 20) in order to establish a baseline of overlapping data. By leveraging this functionality, you can discover attribute supersets and subsets, as well as overlapping and unique attributes, to speed reconciliation of disparate data sources and effectively identify and document transformation requirements. PRODUCT BRIEF: CA ERWIN DATA PROFILER 5

FIGURE C The automated analysis of attribute overlap and mapping between disparate systems significantly reduces cycles and costs associated with traditional data integration initiatives. MULTI-SOURCE ATTRIBUTE OVERLAP ANALYSIS DATA SYNCHRONIZATION ANALYSIS Validates the uniqueness of a potential global identifier in each data source and then confirms that data across sources can be aligned and synchronized using this identifier. This functionality enables you to prototype and test survivorship rules between sources before you move the data into a master structure, helping you ensure quality and consistency across existing sources and well-aligned target structures. CA ERWIN DATA MODELER INTEGRATION Enables the creation of data models that can be persisted and reused within CA ERwin Data Modeler for documentation, impact analysis and stakeholder visualization purposes. Based on metadata inferred from the instance data and profiling results in CA ERwin Data Profiler, these models help you achieve proper business alignment, improved end-user understanding and reduced analysis cycles in subsequent projects. 6 PRODUCT BRIEF: CA ERWIN DATA PROFILER

The CA Advantage CA ERwin Data Profiler helps increase the quality of your critical data assets by performing cross-system analysis, generating robust data quality metrics and statistics and validating raw data with database design and architecture. A strong understanding of your existing data infrastructure helps reduce the risk of rework and the cost and time involved in complex data integration, data migration, data warehouse and master data management projects. The Enterprise IT Management (EITM) Vision CA ERwin Data Profiler, a key component in the CA ERwin Modeling family of solutions, is also an integral part of CA's Enterprise IT Management (EITM) vision to unify and simplify overall IT management. And with a comprehensive portfolio of modular IT management solutions, CA empowers you to better manage risk, costs and service helping ensure that IT meets the business needs of your enterprise. Gaining mastery over business-critical information is a crucial first step in any enterprise-wide IT management initiative. And when integrated as an element of CA s EITM vision, CA ERwin Data Profiler can help you maximize the performance, reliability and efficiency of your overall IT environment. The solution helps you discover, control and manage the data that powers your business, so you can ultimately extend the overarching CA EITM philosophy to operations, storage, security, lifecycle and services management and unify, simplify and secure IT. Next Steps CA ERwin Data Profiler is a proven, flexible solution that provides reusable, automated, cross-data-source discovery analysis and profiling for improved data discovery and control. See how it can enable your organization to achieve strategic data initiatives by reducing the costs and risks associated with data integration. For more information, contact your local CA reseller or go to ca.com/contact/rmdm. To learn more, and see how CA ERwin Data Profiler can help you unify and simplify data management for better business results, visit ca.com/modeling. PRODUCT BRIEF: CA ERWIN DATA PROFILER 7

Copyright 2009 CA. All rights reserved. All trademarks, trade names, service marks and logos referenced herein belong to their respective companies. This document is for your informational purposes only. To the extent permitted by applicable law, CA provides this document As Is without warranty of any kind, including, without limitation, any implied warranties of merchantability or fitness for a particular purpose, or non-infringement. In no event will CA be liable for any loss or damage, direct or indirect, from the use of this document including, without limitation, lost profits, business interruption, goodwill or lost data, even if CA is expressly advised of the possibility of such damages. CA does not provide legal advice. Neither this document nor any software product referenced herein shall serve as a substitute for the reader s compliance with any laws (including but not limited to any act, statue, regulation, rule, directive, standard, policy, administrative order, executive order, etc. (collectively, Laws )) referenced herein. The reader should consult with competent legal counsel regarding any such Laws. 334530109 Learn more about how CA can help you transform your business at ca.com