DATA QUALITY STRATEGY. Martin Rennhackkamp

Size: px
Start display at page:

Download "DATA QUALITY STRATEGY. Martin Rennhackkamp"

Transcription

1 DATA QUALITY STRATEGY Martin Rennhackkamp

2 AGENDA Data quality Data profiling Data cleansing Measuring data quality Data quality strategy Why data quality strategy? Implementing the strategy

3 DATA QUALITY OBJECTIVES How can we achieve our business objectives? With appropriate, complete and accurate data Reduce costs in the right Increase context, profits format at the correct time and place for the right purpose. Improving Data warehouse and Business Information Quality Larry English, 1999

4 DATA QUALITY APPROACH Know Your Data (Data Profiling) Fix Your Data (Data Cleansing) Business Application Projects System Replacements Data Warehouse / ETL Data Integration Enhance Your Data (CRM, Householding) Data Quality Initiatives Root Cause Analysis & System / Process Remediation Metadata Management Data Stewardship & Governance Monitor Your Data (Data Profiling)

5 Data quality AGENDA Data profiling Data cleansing Measuring data quality Data quality strategy Why data quality strategy? Implementing the strategy

6 DATA PROFILING Software and analytical techniques thorough knowledge Data: content, structure, correctness and quality Diagnostic step in data quality management

7 ANALYSIS THRASHING Multiple meanings Cleanliness Multiple sources USERS Accuracy Completeness Timeliness History complications Audit and validation

8 DATA PROFILING TECHNIQUES Domain analysis Pattern analysis Statistical analysis Range or threshold checking Cross-table identification Business rule analysis Customer profiling

9 DOMAIN ANALYSIS Data type adherence: numeric and dates Domains: dates, numbers, alphanumeric Nulls and spaces Frequency and % of distinct values Primary and alternate key uniqueness

10 PATTERN ANALYSIS Specific data type formats: ID number, Tax number Postal code Telephone and fax numbers Dates formats Specific data values: N/A unknown

11 STATISTICAL ANALYSIS Record count Sum Mode value occurring most often Minimum / Maximum Percentiles Mean Standard deviation Hash total / control total CRC

12 RANGE ANALYSIS Ranges of valid values zero or negative numbers allowed past or future dates minimum and maximum values Thresholds salary over given value? interest rate <= 0 negative amount fields Sets of valid values discrete set of permitted values e.g. flags (Y/N), types, codes

13 CROSS-TABLE ANALYSIS Referential integrity between two tables B.ForeignKey = A.PrimaryKey Look-up values Possible PK-FK identification if not defined Primary keys Possible primary and alternate key identification Referential integrity in one table Hierarchical references

14 CROSS-TABLE ANALYSIS

15 BUSINESS RULE ANALYSIS Consistency checks of relationships: Every product in the order table in the product table? No autopsy result if date of death is blank. Accident claim reason only if insured item is a vehicle. Can only have pregnancy test if gender = female. Cannot have 0 monthly interest on fixed deposit saving account. Cannot have first payment date before activation date. Richness of profiling product s business rule detection and reporting

16 CUSTOMER PROFILING TOOLS Focus on names and descriptive data Recognize duplicates and variations Standardize international names, phone numbers and addresses USA zip code vs UK postal code USA states vs UK counties vs cantons vs provinces Parse names and addresses Propagate addresses into households

17 ON-GOING PROCESS Periodical data profiling Analysis on source systems Analysis on data warehouse Re-extraction, analysis, cleansing and re-insertion into data warehouse First step in repeating ETL processes Regular data quality benchmarks (CIO dashboard)

18 BI METHODOLOGY Technical Architecture Product Selection & Installation Project Planning Business Requirements Data Profiling Data Warehouse Design Data Warehouse Implementation ETL Specification Information Delivery Specification Data Warehouse Maintenance ETL Implementation Information Delivery Implementation Testing, Deployment & Roll-out Maintenance Project management Information management Metadata management

19 WHY DATA PROFILING? Understand the data Understand requirements Accurate database design Risk mitigation: ETL is the most under-estimated, under-budgeted part of most BI projects ETL code-load-explode Initiate spin-off data quality projects

20 Data quality Data profiling AGENDA Data cleansing Measuring data quality Data quality strategy Why data quality strategy? Implementing the strategy

21 DATA CLEANSING PROJECTS Data entry At data capture time in source system One record at a time interactive mode In contact with user or client Require changes to source system Require changes to capturers KPIs Data extraction At data transfer time from the source system One system at a time Many records at a time batch mode Data integration At data transfer time into data warehouse More than one system at a time different dynamics Many records at a time batch mode Can select best records for consolidation and merging

22 DATA CLEANSING PROJECTS Reporting / down-stream extraction When extracting from data warehouse for data marts, reporting, analytics Many records at a time batch mode Possibly far removed from source system identification Formatting, extraction and transformation rules Could be start of feedback cycle to source system Maintenance On-going across spectrum of systems On-going improvement and raising data quality standard In-place on source or data warehouse system Many records at a time batch mode May introduce reporting inconsistencies over time

23 DATA CLEANSING In transit: During operational system extract* During data warehouse input* During data warehouse / data mart reporting** In place: In operational system In data warehouse / data mart* Techniques: Reject, ignore error, correct, look-up, default value

24 FEEDBACK CYCLE Operational systems Financial Marketing Quality feedback cycle Decision makers On-line Web ERP HR Extract, transform & load Query and analysis tools Operations Data warehouse

25 AGENDA Data quality Data profiling Data cleansing Measuring data quality Data quality strategy Why data quality strategy? Implementing the strategy

26 INHERENT QUALITY MEASURES Data correctness and accuracy Definition conformance Consistency of the meaning and definition of data Completeness (coverage) All required values for all required fields Validity Data values conform to domain and business rules Accuracy to source Data agrees with data values in original source Accuracy to reality Accurate reflection of real-world object or event Precision Values to the right degree of granularity Non-duplication One-to-one correspondence between records and real-world objects or events

27 RAGMATIC QUALITY MEASURES Usefulness of data Accessibility Able to access the data on demand Timeliness Data available to support process within required time table Contextual clarity Data presentation enables understanding of its meaning Avoid misinterpretation Derivation integrity Correctness with which two/more data elements are combined Usability Ease of use of the form of data presentation Rightness Right kind of data with right degree of quality to support a given process

28 MEASURING DATA QUALITY Aspect Measure Completeness Percentage of data available Accuracy Timeliness Delivery Usability Percentage of records with errors Percentage of time data available when needed Percentage of users satisfied with delivery method Subjective: degree to which presentation is purpose-efficient

29 MEASURING DATA QUALITY Completeness Cost Accuracy Delivery Timeliness

30 CIO DASHBOARD

31 APPROACH Cultural issue Business buy-in Responsibilities Methodology, processes and procedures Quality feedback cycle Governance and controls => Data Quality Strategy

32 AGENDA Data quality Data profiling Data cleansing Measuring data quality Data quality strategy Why data quality strategy? Implementing the strategy

33 DATA QUALITY STRATEGY Get stakeholder involvement, budget and go-ahead Hand over to programme management for implementation

34 DATA QUALITY STRATEGY Context Storage Data flows Stewardship Workflow Monitoring Appendices: - Profiling: technique vs storage - Cleansing: Data detail vs storage - Ownership/Stewardship: Data entities vs storage

35 DATA QUALITY STRATEGY Context: The types and usage of data => type of cleansing required specialized data requires specialized cleansing e.g. titles, dates, numbers, addresses

36 DATA QUALITY STRATEGY Context: The types and usage of data => type of cleansing required Storage: System architecture and platforms => where & how cleansing done technology: mainframe / UNIX locality: locally, regionally, internationally formats: tables: files, spreadsheets applications: open (RDBMS), proprietary connectivity: ODBC, proprietary

37 DATA QUALITY STRATEGY Context: The types and usage of data => type of cleansing required Storage: System architecture and platforms => where & how cleansing done Data flows: Capture, movement and migration of data => cleansing opportunity Transactional updates Operational data feeds Purchased data Legacy migration Regular maintenance (batch) Special extracts

38 DATA QUALITY STRATEGY Context: The types and usage of data => type of cleansing required Storage: System architecture and platforms => where & how cleansing done Data flows: Capture, movement and migration of data => cleansing opportunity Stewardship: People and organisation => who is involved Stakeholders: buy-in, mandate, budget Data stewards: roles, responsibilities Organisational structure: impact Education: all stakeholders

39 DATA QUALITY STRATEGY Context: The types and usage of data => type of cleansing required Storage: System architecture and platforms => where & how cleansing done Data flows: Capture, movement and migration of data => cleansing opportunity Stewardship: People and organisation => who is involved Workflow: Tasks and processes involved => process improvement opportunity Eliminate non-value-add tasks Add data improvement tasks

40 DATA QUALITY STRATEGY Context: The types and usage of data => type of cleansing required Storage: System architecture and platforms => where & how cleansing done Data flows: Capture, movement and migration of data => cleansing opportunity Stewardship: People and organisation => who is involved Workflow: Tasks and processes involved => process improvement opportunity Monitoring: Measuring quality over time => Measure against baseline Controls to improve

41 PROFILING STRATEGY Storage Profiling Source system Staging area Data warehouse Data mart Report / analytics Domain analysis D:1, T:1, P:m D:1, T:y, P:m D:1, T:m, P:m D:1, T:m, P:m D:1, T:m, P:m Pattern analysis D:1, T:1, P:m D:1, T:y, P:m D:1, T:m, P:w D:1, T:m, P:w D:1, T:m, P:w Statistical analysis D:1, T:1, P:m D:1, T:y, P:m D:1, T:m, P:d D:1, T:m, P:d D:1, T:m, P:w Range analysis D:1, T:1, P:m D:1, T:y, P:m D:1, T:m, P:d D:1, T:m, P:d D:1, T:m, P:w Relationship analysis D:1, T:1, P:m D:1, T:y, P:m D:1, T:m, P:w D:1, T:m, P:w D:1, T:m, P:w Business rule analysis D:1, T:1, P:m D:1, T:y, P:m D:1, T:m, P:d D:1, T:m, P:d D:1, T:m, P:w D:development T:test P:production 1:once-off d:daily w:weekly m:monthly y:yearly

42 CLEANSING STRATEGY Storage Cleansing Source system Staging area Data warehouse Data mart Report / analytics ACCOUNT Account no Account name Ignore Ignore Reject Ignore Ignore Ignore Ignore Ignore Ignore Ignore Household Ignore Household Lookup Lookup Ignore Contact Ignore Ignore Parse Ignore Ignore Language Ignore Default Lookup Lookup Ignore Address Ignore Ignore Parse Ignore Ignore Postal code Lookup & replace Ignore Lookup Lookup Ignore Balance Ignore Ignore Ignore Ignore Ignore

43 OWNERSHIP / STEWARDSHIP Storage Entity Source system Staging area Data warehouse Data mart Report / analytics ACCOUNT CUSTOMER PRODUCT ORDER DELIVERY HR FINANCE Own: CR Cust: CR Own: CR Cust: CR Own: Prod Cust: Prod Own: Prod Cust: Prod Own: Prod Cust: Prod Own: HR Cust: CTO Own: CFO Cust: FinIT Own: CR Cust: EDW Own: CR Cust: EDW Own: Prod Cust: EDW Own: Prod Cust: EDW Own: Prod Cust: EDW Own: HR Cust: EDW Own: CFO Cust: EDW Own: CR Cust: EDW Own: CR Cust: EDW Own: Prod Cust: EDW Own: Prod Cust: EDW Own: Prod Cust: EDW Own: HR Cust: EDW Own: CFO Cust: EDW Own: CR Cust: CR Own: CR Cust: CR Own: Prod Cust: EDW Own: Prod Cust: EDW Own: Prod Cust: EDW N/A Own: CFO Cust: FinIT Own: CR Cust: CR Own: CR Cust: CR Own: Prod Cust: BI Own: Prod Cust: BI Own: Prod Cust: BI Own: HR Cust: BI Own: CFO Cust: FinIT

44 AGENDA Data quality Data profiling Data cleansing Measuring data quality Data quality strategy Why data quality strategy? Implementing the strategy

45 WHY DETAIL STRATEGY? Control redundant data Get knowledge of data available Avoid inconsistent overlapping data sets Avoid inconsistent reports Avoid late data availability Formally assign data responsibility Set and adhere to standards Get management to understand importance of data quality

46 GAIN CONTROL Set consistent and unassailable data quality controls Define data ownership and stewardship Increase trust in data Improve delivery and responsiveness Consistent use of terminology Technical and business Align with corporate strategy Get buy-in and support from group management level Group-wide vendor license power

47 Data quality Data profiling Data cleansing AGENDA Measuring data quality Data quality strategy Why data quality strategy? Implementing the strategy

48 GOVERNANCES Source systems: - BI readiness / enablement - Levels of data quality as KPIs - Data capturing quality KPIs Data warehouse: - Levels of data quality as KPIs Information usage: - Levels of data quality as KPIs - User training and certification License to drive Metadata compliance: - Source systems - Data warehousing and related processes - Information exploitation Quality control: - End-to-end data quality program - Periodic data quality measurements and feedback - Data quality measures on CIO dashboard - Data quality in all personnel s KPIs

49 RESPONSIBILITIES Business ownership of data: - Data treated as an asset - Aligned with business strategy - Thorough end-to-end process Source data alignment: - Source of data creation and maintenance - Interface requirements and service levels Analysis and certification: - Data quality and acceptance metrics - Acceptance by business-representative data stewards - Agreed-on cleansing transformations Monitoring: - On-going frequent audits - CIO dashboard Change control: - All source system changes that may affect data quality - All data warehouse changes that may affect data quality - Metadata management

50 COMPLICATIONS Existing perceptions Legacy data Informal data (Excel, Access, data on users PCs, etc) Existing tools and systems Accidental data warehouse Yesterday s requirements Existing power-play, politics, agendas and domains of control Uncooperative role-players

51 BEST PRACTICE Don t get into too much detail too soon Don t lead with long-term deliverables (Get short-term on the table and accepted, get the project started) Don t be a theorist Use practical actionable steps Don t commit more than you can deliver Avoid unproved technology Align with corporate strategy (Or fight never-ending unpopular battle) Play the P s correctly Politics and Partnership Sell strategy and approach in organisation Initially hard for buy-in and go-ahead On-going subtly to justify existence and promote workings

52 COMMUNICATION Publicize objectives: What data was corrected Future planes Report progress Users know to what extent to use which data Re-affirm process to users: Commitment to the process Build up confidence in the system Changing Perceptions

53 SUMMARY Data quality management approach Data as corporate resource Effects the bottom line End-to-end data quality process Monitor throughout Fix as close to source as possible Data profiling Key step in diagnosing and monitoring data quality Feed CIO dashboard Data quality strategy Defined contents reviewed implementation Required to establish corporate data quality management program Teamwork, partnership and politics Communication

The Data Organization

The Data Organization C V I T F E P A O TM The Data Organization 1251 Yosemite Way Hayward, CA 94545 (510) 303-8868 rschoenrank@computer.org Business Intelligence Process Architecture By Rainer Schoenrank Data Warehouse Consultant

More information

DATA STEWARDSHIP BODY OF KNOWLEDGE (DSBOK)

DATA STEWARDSHIP BODY OF KNOWLEDGE (DSBOK) DATA STEWARDSHIP BODY OF KNOWLEDGE (DSBOK) Release 2.2 August 2013. This document was created in collaboration of the leading experts and educators in the field and members of the Certified Data Steward

More information

Improving Data Governance in Your Organization. Faire Co Regional Manger, Information Management Software, ASEAN

Improving Data Governance in Your Organization. Faire Co Regional Manger, Information Management Software, ASEAN Improving Data Governance in Your Organization Faire Co Regional Manger, Information Management Software, ASEAN Topics The Innovation Imperative and Innovating with Information What Is Data Governance?

More information

BPS Suite and the OCEG Capability Model. Mapping the OCEG Capability Model to the BPS Suite s product capability.

BPS Suite and the OCEG Capability Model. Mapping the OCEG Capability Model to the BPS Suite s product capability. BPS Suite and the OCEG Capability Model Mapping the OCEG Capability Model to the BPS Suite s product capability. BPS Contents Introduction... 2 GRC activities... 2 BPS and the Capability Model for GRC...

More information

Data Quality in the MDM Ecosystem

Data Quality in the MDM Ecosystem Solution Guide Data Quality in the MDM Ecosystem What is MDM? The premise of Master Data Management (MDM) is to create, maintain, and deliver the most complete and comprehensive view possible from disparate

More information

2 The IBM Data Governance Unified Process

2 The IBM Data Governance Unified Process 2 The IBM Data Governance Unified Process The benefits of a commitment to a comprehensive enterprise Data Governance initiative are many and varied, and so are the challenges to achieving strong Data Governance.

More information

Effective Risk Data Aggregation & Risk Reporting

Effective Risk Data Aggregation & Risk Reporting Effective Risk Data Aggregation & Risk Reporting Presented by: Ilia Bolotine Head, Adastra Business Consulting (Canada) 1 The Evolving Regulatory Landscape in Risk Management A significant lesson learned

More information

IBM Software IBM InfoSphere Information Server for Data Quality

IBM Software IBM InfoSphere Information Server for Data Quality IBM InfoSphere Information Server for Data Quality A component index Table of contents 3 6 9 9 InfoSphere QualityStage 10 InfoSphere Information Analyzer 12 InfoSphere Discovery 13 14 2 Do you have confidence

More information

Full file at

Full file at Chapter 2 Data Warehousing True-False Questions 1. A real-time, enterprise-level data warehouse combined with a strategy for its use in decision support can leverage data to provide massive financial benefits

More information

Enabling efficiency through Data Governance: a phased approach

Enabling efficiency through Data Governance: a phased approach Enabling efficiency through Data Governance: a phased approach Transform your process efficiency, decision-making, and customer engagement by improving data accuracy An Experian white paper Enabling efficiency

More information

CA ERwin Data Profiler

CA ERwin Data Profiler PRODUCT BRIEF: CA ERWIN DATA PROFILER CA ERwin Data Profiler CA ERWIN DATA PROFILER HELPS ORGANIZATIONS LOWER THE COSTS AND RISK ASSOCIATED WITH DATA INTEGRATION BY PROVIDING REUSABLE, AUTOMATED, CROSS-DATA-SOURCE

More information

Trillium Consulting. Data Governance. Optimizing Business Outcomes through Data and Information Assets

Trillium Consulting. Data Governance. Optimizing Business Outcomes through Data and Information Assets Trillium Consulting Data Governance Optimizing Business Outcomes through Data and Information Assets DAMA Indiana Winter Meeting Indianapolis, Indiana January 20, 2011 Jim Orr, Global Director Enterprise

More information

Implementing a Successful Data Governance Program

Implementing a Successful Data Governance Program Implementing a Successful Data Governance Program Mary Anne Hopper Data Management Consulting Manager SAS #AnalyticsX Data Stewardship #analyticsx SAS Data Management Framework BUSINESS DRIVERS DATA GOVERNANCE

More information

DataND Finance. A Journey into Enterprise Data Warehouse

DataND Finance. A Journey into Enterprise Data Warehouse DataND Finance A Journey into Enterprise Data Warehouse About the Presenters Vaibhav Agarwal Chris Frederick Manager, Finance Systems Email: vagarwal@nd.edu Business Intelligence Manager Email: cfreder2@nd.edu

More information

Copyright 2016 Datalynx Pty Ltd. All rights reserved. Datalynx Enterprise Data Management Solution Catalogue

Copyright 2016 Datalynx Pty Ltd. All rights reserved. Datalynx Enterprise Data Management Solution Catalogue Datalynx Enterprise Data Management Solution Catalogue About Datalynx Vendor of the world s most versatile Enterprise Data Management software Licence our software to clients & partners Partner-based sales

More information

Data Management Framework

Data Management Framework The Organization Management Framework Created and Presented By Copyright 2018 Management Is part of the Manage Knowledge, Improvement and Change process of the APQC Process Classification Framework (wwwapqcorg)

More information

Data Governance Central to Data Management Success

Data Governance Central to Data Management Success Data Governance Central to Data Success International Anne Marie Smith, Ph.D. DAMA International DMBOK Editorial Review Board Primary Contributor EWSolutions, Inc Principal Consultant and Director of Education

More information

Standards: Implementation, Certification and Testing Work group Friday, May 8, :00 Pm-1:30 Pm ET.

Standards: Implementation, Certification and Testing Work group Friday, May 8, :00 Pm-1:30 Pm ET. Standards: Implementation, Certification and Testing Work group Friday, May 8, 2015. 12:00 Pm-1:30 Pm ET. Agenda Complete Work group Comments- Group 1 Review Group 2 Comments. 2015 Edition Certification

More information

TECHNOLOGY BRIEF: CA ERWIN DATA PROFILER. Combining Data Profiling and Data Modeling for Better Data Quality

TECHNOLOGY BRIEF: CA ERWIN DATA PROFILER. Combining Data Profiling and Data Modeling for Better Data Quality TECHNOLOGY BRIEF: CA ERWIN DATA PROFILER Combining Data Profiling and Data Modeling for Better Data Quality Table of Contents Executive Summary SECTION 1: CHALLENGE 2 Reducing the Cost and Risk of Data

More information

IT123: SABSA Foundation Training

IT123: SABSA Foundation Training IT123: SABSA Foundation Training IT123 Rev.002 CMCT COURSE OUTLINE Page 1 of 8 Training Description: SABSA is the world s leading open security architecture framework and methodology. SABSA is a top-tobottom

More information

Data Vault Brisbane User Group

Data Vault Brisbane User Group Data Vault Brisbane User Group 26-02-2013 Agenda Introductions A brief introduction to Data Vault Creating a Data Vault based Data Warehouse Comparisons with 3NF/Kimball When is it good for you? Examples

More information

Data Stewardship Core by Maria C Villar and Dave Wells

Data Stewardship Core by Maria C Villar and Dave Wells Data Stewardship Core by Maria C Villar and Dave Wells All rights reserved. Reproduction in whole or part prohibited except by written permission. Product and company names mentioned herein may be trademarks

More information

Business Impacts of Poor Data Quality: Building the Business Case

Business Impacts of Poor Data Quality: Building the Business Case Business Impacts of Poor Data Quality: Building the Business Case David Loshin Knowledge Integrity, Inc. 1 Data Quality Challenges 2 Addressing the Problem To effectively ultimately address data quality,

More information

AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT

AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT Dalton Cervo Author, Consultant, Data Management Expert March 2016 This presentation contains extracts from books that are: Copyright 2011 John Wiley & Sons,

More information

Lecture 18. Business Intelligence and Data Warehousing. 1:M Normalization. M:M Normalization 11/1/2017. Topics Covered

Lecture 18. Business Intelligence and Data Warehousing. 1:M Normalization. M:M Normalization 11/1/2017. Topics Covered Lecture 18 Business Intelligence and Data Warehousing BDIS 6.2 BSAD 141 Dave Novak Topics Covered Test # Review What is Business Intelligence? How can an organization be data rich and information poor?

More information

Metadata Management as a Key Component to Data Governance, Data Stewardship, and Data Quality Management. Wednesday, July 20 th 2016

Metadata Management as a Key Component to Data Governance, Data Stewardship, and Data Quality Management. Wednesday, July 20 th 2016 Metadata Management as a Key Component to Data Governance, Data Stewardship, and Data Quality Management Wednesday, July 20 th 2016 Confidential, Datasource Consulting, LLC 2 Multi-Domain Master Data Management

More information

MDM Partner Summit 2015 Oracle Enterprise Data Quality Overview & Roadmap

MDM Partner Summit 2015 Oracle Enterprise Data Quality Overview & Roadmap MDM Partner Summit 2015 Oracle Enterprise Data Quality Overview & Roadmap Steve Tuck Senior Director, Product Strategy Todd Blackmon Senior Director, Sales Consulting David Gengenbach Sales Consultant

More information

The 360 Solution. July 24, 2014

The 360 Solution. July 24, 2014 The 360 Solution July 24, 2014 Most successful large organizations are organized by lines of businesses (LOBs). This has been a very successful way to organize for the accountability of profit and loss.

More information

1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar

1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar 1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar 1) What does the term 'Ad-hoc Analysis' mean? Choice 1 Business analysts use a subset of the data for analysis. Choice 2: Business analysts access the Data

More information

IBM InfoSphere Master Data Management Version 11 Release 5. Overview IBM SC

IBM InfoSphere Master Data Management Version 11 Release 5. Overview IBM SC IBM InfoSphere Master Data Management Version 11 Release 5 Overview IBM SC27-6718-01 IBM InfoSphere Master Data Management Version 11 Release 5 Overview IBM SC27-6718-01 Note Before using this information

More information

Data Management Glossary

Data Management Glossary Data Management Glossary A Access path: The route through a system by which data is found, accessed and retrieved Agile methodology: An approach to software development which takes incremental, iterative

More information

Oracle Data Integration

Oracle Data Integration Oracle Data Integration The Essential Core of Data Governance with Oracle Enterprise Data Quality CON9539 Martin Boyd Senior Director Product Strategy, Oracle Brian Kleber Director Enterprise Data Management,

More information

TDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended.

TDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended. Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your needs. The previews cannot be printed. TDWI strives to provide

More information

IT Briefing. May 17, 2012 Goizueta Business School Room 231

IT Briefing. May 17, 2012 Goizueta Business School Room 231 IT Briefing May 17, 2012 Goizueta Business School Room 231 IT Briefing Agenda Unified Messaging Update ServiceNow - Request 2.0 University Service Desk Security Update Business Intelligence Jay Flanagan

More information

Data Governance. Mark Plessinger / Julie Evans December /7/2017

Data Governance. Mark Plessinger / Julie Evans December /7/2017 Data Governance Mark Plessinger / Julie Evans December 2017 12/7/2017 Agenda Introductions (15) Background (30) Definitions Fundamentals Roadmap (15) Break (15) Framework (60) Foundation Disciplines Engagements

More information

Decision Support. Go-Live Update

Decision Support. Go-Live Update Decision Support Go-Live Update UNLV Decision Support Purpose is to provide knowledgeable campus users with access to data for decision making. Principles embraced: Involvement of decision makers and

More information

Best Practices in Enterprise Data Governance

Best Practices in Enterprise Data Governance Best Practices in Enterprise Data Governance Scott Gidley and Nancy Rausch, SAS WHITE PAPER SAS White Paper Table of Contents Introduction.... 1 Data Governance Use Case and Challenges.... 1 Collaboration

More information

Experiences in Data Quality

Experiences in Data Quality Experiences in Data Quality MIT IQIS 2010 Annette Pence July 14-16, 2010 Approved for Public Release: 10-0686 Distribution Unlimited As a public interest company, MITRE works in partnership with the government

More information

IBM InfoSphere Information Analyzer

IBM InfoSphere Information Analyzer IBM InfoSphere Information Analyzer Understand, analyze and monitor your data Highlights Develop a greater understanding of data source structure, content and quality Leverage data quality rules continuously

More information

Data Quality Framework

Data Quality Framework #THETA2017 Data Quality Framework Mozhgan Memari, Bruce Cassidy The University of Auckland This work is licensed under a Creative Commons Attribution 4.0 International License Two Figures from 2016 The

More information

Data Profiling. A Quick Primer on the What and the Why of Data Integration AUTHORS

Data Profiling. A Quick Primer on the What and the Why of Data Integration AUTHORS A Quick Primer on the What and the Why of Data Integration AUTHORS Shankar Ganesh R Senior Technical Architect Architecture and Technology Services HCL Technologies, Chennai Sathish Kumar Srinivasan Enterprise

More information

IBM Industry Data Models

IBM Industry Data Models IBM Software Group IBM Industry Data Models Usage, Process & Demonstration David Cope EDW Architect Asia Pacific 2007 IBM Corporation The EDW Data Model Business Requirements Analysis Design Planning Data

More information

REPORT 2015/186 INTERNAL AUDIT DIVISION

REPORT 2015/186 INTERNAL AUDIT DIVISION INTERNAL AUDIT DIVISION REPORT 2015/186 Audit of information and communications technology operations in the Secretariat of the United Nations Joint Staff Pension Fund Overall results relating to the effective

More information

Metadata Framework for Resource Discovery

Metadata Framework for Resource Discovery Submitted by: Metadata Strategy Catalytic Initiative 2006-05-01 Page 1 Section 1 Metadata Framework for Resource Discovery Overview We must find new ways to organize and describe our extraordinary information

More information

qwertyuiopasdfghjklzxcvbnmqw ertyuiopasdfghjklzxcvbnmqwert uiopasdfghjklzxcvbnmqwertyuio

qwertyuiopasdfghjklzxcvbnmqw ertyuiopasdfghjklzxcvbnmqwert uiopasdfghjklzxcvbnmqwertyuio qwertyuiopasdfghjklzxcvbnmqw ertyuiopasdfghjklzxcvbnmqwert yuiopasdfghjklzxcvbnmqwertyui opasdfghjklzxcvbnmqwertyuiopa A Tutorial on Checking Data in a Database sdfghjklzxcvbnmqwertyuiopasdf DatabaseAnswers.org

More information

Enabling Data Governance Leveraging Critical Data Elements

Enabling Data Governance Leveraging Critical Data Elements Adaptive Presentation at DAMA-NYC October 19 th, 2017 Enabling Data Governance Leveraging Critical Data Elements Jeff Goins, President, Jeff.goins@adaptive.com James Cerrato, Chief, Product Evangelist,

More information

Efficiency Gains in Inbound Data Warehouse Feed Implementation

Efficiency Gains in Inbound Data Warehouse Feed Implementation Efficiency Gains in Inbound Data Warehouse Feed Implementation Simon Eligulashvili simon.e@gamma-sys.com Introduction The task of building a data warehouse with the objective of making it a long-term strategic

More information

CHAPTER 3 Implementation of Data warehouse in Data Mining

CHAPTER 3 Implementation of Data warehouse in Data Mining CHAPTER 3 Implementation of Data warehouse in Data Mining 3.1 Introduction to Data Warehousing A data warehouse is storage of convenient, consistent, complete and consolidated data, which is collected

More information

<Insert Picture Here> Enterprise Data Management using Grid Technology

<Insert Picture Here> Enterprise Data Management using Grid Technology Enterprise Data using Grid Technology Kriangsak Tiawsirisup Sales Consulting Manager Oracle Corporation (Thailand) 3 Related Data Centre Trends. Service Oriented Architecture Flexibility

More information

Experiences in Data Quality

Experiences in Data Quality Experiences in Data Quality ABSTRACT The MITRE Corporation is committed to helping the Federal government manage its data as an enterprise asset and make the best use of appropriate technologies and services

More information

University of Texas Arlington Data Governance Program Charter

University of Texas Arlington Data Governance Program Charter University of Texas Arlington Data Governance Program Charter Document Version: 1.0 Version/Published Date: 11/2016 Table of Contents 1 INTRODUCTION... 3 1.1 PURPOSE OF THIS DOCUMENT... 3 1.2 SCOPE...

More information

Match data set availability to data resource requirements, including gap analysis and remediation assistance.

Match data set availability to data resource requirements, including gap analysis and remediation assistance. Discovering data/datasets Specify Data Requirements Identify Data Assets Assist customers with clarifying problem statements, use cases, high-level requirements (e.g. goals, objectives) and detailed requirements

More information

Data Clairvoyance. A business approach to data. Real data practitioners, delivering real improvements to your enterprise data assets.

Data Clairvoyance. A business approach to data. Real data practitioners, delivering real improvements to your enterprise data assets. Data Clairvoyance A business approach to data. A professional services firm that provides a very unique and holistic approach that enables your organization to be successful in traversing the data challenges

More information

Business Intelligence and Decision Support Systems

Business Intelligence and Decision Support Systems Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 8: Data Warehousing Learning Objectives Understand the basic definitions and concepts of data warehouses Learn different

More information

STRATEGIC DATA ORGANISATION SOLUTION

STRATEGIC DATA ORGANISATION SOLUTION STRATEGIC DATA ORGANISATION SOLUTION STRATEGIC DATA ORGANISATION The aim is to be the internal data provider of choice within the firm by: employing governance best practices providing high-quality products

More information

Managing Data Resources

Managing Data Resources Chapter 7 Managing Data Resources 7.1 2006 by Prentice Hall OBJECTIVES Describe basic file organization concepts and the problems of managing data resources in a traditional file environment Describe how

More information

Data Strategies for Efficiency and Growth

Data Strategies for Efficiency and Growth Data Strategies for Efficiency and Growth Date Dimension Date key (PK) Date Day of week Calendar month Calendar year Holiday Channel Dimension Channel ID (PK) Channel name Channel description Channel type

More information

CISM QAE ITEM DEVELOPMENT GUIDE

CISM QAE ITEM DEVELOPMENT GUIDE CISM QAE ITEM DEVELOPMENT GUIDE ISACA 2015. All Rights Reserved. 2 TABLE OF CONTENTS PURPOSE OF THE CISM QAE ITEM DEVELOPMENT GUIDE... 3 PURPOSE OF THE CISM QAE... 3 CISM EXAM STRUCTURE... 3 WRITING QUALITY

More information

Data Quality for PowerCenter Users: Expanding Beyond ETL. Marina Grebenkova Principal Product Manager Informatica

Data Quality for PowerCenter Users: Expanding Beyond ETL. Marina Grebenkova Principal Product Manager Informatica Data Quality for PowerCenter Users: Expanding Beyond ETL Marina Grebenkova Principal Product Manager Informatica 2 Agenda Do you trust your data? What is Data Quality? Data Quality process How it complements

More information

Cisco Director Class SAN Planning and Design Service

Cisco Director Class SAN Planning and Design Service Cisco Director Class SAN Planning and Design Service Rapidly Plan and Deploy a Cisco Director Class MDS Solution for More Efficient Storage Networking Service Overview Cisco s intelligent storage networking

More information

Data Quality Assessment Framework

Data Quality Assessment Framework Data Quality Assessment Framework ABSTRACT Many efforts to measure data quality focus on abstract concepts and cannot find a practical way to apply them. Or they attach to specific issues and cannot imagine

More information

ITIL and IT Service Management

ITIL and IT Service Management Background and Introduction to ITIL and IT Service Management Agenda/Learning Objectives What is ITIL The history of ITIL The key components of version 3 (the Lifecycle) The key advantages and Objectives

More information

Data Protection. Plugging the gap. Gary Comiskey 26 February 2010

Data Protection. Plugging the gap. Gary Comiskey 26 February 2010 Data Protection. Plugging the gap Gary Comiskey 26 February 2010 Data Protection Trends in Financial Services Financial services firms are deploying data protection solutions across their enterprise at

More information

Federal Government. Each fiscal year the Federal Government is challenged CATEGORY MANAGEMENT IN THE WHAT IS CATEGORY MANAGEMENT?

Federal Government. Each fiscal year the Federal Government is challenged CATEGORY MANAGEMENT IN THE WHAT IS CATEGORY MANAGEMENT? CATEGORY MANAGEMENT IN THE Federal Government Each fiscal year the Federal Government is challenged to accomplish strategic goals while reducing spend and operating more efficiently. In 2014, the Federal

More information

ABU DHABI GOVERNMENT DATA MANAGEMENT POLICY VERSION 1.0. ADSIC-DM-PD03 Data Management Policy

ABU DHABI GOVERNMENT DATA MANAGEMENT POLICY VERSION 1.0. ADSIC-DM-PD03 Data Management Policy ABU DHABI GOVERNMENT DATA MANAGEMENT POLICY VERSION 1.0 ADSIC-DM-PD03 Data Management Policy Contents DEFINITIONS 01 1 EXECUTIVE SUMMARY 04 2 INTRODUCTION 05 2.1 Purpose 05 2.2 Scope 05 3 COMPLIANCE AND

More information

to the Enterprise Brussels - Tuesday 20th April 2004 Chris Greenslade Introducing Enterprise Architecture Introducing Enterprise Architecture

to the Enterprise Brussels - Tuesday 20th April 2004 Chris Greenslade Introducing Enterprise Architecture Introducing Enterprise Architecture Introducing Enterprise Architecture to the Enterprise Brussels - Tuesday 20th April 2004 Chris Greenslade Chris@.com 1 of 28 Approach Every situation is different The organization Its history and its current

More information

DATA GOVERNANCE LEADS TO DATA QUALITY

DATA GOVERNANCE LEADS TO DATA QUALITY DATA GOVERNANCE LEADS TO DATA QUALITY Trending. Kash Mehdi Senior Product Specialist and Instructor May 3, 2017 1 Collibra 2017 2017 Collibra Inc How Many of Your Reports Have Good Data Quality? What would

More information

EUROPEAN ICT PROFESSIONAL ROLE PROFILES VERSION 2 CWA 16458:2018 LOGFILE

EUROPEAN ICT PROFESSIONAL ROLE PROFILES VERSION 2 CWA 16458:2018 LOGFILE EUROPEAN ICT PROFESSIONAL ROLE PROFILES VERSION 2 CWA 16458:2018 LOGFILE Overview all ICT Profile changes in title, summary, mission and from version 1 to version 2 Versions Version 1 Version 2 Role Profile

More information

The Data Organization

The Data Organization C V I T F E P A O TM The Data Organization Best Practices Metadata Dictionary Application Architecture Prepared by Rainer Schoenrank January 2017 Table of Contents 1. INTRODUCTION... 3 1.1 PURPOSE OF THE

More information

The Data Organization Yosemite Way Hayward, CA (510) The Data Warehouse Conceptual Data Model

The Data Organization Yosemite Way Hayward, CA (510) The Data Warehouse Conceptual Data Model C V I T F E P A O TM The Data Organization 1251 Yosemite Way Hayward, CA 94545 (510) 303-8868 rschoenrank@computer.org The Data Warehouse Conceptual Data Model By Rainer Schoenrank Data Warehouse Consultant

More information

Guidance Solvency II data quality management by insurers

Guidance Solvency II data quality management by insurers Guidance Solvency II data quality management by insurers De Nederlandsche Bank N.V. Guidance Solvency II data quality management by insurers Guidance document of De Nederlandsche Bank N.V., dated 1 September

More information

Losing Control: Controls, Risks, Governance, and Stewardship of Enterprise Data

Losing Control: Controls, Risks, Governance, and Stewardship of Enterprise Data Losing Control: Controls, Risks, Governance, and Stewardship of Enterprise Data an eprentise white paper tel: 407.591.4950 toll-free: 1.888.943.5363 web: www.eprentise.com Author: Helene Abrams www.eprentise.com

More information

Automating for Agility in the Data Center. Purnima Padmanabhan Jeff Evans BMC Software

Automating for Agility in the Data Center. Purnima Padmanabhan Jeff Evans BMC Software Automating for Agility in the Data Center Purnima Padmanabhan Jeff Evans BMC Software 9/5/2006 Agenda The Situation Challenges Objectives BMC Solution for Data Center Closed-Loop Change Data Center Optimization

More information

Certificate Software Asset Management Essentials Syllabus. Version 2.0

Certificate Software Asset Management Essentials Syllabus. Version 2.0 Certificate Software Asset Management Essentials Syllabus Version 2.0 June 2010 Certificate in Software Asset Management Essentials Leaning Objectives Holders of the ISEB Certificate in SAM Essentials

More information

Using SAS-DataFlux technologies to implement a dedupe key on a retail customer warehouse IN A BANK

Using SAS-DataFlux technologies to implement a dedupe key on a retail customer warehouse IN A BANK Using SAS-DataFlux technologies to implement a dedupe key on a retail customer warehouse IN A BANK Background One of the four largest banks in South Africa Branded Businesses: Nedbank, Old Mutual Bank,

More information

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems Management Information Systems Management Information Systems B10. Data Management: Warehousing, Analyzing, Mining, and Visualization Code: 166137-01+02 Course: Management Information Systems Period: Spring

More information

TDWI strives to provide course books that are content-rich and that serve as useful reference documents after a class has ended.

TDWI strives to provide course books that are content-rich and that serve as useful reference documents after a class has ended. Previews of TDWI course books are provided as an opportunity to see the quality of our material and help you to select the courses that best fit your needs. The previews can not be printed. TDWI strives

More information

TX CIO Leadership Journey Texas CIOs Bowden Hight Texas Health and Human Services Commission Tim Jennings Texas Department of Transportation Mark

TX CIO Leadership Journey Texas CIOs Bowden Hight Texas Health and Human Services Commission Tim Jennings Texas Department of Transportation Mark TX CIO Leadership Journey Texas CIOs Bowden Hight Texas Health and Human Services Commission Tim Jennings Texas Department of Transportation Mark Stone Texas A&M University System Moderator Anh Selissen

More information

Work Breakdown Structure

Work Breakdown Structure MossAtre.book Page 491 Sunday, February 9, 2003 7:01 PM APPENDIX Work Breakdown Structure The work breakdown structure in this appendix reflects the contents of the enclosed CD-ROM. TASK_DATA 491 1 Your

More information

THE STATE OF DATA QUALITY

THE STATE OF DATA QUALITY THE STATE OF DATA QUALITY HOW TO START A DATA QUALITY PROGRAM Organizations claim to make decisions based on data, but are they really? Ask a few simple questions of the data they are using for those presumed

More information

SOLUTION BRIEF RSA ARCHER IT & SECURITY RISK MANAGEMENT

SOLUTION BRIEF RSA ARCHER IT & SECURITY RISK MANAGEMENT RSA ARCHER IT & SECURITY RISK MANAGEMENT INTRODUCTION Organizations battle growing security challenges by building layer upon layer of defenses: firewalls, antivirus, intrusion prevention systems, intrusion

More information

DATA MINING AND WAREHOUSING

DATA MINING AND WAREHOUSING DATA MINING AND WAREHOUSING Qno Question Answer 1 Define data warehouse? Data warehouse is a subject oriented, integrated, time-variant, and nonvolatile collection of data that supports management's decision-making

More information

Data Migration Platform

Data Migration Platform appmigrate TM Data Migration Platform QUALITY MANAGEMENT RECONCILIATION MAINTENANCE MIGRATION PROFILING COEXISTENCE CUTOVER PLANNING AND EXECUTION ? Problem Data migration done in the traditional way,

More information

STRATEGIC PLAN

STRATEGIC PLAN STRATEGIC PLAN 2013-2018 In an era of growing demand for IT services, it is imperative that strong guiding principles are followed that will allow for the fulfillment of the Division of Information Technology

More information

On Premise. Service Pack

On Premise. Service Pack On Premise Service Pack 02.0.01 - This Documentation, which includes embedded help systems and electronically distributed materials, (hereinafter referred to as the Documentation ) is for your informational

More information

Managing IT Risk: The ISACA Risk IT Framework. 1 st ISACA Day, Sofia 15 October Charalampos (Haris)Brilakis, CISA

Managing IT Risk: The ISACA Risk IT Framework. 1 st ISACA Day, Sofia 15 October Charalampos (Haris)Brilakis, CISA Managing IT Risk: The ISACA Risk IT Framework Charalampos (Haris)Brilakis, CISA ISACA Athens Chapter BoD / Education Committee Chair Sr. Manager, Internal Audit, Eurobank (Greece) 1 st ISACA Day, Sofia

More information

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems Data Analysis and Design for BI and Data Warehousing Systems Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your

More information

PERSPECTIVE. Effective Data Governance. Abstract

PERSPECTIVE. Effective Data Governance. Abstract PERSPECTIVE Effective Governance Abstract governance is no more just another item that is good to talk about and nice to have, for global data management organizations. This PoV looks into why data governance

More information

Introduction to Data Science

Introduction to Data Science UNIT I INTRODUCTION TO DATA SCIENCE Syllabus Introduction of Data Science Basic Data Analytics using R R Graphical User Interfaces Data Import and Export Attribute and Data Types Descriptive Statistics

More information

On Premise. Service Pack

On Premise. Service Pack On Premise Service Pack 02.0.01 - This Documentation, which includes embedded help systems and electronically distributed materials, (hereinafter referred to as the Documentation ) is for your informational

More information

August Oracle - GoldenGate Statement of Direction

August Oracle - GoldenGate Statement of Direction August 2015 Oracle - GoldenGate Statement of Direction Disclaimer This document in any form, software or printed matter, contains proprietary information that is the exclusive property of Oracle. Your

More information

Data Governance Quick Start

Data Governance Quick Start Service Offering Data Governance Quick Start Congratulations! You ve been named the Data Governance Leader Now What? Benefits Accelerate the initiation of your Data Governance program with an industry

More information

Luncheon Webinar Series June 3rd, Deep Dive MetaData Workbench Sponsored By:

Luncheon Webinar Series June 3rd, Deep Dive MetaData Workbench Sponsored By: Luncheon Webinar Series June 3rd, 2010 Deep Dive MetaData Workbench Sponsored By: 1 Deep Dive MetaData Workbench Questions and suggestions regarding presentation topics? - send to editor@dsxchange.com

More information

Techno Expert Solutions An institute for specialized studies!

Techno Expert Solutions An institute for specialized studies! Getting Started Course Content of IBM Cognos Data Manger Identify the purpose of IBM Cognos Data Manager Define data warehousing and its key underlying concepts Identify how Data Manager creates data warehouses

More information

Web CRM Project. Logical Data Model

Web CRM Project. Logical Data Model Web CRM Project Logical Data Model Prepared by Rainer Schoenrank Data Warehouse Architect The Data Organization 11 December 2007 DRAFT 4/26/2018 Page 1 TABLE OF CONTENTS 1. CHANGE LOG... 5 2. DOCUMENT

More information

Global Address Book. Microsoft Dynamics AX White Paper

Global Address Book. Microsoft Dynamics AX White Paper Microsoft Dynamics AX 2009 Global Address Book White Paper This document provides information about sharing party records in global address book across companies and within companies in Microsoft Dynamics

More information

Overview. Introduction to Data Warehousing and Business Intelligence. BI Is Important. What is Business Intelligence (BI)?

Overview. Introduction to Data Warehousing and Business Intelligence. BI Is Important. What is Business Intelligence (BI)? Introduction to Data Warehousing and Business Intelligence Overview Why Business Intelligence? Data analysis problems Data Warehouse (DW) introduction A tour of the coming DW lectures DW Applications Loosely

More information

Introducing Enterprise Architecture. into the Enterprise

Introducing Enterprise Architecture. into the Enterprise Introducing Enterprise Architecture into the Enterprise Washington - 21st October 2003 Chris Greenslade Chris@Architecting-the-Enterprise.com Introducing Enterprise Architecture 1 of 28 TA P16 1 Approach

More information

Solving the Enterprise Data Dilemma

Solving the Enterprise Data Dilemma Solving the Enterprise Data Dilemma Harmonizing Data Management and Data Governance to Accelerate Actionable Insights Learn More at erwin.com Is Our Company Realizing Value from Our Data? If your business

More information

ITIL Managing Across the Lifecycle Course

ITIL Managing Across the Lifecycle Course ITIL Managing Across the Lifecycle Course Duration: 5 Days Course Delivery: Classroom Language: English Course Overview ITIL 2011 edition is comprised of five core publications: Service Strategy, Service

More information