DATA QUALITY STRATEGY. Martin Rennhackkamp
|
|
- Lorraine Butler
- 6 years ago
- Views:
Transcription
1 DATA QUALITY STRATEGY Martin Rennhackkamp
2 AGENDA Data quality Data profiling Data cleansing Measuring data quality Data quality strategy Why data quality strategy? Implementing the strategy
3 DATA QUALITY OBJECTIVES How can we achieve our business objectives? With appropriate, complete and accurate data Reduce costs in the right Increase context, profits format at the correct time and place for the right purpose. Improving Data warehouse and Business Information Quality Larry English, 1999
4 DATA QUALITY APPROACH Know Your Data (Data Profiling) Fix Your Data (Data Cleansing) Business Application Projects System Replacements Data Warehouse / ETL Data Integration Enhance Your Data (CRM, Householding) Data Quality Initiatives Root Cause Analysis & System / Process Remediation Metadata Management Data Stewardship & Governance Monitor Your Data (Data Profiling)
5 Data quality AGENDA Data profiling Data cleansing Measuring data quality Data quality strategy Why data quality strategy? Implementing the strategy
6 DATA PROFILING Software and analytical techniques thorough knowledge Data: content, structure, correctness and quality Diagnostic step in data quality management
7 ANALYSIS THRASHING Multiple meanings Cleanliness Multiple sources USERS Accuracy Completeness Timeliness History complications Audit and validation
8 DATA PROFILING TECHNIQUES Domain analysis Pattern analysis Statistical analysis Range or threshold checking Cross-table identification Business rule analysis Customer profiling
9 DOMAIN ANALYSIS Data type adherence: numeric and dates Domains: dates, numbers, alphanumeric Nulls and spaces Frequency and % of distinct values Primary and alternate key uniqueness
10 PATTERN ANALYSIS Specific data type formats: ID number, Tax number Postal code Telephone and fax numbers Dates formats Specific data values: N/A unknown
11 STATISTICAL ANALYSIS Record count Sum Mode value occurring most often Minimum / Maximum Percentiles Mean Standard deviation Hash total / control total CRC
12 RANGE ANALYSIS Ranges of valid values zero or negative numbers allowed past or future dates minimum and maximum values Thresholds salary over given value? interest rate <= 0 negative amount fields Sets of valid values discrete set of permitted values e.g. flags (Y/N), types, codes
13 CROSS-TABLE ANALYSIS Referential integrity between two tables B.ForeignKey = A.PrimaryKey Look-up values Possible PK-FK identification if not defined Primary keys Possible primary and alternate key identification Referential integrity in one table Hierarchical references
14 CROSS-TABLE ANALYSIS
15 BUSINESS RULE ANALYSIS Consistency checks of relationships: Every product in the order table in the product table? No autopsy result if date of death is blank. Accident claim reason only if insured item is a vehicle. Can only have pregnancy test if gender = female. Cannot have 0 monthly interest on fixed deposit saving account. Cannot have first payment date before activation date. Richness of profiling product s business rule detection and reporting
16 CUSTOMER PROFILING TOOLS Focus on names and descriptive data Recognize duplicates and variations Standardize international names, phone numbers and addresses USA zip code vs UK postal code USA states vs UK counties vs cantons vs provinces Parse names and addresses Propagate addresses into households
17 ON-GOING PROCESS Periodical data profiling Analysis on source systems Analysis on data warehouse Re-extraction, analysis, cleansing and re-insertion into data warehouse First step in repeating ETL processes Regular data quality benchmarks (CIO dashboard)
18 BI METHODOLOGY Technical Architecture Product Selection & Installation Project Planning Business Requirements Data Profiling Data Warehouse Design Data Warehouse Implementation ETL Specification Information Delivery Specification Data Warehouse Maintenance ETL Implementation Information Delivery Implementation Testing, Deployment & Roll-out Maintenance Project management Information management Metadata management
19 WHY DATA PROFILING? Understand the data Understand requirements Accurate database design Risk mitigation: ETL is the most under-estimated, under-budgeted part of most BI projects ETL code-load-explode Initiate spin-off data quality projects
20 Data quality Data profiling AGENDA Data cleansing Measuring data quality Data quality strategy Why data quality strategy? Implementing the strategy
21 DATA CLEANSING PROJECTS Data entry At data capture time in source system One record at a time interactive mode In contact with user or client Require changes to source system Require changes to capturers KPIs Data extraction At data transfer time from the source system One system at a time Many records at a time batch mode Data integration At data transfer time into data warehouse More than one system at a time different dynamics Many records at a time batch mode Can select best records for consolidation and merging
22 DATA CLEANSING PROJECTS Reporting / down-stream extraction When extracting from data warehouse for data marts, reporting, analytics Many records at a time batch mode Possibly far removed from source system identification Formatting, extraction and transformation rules Could be start of feedback cycle to source system Maintenance On-going across spectrum of systems On-going improvement and raising data quality standard In-place on source or data warehouse system Many records at a time batch mode May introduce reporting inconsistencies over time
23 DATA CLEANSING In transit: During operational system extract* During data warehouse input* During data warehouse / data mart reporting** In place: In operational system In data warehouse / data mart* Techniques: Reject, ignore error, correct, look-up, default value
24 FEEDBACK CYCLE Operational systems Financial Marketing Quality feedback cycle Decision makers On-line Web ERP HR Extract, transform & load Query and analysis tools Operations Data warehouse
25 AGENDA Data quality Data profiling Data cleansing Measuring data quality Data quality strategy Why data quality strategy? Implementing the strategy
26 INHERENT QUALITY MEASURES Data correctness and accuracy Definition conformance Consistency of the meaning and definition of data Completeness (coverage) All required values for all required fields Validity Data values conform to domain and business rules Accuracy to source Data agrees with data values in original source Accuracy to reality Accurate reflection of real-world object or event Precision Values to the right degree of granularity Non-duplication One-to-one correspondence between records and real-world objects or events
27 RAGMATIC QUALITY MEASURES Usefulness of data Accessibility Able to access the data on demand Timeliness Data available to support process within required time table Contextual clarity Data presentation enables understanding of its meaning Avoid misinterpretation Derivation integrity Correctness with which two/more data elements are combined Usability Ease of use of the form of data presentation Rightness Right kind of data with right degree of quality to support a given process
28 MEASURING DATA QUALITY Aspect Measure Completeness Percentage of data available Accuracy Timeliness Delivery Usability Percentage of records with errors Percentage of time data available when needed Percentage of users satisfied with delivery method Subjective: degree to which presentation is purpose-efficient
29 MEASURING DATA QUALITY Completeness Cost Accuracy Delivery Timeliness
30 CIO DASHBOARD
31 APPROACH Cultural issue Business buy-in Responsibilities Methodology, processes and procedures Quality feedback cycle Governance and controls => Data Quality Strategy
32 AGENDA Data quality Data profiling Data cleansing Measuring data quality Data quality strategy Why data quality strategy? Implementing the strategy
33 DATA QUALITY STRATEGY Get stakeholder involvement, budget and go-ahead Hand over to programme management for implementation
34 DATA QUALITY STRATEGY Context Storage Data flows Stewardship Workflow Monitoring Appendices: - Profiling: technique vs storage - Cleansing: Data detail vs storage - Ownership/Stewardship: Data entities vs storage
35 DATA QUALITY STRATEGY Context: The types and usage of data => type of cleansing required specialized data requires specialized cleansing e.g. titles, dates, numbers, addresses
36 DATA QUALITY STRATEGY Context: The types and usage of data => type of cleansing required Storage: System architecture and platforms => where & how cleansing done technology: mainframe / UNIX locality: locally, regionally, internationally formats: tables: files, spreadsheets applications: open (RDBMS), proprietary connectivity: ODBC, proprietary
37 DATA QUALITY STRATEGY Context: The types and usage of data => type of cleansing required Storage: System architecture and platforms => where & how cleansing done Data flows: Capture, movement and migration of data => cleansing opportunity Transactional updates Operational data feeds Purchased data Legacy migration Regular maintenance (batch) Special extracts
38 DATA QUALITY STRATEGY Context: The types and usage of data => type of cleansing required Storage: System architecture and platforms => where & how cleansing done Data flows: Capture, movement and migration of data => cleansing opportunity Stewardship: People and organisation => who is involved Stakeholders: buy-in, mandate, budget Data stewards: roles, responsibilities Organisational structure: impact Education: all stakeholders
39 DATA QUALITY STRATEGY Context: The types and usage of data => type of cleansing required Storage: System architecture and platforms => where & how cleansing done Data flows: Capture, movement and migration of data => cleansing opportunity Stewardship: People and organisation => who is involved Workflow: Tasks and processes involved => process improvement opportunity Eliminate non-value-add tasks Add data improvement tasks
40 DATA QUALITY STRATEGY Context: The types and usage of data => type of cleansing required Storage: System architecture and platforms => where & how cleansing done Data flows: Capture, movement and migration of data => cleansing opportunity Stewardship: People and organisation => who is involved Workflow: Tasks and processes involved => process improvement opportunity Monitoring: Measuring quality over time => Measure against baseline Controls to improve
41 PROFILING STRATEGY Storage Profiling Source system Staging area Data warehouse Data mart Report / analytics Domain analysis D:1, T:1, P:m D:1, T:y, P:m D:1, T:m, P:m D:1, T:m, P:m D:1, T:m, P:m Pattern analysis D:1, T:1, P:m D:1, T:y, P:m D:1, T:m, P:w D:1, T:m, P:w D:1, T:m, P:w Statistical analysis D:1, T:1, P:m D:1, T:y, P:m D:1, T:m, P:d D:1, T:m, P:d D:1, T:m, P:w Range analysis D:1, T:1, P:m D:1, T:y, P:m D:1, T:m, P:d D:1, T:m, P:d D:1, T:m, P:w Relationship analysis D:1, T:1, P:m D:1, T:y, P:m D:1, T:m, P:w D:1, T:m, P:w D:1, T:m, P:w Business rule analysis D:1, T:1, P:m D:1, T:y, P:m D:1, T:m, P:d D:1, T:m, P:d D:1, T:m, P:w D:development T:test P:production 1:once-off d:daily w:weekly m:monthly y:yearly
42 CLEANSING STRATEGY Storage Cleansing Source system Staging area Data warehouse Data mart Report / analytics ACCOUNT Account no Account name Ignore Ignore Reject Ignore Ignore Ignore Ignore Ignore Ignore Ignore Household Ignore Household Lookup Lookup Ignore Contact Ignore Ignore Parse Ignore Ignore Language Ignore Default Lookup Lookup Ignore Address Ignore Ignore Parse Ignore Ignore Postal code Lookup & replace Ignore Lookup Lookup Ignore Balance Ignore Ignore Ignore Ignore Ignore
43 OWNERSHIP / STEWARDSHIP Storage Entity Source system Staging area Data warehouse Data mart Report / analytics ACCOUNT CUSTOMER PRODUCT ORDER DELIVERY HR FINANCE Own: CR Cust: CR Own: CR Cust: CR Own: Prod Cust: Prod Own: Prod Cust: Prod Own: Prod Cust: Prod Own: HR Cust: CTO Own: CFO Cust: FinIT Own: CR Cust: EDW Own: CR Cust: EDW Own: Prod Cust: EDW Own: Prod Cust: EDW Own: Prod Cust: EDW Own: HR Cust: EDW Own: CFO Cust: EDW Own: CR Cust: EDW Own: CR Cust: EDW Own: Prod Cust: EDW Own: Prod Cust: EDW Own: Prod Cust: EDW Own: HR Cust: EDW Own: CFO Cust: EDW Own: CR Cust: CR Own: CR Cust: CR Own: Prod Cust: EDW Own: Prod Cust: EDW Own: Prod Cust: EDW N/A Own: CFO Cust: FinIT Own: CR Cust: CR Own: CR Cust: CR Own: Prod Cust: BI Own: Prod Cust: BI Own: Prod Cust: BI Own: HR Cust: BI Own: CFO Cust: FinIT
44 AGENDA Data quality Data profiling Data cleansing Measuring data quality Data quality strategy Why data quality strategy? Implementing the strategy
45 WHY DETAIL STRATEGY? Control redundant data Get knowledge of data available Avoid inconsistent overlapping data sets Avoid inconsistent reports Avoid late data availability Formally assign data responsibility Set and adhere to standards Get management to understand importance of data quality
46 GAIN CONTROL Set consistent and unassailable data quality controls Define data ownership and stewardship Increase trust in data Improve delivery and responsiveness Consistent use of terminology Technical and business Align with corporate strategy Get buy-in and support from group management level Group-wide vendor license power
47 Data quality Data profiling Data cleansing AGENDA Measuring data quality Data quality strategy Why data quality strategy? Implementing the strategy
48 GOVERNANCES Source systems: - BI readiness / enablement - Levels of data quality as KPIs - Data capturing quality KPIs Data warehouse: - Levels of data quality as KPIs Information usage: - Levels of data quality as KPIs - User training and certification License to drive Metadata compliance: - Source systems - Data warehousing and related processes - Information exploitation Quality control: - End-to-end data quality program - Periodic data quality measurements and feedback - Data quality measures on CIO dashboard - Data quality in all personnel s KPIs
49 RESPONSIBILITIES Business ownership of data: - Data treated as an asset - Aligned with business strategy - Thorough end-to-end process Source data alignment: - Source of data creation and maintenance - Interface requirements and service levels Analysis and certification: - Data quality and acceptance metrics - Acceptance by business-representative data stewards - Agreed-on cleansing transformations Monitoring: - On-going frequent audits - CIO dashboard Change control: - All source system changes that may affect data quality - All data warehouse changes that may affect data quality - Metadata management
50 COMPLICATIONS Existing perceptions Legacy data Informal data (Excel, Access, data on users PCs, etc) Existing tools and systems Accidental data warehouse Yesterday s requirements Existing power-play, politics, agendas and domains of control Uncooperative role-players
51 BEST PRACTICE Don t get into too much detail too soon Don t lead with long-term deliverables (Get short-term on the table and accepted, get the project started) Don t be a theorist Use practical actionable steps Don t commit more than you can deliver Avoid unproved technology Align with corporate strategy (Or fight never-ending unpopular battle) Play the P s correctly Politics and Partnership Sell strategy and approach in organisation Initially hard for buy-in and go-ahead On-going subtly to justify existence and promote workings
52 COMMUNICATION Publicize objectives: What data was corrected Future planes Report progress Users know to what extent to use which data Re-affirm process to users: Commitment to the process Build up confidence in the system Changing Perceptions
53 SUMMARY Data quality management approach Data as corporate resource Effects the bottom line End-to-end data quality process Monitor throughout Fix as close to source as possible Data profiling Key step in diagnosing and monitoring data quality Feed CIO dashboard Data quality strategy Defined contents reviewed implementation Required to establish corporate data quality management program Teamwork, partnership and politics Communication
The Data Organization
C V I T F E P A O TM The Data Organization 1251 Yosemite Way Hayward, CA 94545 (510) 303-8868 rschoenrank@computer.org Business Intelligence Process Architecture By Rainer Schoenrank Data Warehouse Consultant
More informationDATA STEWARDSHIP BODY OF KNOWLEDGE (DSBOK)
DATA STEWARDSHIP BODY OF KNOWLEDGE (DSBOK) Release 2.2 August 2013. This document was created in collaboration of the leading experts and educators in the field and members of the Certified Data Steward
More informationImproving Data Governance in Your Organization. Faire Co Regional Manger, Information Management Software, ASEAN
Improving Data Governance in Your Organization Faire Co Regional Manger, Information Management Software, ASEAN Topics The Innovation Imperative and Innovating with Information What Is Data Governance?
More informationBPS Suite and the OCEG Capability Model. Mapping the OCEG Capability Model to the BPS Suite s product capability.
BPS Suite and the OCEG Capability Model Mapping the OCEG Capability Model to the BPS Suite s product capability. BPS Contents Introduction... 2 GRC activities... 2 BPS and the Capability Model for GRC...
More informationData Quality in the MDM Ecosystem
Solution Guide Data Quality in the MDM Ecosystem What is MDM? The premise of Master Data Management (MDM) is to create, maintain, and deliver the most complete and comprehensive view possible from disparate
More information2 The IBM Data Governance Unified Process
2 The IBM Data Governance Unified Process The benefits of a commitment to a comprehensive enterprise Data Governance initiative are many and varied, and so are the challenges to achieving strong Data Governance.
More informationEffective Risk Data Aggregation & Risk Reporting
Effective Risk Data Aggregation & Risk Reporting Presented by: Ilia Bolotine Head, Adastra Business Consulting (Canada) 1 The Evolving Regulatory Landscape in Risk Management A significant lesson learned
More informationIBM Software IBM InfoSphere Information Server for Data Quality
IBM InfoSphere Information Server for Data Quality A component index Table of contents 3 6 9 9 InfoSphere QualityStage 10 InfoSphere Information Analyzer 12 InfoSphere Discovery 13 14 2 Do you have confidence
More informationFull file at
Chapter 2 Data Warehousing True-False Questions 1. A real-time, enterprise-level data warehouse combined with a strategy for its use in decision support can leverage data to provide massive financial benefits
More informationEnabling efficiency through Data Governance: a phased approach
Enabling efficiency through Data Governance: a phased approach Transform your process efficiency, decision-making, and customer engagement by improving data accuracy An Experian white paper Enabling efficiency
More informationCA ERwin Data Profiler
PRODUCT BRIEF: CA ERWIN DATA PROFILER CA ERwin Data Profiler CA ERWIN DATA PROFILER HELPS ORGANIZATIONS LOWER THE COSTS AND RISK ASSOCIATED WITH DATA INTEGRATION BY PROVIDING REUSABLE, AUTOMATED, CROSS-DATA-SOURCE
More informationTrillium Consulting. Data Governance. Optimizing Business Outcomes through Data and Information Assets
Trillium Consulting Data Governance Optimizing Business Outcomes through Data and Information Assets DAMA Indiana Winter Meeting Indianapolis, Indiana January 20, 2011 Jim Orr, Global Director Enterprise
More informationImplementing a Successful Data Governance Program
Implementing a Successful Data Governance Program Mary Anne Hopper Data Management Consulting Manager SAS #AnalyticsX Data Stewardship #analyticsx SAS Data Management Framework BUSINESS DRIVERS DATA GOVERNANCE
More informationDataND Finance. A Journey into Enterprise Data Warehouse
DataND Finance A Journey into Enterprise Data Warehouse About the Presenters Vaibhav Agarwal Chris Frederick Manager, Finance Systems Email: vagarwal@nd.edu Business Intelligence Manager Email: cfreder2@nd.edu
More informationCopyright 2016 Datalynx Pty Ltd. All rights reserved. Datalynx Enterprise Data Management Solution Catalogue
Datalynx Enterprise Data Management Solution Catalogue About Datalynx Vendor of the world s most versatile Enterprise Data Management software Licence our software to clients & partners Partner-based sales
More informationData Management Framework
The Organization Management Framework Created and Presented By Copyright 2018 Management Is part of the Manage Knowledge, Improvement and Change process of the APQC Process Classification Framework (wwwapqcorg)
More informationData Governance Central to Data Management Success
Data Governance Central to Data Success International Anne Marie Smith, Ph.D. DAMA International DMBOK Editorial Review Board Primary Contributor EWSolutions, Inc Principal Consultant and Director of Education
More informationStandards: Implementation, Certification and Testing Work group Friday, May 8, :00 Pm-1:30 Pm ET.
Standards: Implementation, Certification and Testing Work group Friday, May 8, 2015. 12:00 Pm-1:30 Pm ET. Agenda Complete Work group Comments- Group 1 Review Group 2 Comments. 2015 Edition Certification
More informationTECHNOLOGY BRIEF: CA ERWIN DATA PROFILER. Combining Data Profiling and Data Modeling for Better Data Quality
TECHNOLOGY BRIEF: CA ERWIN DATA PROFILER Combining Data Profiling and Data Modeling for Better Data Quality Table of Contents Executive Summary SECTION 1: CHALLENGE 2 Reducing the Cost and Risk of Data
More informationIT123: SABSA Foundation Training
IT123: SABSA Foundation Training IT123 Rev.002 CMCT COURSE OUTLINE Page 1 of 8 Training Description: SABSA is the world s leading open security architecture framework and methodology. SABSA is a top-tobottom
More informationData Vault Brisbane User Group
Data Vault Brisbane User Group 26-02-2013 Agenda Introductions A brief introduction to Data Vault Creating a Data Vault based Data Warehouse Comparisons with 3NF/Kimball When is it good for you? Examples
More informationData Stewardship Core by Maria C Villar and Dave Wells
Data Stewardship Core by Maria C Villar and Dave Wells All rights reserved. Reproduction in whole or part prohibited except by written permission. Product and company names mentioned herein may be trademarks
More informationBusiness Impacts of Poor Data Quality: Building the Business Case
Business Impacts of Poor Data Quality: Building the Business Case David Loshin Knowledge Integrity, Inc. 1 Data Quality Challenges 2 Addressing the Problem To effectively ultimately address data quality,
More informationAVOIDING SILOED DATA AND SILOED DATA MANAGEMENT
AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT Dalton Cervo Author, Consultant, Data Management Expert March 2016 This presentation contains extracts from books that are: Copyright 2011 John Wiley & Sons,
More informationLecture 18. Business Intelligence and Data Warehousing. 1:M Normalization. M:M Normalization 11/1/2017. Topics Covered
Lecture 18 Business Intelligence and Data Warehousing BDIS 6.2 BSAD 141 Dave Novak Topics Covered Test # Review What is Business Intelligence? How can an organization be data rich and information poor?
More informationMetadata Management as a Key Component to Data Governance, Data Stewardship, and Data Quality Management. Wednesday, July 20 th 2016
Metadata Management as a Key Component to Data Governance, Data Stewardship, and Data Quality Management Wednesday, July 20 th 2016 Confidential, Datasource Consulting, LLC 2 Multi-Domain Master Data Management
More informationMDM Partner Summit 2015 Oracle Enterprise Data Quality Overview & Roadmap
MDM Partner Summit 2015 Oracle Enterprise Data Quality Overview & Roadmap Steve Tuck Senior Director, Product Strategy Todd Blackmon Senior Director, Sales Consulting David Gengenbach Sales Consultant
More informationThe 360 Solution. July 24, 2014
The 360 Solution July 24, 2014 Most successful large organizations are organized by lines of businesses (LOBs). This has been a very successful way to organize for the accountability of profit and loss.
More information1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar
1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar 1) What does the term 'Ad-hoc Analysis' mean? Choice 1 Business analysts use a subset of the data for analysis. Choice 2: Business analysts access the Data
More informationIBM InfoSphere Master Data Management Version 11 Release 5. Overview IBM SC
IBM InfoSphere Master Data Management Version 11 Release 5 Overview IBM SC27-6718-01 IBM InfoSphere Master Data Management Version 11 Release 5 Overview IBM SC27-6718-01 Note Before using this information
More informationData Management Glossary
Data Management Glossary A Access path: The route through a system by which data is found, accessed and retrieved Agile methodology: An approach to software development which takes incremental, iterative
More informationOracle Data Integration
Oracle Data Integration The Essential Core of Data Governance with Oracle Enterprise Data Quality CON9539 Martin Boyd Senior Director Product Strategy, Oracle Brian Kleber Director Enterprise Data Management,
More informationTDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended.
Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your needs. The previews cannot be printed. TDWI strives to provide
More informationIT Briefing. May 17, 2012 Goizueta Business School Room 231
IT Briefing May 17, 2012 Goizueta Business School Room 231 IT Briefing Agenda Unified Messaging Update ServiceNow - Request 2.0 University Service Desk Security Update Business Intelligence Jay Flanagan
More informationData Governance. Mark Plessinger / Julie Evans December /7/2017
Data Governance Mark Plessinger / Julie Evans December 2017 12/7/2017 Agenda Introductions (15) Background (30) Definitions Fundamentals Roadmap (15) Break (15) Framework (60) Foundation Disciplines Engagements
More informationDecision Support. Go-Live Update
Decision Support Go-Live Update UNLV Decision Support Purpose is to provide knowledgeable campus users with access to data for decision making. Principles embraced: Involvement of decision makers and
More informationBest Practices in Enterprise Data Governance
Best Practices in Enterprise Data Governance Scott Gidley and Nancy Rausch, SAS WHITE PAPER SAS White Paper Table of Contents Introduction.... 1 Data Governance Use Case and Challenges.... 1 Collaboration
More informationExperiences in Data Quality
Experiences in Data Quality MIT IQIS 2010 Annette Pence July 14-16, 2010 Approved for Public Release: 10-0686 Distribution Unlimited As a public interest company, MITRE works in partnership with the government
More informationIBM InfoSphere Information Analyzer
IBM InfoSphere Information Analyzer Understand, analyze and monitor your data Highlights Develop a greater understanding of data source structure, content and quality Leverage data quality rules continuously
More informationData Quality Framework
#THETA2017 Data Quality Framework Mozhgan Memari, Bruce Cassidy The University of Auckland This work is licensed under a Creative Commons Attribution 4.0 International License Two Figures from 2016 The
More informationData Profiling. A Quick Primer on the What and the Why of Data Integration AUTHORS
A Quick Primer on the What and the Why of Data Integration AUTHORS Shankar Ganesh R Senior Technical Architect Architecture and Technology Services HCL Technologies, Chennai Sathish Kumar Srinivasan Enterprise
More informationIBM Industry Data Models
IBM Software Group IBM Industry Data Models Usage, Process & Demonstration David Cope EDW Architect Asia Pacific 2007 IBM Corporation The EDW Data Model Business Requirements Analysis Design Planning Data
More informationREPORT 2015/186 INTERNAL AUDIT DIVISION
INTERNAL AUDIT DIVISION REPORT 2015/186 Audit of information and communications technology operations in the Secretariat of the United Nations Joint Staff Pension Fund Overall results relating to the effective
More informationMetadata Framework for Resource Discovery
Submitted by: Metadata Strategy Catalytic Initiative 2006-05-01 Page 1 Section 1 Metadata Framework for Resource Discovery Overview We must find new ways to organize and describe our extraordinary information
More informationqwertyuiopasdfghjklzxcvbnmqw ertyuiopasdfghjklzxcvbnmqwert uiopasdfghjklzxcvbnmqwertyuio
qwertyuiopasdfghjklzxcvbnmqw ertyuiopasdfghjklzxcvbnmqwert yuiopasdfghjklzxcvbnmqwertyui opasdfghjklzxcvbnmqwertyuiopa A Tutorial on Checking Data in a Database sdfghjklzxcvbnmqwertyuiopasdf DatabaseAnswers.org
More informationEnabling Data Governance Leveraging Critical Data Elements
Adaptive Presentation at DAMA-NYC October 19 th, 2017 Enabling Data Governance Leveraging Critical Data Elements Jeff Goins, President, Jeff.goins@adaptive.com James Cerrato, Chief, Product Evangelist,
More informationEfficiency Gains in Inbound Data Warehouse Feed Implementation
Efficiency Gains in Inbound Data Warehouse Feed Implementation Simon Eligulashvili simon.e@gamma-sys.com Introduction The task of building a data warehouse with the objective of making it a long-term strategic
More informationCHAPTER 3 Implementation of Data warehouse in Data Mining
CHAPTER 3 Implementation of Data warehouse in Data Mining 3.1 Introduction to Data Warehousing A data warehouse is storage of convenient, consistent, complete and consolidated data, which is collected
More information<Insert Picture Here> Enterprise Data Management using Grid Technology
Enterprise Data using Grid Technology Kriangsak Tiawsirisup Sales Consulting Manager Oracle Corporation (Thailand) 3 Related Data Centre Trends. Service Oriented Architecture Flexibility
More informationExperiences in Data Quality
Experiences in Data Quality ABSTRACT The MITRE Corporation is committed to helping the Federal government manage its data as an enterprise asset and make the best use of appropriate technologies and services
More informationUniversity of Texas Arlington Data Governance Program Charter
University of Texas Arlington Data Governance Program Charter Document Version: 1.0 Version/Published Date: 11/2016 Table of Contents 1 INTRODUCTION... 3 1.1 PURPOSE OF THIS DOCUMENT... 3 1.2 SCOPE...
More informationMatch data set availability to data resource requirements, including gap analysis and remediation assistance.
Discovering data/datasets Specify Data Requirements Identify Data Assets Assist customers with clarifying problem statements, use cases, high-level requirements (e.g. goals, objectives) and detailed requirements
More informationData Clairvoyance. A business approach to data. Real data practitioners, delivering real improvements to your enterprise data assets.
Data Clairvoyance A business approach to data. A professional services firm that provides a very unique and holistic approach that enables your organization to be successful in traversing the data challenges
More informationBusiness Intelligence and Decision Support Systems
Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 8: Data Warehousing Learning Objectives Understand the basic definitions and concepts of data warehouses Learn different
More informationSTRATEGIC DATA ORGANISATION SOLUTION
STRATEGIC DATA ORGANISATION SOLUTION STRATEGIC DATA ORGANISATION The aim is to be the internal data provider of choice within the firm by: employing governance best practices providing high-quality products
More informationManaging Data Resources
Chapter 7 Managing Data Resources 7.1 2006 by Prentice Hall OBJECTIVES Describe basic file organization concepts and the problems of managing data resources in a traditional file environment Describe how
More informationData Strategies for Efficiency and Growth
Data Strategies for Efficiency and Growth Date Dimension Date key (PK) Date Day of week Calendar month Calendar year Holiday Channel Dimension Channel ID (PK) Channel name Channel description Channel type
More informationCISM QAE ITEM DEVELOPMENT GUIDE
CISM QAE ITEM DEVELOPMENT GUIDE ISACA 2015. All Rights Reserved. 2 TABLE OF CONTENTS PURPOSE OF THE CISM QAE ITEM DEVELOPMENT GUIDE... 3 PURPOSE OF THE CISM QAE... 3 CISM EXAM STRUCTURE... 3 WRITING QUALITY
More informationData Quality for PowerCenter Users: Expanding Beyond ETL. Marina Grebenkova Principal Product Manager Informatica
Data Quality for PowerCenter Users: Expanding Beyond ETL Marina Grebenkova Principal Product Manager Informatica 2 Agenda Do you trust your data? What is Data Quality? Data Quality process How it complements
More informationCisco Director Class SAN Planning and Design Service
Cisco Director Class SAN Planning and Design Service Rapidly Plan and Deploy a Cisco Director Class MDS Solution for More Efficient Storage Networking Service Overview Cisco s intelligent storage networking
More informationData Quality Assessment Framework
Data Quality Assessment Framework ABSTRACT Many efforts to measure data quality focus on abstract concepts and cannot find a practical way to apply them. Or they attach to specific issues and cannot imagine
More informationITIL and IT Service Management
Background and Introduction to ITIL and IT Service Management Agenda/Learning Objectives What is ITIL The history of ITIL The key components of version 3 (the Lifecycle) The key advantages and Objectives
More informationData Protection. Plugging the gap. Gary Comiskey 26 February 2010
Data Protection. Plugging the gap Gary Comiskey 26 February 2010 Data Protection Trends in Financial Services Financial services firms are deploying data protection solutions across their enterprise at
More informationFederal Government. Each fiscal year the Federal Government is challenged CATEGORY MANAGEMENT IN THE WHAT IS CATEGORY MANAGEMENT?
CATEGORY MANAGEMENT IN THE Federal Government Each fiscal year the Federal Government is challenged to accomplish strategic goals while reducing spend and operating more efficiently. In 2014, the Federal
More informationABU DHABI GOVERNMENT DATA MANAGEMENT POLICY VERSION 1.0. ADSIC-DM-PD03 Data Management Policy
ABU DHABI GOVERNMENT DATA MANAGEMENT POLICY VERSION 1.0 ADSIC-DM-PD03 Data Management Policy Contents DEFINITIONS 01 1 EXECUTIVE SUMMARY 04 2 INTRODUCTION 05 2.1 Purpose 05 2.2 Scope 05 3 COMPLIANCE AND
More informationto the Enterprise Brussels - Tuesday 20th April 2004 Chris Greenslade Introducing Enterprise Architecture Introducing Enterprise Architecture
Introducing Enterprise Architecture to the Enterprise Brussels - Tuesday 20th April 2004 Chris Greenslade Chris@.com 1 of 28 Approach Every situation is different The organization Its history and its current
More informationDATA GOVERNANCE LEADS TO DATA QUALITY
DATA GOVERNANCE LEADS TO DATA QUALITY Trending. Kash Mehdi Senior Product Specialist and Instructor May 3, 2017 1 Collibra 2017 2017 Collibra Inc How Many of Your Reports Have Good Data Quality? What would
More informationEUROPEAN ICT PROFESSIONAL ROLE PROFILES VERSION 2 CWA 16458:2018 LOGFILE
EUROPEAN ICT PROFESSIONAL ROLE PROFILES VERSION 2 CWA 16458:2018 LOGFILE Overview all ICT Profile changes in title, summary, mission and from version 1 to version 2 Versions Version 1 Version 2 Role Profile
More informationThe Data Organization
C V I T F E P A O TM The Data Organization Best Practices Metadata Dictionary Application Architecture Prepared by Rainer Schoenrank January 2017 Table of Contents 1. INTRODUCTION... 3 1.1 PURPOSE OF THE
More informationThe Data Organization Yosemite Way Hayward, CA (510) The Data Warehouse Conceptual Data Model
C V I T F E P A O TM The Data Organization 1251 Yosemite Way Hayward, CA 94545 (510) 303-8868 rschoenrank@computer.org The Data Warehouse Conceptual Data Model By Rainer Schoenrank Data Warehouse Consultant
More informationGuidance Solvency II data quality management by insurers
Guidance Solvency II data quality management by insurers De Nederlandsche Bank N.V. Guidance Solvency II data quality management by insurers Guidance document of De Nederlandsche Bank N.V., dated 1 September
More informationLosing Control: Controls, Risks, Governance, and Stewardship of Enterprise Data
Losing Control: Controls, Risks, Governance, and Stewardship of Enterprise Data an eprentise white paper tel: 407.591.4950 toll-free: 1.888.943.5363 web: www.eprentise.com Author: Helene Abrams www.eprentise.com
More informationAutomating for Agility in the Data Center. Purnima Padmanabhan Jeff Evans BMC Software
Automating for Agility in the Data Center Purnima Padmanabhan Jeff Evans BMC Software 9/5/2006 Agenda The Situation Challenges Objectives BMC Solution for Data Center Closed-Loop Change Data Center Optimization
More informationCertificate Software Asset Management Essentials Syllabus. Version 2.0
Certificate Software Asset Management Essentials Syllabus Version 2.0 June 2010 Certificate in Software Asset Management Essentials Leaning Objectives Holders of the ISEB Certificate in SAM Essentials
More informationUsing SAS-DataFlux technologies to implement a dedupe key on a retail customer warehouse IN A BANK
Using SAS-DataFlux technologies to implement a dedupe key on a retail customer warehouse IN A BANK Background One of the four largest banks in South Africa Branded Businesses: Nedbank, Old Mutual Bank,
More informationWKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems
Management Information Systems Management Information Systems B10. Data Management: Warehousing, Analyzing, Mining, and Visualization Code: 166137-01+02 Course: Management Information Systems Period: Spring
More informationTDWI strives to provide course books that are content-rich and that serve as useful reference documents after a class has ended.
Previews of TDWI course books are provided as an opportunity to see the quality of our material and help you to select the courses that best fit your needs. The previews can not be printed. TDWI strives
More informationTX CIO Leadership Journey Texas CIOs Bowden Hight Texas Health and Human Services Commission Tim Jennings Texas Department of Transportation Mark
TX CIO Leadership Journey Texas CIOs Bowden Hight Texas Health and Human Services Commission Tim Jennings Texas Department of Transportation Mark Stone Texas A&M University System Moderator Anh Selissen
More informationWork Breakdown Structure
MossAtre.book Page 491 Sunday, February 9, 2003 7:01 PM APPENDIX Work Breakdown Structure The work breakdown structure in this appendix reflects the contents of the enclosed CD-ROM. TASK_DATA 491 1 Your
More informationTHE STATE OF DATA QUALITY
THE STATE OF DATA QUALITY HOW TO START A DATA QUALITY PROGRAM Organizations claim to make decisions based on data, but are they really? Ask a few simple questions of the data they are using for those presumed
More informationSOLUTION BRIEF RSA ARCHER IT & SECURITY RISK MANAGEMENT
RSA ARCHER IT & SECURITY RISK MANAGEMENT INTRODUCTION Organizations battle growing security challenges by building layer upon layer of defenses: firewalls, antivirus, intrusion prevention systems, intrusion
More informationDATA MINING AND WAREHOUSING
DATA MINING AND WAREHOUSING Qno Question Answer 1 Define data warehouse? Data warehouse is a subject oriented, integrated, time-variant, and nonvolatile collection of data that supports management's decision-making
More informationData Migration Platform
appmigrate TM Data Migration Platform QUALITY MANAGEMENT RECONCILIATION MAINTENANCE MIGRATION PROFILING COEXISTENCE CUTOVER PLANNING AND EXECUTION ? Problem Data migration done in the traditional way,
More informationSTRATEGIC PLAN
STRATEGIC PLAN 2013-2018 In an era of growing demand for IT services, it is imperative that strong guiding principles are followed that will allow for the fulfillment of the Division of Information Technology
More informationOn Premise. Service Pack
On Premise Service Pack 02.0.01 - This Documentation, which includes embedded help systems and electronically distributed materials, (hereinafter referred to as the Documentation ) is for your informational
More informationManaging IT Risk: The ISACA Risk IT Framework. 1 st ISACA Day, Sofia 15 October Charalampos (Haris)Brilakis, CISA
Managing IT Risk: The ISACA Risk IT Framework Charalampos (Haris)Brilakis, CISA ISACA Athens Chapter BoD / Education Committee Chair Sr. Manager, Internal Audit, Eurobank (Greece) 1 st ISACA Day, Sofia
More informationTDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems
Data Analysis and Design for BI and Data Warehousing Systems Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your
More informationPERSPECTIVE. Effective Data Governance. Abstract
PERSPECTIVE Effective Governance Abstract governance is no more just another item that is good to talk about and nice to have, for global data management organizations. This PoV looks into why data governance
More informationIntroduction to Data Science
UNIT I INTRODUCTION TO DATA SCIENCE Syllabus Introduction of Data Science Basic Data Analytics using R R Graphical User Interfaces Data Import and Export Attribute and Data Types Descriptive Statistics
More informationOn Premise. Service Pack
On Premise Service Pack 02.0.01 - This Documentation, which includes embedded help systems and electronically distributed materials, (hereinafter referred to as the Documentation ) is for your informational
More informationAugust Oracle - GoldenGate Statement of Direction
August 2015 Oracle - GoldenGate Statement of Direction Disclaimer This document in any form, software or printed matter, contains proprietary information that is the exclusive property of Oracle. Your
More informationData Governance Quick Start
Service Offering Data Governance Quick Start Congratulations! You ve been named the Data Governance Leader Now What? Benefits Accelerate the initiation of your Data Governance program with an industry
More informationLuncheon Webinar Series June 3rd, Deep Dive MetaData Workbench Sponsored By:
Luncheon Webinar Series June 3rd, 2010 Deep Dive MetaData Workbench Sponsored By: 1 Deep Dive MetaData Workbench Questions and suggestions regarding presentation topics? - send to editor@dsxchange.com
More informationTechno Expert Solutions An institute for specialized studies!
Getting Started Course Content of IBM Cognos Data Manger Identify the purpose of IBM Cognos Data Manager Define data warehousing and its key underlying concepts Identify how Data Manager creates data warehouses
More informationWeb CRM Project. Logical Data Model
Web CRM Project Logical Data Model Prepared by Rainer Schoenrank Data Warehouse Architect The Data Organization 11 December 2007 DRAFT 4/26/2018 Page 1 TABLE OF CONTENTS 1. CHANGE LOG... 5 2. DOCUMENT
More informationGlobal Address Book. Microsoft Dynamics AX White Paper
Microsoft Dynamics AX 2009 Global Address Book White Paper This document provides information about sharing party records in global address book across companies and within companies in Microsoft Dynamics
More informationOverview. Introduction to Data Warehousing and Business Intelligence. BI Is Important. What is Business Intelligence (BI)?
Introduction to Data Warehousing and Business Intelligence Overview Why Business Intelligence? Data analysis problems Data Warehouse (DW) introduction A tour of the coming DW lectures DW Applications Loosely
More informationIntroducing Enterprise Architecture. into the Enterprise
Introducing Enterprise Architecture into the Enterprise Washington - 21st October 2003 Chris Greenslade Chris@Architecting-the-Enterprise.com Introducing Enterprise Architecture 1 of 28 TA P16 1 Approach
More informationSolving the Enterprise Data Dilemma
Solving the Enterprise Data Dilemma Harmonizing Data Management and Data Governance to Accelerate Actionable Insights Learn More at erwin.com Is Our Company Realizing Value from Our Data? If your business
More informationITIL Managing Across the Lifecycle Course
ITIL Managing Across the Lifecycle Course Duration: 5 Days Course Delivery: Classroom Language: English Course Overview ITIL 2011 edition is comprised of five core publications: Service Strategy, Service
More information