Enterprise Architecture led Data Quality Strategy

Similar documents
Enabling efficiency through Data Governance: a phased approach

2013 Contact Data Quality Benchmark Report:

The data quality trends report

The Data Center is Dead Long Live the Virtual Data Center

Survey Report Industry Survey. Data Governance, Technology & Analytics Trends Q1 2014

PERFORM FOR HPE CONTENT MANAGER

5-1McGraw-Hill/Irwin. Copyright 2007 by The McGraw-Hill Companies, Inc. All rights reserved.

OPTIMIZATION MAXIMIZING TELECOM AND NETWORK. The current state of enterprise optimization, best practices and considerations for improvement

The Emerging Data Lake IT Strategy

Oracle Buys Automated Applications Controls Leader LogicalApps

The 360 Solution. July 24, 2014

Accenture Amadeus Digital Experience

The Adobe XML Architecture

BUILD BETTER MICROSOFT SQL SERVER SOLUTIONS Sales Conversation Card

Overview. Solution Partner Opportunity. Transactional Partner Opportunity. OEM Partner Opportunity. Next Steps. Appendix

AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT

Realizing the Full Potential of MDM 1

Business Risk Management

Transformational Projects to Remain Globally Competitive. Dr Mary Davies, University Librarian & Director (Information Management)

2 The IBM Data Governance Unified Process

Best Practices in Enterprise Data Governance

Smart Manufacturing in the Food & Beverage Industry

ACL Interpretive Visual Remediation

ETL is No Longer King, Long Live SDD

Engaging Executives and Boards in Cybersecurity Session 303, Feb 20, 2017 Sanjeev Sah, CISO, Texas Children s Hospital Jimmy Joseph, Senior Manager,

Accelerate Your Enterprise Private Cloud Initiative

Managing Trust in e-health with Federated Identity Management

Foundations. The Golden Record is Not Enough: The Case For Data Orchestration. CPDAs Highly Recommend

Solving the Enterprise Data Dilemma

2013, Healthcare Intelligence Network

TCL Multimedia Announces 2017 Interim Results

Data governance and data quality: is it on your agenda or lurking in the shadows?

Credit Union Cyber Crisis: Gaining Awareness and Combatting Cyber Threats Without Breaking the Bank

THE JOURNEY OVERVIEW THREE PHASES TO A SUCCESSFUL MIGRATION ADOPTION ACCENTURE IS 80% IN THE CLOUD

Gain Control Over Your Cloud Use with Cisco Cloud Consumption Professional Services

Working with Health IT Systems is available under a Creative Commons Attribution-NonCommercial- ShareAlike 3.0 Unported license.

Incentives for IoT Security. White Paper. May Author: Dr. Cédric LEVY-BENCHETON, CEO

Rockwell Automation ODVA Annual Meeting

Oracle Data Integration

IBM InfoSphere Master Data Management Version 11 Release 5. Overview IBM SC

HP environmental messaging

Cisco Enterprise Agreement

Data ownership within governance: getting it right

How Insurers are Realising the Promise of Big Data

M&A Cyber Security Due Diligence

DATA QUALITY STRATEGY. Martin Rennhackkamp

Metadata and the Rise of Big Data Governance: Active Open Source Initiatives. October 23, 2018

SERVICE DESCRIPTION ISO Lex. Certifications

Enterprise Architecture Building a Mobile Vision. David Hunt DCH Technology Services Gill Windall University of Greenwich

Change & Configuration Management Market

Evaluating Cloud Databases for ecommerce Applications. What you need to grow your ecommerce business

Third Party Certification: Challenges & Opportunities

Fast Innovation requires Fast IT

BPS Suite and the OCEG Capability Model. Mapping the OCEG Capability Model to the BPS Suite s product capability.

Improving Data Governance in Your Organization. Faire Co Regional Manger, Information Management Software, ASEAN

DEFENSIBLE DELETION TO DOWNSIZE YOUR DATA

2017 Category Management Conference

What is ISO ISMS? Business Beam

The Role of Public Sector Audit and Risk Committees in Cybersecurity & Digital Transformation. ISACA All Rights Reserved.

Data Quality Assessment Framework

Converged security. Gerben Verstraete, CTO, HP Software Services Colin Henderson, Managing Principal, Enterprise Security Products

Bruno Leiniö Latin America Health Day

The resident maintenance model

Achieving Best in Class Software Savings through Optimization not Negotiation

Data Governance. Mark Plessinger / Julie Evans December /7/2017

Tamr Technical Whitepaper

DEVELOPING DIGITAL STRATEGIES IN HEALTHCARE FOR 2017 AND BEYOND. July 19, 2017

Now on Now: How ServiceNow has transformed its own GRC processes

Grow Your Services Business

Data Center Infrastructure Management (DCIM) Demystified

In 2017, the Auditor General initiated an audit of the City s information technology infrastructure and assets.

A Better Approach to Leveraging an OpenStack Private Cloud. David Linthicum

RIMS Perk Session Protecting the Crown Jewels A Risk Manager's guide to cyber security March 18, 2015

White paper Selecting the right method

Metadata Management as a Key Component to Data Governance, Data Stewardship, and Data Quality Management. Wednesday, July 20 th 2016

DATA PRIVACY & PROTECTION POLICY POLICY INFORMATION WE COLLECT AND RECEIVE. Quality Management System

Better skilled workforce

Shaping the Cloud for the Healthcare Industry

1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar

Sales and Marketing Strategies That Work for Financial Services

How to Become a DATA GOVERNANCE EXPERT

Transforming the Data Center into the Information Center. Jack Domme Chief Executive Officer Hitachi Data Systems

Best Practices in Data Governance

Continuous auditing certification

The Value of Data Governance for the Data-Driven Enterprise

The ROI of UI Toolkit Standardization

Measurement and evaluation: Web analytics and data mining. MGMT 230 Week 10

CONCLUSIONS AND RECOMMENDATIONS

Guarding the Quality of Your Data, Your Most Valued Asset

FROM TACTIC TO STRATEGY:

Single Academy Trust Structure

Use Case Study: Reducing Patient No-Shows. Geisinger Health System Central and Northeastern Pennsylvania

CIO Forum Maximize the value of IT in today s economy

REINVENTING ETHICAL, SUSTAINABLE SUPPLY CHAINS

Digital Layer Trends PostalVision 2020

INTELLIGENCE DRIVEN GRC FOR SECURITY

Security Metrics Establishing unambiguous and logically defensible security metrics. Steven Piliero CSO The Center for Internet Security

Annual Report for the Utility Savings Initiative

REPORT Bill Bradbury, Secretary of State Cathy Pollino, Director, Audits Division

Competency Definition

Transcription:

Enterprise Architecture led Data Quality Strategy Jay Barua Director, IT, Direct General jay.barua@direct general.com Executive Summary/Abstract: It is estimated that corporations annually lose billions of dollars to clean up bad information. Like having one major oil spill every year. Imagine. Why corporations do not use enterprise methods to eliminate bad data before it hits our shores? I believe Data flows through enterprises with interferences everywhere. Only a marriage between governance and enterprise quality controls will ensure clean data. Data and Information are not seen as assets. A healthcare company will invest in an interventions quality lab but will not support a data quality lab. Information generated out of bad data will make the company unable to understand outcomes of interventions and create better patient engagement programs. Enterprise architecture needs to define quality strategies and create data quality labs. They are two sides of the same coin. Corporations annually lose billions of dollars to clean up bad data. Like having one major oil spill every year. Imagine. 1989 2010 Yes it can 2

Losses due to bad data unimaginable Study by AT Kearney shows Retail companies lose over $40 billion every year due to supply chain inefficiencies 30 % of item data in catalogs is in error and each item cost on the average $75 to clean 60 % of all invoices have errors and it takes approx $220 average to reconcile each single invoice Errors happen because data like item, price, units of sale etc is out of sync between businesses doing B2B commerce Lack of data governance contributes largest to the above problem 3 Market Demands Clean Data Recent upsurge in Retail online sales highlights need for clean data Online retailers spend millions of dollars to clean inaccurate information available at checkout process Overstock.com spent over $200,000 a year correcting bad addresses that had negatively impacted shipment delivery and raised costs and above all made customers unhappy Overstock.com saved more than $1M every year once they adopted automatic address verification system 4

What is Data Quality? Wikipedia defines Data are of high quality "if they are fit for their intended uses in operations, decision making, and planning " (J.M Juran) Some people view data quantitatively Others look at it qualitatively 5 Quantitative Data Put a number and correct it! How many customer records in my data warehouse have more than one primary email? How many times in a year have we sent wrong emails to wrong customers? 6

Qualitative Data Data that is Bad for one can be Good for other Customer and email matching processes in a system that takes online orders can be different on a system that tracks gift card sales for the same customer Same customer may have different emails in both these systems 7 Why does data go bad? Unmanageable Silos creates bad data Different versions of data can create different interpretations Disconnected Processes creates bad data Disconnected process acting on a data supply chain can make the data bad Lack of Governance creates bad data Lack of Governance can create disconnected data 8

Can Enterprise led processes help? 9 Enterprise Architecture From Incite comes Insight (Courtesy Terrybone) 10

Let us Create a Data Quality Lab? 11 By Investing in Enterprise Architecture (EA) Adopt a Data Quality Strategy that works for you Set up a simple Enterprise Data Architecture framework 12

Build your Data Quality (DQ) strategy with Data Governance Data Integration Metadata Management 13 Use Data Governance with tactical policing Acquire management responsibility Appoint and empower data quality Inspectors Set standard data definitions Follow culturally accepted evolutionary process Use end-end benchmarked processes 14

Data Integration promotes rich data Evaluate & migrate core legacy systems to common platform Create comprehensive view of key data Customers & Products Extended your workflow US Xpress saves $6 Million a year using data integration methods to clean location information and manage truck idle time. They converted their bad data to rich data 15 Data Production Factory (DPF) PRODUCT DATA INPUT Raw Materials Raw Data PROCESS Processing Processing OUTPUT Physical Product Data Product Analogy between Physical and Data Products (Wang et al [5]) 16

Metadata Management Let us have a name for everything! Important to define data so that correct decisions are taken which in turn produces more clean data Important to define data ancestry so that we can explain the how data is transformed in a data production factory Effective schema attribution provides meaning to data 17 Seven DQ Enablers Completeness Is all information available? Conformity Agrees on certain formats Consistency Does data instantiation provide same value De-duplication Try to maintain one version of truth Uniqueness Does collection of data maintain identity? Integrity Dependencies between entities maintained Accuracy Real world representation 18

Create your EA framework to enable DQ Data Profiling Data Auditing Exception Management Data Integration Data Architecture 19 Embed Data Quality in Data Integration Clinical research mandates clean data which is scattered SAS Clinical Data Integration brings analysis ready healthcare data Brings Healthcare specific clinical integration knowledge to transformation modules Reduces development time due to custom coding clinical study Modules adheres to standards like CDISC SDTM 20

Keep Data Structures Simple Design simple and generic data structures Maintain lossless joins to preserve data dependencies Maintain right degree of data coupling with fewer parameterized elements 21 Keep the coupling simple (Courtesy Craig Borysowich : Chief Technology Tactician) 22

Justify Data Quality Investment Calculate cost to clean before Data Quality Document reduction of errors Show decrease of customer service calls Sell Data Quality to Top Bosses 23 Ultimately Treat Data as an Asset How Do We Manage Data as a Strategic Asset? Invest Dollars Time to Market, Quality, Customer Satisfaction, Compliance, Regulations, etc,etc,

And use it as an Investment Sell high quality data to customers Use it as a strategic advantage 25 Enterprise Data Warehouse and BI

Data is the Lifeblood of the Insurance Industry Integrate Data subject areas and use combined data to track and measure retention following key touch points like billing, Endorsement, Renewals and Claims activity to determine which transactions influence rates and trends Effectively use techniques of BI to predict, detect and combat fraud Provide single view of a customer to understand their behavior and LTV across channels and segments Work with business to review KPIs and make them standardized and actionable Improve data quality through a data governance model 27 1. Full use of insurance ratios for performance measurement across the organization by using Industry Standard Key Performance Metrics. 2. Actionable performance metrics Loss Ratios Hit Ratios Earned Premium Pure Premium ALAE (Associated Loss Adjustment Expense) Paid Severity Features Attorney Rep Rate 28

References 1. Dependencies revisited for improving data quality Wenfei Fan (Symposium on Principles of Database System) ACM SIGMOD-SIAGACT-SIGART 2. Quality Control Handbook Joseph Moses Juran, New York, McGraw-Hill, 1951 3. Data Quality (The Kluwer International Series on Advances in Database Systems Volume 23) Richard Wang, Yang Lee, Mostapha Ziad 4. David Loshin White Paper Data Warehouse ROI (Knowledge Integrity/Informatica) 5. A Framework for Analysis of Data Quality Research - Richard Y. Wang, Veda C. Storey, and Christopher P. Firth 6. http://www.sas.com/resources/factsheet/sas-clinical-data-integration-factsheet.pdf 29 Questions? jay_barua@directgeneral.com 503-804-1633 Thank You! 30