DATA GOVERNANCE LEADS TO DATA QUALITY Trending. Kash Mehdi Senior Product Specialist and Instructor May 3, 2017 1 Collibra 2017 2017 Collibra Inc
How Many of Your Reports Have Good Data Quality? What would you say right now in a board meeting if someone says: Half of the data you have is wrong? Do you even know the quality of your data? How many of your reports have good data quality? What data quality checks do you have in place? What data processes do you have in place? 2 Collibra 2017
3 Collibra 2017
Drive Business Value Through Data Governance Data is growing in volume, velocity, veracity. Where is the value? 4 Collibra 2017 By 2020 more than 20 billion objects will be connected to the Internet of Things - Gartner
What We See in the Market: Data Governance Data governance & stewardship provide the right level of control and trust in data Data infrastructure (IT) Data Consumers (Business) LEADERSHIP CIO ROLES Information Manager, Data Architect, Data Modeler TECHNOLOGY Hadoop, Databases, Data Integration DATA AUTHORITY LEADERSHIP Chief Data Officer ROLES Data Governance Manager, Data Steward TECHNOLOGY Data Stewardship Platform LEADERSHIP CEO, CFO, VP, Marketing ROLES Data Scientist, Business Analyst TECHNOLOGY Visualization, Self-service BI 5 Collibra 2017
Prediction: Data Akin to Gold Internet of Things value add by 2020 1.9 Trillion Source: Gartner 6 Collibra 2017
Why Start with Data Governance Organization Formalize data governance council, stewardship, and working groups on data quality Multi-tier organization support: centralized, decentralized, or federated People & Processes Roles, responsibilities, and controls around data quality Business ownership Workflows and common processes to manage data quality issues and approval of data quality components Collaboration for remediation Data Quality Data quality scorecards, business rules, and metrics Policies and standards Impact analysis Data quality framework While building a data lake what data quality checks and processes to put in place 7 Collibra 2017
8 Collibra 2017 Data Toll Plaza
Example: How a Financial Services Customer is Doing It Formalizing data ownership Building data quality standards and processes Determining what data quality checks and processes to put in place while building a data lake Creating a data quality framework Creating a data quality architecture (data lake example) Determining how the business can be involved while creating data quality framework for collaboration 9 Collibra 2017
Journey to Data Governance Enabling data governance with the project lifecycle 10 Collibra 2017
Enterprise Data Management: Why Now? Supporting our next waves of growth Data quality deteriorates in favor of capability Enable innovation and scalable capabilities to support growth IDEA TORNADO OF GROWTH OPERATIONAL EXCELLENCE ZONE Successful companies focus on growth and getting things done Successful companies drive operational excellence 11 Collibra 2017 Source: Geoffrey A. Moore from the book, Dealing with Darwin (2005)
Year 1: Data Governance Focus Defining and implementing a framework to guide execution of enterprise data governance Data Definitions Data Ownership Data Quality 12 Collibra 2017
Year 1: Data Governance Focus Data Definitions NetApp business glossary Data communities and domains Definition approval 13 Collibra 2017
Year 1: Data Governance Focus Executive Ownership Roles and responsibilities Executive owners, stewards, and stakeholders Exception review process Data governance councils 14 Collibra 2017
Year 1: Data Governance Focus Data Quality Metrics framework Closed loop corrective action Project lifecycle best practices 15 Collibra 2017
Objectives of EDM Project Lifecycle Integration Data Governance Data Quality Data Architecture Compliance Ensuring project alignment to data governance strategies. Ensuring that the project has a high level of confidence in data quality before go-live. Ensuring project alignment to data architecture strategies. Ensuring that NetApp complies with data controls. Deliverables and Steps: Deliverables and Steps: Deliverables and Steps: Deliverables and Steps: Data Requirements Data Quality Assessment EA Checklist Privacy Impact Assessment Data Definitions and Ownership Stakeholder Approvals Definitions in NetApp Business Glossary Data Quality Strategy Data Quality Work Plan Data Integration / Conversion Design Data Integration / Conversion Execution Summary Conceptual Solution Architecture Detailed Solution Architecture Data Elements Matrix User Profile Matrix 16 Collibra 2017
EDM Project Lifecycle Integration Framework Project lifecycle phases and checkpoints Concept Commit Execute Commit Request Lifecycle Plan Design Build Test Transition & Close Data Governance Data Definitions & Ownership Data requirements (part of BRD) Obtain Stakeholder Approvals Add Definitions to NetApp Business Glossary Track Data Gov. Issues and Resolutions Data Quality Data Quality Assessment Data Quality Strategy Data Quality Work Plan Data Integration / Conversion Design (as part of Technical Design) Data Quality Operational Process and Support Plan User Acceptance of Data Quality (Metrics, Questionnaire) Data Integration / Conversion Execution Summary Data Architecture Enterprise Architecture Alignment Checklist Conceptual Solution Architecture Detailed Solution Architecture Logical Data Model and Physical Database Design (part of Technical Design) Incorporate project Solution Architecture into Enterprise Architecture Diagrams Compliance Preliminary Privacy Impact Assessment (PIA) (If required) Data Elements Matrix (if required) User Profile Matrix (if required) 17 Collibra 2017
NetApp Business Glossary Collibra Data Governance Center building content since 2010 88 Business Domains 35 unique Executive Owners 70 unique Domain Stewards 100+ Business Stewards 3008 Business Terms 1726 Acronyms 796 Measures 49 Metrics 49 Hierarchies 37 Processes 21 Policies 49 Product Reporting Logic Business Rules 242 Reports 31 ebi dashboard reports 504 Systems / Business applications 3056 Data Attributes 391 Data Base Tables (EPIC) 4 Reference data domains 8424 Code Values Best Practices documentation Over 10,000 monthly page views We cannot fix what is not defined and measured 18 Collibra 2017 2017 NetApp, Inc. All rights reserved. --- NETAPP CONFIDENTIAL ---
NetApp Business Glossary Community and Domain Grouping 12 Communities 88 Business Domains Acronyms Corporate Product USPS Federal Company Account Competitor Partners Corporate Services Alternate Worker - AWF Business Apps & Systems Corporate Terms Data Privacy Employee Information Technology Legal Records Management Risk Management Safety & Security - SAS Staffing Travel & Corp Cards Workplace Resources -WPR Enterprise Reporting Reporting Services Business Terms (more domains being built) Finance Accounts Payable Backlog Cost Accounting External Reporting Financial Plan & Forecast General Accounting NetApp Capital Solutions - NCS Payroll Revenue SOX Stock Tax Treasury Marketing Campaigns & Leads Contact Content Discoverability Digital Marketing Globalization Product Data Security - Product Development and QA DRM Product Attributes EPIC Product Attributes Hardware Pricing Product Reporting Data Attributes Product Reporting Logic Product Structure Release Process Software Software Licensing & Entitlement Software Products Solutions Quote to Cash AR Credit & Collections Bookings and Sales Orders DCO Product Attributes Invoice QE Product Attributes Quote Product Setup- Templates and User Item Types Trade Compliance Sales Commissions OEM Business Opportunity Processes - Global Sales Operations Sales Booking Forecast Sales Territory Hierarchy US Public Sector - USPS Services - Customer AutoSupport - ASUP Customer Support Delivery -General Customer Support Delivery-Reporting Installed Base NetAppU Processes- Customer Services Programs Professional Services SAP Data Attributes Service Parts Support & Warranty Offerings Legend Green: Executive Owner secured Orange: replacement pending Blue: new domain - pending Stewardship & Data Governance Country Codes ERDM Metrics Glossary Data Type Classifications Governance & Stewardship Supply Chain Operations Demand Planning Inventory Management Logistics and Shipments Manufacturing Test & Quality Sales & Ops Planning (S&OP) Sourcing & Procurement Supply Chain Design & Mgmt Supply Planning Executive ownership secured 19 Collibra 2017 2017 NetApp, Inc. All rights reserved. --- NETAPP CONFIDENTIAL ---
Action Plan Activities Identify 10 Reports Perform data quality assessment Identify executive owners, stewards, and stakeholders Define exception review process Scale it for other asset types 20 Collibra 2017
THANK YOU Kash Mehdi ( e ) kashif.mehdi@collibra.com, ( t ) @kash_mehdi ( ln ) linkedin.com/in/kashmehdi 21 Collibra 2017 2017 Collibra Inc