Good analytics needs good data and that needs good metadata

Size: px
Start display at page:

Download "Good analytics needs good data and that needs good metadata"

Transcription

1 Good analytics needs good data and that needs good metadata 28 th February 2018 Mandy Chessell CBE FREng CEng FBCS Distinguished Engineer, Master Inventor Analytics Chief Data Office

2 Introduction and background Mandy Chessell CBE FREng Distinguished Engineer, Master Inventor Focused on Analytics, Open Metadata and Governance 30 years at IBM working on exchange and integration of data between business systems. 2

3 Agenda Why is metadata so important today? What is the challenge? Building an open ecosystem Apache Atlas and the specifics ODPI Data Governance PMC Progress report and call to action 3

4 Data and analytics affects every aspect of modern life Healthcare Government Manufacturing Telecommunication Insurance Energy & Utilities Financial Services Retail High Tech Hospital Oil & Gas Travel & Hotel Transportation Multi-channel integration Stock Market 4

5 Today s technology is inherently ethics-agnostic but it pushes the art of the possible to new limits The availability of a wide range of data from many sources. The ability to cheaply process and link this data together to understand a bigger picture. The accuracy with which an individual can be identified and targeted. The ability to pinpoint location for contextual insight and surveillance. The application of this new insight to a wide range of activities and actions. The operation of the insight in real-time or near real-time. Increasing both the ability and the obligation to act 5

6 Open Metadata and Governance MANAGING DATA FOR DATA SCIENCE 6

7 The perils of reusing data Callie Quartile Data Scientist Callie Quartile uses (1) open data from the local government registrar and (2) data from the employee directory to (3) create a birthday card service for the company. 1 Open Data Site Data Lake 3 Employee Directory 2 7

8 The perils of reusing data Callie Quartile Data Scientist But its not my birthday Happy Birthday 1 Open Data Site Data Lake 3 Employee Directory 2 Unfortunately the obvious date in the registrar record was the registration of birth date not the date of birth. Date of birth was not published in the open data. Callie needed better information about the open data to realise she had the wrong data. 8

9 Metadata should bring as much information about the data sets to Callie s data science as is known collectively by the organization. Data Set Name: Employee Directory Description: Core attributes describing all employees of OCO pharmaceuticals created from a daily extract from Kenexa. Owner: Penny Payer Classification Ranges: Confidentiality: Public, Confidential, Sensitive Confidence: Authoritative Retention: Indefinitely Status: Last accessed: 6 th May 2016 Records: 3488 Last Update: 1 st May 2016 Contents: Structure Contents Lineage X Column: Band Description Characteristi Lineage cs Position reference number for nonexempt employees. The value ranges from 01 to 06 where 01 is the most senior and 06 is the most junior. Type: String Classification: Public Employee Directory NameBand Job Title X X 9

10 Curation I wonder what this means Lemmie Stage New York 4 27 DataStage Expert Code St Harlem NY Callie Quartile New York 4 27 Data Scientist Code St Harlem NY Tanya Tidie New York 4 27 Data Steward Code St Harlem NY 1 3 I know 10

11 Scared to share Faith Broker Business Team Faith Broker has been doing some simple analysis on the HR data of the company. She wants to share this data with Callie Quartile to do some detailed work. However, she does not want Callie to see the sensitive personal information in the record Lemmie Stage New York 4 27 DataStage Expert Code St Harlem NY Callie Quartile New York 4 27 Data Scientist Code St Harlem NY Tanya Tidie New York 4 27 Data Steward Code St Harlem NY Lemmie Stage New York 4 27 DataStage Expert 1 XXXXX XXX 27 Code St Harlem NY Callie Quartile New York 4 27 Data Scientist 1 XXXXX XXX 27 Code St Harlem NY Tanya Tidie New York 4 27 Data Steward 1 XXXXX XXX 27 Code St Harlem NY 1 3 Callie Quartile Data Scientist 11

12 Why do we need metadata? Metadata enables data to be used outside of the application that created it. Analytics and decision making New business applications Reporting and compliance Metadata describes the format and content of data allowing people to judge which data set to use for a new project Structure Meaning Origin Valid values and quality Usage and ownership Regulations and classifications that apply <more> Metadata describes the business context and classification of data allowing automated governance processes to operate. 12

13 Open Metadata and Governance WHAT METADATA DO WE NEED? 13

14 Scope of metadata for a data driven organization Governance Glossary Collaboration Models and Reference Data Lineage Base Types, Systems and Infrastructure Data Assets Metadata Discovery 14

15 Open Metadata and Governance TODAY S REALITY 15

16 It is a complex world Many systems Different design Different technology Different quality Different aims 16

17 Data-centric Organisations Systems of Record Mobile Apps Systems of Engagement Databases Files Applications Data Lake 17 External Data Systems of Insight

18 Today s reality 18

19 What needs to change? Open and Unified Metadata 19

20 Automatic metadata capture 20 20

21 Manual metadata capture 21

22 A new manifesto for metadata and governance Metadata management must be automated Metadata management must become ubiquitous Metadata must become open and remotely accessible Metadata should be used to drive the governance of data The discovery, maintenance and user of metadata has to be an integral part of all tools that access, change and move information

23 Open metadata management ecosystem Analytics Platform Metadata Collaboration Space Metadata Application Metadata Hadoop Platform Metadata Cloud SaaS platform Metadata Peer-to-peer network of repositories Metadata stored and managed close to its source Each repository/tool brings unique value. Open, extensible metadata structures for metadata exchange and federation extending coverage of the types of resources that need to be described. Open source infrastructure sharing cost of development and maintenance between vendors Support for open standards where available 23

24 Open Metadata and Governance BUILDING THE OPEN ECOSYSTEM 24

25 Apache Atlas Apache Atlas has just graduated to become a top-level project. It began as an incubator open source project on 5 th May 2015 to deliver an open source governance capability focused primarily on the Hadoop platform. Apache Atlas is designed to localize operational governance to the operating data platform such as Hadoop. At its heart is a type-agnostic metadata store that can be access through restful interfaces. We see Apache Atlas as the reference implementation for open metadata and governance, for vendors to pick up and use; or test their integration against. Being open source allows all vendors to enrich/enhance standard. 25

26 Apache Atlas today 26

27 Updates to Apache Atlas Automation Capture of metadata from data platforms, data movement engines and data protection engines. Exception management and stewardship Business Value Specialized services for key data roles such as CDO, Data Scientist, Developer, DevOps Operator, Asset Owner, Applications Connectivity Metadata Highway offering open metadata exchange, linking and federation between heterogeneous metadata repositories. 27

28 Metadata type system 28

29 Open metadata meta-types, types and instances Meta-Type Model (enumerations, structs, entities, classifications, relationships, ) «entity» DataStore * datastore «relationship» PhysicalStoreForDataSet dataset * «entity» DataSet Type Model «entity» Database «entity» DeployedDatabaseSchema Instance Model «entity» Server «classification» DatabaseServer 29

30 Open metadata type model summary 0 Policy Metadata (Principles, Regulations, Standards, Approaches, Rule Specifications, Roles and Metrics) Governance Actions and Processes Strategy Campaigns and Projects Implementation Instrument Rollout Rights Management Information Process Instrumentation (design lineage) Basic Types, Infrastructure and Systems Classification Schemes 4 Classification 7 Association Access Subject Area Definition Business Objects and Relationships, Taxonomies and Ontologies 3 Organization Business Attributes Mapping Asset Collections (Sets, Typed Sets, Type Organized Sets) Information Views Physical Asset Descriptions (Data stores, APIs, models and components) Connectors 2 1 Augmentation 5 Augmentation 6 Teaming Metadata (people profiles, communities, projects, notebooks, ) Feedback Metadata (tags, comments, ratings, ) Models and Schemas Reference Data Discovery Metadata (profile data, technical classification, data classification, data quality assessment, ) 30

31 More detail here 31

32 Metadata and governance digital platform Reporting Platform Governance Platform Analytics Platform Open Metadata and Governance Data Platform ETL Platform 32 Virtualization Platform

33 Types of tools that may integrate with an open metadata repository BI and visualization tools locating data assets and related information about them; defining reports and publishing their metadata; viewing lineage Data Science tool wanting to find out about data assets available and manage user lineage of transformations and analytics models may also manage metadata for analytics models API developer tool wanting to understand proper data structures and data meaning to use for APIs plus additional governance requirements that need to be implemented by API because of the data it exchanges. Counter-fraud tools ad hoc analysis of logs and error reports, setting up rules Curator/owner tool for managing the curation of assets, providing access, verifying use of assets, reviewing discovery results and exceptions, approving change requests. Glossary tool for subject matter experts and information architects to share expertise about a particular subject area may also define structures and related reference data Enterprise architect tools defining the data landscape and related systems. DevOps tools conformance to polices and standards in development metadata capture at deployment validation of deployment platform requirements Data integration engine locating appropriate data and component assets, log design lineage, log operational lineage Information Virtualisation tools locate appropriate data assets, build views and publish them, add design lineage, log operational lineage Governance tools setting up and monitoring governance program, data quality, Stewardship tools reviewing assigned exceptions, making data changes and requesting approval Information security tools setting up data access policies and enforcement Auditor tools view compliance reports and validate policies and policy implementations 33

34 Open Metadata Access Services Asset Owner Subject Area Information View Asset Catalog Community Profile Stewardship Project Management Governance Program Data Science Information Process Developer Information Protection Information Landscape Governance Engine DevOps Information Infrastructure Data Platform Connected Asset Asset Consumer Discovery 34

35 Inside the server OMAS REST APIs and Topics Server Configuration Open Metadata Access Services (OMAS) OMAG Administration REST APIs Open Metadata Repository Services (OMRS) OMRS Repository REST APIs OMRS AuditLog Connector OMRS Archive Connector OMRS Cohort Registry Store Connector OMRS Topic Connector OMRS Repository Connector OMRS Event Mapper Connector 35 Open Metadata and Governance (OMAG) Server

36 Inside the server OMAS REST APIs and Topics Server Configuration Open Metadata Access Services (OMAS) OMAG Administration REST APIs Administration Cohort Services Enterprise Repository Services Local Repository Services OMRS Repository REST APIs OMRS AuditLog Connector OMRS Archive Connector OMRS Cohort Registry Store Connector OMRS Topic Connector OMRS Repository Connector OMRS Event Mapper Connector 36 Open Metadata and Governance (OMAG) Server

37 Simple cohort Data Lake Cohort A Chief Data Office Systems of Record 37

38 Multiple cohorts Marketing Data Lake Data Lake Cohort A Cohort B Chief Data Office 38 Systems of Record Mobile Apps Systems of Record

39 Integration patterns 39

40 Caller Pattern A metadata tool or engine can access the consumer-specific APIs to work with metadata. The Access Layer handles the calls to metadata repositories connected to the metadata highway 40

41 Adapter Pattern Simple components plug into a repository proxy to connect in an existing metadata repository. 41

42 Native Pattern Native implementation of the open metadata governance APIs Apache Atlas is a native implementation of the open metadata and governance APIs. 42

43 First server 43

44 Establishing contact 44

45 Federated queries 45

46 Caching metadata for availability and performance 46

47 @ODPiOrg ODPI - CO-CREATION WITH PRACTITIONERS Compliance assistance and certification for vendors Subject matter experts sharing best practices and co-creating content packs

48 Call to action how can you help? Direct code contribution to the Apache Atlas project. There are many features that still need to be coded. Collaboration through the ODPI Data Governance PMC Encouraging your customers/partners and projects internal to your organization to embrace Apache Atlas and its standards to grow the ecosystem of data and processing that is assured by metadata and governance capability

49 49

50 z zzz z z z Questions? 50

51 51

Metadata and the Rise of Big Data Governance: Active Open Source Initiatives. October 23, 2018

Metadata and the Rise of Big Data Governance: Active Open Source Initiatives. October 23, 2018 Metadata and the Rise of Big Data Governance: Active Open Source Initiatives October 23, 2018 Today s speakers John Mertic, Director of Program Management, Linux Foundation David Radley, ODPi Egeria maintainer,

More information

ODPi and Data Governance Free Your MetaData! October 10, 2018

ODPi and Data Governance Free Your MetaData! October 10, 2018 ODPi and Data Governance Free Your MetaData! October 10, 2018 Today s reality @ODPiOrg Imagine An enterprise data catalogue that lists all of your data, where it is located, its origin (lineage), owner,

More information

Data Governance Data Usage Labeling and Enforcement in Adobe Experience Platform

Data Governance Data Usage Labeling and Enforcement in Adobe Experience Platform Contents What is data governance? Why data governance? Data governance roles The Adobe Experience Platform advantage A framework for data governance Data usage patterns Data governance in action Conclusion

More information

MAPR DATA GOVERNANCE WITHOUT COMPROMISE

MAPR DATA GOVERNANCE WITHOUT COMPROMISE MAPR TECHNOLOGIES, INC. WHITE PAPER JANUARY 2018 MAPR DATA GOVERNANCE TABLE OF CONTENTS EXECUTIVE SUMMARY 3 BACKGROUND 4 MAPR DATA GOVERNANCE 5 CONCLUSION 7 EXECUTIVE SUMMARY The MapR DataOps Governance

More information

Solving the Enterprise Data Dilemma

Solving the Enterprise Data Dilemma Solving the Enterprise Data Dilemma Harmonizing Data Management and Data Governance to Accelerate Actionable Insights Learn More at erwin.com Is Our Company Realizing Value from Our Data? If your business

More information

Data Governance: Data Usage Labeling and Enforcement in Adobe Cloud Platform

Data Governance: Data Usage Labeling and Enforcement in Adobe Cloud Platform Data Governance: Data Usage Labeling and Enforcement in Adobe Cloud Platform Contents What is data governance? Why data governance? Data governance roles. The Adobe Cloud Platform advantage. A framework

More information

Building a Data Strategy for a Digital World

Building a Data Strategy for a Digital World Building a Data Strategy for a Digital World Jason Hunter, CTO, APAC Data Challenge: Pushing the Limits of What's Possible The Art of the Possible Multiple Government Agencies Data Hub 100 s of Service

More information

Data Governance Overview

Data Governance Overview 3 Data Governance Overview Date of Publish: 2018-04-01 http://docs.hortonworks.com Contents Apache Atlas Overview...3 Apache Atlas features...3...4 Apache Atlas Overview Apache Atlas Overview Apache Atlas

More information

2 The IBM Data Governance Unified Process

2 The IBM Data Governance Unified Process 2 The IBM Data Governance Unified Process The benefits of a commitment to a comprehensive enterprise Data Governance initiative are many and varied, and so are the challenges to achieving strong Data Governance.

More information

The Data Catalog The Key to Managing Data, Big and Small. April Reeve May

The Data Catalog The Key to Managing Data, Big and Small. April Reeve May The Data Catalog The Key to Managing Data, Big and Small April Reeve May 18 2017 April Reeve Thirty years doing data oriented stuff Data Management disciplines Data Integration, Data Governance, Data Modeling,

More information

Importance of the Data Management process in setting up the GDPR within a company CREOBIS

Importance of the Data Management process in setting up the GDPR within a company CREOBIS Importance of the Data Management process in setting up the GDPR within a company CREOBIS 1 Alain Cieslik Personal Data is the oil of the digital world 2 Alain Cieslik Personal information comes in different

More information

IBM Industry Model support for a data lake architecture

IBM Industry Model support for a data lake architecture IBM Industry Models IBM Industry Model support for a data lake architecture Version 1.0 P a g e 1 Contents 1 Introduction... 3 1.1 About this document... 3 1.2 What this document means as a data lake...

More information

WHITE PAPER: TOP 10 CAPABILITIES TO LOOK FOR IN A DATA CATALOG

WHITE PAPER: TOP 10 CAPABILITIES TO LOOK FOR IN A DATA CATALOG WHITE PAPER: TOP 10 CAPABILITIES TO LOOK FOR IN A DATA CATALOG The #1 Challenge in Successfully Deploying a Data Catalog The data cataloging space is relatively new. As a result, many organizations don

More information

Getting personal with your customers and GDPR

Getting personal with your customers and GDPR Getting personal with your customers and GDPR A practical approach to a secure, governed 360 degree customer view Darren Brunt Presales Director UK&I, Talend Colm Moynihan Partner Presales Manager EMEA,

More information

Data Virtualization and the API Ecosystem

Data Virtualization and the API Ecosystem Data Virtualization and the API Ecosystem Working Together, These Two Technologies Enable Digital Transformation SOLUTION Data Virtualization for the API Ecosystem WEBSITE www.denodo.com PRODUCT OVERVIEW

More information

Hortonworks DataPlane Service

Hortonworks DataPlane Service Data Steward Studio Administration () docs.hortonworks.com : Data Steward Studio Administration Copyright 2016-2017 Hortonworks, Inc. All rights reserved. Please visit the Hortonworks Data Platform page

More information

ETL is No Longer King, Long Live SDD

ETL is No Longer King, Long Live SDD ETL is No Longer King, Long Live SDD How to Close the Loop from Discovery to Information () to Insights (Analytics) to Outcomes (Business Processes) A presentation by Brian McCalley of DXC Technology,

More information

Data ownership within governance: getting it right

Data ownership within governance: getting it right Data ownership within governance: getting it right Control your data An Experian white paper Data Ownership within Governance : Getting it right - 1 Table of contents 1. Introduction 03 2. Why is data

More information

Data Clairvoyance. A business approach to data. Real data practitioners, delivering real improvements to your enterprise data assets.

Data Clairvoyance. A business approach to data. Real data practitioners, delivering real improvements to your enterprise data assets. Data Clairvoyance A business approach to data. A professional services firm that provides a very unique and holistic approach that enables your organization to be successful in traversing the data challenges

More information

Real World Data Governance- Part 1

Real World Data Governance- Part 1 Real World Data Governance- Part 1 Day in the Life of a Business Steward Jesse Lambert and Jack Spivak, TopQuadrant Inc. November 30, 2017 Today s Program TopBraid EDG: A Day in the Life of a Business

More information

GOVERNING HADOOP (AND THE DATA LAKE)

GOVERNING HADOOP (AND THE DATA LAKE) GOVERNING HADOOP (AND THE DATA LAKE) DAMA-RMC Discussion Lowell W. Fryman, CBIP-CDMP Practice Principle lowell.fryman@collibra.com April 20, 2017 2017 Collibra Inc DAMA-RMC Discussion Agenda Do we need

More information

From Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019

From Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019 From Single Purpose to Multi Purpose Data Lakes Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019 Agenda Data Lakes Multiple Purpose Data Lakes Customer Example Demo Takeaways

More information

Data Protection Modernization: Meeting the Challenges of a Changing IT Landscape

Data Protection Modernization: Meeting the Challenges of a Changing IT Landscape Data Protection Modernization: Meeting the Challenges of a Changing IT Landscape Tom Clark IBM Distinguished Engineer, Chief Architect Software 1 Data growth is continuing to explode Sensors & Devices

More information

Implementing the Army Net Centric Data Strategy in a Service Oriented Environment

Implementing the Army Net Centric Data Strategy in a Service Oriented Environment Implementing the Army Net Centric Strategy in a Service Oriented Environment Michelle Dirner Army Net Centric Strategy (ANCDS) Center of Excellence (CoE) Service Team Lead RDECOM CERDEC SED in support

More information

IBM Information Governance Catalog (IGC) Partner Application Validation Quick Guide

IBM Information Governance Catalog (IGC) Partner Application Validation Quick Guide IBM Information Governance Catalog (IGC) Partner Application Validation Quick Guide VERSION: 2.0 DATE: Feb 15, 2018 EDITOR: D. Rangarao Table of Contents 1 Overview of the Application Validation Process...

More information

IBM InfoSphere Master Data Management Version 11 Release 5. Overview IBM SC

IBM InfoSphere Master Data Management Version 11 Release 5. Overview IBM SC IBM InfoSphere Master Data Management Version 11 Release 5 Overview IBM SC27-6718-01 IBM InfoSphere Master Data Management Version 11 Release 5 Overview IBM SC27-6718-01 Note Before using this information

More information

Microsoft SharePoint Server 2013 Plan, Configure & Manage

Microsoft SharePoint Server 2013 Plan, Configure & Manage Microsoft SharePoint Server 2013 Plan, Configure & Manage Course 20331-20332B 5 Days Instructor-led, Hands on Course Information This five day instructor-led course omits the overlap and redundancy that

More information

SAP Agile Data Preparation Simplify the Way You Shape Data PUBLIC

SAP Agile Data Preparation Simplify the Way You Shape Data PUBLIC SAP Agile Data Preparation Simplify the Way You Shape Data Introduction SAP Agile Data Preparation Overview Video SAP Agile Data Preparation is a self-service data preparation application providing data

More information

What s a BA to do with Data? Discover and define standard data elements in business terms

What s a BA to do with Data? Discover and define standard data elements in business terms What s a BA to do with Data? Discover and define standard data elements in business terms Susan Block, Lead Business Systems Analyst The Vanguard Group Discussion Points Discovering Business Data The Data

More information

Data Management Glossary

Data Management Glossary Data Management Glossary A Access path: The route through a system by which data is found, accessed and retrieved Agile methodology: An approach to software development which takes incremental, iterative

More information

The Value of Data Modeling for the Data-Driven Enterprise

The Value of Data Modeling for the Data-Driven Enterprise Solution Brief: erwin Data Modeler (DM) The Value of Data Modeling for the Data-Driven Enterprise Designing, documenting, standardizing and aligning any data from anywhere produces an enterprise data model

More information

Improving Your Business with Oracle Data Integration See How Oracle Enterprise Metadata Management Can Help You

Improving Your Business with Oracle Data Integration See How Oracle Enterprise Metadata Management Can Help You Improving Your Business with Oracle Data Integration See How Oracle Enterprise Metadata Management Can Help You Özgür Yiğit Oracle Data Integration, Senior Manager, ECEMEA Safe Harbor Statement The following

More information

Luncheon Webinar Series April 25th, Governance for ETL Presented by Beate Porst Sponsored By:

Luncheon Webinar Series April 25th, Governance for ETL Presented by Beate Porst Sponsored By: Luncheon Webinar Series April 25th, 2014 Governance for ETL Presented by Beate Porst Sponsored By: 1 Governance for ETL Questions and suggestions regarding presentation topics? - send to editor@dsxchange.com

More information

IBM InfoSphere Information Analyzer

IBM InfoSphere Information Analyzer IBM InfoSphere Information Analyzer Understand, analyze and monitor your data Highlights Develop a greater understanding of data source structure, content and quality Leverage data quality rules continuously

More information

Effective Risk Data Aggregation & Risk Reporting

Effective Risk Data Aggregation & Risk Reporting Effective Risk Data Aggregation & Risk Reporting Presented by: Ilia Bolotine Head, Adastra Business Consulting (Canada) 1 The Evolving Regulatory Landscape in Risk Management A significant lesson learned

More information

Oliver Engels & Tillmann Eitelberg. Big Data! Big Quality?

Oliver Engels & Tillmann Eitelberg. Big Data! Big Quality? Oliver Engels & Tillmann Eitelberg Big Data! Big Quality? Like to visit Germany? PASS Camp 2017 Main Camp 5.12 7.12.2017 (4.12 Kick Off Evening) Lufthansa Training & Conference Center, Seeheim SQL Konferenz

More information

CA ERwin Data Modeler r9 Rick Alaras N.A. Channel Account Manager

CA ERwin Data Modeler r9 Rick Alaras N.A. Channel Account Manager ERwin r9 CA ERwin Data Modeler r9 Rick Alaras N.A. Channel Account Manager In today s data-driven economy, there is an increasing disconnect between consumers and providers of data DATA VOLUMES INCREASING

More information

Information empowerment for your evolving data ecosystem

Information empowerment for your evolving data ecosystem Information empowerment for your evolving data ecosystem Highlights Enables better results for critical projects and key analytics initiatives Ensures the information is trusted, consistent and governed

More information

SOLUTION BRIEF RSA SECURID SUITE ACCELERATE BUSINESS WHILE MANAGING IDENTITY RISK

SOLUTION BRIEF RSA SECURID SUITE ACCELERATE BUSINESS WHILE MANAGING IDENTITY RISK RSA SECURID SUITE ACCELERATE BUSINESS WHILE MANAGING IDENTITY RISK KEY BENEFITS AT A GLANCE Ensure your journey to the cloud is secure and convenient, without compromising either. Drive business agility

More information

DATA GOVERNANCE LEADS TO DATA QUALITY

DATA GOVERNANCE LEADS TO DATA QUALITY DATA GOVERNANCE LEADS TO DATA QUALITY Trending. Kash Mehdi Senior Product Specialist and Instructor May 3, 2017 1 Collibra 2017 2017 Collibra Inc How Many of Your Reports Have Good Data Quality? What would

More information

Enabling Data Governance Leveraging Critical Data Elements

Enabling Data Governance Leveraging Critical Data Elements Adaptive Presentation at DAMA-NYC October 19 th, 2017 Enabling Data Governance Leveraging Critical Data Elements Jeff Goins, President, Jeff.goins@adaptive.com James Cerrato, Chief, Product Evangelist,

More information

Unified Governance for Amazon S3 Data Lakes

Unified Governance for Amazon S3 Data Lakes WHITEPAPER Unified Governance for Amazon S3 Data Lakes Core Capabilities and Best Practices for Effective Governance Introduction Data governance ensures data quality exists throughout the complete lifecycle

More information

Modelos de Negócio na Era das Clouds. André Rodrigues, Cloud Systems Engineer

Modelos de Negócio na Era das Clouds. André Rodrigues, Cloud Systems Engineer Modelos de Negócio na Era das Clouds André Rodrigues, Cloud Systems Engineer Agenda Software and Cloud Changed the World Cisco s Cloud Vision&Strategy 5 Phase Cloud Plan Before Now From idea to production:

More information

The Value of Data Governance for the Data-Driven Enterprise

The Value of Data Governance for the Data-Driven Enterprise Solution Brief: erwin Data governance (DG) The Value of Data Governance for the Data-Driven Enterprise Prepare for Data Governance 2.0 by bringing business teams into the effort to drive data opportunities

More information

How Security Policy Orchestration Extends to Hybrid Cloud Platforms

How Security Policy Orchestration Extends to Hybrid Cloud Platforms How Security Policy Orchestration Extends to Hybrid Cloud Platforms Reducing complexity also improves visibility when managing multi vendor, multi technology heterogeneous IT environments www.tufin.com

More information

ASG WHITE PAPER DATA INTELLIGENCE. ASG s Enterprise Data Intelligence Solutions: Data Lineage Diving Deeper

ASG WHITE PAPER DATA INTELLIGENCE. ASG s Enterprise Data Intelligence Solutions: Data Lineage Diving Deeper THE NEED Knowing where data came from, how it moves through systems, and how it changes, is the most critical and most difficult task in any data management project. If that process known as tracing data

More information

Realizing the Full Potential of MDM 1

Realizing the Full Potential of MDM 1 Realizing the Full Potential of MDM SOLUTION MDM Augmented with Data Virtualization INDUSTRY Applicable to all Industries EBSITE www.denodo.com PRODUCT OVERVIE The Denodo Platform offers the broadest access

More information

Expose Existing z Systems Assets as APIs to extend your Customer Reach

Expose Existing z Systems Assets as APIs to extend your Customer Reach Expose Existing z Systems Assets as APIs to extend your Customer Reach Unlocking mainframe assets for mobile and cloud applications Asit Dan z Services API Management, Chief Architect asit@us.ibm.com Insert

More information

Creating a Corporate Taxonomy. Internet Librarian November 2001 Betsy Farr Cogliano

Creating a Corporate Taxonomy. Internet Librarian November 2001 Betsy Farr Cogliano Creating a Corporate Taxonomy Internet Librarian 2001 7 November 2001 Betsy Farr Cogliano 2001 The MITRE Corporation Revised October 2001 2 Background MITRE is a not-for-profit corporation operating three

More information

AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT

AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT Dalton Cervo Author, Consultant, Data Management Expert March 2016 This presentation contains extracts from books that are: Copyright 2011 John Wiley & Sons,

More information

Informatica Enterprise Information Catalog

Informatica Enterprise Information Catalog Data Sheet Informatica Enterprise Information Catalog Benefits Automatically catalog and classify all types of data across the enterprise using an AI-powered catalog Identify domains and entities with

More information

Match data set availability to data resource requirements, including gap analysis and remediation assistance.

Match data set availability to data resource requirements, including gap analysis and remediation assistance. Discovering data/datasets Specify Data Requirements Identify Data Assets Assist customers with clarifying problem statements, use cases, high-level requirements (e.g. goals, objectives) and detailed requirements

More information

From Conceptual to Physical Adjustments to Enterprise Models for the Real World. Myriad Solutions, Inc. erwin Premier Partner since 2000

From Conceptual to Physical Adjustments to Enterprise Models for the Real World. Myriad Solutions, Inc. erwin Premier Partner since 2000 From Conceptual to Physical Adjustments to Enterprise Models for the Real World Myriad Solutions, Inc. erwin Premier Partner since 2000 About Your Speaker Kelly Findley is a Senior Consultant with Myriad

More information

What's New in SAS Data Management

What's New in SAS Data Management Paper SAS1390-2015 What's New in SAS Data Management Nancy Rausch, SAS Institute Inc., Cary, NC ABSTRACT The latest releases of SAS Data Integration Studio and DataFlux Data Management Platform provide

More information

IBM Advantage: IBM Watson Compare and Comply Element Classification

IBM Advantage: IBM Watson Compare and Comply Element Classification IBM Advantage: IBM Watson Compare and Comply Element Classification Executive overview... 1 Introducing Watson Compare and Comply... 2 Definitions... 3 Element Classification insights... 4 Sample use cases...

More information

Training and Certification. Guide to Learning and Certification Paths

Training and Certification. Guide to Learning and Certification Paths Training and Certification Guide to Learning and Certification Paths Home Contents Back Next Table of Contents ServiceNow Fundamentals Implementer Developer Fulfiller Training 3 Get Certified Become Indispensable

More information

Applying Auto-Data Classification Techniques for Large Data Sets

Applying Auto-Data Classification Techniques for Large Data Sets SESSION ID: PDAC-W02 Applying Auto-Data Classification Techniques for Large Data Sets Anchit Arora Program Manager InfoSec, Cisco The proliferation of data and increase in complexity 1995 2006 2014 2020

More information

Data Center Management and Automation Strategic Briefing

Data Center Management and Automation Strategic Briefing Data Center and Automation Strategic Briefing Contents Why is Data Center and Automation (DCMA) so important? 2 The Solution Pathway: Data Center and Automation 2 Identifying and Addressing the Challenges

More information

Canada Life Cyber Security Statement 2018

Canada Life Cyber Security Statement 2018 Canada Life Cyber Security Statement 2018 Governance Canada Life has implemented an Information Security framework which supports standards designed to establish a system of internal controls and accountability

More information

The Emerging Data Lake IT Strategy

The Emerging Data Lake IT Strategy The Emerging Data Lake IT Strategy An Evolving Approach for Dealing with Big Data & Changing Environments bit.ly/datalake SPEAKERS: Thomas Kelly, Practice Director Cognizant Technology Solutions Sean Martin,

More information

Transforming IT: From Silos To Services

Transforming IT: From Silos To Services Transforming IT: From Silos To Services Chuck Hollis Global Marketing CTO EMC Corporation http://chucksblog.emc.com @chuckhollis IT is being transformed. Our world is changing fast New Technologies New

More information

IBM Software IBM InfoSphere Information Server for Data Quality

IBM Software IBM InfoSphere Information Server for Data Quality IBM InfoSphere Information Server for Data Quality A component index Table of contents 3 6 9 9 InfoSphere QualityStage 10 InfoSphere Information Analyzer 12 InfoSphere Discovery 13 14 2 Do you have confidence

More information

INTELLIGENCE DRIVEN GRC FOR SECURITY

INTELLIGENCE DRIVEN GRC FOR SECURITY INTELLIGENCE DRIVEN GRC FOR SECURITY OVERVIEW Organizations today strive to keep their business and technology infrastructure organized, controllable, and understandable, not only to have the ability to

More information

The Modeling and Simulation Catalog for Discovery, Knowledge, and Reuse

The Modeling and Simulation Catalog for Discovery, Knowledge, and Reuse The Modeling and Simulation Catalog for Discovery, Knowledge, and Reuse Stephen Hunt OSD CAPE Joint Data Support (SAIC) Stephen.Hunt.ctr@osd.mil The DoD Office of Security Review has cleared this report

More information

Fujitsu World Tour 2018

Fujitsu World Tour 2018 Fujitsu World Tour 2018 May 30, 2018 #FujitsuWorldTour 1 Copyright 2018 FUJITSU Security and Privacy of Big Data A NIST Perspective Arnab Roy Fujitsu Laboratories of America Co-Chair, NIST Big Data WG:

More information

Advanced Solutions of Microsoft SharePoint Server 2013 Course Contact Hours

Advanced Solutions of Microsoft SharePoint Server 2013 Course Contact Hours Advanced Solutions of Microsoft SharePoint Server 2013 Course 20332 36 Contact Hours Course Overview This course examines how to plan, configure, and manage a Microsoft SharePoint Server 2013 environment.

More information

Advanced Solutions of Microsoft SharePoint 2013

Advanced Solutions of Microsoft SharePoint 2013 Course 20332A :Advanced Solutions of Microsoft SharePoint 2013 Page 1 of 9 Advanced Solutions of Microsoft SharePoint 2013 Course 20332A: 4 days; Instructor-Led About the Course This four-day course examines

More information

Metadata Management as a Key Component to Data Governance, Data Stewardship, and Data Quality Management. Wednesday, July 20 th 2016

Metadata Management as a Key Component to Data Governance, Data Stewardship, and Data Quality Management. Wednesday, July 20 th 2016 Metadata Management as a Key Component to Data Governance, Data Stewardship, and Data Quality Management Wednesday, July 20 th 2016 Confidential, Datasource Consulting, LLC 2 Multi-Domain Master Data Management

More information

April 17, Ronald Layne Manager, Data Quality and Data Governance

April 17, Ronald Layne Manager, Data Quality and Data Governance Ensuring the highest quality data is delivered throughout the university providing valuable information serving individual and organizational need April 17, 2015 Ronald Layne Manager, Data Quality and

More information

Advanced Solutions of Microsoft SharePoint Server 2013

Advanced Solutions of Microsoft SharePoint Server 2013 Course Duration: 4 Days + 1 day Self Study Course Pre-requisites: Before attending this course, students must have: Completed Course 20331: Core Solutions of Microsoft SharePoint Server 2013, successful

More information

Improving Data Governance in Your Organization. Faire Co Regional Manger, Information Management Software, ASEAN

Improving Data Governance in Your Organization. Faire Co Regional Manger, Information Management Software, ASEAN Improving Data Governance in Your Organization Faire Co Regional Manger, Information Management Software, ASEAN Topics The Innovation Imperative and Innovating with Information What Is Data Governance?

More information

Multicloud is the New Normal Cloud enables Digital Transformation (DX), but more clouds bring more challenges

Multicloud is the New Normal Cloud enables Digital Transformation (DX), but more clouds bring more challenges Multicloud is the New Normal Cloud enables Digital Transformation (DX), but more clouds bring more challenges An IDC InfoBrief, Sponsored by Cisco March 2018 EXECUTIVE SUMMARY Cloud is a key enabler of

More information

Best Practices in Enterprise Data Governance

Best Practices in Enterprise Data Governance Best Practices in Enterprise Data Governance Scott Gidley and Nancy Rausch, SAS WHITE PAPER SAS White Paper Table of Contents Introduction.... 1 Data Governance Use Case and Challenges.... 1 Collaboration

More information

EUROPEAN ICT PROFESSIONAL ROLE PROFILES VERSION 2 CWA 16458:2018 LOGFILE

EUROPEAN ICT PROFESSIONAL ROLE PROFILES VERSION 2 CWA 16458:2018 LOGFILE EUROPEAN ICT PROFESSIONAL ROLE PROFILES VERSION 2 CWA 16458:2018 LOGFILE Overview all ICT Profile changes in title, summary, mission and from version 1 to version 2 Versions Version 1 Version 2 Role Profile

More information

Microsoft Core Solutions of Microsoft SharePoint Server 2013

Microsoft Core Solutions of Microsoft SharePoint Server 2013 1800 ULEARN (853 276) www.ddls.com.au Microsoft 20331 - Core Solutions of Microsoft SharePoint Server 2013 Length 5 days Price $4290.00 (inc GST) Version B Overview This course will provide you with the

More information

Applied Data Governance - Part 3

Applied Data Governance - Part 3 Applied Data Governance - Part 3 Day in the Life of a Reference Data Steward Jesse Lambert and Jack Spivak, TopQuadrant Inc. May 17, 2018 Today s Program 1. Introduction: Benefits of Managing Reference

More information

XML Documentation for Adobe Experience Manager

XML Documentation for Adobe Experience Manager XML Documentation for Adobe Experience Manager Solution brief XML Documentation for Adobe Experience Manager An enterprise-class CCMS to manage documentation from creation to delivery It s a component

More information

First Look Showcase. Expanding our prevention, detection and response solutions. Sumedh Thakar Chief Product Officer, Qualys, Inc.

First Look Showcase. Expanding our prevention, detection and response solutions. Sumedh Thakar Chief Product Officer, Qualys, Inc. 18 QUALYS SECURITY CONFERENCE 2018 First Look Showcase Expanding our prevention, detection and response solutions Sumedh Thakar Chief Product Officer, Qualys, Inc. Secure Enterprise Mobility Identity (X.509,

More information

PROCEDURE POLICY DEFINITIONS AD DATA GOVERNANCE PROCEDURE. Administration (AD) APPROVED: President and CEO

PROCEDURE POLICY DEFINITIONS AD DATA GOVERNANCE PROCEDURE. Administration (AD) APPROVED: President and CEO Section: Subject: Administration (AD) Data Governance AD.3.3.1 DATA GOVERNANCE PROCEDURE Legislation: Alberta Evidence Act (RSA 2000 ca-18); Copyright Act, R.S.C., 1985, c.c-42; Electronic Transactions

More information

CLOUD SECURITY CRASH COURSE

CLOUD SECURITY CRASH COURSE CLOUD SECURITY CRASH COURSE ADDRESSING REAL WORLD CONCERNS Joel Friedman, CTSO ABOUT ME Name: Joel Friedman Title: Chief Technology & Security Officer of Datapipe Certifications: CISSP, CISA, CISM, CRISC,

More information

FIVE BEST PRACTICES FOR ENSURING A SUCCESSFUL SQL SERVER MIGRATION

FIVE BEST PRACTICES FOR ENSURING A SUCCESSFUL SQL SERVER MIGRATION FIVE BEST PRACTICES FOR ENSURING A SUCCESSFUL SQL SERVER MIGRATION The process of planning and executing SQL Server migrations can be complex and risk-prone. This is a case where the right approach and

More information

How to Become a DATA GOVERNANCE EXPERT

How to Become a DATA GOVERNANCE EXPERT How to Become a DATA GOVERNANCE EXPERT You re already a data expert. You ve been working with enterprise data for years. You ve seen the good, the bad, and the downright ugly. And you ve watched the business

More information

IOTA ARCHITECTURE: DATA VIRTUALIZATION AND PROCESSING MEDIUM DR. KONSTANTIN BOUDNIK DR. ALEXANDRE BOUDNIK

IOTA ARCHITECTURE: DATA VIRTUALIZATION AND PROCESSING MEDIUM DR. KONSTANTIN BOUDNIK DR. ALEXANDRE BOUDNIK IOTA ARCHITECTURE: DATA VIRTUALIZATION AND PROCESSING MEDIUM DR. KONSTANTIN BOUDNIK DR. ALEXANDRE BOUDNIK DR. KONSTANTIN BOUDNIK DR.KONSTANTIN BOUDNIK EPAM SYSTEMS CHIEF TECHNOLOGIST BIGDATA, OPEN SOURCE

More information

BEYOND AUTHENTICATION IDENTITY AND ACCESS MANAGEMENT FOR THE MODERN ENTERPRISE

BEYOND AUTHENTICATION IDENTITY AND ACCESS MANAGEMENT FOR THE MODERN ENTERPRISE BEYOND AUTHENTICATION IDENTITY AND ACCESS MANAGEMENT FOR THE MODERN ENTERPRISE OUR ORGANISATION AND SPECIALIST SKILLS Focused on delivery, integration and managed services around Identity and Access Management.

More information

Fast Innovation requires Fast IT

Fast Innovation requires Fast IT Fast Innovation requires Fast IT Cisco Data Virtualization Puneet Kumar Bhugra Business Solutions Manager 1 Challenge In Data, Big Data & Analytics Siloed, Multiple Sources Business Outcomes Business Opportunity:

More information

Ten Innovative Financial Services Applications Powered by Data Virtualization

Ten Innovative Financial Services Applications Powered by Data Virtualization Ten Innovative Financial Services Applications Powered by Data Virtualization DATA IS THE NEW ALPHA In an industry driven to deliver alpha, where might financial services firms find opportunities when

More information

CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM

CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED DATA PLATFORM CONSOLIDATING RISK MANAGEMENT AND REGULATORY COMPLIANCE APPLICATIONS USING A UNIFIED PLATFORM Executive Summary Financial institutions have implemented and continue to implement many disparate applications

More information

DigitalPersona Altus. Solution Guide

DigitalPersona Altus. Solution Guide DigitalPersona Altus Solution Guide Contents DigitalPersona... 1 DigitalPersona Altus Solution... 4 MODULAR SOLUTION CREATE-CONFIRM-CONTROL... 4 EXPERT SERVICES ASSESS-DESIGN-DEPLOY-SUPPORT... 5 DigitalPersona

More information

The Need for Big Data Governance

The Need for Big Data Governance The Need for Big Data Governance A Whitepaper By Collibra and MapR Collibra Inc 25 Broadway, 9th Floor New York, NY 10004 USA ( t ) +1 646 963 6513 Contact@collibra.com MapR Technologies 350 Holger Way

More information

NCHRP Project 20-97: Improving Findability and Relevance of Transportation Information. Part I: Project Overview Gordon Kennedy, Washington State DOT

NCHRP Project 20-97: Improving Findability and Relevance of Transportation Information. Part I: Project Overview Gordon Kennedy, Washington State DOT NCHRP Project 20-97: Improving Findability and Relevance of Transportation Information Part I: Project Overview Gordon Kennedy, Washington State DOT November, 2017 NCHRP is a State-Driven Program Sponsored

More information

Micro Focus Partner Program. For Resellers

Micro Focus Partner Program. For Resellers Micro Focus Partner Program For Resellers Contents Micro Focus Today About Micro Focus Our solutions for digital transformation Products and Solutions Program Membership Tiers Become a Portfolio Expert

More information

Nomination Form Architecture Excellence Awards 2018

Nomination Form Architecture Excellence Awards 2018 Nomination Form Architecture Excellence Awards 2018 Presented by: [Presenter's name] Company Overview 1. Name of the Company: 2. Name of the CEO (Chief Executive Officer): 3. Name of the CIO (Chief Information

More information

From business need to implementation Design the right information solution

From business need to implementation Design the right information solution From business need to implementation Design the right information solution Davor Gornik (dgornik@us.ibm.com) Product Manager Agenda Relational design Integration design Summary Relational design Data modeling

More information

Developing a Research Data Policy

Developing a Research Data Policy Developing a Research Data Policy Core Elements of the Content of a Research Data Management Policy This document may be useful for defining research data, explaining what RDM is, illustrating workflows,

More information

Enabling efficiency through Data Governance: a phased approach

Enabling efficiency through Data Governance: a phased approach Enabling efficiency through Data Governance: a phased approach Transform your process efficiency, decision-making, and customer engagement by improving data accuracy An Experian white paper Enabling efficiency

More information

Going cloud-native with Kubernetes and Pivotal

Going cloud-native with Kubernetes and Pivotal Going cloud-native with Kubernetes and Pivotal A guide to Pivotal Container Service (PKS) by role Fast, low-risk enterprise-grade Kubernetes has arrived With Pivotal Container Service (PKS), organizations

More information

Securing Digital Transformation

Securing Digital Transformation September 4, 2017 Securing Digital Transformation DXC Security Andreas Wuchner, CTO Security Innovation Risk surface is evolving and increasingly complex The adversary is highly innovative and sophisticated

More information

First Look Showcase. Expanding our prevention, detection and response solutions. Marco Rottigni Chief Technical Security Officer, Qualys, Inc.

First Look Showcase. Expanding our prevention, detection and response solutions. Marco Rottigni Chief Technical Security Officer, Qualys, Inc. 18 QUALYS SECURITY CONFERENCE 2018 First Look Showcase Expanding our prevention, detection and response solutions Marco Rottigni Chief Technical Security Officer, Qualys, Inc. Secure Enterprise Mobility

More information

The Rules of Subsurface Analytics Jane McConnell, Practice Partner Oil and Gas, Teradata DEJ KL, 4 October 2017

The Rules of Subsurface Analytics Jane McConnell, Practice Partner Oil and Gas, Teradata DEJ KL, 4 October 2017 The Rules of Subsurface Analytics Jane McConnell, Practice Partner Oil and Gas, Teradata DEJ KL, 4 October 2017 Agenda Why subsurface analytics is different The Rules Rule 1: Right People Rule 2: Right

More information

To the Designer Where We Need Your Help

To the Designer Where We Need Your Help To the Designer Where We Need Your Help Slide 7 Can you provide a similar high-res image? Slide 15 Can you polish up the content so it s not an eye chart? Slide 21, 22, 23 Can you polish up the content

More information