GOVERNING HADOOP (AND THE DATA LAKE)

Size: px
Start display at page:

Download "GOVERNING HADOOP (AND THE DATA LAKE)"

Transcription

1 GOVERNING HADOOP (AND THE DATA LAKE) DAMA-RMC Discussion Lowell W. Fryman, CBIP-CDMP Practice Principle April 20, Collibra Inc

2 DAMA-RMC Discussion Agenda Do we need to Govern Hadoop Data Lakes Why, where is the Value & Risk What will be our Governance processes Where do I start What Communications & Education When are we done 2 Collibra 2017

3 Why do we build an Hadoop architecture Seems we come down to 3 reasons 1. IT needs to implement less costly infrastructure Hadoop is cheap 2. Variety of data exceeds capacity of our resource capabilities (Social media, website, other external data, internal application variety) 3. We can integrate that additional data Capturing the data is not really the issue, Using the data is the issue, So who is our audience, Where is the Risk and Value proposition It is with our Consumers of the data Should we pay the Data Scientist to do Governance and data wrangling 3 Collibra 2017

4 The Approach Bring all data in is the basic mantra of a Data Lake The challenge is that your Data Lake will slowly turn into a Data swamp if data governance is not in place In broad terms, data lakes are marketed as enterprise-wide data management platforms for analyzing disparate sources of data in its native format. The idea is simple: instead of placing data in a purpose-built data store, you move it into a data lake in its original format. This eliminates the upfront costs of data ingestion, like transformation. Once data is placed into the lake, it's available for analysis by everyone in the organization. Nick Heudecker, research director at Gartner 4 Collibra 2017

5 What we want to prevent Unsecured Data Lake - no clear roles - no access controls - will become a dump yard to fulfill individual needs of the users - multiple copies of the same data - Users won t easily be able to find trustable data 5 Collibra 2017

6 Data Scientists Can t Do Their Jobs Accessing data is difficult and time-consuming Time to impact is slow all the wrangling to do Identifying ownership across on-premises, cloud, and external solutions is challenging Fostering collaboration and reuse is challenging Multiple data sources are in use; some are outdated Many variants and versions of the same data exist The same Consumer questions exist Where is the data integration! 6 Collibra 2017

7 They Spend Most of Their Time on the Least Enjoyable Tasks Data scientists spend 60% of their time cleaning and organizing data But 57% of data scientists view this as the least enjoyable part of their job Source: CrowdFlower via Forbes 7 Collibra 2016

8 From TDWI 8 Collibra 2017

9 Organizations need to reduce the time to find data they can trust BI analytics & Big Data is the top priority for 78% of CFOs. Their needs are: more accurate reports save time and money need for company big picture reporting Create an effective environment to share relevant information Despite spending $2.6bn per year, CFOs report that most BI analytics projects are: too tactical focus too much on technology do not address the fundamental issue of data quality and consistency Governance reduces the time spent searching for data by as much as 40% Data Lineage and Quality measures are required to enhance the TRUST 9 Collibra 2017

10 What we want Secured Data Lake - Marked data (data categories) - Classification, Taxonomy - Controls and measures based as well as lineage - Scorecards indicating quality, controls & reliability - Accountability and responsibilities 10 Collibra 2017

11 So how do we Govern the Data Lake as needed Identify critical data elements (CDE) Ensure we have Business Terms for the CDE Maintain the Business Glossary & Catalog for the CDE What does this mean, values & rules, security, constraints for usage Communications between Owner and Consumer are necessary to support governance as you need it Data Integration is still necessary Maintain a Data Dictionary of the data sets in the Lake What is available and from which data set What is the integration & Lineage What is the data quality and controls 11 Collibra 2017

12 Architecture with Governance Raw Data Lake Integrated Clean Lake Liberated Self Service Governance As you need it Foundation Business Terms (Stewardship, Rules, Security, Authoritative Source, Data Quality) Lineage & Traceability (data flow, transparency) Integration (Technical & Business metadata) Data Catalog (Assets for Governance, Data, Reporting) Data Dictionary (technical metadata alignment) Access Controls and knowledge 12 Collibra 2017

13 Collibra Community Discussion Summary Govern the critical data needed by Data Scientists May require a different DG approach(as you need, know of it) Define critical Business Terms by the Variety of the data sourced into the Lake Technology dependent data integration techniques exist and integration is important for the Consumer Ask the Data Consumers (Scientists) what are you looking for Create a clean Lake for Consumers/Scientists Data Governance is a program, never done Stay calm & allow your data to prosper 13 Collibra 2017

14 QUESTIONS?

15 THANK YOU 2017 Collibra Inc

The Data Catalog The Key to Managing Data, Big and Small. April Reeve May

The Data Catalog The Key to Managing Data, Big and Small. April Reeve May The Data Catalog The Key to Managing Data, Big and Small April Reeve May 18 2017 April Reeve Thirty years doing data oriented stuff Data Management disciplines Data Integration, Data Governance, Data Modeling,

More information

Metadata and the Rise of Big Data Governance: Active Open Source Initiatives. October 23, 2018

Metadata and the Rise of Big Data Governance: Active Open Source Initiatives. October 23, 2018 Metadata and the Rise of Big Data Governance: Active Open Source Initiatives October 23, 2018 Today s speakers John Mertic, Director of Program Management, Linux Foundation David Radley, ODPi Egeria maintainer,

More information

Data Governance: Data Usage Labeling and Enforcement in Adobe Cloud Platform

Data Governance: Data Usage Labeling and Enforcement in Adobe Cloud Platform Data Governance: Data Usage Labeling and Enforcement in Adobe Cloud Platform Contents What is data governance? Why data governance? Data governance roles. The Adobe Cloud Platform advantage. A framework

More information

MAPR DATA GOVERNANCE WITHOUT COMPROMISE

MAPR DATA GOVERNANCE WITHOUT COMPROMISE MAPR TECHNOLOGIES, INC. WHITE PAPER JANUARY 2018 MAPR DATA GOVERNANCE TABLE OF CONTENTS EXECUTIVE SUMMARY 3 BACKGROUND 4 MAPR DATA GOVERNANCE 5 CONCLUSION 7 EXECUTIVE SUMMARY The MapR DataOps Governance

More information

Data Governance Data Usage Labeling and Enforcement in Adobe Experience Platform

Data Governance Data Usage Labeling and Enforcement in Adobe Experience Platform Contents What is data governance? Why data governance? Data governance roles The Adobe Experience Platform advantage A framework for data governance Data usage patterns Data governance in action Conclusion

More information

From Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019

From Single Purpose to Multi Purpose Data Lakes. Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019 From Single Purpose to Multi Purpose Data Lakes Thomas Niewel Technical Sales Director DACH Denodo Technologies March, 2019 Agenda Data Lakes Multiple Purpose Data Lakes Customer Example Demo Takeaways

More information

The Value of Data Modeling for the Data-Driven Enterprise

The Value of Data Modeling for the Data-Driven Enterprise Solution Brief: erwin Data Modeler (DM) The Value of Data Modeling for the Data-Driven Enterprise Designing, documenting, standardizing and aligning any data from anywhere produces an enterprise data model

More information

Solving the Enterprise Data Dilemma

Solving the Enterprise Data Dilemma Solving the Enterprise Data Dilemma Harmonizing Data Management and Data Governance to Accelerate Actionable Insights Learn More at erwin.com Is Our Company Realizing Value from Our Data? If your business

More information

Metadata Management as a Key Component to Data Governance, Data Stewardship, and Data Quality Management. Wednesday, July 20 th 2016

Metadata Management as a Key Component to Data Governance, Data Stewardship, and Data Quality Management. Wednesday, July 20 th 2016 Metadata Management as a Key Component to Data Governance, Data Stewardship, and Data Quality Management Wednesday, July 20 th 2016 Confidential, Datasource Consulting, LLC 2 Multi-Domain Master Data Management

More information

April 17, Ronald Layne Manager, Data Quality and Data Governance

April 17, Ronald Layne Manager, Data Quality and Data Governance Ensuring the highest quality data is delivered throughout the university providing valuable information serving individual and organizational need April 17, 2015 Ronald Layne Manager, Data Quality and

More information

Improving Your Business with Oracle Data Integration See How Oracle Enterprise Metadata Management Can Help You

Improving Your Business with Oracle Data Integration See How Oracle Enterprise Metadata Management Can Help You Improving Your Business with Oracle Data Integration See How Oracle Enterprise Metadata Management Can Help You Özgür Yiğit Oracle Data Integration, Senior Manager, ECEMEA Safe Harbor Statement The following

More information

Data Governance Industrial Internet & Big Data

Data Governance Industrial Internet & Big Data Data Governance Kari Hiekkanen 29.3.2018 CS-E5340 Introduction to Industrial Internet Industrial Internet & Big Data (IDC Data Age 2025, April 2017) 1 Industrial Internet & Big Data (Statista, 2017) Data

More information

DATA GOVERNANCE LEADS TO DATA QUALITY

DATA GOVERNANCE LEADS TO DATA QUALITY DATA GOVERNANCE LEADS TO DATA QUALITY Trending. Kash Mehdi Senior Product Specialist and Instructor May 3, 2017 1 Collibra 2017 2017 Collibra Inc How Many of Your Reports Have Good Data Quality? What would

More information

ODPi and Data Governance Free Your MetaData! October 10, 2018

ODPi and Data Governance Free Your MetaData! October 10, 2018 ODPi and Data Governance Free Your MetaData! October 10, 2018 Today s reality @ODPiOrg Imagine An enterprise data catalogue that lists all of your data, where it is located, its origin (lineage), owner,

More information

Smart Data Catalog DATASHEET

Smart Data Catalog DATASHEET DATASHEET Smart Data Catalog There is so much data distributed across organizations that data and business professionals don t know what data is available or valuable. When it s time to create a new report

More information

TDWI Data Governance Fundamentals: Managing Data as an Asset

TDWI Data Governance Fundamentals: Managing Data as an Asset TDWI Data Governance Fundamentals: Managing Data as an Asset Training Details Training Time : 1 Day Capacity : 10 Prerequisites : There are no prerequisites for this course. About Training About Training

More information

Good analytics needs good data and that needs good metadata

Good analytics needs good data and that needs good metadata Good analytics needs good data and that needs good metadata 28 th February 2018 Mandy Chessell CBE FREng CEng FBCS Distinguished Engineer, Master Inventor Analytics Chief Data Office mandy_chessell@uk.ibm.com

More information

The Need for Big Data Governance

The Need for Big Data Governance The Need for Big Data Governance A Whitepaper By Collibra and MapR Collibra Inc 25 Broadway, 9th Floor New York, NY 10004 USA ( t ) +1 646 963 6513 Contact@collibra.com MapR Technologies 350 Holger Way

More information

Modern Data Warehouse The New Approach to Azure BI

Modern Data Warehouse The New Approach to Azure BI Modern Data Warehouse The New Approach to Azure BI History On-Premise SQL Server Big Data Solutions Technical Barriers Modern Analytics Platform On-Premise SQL Server Big Data Solutions Modern Analytics

More information

AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT

AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT Dalton Cervo Author, Consultant, Data Management Expert March 2016 This presentation contains extracts from books that are: Copyright 2011 John Wiley & Sons,

More information

WHITE PAPER: TOP 10 CAPABILITIES TO LOOK FOR IN A DATA CATALOG

WHITE PAPER: TOP 10 CAPABILITIES TO LOOK FOR IN A DATA CATALOG WHITE PAPER: TOP 10 CAPABILITIES TO LOOK FOR IN A DATA CATALOG The #1 Challenge in Successfully Deploying a Data Catalog The data cataloging space is relatively new. As a result, many organizations don

More information

Virtuoso Infotech Pvt. Ltd.

Virtuoso Infotech Pvt. Ltd. Virtuoso Infotech Pvt. Ltd. About Virtuoso Infotech Fastest growing IT firm; Offers the flexibility of a small firm and robustness of over 30 years experience collectively within the leadership team Technology

More information

Informatica Enterprise Information Catalog

Informatica Enterprise Information Catalog Data Sheet Informatica Enterprise Information Catalog Benefits Automatically catalog and classify all types of data across the enterprise using an AI-powered catalog Identify domains and entities with

More information

The Value of Data Governance for the Data-Driven Enterprise

The Value of Data Governance for the Data-Driven Enterprise Solution Brief: erwin Data governance (DG) The Value of Data Governance for the Data-Driven Enterprise Prepare for Data Governance 2.0 by bringing business teams into the effort to drive data opportunities

More information

Oracle Data Integration

Oracle Data Integration Oracle Data Integration The Essential Core of Data Governance with Oracle Enterprise Data Quality CON9539 Martin Boyd Senior Director Product Strategy, Oracle Brian Kleber Director Enterprise Data Management,

More information

I CAN T FIND THE #$%& DATA. Why You Need a Data Catalog

I CAN T FIND THE #$%& DATA. Why You Need a Data Catalog I CAN T FIND THE #$%& DATA Why You Need a Data Catalog Data is everywhere It s embedded in our social media, streaming across the Internet of Things, and stored in the cloud. The volume of data available

More information

Data Governance for the Connected Enterprise

Data Governance for the Connected Enterprise Data Governance for the Connected Enterprise Irene Polikoff and Jack Spivak, TopQuadrant Inc. November 3, 2016 Copyright 2016 TopQuadrant Inc. Slide 1 Data Governance for the Connected Enterprise Today

More information

The future of Subsurface Data Management? Building a Data Science Lab Data Lake Jane McConnell, Practice Partner Oil and Gas, Teradata DEJ KL, 3

The future of Subsurface Data Management? Building a Data Science Lab Data Lake Jane McConnell, Practice Partner Oil and Gas, Teradata DEJ KL, 3 The future of Subsurface Data Management? Building a Data Science Lab Data Lake Jane McConnell, Practice Partner Oil and Gas, Teradata DEJ KL, 3 October 2017 Analytics and AI is gaining ground in Subsurface

More information

55238 SharePoint Online for Administrators. Module 1: Introduction to Office 365 and SharePoint Online

55238 SharePoint Online for Administrators. Module 1: Introduction to Office 365 and SharePoint Online 55238 for Administrators 55238 For Administrators Duration: 3 Days Overview: This course will introduce the audience to Administration in Office 365 and explain and demonstrate the configuration options

More information

Advanced Solutions of Microsoft SharePoint Server 2013 Course Contact Hours

Advanced Solutions of Microsoft SharePoint Server 2013 Course Contact Hours Advanced Solutions of Microsoft SharePoint Server 2013 Course 20332 36 Contact Hours Course Overview This course examines how to plan, configure, and manage a Microsoft SharePoint Server 2013 environment.

More information

The Role of Metadata in a Data Governance Strategy

The Role of Metadata in a Data Governance Strategy The Role of Metadata in a Data Governance Strategy Prepared by: David Loshin President, Knowledge Integrity, Inc. (301) 754-6350 loshin@knowledge- integrity.com Sponsored by: Knowledge Integrity, Inc.

More information

REGULATORY REPORTING FOR FINANCIAL SERVICES

REGULATORY REPORTING FOR FINANCIAL SERVICES REGULATORY REPORTING FOR FINANCIAL SERVICES Gordon Hughes, Global Sales Director, Intel Corporation Sinan Baskan, Solutions Director, Financial Services, MarkLogic Corporation Many regulators and regulations

More information

Advanced Solutions of Microsoft SharePoint 2013

Advanced Solutions of Microsoft SharePoint 2013 Course 20332A :Advanced Solutions of Microsoft SharePoint 2013 Page 1 of 9 Advanced Solutions of Microsoft SharePoint 2013 Course 20332A: 4 days; Instructor-Led About the Course This four-day course examines

More information

Trillium Consulting. Data Governance. Optimizing Business Outcomes through Data and Information Assets

Trillium Consulting. Data Governance. Optimizing Business Outcomes through Data and Information Assets Trillium Consulting Data Governance Optimizing Business Outcomes through Data and Information Assets DAMA Indiana Winter Meeting Indianapolis, Indiana January 20, 2011 Jim Orr, Global Director Enterprise

More information

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems Data Analysis and Design for BI and Data Warehousing Systems Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your

More information

SAP Agile Data Preparation Simplify the Way You Shape Data PUBLIC

SAP Agile Data Preparation Simplify the Way You Shape Data PUBLIC SAP Agile Data Preparation Simplify the Way You Shape Data Introduction SAP Agile Data Preparation Overview Video SAP Agile Data Preparation is a self-service data preparation application providing data

More information

Hortonworks DataPlane Service

Hortonworks DataPlane Service Data Steward Studio Administration () docs.hortonworks.com : Data Steward Studio Administration Copyright 2016-2017 Hortonworks, Inc. All rights reserved. Please visit the Hortonworks Data Platform page

More information

Cloud Bursting: Top Reasons Your Organization will Benefit. Scott Jeschonek Director of Cloud Products Avere Systems

Cloud Bursting: Top Reasons Your Organization will Benefit. Scott Jeschonek Director of Cloud Products Avere Systems Cloud Bursting: Top Reasons Your Organization will Benefit Scott Jeschonek Director of Cloud Products Avere Systems Agenda Define Cloud Bursting Benefits of using Cloud Bursting Identify Cloud Bursting

More information

What is Gluent? The Gluent Data Platform

What is Gluent? The Gluent Data Platform What is Gluent? The Gluent Data Platform The Gluent Data Platform provides a transparent data virtualization layer between traditional databases and modern data storage platforms, such as Hadoop, in the

More information

Data Governance: Are Governance Models Keeping Up?

Data Governance: Are Governance Models Keeping Up? Data Governance: Are Governance Models Keeping Up? Jim Crompton and Paul Haines Noah Consulting Calgary Data Management Symposium Oct 2016 Copyright 2012 Noah Consulting LLC. All Rights Reserved. Page

More information

ITC Vision for Campus IT: February 28, 2018

ITC Vision for Campus IT: February 28, 2018 ITC Vision for Campus IT: 2018-2023 February 28, 2018 1 Content On-Going Strategic Initiatives, Processes ITC Vision for Campus IT Vision -2018-2023: Desired/Future State 2 Strategic Initiatives, Processes

More information

Data Management Glossary

Data Management Glossary Data Management Glossary A Access path: The route through a system by which data is found, accessed and retrieved Agile methodology: An approach to software development which takes incremental, iterative

More information

Implementing a Successful Data Governance Program

Implementing a Successful Data Governance Program Implementing a Successful Data Governance Program Mary Anne Hopper Data Management Consulting Manager SAS #AnalyticsX Data Stewardship #analyticsx SAS Data Management Framework BUSINESS DRIVERS DATA GOVERNANCE

More information

Advanced Solutions of Microsoft SharePoint Server 2013

Advanced Solutions of Microsoft SharePoint Server 2013 Course Duration: 4 Days + 1 day Self Study Course Pre-requisites: Before attending this course, students must have: Completed Course 20331: Core Solutions of Microsoft SharePoint Server 2013, successful

More information

WHAT CIOs NEED TO KNOW TO CAPITALIZE ON HYBRID CLOUD

WHAT CIOs NEED TO KNOW TO CAPITALIZE ON HYBRID CLOUD WHAT CIOs NEED TO KNOW TO CAPITALIZE ON HYBRID CLOUD 2 A CONVERSATION WITH DAVID GOULDEN Hybrid clouds are rapidly coming of age as the platforms for managing the extended computing environments of innovative

More information

Data Governance Toolkit

Data Governance Toolkit Data Governance Toolkit George Reynolds, MD, MMM, FAAP, CPHIMS, CHCIO President, HIMSS Nebraska Chapter Interim Vice President, Education. CHIME Principal, Reynolds Healthcare Advisers Agenda The Value

More information

Data governance and data quality: is it on your agenda or lurking in the shadows?

Data governance and data quality: is it on your agenda or lurking in the shadows? Data governance and data quality: is it on your agenda or lurking in the shadows? Associate Professor Anne Young Director Planning, Quality and Reporting The University of Newcastle Context Data governance

More information

How to Become a DATA GOVERNANCE EXPERT

How to Become a DATA GOVERNANCE EXPERT How to Become a DATA GOVERNANCE EXPERT You re already a data expert. You ve been working with enterprise data for years. You ve seen the good, the bad, and the downright ugly. And you ve watched the business

More information

Getting personal with your customers and GDPR

Getting personal with your customers and GDPR Getting personal with your customers and GDPR A practical approach to a secure, governed 360 degree customer view Darren Brunt Presales Director UK&I, Talend Colm Moynihan Partner Presales Manager EMEA,

More information

Best practices for building a Hadoop Data Lake Solution CHARLOTTE HADOOP USER GROUP

Best practices for building a Hadoop Data Lake Solution CHARLOTTE HADOOP USER GROUP Best practices for building a Hadoop Data Lake Solution CHARLOTTE HADOOP USER GROUP 07.29.2015 LANDING STAGING DW Let s start with something basic Is Data Lake a new concept? What is the closest we can

More information

Data Warehousing Fundamentals by Mark Peco

Data Warehousing Fundamentals by Mark Peco Data Warehousing Fundamentals by Mark Peco All rights reserved. Reproduction in whole or part prohibited except by written permission. Product and company names mentioned herein may be trademarks of their

More information

Microsoft SharePoint Online for Administrators

Microsoft SharePoint Online for Administrators 1800 ULEARN (853 276) www.ddls.com.au Microsoft 55238 - SharePoint Online for Administrators Length 3 days Price $2750.00 (inc GST) Version A Overview This course will introduce the audience to SharePoint

More information

IBM Industry Model support for a data lake architecture

IBM Industry Model support for a data lake architecture IBM Industry Models IBM Industry Model support for a data lake architecture Version 1.0 P a g e 1 Contents 1 Introduction... 3 1.1 About this document... 3 1.2 What this document means as a data lake...

More information

ETL is No Longer King, Long Live SDD

ETL is No Longer King, Long Live SDD ETL is No Longer King, Long Live SDD How to Close the Loop from Discovery to Information () to Insights (Analytics) to Outcomes (Business Processes) A presentation by Brian McCalley of DXC Technology,

More information

Data Governance. A short introduction. Gábor Gollnhofer DMS Consulting

Data Governance. A short introduction. Gábor Gollnhofer DMS Consulting Data Governance A short introduction Gábor Gollnhofer DMS Consulting 1 About DMS Consulting Ltd. Established in 2004 Data, Management, Systems, Consulting Mostly DW/BI, metadata management & data governance

More information

Supporting the Cloud Transformation of Agencies across the Public Sector

Supporting the Cloud Transformation of Agencies across the Public Sector SOLUTION SERVICES Supporting the Cloud Transformation of Agencies across the Public Sector BRIEF Digital transformation, aging IT infrastructure, the Modernizing Government Technology (MGT) Act, the Datacenter

More information

The Data Governance Journey at Principal

The Data Governance Journey at Principal The Data Governance Journey at Principal DAMA Iowa Meeting 9/20/2016 Andrea Jackson, IT Business Analyst, Sr. Sarah Playle, AD Data Quality & Governance Data governance anyone? Agenda Background Business

More information

Microsoft SharePoint Server 2013 Plan, Configure & Manage

Microsoft SharePoint Server 2013 Plan, Configure & Manage Microsoft SharePoint Server 2013 Plan, Configure & Manage Course 20331-20332B 5 Days Instructor-led, Hands on Course Information This five day instructor-led course omits the overlap and redundancy that

More information

Effective Risk Data Aggregation & Risk Reporting

Effective Risk Data Aggregation & Risk Reporting Effective Risk Data Aggregation & Risk Reporting Presented by: Ilia Bolotine Head, Adastra Business Consulting (Canada) 1 The Evolving Regulatory Landscape in Risk Management A significant lesson learned

More information

Applying Auto-Data Classification Techniques for Large Data Sets

Applying Auto-Data Classification Techniques for Large Data Sets SESSION ID: PDAC-W02 Applying Auto-Data Classification Techniques for Large Data Sets Anchit Arora Program Manager InfoSec, Cisco The proliferation of data and increase in complexity 1995 2006 2014 2020

More information

SharePoint Online for Administrators

SharePoint Online for Administrators Course 55238: SharePoint Online for Administrators Page 1 of 6 SharePoint Online for Administrators Course 55238: 2 days; Instructor-Led Introduction This course will introduce the audience to SharePoint

More information

ASG WHITE PAPER DATA INTELLIGENCE. ASG s Enterprise Data Intelligence Solutions: Data Lineage Diving Deeper

ASG WHITE PAPER DATA INTELLIGENCE. ASG s Enterprise Data Intelligence Solutions: Data Lineage Diving Deeper THE NEED Knowing where data came from, how it moves through systems, and how it changes, is the most critical and most difficult task in any data management project. If that process known as tracing data

More information

Informatica Data Quality Product Family

Informatica Data Quality Product Family Brochure Informatica Product Family Deliver the Right Capabilities at the Right Time to the Right Users Benefits Reduce risks by identifying, resolving, and preventing costly data problems Enhance IT productivity

More information

2 The IBM Data Governance Unified Process

2 The IBM Data Governance Unified Process 2 The IBM Data Governance Unified Process The benefits of a commitment to a comprehensive enterprise Data Governance initiative are many and varied, and so are the challenges to achieving strong Data Governance.

More information

Government IT Modernization and the Adoption of Hybrid Cloud

Government IT Modernization and the Adoption of Hybrid Cloud Government IT Modernization and the Adoption of Hybrid Cloud An IDC InfoBrief, Sponsored by VMware June 2018 Federal and National Governments Are at an Inflection Point Federal and national governments

More information

Are your data ready for GDPR Compliance?

Are your data ready for GDPR Compliance? Are your data ready for GDPR Compliance? USING A DATA HUB TO PROTECT PERSONAL DATA Track & Trace Capture & Connect Secure & Protect Certify & Curate Publish & Share 2017 Talend 1 Rémi Forest Solution Engineer

More information

IT Redefined. Hans Timmerman CTO EMC Nederland. Copyright 2015 EMC Corporation. All rights reserved.

IT Redefined. Hans Timmerman CTO EMC Nederland. Copyright 2015 EMC Corporation. All rights reserved. IT Redefined Hans Timmerman CTO EMC Nederland 1 INDUSTRIES FACE STRUCTURAL CHANGE 2 More CEOs See IT As Driver Of Digital Business Innovation And Growth IT has the highest business priority in more than

More information

Network Implications of Cloud Computing Presentation to Internet2 Meeting November 4, 2010

Network Implications of Cloud Computing Presentation to Internet2 Meeting November 4, 2010 Network Implications of Cloud Computing Presentation to Internet2 Meeting November 4, 2010 Lou Topfl Director, New Technology Product Development Engineering AT&T Agenda What is the Cloud? Types of Cloud

More information

Importance of the Data Management process in setting up the GDPR within a company CREOBIS

Importance of the Data Management process in setting up the GDPR within a company CREOBIS Importance of the Data Management process in setting up the GDPR within a company CREOBIS 1 Alain Cieslik Personal Data is the oil of the digital world 2 Alain Cieslik Personal information comes in different

More information

Datameer Big Data Governance. Bringing open-architected and forward-compatible governance controls to Hadoop analytics

Datameer Big Data Governance. Bringing open-architected and forward-compatible governance controls to Hadoop analytics Datameer Big Data Governance Bringing open-architected and forward-compatible governance controls to Hadoop analytics As big data moves toward greater mainstream adoption, its compliance with long-standing

More information

PERSPECTIVE. Effective Data Governance. Abstract

PERSPECTIVE. Effective Data Governance. Abstract PERSPECTIVE Effective Governance Abstract governance is no more just another item that is good to talk about and nice to have, for global data management organizations. This PoV looks into why data governance

More information

Lambda Architecture for Batch and Stream Processing. October 2018

Lambda Architecture for Batch and Stream Processing. October 2018 Lambda Architecture for Batch and Stream Processing October 2018 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document is provided for informational purposes only.

More information

Drawing the Big Picture

Drawing the Big Picture Drawing the Big Picture Multi-Platform Data Architectures, Queries, and Analytics Philip Russom TDWI Research Director for Data Management August 26, 2015 Sponsor 2 Speakers Philip Russom TDWI Research

More information

National Digital Engineering Working Group (NDEWG)

National Digital Engineering Working Group (NDEWG) NATSPEC Strategic Outcomes with BIM Seminar National Digital Engineering Working Group (NDEWG) Simon Vaux Chair, NDEWG Director Digital Engineering, TfNSW 3 rd August 2017 What is Digital Engineering?

More information

Universal Hybrid Cloud Connector. Veritas Technologies LLC

Universal Hybrid Cloud Connector. Veritas Technologies LLC Universal Hybrid Cloud Connector Pankaj Mansukhani Nalini Kumari Nallamalli Meghraj Bendre Veritas Technologies LLC 1 Agenda Hybrid Cloud Multi Volume File system SmartTier Architecture of Cloud Connector

More information

Informatica Data Lake Management on the AWS Cloud

Informatica Data Lake Management on the AWS Cloud Informatica Data Lake Management on the AWS Cloud Quick Start Reference Deployment January 2018 Informatica Big Data Team Vinod Shukla AWS Quick Start Reference Team Contents Overview... 2 Informatica

More information

The Emerging Data Lake IT Strategy

The Emerging Data Lake IT Strategy The Emerging Data Lake IT Strategy An Evolving Approach for Dealing with Big Data & Changing Environments bit.ly/datalake SPEAKERS: Thomas Kelly, Practice Director Cognizant Technology Solutions Sean Martin,

More information

Automated Netezza to Cloud Migration

Automated Netezza to Cloud Migration Automated Netezza to Cloud Migration CASE STUDY Client Overview Our client is a government-sponsored enterprise* that provides financial products and services to increase the availability and affordability

More information

Modern Database Architectures Demand Modern Data Security Measures

Modern Database Architectures Demand Modern Data Security Measures Forrester Opportunity Snapshot: A Custom Study Commissioned By Imperva January 2018 Modern Database Architectures Demand Modern Data Security Measures GET STARTED Introduction The fast-paced, ever-changing

More information

What s a BA to do with Data? Discover and define standard data elements in business terms

What s a BA to do with Data? Discover and define standard data elements in business terms What s a BA to do with Data? Discover and define standard data elements in business terms Susan Block, Lead Business Systems Analyst The Vanguard Group Discussion Points Discovering Business Data The Data

More information

Match data set availability to data resource requirements, including gap analysis and remediation assistance.

Match data set availability to data resource requirements, including gap analysis and remediation assistance. Discovering data/datasets Specify Data Requirements Identify Data Assets Assist customers with clarifying problem statements, use cases, high-level requirements (e.g. goals, objectives) and detailed requirements

More information

Storage as a Service From Hitachi Vantara

Storage as a Service From Hitachi Vantara Storage as a Service From Hitachi Vantara Cost-Effectively Manage Your Storage Needs SOLUTION PROFILE Companies are increasingly looking for ways to lower the cost of IT management. Capital expenditure

More information

Advanced Technologies of SharePoint 2016 ( )

Advanced Technologies of SharePoint 2016 ( ) SharePoint Course - 203392 Advanced Technologies of SharePoint 2016 (20339-2) Length 5 days Audience The course is targeted at experienced IT Professionals who are interested in learning how to install,

More information

University of Texas Arlington Data Governance Program Charter

University of Texas Arlington Data Governance Program Charter University of Texas Arlington Data Governance Program Charter Document Version: 1.0 Version/Published Date: 11/2016 Table of Contents 1 INTRODUCTION... 3 1.1 PURPOSE OF THIS DOCUMENT... 3 1.2 SCOPE...

More information

SOLUTION OVERVIEW: DATA CATALOGS FOR RISK AND COMPLIANCE

SOLUTION OVERVIEW: DATA CATALOGS FOR RISK AND COMPLIANCE SOLUTION OVERVIEW: DATA CATALOGS FOR RISK AND COMPLIANCE Introduction As governments increasingly recognize the importance of data and the potential for its misuse, the amount of compliance rules and regulations

More information

How Insurers are Realising the Promise of Big Data

How Insurers are Realising the Promise of Big Data How Insurers are Realising the Promise of Big Data Jason Hunter, CTO Asia-Pacific, MarkLogic A Big Data Challenge: Pushing the Limits of What's Possible The Art of the Possible Multiple Government Agencies

More information

2018 Edition. Security and Compliance for Office 365

2018 Edition. Security and Compliance for Office 365 2018 Edition Security and Compliance for Office 365 [Proofpoint has] given us our time back to focus on the really evil stuff. CISO, Global 500 Manufacturer Like millions of businesses around the world,

More information

Data Modeling Whitepaper DATA MODELING IS A FORM OF DATA GOVERNANCE BY ROBERT S. SEINER

Data Modeling Whitepaper DATA MODELING IS A FORM OF DATA GOVERNANCE BY ROBERT S. SEINER Data Modeling Whitepaper DATA MODELING IS A FORM OF DATA GOVERNANCE BY ROBERT S. SEINER TABLE OF CONTENTS 3 Introduction 4 Three Actions of Governing Data 4 Governing the Action of Defining Data 5 Relating

More information

Cloud Going Mainstream All Are Trying, Some Are Benefiting; Few Are Maximizing Value

Cloud Going Mainstream All Are Trying, Some Are Benefiting; Few Are Maximizing Value All Are Trying, Some Are Benefiting; Few Are Maximizing Value Germany Findings September 2016 Executive Summary Cloud adoption has increased 70% from last year, with 71% of companies in Germany pursuing

More information

with Advanced Protection

with Advanced  Protection with Advanced Email Protection OVERVIEW Today s sophisticated threats are changing. They re multiplying. They re morphing into new variants. And they re targeting people, not just technology. As organizations

More information

Cloud Going Mainstream All Are Trying, Some Are Benefiting; Few Are Maximizing Value

Cloud Going Mainstream All Are Trying, Some Are Benefiting; Few Are Maximizing Value All Are Trying, Some Are Benefiting; Few Are Maximizing Value Latin America Findings September 2016 Executive Summary Cloud adoption has increased 49% from last year, with 78% of companies in Latin America

More information

Data Governance & Privacy Compliance: 16 Practical Steps towards GDPR Compliance with Talend

Data Governance & Privacy Compliance: 16 Practical Steps towards GDPR Compliance with Talend White Paper Data Governance & Privacy Compliance: 16 Practical Steps towards GDPR Compliance with Talend By Sunil Soares and Jean-Michel Franco July 8, 2017 The European Union (EU) published the General

More information

Fast Innovation requires Fast IT

Fast Innovation requires Fast IT Fast Innovation requires Fast IT Cisco Data Virtualization Puneet Kumar Bhugra Business Solutions Manager 1 Challenge In Data, Big Data & Analytics Siloed, Multiple Sources Business Outcomes Business Opportunity:

More information

Cloud Bursting Jim Thompson Avere Systems

Cloud Bursting Jim Thompson Avere Systems Cloud Bursting Jim Thompson Avere Systems Why Use Someone Else s Cloud? Significantly reduce infrastructure management costs both in money and time Maintain operational flexibility during scale-out jobs

More information

Smart Data Center From Hitachi Vantara: Transform to an Agile, Learning Data Center

Smart Data Center From Hitachi Vantara: Transform to an Agile, Learning Data Center Smart Data Center From Hitachi Vantara: Transform to an Agile, Learning Data Center Leverage Analytics To Protect and Optimize Your Business Infrastructure SOLUTION PROFILE Managing a data center and the

More information

NCHRP Project 20-97: Improving Findability and Relevance of Transportation Information. Part I: Project Overview Gordon Kennedy, Washington State DOT

NCHRP Project 20-97: Improving Findability and Relevance of Transportation Information. Part I: Project Overview Gordon Kennedy, Washington State DOT NCHRP Project 20-97: Improving Findability and Relevance of Transportation Information Part I: Project Overview Gordon Kennedy, Washington State DOT November, 2017 NCHRP is a State-Driven Program Sponsored

More information

IBM Information Governance Catalog (IGC) Partner Application Validation Quick Guide

IBM Information Governance Catalog (IGC) Partner Application Validation Quick Guide IBM Information Governance Catalog (IGC) Partner Application Validation Quick Guide VERSION: 2.0 DATE: Feb 15, 2018 EDITOR: D. Rangarao Table of Contents 1 Overview of the Application Validation Process...

More information

Information empowerment for your evolving data ecosystem

Information empowerment for your evolving data ecosystem Information empowerment for your evolving data ecosystem Highlights Enables better results for critical projects and key analytics initiatives Ensures the information is trusted, consistent and governed

More information

Luncheon Webinar Series April 25th, Governance for ETL Presented by Beate Porst Sponsored By:

Luncheon Webinar Series April 25th, Governance for ETL Presented by Beate Porst Sponsored By: Luncheon Webinar Series April 25th, 2014 Governance for ETL Presented by Beate Porst Sponsored By: 1 Governance for ETL Questions and suggestions regarding presentation topics? - send to editor@dsxchange.com

More information

Data Governance Quick Start

Data Governance Quick Start Service Offering Data Governance Quick Start Congratulations! You ve been named the Data Governance Leader Now What? Benefits Accelerate the initiation of your Data Governance program with an industry

More information