Data Quality Architecture and Options

Size: px
Start display at page:

Download "Data Quality Architecture and Options"

Transcription

1 Data Quality Architecture and Options Nita Khare Alliances & Technology Team - Solution Architect nita.khare@tcs.com * IBM IM Champion 2013 * December 3,

2 Agenda Pain Areas / Challenges of DQ Solution DQ Solution DQ Architecture Options Near Real-time/Inline DQ management Solution Standardization process Other Important DQ Processes Benefits 1

3 Why is DQ important? A Simple Pizza & Beer Order Receipt Date format not known This can mislead while calculating the sales numbers. Suffix Spring Ale added to the beer description by default - This data quality issue impacts the store s inventory and procurement systems. Look at the age It s a default nos. Not capturing the correct age range of the buyer During analysis, we will get results depending on the current year only. i.e. say if default nos. mentioned is 11/22/1990 and if we do analysis today i.e. 11/22/2013, we will think that the buyers are of age somewhere around 23 years which is not at all true. Currency format is not mentioned Sales Nos. can go wrong. Source : 2

4 Pain Areas of DQ Solution Providing high performance for ad-hoc queries Identification of Business Areas Managing Large Volumes of Data Resolving data quality issues & survival policy decisions Nos. and types of Technology Involved Handling complex relationship in data Large Nos. of Unmanaged Data sources Data Availability & Cleansing 3

5 DQ Solution DQ Management Approach DMAIC is a 5 step iterative approach to data quality improvement. It comprises of continuous analysis, observation, and improvement of underlying data leading to overall improvement in the quality of information across the organization 4

6 DQ Solution A typical DQ Lifecycle The ultimate goal of DQ management should be to move from reactive mode of data quality management to proactively control and manage the data quality so that the data imperfections in the systems are limited. 5

7 DQ Solution Factors Influencing DQ All the below mentioned factors (LUCAS) need to be addressed in order to ensure quality data available for analysis to the end users. 6

8 DQ Solution DQ Reference Architecture 7

9 DQ Architecture Options 8

10 DQ Architecture Options Pros & Cons Consideration Points Option 1: Source System Option 2: ETL Layer Option 3: Target Layer Data cleansing effort and cost Ensures quality data is available at the place where it is captured and hence minimal data quality impact on downstream applications. Cost and effort of DQ exercise increases as we move away from the source system. More Expensive Data Load The data load may be delayed as the DQ checks needs to be applied in the source system before the data is ready to be loaded into target layers. The more the DQ checks, the more would be the impact on the data load. But if designed optimally, there might not be much impact. The data load is very quick as the DQ checks are applied after the loads into DWH. Impact on Source System DQ processes may become an overhead to the operational system. Less impact to source operational system compared to option 1 Minimal impact to source operational system performance Heterogeneous source systems An additional overhead of implementing data quality processes and procedures in multiple platforms Less impact to source operational system compared to option 1. Minimal impact to source operational system performance 9

11 Near Real-time/Inline DQ management Solution Option 1: Data Quality Management using Hadoop Option 2: Data Quality Management using Database resources Consideration Points Option 1: Managing DQ solution using Big Data technology Option 2: Managing Data Quality Solutions using Database Resources Volume Best suited for very high volume data. It works well with low to medium data volumes. Update Frequency Efficiently handles frequent changed records Source data quality / Very efficiently handles multiple data No. of DQ Rules quality checks and controls during data load process. Data Load SLA Capable of loading high volume data in stipulated time. Cost Cost of implementation is cheaper compared to Option 2. Maintenance / Support As big data technologies, especially requires coding, debugging and applying fixes are more time consuming and costly. Capable of handling frequently changed records. Performance bottleneck possible with volume growth and more number of DQ checks. Capable of loading high volume data in stipulated time but can crumble with data growth. Cost of implementation is higher compared to Option 1 Maintainability, debugging and fixes are quicker compared to Option 1. Expert availability As big data technologies are still emerging, there could be difficulty in getting big data skilled associate. ETL experts are easily available. 10

12 DQ Solution Data Governance Council 11

13 Standardization process Cleansing and standardization of data is achieved by set of transformations, where organizations data process through each of these stages for better data cleansing and standardization. 12

14 Standardization process Technical Integration 13

15 Other Important DQ Processes Error Handling This is one of the ways of implementing Error handling solution in any ETL architecture One common error table can be created to capture and store exceptions while loading data in downstream systems. When records are rejected due to data quality issues (validation errors) they will be logged in the exception database. In case, there is agreed default value provided by Business for the source columns not holding valid data, the same will be loaded into the target table. Performance overhead will be minimal as exception records will be low in volume in incremental scenario and there would be mostly all inserts in the exceptions table. 14

16 Sample Error Handling Dashboard 15

17 Reconciliation Data reconciliation is performed to verify the integrity of the data loaded into the warehouse. One of the major reasons of information loss is loading failures or errors during loading. Such errors can occur due to several reasons: Inconsistent or non coherent data from source Non-integrating data among different sources Unclean/ non-profiled data Un-handled exceptions Constraint violations Logical issues/ Inherent flaws in program Technical failures like loss of connectivity, loss over network, space issue etc. Reconciliation process will only indicate whether or not the data is correct. It will not indicate why the data is not correct. Reconciliation process answers what part of the question, not why part of the question. Typically they are implemented to ensure that: SUM/COUNT (Input) = SUM/COUNT (Output) + SUM/COUNT (Captured Reject) 16

18 Types Of Reconciliation Transactional Reconciliation Matching the number of records in source and in target. If these counts are equal, it can be safely assumed that records were not left out due to an error during the ETL or simple load process. This can be further verified by the lack of errors (not necessarily warnings) in the exception reporting by the ETL tool. Financial Reconciliation This checks on the data content in source and target. E.g. computing the sum of Amount column in all records at source and target and matching the same. Financial reconciliation will be performed before the data load starts into EDW. Financial reconciliation, if required, can be implemented at specific job level after identifying columns to be reconciled for a source system batch run. 17

19 Benefits Central repository of enterprise data and single version of truth across the enterprise providing Unified Information Delivery platform Data is complete, accurate and consistent in the target system enabling better confidence in decision making. Consolidation of business logic at enterprise level, to remove discrepancies and standardize the data set. Provide business users & data stewards a clear picture of their data quality, monitor, track and govern information over time. Improves data standardization through common data governance framework at the enterprise level. Provides architectural options for implementing DQ solution. The cost of DQM increases as we move from Source to Target. Hence it is advisable to apply DQM solution in the source or nearer to the source in order to reduce the DQM effort. 18

20 THANK YOU 19

Luncheon Webinar Series January 13th, Free is Better Presented by Tony Curcio and Beate Porst Sponsored By:

Luncheon Webinar Series January 13th, Free is Better Presented by Tony Curcio and Beate Porst Sponsored By: Luncheon Webinar Series January 13th, 2014 Free is Better Presented by Tony Curcio and Beate Porst Sponsored By: 1 Free is Better Questions and suggestions regarding presentation topics? - send to editor@dsxchange.com

More information

Fast Innovation requires Fast IT

Fast Innovation requires Fast IT Fast Innovation requires Fast IT Cisco Data Virtualization Puneet Kumar Bhugra Business Solutions Manager 1 Challenge In Data, Big Data & Analytics Siloed, Multiple Sources Business Outcomes Business Opportunity:

More information

Full file at

Full file at Chapter 2 Data Warehousing True-False Questions 1. A real-time, enterprise-level data warehouse combined with a strategy for its use in decision support can leverage data to provide massive financial benefits

More information

Work Breakdown Structure

Work Breakdown Structure MossAtre.book Page 491 Sunday, February 9, 2003 7:01 PM APPENDIX Work Breakdown Structure The work breakdown structure in this appendix reflects the contents of the enclosed CD-ROM. TASK_DATA 491 1 Your

More information

Data Mining & Data Warehouse

Data Mining & Data Warehouse Data Mining & Data Warehouse Asso. Profe. Dr. Raed Ibraheem Hamed University of Human Development, College of Science and Technology Department of Information Technology 2016 2017 (1) Points to Cover Problem:

More information

Data Mining. Asso. Profe. Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology Department of CS (1)

Data Mining. Asso. Profe. Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology Department of CS (1) Data Mining Asso. Profe. Dr. Raed Ibraheem Hamed University of Human Development, College of Science and Technology Department of CS 2016 2017 (1) Points to Cover Problem: Heterogeneous Information Sources

More information

Data Vault Brisbane User Group

Data Vault Brisbane User Group Data Vault Brisbane User Group 26-02-2013 Agenda Introductions A brief introduction to Data Vault Creating a Data Vault based Data Warehouse Comparisons with 3NF/Kimball When is it good for you? Examples

More information

DEVELOPING DECISION SUPPORT SYSTEMS A MODERN APPROACH

DEVELOPING DECISION SUPPORT SYSTEMS A MODERN APPROACH DEVELOPING DECISION SUPPORT SYSTEMS A MODERN APPROACH Ion Lungu PhD, Vlad Diaconiţa PhD Candidate Department of Economic Informatics Academy of Economic Studies Bucharest In today s economy access to quality

More information

AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT

AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT Dalton Cervo Author, Consultant, Data Management Expert March 2016 This presentation contains extracts from books that are: Copyright 2011 John Wiley & Sons,

More information

Data Warehousing in the Age of In-Memory Computing and Real-Time Analytics. Erich Schneider, Daniel Rutschmann June 2014

Data Warehousing in the Age of In-Memory Computing and Real-Time Analytics. Erich Schneider, Daniel Rutschmann June 2014 Data Warehousing in the Age of In-Memory Computing and Real-Time Analytics Erich Schneider, Daniel Rutschmann June 2014 Disclaimer This presentation outlines our general product direction and should not

More information

MDM Partner Summit 2015 Oracle Enterprise Data Quality Overview & Roadmap

MDM Partner Summit 2015 Oracle Enterprise Data Quality Overview & Roadmap MDM Partner Summit 2015 Oracle Enterprise Data Quality Overview & Roadmap Steve Tuck Senior Director, Product Strategy Todd Blackmon Senior Director, Sales Consulting David Gengenbach Sales Consultant

More information

DATAWAREHOUSING AND ETL PROCESSES: An Explanatory Research

DATAWAREHOUSING AND ETL PROCESSES: An Explanatory Research DATAWAREHOUSING AND ETL PROCESSES: An Explanatory Research Priyanshu Gupta ETL Software Developer United Health Group Abstract- In this paper, the author has focused on explaining Data Warehousing and

More information

Low Friction Data Warehousing WITH PERSPECTIVE ILM DATA GOVERNOR

Low Friction Data Warehousing WITH PERSPECTIVE ILM DATA GOVERNOR Low Friction Data Warehousing WITH PERSPECTIVE ILM DATA GOVERNOR Table of Contents Foreword... 2 New Era of Rapid Data Warehousing... 3 Eliminating Slow Reporting and Analytics Pains... 3 Applying 20 Years

More information

Data Virtualization Implementation Methodology and Best Practices

Data Virtualization Implementation Methodology and Best Practices White Paper Data Virtualization Implementation Methodology and Best Practices INTRODUCTION Cisco s proven Data Virtualization Implementation Methodology and Best Practices is compiled from our successful

More information

The Data Catalog The Key to Managing Data, Big and Small. April Reeve May

The Data Catalog The Key to Managing Data, Big and Small. April Reeve May The Data Catalog The Key to Managing Data, Big and Small April Reeve May 18 2017 April Reeve Thirty years doing data oriented stuff Data Management disciplines Data Integration, Data Governance, Data Modeling,

More information

@Pentaho #BigDataWebSeries

@Pentaho #BigDataWebSeries Enterprise Data Warehouse Optimization with Hadoop Big Data @Pentaho #BigDataWebSeries Your Hosts Today Dave Henry SVP Enterprise Solutions Davy Nys VP EMEA & APAC 2 Source/copyright: The Human Face of

More information

Data Management Glossary

Data Management Glossary Data Management Glossary A Access path: The route through a system by which data is found, accessed and retrieved Agile methodology: An approach to software development which takes incremental, iterative

More information

Metadata Management as a Key Component to Data Governance, Data Stewardship, and Data Quality Management. Wednesday, July 20 th 2016

Metadata Management as a Key Component to Data Governance, Data Stewardship, and Data Quality Management. Wednesday, July 20 th 2016 Metadata Management as a Key Component to Data Governance, Data Stewardship, and Data Quality Management Wednesday, July 20 th 2016 Confidential, Datasource Consulting, LLC 2 Multi-Domain Master Data Management

More information

Revolutionize the Way You Work With IMS Applications Using IBM UrbanCode Deploy Evgeni Liakhovich, IMS Developer

Revolutionize the Way You Work With IMS Applications Using IBM UrbanCode Deploy Evgeni Liakhovich, IMS Developer Revolutionize the Way You Work With IMS Applications Using IBM UrbanCode Deploy Evgeni Liakhovich, IMS Developer evgueni@us.ibm.com * 2016 IBM Corporation Trademarks, copyrights, disclaimers IBM, the IBM

More information

Introduction to Data Science

Introduction to Data Science UNIT I INTRODUCTION TO DATA SCIENCE Syllabus Introduction of Data Science Basic Data Analytics using R R Graphical User Interfaces Data Import and Export Attribute and Data Types Descriptive Statistics

More information

Data Warehouses and Deployment

Data Warehouses and Deployment Data Warehouses and Deployment This document contains the notes about data warehouses and lifecycle for data warehouse deployment project. This can be useful for students or working professionals to gain

More information

A Single Source of Truth

A Single Source of Truth A Single Source of Truth is it the mythical creature of data management? In the world of data management, a single source of truth is a fully trusted data source the ultimate authority for the particular

More information

Building a Data Warehouse: Data Quality is key for BI. Werner Daehn

Building a Data Warehouse: Data Quality is key for BI. Werner Daehn [ Building a Data Warehouse: Data Quality is key for BI Werner Daehn [ Learning Points A DWH project is about discovering new information Not having a good quality counterfeits that purpose Actually it

More information

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems Data Analysis and Design for BI and Data Warehousing Systems Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your

More information

What s the Value of Your Data? The Agile Advantage

What s the Value of Your Data? The Agile Advantage What s the Value of Your Data? The Agile Advantage by Jan Paul Fillie and Werner de Jong In a world of big data, advanced analytics, in-memory data warehousing, and real-time business intelligence (BI),

More information

Implementing a Data Warehouse with Microsoft SQL Server 2014 (20463D)

Implementing a Data Warehouse with Microsoft SQL Server 2014 (20463D) Implementing a Data Warehouse with Microsoft SQL Server 2014 (20463D) Overview This course describes how to implement a data warehouse platform to support a BI solution. Students will learn how to create

More information

Musewerx support for Application Maintenance in Software AG NATURAL and ADABAS TM environment

Musewerx support for Application Maintenance in Software AG NATURAL and ADABAS TM environment Musewerx support for Application Maintenance in Software AG NATURAL and ADABAS TM environment Musewerx provides Application Maintenance Services for your applications written in NATURAL and ADABAS environment.

More information

TECHNOLOGY BRIEF: CA ERWIN DATA PROFILER. Combining Data Profiling and Data Modeling for Better Data Quality

TECHNOLOGY BRIEF: CA ERWIN DATA PROFILER. Combining Data Profiling and Data Modeling for Better Data Quality TECHNOLOGY BRIEF: CA ERWIN DATA PROFILER Combining Data Profiling and Data Modeling for Better Data Quality Table of Contents Executive Summary SECTION 1: CHALLENGE 2 Reducing the Cost and Risk of Data

More information

Deploying IMS Applications with IBM UrbanCode Deploy

Deploying IMS Applications with IBM UrbanCode Deploy Deploying IMS Applications with IBM UrbanCode Deploy Evgeni Liakhovich, IMS Develper evgueni@us.ibm.com * IMS Technical Symposium 2015 Trademarks, copyrights, disclaimers IBM, the IBM logo, and ibm.com

More information

IBM Software IBM InfoSphere Information Server for Data Quality

IBM Software IBM InfoSphere Information Server for Data Quality IBM InfoSphere Information Server for Data Quality A component index Table of contents 3 6 9 9 InfoSphere QualityStage 10 InfoSphere Information Analyzer 12 InfoSphere Discovery 13 14 2 Do you have confidence

More information

Data Modeling Whitepaper DATA MODELING IS A FORM OF DATA GOVERNANCE BY ROBERT S. SEINER

Data Modeling Whitepaper DATA MODELING IS A FORM OF DATA GOVERNANCE BY ROBERT S. SEINER Data Modeling Whitepaper DATA MODELING IS A FORM OF DATA GOVERNANCE BY ROBERT S. SEINER TABLE OF CONTENTS 3 Introduction 4 Three Actions of Governing Data 4 Governing the Action of Defining Data 5 Relating

More information

Introduction 1.1 SERVER-CENTRIC IT ARCHITECTURE AND ITS LIMITATIONS

Introduction 1.1 SERVER-CENTRIC IT ARCHITECTURE AND ITS LIMITATIONS 1 Introduction The purpose of this chapter is to convey the basic idea underlying this book. To this end we will first describe conventional server-centric IT architecture and sketch out its limitations

More information

The Six Principles of BW Data Validation

The Six Principles of BW Data Validation The Problem The Six Principles of BW Data Validation Users do not trust the data in your BW system. The Cause By their nature, data warehouses store large volumes of data. For analytical purposes, the

More information

Realizing the Full Potential of MDM 1

Realizing the Full Potential of MDM 1 Realizing the Full Potential of MDM SOLUTION MDM Augmented with Data Virtualization INDUSTRY Applicable to all Industries EBSITE www.denodo.com PRODUCT OVERVIE The Denodo Platform offers the broadest access

More information

Enterprise Data-warehouse (EDW) In Easy Steps

Enterprise Data-warehouse (EDW) In Easy Steps Enterprise Data-warehouse (EDW) In Easy Steps Data-warehouses (DW) are centralised data repositories that integrate data from various transactional, legacy, or external systems, applications, and sources.

More information

Three requirements for reducing performance issues and unplanned downtime in any data center

Three requirements for reducing performance issues and unplanned downtime in any data center Three requirements for reducing performance issues and unplanned downtime in any data center DARRYL FUJITA TECHNICAL SOFTWARE SOLUTIONS SPECIALIST HITACHI DATA SYSTEMS How Big Is The Cost Of Unplanned

More information

Microsoft Implementing a Data Warehouse with Microsoft SQL Server 2014

Microsoft Implementing a Data Warehouse with Microsoft SQL Server 2014 1800 ULEARN (853 276) www.ddls.com.au Microsoft 20463 - Implementing a Data Warehouse with Microsoft SQL Server 2014 Length 5 days Price $4290.00 (inc GST) Version D Overview Please note: Microsoft have

More information

Data Management Framework

Data Management Framework The Organization Management Framework Created and Presented By Copyright 2018 Management Is part of the Manage Knowledge, Improvement and Change process of the APQC Process Classification Framework (wwwapqcorg)

More information

Making the Impossible Possible

Making the Impossible Possible Making the Impossible Possible Find and Eliminate Data Errors with Automated Discovery and Data Lineage Introduction Organizations have long struggled to identify and take advantage of opportunities for

More information

Database Architectures

Database Architectures Database Architectures CPS352: Database Systems Simon Miner Gordon College Last Revised: 4/15/15 Agenda Check-in Parallelism and Distributed Databases Technology Research Project Introduction to NoSQL

More information

Modernizing Business Intelligence and Analytics

Modernizing Business Intelligence and Analytics Modernizing Business Intelligence and Analytics Justin Erickson Senior Director, Product Management 1 Agenda What benefits can I achieve from modernizing my analytic DB? When and how do I migrate from

More information

Data Mining: Approach Towards The Accuracy Using Teradata!

Data Mining: Approach Towards The Accuracy Using Teradata! Data Mining: Approach Towards The Accuracy Using Teradata! Shubhangi Pharande Department of MCA NBNSSOCS,Sinhgad Institute Simantini Nalawade Department of MCA NBNSSOCS,Sinhgad Institute Ajay Nalawade

More information

Effective Risk Data Aggregation & Risk Reporting

Effective Risk Data Aggregation & Risk Reporting Effective Risk Data Aggregation & Risk Reporting Presented by: Ilia Bolotine Head, Adastra Business Consulting (Canada) 1 The Evolving Regulatory Landscape in Risk Management A significant lesson learned

More information

DATACENTER SERVICES DATACENTER

DATACENTER SERVICES DATACENTER SERVICES SOLUTION SUMMARY ALL CHANGE React, grow and innovate faster with Computacenter s agile infrastructure services Customers expect an always-on, superfast response. Businesses need to release new

More information

Lambda Architecture for Batch and Stream Processing. October 2018

Lambda Architecture for Batch and Stream Processing. October 2018 Lambda Architecture for Batch and Stream Processing October 2018 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document is provided for informational purposes only.

More information

Data Migration Platform

Data Migration Platform appmigrate TM Data Migration Platform QUALITY MANAGEMENT RECONCILIATION MAINTENANCE MIGRATION PROFILING COEXISTENCE CUTOVER PLANNING AND EXECUTION ? Problem Data migration done in the traditional way,

More information

Decision Support. Go-Live Update

Decision Support. Go-Live Update Decision Support Go-Live Update UNLV Decision Support Purpose is to provide knowledgeable campus users with access to data for decision making. Principles embraced: Involvement of decision makers and

More information

Automated Testing of Tableau Dashboards

Automated Testing of Tableau Dashboards Kinesis Technical Whitepapers April 2018 Kinesis CI Automated Testing of Tableau Dashboards Abstract Companies make business critical decisions every day, based on data from their business intelligence

More information

Cisco EnergyWise: Power Management Without Borders

Cisco EnergyWise: Power Management Without Borders Cisco EnergyWise: Power Management Without Borders Introduction In response to energy costs, environmental concerns, and government directives, there is an increased need for sustainable and green business

More information

IBM InfoSphere Information Analyzer

IBM InfoSphere Information Analyzer IBM InfoSphere Information Analyzer Understand, analyze and monitor your data Highlights Develop a greater understanding of data source structure, content and quality Leverage data quality rules continuously

More information

The Data Organization

The Data Organization C V I T F E P A O TM The Organization 1251 Yosemite Way Hayward, CA 94545 (510) 303-8868 rschoenrank@computerorg / Warehouse Strategies By Rainer Schoenrank Warehouse Consultant January 2018 / Warehouse

More information

IT Monitoring Tool Gaps are Impacting the Business A survey of IT Professionals and Executives

IT Monitoring Tool Gaps are Impacting the Business A survey of IT Professionals and Executives IT Monitoring Tool Gaps are Impacting the Business A survey of IT Professionals and Executives June 2018 1 Executive Summary This research finds that large enterprise customers and employees endure a substantial

More information

How to integrate data into Tableau

How to integrate data into Tableau 1 How to integrate data into Tableau a comparison of 3 approaches: ETL, Tableau self-service and WHITE PAPER WHITE PAPER 2 data How to integrate data into Tableau a comparison of 3 es: ETL, Tableau self-service

More information

Improving Data Governance in Your Organization. Faire Co Regional Manger, Information Management Software, ASEAN

Improving Data Governance in Your Organization. Faire Co Regional Manger, Information Management Software, ASEAN Improving Data Governance in Your Organization Faire Co Regional Manger, Information Management Software, ASEAN Topics The Innovation Imperative and Innovating with Information What Is Data Governance?

More information

ETL AUTO RECONCILIATION

ETL AUTO RECONCILIATION ISSN 0975-3303 Mapana J Sci, 9, 2 (2010), 35-46 https://doi.org/10.12725/mjs.17.5 ETL AUTO RECONCILIATION Jibrael Jos* and Brogodishworon U** ABSTRACT Extraction, tro.nsformation and Loading (ETL) is the

More information

IBM DB2 Analytics Accelerator use cases

IBM DB2 Analytics Accelerator use cases IBM DB2 Analytics Accelerator use cases Ciro Puglisi Netezza Europe +41 79 770 5713 cpug@ch.ibm.com 1 Traditional systems landscape Applications OLTP Staging Area ODS EDW Data Marts ETL ETL ETL ETL Historical

More information

The EDW: An Overview For Foothill-De Anza Community College District

The EDW: An Overview For Foothill-De Anza Community College District The EDW: An Overview For Foothill-De Anza Community College District R. Joanne Keys, SunGard Higher Education October, 2009 1 The Agenda Objective: Set the stage for a successful implementation of the

More information

The Data Organization

The Data Organization C V I T F E P A O TM The Data Organization 1251 Yosemite Way Hayward, CA 94545 (510) 303-8868 rschoenrank@computer.org Business Intelligence Process Architecture By Rainer Schoenrank Data Warehouse Consultant

More information

Demystifying GRC. Abstract

Demystifying GRC. Abstract White Paper Demystifying GRC Abstract Executives globally are highly focused on initiatives around Governance, Risk and Compliance (GRC), to improve upon risk management and regulatory compliances. Over

More information

Association for International PMOs. Expert. Practitioner. Foundation PMO. Learning.

Association for International PMOs. Expert. Practitioner. Foundation PMO. Learning. AIPM Association for International PMOs Expert Practitioner Foundation www.pmolearning.co.uk PMO The Leading Standard and Certification for PMO Professionals Today Understand the Value of High-Performing

More information

Improving Your Business with Oracle Data Integration See How Oracle Enterprise Metadata Management Can Help You

Improving Your Business with Oracle Data Integration See How Oracle Enterprise Metadata Management Can Help You Improving Your Business with Oracle Data Integration See How Oracle Enterprise Metadata Management Can Help You Özgür Yiğit Oracle Data Integration, Senior Manager, ECEMEA Safe Harbor Statement The following

More information

Data Warehouse Testing. By: Rakesh Kumar Sharma

Data Warehouse Testing. By: Rakesh Kumar Sharma Data Warehouse Testing By: Rakesh Kumar Sharma Index...2 Introduction...3 About Data Warehouse...3 Data Warehouse definition...3 Testing Process for Data warehouse:...3 Requirements Testing :...3 Unit

More information

UKEF UK Export Finance. Transformation with the Microsoft Cloud

UKEF UK Export Finance. Transformation with the Microsoft Cloud UKEF UK Export Finance Transformation with the Microsoft Cloud the customer overview Customer UKEF Website www.gov.uk/uk-export-finance Number of employees UKEF s fixed deadline had major financial implications,

More information

MERAKI SERVICE DESCRIPTION

MERAKI SERVICE DESCRIPTION MERAKI SERVICE DESCRIPTION Document Control Purpose of this Document The purpose of this document is to provide clear guidance on what the Meraki service will deliver to the end client. Document Contributors

More information

Virtuoso Infotech Pvt. Ltd.

Virtuoso Infotech Pvt. Ltd. Virtuoso Infotech Pvt. Ltd. About Virtuoso Infotech Fastest growing IT firm; Offers the flexibility of a small firm and robustness of over 30 years experience collectively within the leadership team Technology

More information

IOTA ARCHITECTURE: DATA VIRTUALIZATION AND PROCESSING MEDIUM DR. KONSTANTIN BOUDNIK DR. ALEXANDRE BOUDNIK

IOTA ARCHITECTURE: DATA VIRTUALIZATION AND PROCESSING MEDIUM DR. KONSTANTIN BOUDNIK DR. ALEXANDRE BOUDNIK IOTA ARCHITECTURE: DATA VIRTUALIZATION AND PROCESSING MEDIUM DR. KONSTANTIN BOUDNIK DR. ALEXANDRE BOUDNIK DR. KONSTANTIN BOUDNIK DR.KONSTANTIN BOUDNIK EPAM SYSTEMS CHIEF TECHNOLOGIST BIGDATA, OPEN SOURCE

More information

A Layered Architecture for Enterprise Data Warehouse Systems

A Layered Architecture for Enterprise Data Warehouse Systems A Layered Architecture for Enterprise Warehouse Systems Thorsten Winsemann 1,2, Veit Köppen 2, and Gunter Saake 2 1 SAP Deutschland AG & Co. KG, Großer Grasbrook 17, 22047 Hamburg, Germany Thorsten.Winsemann@t-online.de

More information

Migrate from Netezza Workload Migration

Migrate from Netezza Workload Migration Migrate from Netezza Automated Big Data Open Netezza Source Workload Migration CASE SOLUTION STUDY BRIEF Automated Netezza Workload Migration To achieve greater scalability and tighter integration with

More information

SAP Agile Data Preparation Simplify the Way You Shape Data PUBLIC

SAP Agile Data Preparation Simplify the Way You Shape Data PUBLIC SAP Agile Data Preparation Simplify the Way You Shape Data Introduction SAP Agile Data Preparation Overview Video SAP Agile Data Preparation is a self-service data preparation application providing data

More information

JAVASCRIPT CHARTING. Scaling for the Enterprise with Metric Insights Copyright Metric insights, Inc.

JAVASCRIPT CHARTING. Scaling for the Enterprise with Metric Insights Copyright Metric insights, Inc. JAVASCRIPT CHARTING Scaling for the Enterprise with Metric Insights 2013 Copyright Metric insights, Inc. A REVOLUTION IS HAPPENING... 3! Challenges... 3! Borrowing From The Enterprise BI Stack... 4! Visualization

More information

Implement a Data Warehouse with Microsoft SQL Server

Implement a Data Warehouse with Microsoft SQL Server Implement a Data Warehouse with Microsoft SQL Server 20463D; 5 days, Instructor-led Course Description This course describes how to implement a data warehouse platform to support a BI solution. Students

More information

Optimized Data Integration for the MSO Market

Optimized Data Integration for the MSO Market Optimized Data Integration for the MSO Market Actions at the speed of data For Real-time Decisioning and Big Data Problems VelociData for FinTech and the Enterprise VelociData s technology has been providing

More information

COGNOS DYNAMIC CUBES: SET TO RETIRE TRANSFORMER? Update: Pros & Cons

COGNOS DYNAMIC CUBES: SET TO RETIRE TRANSFORMER? Update: Pros & Cons COGNOS DYNAMIC CUBES: SET TO RETIRE TRANSFORMER? 10.2.2 Update: Pros & Cons GoToWebinar Control Panel Submit questions here Click arrow to restore full control panel Copyright 2015 Senturus, Inc. All Rights

More information

BPS Suite and the OCEG Capability Model. Mapping the OCEG Capability Model to the BPS Suite s product capability.

BPS Suite and the OCEG Capability Model. Mapping the OCEG Capability Model to the BPS Suite s product capability. BPS Suite and the OCEG Capability Model Mapping the OCEG Capability Model to the BPS Suite s product capability. BPS Contents Introduction... 2 GRC activities... 2 BPS and the Capability Model for GRC...

More information

20463C-Implementing a Data Warehouse with Microsoft SQL Server. Course Content. Course ID#: W 35 Hrs. Course Description: Audience Profile

20463C-Implementing a Data Warehouse with Microsoft SQL Server. Course Content. Course ID#: W 35 Hrs. Course Description: Audience Profile Course Content Course Description: This course describes how to implement a data warehouse platform to support a BI solution. Students will learn how to create a data warehouse 2014, implement ETL with

More information

The Long and Winding Road from CDI to Data Governance

The Long and Winding Road from CDI to Data Governance The Long and Winding Road from CDI to Data Governance Thomas V. Carlock VP Corporate Data Management Tuesday, November 27, 2007 Agenda Introduction About CIT The Road to CDI Using Data Governance Light

More information

DATA WAREHOUSE PART LX: PROJECT MANAGEMENT ANDREAS BUCKENHOFER, DAIMLER TSS

DATA WAREHOUSE PART LX: PROJECT MANAGEMENT ANDREAS BUCKENHOFER, DAIMLER TSS A company of Daimler AG LECTURE @DHBW: DATA WAREHOUSE PART LX: PROJECT MANAGEMENT ANDREAS BUCKENHOFER, DAIMLER TSS ABOUT ME Andreas Buckenhofer Senior DB Professional andreas.buckenhofer@daimler.com Since

More information

Session 4112 BW NLS Data Archiving: Keeping BW in Tip-Top Shape for SAP HANA. Sandy Speizer, PSEG SAP Principal Architect

Session 4112 BW NLS Data Archiving: Keeping BW in Tip-Top Shape for SAP HANA. Sandy Speizer, PSEG SAP Principal Architect Session 4112 BW NLS Data Archiving: Keeping BW in Tip-Top Shape for SAP HANA Sandy Speizer, PSEG SAP Principal Architect Public Service Enterprise Group PSEG SAP ECC (R/3) Core Implementation SAP BW Implementation

More information

Randy House Vice President of Health Informatics. Saint Luke s Health System. Lynsey McNeal Director Data of Governance. Saint Luke s Health System

Randy House Vice President of Health Informatics. Saint Luke s Health System. Lynsey McNeal Director Data of Governance. Saint Luke s Health System Randy House Vice President of Health Informatics Saint Luke s Health System Lynsey McNeal Director Data of Governance Saint Luke s Health System Advancing along the Information Governance Maturity Curve

More information

PERSPECTIVE. Data Virtualization A Potential Antidote for Big Data Growing Pains. Abstract

PERSPECTIVE. Data Virtualization A Potential Antidote for Big Data Growing Pains. Abstract PERSPECTIVE Data Virtualization A Potential Antidote for Big Data Growing Pains Abstract Enterprises are already facing challenges around data consolidation, heterogeneity, quality, and value. Now they

More information

Welcome. Lyubomira Mihaylova Business Development Manager. M.: October 2012

Welcome. Lyubomira Mihaylova Business Development Manager. M.: October 2012 Welcome Lyubomira Mihaylova Business Development Manager lyubomira@scalefocus.com M.: +359 885 635 887 17 October 2012 Copyright 2012, Scale Focus AD, www.scalefocus.com About ScaleFocus Fastest growing

More information

Implementing a Data Warehouse with Microsoft SQL Server

Implementing a Data Warehouse with Microsoft SQL Server Course 20463C: Implementing a Data Warehouse with Microsoft SQL Server Page 1 of 6 Implementing a Data Warehouse with Microsoft SQL Server Course 20463C: 4 days; Instructor-Led Introduction This course

More information

Automating IT Asset Visualisation

Automating IT Asset Visualisation P a g e 1 It s common sense to know what IT assets you have and to manage them through their lifecycle as part of the IT environment. In practice, asset management is often separate to the planning, operations

More information

Evolution of Big Data Facebook. Architecture Summit, Shenzhen, August 2012 Ashish Thusoo

Evolution of Big Data Facebook. Architecture Summit, Shenzhen, August 2012 Ashish Thusoo Evolution of Big Data Architectures@ Facebook Architecture Summit, Shenzhen, August 2012 Ashish Thusoo About Me Currently Co-founder/CEO of Qubole Ran the Data Infrastructure Team at Facebook till 2011

More information

Data ownership within governance: getting it right

Data ownership within governance: getting it right Data ownership within governance: getting it right Control your data An Experian white paper Data Ownership within Governance : Getting it right - 1 Table of contents 1. Introduction 03 2. Why is data

More information

Data Analytics at Logitech Snowflake + Tableau = #Winning

Data Analytics at Logitech Snowflake + Tableau = #Winning Welcome # T C 1 8 Data Analytics at Logitech Snowflake + Tableau = #Winning Avinash Deshpande I am a futurist, scientist, engineer, designer, data evangelist at heart Find me at Avinash Deshpande Chief

More information

Virtualizing the SAP Infrastructure through Grid Technology. WHITE PAPER March 2007

Virtualizing the SAP Infrastructure through Grid Technology. WHITE PAPER March 2007 Virtualizing the SAP Infrastructure through Grid Technology WHITE PAPER March 2007 TABLE OF CONTENTS TABLE OF CONTENTS 2 Introduction 3 The Complexity of the SAP Landscape 3 Specific Pain Areas 4 Virtualizing

More information

iway Software: Information Management and roadmap Transforming data into business value

iway Software: Information Management and roadmap Transforming data into business value iway Software: Information Management and roadmap Transforming data into business value Fateh NAILI Enterprise Solutions Manager August, 23 rd 2018 Agenda Introduction and Context Information Builders

More information

The #1 Key to Removing the Chaos. in Modern Analytical Environments

The #1 Key to Removing the Chaos. in Modern Analytical Environments October/2018 Advanced Data Lineage: The #1 Key to Removing the Chaos in Modern Analytical Environments Claudia Imhoff, Ph.D. Sponsored By: Table of Contents Executive Summary... 1 Data Lineage Introduction...

More information

#mstrworld. Analyzing Multiple Data Sources with Multisource Data Federation and In-Memory Data Blending. Presented by: Trishla Maru.

#mstrworld. Analyzing Multiple Data Sources with Multisource Data Federation and In-Memory Data Blending. Presented by: Trishla Maru. Analyzing Multiple Data Sources with Multisource Data Federation and In-Memory Data Blending Presented by: Trishla Maru Agenda Overview MultiSource Data Federation Use Cases Design Considerations Data

More information

Modern Data Warehouse The New Approach to Azure BI

Modern Data Warehouse The New Approach to Azure BI Modern Data Warehouse The New Approach to Azure BI History On-Premise SQL Server Big Data Solutions Technical Barriers Modern Analytics Platform On-Premise SQL Server Big Data Solutions Modern Analytics

More information

Is Your Project in Trouble on System Performance?

Is Your Project in Trouble on System Performance? Is Your Project in Trouble on System Performance? Charles Chow May 2017 Is SATURN Your Project 2017 in Trouble - Is Your on System Project Performance? in Trouble on System Performance? May 2017 1 4, [Copyright

More information

Informatica Data Quality Product Family

Informatica Data Quality Product Family Brochure Informatica Product Family Deliver the Right Capabilities at the Right Time to the Right Users Benefits Reduce risks by identifying, resolving, and preventing costly data problems Enhance IT productivity

More information

Solving the Enterprise Data Dilemma

Solving the Enterprise Data Dilemma Solving the Enterprise Data Dilemma Harmonizing Data Management and Data Governance to Accelerate Actionable Insights Learn More at erwin.com Is Our Company Realizing Value from Our Data? If your business

More information

Bull Fast Track/PDW and Big Data

Bull Fast Track/PDW and Big Data Bull Fast Track/PDW and Big Data Add High Performance BI to your Big Data Roger Van Unen Expert Microsoft / BI roger.van-unen@bull.net http://www.bull.fr/bi/fastrack.html Michael Schmitter BI Sales Germany

More information

Data Quality for PowerCenter Users: Expanding Beyond ETL. Marina Grebenkova Principal Product Manager Informatica

Data Quality for PowerCenter Users: Expanding Beyond ETL. Marina Grebenkova Principal Product Manager Informatica Data Quality for PowerCenter Users: Expanding Beyond ETL Marina Grebenkova Principal Product Manager Informatica 2 Agenda Do you trust your data? What is Data Quality? Data Quality process How it complements

More information

1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar

1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar 1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar 1) What does the term 'Ad-hoc Analysis' mean? Choice 1 Business analysts use a subset of the data for analysis. Choice 2: Business analysts access the Data

More information

Introduction to DWML. Christian Thomsen, Aalborg University. Slides adapted from Torben Bach Pedersen and Man Lung Yiu

Introduction to DWML. Christian Thomsen, Aalborg University. Slides adapted from Torben Bach Pedersen and Man Lung Yiu Introduction to DWML Christian Thomsen, Aalborg University Slides adapted from Torben Bach Pedersen and Man Lung Yiu Course Structure Business intelligence Extract knowledge from large amounts of data

More information

The Truth About Test Data Management & Its Impact on Agile Development

The Truth About Test Data Management & Its Impact on Agile Development The Truth About Test Data Management & Its Impact on Agile Development The Truth About Test Data Management and its Impact on Agile Development Despite the agile methods and automated functionality you

More information

End to End Analysis on System z IBM Transaction Analysis Workbench for z/os. James Martin IBM Tools Product SME August 10, 2015

End to End Analysis on System z IBM Transaction Analysis Workbench for z/os. James Martin IBM Tools Product SME August 10, 2015 End to End Analysis on System z IBM Transaction Analysis Workbench for z/os James Martin IBM Tools Product SME August 10, 2015 Please note IBM s statements regarding its plans, directions, and intent are

More information