Data Quality Architecture and Options
|
|
- Liliana Gallagher
- 5 years ago
- Views:
Transcription
1 Data Quality Architecture and Options Nita Khare Alliances & Technology Team - Solution Architect nita.khare@tcs.com * IBM IM Champion 2013 * December 3,
2 Agenda Pain Areas / Challenges of DQ Solution DQ Solution DQ Architecture Options Near Real-time/Inline DQ management Solution Standardization process Other Important DQ Processes Benefits 1
3 Why is DQ important? A Simple Pizza & Beer Order Receipt Date format not known This can mislead while calculating the sales numbers. Suffix Spring Ale added to the beer description by default - This data quality issue impacts the store s inventory and procurement systems. Look at the age It s a default nos. Not capturing the correct age range of the buyer During analysis, we will get results depending on the current year only. i.e. say if default nos. mentioned is 11/22/1990 and if we do analysis today i.e. 11/22/2013, we will think that the buyers are of age somewhere around 23 years which is not at all true. Currency format is not mentioned Sales Nos. can go wrong. Source : 2
4 Pain Areas of DQ Solution Providing high performance for ad-hoc queries Identification of Business Areas Managing Large Volumes of Data Resolving data quality issues & survival policy decisions Nos. and types of Technology Involved Handling complex relationship in data Large Nos. of Unmanaged Data sources Data Availability & Cleansing 3
5 DQ Solution DQ Management Approach DMAIC is a 5 step iterative approach to data quality improvement. It comprises of continuous analysis, observation, and improvement of underlying data leading to overall improvement in the quality of information across the organization 4
6 DQ Solution A typical DQ Lifecycle The ultimate goal of DQ management should be to move from reactive mode of data quality management to proactively control and manage the data quality so that the data imperfections in the systems are limited. 5
7 DQ Solution Factors Influencing DQ All the below mentioned factors (LUCAS) need to be addressed in order to ensure quality data available for analysis to the end users. 6
8 DQ Solution DQ Reference Architecture 7
9 DQ Architecture Options 8
10 DQ Architecture Options Pros & Cons Consideration Points Option 1: Source System Option 2: ETL Layer Option 3: Target Layer Data cleansing effort and cost Ensures quality data is available at the place where it is captured and hence minimal data quality impact on downstream applications. Cost and effort of DQ exercise increases as we move away from the source system. More Expensive Data Load The data load may be delayed as the DQ checks needs to be applied in the source system before the data is ready to be loaded into target layers. The more the DQ checks, the more would be the impact on the data load. But if designed optimally, there might not be much impact. The data load is very quick as the DQ checks are applied after the loads into DWH. Impact on Source System DQ processes may become an overhead to the operational system. Less impact to source operational system compared to option 1 Minimal impact to source operational system performance Heterogeneous source systems An additional overhead of implementing data quality processes and procedures in multiple platforms Less impact to source operational system compared to option 1. Minimal impact to source operational system performance 9
11 Near Real-time/Inline DQ management Solution Option 1: Data Quality Management using Hadoop Option 2: Data Quality Management using Database resources Consideration Points Option 1: Managing DQ solution using Big Data technology Option 2: Managing Data Quality Solutions using Database Resources Volume Best suited for very high volume data. It works well with low to medium data volumes. Update Frequency Efficiently handles frequent changed records Source data quality / Very efficiently handles multiple data No. of DQ Rules quality checks and controls during data load process. Data Load SLA Capable of loading high volume data in stipulated time. Cost Cost of implementation is cheaper compared to Option 2. Maintenance / Support As big data technologies, especially requires coding, debugging and applying fixes are more time consuming and costly. Capable of handling frequently changed records. Performance bottleneck possible with volume growth and more number of DQ checks. Capable of loading high volume data in stipulated time but can crumble with data growth. Cost of implementation is higher compared to Option 1 Maintainability, debugging and fixes are quicker compared to Option 1. Expert availability As big data technologies are still emerging, there could be difficulty in getting big data skilled associate. ETL experts are easily available. 10
12 DQ Solution Data Governance Council 11
13 Standardization process Cleansing and standardization of data is achieved by set of transformations, where organizations data process through each of these stages for better data cleansing and standardization. 12
14 Standardization process Technical Integration 13
15 Other Important DQ Processes Error Handling This is one of the ways of implementing Error handling solution in any ETL architecture One common error table can be created to capture and store exceptions while loading data in downstream systems. When records are rejected due to data quality issues (validation errors) they will be logged in the exception database. In case, there is agreed default value provided by Business for the source columns not holding valid data, the same will be loaded into the target table. Performance overhead will be minimal as exception records will be low in volume in incremental scenario and there would be mostly all inserts in the exceptions table. 14
16 Sample Error Handling Dashboard 15
17 Reconciliation Data reconciliation is performed to verify the integrity of the data loaded into the warehouse. One of the major reasons of information loss is loading failures or errors during loading. Such errors can occur due to several reasons: Inconsistent or non coherent data from source Non-integrating data among different sources Unclean/ non-profiled data Un-handled exceptions Constraint violations Logical issues/ Inherent flaws in program Technical failures like loss of connectivity, loss over network, space issue etc. Reconciliation process will only indicate whether or not the data is correct. It will not indicate why the data is not correct. Reconciliation process answers what part of the question, not why part of the question. Typically they are implemented to ensure that: SUM/COUNT (Input) = SUM/COUNT (Output) + SUM/COUNT (Captured Reject) 16
18 Types Of Reconciliation Transactional Reconciliation Matching the number of records in source and in target. If these counts are equal, it can be safely assumed that records were not left out due to an error during the ETL or simple load process. This can be further verified by the lack of errors (not necessarily warnings) in the exception reporting by the ETL tool. Financial Reconciliation This checks on the data content in source and target. E.g. computing the sum of Amount column in all records at source and target and matching the same. Financial reconciliation will be performed before the data load starts into EDW. Financial reconciliation, if required, can be implemented at specific job level after identifying columns to be reconciled for a source system batch run. 17
19 Benefits Central repository of enterprise data and single version of truth across the enterprise providing Unified Information Delivery platform Data is complete, accurate and consistent in the target system enabling better confidence in decision making. Consolidation of business logic at enterprise level, to remove discrepancies and standardize the data set. Provide business users & data stewards a clear picture of their data quality, monitor, track and govern information over time. Improves data standardization through common data governance framework at the enterprise level. Provides architectural options for implementing DQ solution. The cost of DQM increases as we move from Source to Target. Hence it is advisable to apply DQM solution in the source or nearer to the source in order to reduce the DQM effort. 18
20 THANK YOU 19
Luncheon Webinar Series January 13th, Free is Better Presented by Tony Curcio and Beate Porst Sponsored By:
Luncheon Webinar Series January 13th, 2014 Free is Better Presented by Tony Curcio and Beate Porst Sponsored By: 1 Free is Better Questions and suggestions regarding presentation topics? - send to editor@dsxchange.com
More informationFast Innovation requires Fast IT
Fast Innovation requires Fast IT Cisco Data Virtualization Puneet Kumar Bhugra Business Solutions Manager 1 Challenge In Data, Big Data & Analytics Siloed, Multiple Sources Business Outcomes Business Opportunity:
More informationFull file at
Chapter 2 Data Warehousing True-False Questions 1. A real-time, enterprise-level data warehouse combined with a strategy for its use in decision support can leverage data to provide massive financial benefits
More informationWork Breakdown Structure
MossAtre.book Page 491 Sunday, February 9, 2003 7:01 PM APPENDIX Work Breakdown Structure The work breakdown structure in this appendix reflects the contents of the enclosed CD-ROM. TASK_DATA 491 1 Your
More informationData Mining & Data Warehouse
Data Mining & Data Warehouse Asso. Profe. Dr. Raed Ibraheem Hamed University of Human Development, College of Science and Technology Department of Information Technology 2016 2017 (1) Points to Cover Problem:
More informationData Mining. Asso. Profe. Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology Department of CS (1)
Data Mining Asso. Profe. Dr. Raed Ibraheem Hamed University of Human Development, College of Science and Technology Department of CS 2016 2017 (1) Points to Cover Problem: Heterogeneous Information Sources
More informationData Vault Brisbane User Group
Data Vault Brisbane User Group 26-02-2013 Agenda Introductions A brief introduction to Data Vault Creating a Data Vault based Data Warehouse Comparisons with 3NF/Kimball When is it good for you? Examples
More informationDEVELOPING DECISION SUPPORT SYSTEMS A MODERN APPROACH
DEVELOPING DECISION SUPPORT SYSTEMS A MODERN APPROACH Ion Lungu PhD, Vlad Diaconiţa PhD Candidate Department of Economic Informatics Academy of Economic Studies Bucharest In today s economy access to quality
More informationAVOIDING SILOED DATA AND SILOED DATA MANAGEMENT
AVOIDING SILOED DATA AND SILOED DATA MANAGEMENT Dalton Cervo Author, Consultant, Data Management Expert March 2016 This presentation contains extracts from books that are: Copyright 2011 John Wiley & Sons,
More informationData Warehousing in the Age of In-Memory Computing and Real-Time Analytics. Erich Schneider, Daniel Rutschmann June 2014
Data Warehousing in the Age of In-Memory Computing and Real-Time Analytics Erich Schneider, Daniel Rutschmann June 2014 Disclaimer This presentation outlines our general product direction and should not
More informationMDM Partner Summit 2015 Oracle Enterprise Data Quality Overview & Roadmap
MDM Partner Summit 2015 Oracle Enterprise Data Quality Overview & Roadmap Steve Tuck Senior Director, Product Strategy Todd Blackmon Senior Director, Sales Consulting David Gengenbach Sales Consultant
More informationDATAWAREHOUSING AND ETL PROCESSES: An Explanatory Research
DATAWAREHOUSING AND ETL PROCESSES: An Explanatory Research Priyanshu Gupta ETL Software Developer United Health Group Abstract- In this paper, the author has focused on explaining Data Warehousing and
More informationLow Friction Data Warehousing WITH PERSPECTIVE ILM DATA GOVERNOR
Low Friction Data Warehousing WITH PERSPECTIVE ILM DATA GOVERNOR Table of Contents Foreword... 2 New Era of Rapid Data Warehousing... 3 Eliminating Slow Reporting and Analytics Pains... 3 Applying 20 Years
More informationData Virtualization Implementation Methodology and Best Practices
White Paper Data Virtualization Implementation Methodology and Best Practices INTRODUCTION Cisco s proven Data Virtualization Implementation Methodology and Best Practices is compiled from our successful
More informationThe Data Catalog The Key to Managing Data, Big and Small. April Reeve May
The Data Catalog The Key to Managing Data, Big and Small April Reeve May 18 2017 April Reeve Thirty years doing data oriented stuff Data Management disciplines Data Integration, Data Governance, Data Modeling,
More information@Pentaho #BigDataWebSeries
Enterprise Data Warehouse Optimization with Hadoop Big Data @Pentaho #BigDataWebSeries Your Hosts Today Dave Henry SVP Enterprise Solutions Davy Nys VP EMEA & APAC 2 Source/copyright: The Human Face of
More informationData Management Glossary
Data Management Glossary A Access path: The route through a system by which data is found, accessed and retrieved Agile methodology: An approach to software development which takes incremental, iterative
More informationMetadata Management as a Key Component to Data Governance, Data Stewardship, and Data Quality Management. Wednesday, July 20 th 2016
Metadata Management as a Key Component to Data Governance, Data Stewardship, and Data Quality Management Wednesday, July 20 th 2016 Confidential, Datasource Consulting, LLC 2 Multi-Domain Master Data Management
More informationRevolutionize the Way You Work With IMS Applications Using IBM UrbanCode Deploy Evgeni Liakhovich, IMS Developer
Revolutionize the Way You Work With IMS Applications Using IBM UrbanCode Deploy Evgeni Liakhovich, IMS Developer evgueni@us.ibm.com * 2016 IBM Corporation Trademarks, copyrights, disclaimers IBM, the IBM
More informationIntroduction to Data Science
UNIT I INTRODUCTION TO DATA SCIENCE Syllabus Introduction of Data Science Basic Data Analytics using R R Graphical User Interfaces Data Import and Export Attribute and Data Types Descriptive Statistics
More informationData Warehouses and Deployment
Data Warehouses and Deployment This document contains the notes about data warehouses and lifecycle for data warehouse deployment project. This can be useful for students or working professionals to gain
More informationA Single Source of Truth
A Single Source of Truth is it the mythical creature of data management? In the world of data management, a single source of truth is a fully trusted data source the ultimate authority for the particular
More informationBuilding a Data Warehouse: Data Quality is key for BI. Werner Daehn
[ Building a Data Warehouse: Data Quality is key for BI Werner Daehn [ Learning Points A DWH project is about discovering new information Not having a good quality counterfeits that purpose Actually it
More informationTDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems
Data Analysis and Design for BI and Data Warehousing Systems Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your
More informationWhat s the Value of Your Data? The Agile Advantage
What s the Value of Your Data? The Agile Advantage by Jan Paul Fillie and Werner de Jong In a world of big data, advanced analytics, in-memory data warehousing, and real-time business intelligence (BI),
More informationImplementing a Data Warehouse with Microsoft SQL Server 2014 (20463D)
Implementing a Data Warehouse with Microsoft SQL Server 2014 (20463D) Overview This course describes how to implement a data warehouse platform to support a BI solution. Students will learn how to create
More informationMusewerx support for Application Maintenance in Software AG NATURAL and ADABAS TM environment
Musewerx support for Application Maintenance in Software AG NATURAL and ADABAS TM environment Musewerx provides Application Maintenance Services for your applications written in NATURAL and ADABAS environment.
More informationTECHNOLOGY BRIEF: CA ERWIN DATA PROFILER. Combining Data Profiling and Data Modeling for Better Data Quality
TECHNOLOGY BRIEF: CA ERWIN DATA PROFILER Combining Data Profiling and Data Modeling for Better Data Quality Table of Contents Executive Summary SECTION 1: CHALLENGE 2 Reducing the Cost and Risk of Data
More informationDeploying IMS Applications with IBM UrbanCode Deploy
Deploying IMS Applications with IBM UrbanCode Deploy Evgeni Liakhovich, IMS Develper evgueni@us.ibm.com * IMS Technical Symposium 2015 Trademarks, copyrights, disclaimers IBM, the IBM logo, and ibm.com
More informationIBM Software IBM InfoSphere Information Server for Data Quality
IBM InfoSphere Information Server for Data Quality A component index Table of contents 3 6 9 9 InfoSphere QualityStage 10 InfoSphere Information Analyzer 12 InfoSphere Discovery 13 14 2 Do you have confidence
More informationData Modeling Whitepaper DATA MODELING IS A FORM OF DATA GOVERNANCE BY ROBERT S. SEINER
Data Modeling Whitepaper DATA MODELING IS A FORM OF DATA GOVERNANCE BY ROBERT S. SEINER TABLE OF CONTENTS 3 Introduction 4 Three Actions of Governing Data 4 Governing the Action of Defining Data 5 Relating
More informationIntroduction 1.1 SERVER-CENTRIC IT ARCHITECTURE AND ITS LIMITATIONS
1 Introduction The purpose of this chapter is to convey the basic idea underlying this book. To this end we will first describe conventional server-centric IT architecture and sketch out its limitations
More informationThe Six Principles of BW Data Validation
The Problem The Six Principles of BW Data Validation Users do not trust the data in your BW system. The Cause By their nature, data warehouses store large volumes of data. For analytical purposes, the
More informationRealizing the Full Potential of MDM 1
Realizing the Full Potential of MDM SOLUTION MDM Augmented with Data Virtualization INDUSTRY Applicable to all Industries EBSITE www.denodo.com PRODUCT OVERVIE The Denodo Platform offers the broadest access
More informationEnterprise Data-warehouse (EDW) In Easy Steps
Enterprise Data-warehouse (EDW) In Easy Steps Data-warehouses (DW) are centralised data repositories that integrate data from various transactional, legacy, or external systems, applications, and sources.
More informationThree requirements for reducing performance issues and unplanned downtime in any data center
Three requirements for reducing performance issues and unplanned downtime in any data center DARRYL FUJITA TECHNICAL SOFTWARE SOLUTIONS SPECIALIST HITACHI DATA SYSTEMS How Big Is The Cost Of Unplanned
More informationMicrosoft Implementing a Data Warehouse with Microsoft SQL Server 2014
1800 ULEARN (853 276) www.ddls.com.au Microsoft 20463 - Implementing a Data Warehouse with Microsoft SQL Server 2014 Length 5 days Price $4290.00 (inc GST) Version D Overview Please note: Microsoft have
More informationData Management Framework
The Organization Management Framework Created and Presented By Copyright 2018 Management Is part of the Manage Knowledge, Improvement and Change process of the APQC Process Classification Framework (wwwapqcorg)
More informationMaking the Impossible Possible
Making the Impossible Possible Find and Eliminate Data Errors with Automated Discovery and Data Lineage Introduction Organizations have long struggled to identify and take advantage of opportunities for
More informationDatabase Architectures
Database Architectures CPS352: Database Systems Simon Miner Gordon College Last Revised: 4/15/15 Agenda Check-in Parallelism and Distributed Databases Technology Research Project Introduction to NoSQL
More informationModernizing Business Intelligence and Analytics
Modernizing Business Intelligence and Analytics Justin Erickson Senior Director, Product Management 1 Agenda What benefits can I achieve from modernizing my analytic DB? When and how do I migrate from
More informationData Mining: Approach Towards The Accuracy Using Teradata!
Data Mining: Approach Towards The Accuracy Using Teradata! Shubhangi Pharande Department of MCA NBNSSOCS,Sinhgad Institute Simantini Nalawade Department of MCA NBNSSOCS,Sinhgad Institute Ajay Nalawade
More informationEffective Risk Data Aggregation & Risk Reporting
Effective Risk Data Aggregation & Risk Reporting Presented by: Ilia Bolotine Head, Adastra Business Consulting (Canada) 1 The Evolving Regulatory Landscape in Risk Management A significant lesson learned
More informationDATACENTER SERVICES DATACENTER
SERVICES SOLUTION SUMMARY ALL CHANGE React, grow and innovate faster with Computacenter s agile infrastructure services Customers expect an always-on, superfast response. Businesses need to release new
More informationLambda Architecture for Batch and Stream Processing. October 2018
Lambda Architecture for Batch and Stream Processing October 2018 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document is provided for informational purposes only.
More informationData Migration Platform
appmigrate TM Data Migration Platform QUALITY MANAGEMENT RECONCILIATION MAINTENANCE MIGRATION PROFILING COEXISTENCE CUTOVER PLANNING AND EXECUTION ? Problem Data migration done in the traditional way,
More informationDecision Support. Go-Live Update
Decision Support Go-Live Update UNLV Decision Support Purpose is to provide knowledgeable campus users with access to data for decision making. Principles embraced: Involvement of decision makers and
More informationAutomated Testing of Tableau Dashboards
Kinesis Technical Whitepapers April 2018 Kinesis CI Automated Testing of Tableau Dashboards Abstract Companies make business critical decisions every day, based on data from their business intelligence
More informationCisco EnergyWise: Power Management Without Borders
Cisco EnergyWise: Power Management Without Borders Introduction In response to energy costs, environmental concerns, and government directives, there is an increased need for sustainable and green business
More informationIBM InfoSphere Information Analyzer
IBM InfoSphere Information Analyzer Understand, analyze and monitor your data Highlights Develop a greater understanding of data source structure, content and quality Leverage data quality rules continuously
More informationThe Data Organization
C V I T F E P A O TM The Organization 1251 Yosemite Way Hayward, CA 94545 (510) 303-8868 rschoenrank@computerorg / Warehouse Strategies By Rainer Schoenrank Warehouse Consultant January 2018 / Warehouse
More informationIT Monitoring Tool Gaps are Impacting the Business A survey of IT Professionals and Executives
IT Monitoring Tool Gaps are Impacting the Business A survey of IT Professionals and Executives June 2018 1 Executive Summary This research finds that large enterprise customers and employees endure a substantial
More informationHow to integrate data into Tableau
1 How to integrate data into Tableau a comparison of 3 approaches: ETL, Tableau self-service and WHITE PAPER WHITE PAPER 2 data How to integrate data into Tableau a comparison of 3 es: ETL, Tableau self-service
More informationImproving Data Governance in Your Organization. Faire Co Regional Manger, Information Management Software, ASEAN
Improving Data Governance in Your Organization Faire Co Regional Manger, Information Management Software, ASEAN Topics The Innovation Imperative and Innovating with Information What Is Data Governance?
More informationETL AUTO RECONCILIATION
ISSN 0975-3303 Mapana J Sci, 9, 2 (2010), 35-46 https://doi.org/10.12725/mjs.17.5 ETL AUTO RECONCILIATION Jibrael Jos* and Brogodishworon U** ABSTRACT Extraction, tro.nsformation and Loading (ETL) is the
More informationIBM DB2 Analytics Accelerator use cases
IBM DB2 Analytics Accelerator use cases Ciro Puglisi Netezza Europe +41 79 770 5713 cpug@ch.ibm.com 1 Traditional systems landscape Applications OLTP Staging Area ODS EDW Data Marts ETL ETL ETL ETL Historical
More informationThe EDW: An Overview For Foothill-De Anza Community College District
The EDW: An Overview For Foothill-De Anza Community College District R. Joanne Keys, SunGard Higher Education October, 2009 1 The Agenda Objective: Set the stage for a successful implementation of the
More informationThe Data Organization
C V I T F E P A O TM The Data Organization 1251 Yosemite Way Hayward, CA 94545 (510) 303-8868 rschoenrank@computer.org Business Intelligence Process Architecture By Rainer Schoenrank Data Warehouse Consultant
More informationDemystifying GRC. Abstract
White Paper Demystifying GRC Abstract Executives globally are highly focused on initiatives around Governance, Risk and Compliance (GRC), to improve upon risk management and regulatory compliances. Over
More informationAssociation for International PMOs. Expert. Practitioner. Foundation PMO. Learning.
AIPM Association for International PMOs Expert Practitioner Foundation www.pmolearning.co.uk PMO The Leading Standard and Certification for PMO Professionals Today Understand the Value of High-Performing
More informationImproving Your Business with Oracle Data Integration See How Oracle Enterprise Metadata Management Can Help You
Improving Your Business with Oracle Data Integration See How Oracle Enterprise Metadata Management Can Help You Özgür Yiğit Oracle Data Integration, Senior Manager, ECEMEA Safe Harbor Statement The following
More informationData Warehouse Testing. By: Rakesh Kumar Sharma
Data Warehouse Testing By: Rakesh Kumar Sharma Index...2 Introduction...3 About Data Warehouse...3 Data Warehouse definition...3 Testing Process for Data warehouse:...3 Requirements Testing :...3 Unit
More informationUKEF UK Export Finance. Transformation with the Microsoft Cloud
UKEF UK Export Finance Transformation with the Microsoft Cloud the customer overview Customer UKEF Website www.gov.uk/uk-export-finance Number of employees UKEF s fixed deadline had major financial implications,
More informationMERAKI SERVICE DESCRIPTION
MERAKI SERVICE DESCRIPTION Document Control Purpose of this Document The purpose of this document is to provide clear guidance on what the Meraki service will deliver to the end client. Document Contributors
More informationVirtuoso Infotech Pvt. Ltd.
Virtuoso Infotech Pvt. Ltd. About Virtuoso Infotech Fastest growing IT firm; Offers the flexibility of a small firm and robustness of over 30 years experience collectively within the leadership team Technology
More informationIOTA ARCHITECTURE: DATA VIRTUALIZATION AND PROCESSING MEDIUM DR. KONSTANTIN BOUDNIK DR. ALEXANDRE BOUDNIK
IOTA ARCHITECTURE: DATA VIRTUALIZATION AND PROCESSING MEDIUM DR. KONSTANTIN BOUDNIK DR. ALEXANDRE BOUDNIK DR. KONSTANTIN BOUDNIK DR.KONSTANTIN BOUDNIK EPAM SYSTEMS CHIEF TECHNOLOGIST BIGDATA, OPEN SOURCE
More informationA Layered Architecture for Enterprise Data Warehouse Systems
A Layered Architecture for Enterprise Warehouse Systems Thorsten Winsemann 1,2, Veit Köppen 2, and Gunter Saake 2 1 SAP Deutschland AG & Co. KG, Großer Grasbrook 17, 22047 Hamburg, Germany Thorsten.Winsemann@t-online.de
More informationMigrate from Netezza Workload Migration
Migrate from Netezza Automated Big Data Open Netezza Source Workload Migration CASE SOLUTION STUDY BRIEF Automated Netezza Workload Migration To achieve greater scalability and tighter integration with
More informationSAP Agile Data Preparation Simplify the Way You Shape Data PUBLIC
SAP Agile Data Preparation Simplify the Way You Shape Data Introduction SAP Agile Data Preparation Overview Video SAP Agile Data Preparation is a self-service data preparation application providing data
More informationJAVASCRIPT CHARTING. Scaling for the Enterprise with Metric Insights Copyright Metric insights, Inc.
JAVASCRIPT CHARTING Scaling for the Enterprise with Metric Insights 2013 Copyright Metric insights, Inc. A REVOLUTION IS HAPPENING... 3! Challenges... 3! Borrowing From The Enterprise BI Stack... 4! Visualization
More informationImplement a Data Warehouse with Microsoft SQL Server
Implement a Data Warehouse with Microsoft SQL Server 20463D; 5 days, Instructor-led Course Description This course describes how to implement a data warehouse platform to support a BI solution. Students
More informationOptimized Data Integration for the MSO Market
Optimized Data Integration for the MSO Market Actions at the speed of data For Real-time Decisioning and Big Data Problems VelociData for FinTech and the Enterprise VelociData s technology has been providing
More informationCOGNOS DYNAMIC CUBES: SET TO RETIRE TRANSFORMER? Update: Pros & Cons
COGNOS DYNAMIC CUBES: SET TO RETIRE TRANSFORMER? 10.2.2 Update: Pros & Cons GoToWebinar Control Panel Submit questions here Click arrow to restore full control panel Copyright 2015 Senturus, Inc. All Rights
More informationBPS Suite and the OCEG Capability Model. Mapping the OCEG Capability Model to the BPS Suite s product capability.
BPS Suite and the OCEG Capability Model Mapping the OCEG Capability Model to the BPS Suite s product capability. BPS Contents Introduction... 2 GRC activities... 2 BPS and the Capability Model for GRC...
More information20463C-Implementing a Data Warehouse with Microsoft SQL Server. Course Content. Course ID#: W 35 Hrs. Course Description: Audience Profile
Course Content Course Description: This course describes how to implement a data warehouse platform to support a BI solution. Students will learn how to create a data warehouse 2014, implement ETL with
More informationThe Long and Winding Road from CDI to Data Governance
The Long and Winding Road from CDI to Data Governance Thomas V. Carlock VP Corporate Data Management Tuesday, November 27, 2007 Agenda Introduction About CIT The Road to CDI Using Data Governance Light
More informationDATA WAREHOUSE PART LX: PROJECT MANAGEMENT ANDREAS BUCKENHOFER, DAIMLER TSS
A company of Daimler AG LECTURE @DHBW: DATA WAREHOUSE PART LX: PROJECT MANAGEMENT ANDREAS BUCKENHOFER, DAIMLER TSS ABOUT ME Andreas Buckenhofer Senior DB Professional andreas.buckenhofer@daimler.com Since
More informationSession 4112 BW NLS Data Archiving: Keeping BW in Tip-Top Shape for SAP HANA. Sandy Speizer, PSEG SAP Principal Architect
Session 4112 BW NLS Data Archiving: Keeping BW in Tip-Top Shape for SAP HANA Sandy Speizer, PSEG SAP Principal Architect Public Service Enterprise Group PSEG SAP ECC (R/3) Core Implementation SAP BW Implementation
More informationRandy House Vice President of Health Informatics. Saint Luke s Health System. Lynsey McNeal Director Data of Governance. Saint Luke s Health System
Randy House Vice President of Health Informatics Saint Luke s Health System Lynsey McNeal Director Data of Governance Saint Luke s Health System Advancing along the Information Governance Maturity Curve
More informationPERSPECTIVE. Data Virtualization A Potential Antidote for Big Data Growing Pains. Abstract
PERSPECTIVE Data Virtualization A Potential Antidote for Big Data Growing Pains Abstract Enterprises are already facing challenges around data consolidation, heterogeneity, quality, and value. Now they
More informationWelcome. Lyubomira Mihaylova Business Development Manager. M.: October 2012
Welcome Lyubomira Mihaylova Business Development Manager lyubomira@scalefocus.com M.: +359 885 635 887 17 October 2012 Copyright 2012, Scale Focus AD, www.scalefocus.com About ScaleFocus Fastest growing
More informationImplementing a Data Warehouse with Microsoft SQL Server
Course 20463C: Implementing a Data Warehouse with Microsoft SQL Server Page 1 of 6 Implementing a Data Warehouse with Microsoft SQL Server Course 20463C: 4 days; Instructor-Led Introduction This course
More informationAutomating IT Asset Visualisation
P a g e 1 It s common sense to know what IT assets you have and to manage them through their lifecycle as part of the IT environment. In practice, asset management is often separate to the planning, operations
More informationEvolution of Big Data Facebook. Architecture Summit, Shenzhen, August 2012 Ashish Thusoo
Evolution of Big Data Architectures@ Facebook Architecture Summit, Shenzhen, August 2012 Ashish Thusoo About Me Currently Co-founder/CEO of Qubole Ran the Data Infrastructure Team at Facebook till 2011
More informationData ownership within governance: getting it right
Data ownership within governance: getting it right Control your data An Experian white paper Data Ownership within Governance : Getting it right - 1 Table of contents 1. Introduction 03 2. Why is data
More informationData Analytics at Logitech Snowflake + Tableau = #Winning
Welcome # T C 1 8 Data Analytics at Logitech Snowflake + Tableau = #Winning Avinash Deshpande I am a futurist, scientist, engineer, designer, data evangelist at heart Find me at Avinash Deshpande Chief
More informationVirtualizing the SAP Infrastructure through Grid Technology. WHITE PAPER March 2007
Virtualizing the SAP Infrastructure through Grid Technology WHITE PAPER March 2007 TABLE OF CONTENTS TABLE OF CONTENTS 2 Introduction 3 The Complexity of the SAP Landscape 3 Specific Pain Areas 4 Virtualizing
More informationiway Software: Information Management and roadmap Transforming data into business value
iway Software: Information Management and roadmap Transforming data into business value Fateh NAILI Enterprise Solutions Manager August, 23 rd 2018 Agenda Introduction and Context Information Builders
More informationThe #1 Key to Removing the Chaos. in Modern Analytical Environments
October/2018 Advanced Data Lineage: The #1 Key to Removing the Chaos in Modern Analytical Environments Claudia Imhoff, Ph.D. Sponsored By: Table of Contents Executive Summary... 1 Data Lineage Introduction...
More information#mstrworld. Analyzing Multiple Data Sources with Multisource Data Federation and In-Memory Data Blending. Presented by: Trishla Maru.
Analyzing Multiple Data Sources with Multisource Data Federation and In-Memory Data Blending Presented by: Trishla Maru Agenda Overview MultiSource Data Federation Use Cases Design Considerations Data
More informationModern Data Warehouse The New Approach to Azure BI
Modern Data Warehouse The New Approach to Azure BI History On-Premise SQL Server Big Data Solutions Technical Barriers Modern Analytics Platform On-Premise SQL Server Big Data Solutions Modern Analytics
More informationIs Your Project in Trouble on System Performance?
Is Your Project in Trouble on System Performance? Charles Chow May 2017 Is SATURN Your Project 2017 in Trouble - Is Your on System Project Performance? in Trouble on System Performance? May 2017 1 4, [Copyright
More informationInformatica Data Quality Product Family
Brochure Informatica Product Family Deliver the Right Capabilities at the Right Time to the Right Users Benefits Reduce risks by identifying, resolving, and preventing costly data problems Enhance IT productivity
More informationSolving the Enterprise Data Dilemma
Solving the Enterprise Data Dilemma Harmonizing Data Management and Data Governance to Accelerate Actionable Insights Learn More at erwin.com Is Our Company Realizing Value from Our Data? If your business
More informationBull Fast Track/PDW and Big Data
Bull Fast Track/PDW and Big Data Add High Performance BI to your Big Data Roger Van Unen Expert Microsoft / BI roger.van-unen@bull.net http://www.bull.fr/bi/fastrack.html Michael Schmitter BI Sales Germany
More informationData Quality for PowerCenter Users: Expanding Beyond ETL. Marina Grebenkova Principal Product Manager Informatica
Data Quality for PowerCenter Users: Expanding Beyond ETL Marina Grebenkova Principal Product Manager Informatica 2 Agenda Do you trust your data? What is Data Quality? Data Quality process How it complements
More information1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar
1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar 1) What does the term 'Ad-hoc Analysis' mean? Choice 1 Business analysts use a subset of the data for analysis. Choice 2: Business analysts access the Data
More informationIntroduction to DWML. Christian Thomsen, Aalborg University. Slides adapted from Torben Bach Pedersen and Man Lung Yiu
Introduction to DWML Christian Thomsen, Aalborg University Slides adapted from Torben Bach Pedersen and Man Lung Yiu Course Structure Business intelligence Extract knowledge from large amounts of data
More informationThe Truth About Test Data Management & Its Impact on Agile Development
The Truth About Test Data Management & Its Impact on Agile Development The Truth About Test Data Management and its Impact on Agile Development Despite the agile methods and automated functionality you
More informationEnd to End Analysis on System z IBM Transaction Analysis Workbench for z/os. James Martin IBM Tools Product SME August 10, 2015
End to End Analysis on System z IBM Transaction Analysis Workbench for z/os James Martin IBM Tools Product SME August 10, 2015 Please note IBM s statements regarding its plans, directions, and intent are
More information