Semantics, Metadata and Identifying Master Data

Size: px
Start display at page:

Download "Semantics, Metadata and Identifying Master Data"

Transcription

1 Semantics, Metadata and Identifying Master Data A DataFlux White Paper Prepared by: David Loshin, President, Knowledge Integrity, Inc.

2 Once you have determined that your organization can achieve the benefits of integrating data quality and data governance through introducing a master data management (MDM) program, some typical early questions emerge, such as What architectural approaches will we take to deploy our MDM solution? or What are the business approaches for acquiring the appropriate tools and technologies required for MDM success? These are good questions, but they are often asked prematurely. Even before determining how to manage the enterprise master data asset, there are more fundamental questions that need to be asked and comprehensively explored, such as: What data elements constitute our master data? How do we locate and isolate master data objects that exist within the enterprise? How do we assess the variances between the different representations in order to consolidate instances into a single view? Because of the ways that diffused application architectures have evolved within different organizations, it is likely that while there are a relatively small number of core master objects used, there are many different ways that these objects are modeled, represented and stored. For example, any application that must manage contact information for individual customers will rely on a data model that maintains the customer s name. Yet one application will track an individual s full name, while others will break up the name into its first, middle and last parts. And even for those that track the given and family names of a customer will do it differently perform a quick scan of the data sets within your own organization and you are likely to find LAST_NAME attributes with a wide range of field lengths. Figure 1: Isolating master data from different data sets.

3 The challenges are not limited to determining what master objects are used. Indeed, the core requirement is to find where master objects are used and to chart a strategy for standardizing, harmonizing and consolidating them into a master repository or registry. When the intention is to create an organizational asset that is not just another data silo, it is imperative that your organization provide the means for both the consolidation and integration of master data and facilitate the most effective and appropriate sharing of that master data. What is Master Data? What are the characteristics of master data? So far, the industry has been better at describing master data but less adept at actually defining what master data is. As a description, master data objects are those core business objects that are used in the different applications across the organization, along with their associated metadata, attributes, definitions, roles, connections, and taxonomies. Master data objects are those things that we care about the things that are logged in our transaction systems, measured and reported on in our reporting systems, and analyzed in our analytical systems. Common examples of master data include: Customers Suppliers Parts Products Locations Contact mechanisms For example, consider the following transaction: David Loshin purchased seat 15B on flight 238 from BWI to SFO on July 20, Some of the master data elements in this example and their types are shown in Table 1. Master Data Object Customer Product Value Flight 238 Location Location David Loshin Seat 15B BWI SFO Table 1: Master data elements for a typical airline reservation. Aside from the above description, master data objects share certain characteristics: The real-world objects modeled within the environment as master data objects tend to be referenced in multiple business areas. For example, the concept of a vendor may exist in the finance application at the same time as in the procurements application.

4 Master data objects are referenced in both transaction and analytic system records. While the sales system may log and process the transactions initiated by a customer, those same activities may be analyzed for the purposes of segmentation and marketing. Master data objects may be classified within a semantic hierarchy, with different levels of classification, attribution and specialization applied depending on the application. For example, we may have a master data category of party, which in turn is comprised of individuals or organizations. Those parties may also be classified based on their roles, such as prospect, customer, supplier, vendor, or employee. Master data objects may require specialized application functions to create new instances, as well as manage the updating and removal of instance records. Each application that involves supplier interaction may have a function enabling the creation of a new supplier record. They are likely to have models reflected across multiple applications, possibly embedded in legacy data structure models. While we may see a natural hierarchy across one dimension, the taxonomies that are applied to our data instances may actually cross multiple hierarchies. For example, a party may be an individual, a customer and an employee simultaneously. In turn, the same master data categories and their related taxonomies would be used for transactions, analysis and reporting. For example, the headers in a monthly sales report may be derived from the master data categories (sales by customer by region by time period). Enabling the transactional systems to refer to the same data objects as the subsequent reporting systems ensures that the analysis reports are consistent with the transaction systems. Centralizing Semantic Metadata Master data may be sprinkled across the application environment. The objective of a master data management program is to facilitate the effective management of the set of master data instances as a single centralized master resource. But before we can materialize a single master record for any entity, we must be able to: 1. Discover which data resources may contain entity information 2. Understand which attributes carry identifying information 3. Extract identifying information from the data resource 4. Transform the identifying information into a standardized or canonical form 5. Establish similarity to other standardized records This entails cataloging the data sets, their attributes, formats, data domains, definitions, contexts and semantics, not just as an operational resource, but rather in a way that can be used to automate master data consolidation as well as governing the ongoing application interactions with the master repository. In other words, to be able to manage the master data, one must first be able to manage the master metadata. But as there is a need to resolve multiple variant models into a single view, the interaction with the master metadata must facilitate the resolution of three critical aspects:

5 Format at the element level Structure at the instance level Semantics across all levels. Figure 2: Preparation for a master data integration process must resolve the differences between the syntax, structure, and semantics of different source data sets. These are effectively three levels of integration that need to dovetail as a prelude to any kind of enterprise-wide integration, and introduces three corresponding challenges for master metadata management: 1. Collecting and analyzing master metadata 2. Resolving similarity in structure 3. Understanding and unifying master data semantics Challenge 1: Consolidating and Analyzing Master Metadata One approach is to analyze and document the metadata associated with all data objects across the enterprise and use that information to guide analysts seeking out master data. Many of the data sets may have documented some of the necessary metadata. For example, relational database systems allow for querying table structure and data element types, and COBOL copybooks reveal some structure and potentially even some alias data. Some of the data may have little or no documented metadata, such as fixed-format or character-separated files.

6 If the objective is to collect comprehensive and consistent metadata, as well as ensure that the data appropriately correlates to its documented metadata, we can use data profiling as the tool of choice. Because of its ability to apply both statistical and analytical algorithms for characterizing data sets, data profiling can drive the empirical assessment of structure and format metadata while simultaneously exposing of embedded data models and dependencies. Our consolidated metadata repository will eventually enumerate the relevant characteristics associated with each data set in a standardized way, including the data set name, its type (e.g., RDBMS table, VSAM file, CSV file) and the characteristics of each of its columns/attributes (e.g., length, data type or format pattern). At the end of this process, we will have more than simply a comprehensive catalog of all data sets. We will also be able to review the frequency of meta-model characteristics, such as frequently-used names, field sizes, and data types. Capturing these values with a standard representation allows the metadata characteristics themselves to be subjected to the kinds of statistical analysis that data profiling provides. For example, we can assess the dependencies between common attribute names (e.g., CUSTOMER ) and their assigned data types (e.g., VARCHAR[20]) to identify (and potentially standardize against) commonly-used types, sizes and formats. Challenge 2: Resolving Similarity in Structure Despite the expectations that there are many variant forms and structures for your organization s master data, the different underlying models of each master data object are bound to share many commonalities. For example, the structure for practically any residential customer table will contain a name, an address and a telephone number. On the other hand, almost any vendor table will probably also contain a name, an address and a telephone number. A closer look might suggest considering an underlying model concept of a party, used as the basis for both customer and vendor. In turn, the analyst might review any model that contains those same identifying attributes as a structure type that can be derived or is related to a party type. There are two aspects to structure similarity for the purpose of tracking down master data instances. The first is seeking out overlapping structures, in which the core attributes determined to carry identifying information for one data object overlap with a similar set of attributes in another data object. The second is identifying derived structures, in which one object s set of attributes are completely embedded within other data objects. Both cases indicate a structural relationship, and when related attributes carry identifying information, the analyst should review those objects to determine if they indeed represent master objects. Challenge 3: Unifying Semantics The third challenge focuses the qualitative difference between pure syntactic or structural metadata (as we can discover through the profiling process), and the underlying semantic metadata. This involves more than just analyzing structure similarity. It involves understanding what the data means, how that meaning is conveyed, how that meaning connects data sets across the enterprise, and approaches to capturing semantics as an attribute of your metadata framework. As a data set s metadata is collected, the semantic analyst must approach the business client to understand that data object s business meaning. One step in this process involves reviewing the degree of semantic consistency in data element naming is related to overlapping data types, sizes and structures. The next step is to

7 document the business meanings assumed for each of the data objects, which involves asking questions like: What are the definitions for the data elements? Or for the data sets themselves? Are there authoritative sources for the definitions? Do similar objects have different business meanings? The answers to these questions not only help in determining which data sets truly refer to the same underlying real-world objects, they also contribute to an organizational resource that can be used to standardize a representation for each data object as its definition is approved through the data governance process. Managing that semantic metadata as a central asset enables the metadata repository to grow in value as it consolidates semantics from different enterprise data collections. Identifying and Qualifying Master Data Once the semantic metadata has been collected and centralized, the analyst s task of identifying master data should be simplified. As more metadata representations of similar objects and entities populate the repository, the frequency of specific models will provide a basis for assessing whether the attributes of a represented object qualify the data elements represented by the model as master data. By adding additional characterization data for each data set s metadata profile, we can add more knowledge to the process of determining sources that can feed a master data repository, which will help in the analyst s task. One approach is to characterize the value set associated with each column in each table. At the conceptual level, designating a value set using a simplified classification scheme reduces the level of complexity associated with data variance, and allows for loosening the constraints when comparing multiple metadata instances. For example, we can limit ourselves to six data value classes, such as these: 1. Boolean or Flag There are only two valid values, one representing true and one representing false. 2. Time/Date Stamp A value that represents a point in time. 3. Magnitude A numeric value on a continuous range, such as a quantity or an amount. 4. Code Enumeration A small set of values, either used directly (e.g., using the colors red and blue ) or mapped as a numeric enumeration (e.g., 1 = red, 2 = blue ). 5. Handle A character string with limited duplication across the set may be used as part of an object description (e.g., name or address_line_1 fields contain handle information). 6. Cross-Reference An identifier that either is uniquely assigned to the record or provides a reference to that identifier in another dataset.

8 The Fractal Nature of Metadata Profiling At this point, each data attribute can be summarized in terms of a small number of descriptive characteristics: data type, length, data value, class, etc. In turn, each data set can be described as a collection of its component attributes. Because we are looking for similar data sets with similar structures, formats and semantics, our job is to assess each data set s identifying attribution, try to find the collections of data sets that share similar characteristics, and determine if they represent the same objects. Let s summarize: We are using our tools to assess data element structure We are collecting this information into a metadata repository We use our tools to look for data attributes that share similar characteristics We use our tools to seek out attributes with similar names We analyze the data value sets and assign them into value classes We use our tools to detect similarities between representative data metamodels In essence, the techniques and tools we can use for determining the sources of master data objects are the same types of tools we use for consolidating the data into a master repository! Using data profiling, parsing, standardization and matching, we can facilitate the process of identifying which data sets (tables, files, spreadsheets, etc.) represent which master data objects. Standardizing the Representation The analyst now has a collection of master object representations. But as a prelude to developing the consolidation road map, decisions must be made as part of the organization s governance process. To consolidate the variety of diverse master object representations into a single repository, the relevant stakeholders need to agree on a common representation as well as the underlying semantics for that representation. It is critical that a standard representation be defined and agreed to so that the participants expecting to benefit from the data in the master repository can effectively share the data. And because MDM is a solution that integrates tools with policies and procedures for data governance, there should be a process for defining and agreeing to data standards. Summary: Metadata Profiling Drives the Process In effect, we have described a process for analyzing similarity of syntax, structure and semantics as a prelude to identifying enterprise sources of master data. And since the objective in identifying and consolidating master data representations requires empirical analysis and similarity assessment as part of the resolution process, it is comforting to know that the same kinds of tools and techniques that will subsequently be used to facilitate data integration can also isolate and catalog organizational master data.

Data Quality in the MDM Ecosystem

Data Quality in the MDM Ecosystem Solution Guide Data Quality in the MDM Ecosystem What is MDM? The premise of Master Data Management (MDM) is to create, maintain, and deliver the most complete and comprehensive view possible from disparate

More information

Data Governance. Data Governance, Data Architecture, and Metadata Essentials Enabling Data Reuse Across the Enterprise

Data Governance. Data Governance, Data Architecture, and Metadata Essentials Enabling Data Reuse Across the Enterprise Data Governance Data Governance, Data Architecture, and Metadata Essentials Enabling Data Reuse Across the Enterprise 2 Table of Contents 4 Why Business Success Requires Data Governance Data Repurposing

More information

Considering a Services Approach for Data Quality

Considering a Services Approach for Data Quality Solutions for Customer Intelligence, Communications and Care. Considering a Services Approach for Data Quality Standardize Data Quality Capabilities for Increased Efficiency and Lower Overall Cost W H

More information

STEP Data Governance: At a Glance

STEP Data Governance: At a Glance STEP Data Governance: At a Glance Master data is the heart of business optimization and refers to organizational data, such as product, asset, location, supplier and customer information. Companies today

More information

Copyright 2016 Datalynx Pty Ltd. All rights reserved. Datalynx Enterprise Data Management Solution Catalogue

Copyright 2016 Datalynx Pty Ltd. All rights reserved. Datalynx Enterprise Data Management Solution Catalogue Datalynx Enterprise Data Management Solution Catalogue About Datalynx Vendor of the world s most versatile Enterprise Data Management software Licence our software to clients & partners Partner-based sales

More information

How Turner Broadcasting can avoid the Seven Deadly Sins That. Can Cause a Data Warehouse Project to Fail. Robert Milton Underwood, Jr.

How Turner Broadcasting can avoid the Seven Deadly Sins That. Can Cause a Data Warehouse Project to Fail. Robert Milton Underwood, Jr. How Turner Broadcasting can avoid the Seven Deadly Sins That Can Cause a Data Warehouse Project to Fail Robert Milton Underwood, Jr. 2000 Robert Milton Underwood, Jr. Page 2 2000 Table of Contents Section

More information

Challenges in the Effective Use of Master Data Management Techniques WHITE PAPER

Challenges in the Effective Use of Master Data Management Techniques WHITE PAPER Challenges in the Effective Use of Master Management Techniques WHITE PAPER SAS White Paper Table of Contents Introduction.... 1 Consolidation: The Typical Approach to Master Management. 2 Why Consolidation

More information

2 The IBM Data Governance Unified Process

2 The IBM Data Governance Unified Process 2 The IBM Data Governance Unified Process The benefits of a commitment to a comprehensive enterprise Data Governance initiative are many and varied, and so are the challenges to achieving strong Data Governance.

More information

Making the Impossible Possible

Making the Impossible Possible Making the Impossible Possible Find and Eliminate Data Errors with Automated Discovery and Data Lineage Introduction Organizations have long struggled to identify and take advantage of opportunities for

More information

TECHNOLOGY BRIEF: CA ERWIN DATA PROFILER. Combining Data Profiling and Data Modeling for Better Data Quality

TECHNOLOGY BRIEF: CA ERWIN DATA PROFILER. Combining Data Profiling and Data Modeling for Better Data Quality TECHNOLOGY BRIEF: CA ERWIN DATA PROFILER Combining Data Profiling and Data Modeling for Better Data Quality Table of Contents Executive Summary SECTION 1: CHALLENGE 2 Reducing the Cost and Risk of Data

More information

Enterprise Data Catalog for Microsoft Azure Tutorial

Enterprise Data Catalog for Microsoft Azure Tutorial Enterprise Data Catalog for Microsoft Azure Tutorial VERSION 10.2 JANUARY 2018 Page 1 of 45 Contents Tutorial Objectives... 4 Enterprise Data Catalog Overview... 5 Overview... 5 Objectives... 5 Enterprise

More information

Databases and Database Systems

Databases and Database Systems Page 1 of 6 Databases and Database Systems 9.1 INTRODUCTION: A database can be summarily described as a repository for data. This makes clear that building databases is really a continuation of a human

More information

EDQ Product Data Extensions Essentials

EDQ Product Data Extensions Essentials EDQ Product Data Extensions Essentials January, 2015 Contents Part 1 Product Data and EDQ s Product Data Extensions: a Conceptual Overview Part 2 Create a Data Lens Using the Knowledge Studio Part 3 Develop

More information

A WHITE PAPER By Silwood Technology Limited

A WHITE PAPER By Silwood Technology Limited A WHITE PAPER By Silwood Technology Limited Delivering metadata transparency for Enterprise Application packages Executive Summary Enterprise Resource Planning (ERP) packages such as SAP, J.D. Edwards

More information

FIBO Operational Ontologies Briefing for the Object Management Group

FIBO Operational Ontologies Briefing for the Object Management Group FIBO Operational Ontologies Briefing for the Object Management Group March 20, 2013, Reston, VA David Newman Strategic Planning Manager, Senior Vice President, Enterprise Architecture Chair, Semantic Technology

More information

Vendor: The Open Group. Exam Code: OG Exam Name: TOGAF 9 Part 1. Version: Demo

Vendor: The Open Group. Exam Code: OG Exam Name: TOGAF 9 Part 1. Version: Demo Vendor: The Open Group Exam Code: OG0-091 Exam Name: TOGAF 9 Part 1 Version: Demo QUESTION 1 According to TOGAF, Which of the following are the architecture domains that are commonly accepted subsets of

More information

Informatica Data Quality Product Family

Informatica Data Quality Product Family Brochure Informatica Product Family Deliver the Right Capabilities at the Right Time to the Right Users Benefits Reduce risks by identifying, resolving, and preventing costly data problems Enhance IT productivity

More information

Business Impacts of Poor Data Quality: Building the Business Case

Business Impacts of Poor Data Quality: Building the Business Case Business Impacts of Poor Data Quality: Building the Business Case David Loshin Knowledge Integrity, Inc. 1 Data Quality Challenges 2 Addressing the Problem To effectively ultimately address data quality,

More information

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems Data Analysis and Design for BI and Data Warehousing Systems Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your

More information

Test bank for accounting information systems 1st edition by richardson chang and smith

Test bank for accounting information systems 1st edition by richardson chang and smith Test bank for accounting information systems 1st edition by richardson chang and smith Chapter 04 Relational Databases and Enterprise Systems True / False Questions 1. Three types of data models used today

More information

April 17, Ronald Layne Manager, Data Quality and Data Governance

April 17, Ronald Layne Manager, Data Quality and Data Governance Ensuring the highest quality data is delivered throughout the university providing valuable information serving individual and organizational need April 17, 2015 Ronald Layne Manager, Data Quality and

More information

IBM InfoSphere Master Data Management Version 11 Release 5. Overview IBM SC

IBM InfoSphere Master Data Management Version 11 Release 5. Overview IBM SC IBM InfoSphere Master Data Management Version 11 Release 5 Overview IBM SC27-6718-01 IBM InfoSphere Master Data Management Version 11 Release 5 Overview IBM SC27-6718-01 Note Before using this information

More information

Data Governance for Master Data Management and Beyond

Data Governance for Master Data Management and Beyond Data Governance for Master Data Management and Beyond A White Paper by David Loshin WHITE PAPER SAS White Paper Table of Contents Aligning Information Objectives with the Business Strategy.... 1 Clarifying

More information

ISO/IEC TR TECHNICAL REPORT. Information technology Procedures for achieving metadata registry (MDR) content consistency Part 1: Data elements

ISO/IEC TR TECHNICAL REPORT. Information technology Procedures for achieving metadata registry (MDR) content consistency Part 1: Data elements TECHNICAL REPORT ISO/IEC TR 20943-1 First edition 2003-08-01 Information technology Procedures for achieving metadata registry (MDR) content consistency Part 1: Data elements Technologies de l'information

More information

SAP Security Remediation: Three Steps for Success Using SAP GRC

SAP Security Remediation: Three Steps for Success Using SAP GRC SAP Security Remediation: Three Steps for Success Using SAP GRC All companies need strong application security environments as part of a successful overall risk management strategy. Strong risk-oriented

More information

Vocabulary-Driven Enterprise Architecture Development Guidelines for DoDAF AV-2: Design and Development of the Integrated Dictionary

Vocabulary-Driven Enterprise Architecture Development Guidelines for DoDAF AV-2: Design and Development of the Integrated Dictionary Vocabulary-Driven Enterprise Architecture Development Guidelines for DoDAF AV-2: Design and Development of the Integrated Dictionary December 17, 2009 Version History Version Publication Date Author Description

More information

Full file at

Full file at Chapter 2 Data Warehousing True-False Questions 1. A real-time, enterprise-level data warehouse combined with a strategy for its use in decision support can leverage data to provide massive financial benefits

More information

The Data Organization

The Data Organization C V I T F E P A O TM The Data Organization Best Practices Metadata Dictionary Application Architecture Prepared by Rainer Schoenrank January 2017 Table of Contents 1. INTRODUCTION... 3 1.1 PURPOSE OF THE

More information

The Emerging Data Lake IT Strategy

The Emerging Data Lake IT Strategy The Emerging Data Lake IT Strategy An Evolving Approach for Dealing with Big Data & Changing Environments bit.ly/datalake SPEAKERS: Thomas Kelly, Practice Director Cognizant Technology Solutions Sean Martin,

More information

Data Virtualization Implementation Methodology and Best Practices

Data Virtualization Implementation Methodology and Best Practices White Paper Data Virtualization Implementation Methodology and Best Practices INTRODUCTION Cisco s proven Data Virtualization Implementation Methodology and Best Practices is compiled from our successful

More information

Data Governance and Data Quality: Applying GS1 Best Practices to USAID/GHSC-PSM

Data Governance and Data Quality: Applying GS1 Best Practices to USAID/GHSC-PSM Data Governance and Data Quality: Applying GS1 Best Practices to USAID/GHSC-PSM February 2017 Prepared by Beth Anne Cusack, Senior Consultant, RC Partners LLC Agenda Introduction and Approach to Today

More information

On the Design and Implementation of a Generalized Process for Business Statistics

On the Design and Implementation of a Generalized Process for Business Statistics On the Design and Implementation of a Generalized Process for Business Statistics M. Bruno, D. Infante, G. Ruocco, M. Scannapieco 1. INTRODUCTION Since the second half of 2014, Istat has been involved

More information

Meaning & Concepts of Databases

Meaning & Concepts of Databases 27 th August 2015 Unit 1 Objective Meaning & Concepts of Databases Learning outcome Students will appreciate conceptual development of Databases Section 1: What is a Database & Applications Section 2:

More information

Managing the Emerging Semantic Risks

Managing the Emerging Semantic Risks The New Information Security Agenda: Managing the Emerging Semantic Risks Dr Robert Garigue Vice President for information integrity and Chief Security Executive Bell Canada Page 1 Abstract Today all modern

More information

Solving the Enterprise Data Dilemma

Solving the Enterprise Data Dilemma Solving the Enterprise Data Dilemma Harmonizing Data Management and Data Governance to Accelerate Actionable Insights Learn More at erwin.com Is Our Company Realizing Value from Our Data? If your business

More information

DATA MINING TRANSACTION

DATA MINING TRANSACTION DATA MINING Data Mining is the process of extracting patterns from data. Data mining is seen as an increasingly important tool by modern business to transform data into an informational advantage. It is

More information

CA Asset Portfolio Management

CA Asset Portfolio Management CA Asset Portfolio Management User Guide Release 12.9.00 This Documentation, which includes embedded help systems and electronically distributed materials, (hereinafter referred to as the Documentation

More information

SAP Security Remediation: Three Steps for Success Using SAP GRC

SAP Security Remediation: Three Steps for Success Using SAP GRC SAP Security Remediation: Three Steps for Success Using SAP GRC All companies need strong application security environments as part of a successful overall risk management strategy. Strong risk-oriented

More information

Teiid Designer User Guide 7.5.0

Teiid Designer User Guide 7.5.0 Teiid Designer User Guide 1 7.5.0 1. Introduction... 1 1.1. What is Teiid Designer?... 1 1.2. Why Use Teiid Designer?... 2 1.3. Metadata Overview... 2 1.3.1. What is Metadata... 2 1.3.2. Editing Metadata

More information

DATA Act Information Model Schema (DAIMS) Architecture. U.S. Department of the Treasury

DATA Act Information Model Schema (DAIMS) Architecture. U.S. Department of the Treasury DATA Act Information Model Schema (DAIMS) Architecture U.S. Department of the Treasury September 22, 2017 Table of Contents 1. Introduction... 1 2. Conceptual Information Model... 2 3. Metadata... 4 4.

More information

BPS Suite and the OCEG Capability Model. Mapping the OCEG Capability Model to the BPS Suite s product capability.

BPS Suite and the OCEG Capability Model. Mapping the OCEG Capability Model to the BPS Suite s product capability. BPS Suite and the OCEG Capability Model Mapping the OCEG Capability Model to the BPS Suite s product capability. BPS Contents Introduction... 2 GRC activities... 2 BPS and the Capability Model for GRC...

More information

The 360 Solution. July 24, 2014

The 360 Solution. July 24, 2014 The 360 Solution July 24, 2014 Most successful large organizations are organized by lines of businesses (LOBs). This has been a very successful way to organize for the accountability of profit and loss.

More information

turning data into dollars

turning data into dollars turning data into dollars Tom s Ten Data Tips July 2010 Meta Data Meta data are data about data. A more descriptive definition says meta data is everything you need to know in order to interpret information

More information

ER/Studio Enterprise Portal Evaluation Guide. Published: March 6, 2009

ER/Studio Enterprise Portal Evaluation Guide. Published: March 6, 2009 ER/Studio Enterprise Portal 1.0.3 Evaluation Guide Published: March 6, 2009 Contents Introduction to ER/Studio Enterprise Portal 1.0... 3 Session 1: Getting Started with the Enterprise Portal... 4 Session

More information

Microsoft SharePoint Server 2013 Plan, Configure & Manage

Microsoft SharePoint Server 2013 Plan, Configure & Manage Microsoft SharePoint Server 2013 Plan, Configure & Manage Course 20331-20332B 5 Days Instructor-led, Hands on Course Information This five day instructor-led course omits the overlap and redundancy that

More information

CHAPTER 3 Implementation of Data warehouse in Data Mining

CHAPTER 3 Implementation of Data warehouse in Data Mining CHAPTER 3 Implementation of Data warehouse in Data Mining 3.1 Introduction to Data Warehousing A data warehouse is storage of convenient, consistent, complete and consolidated data, which is collected

More information

BIG DATA DOCUMENT SOLUTIONS INSTANT INSIGHTS IN YOUR TECHNICAL DOCUMENTATION

BIG DATA DOCUMENT SOLUTIONS INSTANT INSIGHTS IN YOUR TECHNICAL DOCUMENTATION BIG DATA DOCUMENT SOLUTIONS INSTANT INSIGHTS IN YOUR TECHNICAL DOCUMENTATION LEGACY DOCUMENTS PROBLEM DOCUMENT CONTROL MAZE Keeping track and control on sometimes decades of historic documents of complex

More information

CA ERwin Data Profiler

CA ERwin Data Profiler PRODUCT BRIEF: CA ERWIN DATA PROFILER CA ERwin Data Profiler CA ERWIN DATA PROFILER HELPS ORGANIZATIONS LOWER THE COSTS AND RISK ASSOCIATED WITH DATA INTEGRATION BY PROVIDING REUSABLE, AUTOMATED, CROSS-DATA-SOURCE

More information

Tamr Technical Whitepaper

Tamr Technical Whitepaper Tamr Technical Whitepaper 1. Executive Summary Tamr was founded to tackle large-scale data management challenges in organizations where extreme data volume and variety require an approach different from

More information

Not All Data Are Created Equal - Taxonomic Data and Data Governance

Not All Data Are Created Equal - Taxonomic Data and Data Governance Not All Data Are Created Equal - Taxonomic Data and Data Governance ABSTRACT Business value lost due to poor data quality has lead organizations to look for Data Governance. The assumption is that having

More information

INTELLIGENCE DRIVEN GRC FOR SECURITY

INTELLIGENCE DRIVEN GRC FOR SECURITY INTELLIGENCE DRIVEN GRC FOR SECURITY OVERVIEW Organizations today strive to keep their business and technology infrastructure organized, controllable, and understandable, not only to have the ability to

More information

The Data Organization

The Data Organization C V I T F E P A O TM The Data Organization 1251 Yosemite Way Hayward, CA 94545 (510) 303-8868 rschoenrank@computer.org Business Intelligence Process Architecture By Rainer Schoenrank Data Warehouse Consultant

More information

Core Data Services: Basic Components for Establishing Business Value

Core Data Services: Basic Components for Establishing Business Value Core Data Services: Basic Components for Establishing Business Value W H I T E PA P E R : DATA QUALITY David Loshin WHITE PAPER: DATA QUALITY Core Data Services: Basic Components for Establishing Business

More information

1. Inroduction to Data Mininig

1. Inroduction to Data Mininig 1. Inroduction to Data Mininig 1.1 Introduction Universe of Data Information Technology has grown in various directions in the recent years. One natural evolutionary path has been the development of the

More information

What s New in Ideagen Pentana 4.2

What s New in Ideagen Pentana 4.2 --- What s New in Ideagen Pentana 4.2 Table of Contents 1 Introduction... 2 2 Major New Features... 3 2.1 Key Issues/Strategic Risks... 3 2.2 Definition Sign Offs... 3 2.3 Sampling... 4 2.4 Audit Problems...

More information

Analytics: Server Architect (Siebel 7.7)

Analytics: Server Architect (Siebel 7.7) Analytics: Server Architect (Siebel 7.7) Student Guide June 2005 Part # 10PO2-ASAS-07710 D44608GC10 Edition 1.0 D44917 Copyright 2005, 2006, Oracle. All rights reserved. Disclaimer This document contains

More information

Chapter 6 Architectural Design. Chapter 6 Architectural design

Chapter 6 Architectural Design. Chapter 6 Architectural design Chapter 6 Architectural Design 1 Topics covered Architectural design decisions Architectural views Architectural patterns Application architectures 2 Software architecture The design process for identifying

More information

Risk: Security s New Compliance. Torsten George VP Worldwide Marketing and Products, Agiliance Professional Strategies - S23

Risk: Security s New Compliance. Torsten George VP Worldwide Marketing and Products, Agiliance Professional Strategies - S23 Risk: Security s New Compliance Torsten George VP Worldwide Marketing and Products, Agiliance Professional Strategies - S23 Agenda Market Dynamics Organizational Challenges Risk: Security s New Compliance

More information

DC Area Business Objects Crystal User Group (DCABOCUG) Data Warehouse Architectures for Business Intelligence Reporting.

DC Area Business Objects Crystal User Group (DCABOCUG) Data Warehouse Architectures for Business Intelligence Reporting. DC Area Business Objects Crystal User Group (DCABOCUG) Data Warehouse Architectures for Business Intelligence Reporting April 14, 2009 Whitemarsh Information Systems Corporation 2008 Althea Lane Bowie,

More information

What s a BA to do with Data? Discover and define standard data elements in business terms

What s a BA to do with Data? Discover and define standard data elements in business terms What s a BA to do with Data? Discover and define standard data elements in business terms Susan Block, Lead Business Systems Analyst The Vanguard Group Discussion Points Discovering Business Data The Data

More information

lnteroperability of Standards to Support Application Integration

lnteroperability of Standards to Support Application Integration lnteroperability of Standards to Support Application Integration Em delahostria Rockwell Automation, USA, em.delahostria@ra.rockwell.com Abstract: One of the key challenges in the design, implementation,

More information

IBM InfoSphere Information Server Version 8 Release 7. Reporting Guide SC

IBM InfoSphere Information Server Version 8 Release 7. Reporting Guide SC IBM InfoSphere Server Version 8 Release 7 Reporting Guide SC19-3472-00 IBM InfoSphere Server Version 8 Release 7 Reporting Guide SC19-3472-00 Note Before using this information and the product that it

More information

I CAN T FIND THE #$%& DATA. Why You Need a Data Catalog

I CAN T FIND THE #$%& DATA. Why You Need a Data Catalog I CAN T FIND THE #$%& DATA Why You Need a Data Catalog Data is everywhere It s embedded in our social media, streaming across the Internet of Things, and stored in the cloud. The volume of data available

More information

The Evolution of Data Warehousing. Data Warehousing Concepts. The Evolution of Data Warehousing. The Evolution of Data Warehousing

The Evolution of Data Warehousing. Data Warehousing Concepts. The Evolution of Data Warehousing. The Evolution of Data Warehousing The Evolution of Data Warehousing Data Warehousing Concepts Since 1970s, organizations gained competitive advantage through systems that automate business processes to offer more efficient and cost-effective

More information

Software Engineering

Software Engineering Software Engineering chap 4. Software Reuse 1 SuJin Choi, PhD. Sogang University Email: sujinchoi@sogang.ac.kr Slides modified, based on original slides by Ian Sommerville (Software Engineering 10 th Edition)

More information

NOTES ON OBJECT-ORIENTED MODELING AND DESIGN

NOTES ON OBJECT-ORIENTED MODELING AND DESIGN NOTES ON OBJECT-ORIENTED MODELING AND DESIGN Stephen W. Clyde Brigham Young University Provo, UT 86402 Abstract: A review of the Object Modeling Technique (OMT) is presented. OMT is an object-oriented

More information

OLAP Introduction and Overview

OLAP Introduction and Overview 1 CHAPTER 1 OLAP Introduction and Overview What Is OLAP? 1 Data Storage and Access 1 Benefits of OLAP 2 What Is a Cube? 2 Understanding the Cube Structure 3 What Is SAS OLAP Server? 3 About Cube Metadata

More information

Chapter 4. Fundamental Concepts and Models

Chapter 4. Fundamental Concepts and Models Chapter 4. Fundamental Concepts and Models 4.1 Roles and Boundaries 4.2 Cloud Characteristics 4.3 Cloud Delivery Models 4.4 Cloud Deployment Models The upcoming sections cover introductory topic areas

More information

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. 1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. Integrating Complex Financial Workflows in Oracle Database Xavier Lopez Seamus Hayes Oracle PolarLake, LTD 2 Copyright 2011, Oracle

More information

Data Governance Central to Data Management Success

Data Governance Central to Data Management Success Data Governance Central to Data Success International Anne Marie Smith, Ph.D. DAMA International DMBOK Editorial Review Board Primary Contributor EWSolutions, Inc Principal Consultant and Director of Education

More information

Realizing the Full Potential of MDM 1

Realizing the Full Potential of MDM 1 Realizing the Full Potential of MDM SOLUTION MDM Augmented with Data Virtualization INDUSTRY Applicable to all Industries EBSITE www.denodo.com PRODUCT OVERVIE The Denodo Platform offers the broadest access

More information

Chapter 6 Architectural Design

Chapter 6 Architectural Design Chapter 6 Architectural Design Chapter 6 Architectural Design Slide 1 Topics covered The WHAT and WHY of architectural design Architectural design decisions Architectural views/perspectives Architectural

More information

Question Bank. 4) It is the source of information later delivered to data marts.

Question Bank. 4) It is the source of information later delivered to data marts. Question Bank Year: 2016-2017 Subject Dept: CS Semester: First Subject Name: Data Mining. Q1) What is data warehouse? ANS. A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile

More information

SUGI 29 Data Warehousing, Management and Quality

SUGI 29 Data Warehousing, Management and Quality Building a Purchasing Data Warehouse for SRM from Disparate Procurement Systems Zeph Stemle, Qualex Consulting Services, Inc., Union, KY ABSTRACT SAS Supplier Relationship Management (SRM) solution offers

More information

Data Warehousing and OLAP Technologies for Decision-Making Process

Data Warehousing and OLAP Technologies for Decision-Making Process Data Warehousing and OLAP Technologies for Decision-Making Process Hiren H Darji Asst. Prof in Anand Institute of Information Science,Anand Abstract Data warehousing and on-line analytical processing (OLAP)

More information

Teiid Designer User Guide 7.7.0

Teiid Designer User Guide 7.7.0 Teiid Designer User Guide 1 7.7.0 1. Introduction... 1 1.1. What is Teiid Designer?... 1 1.2. Why Use Teiid Designer?... 2 1.3. Metadata Overview... 2 1.3.1. What is Metadata... 2 1.3.2. Editing Metadata

More information

Integration With the Business Modeler

Integration With the Business Modeler Decision Framework, J. Duggan Research Note 11 September 2003 Evaluating OOA&D Functionality Criteria Looking at nine criteria will help you evaluate the functionality of object-oriented analysis and design

More information

Federal Agency Firewall Management with SolarWinds Network Configuration Manager & Firewall Security Manager. Follow SolarWinds:

Federal Agency Firewall Management with SolarWinds Network Configuration Manager & Firewall Security Manager. Follow SolarWinds: Federal Agency Firewall Management with SolarWinds Network Configuration Manager & Firewall Security Manager Introduction What s different about Federal Government Firewalls? The United States Federal

More information

How to Accelerate Merger and Acquisition Synergies

How to Accelerate Merger and Acquisition Synergies How to Accelerate Merger and Acquisition Synergies MERGER AND ACQUISITION CHALLENGES Mergers and acquisitions (M&A) occur frequently in today s business environment; $3 trillion in 2017 alone. 1 M&A enables

More information

How Security Policy Orchestration Extends to Hybrid Cloud Platforms

How Security Policy Orchestration Extends to Hybrid Cloud Platforms How Security Policy Orchestration Extends to Hybrid Cloud Platforms Reducing complexity also improves visibility when managing multi vendor, multi technology heterogeneous IT environments www.tufin.com

More information

DATA VAULT MODELING GUIDE

DATA VAULT MODELING GUIDE DATA VAULT MODELING GUIDE Introductory Guide to Data Vault Modeling GENESEE ACADEMY, LLC 2012 Authored by: Hans Hultgren DATA VAULT MODELING GUIDE Introductory Guide to Data Vault Modeling Forward Data

More information

Emerging Technologies in Knowledge Management By Ramana Rao, CTO of Inxight Software, Inc.

Emerging Technologies in Knowledge Management By Ramana Rao, CTO of Inxight Software, Inc. Emerging Technologies in Knowledge Management By Ramana Rao, CTO of Inxight Software, Inc. This paper provides an overview of a presentation at the Internet Librarian International conference in London

More information

Event Metamodel and Profile (EMP) Proposed RFP Updated Sept, 2007

Event Metamodel and Profile (EMP) Proposed RFP Updated Sept, 2007 Event Metamodel and Profile (EMP) Proposed RFP Updated Sept, 2007 Robert Covington, CTO 8425 woodfield crossing boulevard suite 345 indianapolis in 46240 317.252.2636 Motivation for this proposed RFP 1.

More information

OG0-091 Q&As TOGAF 9 Part 1

OG0-091 Q&As TOGAF 9 Part 1 CertBus.com OG0-091 Q&As TOGAF 9 Part 1 Pass The Open Group OG0-091 Exam with 100% Guarantee Free Download Real Questions & Answers PDF and VCE file from: 100% Passing Guarantee 100% Money Back Assurance

More information

Effective Risk Data Aggregation & Risk Reporting

Effective Risk Data Aggregation & Risk Reporting Effective Risk Data Aggregation & Risk Reporting Presented by: Ilia Bolotine Head, Adastra Business Consulting (Canada) 1 The Evolving Regulatory Landscape in Risk Management A significant lesson learned

More information

SAS Data Integration Studio 3.3. User s Guide

SAS Data Integration Studio 3.3. User s Guide SAS Data Integration Studio 3.3 User s Guide The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2006. SAS Data Integration Studio 3.3: User s Guide. Cary, NC: SAS Institute

More information

Chapter 6 Architectural Design. Lecture 1. Chapter 6 Architectural design

Chapter 6 Architectural Design. Lecture 1. Chapter 6 Architectural design Chapter 6 Architectural Design Lecture 1 1 Topics covered ² Architectural design decisions ² Architectural views ² Architectural patterns ² Application architectures 2 Software architecture ² The design

More information

WHAT S CHANGING IN PROCURE-TO-PAY

WHAT S CHANGING IN PROCURE-TO-PAY epayment Forum WHAT S CHANGING IN PROCURE-TO-PAY August 2017 Session Objectives During this session, we will discuss: Overview of the University s Preferred Purchasing Methods Changes to the check request

More information

Veritas NetBackup Appliance Family OVERVIEW BROCHURE

Veritas NetBackup Appliance Family OVERVIEW BROCHURE Veritas NetBackup Appliance Family OVERVIEW BROCHURE Veritas NETBACKUP APPLIANCES Veritas understands the shifting needs of the data center and offers NetBackup Appliances as a way for customers to simplify

More information

TDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended.

TDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended. Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your needs. The previews cannot be printed. TDWI strives to provide

More information

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective B.Manivannan Research Scholar, Dept. Computer Science, Dravidian University, Kuppam, Andhra Pradesh, India

More information

Query-Time JOIN for Active Intelligence Engine (AIE)

Query-Time JOIN for Active Intelligence Engine (AIE) Query-Time JOIN for Active Intelligence Engine (AIE) Ad hoc JOINing of Structured Data and Unstructured Content: An Attivio-Patented Breakthrough in Information- Centered Business Agility An Attivio Technology

More information

PALANTIR CYBERMESH INTRODUCTION

PALANTIR CYBERMESH INTRODUCTION 100 Hamilton Avenue Palo Alto, California 94301 PALANTIR CYBERMESH INTRODUCTION Cyber attacks expose organizations to significant security, regulatory, and reputational risks, including the potential for

More information

SOME TYPES AND USES OF DATA MODELS

SOME TYPES AND USES OF DATA MODELS 3 SOME TYPES AND USES OF DATA MODELS CHAPTER OUTLINE 3.1 Different Types of Data Models 23 3.1.1 Physical Data Model 24 3.1.2 Logical Data Model 24 3.1.3 Conceptual Data Model 25 3.1.4 Canonical Data Model

More information

Chapter. Relational Database Concepts COPYRIGHTED MATERIAL

Chapter. Relational Database Concepts COPYRIGHTED MATERIAL Chapter Relational Database Concepts 1 COPYRIGHTED MATERIAL Every organization has data that needs to be collected, managed, and analyzed. A relational database fulfills these needs. Along with the powerful

More information

1 Executive Overview The Benefits and Objectives of BPDM

1 Executive Overview The Benefits and Objectives of BPDM 1 Executive Overview The Benefits and Objectives of BPDM This is an excerpt from the Final Submission BPDM document posted to OMG members on November 13 th 2006. The full version of the specification will

More information

TEL2813/IS2621 Security Management

TEL2813/IS2621 Security Management TEL2813/IS2621 Security Management James Joshi Associate Professor Lecture 4 + Feb 12, 2014 NIST Risk Management Risk management concept Goal to establish a relationship between aggregated risks from information

More information

The Modeling and Simulation Catalog for Discovery, Knowledge, and Reuse

The Modeling and Simulation Catalog for Discovery, Knowledge, and Reuse The Modeling and Simulation Catalog for Discovery, Knowledge, and Reuse Stephen Hunt OSD CAPE Joint Data Support (SAIC) Stephen.Hunt.ctr@osd.mil The DoD Office of Security Review has cleared this report

More information

Lecture 1. Chapter 6 Architectural design

Lecture 1. Chapter 6 Architectural design Chapter 6 Architectural Design Lecture 1 1 Topics covered Architectural design decisions Architectural views Architectural patterns Application architectures 2 Software architecture The design process

More information

Database Environment. Pearson Education 2009

Database Environment. Pearson Education 2009 Chapter 2 Database Environment 1 Chapter 2 - Objectives Purpose of three-level database architecture. Contents of external, conceptual, and internal levels. Purpose of external/conceptual and conceptual/internal

More information