Summary. The Dimensional Fact Model. Goals and benefits Basic and advanced constructs. Logical design with the DFM Best practices for design

Size: px
Start display at page:

Download "Summary. The Dimensional Fact Model. Goals and benefits Basic and advanced constructs. Logical design with the DFM Best practices for design"

Transcription

1 Boosting the Data Warehouse Life-Cycle Through Conceptual Design Stefano Rizzi DISI University of Bologna Summary Methodological frameworks Prescriptive design Agile design The Dimensional Fact Model Goals and benefits Basic and advanced constructs Logical design with the DFM Best practices for design Knowledge documentation with the DFM Testing with the DFM Functional and usability tests A case study 2

2 Methodological framework Why? Designing a data warehouse is a long and complex project that does not always meet the needs of business users It is often perceived as too rigid and IT-centric and has become more complex with big data It requires an accurate planning aimed at devising satisfactory answers to organizational and architectural questions The risk of getting an unsatisfactory result in data warehousing projects is particularly high because of high user expectations Risks related to project management Risks associated with technology Risks related to data and design Risks related to the organization Methodologies are created by closely studying similar experiences and minimizing the risks for failure by founding new approaches on a constructive analysis of the mistakes made previously 4

3 A prescriptive view Bottom-up approach DWs are incrementally built and several data marts are iteratively created. Each data mart is based on a set of facts that are linked to a specific department and that can be interesting for a user group Leads to concrete results in a short time Does not require huge investments Enables designers to investigate one area at a time Gives managers a quick feedback about the actual benefits of the system being built Keeps the interest for the project constantly high May determine a partial vision of the business domain 5 Data mart design db administrator Source Analysis and Integration Front-end Programming Requirement Analysis Physical Design distinguishing between a phase of conceptual design and one of logical design Testing brings advantages to both designers and end-users end-user Conceptual Design ETL Design Data Volume and Workload designer Logical Design 6

4 Methodological scenarios Supply-driven approach data marts are designed based on a close operational data source analysis user requirements show designers which groups of data, relevant for decision-making processes, should be selected and how to define data group structures based on the multidimensional model Demand-driven approach it begins with the definition of information requirements of data mart users the problem of how to map those requirements into existing data sources is addressed at a later stage, when ETL procedures are implemented 7 Supply-driven approach glossary REQUIREMENT ANALYSIS interviews reconciled schema CONCEPTUAL DESIGN WORKLOAD AND DATA VOLUME data source schemata SOURCE ANALYSIS AND INTEGRATION fact schema logical model LOGICAL DESIGN workload data volume DBMS reconciled schema logical schema ETL DESIGN PHYSICAL DESIGN ETL schema physical schema 8

5 Supply-driven approach Pros an initial conceptual schema for data marts can be automatically derived from the reconciled layer that is, it strictly depends on data source structures ETL design is extremely streamlined because every single information piece stored in a data mart is directly associated with one or more source attributes the resulting data marts are quite stable in time, because they are rooted in source schemata that change less frequently than the requirements expressed by end users 9 Demand-driven approach glossary REQUIREMENT ANALYSIS interviews CONCEPTUAL DESIGN WORKLOAD AND DATA VOLUME data source schemata SOURCE ANALYSIS AND INTEGRATION fact schema logical model LOGICAL DESIGN workload data volume DBMS reconciled schema logical schema ETL DESIGN PHYSICAL DESIGN ETL schema physical schema 10

6 Demand-driven approach Pros users wishes play a leading role Cons designers are required to have strong leadership and meeting facilitation qualities to properly grab and integrate the different points of view designers make great efforts in the data-staging design phase facts, measures, and hierarchies are drawn directly from the specifications provided by users, and only at a later stage can designers check for the information required to be actually available in source databases this may undermine customers confidence in designers and in the advantage gained by data marts on the whole 11 An agile view Agile methodologies can be applied to data warehouse projects only partially Highlight on incremental delivery of valuable components for users......but several DW components are hardly perceived as valuable by users Strong project segmentation based on user stories: high-level functional requirements that can be implemented in a few days......but founding design on detailed functionalities required by users does not allow the multidimensional structure of data to be correctly determined However, some founding principles of agile methodologies can be reused Incrementality and risk-based iteration Prototyping User involvement Light and formal documentation Component reuse Automated schema generation 12

7 4WD (Four-Wheel-Drive) A methodology arising from the application of agile principles to real data warehouse projects: Fact-based iteration cycles Build a conceptual schema before implementation User involvement in testing activities Possibility of changing fact and data mart priorities 13 4WD (Four-Wheel-Drive) 14

8 Conceptual modeling Which formalism? While it is now universally recognized that a data mart is based on a multidimensional view of data, there is still no agreement on how to implement its conceptual design Use of the Entity-Relationship model is quite widespread throughout companies as a conceptual tool for standard documentation and design of relational databases, but it cannot be used to model DWs Designers often base their data marts design on the logical level that is, they directly define star schemata. But a star schema is nothing but a denormalized relational schema; it contains only the definition of a set of relations and integrity constraints! 16

9 The Dimensional Fact Model The DFM is a graphical model devised to: 1. lend effective support to conceptual design of data marts it is implementation-independent it is expressive it is non-ambiguous it is formally sound (based on FD theory) it can be automatically translated into a logical schema 2. be easily understood by both designers and end-users 3. provide clear and expressive design documentation It has been successfully experimented over the last 20 years in both the academic and industrial worlds The conceptual representation generated by the DFM consists of a set of fact schemata that model facts, measures, dimensions, and hierarchies 17 The Dimensional Fact Model Advantages: 1. it gives designers and end-users a platform-independent, nonambiguous, comprehensive picture of the DW content 2. it is 100% independent of the OLAP multidimensional engine chosen for deployment and of the target logical model 3. it enables effective communication between designers and endusers with the goal of more accurately formalizing requirement specifications 4. it decreases the overall complexity of design by breaking it into two distinct but inter-related phases 5. it streamlines the DW life-cycle by enabling logical design to be automated based on widely recognized best practices 18

10 The Dimensional Fact Model Advantages: 6. it provides clear and expressive design documentation, readable by both ICT and business people, which improves the overall system maintainability 7. it effectively integrates user-oriented business glossaries 8. it enables early testing of requirements based on the core workload expressed by users, thus reducing the probability of errors and misunderstanding 9. it encourages self-service BI by helping business users understand the information content of their DW 10. it enables computation of metrics to effectively assess the quality of design 19 DFM: basic concepts A fact is a concept relevant to decision-making processes. It typically models a set of events taking place within a company (e.g., sales, shipments, purchases,...). A fact has dynamic properties and evolves in some way over time A measure is a numerical property of a fact and describes a quantitative fact aspect that is relevant to analysis (e.g., every sale is quantified by its receipts) A dimension is a fact property with a finite domain and describes an analysis coordinate of the fact. Typical dimensions for the sales fact are products, stores, and dates fact dimension A fact expresses a many-to-many relationship between its dimensions measure 20

11 DFM: basic concepts Dimensional attributes are the dimensions and other possible attributes, always with discrete values, that describe them (e.g., a product is described by its type, by the category to which it belongs, by its brand, and by the department in which it is sold) A hierarchy is a directed tree whose nodes are dimensional attributes and whose arcs model many-to-one associations between dimensional attribute pairs. It includes a dimension, positioned at the tree s root, and all of the dimensional attributes that describe it dimensional attribute hierarchy 21 DFM vs. ERM 22

12 DFM vs. star schema DateId Date Week Month Quarter Year ProductID Product Type Category Brand MarketingGroup Department StoreID DateID ProductID NumberOfUnits TotalReceipts UnitPrice StoreID Store City State SalesManager SalesDistrict 23 DFM: advanced concepts A descriptive attribute stores additional information about a dimensional attribute but is not used for aggregation Some arcs in a fact schema can be optional descriptive attribute optional arc 24

13 DFM: advanced concepts A shared hierarchy is a shorthand to denote that a part of a hierarchy is replicated two or more times in a fact schema shared hierarchy 25 DFM: advanced concepts In a convergence, two dimensional attributes are connected by two or more distinct directed paths, and each of them still represents a functional dependency convergence 26

14 DFM: advanced concepts A cross-dimensional attribute is one whose value is defined by the combination of two or more dimensional attributes cross-dimensional attribute 27 DFM: advanced concepts A multiple arc models a many-to-many association between two dimensional attributes multiple arc 28

15 DFM: advanced concepts An incomplete hierarchy is a hierarchy where, for some instances, one ore more aggregation levels are missing (because they are unknown or undefined) incomplete hierarchy 29 DFM: advanced concepts In recursive hierarchies the father-child relationships between aggregation levels are consistent, but instances can have different lengths recursive hierarchy 30

16 DFM: advanced concepts Additivity shows how measures can be aggregated additivity matrix 31 The DFM in action Budget of the United States Government for Fiscal Year

17 The DFM in action University degrees 33 The DFM in action Retail orders & invoices 34

18 The DFM in action Activity analysis for a consulting company 35 Logical design with the DFM

19 Logical design It aims at determining a logical schema for the data mart starting from a conceptual schema Translation of conceptual schemata Optimization (view materialization, fragmentation) The principles it is based on are different from those used in operational databases data redundancy denormalization of relations conceptual schema workload data volume other constraints Logical design logical schema 37 Stars & Snowflakes WeekID Week Month ProductID Product Type Category Supplier StoreID WeekID ProductID Quantity Receipts StoreID Store City Country SalesManager WeekID Week Month ProductID Product Supplier TypeID TypeID Type Category StoreID WeekID ProductID Quantity Receipts StoreID Store CityID SalesManager CityID City Country 38

20 Translating conceptual schemata The basic rule for translating a fact schema into a star schema is: Create a fact table including all measures; for each hierarchy, create a dimension table including all attributes Besides this obvious rule, specific solutions must be taken for different constructs Some examples Shared hierarchies If a hierarchy is shared, there are two choices: I. Introduce redundancy by duplicating common attributes II. Snowflake on the first common attribute SaleFT StoreID ProductID DateID NumberOfUnits UnitPrice TotalReceipts StoreDT StoreID Store CityID... ProductDT ProductId Product Supplier CityID... CityDT CityID City State Country 40

21 Cross-dimensional attributes A cross-dimensional attribute b defines a many-to-many association between two or more dimensional attributes a 1..., a m Translating it to the star schema requires to create a bridge table including b and having a 1..., a m as the key SaleFT StoreID WeekID ProductID Quantity Receipts StoreDT StoreID Store State Country ProductDT ProductID Product Type Category Supplier Brand VatBT StoreState ProductCategory Vat 41 Multiple arcs When a hierarchy includes a many-to-many association, the best solution is to create an additional table (bridge table) to model the multiple arc: The key of the bridge table is made of the two attributes connected by the multiple arc A weight attribute allows to give different relevance to tuples Both weighted and impact queries are supported SaleFT BookID DateID Number Receipts BookDT BookID Book Genre AuthorDT AuthorID Author BT BookID AuthorID Weight 42

22 Degenerate dimensions This term refers to a hierarchy including only one attribute If the attribute is not too long, its values can be directly included in the FT Alternatively, we may use one DT to model several degenerate dimensions (junk dimension) Since there is no functional dependency between attributes, this solution is feasible only if the number of distinct values for the attributes involved is small OrderLineFT OrderID ProductID McsID Quantity Amount OrderDT OrderID Order Customer City McsDT McsID ShipmentMode ReturnCode LineStatus 43 Recursive hierarchies In recursive hierarchies, the number of aggregation levels vary from instance to instance A self-referencing foreign key can be used to model the recursion ActivityFT EmployeeID ProjectID DateID ActivityTypeID HoursWorked EmployeeDT EmployeeID Employee Role ManagerID A more powerful solution is to flatten out a hierarchy and set out all the links induced by it in a navigation table ActivityFT EmployeeID ProjectID DateID ActivityTypeID HoursWorked EmployeeDT EmployeeID Employee Role Navigation Table ParentID ChildID Level Leaf 44

23 Knowledge documentation with the DFM Data warehouse view 46

24 Data warehouse view 47 Data mart view 48

25 Data mart view 49 Data mart view 50

26 Fact view 51 Fact view 52

27 Testing with the DFM A winning combination The earlier an error is detected in the software life-cycle, the cheapest correcting that error is Multidimensional schemata are typically the first design artifact that is created and can be tested, so they have a primary role in ensuring early discovery of errors All DW design methodologies entail a multidimensional modeling phase, that is carried out either at the conceptual level or at the logical level A conceptual representation is by far more expressive and can more easily be understood by non-expert users Adopting a methodology that entails a conceptual design phase has a positive impact on the effectiveness of testing, which in turn brings several advantages in terms of design quality and accuracy and in terms of maintainability and reuse 54

28 What vs. how in testing multidimensional schema functional ETL physical schema front-end usability performance stress recovery security maintainability 55 Functional test of DFM schemata It aims at verifying that the multidimensional schemata produced for the data mart effectively supports user requirements The workload test verifies that the workload preliminarily expressed by users during requirement analysis is actually supported by the multidimensional schema Check, for each workload query, that the required measures have been included in the fact schema and that the required aggregation level can be expressed as a valid grouping set on the fact schema Should the workload be too large to be comprehensively tested, tests should be made on a sample of queries only 56

29 Functional test of DFM schemata It aims at verifying that the multidimensional schemata produced for the data mart effectively supports user requirements The hierarchy test verifies that the functional dependencies represented by hierarchies in the multidimensional schema are actually valid on source data Detect two types of errors: if a hierarchy includes a functional dependency that is contradicted by source data, then either (1) a modeling error has been done and the functional dependency should be removed from the multidimensional schema, or (2) source data are faulty, which should be taken care of by ETL Even when no such errors are detected, this test can lead designers to discover denormalization issues they were not aware of in source data, which has a significant impact on ETL design 57 Functional test of DFM schemata It aims at verifying that the multidimensional schemata produced for the data mart effectively supports user requirements The conformity test is aimed at assessing how well conformed hierarchies have been designed by evaluating the sparseness of the bus matrix If the bus matrix is very sparse, the designer probably failed to recognize the semantic and structural similarities between apparently different hierarchies (creating a conformed dimension to be shared by multiple facts would be preferable maybe) If the bus matrix is very dense, the designer probably failed to recognize the semantic and structural similarities between apparently different facts (it could be worth merging these facts into a single fact including the union of their measures) Dimension TICKET RET. ORDER STORE MOVEM. accounting type admin. block reason box type constraint reason counter currency currency type discount range discount reason entity environment exchange flag completed order flag item block reason 58

30 Functional test of DFM schemata It aims at verifying that the multidimensional schemata produced for the data mart effectively supports user requirements The similarity factor aids designers in evaluating whether the facts in a single data mart have been designed minimally or redundantly, and is computed for each ordered pair of facts Depending on the values of the similarity factors for two facts, the designer can infer that these facts are mainly overlapping, or that one of them is mostly included in the other by aggregation The similarity factor is particularly useful when assessing an existing data warehouse to quickly identify possibly redundant facts When conceptual design is query-driven, several quite similar facts may be erroneously designed, which is easily captured by the similarity factor RETAIL TICKETS ORDERS STORE MOVEMENTS RETAIL TICKETS ORDERS STORE MOVEMENTS Usability test of DFM schemata The size metrics counts the total number of attributes in each hierarchy/fact/data mart The complexity metrics counts the number of advanced DFM constructs (e.g., multiple arcs, recursive hierarchies, cross-dimensional attributes) Higher values suggest lower understandability Hierarchies with size above 25 and complexity above 3 may be difficult to understand for users, while hierarchies with size above 50 and complexity above 3 will most probably create problems with usability and maintainability 60

31 Usability test of DFM schemata The roll-up factor is specifically oriented to hierarchies and aims at evaluating their OLAP navigability in terms of the roll-up paths they express The roll-up factor of a hierarchy depends on its width and depth: a high value denotes a deep hierarchy, while a low value denotes a wide hierarchy The roll-up factor makes the designer aware that some hierarchies carry low roll-up expressiveness (i.e., they are characterized by short roll-up paths), so that he can check with the user if some rollup relationships have been forgotten during requirement analysis and conceptual design Missing existing roll-up relationships is one of the most common mistakes in multidimensional modeling, and leads to a proliferation of degenerate dimensions 61 A case study

32 Italian Shoes A leading brand in the international market that produces shoes, sells them through its own sales point network, and distributes them to third-party stores worldwide Development team: 1 project manager 1 application domain expert (requirement analysis and relations with business users) 2 designers (one for ETL, one for the front-end) 2 programmers (implementation) 1 Oracle expert (physical tuning) 2 testers Duration: 9 months 63 Testing before the DFM At designers and programmers full discretion (on ETL and front-end) Each team members tests her own work Only documentation: user-acceptance tests Precise functional analysis missing no clear specification of how to move from micro-analysis to logical design difficulties in documenting business rules for instance: 1. how is this KPI exactly defined? 2. on which group-by s is it valid? 3. for which business areas is it valid? 64

33 Testing after the DFM multidimensional schema functional ETL physical schema front-end usability performance stress recovery security maintainability 65 Conceptual schema Three data flows: Orders, Retail Tickets, Store Movements

34 Workload test - 1 Goal: check that the conceptual schema meets the preliminary workload all attributes and measures are present......at the right detail Retail Tickets Orders Store Movements % unsolved user queries 0% 0% 70% Retail Tickets and Orders were already partially implemented, so 0% was expected Store Movements was still under development (the Warehouse hierarchy was missing...) 67 Workload test - 2 Goal: share, explain, and discuss the three fact schemata with users 24 degenerate dimensions added 7 dimensions deleted 7 attributes added 32 attributes deleted 10 parent-child relationships in hierarchies changed 68

35 Similarity test Retail Tickets Orders Store Movements Retail Tickets Orders Store Movements a store movement can in principle be viewed as a particular type of order, but most user queries are related to either movements or orders After 2 months: Retail Tickets Orders Store Movements Retail Tickets Orders Store Movements Usability test Presence of 9 cross-dimensional attributes in the conceptual schema after an iteration, thanks to a more careful analysis and conceptual design, all these constructs could be avoided Low roll-up factor (0.21) for the product hierarchy a discussion with users revealed that the company lacks a shared view of the product hierarchy at this stage designers agreed to leave that hierarchy untouched this issue was listed for perfective maintenance Large number of large dimension tables for Orders (#DTs = 21, #attributes = 410) creation of some junk tables to reduce the number of degenerate dimension tables some snowflaking to reduce the number of attributes per dimension table 70

36 Total effort breakdown ETL 20% 24% Front-end 7% 11% 18% 20% Requirement Analysis Multidimensional design Database Testing 71 Testing effort breakdown 5% 10% MD schema 32% ETL 53% Front-end Physical schema 72

37 Conclusions Conceptual modeling brings advantages to... business analysts & data scientists (better understanding/ exploration of data, self-service BI) designers (streamlining of design activities, early validation of requirements, effective testing) BI architects (comprehensive documentation, control over evolution) The DFM... can be coupled with both prescriptive and agile design methodologies is supported by a CASE tool 73 Conclusions The DFM is already used in several projects in public and private companies (e.g., FIAT, Italian Ministry of Justice, University of Bologna, Yoox Group, GEOX, Health Authorities, etc.) Today, in big data contexts where companies store large quantities of unstructured data in data lakes, the DFM has a key role in ensuring that this huge wealth of information can be effectively explored and analyzed 74

38 References M. Golfarelli, S. Rizzi. Data Warehouse Testing: A Prototype- Based Methodology. Information and Software Technology, 53(11): , 2011 M. Golfarelli, S. Rizzi, E. Turricchia. Modern Software Engineering Methodologies Meet Data Warehouse Design: 4WD. Proceedings 13th International Conference on Data Warehousing and Knowledge Discovery, Toulouse, France, pp , 2011 M. Golfarelli, S. Rizzi. Data Warehouse Design: Modern Principles and Methodologies. McGraw-Hill, 2009 S. Rizzi. Conceptual modeling solutions for the data warehouse. In Data Warehousing and Mining: Concepts, Methodologies, Tools, and Applications, J. Wang (Ed.), Information Science Reference, pp , Thank you for your attention Questions?

Modern Software Engineering Methodologies Meet Data Warehouse Design: 4WD

Modern Software Engineering Methodologies Meet Data Warehouse Design: 4WD Modern Software Engineering Methodologies Meet Data Warehouse Design: 4WD Matteo Golfarelli Stefano Rizzi Elisa Turricchia University of Bologna - Italy 13th International Conference on Data Warehousing

More information

Data warehouse design

Data warehouse design Database and data mining group, Data warehouse design DATA WAREHOUSE: DESIGN - Risk factors Database and data mining group, High user expectation the data warehouse is the solution of the company s problems

More information

Data Warehouse Logical Design. Letizia Tanca Politecnico di Milano (with the kind support of Rosalba Rossato)

Data Warehouse Logical Design. Letizia Tanca Politecnico di Milano (with the kind support of Rosalba Rossato) Data Warehouse Logical Design Letizia Tanca Politecnico di Milano (with the kind support of Rosalba Rossato) Data Mart logical models MOLAP (Multidimensional On-Line Analytical Processing) stores data

More information

Data warehouse design

Data warehouse design DataBase and Data Mining Group of Database and data mining group, D M B G Data warehouse design DATA WAREHOUSE: DESIGN - 1 DataBase and Data Mining Group of Risk factors Database and data mining group,

More information

Logical design DATA WAREHOUSE: DESIGN Logical design. We address the relational model (ROLAP)

Logical design DATA WAREHOUSE: DESIGN Logical design. We address the relational model (ROLAP) atabase and ata Mining Group of atabase and ata Mining Group of B MG ata warehouse design atabase and ata Mining Group of atabase and data mining group, M B G Logical design ATA WAREHOUSE: ESIGN - 37 Logical

More information

Seminars of Software and Services for the Information Society. Data Warehousing Design Issues

Seminars of Software and Services for the Information Society. Data Warehousing Design Issues DIPARTIMENTO DI INGEGNERIA INFORMATICA AUTOMATICA E GESTIONALE ANTONIO RUBERTI Master of Science in Engineering in Computer Science (MSE-CS) Seminars in Software and Services for the Information Society

More information

Decision Support Systems aka Analytical Systems

Decision Support Systems aka Analytical Systems Decision Support Systems aka Analytical Systems Decision Support Systems Systems that are used to transform data into information, to manage the organization: OLAP vs OLTP OLTP vs OLAP Transactions Analysis

More information

Chapter 3. The Multidimensional Model: Basic Concepts. Introduction. The multidimensional model. The multidimensional model

Chapter 3. The Multidimensional Model: Basic Concepts. Introduction. The multidimensional model. The multidimensional model Chapter 3 The Multidimensional Model: Basic Concepts Introduction Multidimensional Model Multidimensional concepts Star Schema Representation Conceptual modeling using ER, UML Conceptual modeling using

More information

QM Chapter 1 Database Fundamentals Version 10 th Ed. Prepared by Dr Kamel Rouibah / Dept QM & IS

QM Chapter 1 Database Fundamentals Version 10 th Ed. Prepared by Dr Kamel Rouibah / Dept QM & IS QM 433 - Chapter 1 Database Fundamentals Version 10 th Ed Prepared by Dr Kamel Rouibah / Dept QM & IS www.cba.edu.kw/krouibah Dr K. Rouibah / dept QM & IS Chapter 1 (433) Database fundamentals 1 Objectives

More information

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective B.Manivannan Research Scholar, Dept. Computer Science, Dravidian University, Kuppam, Andhra Pradesh, India

More information

Data Warehouse Design. Letizia Tanca Politecnico di Milano (with the kind support of Rosalba Rossato)

Data Warehouse Design. Letizia Tanca Politecnico di Milano (with the kind support of Rosalba Rossato) Data Warehouse Design Letizia Tanca Politecnico di Milano (with the kind support of Rosalba Rossato) Data Warehouse Design User requirements Internal DBs Further info sources Source selection Analysis

More information

Full file at

Full file at Chapter 2 Data Warehousing True-False Questions 1. A real-time, enterprise-level data warehouse combined with a strategy for its use in decision support can leverage data to provide massive financial benefits

More information

A Methodology for Integrating XML Data into Data Warehouses

A Methodology for Integrating XML Data into Data Warehouses A Methodology for Integrating XML Data into Data Warehouses Boris Vrdoljak, Marko Banek, Zoran Skočir University of Zagreb Faculty of Electrical Engineering and Computing Address: Unska 3, HR-10000 Zagreb,

More information

Basics of Dimensional Modeling

Basics of Dimensional Modeling Basics of Dimensional Modeling Data warehouse and OLAP tools are based on a dimensional data model. A dimensional model is based on dimensions, facts, cubes, and schemas such as star and snowflake. Dimension

More information

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing. About the Tutorial A data warehouse is constructed by integrating data from multiple heterogeneous sources. It supports analytical reporting, structured and/or ad hoc queries and decision making. This

More information

Overview. Introduction to Data Warehousing and Business Intelligence. BI Is Important. What is Business Intelligence (BI)?

Overview. Introduction to Data Warehousing and Business Intelligence. BI Is Important. What is Business Intelligence (BI)? Introduction to Data Warehousing and Business Intelligence Overview Why Business Intelligence? Data analysis problems Data Warehouse (DW) introduction A tour of the coming DW lectures DW Applications Loosely

More information

FROM A RELATIONAL TO A MULTI-DIMENSIONAL DATA BASE

FROM A RELATIONAL TO A MULTI-DIMENSIONAL DATA BASE FROM A RELATIONAL TO A MULTI-DIMENSIONAL DATA BASE David C. Hay Essential Strategies, Inc In the buzzword sweepstakes of 1997, the clear winner has to be Data Warehouse. A host of technologies and techniques

More information

Managing Data Resources

Managing Data Resources Chapter 7 OBJECTIVES Describe basic file organization concepts and the problems of managing data resources in a traditional file environment Managing Data Resources Describe how a database management system

More information

Guide Users along Information Pathways and Surf through the Data

Guide Users along Information Pathways and Surf through the Data Guide Users along Information Pathways and Surf through the Data Stephen Overton, Overton Technologies, LLC, Raleigh, NC ABSTRACT Business information can be consumed many ways using the SAS Enterprise

More information

Question Bank. 4) It is the source of information later delivered to data marts.

Question Bank. 4) It is the source of information later delivered to data marts. Question Bank Year: 2016-2017 Subject Dept: CS Semester: First Subject Name: Data Mining. Q1) What is data warehouse? ANS. A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile

More information

Managing Data Resources

Managing Data Resources Chapter 7 Managing Data Resources 7.1 2006 by Prentice Hall OBJECTIVES Describe basic file organization concepts and the problems of managing data resources in a traditional file environment Describe how

More information

This module presents the star schema, an alternative to 3NF schemas intended for analytical databases.

This module presents the star schema, an alternative to 3NF schemas intended for analytical databases. Topic 3.3: Star Schema Design This module presents the star schema, an alternative to 3NF schemas intended for analytical databases. Star Schema Overview The star schema is a simple database architecture

More information

A Comprehensive Approach to Data Warehouse Testing

A Comprehensive Approach to Data Warehouse Testing A Comprehensive Approach to Data Warehouse Testing Matteo Golfarelli DEIS - University of Bologna Via Sacchi, 3 Cesena, Italy matteo.golfarelli@unibo.it Stefano Rizzi DEIS - University of Bologna VIale

More information

OBJECTIVES DEFINITIONS CHAPTER 1: THE DATABASE ENVIRONMENT AND DEVELOPMENT PROCESS. Figure 1-1a Data in context

OBJECTIVES DEFINITIONS CHAPTER 1: THE DATABASE ENVIRONMENT AND DEVELOPMENT PROCESS. Figure 1-1a Data in context OBJECTIVES CHAPTER 1: THE DATABASE ENVIRONMENT AND DEVELOPMENT PROCESS Modern Database Management 11 th Edition Jeffrey A. Hoffer, V. Ramesh, Heikki Topi! Define terms! Name limitations of conventional

More information

Advanced Data Management Technologies Written Exam

Advanced Data Management Technologies Written Exam Advanced Data Management Technologies Written Exam 02.02.2016 First name Student number Last name Signature Instructions for Students Write your name, student number, and signature on the exam sheet. This

More information

Data Strategies for Efficiency and Growth

Data Strategies for Efficiency and Growth Data Strategies for Efficiency and Growth Date Dimension Date key (PK) Date Day of week Calendar month Calendar year Holiday Channel Dimension Channel ID (PK) Channel name Channel description Channel type

More information

Data Mining Concepts & Techniques

Data Mining Concepts & Techniques Data Mining Concepts & Techniques Lecture No. 01 Databases, Data warehouse Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro

More information

The strategic advantage of OLAP and multidimensional analysis

The strategic advantage of OLAP and multidimensional analysis IBM Software Business Analytics Cognos Enterprise The strategic advantage of OLAP and multidimensional analysis 2 The strategic advantage of OLAP and multidimensional analysis Overview Online analytical

More information

TDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended.

TDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended. Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your needs. The previews cannot be printed. TDWI strives to provide

More information

Meta-Stars: Multidimensional Modeling for Social Business Intelligence

Meta-Stars: Multidimensional Modeling for Social Business Intelligence Meta-Stars: Multidimensional Modeling for Social Business Intelligence Enrico Gallinucci - Matteo Golfarelli - Stefano Rizzi University of Bologna - Italy Summary Introduction: Social BI Topic hierarchy

More information

2 The IBM Data Governance Unified Process

2 The IBM Data Governance Unified Process 2 The IBM Data Governance Unified Process The benefits of a commitment to a comprehensive enterprise Data Governance initiative are many and varied, and so are the challenges to achieving strong Data Governance.

More information

Data Warehousing. Overview

Data Warehousing. Overview Data Warehousing Overview Basic Definitions Normalization Entity Relationship Diagrams (ERDs) Normal Forms Many to Many relationships Warehouse Considerations Dimension Tables Fact Tables Star Schema Snowflake

More information

Proceedings of the IE 2014 International Conference AGILE DATA MODELS

Proceedings of the IE 2014 International Conference  AGILE DATA MODELS AGILE DATA MODELS Mihaela MUNTEAN Academy of Economic Studies, Bucharest mun61mih@yahoo.co.uk, Mihaela.Muntean@ie.ase.ro Abstract. In last years, one of the most popular subjects related to the field of

More information

ALTERNATE SCHEMA DIAGRAMMING METHODS DECISION SUPPORT SYSTEMS. CS121: Relational Databases Fall 2017 Lecture 22

ALTERNATE SCHEMA DIAGRAMMING METHODS DECISION SUPPORT SYSTEMS. CS121: Relational Databases Fall 2017 Lecture 22 ALTERNATE SCHEMA DIAGRAMMING METHODS DECISION SUPPORT SYSTEMS CS121: Relational Databases Fall 2017 Lecture 22 E-R Diagramming 2 E-R diagramming techniques used in book are similar to ones used in industry

More information

Advanced Data Management Technologies

Advanced Data Management Technologies ADMT 2018/19 Unit 5 J. Gamper 1/48 Advanced Data Management Technologies Unit 5 Logical Design and DW Applications J. Gamper Free University of Bozen-Bolzano Faculty of Computer Science IDSE Acknowledgements:

More information

CHAPTER 3 Implementation of Data warehouse in Data Mining

CHAPTER 3 Implementation of Data warehouse in Data Mining CHAPTER 3 Implementation of Data warehouse in Data Mining 3.1 Introduction to Data Warehousing A data warehouse is storage of convenient, consistent, complete and consolidated data, which is collected

More information

Working with the Business to Build Effective Dimensional Models

Working with the Business to Build Effective Dimensional Models Working with the Business to Build Effective Dimensional Models Laura L. Reeves Co-Founder & Principal April, 2009 Copyright 2009 StarSoft Solutions, Inc. Slide 1 Instructor Information: Laura L. Reeves,

More information

Data Mining: Approach Towards The Accuracy Using Teradata!

Data Mining: Approach Towards The Accuracy Using Teradata! Data Mining: Approach Towards The Accuracy Using Teradata! Shubhangi Pharande Department of MCA NBNSSOCS,Sinhgad Institute Simantini Nalawade Department of MCA NBNSSOCS,Sinhgad Institute Ajay Nalawade

More information

Chapter 6. Foundations of Business Intelligence: Databases and Information Management VIDEO CASES

Chapter 6. Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Chapter 6 Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:

More information

1. Analytical queries on the dimensionally modeled database can be significantly simpler to create than on the equivalent nondimensional database.

1. Analytical queries on the dimensionally modeled database can be significantly simpler to create than on the equivalent nondimensional database. 1. Creating a data warehouse involves using the functionalities of database management software to implement the data warehouse model as a collection of physically created and mutually connected database

More information

DC Area Business Objects Crystal User Group (DCABOCUG) Data Warehouse Architectures for Business Intelligence Reporting.

DC Area Business Objects Crystal User Group (DCABOCUG) Data Warehouse Architectures for Business Intelligence Reporting. DC Area Business Objects Crystal User Group (DCABOCUG) Data Warehouse Architectures for Business Intelligence Reporting April 14, 2009 Whitemarsh Information Systems Corporation 2008 Althea Lane Bowie,

More information

Data Warehousing ETL. Esteban Zimányi Slides by Toon Calders

Data Warehousing ETL. Esteban Zimányi Slides by Toon Calders Data Warehousing ETL Esteban Zimányi ezimanyi@ulb.ac.be Slides by Toon Calders 1 Overview Picture other sources Metadata Monitor & Integrator OLAP Server Analysis Operational DBs Extract Transform Load

More information

Agile Accessibility. Presenters: Ensuring accessibility throughout the Agile development process

Agile Accessibility. Presenters: Ensuring accessibility throughout the Agile development process Agile Accessibility Ensuring accessibility throughout the Agile development process Presenters: Andrew Nielson, CSM, PMP, MPA Ann Marie Davis, CSM, PMP, M. Ed. Cammie Truesdell, M. Ed. Overview What is

More information

The Data Organization

The Data Organization C V I T F E P A O TM The Data Organization 1251 Yosemite Way Hayward, CA 94545 (510) 303-8868 rschoenrank@computer.org Business Intelligence Process Architecture By Rainer Schoenrank Data Warehouse Consultant

More information

Advanced Data Management Technologies

Advanced Data Management Technologies ADMT 2017/18 Unit 2 J. Gamper 1/44 Advanced Data Management Technologies Unit 2 Basic Concepts of BI and DW J. Gamper Free University of Bozen-Bolzano Faculty of Computer Science IDSE Acknowledgements:

More information

Aggregating Knowledge in a Data Warehouse and Multidimensional Analysis

Aggregating Knowledge in a Data Warehouse and Multidimensional Analysis Aggregating Knowledge in a Data Warehouse and Multidimensional Analysis Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com Objectives Explain the basics of: 1. Data

More information

This tutorial has been prepared for computer science graduates to help them understand the basic-to-advanced concepts related to data mining.

This tutorial has been prepared for computer science graduates to help them understand the basic-to-advanced concepts related to data mining. About the Tutorial Data Mining is defined as the procedure of extracting information from huge sets of data. In other words, we can say that data mining is mining knowledge from data. The tutorial starts

More information

Acknowledgment. MTAT Data Mining. Week 7: Online Analytical Processing and Data Warehouses. Typical Data Analysis Process.

Acknowledgment. MTAT Data Mining. Week 7: Online Analytical Processing and Data Warehouses. Typical Data Analysis Process. MTAT.03.183 Data Mining Week 7: Online Analytical Processing and Data Warehouses Marlon Dumas marlon.dumas ät ut. ee Acknowledgment This slide deck is a mashup of the following publicly available slide

More information

Rocky Mountain Technology Ventures

Rocky Mountain Technology Ventures Rocky Mountain Technology Ventures Comparing and Contrasting Online Analytical Processing (OLAP) and Online Transactional Processing (OLTP) Architectures 3/19/2006 Introduction One of the most important

More information

collection of data that is used primarily in organizational decision making.

collection of data that is used primarily in organizational decision making. Data Warehousing A data warehouse is a special purpose database. Classic databases are generally used to model some enterprise. Most often they are used to support transactions, a process that is referred

More information

An Overview of Data Warehousing and OLAP Technology

An Overview of Data Warehousing and OLAP Technology An Overview of Data Warehousing and OLAP Technology CMPT 843 Karanjit Singh Tiwana 1 Intro and Architecture 2 What is Data Warehouse? Subject-oriented, integrated, time varying, non-volatile collection

More information

Star Schema Design (Additonal Material; Partly Covered in Chapter 8) Class 04: Star Schema Design 1

Star Schema Design (Additonal Material; Partly Covered in Chapter 8) Class 04: Star Schema Design 1 Star Schema Design (Additonal Material; Partly Covered in Chapter 8) Class 04: Star Schema Design 1 Star Schema Overview Star Schema: A simple database architecture used extensively in analytical applications,

More information

Partner Presentation Faster and Smarter Data Warehouses with Oracle OLAP 11g

Partner Presentation Faster and Smarter Data Warehouses with Oracle OLAP 11g Partner Presentation Faster and Smarter Data Warehouses with Oracle OLAP 11g Vlamis Software Solutions, Inc. Founded in 1992 in Kansas City, Missouri Oracle Partner and reseller since 1995 Specializes

More information

Data Warehouse Testing. By: Rakesh Kumar Sharma

Data Warehouse Testing. By: Rakesh Kumar Sharma Data Warehouse Testing By: Rakesh Kumar Sharma Index...2 Introduction...3 About Data Warehouse...3 Data Warehouse definition...3 Testing Process for Data warehouse:...3 Requirements Testing :...3 Unit

More information

Fig 1.2: Relationship between DW, ODS and OLTP Systems

Fig 1.2: Relationship between DW, ODS and OLTP Systems 1.4 DATA WAREHOUSES Data warehousing is a process for assembling and managing data from various sources for the purpose of gaining a single detailed view of an enterprise. Although there are several definitions

More information

Data warehouse architecture consists of the following interconnected layers:

Data warehouse architecture consists of the following interconnected layers: Architecture, in the Data warehousing world, is the concept and design of the data base and technologies that are used to load the data. A good architecture will enable scalability, high performance and

More information

The Seven Steps to Implement DataOps

The Seven Steps to Implement DataOps The Seven Steps to Implement Ops ABSTRACT analytics teams challenged by inflexibility and poor quality have found that Ops can address these and many other obstacles. Ops includes tools and process improvements

More information

Software Development Methodologies

Software Development Methodologies Software Development Methodologies Lecturer: Raman Ramsin Lecture 8 Agile Methodologies: XP 1 extreme Programming (XP) Developed by Beck in 1996. The first authentic XP book appeared in 1999, with a revised

More information

CSE 544 Principles of Database Management Systems. Alvin Cheung Fall 2015 Lecture 8 - Data Warehousing and Column Stores

CSE 544 Principles of Database Management Systems. Alvin Cheung Fall 2015 Lecture 8 - Data Warehousing and Column Stores CSE 544 Principles of Database Management Systems Alvin Cheung Fall 2015 Lecture 8 - Data Warehousing and Column Stores Announcements Shumo office hours change See website for details HW2 due next Thurs

More information

DEVELOPING DECISION SUPPORT SYSTEMS A MODERN APPROACH

DEVELOPING DECISION SUPPORT SYSTEMS A MODERN APPROACH DEVELOPING DECISION SUPPORT SYSTEMS A MODERN APPROACH Ion Lungu PhD, Vlad Diaconiţa PhD Candidate Department of Economic Informatics Academy of Economic Studies Bucharest In today s economy access to quality

More information

Data Warehouses. Yanlei Diao. Slides Courtesy of R. Ramakrishnan and J. Gehrke

Data Warehouses. Yanlei Diao. Slides Courtesy of R. Ramakrishnan and J. Gehrke Data Warehouses Yanlei Diao Slides Courtesy of R. Ramakrishnan and J. Gehrke Introduction v In the late 80s and early 90s, companies began to use their DBMSs for complex, interactive, exploratory analysis

More information

Management Information Systems MANAGING THE DIGITAL FIRM, 12 TH EDITION FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT

Management Information Systems MANAGING THE DIGITAL FIRM, 12 TH EDITION FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT MANAGING THE DIGITAL FIRM, 12 TH EDITION Chapter 6 FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT VIDEO CASES Case 1: Maruti Suzuki Business Intelligence and Enterprise Databases

More information

DATA MINING AND WAREHOUSING

DATA MINING AND WAREHOUSING DATA MINING AND WAREHOUSING Qno Question Answer 1 Define data warehouse? Data warehouse is a subject oriented, integrated, time-variant, and nonvolatile collection of data that supports management's decision-making

More information

Advanced Modeling and Design

Advanced Modeling and Design Advanced Modeling and Design 1. Advanced Multidimensional Modeling Handling changes in dimensions Large-scale dimensional modeling 2. Design Methodologies 3. Project Management Acknowledgements: I am indebted

More information

QUALITY MONITORING AND

QUALITY MONITORING AND BUSINESS INTELLIGENCE FOR CMS DATA QUALITY MONITORING AND DATA CERTIFICATION. Author: Daina Dirmaite Supervisor: Broen van Besien CERN&Vilnius University 2016/08/16 WHAT IS BI? Business intelligence is

More information

Requirements and Design Overview

Requirements and Design Overview Requirements and Design Overview Robert B. France Colorado State University Robert B. France O-1 Why do we model? Enhance understanding and communication Provide structure for problem solving Furnish abstractions

More information

Chapter 6 VIDEO CASES

Chapter 6 VIDEO CASES Chapter 6 Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:

More information

MIS2502: Data Analytics Dimensional Data Modeling. Jing Gong

MIS2502: Data Analytics Dimensional Data Modeling. Jing Gong MIS2502: Data Analytics Dimensional Data Modeling Jing Gong gong@temple.edu http://community.mis.temple.edu/gong Where we are Now we re here Data entry Transactional Database Data extraction Analytical

More information

Data Governance Central to Data Management Success

Data Governance Central to Data Management Success Data Governance Central to Data Success International Anne Marie Smith, Ph.D. DAMA International DMBOK Editorial Review Board Primary Contributor EWSolutions, Inc Principal Consultant and Director of Education

More information

CIS 330: Web-driven Web Applications. Lecture 2: Introduction to ER Modeling

CIS 330: Web-driven Web Applications. Lecture 2: Introduction to ER Modeling CIS 330: Web-driven Web Applications Lecture 2: Introduction to ER Modeling 1 Goals of This Lecture Understand ER modeling 2 Last Lecture Why Store Data in a DBMS? Transactions (concurrent data access,

More information

Data Vault Brisbane User Group

Data Vault Brisbane User Group Data Vault Brisbane User Group 26-02-2013 Agenda Introductions A brief introduction to Data Vault Creating a Data Vault based Data Warehouse Comparisons with 3NF/Kimball When is it good for you? Examples

More information

Sql Fact Constellation Schema In Data Warehouse With Example

Sql Fact Constellation Schema In Data Warehouse With Example Sql Fact Constellation Schema In Data Warehouse With Example Data Warehouse OLAP - Learn Data Warehouse in simple and easy steps using Multidimensional OLAP (MOLAP), Hybrid OLAP (HOLAP), Specialized SQL

More information

After completing this course, participants will be able to:

After completing this course, participants will be able to: Designing a Business Intelligence Solution by Using Microsoft SQL Server 2008 T h i s f i v e - d a y i n s t r u c t o r - l e d c o u r s e p r o v i d e s i n - d e p t h k n o w l e d g e o n d e s

More information

IDU0010 ERP,CRM ja DW süsteemid Loeng 5 DW concepts. Enn Õunapuu

IDU0010 ERP,CRM ja DW süsteemid Loeng 5 DW concepts. Enn Õunapuu IDU0010 ERP,CRM ja DW süsteemid Loeng 5 DW concepts Enn Õunapuu enn.ounapuu@ttu.ee Content Oveall approach Dimensional model Tabular model Overall approach Data modeling is a discipline that has been practiced

More information

Accounting Information Systems, 2e (Kay/Ovlia) Chapter 2 Accounting Databases. Objective 1

Accounting Information Systems, 2e (Kay/Ovlia) Chapter 2 Accounting Databases. Objective 1 Accounting Information Systems, 2e (Kay/Ovlia) Chapter 2 Accounting Databases Objective 1 1) One of the disadvantages of a relational database is that we can enter data once into the database, and then

More information

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems

TDWI Data Modeling. Data Analysis and Design for BI and Data Warehousing Systems Data Analysis and Design for BI and Data Warehousing Systems Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your

More information

by Prentice Hall

by Prentice Hall Chapter 6 Foundations of Business Intelligence: Databases and Information Management 6.1 2010 by Prentice Hall Organizing Data in a Traditional File Environment File organization concepts Computer system

More information

Business Intelligence and Decision Support Systems

Business Intelligence and Decision Support Systems Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 8: Data Warehousing Learning Objectives Understand the basic definitions and concepts of data warehouses Learn different

More information

Q1) Describe business intelligence system development phases? (6 marks)

Q1) Describe business intelligence system development phases? (6 marks) BUISINESS ANALYTICS AND INTELLIGENCE SOLVED QUESTIONS Q1) Describe business intelligence system development phases? (6 marks) The 4 phases of BI system development are as follow: Analysis phase Design

More information

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems Management Information Systems Management Information Systems B10. Data Management: Warehousing, Analyzing, Mining, and Visualization Code: 166137-01+02 Course: Management Information Systems Period: Spring

More information

Getting Started with Rational Team Concert

Getting Started with Rational Team Concert Getting Started with Rational Team Concert or RTC in 16 Steps Kai-Uwe Maetzel IBM Rational Software kai-uwe_maetzel@us.ibm.com SDP 20 2009 IBM Corporation This Presentation is Good for You if You know

More information

Chapter 3. Foundations of Business Intelligence: Databases and Information Management

Chapter 3. Foundations of Business Intelligence: Databases and Information Management Chapter 3 Foundations of Business Intelligence: Databases and Information Management THE DATA HIERARCHY TRADITIONAL FILE PROCESSING Organizing Data in a Traditional File Environment Problems with the traditional

More information

Data Warehousing. Seminar report. Submitted in partial fulfillment of the requirement for the award of degree Of Computer Science

Data Warehousing. Seminar report.  Submitted in partial fulfillment of the requirement for the award of degree Of Computer Science A Seminar report On Data Warehousing Submitted in partial fulfillment of the requirement for the award of degree Of Computer Science SUBMITTED TO: SUBMITTED BY: www.studymafia.org www.studymafia.org Preface

More information

Data Warehouse and Data Mining

Data Warehouse and Data Mining Data Warehouse and Data Mining Lecture No. 02 Lifecycle of Data warehouse Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro

More information

Data Management Glossary

Data Management Glossary Data Management Glossary A Access path: The route through a system by which data is found, accessed and retrieved Agile methodology: An approach to software development which takes incremental, iterative

More information

5-1McGraw-Hill/Irwin. Copyright 2007 by The McGraw-Hill Companies, Inc. All rights reserved.

5-1McGraw-Hill/Irwin. Copyright 2007 by The McGraw-Hill Companies, Inc. All rights reserved. 5-1McGraw-Hill/Irwin Copyright 2007 by The McGraw-Hill Companies, Inc. All rights reserved. 5 hapter Data Resource Management Data Concepts Database Management Types of Databases McGraw-Hill/Irwin Copyright

More information

What s a BA to do with Data? Discover and define standard data elements in business terms

What s a BA to do with Data? Discover and define standard data elements in business terms What s a BA to do with Data? Discover and define standard data elements in business terms Susan Block, Lead Business Systems Analyst The Vanguard Group Discussion Points Discovering Business Data The Data

More information

Data Warehousing and Decision Support (mostly using Relational Databases) CS634 Class 20

Data Warehousing and Decision Support (mostly using Relational Databases) CS634 Class 20 Data Warehousing and Decision Support (mostly using Relational Databases) CS634 Class 20 Slides based on Database Management Systems 3 rd ed, Ramakrishnan and Gehrke, Chapter 25 Introduction Increasingly,

More information

The Business Value of Metadata for Data Governance: The Challenge of Integrating Packaged Applications

The Business Value of Metadata for Data Governance: The Challenge of Integrating Packaged Applications The Business Value of Metadata for Data Governance: The Challenge of Integrating Packaged Applications By Donna Burbank Managing Director, Global Data Strategy, Ltd www.globaldatastrategy.com Sponsored

More information

Top of Minds Report series Data Warehouse The six levels of integration

Top of Minds Report series Data Warehouse The six levels of integration Top of Minds Report series Data Warehouse The six levels of integration Recommended reading Before reading this report it is recommended to read ToM Report Series on Data Warehouse Definitions for Integration

More information

TECHNOLOGY BRIEF: CA ERWIN DATA PROFILER. Combining Data Profiling and Data Modeling for Better Data Quality

TECHNOLOGY BRIEF: CA ERWIN DATA PROFILER. Combining Data Profiling and Data Modeling for Better Data Quality TECHNOLOGY BRIEF: CA ERWIN DATA PROFILER Combining Data Profiling and Data Modeling for Better Data Quality Table of Contents Executive Summary SECTION 1: CHALLENGE 2 Reducing the Cost and Risk of Data

More information

CT75 DATA WAREHOUSING AND DATA MINING DEC 2015

CT75 DATA WAREHOUSING AND DATA MINING DEC 2015 Q.1 a. Briefly explain data granularity with the help of example Data Granularity: The single most important aspect and issue of the design of the data warehouse is the issue of granularity. It refers

More information

Variety-Aware OLAP of Document-Oriented Databases

Variety-Aware OLAP of Document-Oriented Databases Variety-Aware OLAP of Document-Oriented Databases Enrico Gallinucci, Matteo Golfarelli, Stefano Rizzi DISI University of Bologna, Italy Introduction In recent years, schemalessdbmss have progressively

More information

The power management skills gap

The power management skills gap The power management skills gap Do you have the knowledge and expertise to keep energy flowing around your datacentre environment? A recent survey by Freeform Dynamics of 320 senior data centre professionals

More information

Business Intelligence. You can t manage what you can t measure. You can t measure what you can t describe. Ahsan Kabir

Business Intelligence. You can t manage what you can t measure. You can t measure what you can t describe. Ahsan Kabir Business Intelligence You can t manage what you can t measure. You can t measure what you can t describe Ahsan Kabir A broad category of applications and technologies for gathering, storing, analyzing,

More information

Proposal of a new Data Warehouse Architecture Reference Model

Proposal of a new Data Warehouse Architecture Reference Model Proposal of a new Data Warehouse Architecture Reference Model Dariusz Dymek Wojciech Komnata Piotr Szwed AGH University of Science and Technology Department of Applied Computer Science e-mail: eidymek@kinga.cyf-kr.edu.pl,

More information

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures)

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures) CS614- Data Warehousing Solved MCQ(S) From Midterm Papers (1 TO 22 Lectures) BY Arslan Arshad Nov 21,2016 BS110401050 BS110401050@vu.edu.pk Arslan.arshad01@gmail.com AKMP01 CS614 - Data Warehousing - Midterm

More information

Course on Database Design Carlo Batini University of Milano Bicocca

Course on Database Design Carlo Batini University of Milano Bicocca Course on Database Design Carlo Batini University of Milano Bicocca 1 Carlo Batini, 2015 This work is licensed under the Creative Commons Attribution NonCommercial NoDerivatives 4.0 International License.

More information

Data Warehouse and Data Mining

Data Warehouse and Data Mining Data Warehouse and Data Mining Lecture No. 04-06 Data Warehouse Architecture Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology

More information

A Systems Approach to Dimensional Modeling in Data Marts. Joseph M. Firestone, Ph.D. White Paper No. One. March 12, 1997

A Systems Approach to Dimensional Modeling in Data Marts. Joseph M. Firestone, Ph.D. White Paper No. One. March 12, 1997 1 of 8 5/24/02 4:43 PM A Systems Approach to Dimensional Modeling in Data Marts By Joseph M. Firestone, Ph.D. White Paper No. One March 12, 1997 OLAP s Purposes And Dimensional Data Modeling Dimensional

More information