Proposal of a new Data Warehouse Architecture Reference Model

Size: px
Start display at page:

Download "Proposal of a new Data Warehouse Architecture Reference Model"

Transcription

1 Proposal of a new Data Warehouse Architecture Reference Model Dariusz Dymek Wojciech Komnata Piotr Szwed AGH University of Science and Technology Department of Applied Computer Science eidymek@kinga.cyf-kr.edu.pl, {wkomnata,pszwed}@agh.edu.pl BDAS 2015 Ustroń, Poland, May 26-29, 2015

2 Agenda 1. Motivation 2. Overview of DW architecturs 3. Data Warehouse Architecture Reference Model: description 4. Mapping and examples 5. Conclusions 2

3 Data Warehouse Architecture Reference Model - definition Software Architecture Definitions The software architecture of a program or computer system is the structure or structures of the system, which comprise software components, the externally visible properties of those components, and the relationships among them [Bass et al. CMU/SEI-2006-TR-012 ESC-TR ]. Architecture is the fundamental organization of a system embodied in its components, their relationships to each other and to the environment and the principles guiding its design and evolution. [ISO 2000 standard] Data Warehouse Architecture Reference Model (DWARM) description of components appearing in DW architectures (and their properties) guidelines defining how to use them 3

4 Motivation 1 Long experience with industrial Data Warehouse (DW) implementations Scientific projects, which used DW components for data analysis Design of BPWIP platform data integration and exchange between security agencies. (DW collected facts related to communication events) 4

5 Motivation 2 DW architectures are presented in various ways: layered models five basic architecture patterns Problems with mixed architectures: evolution of DWs integration of existent systems (company mergers and acquisitions) Data Warehouse Architecture Reference Model (DWARM) goals: a systematic description of DW components and architectural patterns, which are developed to meet very diverse technical and business needs comprehensive set of components allowing to model standard architectures guidelines on how to arrange them to form hybrid architectures 5

6 Overview layered classification 1 Single-layer [Golfarelli, Rizzi, Data Warehouse Design, Modern Principles and Methodologies, 2009] Operational data Source layer DW is a multidimensional view of operational data created by middleware Middleware Data Warehouse Analysis Reporting tools OLAP tools 6

7 Overview layered classification 2 Two-layer [Golfarelli, Rizzi, Data Warehouse Design, Modern Principles and Methodologies, 2009] Operational data ETL tools External data Source data Data Staging Data stored in central repository Data marts replicate it contents (selection of data relevant for business area, department or user category) Data Warehouse Metadata Data Warehouse layer Data Marts Analysis Reporting tools OLAP tools Data Mining tools What-if analysis 7

8 Overview layered classification 3 Operational data ETL tools External data Source layer Data staging Three-layer [Golfarelli, Rizzi, Data Warehouse Design, Modern Principles and Methodologies, 2009] Reconciled data Metadata Reconciled layer Reconciled layer materializes operational data after clensing and integrating. ETL tools Loading introduces additional redundancy sometimes used for daily reports Data Warehouse Data Warehouse layer Data Marts Analysis Reporting tools OLAP tools Data Mining tools What if analysis tools 8

9 Functional classification Five architectures identified based on roles of central Data Warehouse and Data Marts [Watson, Ariyachandra, 2005; Connolly, Begg, 2005; Golfarelli, Rizzi, 2009; Ponniah, 2010] Data Data Data Storage Data & Information Source Staging Delivery Centralized Independent Data Marts Federated Hub-and- Spoke Data-Mart Bus 9

10 Five architecture models 1 Centralized (Enterprise DW) Data Staging Data Storage Data & Information Delivery Central data repository: fine-granular data in 3NF materialized aggregates logical dimensional views No data marts. Hub and Spoke (Inmon) Data Source Data Staging Data Storage Data & Information Delivery Central data repository: fine-granular data in 3NF Dependent data marts fed from CDR: summarized data selected elementary data no constraints on data models: normalized, denormalized, multidimensional 10

11 Five architecture models 2 Independent Data Marts Data Source Data Mart Bus (Kimball) Data Source Data Staging Data Staging Data Storage Data Storage Data & Information Delivery Data & Information Delivery No Central data repository Data marts: fed separetely independent data models numerous versions of truth hinders making enterprise-wide queries No Central data repository Data marts: conformed enterprise-wide dimensional model provided by a supermart star (snowflake) data models used in marts arranged into a constelation by the supermart model logical data integration and entrprise-wide view 11

12 Five architecture models 3 Federated Data Source Data Staging Data Storage Data & Information Delivery Integration of legacy systems No changes in flows from source systems to data storages (multiple CDRs and DMs) Data integrated logically based on shared keys, metadata and distributed queries Materialized or virtual 12

13 Popularity of architectures in Poland Centralized Based on: Moh d Alsqour, Kamal Matouk Mieczysław L. Owoc A survey of data warehouse architectures - preliminary results, Proceedings of the Federated Conference on Computer Science and Information Systems, 2013, pp

14 Architecture selection factors Central Hub and Spoke Data Mart Bus Independent Data Mart Information inter-dependence (data exchange level) Senior management informational needs Urgency Task routiness Resource constraints (availability) Strategic view of the data warehouse (DM as enabler for strategic transformations within an enterprise) Experts opinions Compatibility with running systems Perceived ability (IT in-house staff skills) Sponsorship level (influence of the sponsor) Technical issues According to [Watson, Ariyachandra, 2005/2010]. Data collected from 400 large organizations, mean revenue 660M$, mean number of emplyees correlation between a factor and architectures no influence 14

15 Data Warehouse Reference Model 11. Consumer Applications 10. Data Delivery App Data Analysis & Visualization Simple language borrowed from Structured Anaysis. 9. Federated Data Repository 8. Data Integration FDR 11 layers comprising data stores and processes 7. Data Mart 6. Data Mart Feeding 5. Central Data Repository DM CDR Granularity level: tradeof between generality and flexibility at describing various components and connections 4. Loading 3. Temporary Staging Area TSA ETL Transformation 2. Extraction 1. Source Src Data Collection 15

16 DWARM: Source layer 11. Consumer Applications 10. Data Delivery 9. Federated Data Repository App FDR Data Analysis & Visualization Source - entities that can deliver data to the data warehouse: enterprise information systems registering devices. 8. Data Integration 7. Data Mart 6. Data Mart Feeding DM Data: structured or unstructured synchronous or asynchronous 5. Central Data Repository CDR 4. Loading 3. Temporary Staging Area TSA ETL Transformation 2. Extraction 1. Source Src Data Collection 16

17 DWARM: Extraction layer 11. Consumer Applications 10. Data Delivery 9. Federated Data Repository 8. Data Integration App FDR Data Analysis & Visualization Extraction processes data acquisition, transforming to transport formats sending them to a data warehouse (DW) 7. Data Mart 6. Data Mart Feeding 5. Central Data Repository 4. Loading DM CDR Processes: Triggered manually or automatically Partly or fully controlled 3. Temporary Staging Area 2. Extraction 1. Source TSA Src ETL Transformation Data Collection Data: complete or incremental metadata (timestamps, source identification, checksums) 17

18 DWARM: Temporary Staging Area (TSA) 11. Consumer Applications App Data Analysis & Visualization Data storage and transformation before loading. 10. Data Delivery 9. Federated Data Repository FDR Fully controlled by DW. 8. Data Integration 7. Data Mart 6. Data Mart Feeding 5. Central Data Repository 4. Loading DM CDR Metadata describing: data transformations (model, structure, values) quality assurance (completness, consistency, compliance with control rules) 3. Temporary Staging Area TSA ETL Transformation 2. Extraction 1. Source Src Data Collection 18

19 DWARM: Loading layer 11. Consumer Applications 10. Data Delivery 9. Federated Data Repository 8. Data Integration App FDR Data Analysis & Visualization Loading processes Transferring data to Central Data Repository (CDR) Data Marts Automatic data processing (integration, aggregation) 7. Data Mart 6. Data Mart Feeding DM Processes started automatically (periodically). 5. Central Data Repository 4. Loading CDR Primary data after loading are immutable 3. Temporary Staging Area TSA ETL Transformation 2. Extraction 1. Source Src Data Collection 19

20 DWARM: Central Data Repository 11. Consumer Applications App Data Analysis & Visualization Appears in Centralized and Hub and Spoke architectures. 10. Data Delivery 9. Federated Data Repository 8. Data Integration 7. Data Mart FDR DM Data structured according to common schema. Depending on approach: multi dimensional models or 3NF. 6. Data Mart Feeding 5. Central Data Repository CDR Implementations: typically, relational, rarely flattened models. 4. Loading 3. Temporary Staging Area TSA ETL Transformation 2. Extraction 1. Source Src Data Collection 20

21 DWARM: Data Mart feeding layer 11. Consumer Applications App Data Analysis & Visualization Appears in Hub and Spoke architecture. 10. Data Delivery 9. Federated Data Repository 8. Data Integration 7. Data Mart 6. Data Mart Feeding FDR DM Processes selecting data from CDR applying transformations and loading data to separate Data Marts 5. Central Data Repository CDR 4. Loading 3. Temporary Staging Area TSA ETL Transformation 2. Extraction 1. Source Src Data Collection 21

22 DWARM: Data Mart layer 11. Consumer Applications 10. Data Delivery 9. Federated Data Repository 8. Data Integration App FDR Data Analysis & Visualization Primary data storage in: Independed Data Mart Data Mart bus Local data storage (materialized view) for Hub and Spoke. 7. Data Mart 6. Data Mart Feeding 5. Central Data Repository 4. Loading 3. Temporary Staging Area DM CDR TSA ETL Transformation Data models similar to CDR. Smaller data volumes and subject orientation favor redundancy. Often multidimensional DB (data cubes) used for storing data. 2. Extraction 1. Source Src Data Collection 22

23 DWARM: Data integration layer 11. Consumer Applications 10. Data Delivery 9. Federated Data Repository 8. Data Integration App FDR Data Analysis & Visualization Processes responsible for loading data originating from different data marts / data warehouses into Federated Data Repository. 7. Data Mart 6. Data Mart Feeding 5. Central Data Repository 4. Loading DM CDR Two cases: shared common data model: joins from databases different data models: similar to ETL processes 3. Temporary Staging Area TSA ETL Transformation 2. Extraction 1. Source Src Data Collection 23

24 DWARM: Federated Data Repository layer 11. Consumer Applications 10. Data Delivery App Data Analysis & Visualization Additional source of data satisfying various needs of Consumer Applications. 9. Federated Data Repository 8. Data Integration 7. Data Mart 6. Data Mart Feeding FDR DM Implementations: virtual (middleware, closely connected to data integration) materialized 5. Central Data Repository 4. Loading CDR Materialized FDR shares characteristics with CDR (although does not contain primary data). 3. Temporary Staging Area TSA ETL Transformation 2. Extraction 1. Source Src Data Collection 24

25 DWARM: Data Delivery 11. Consumer Applications 10. Data Delivery App Data Analysis & Visualization A link between Consumer Applications and all needed sources of data (CDR, DM, FDR). 9. Federated Data Repository 8. Data Integration FDR Main goal: delivering demanded data to consumer applications. 7. Data Mart 6. Data Mart Feeding 5. Central Data Repository DM CDR Auxiliary functions: security and access control logging 4. Loading 3. Temporary Staging Area 2. Extraction 1. Source TSA Src ETL Transformation Data Collection Data delivery processes synchronized with processes of other layers (e.g. mutual exclusion with loading and aggregation). 25

26 DWARM: Customer Applications 11. Consumer Applications 10. Data Delivery 9. Federated Data Repository 8. Data Integration 7. Data Mart 6. Data Mart Feeding App FDR DM Data Analysis & Visualization End-user applications: controlled by metadata, e.g. reporting systems with specfied data formats, periodicity and recipients. not described by metadata (e.g. Excel spreadsheet using exported data). 5. Central Data Repository CDR 4. Loading 3. Temporary Staging Area TSA ETL Transformation 2. Extraction 1. Source Src Data Collection 26

27 Mapping between DWARM and five architectures 1 exactly one layer many several layer instances (including data stores and processes) 27

28 Examples Consumer applications Data delivery Consumer applications Data delivery Consumer applications Data delivery Consumer applications Data delivery Consumer applications Data delivery Mixed architecture: Hub and Spoke Data Mart Bus 7 DM 7 DM 7 DM 7 DM Data Mart Data Mart Data Mart Data Mart 6 Data Mart Loading 5 CDR Central Data Repository Loading Loading Loading 3 TSA 3 TSA 3 TSA Temporary Staging Area Temporary Staging Area Temporary Staging Area Extraction Extraction Extraction 1 Source 28

29 Examples 2 11 Aplikacje analityczno-prezentacyjne 10 Udostępnianie danych 11 Aplikacje analityczno-prezentacyjne 10 Udostępnianie danych 9 FRD 11 Aplikacje analityczno-prezentacyjne 10 Udostępnianie danych Aplikacje analityczno-prezentacyjne 10 Udostępnianie danych Three architectures: Centralized Independent Data Mart Federated Federacyjne Repozytorium Danych 8 Integracja danych 5 CRD Centralne Repozytorium Danych 7 HT Hurtownia Tematyczna... 7 HT Hurtownia Tematyczna Ładowanie danych Ładowanie danych Ładowanie danych 3 DT 3 DT Dane Tymczasowe Dane Tymczasowe 2 2 Pozyskanie danych Pozyskanie danych 1 1 Dane źródłowe Dane źródłowe Organization 1 Ogranization 2 29

30 Conclusions The goal of our work was to define a comprehensive model that can be used to express various data warehouse architectures define components and guidelines how to arrange them DWARM Model: 11 layers (elements having common properties and roles) processes containers (data stores) and data An instance of data warehouse architecture model can be obtained by: tailoring (removing selected elements or layers) and creating instances of appropriately connected components Architectures of desired complexity can be obtained by multiplying layers and their elements. 30

31 Further information Discussion on DW achitectures, data modeling, development process and metadata. Presentation of the reference model including characterisctics of data and processes. Ontological model (DWARM). Concise information on BPWIP (secure platform for data integration) Application of graph grammars to data transformations. 31

32 Thank you 32

Business Intelligence and Decision Support Systems

Business Intelligence and Decision Support Systems Business Intelligence and Decision Support Systems (9 th Ed., Prentice Hall) Chapter 8: Data Warehousing Learning Objectives Understand the basic definitions and concepts of data warehouses Learn different

More information

Full file at

Full file at Chapter 2 Data Warehousing True-False Questions 1. A real-time, enterprise-level data warehouse combined with a strategy for its use in decision support can leverage data to provide massive financial benefits

More information

DATA WAREHOUSE EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY

DATA WAREHOUSE EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY DATA WAREHOUSE EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY CHARACTERISTICS Data warehouse is a central repository for summarized and integrated data

More information

CHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS. Assist. Prof. Dr. Volkan TUNALI

CHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS. Assist. Prof. Dr. Volkan TUNALI CHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS Assist. Prof. Dr. Volkan TUNALI Topics 2 Business Intelligence (BI) Decision Support System (DSS) Data Warehouse Online Analytical Processing (OLAP)

More information

Fig 1.2: Relationship between DW, ODS and OLTP Systems

Fig 1.2: Relationship between DW, ODS and OLTP Systems 1.4 DATA WAREHOUSES Data warehousing is a process for assembling and managing data from various sources for the purpose of gaining a single detailed view of an enterprise. Although there are several definitions

More information

Call: SAS BI Course Content:35-40hours

Call: SAS BI Course Content:35-40hours SAS BI Course Content:35-40hours Course Outline SAS Data Integration Studio 4.2 Introduction * to SAS DIS Studio Features of SAS DIS Studio Tasks performed by SAS DIS Studio Navigation to SAS DIS Studio

More information

Data warehouse architecture consists of the following interconnected layers:

Data warehouse architecture consists of the following interconnected layers: Architecture, in the Data warehousing world, is the concept and design of the data base and technologies that are used to load the data. A good architecture will enable scalability, high performance and

More information

Data Warehouse and Data Mining

Data Warehouse and Data Mining Data Warehouse and Data Mining Lecture No. 02 Introduction to Data Warehouse Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology

More information

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective B.Manivannan Research Scholar, Dept. Computer Science, Dravidian University, Kuppam, Andhra Pradesh, India

More information

Improving the Data Warehouse Architecture Using Design Patterns

Improving the Data Warehouse Architecture Using Design Patterns Association for Information Systems AIS Electronic Library (AISeL) MWAIS 2011 Proceedings Midwest (MWAIS) 5-20-2011 Improving the Data Warehouse Architecture Using Design Patterns Weiwen Yang Colorado

More information

STRATEGIC INFORMATION SYSTEMS IV STV401T / B BTIP05 / BTIX05 - BTECH DEPARTMENT OF INFORMATICS. By: Dr. Tendani J. Lavhengwa

STRATEGIC INFORMATION SYSTEMS IV STV401T / B BTIP05 / BTIX05 - BTECH DEPARTMENT OF INFORMATICS. By: Dr. Tendani J. Lavhengwa STRATEGIC INFORMATION SYSTEMS IV STV401T / B BTIP05 / BTIX05 - BTECH DEPARTMENT OF INFORMATICS LECTURE: 05 (A) DATA WAREHOUSING (DW) By: Dr. Tendani J. Lavhengwa lavhengwatj@tut.ac.za 1 My personal quote:

More information

IMPLEMENTING STATISTICAL DOMAIN DATABASES IN POLAND. OPPORTUNITIES AND THREATS. Central Statistical Office in Poland

IMPLEMENTING STATISTICAL DOMAIN DATABASES IN POLAND. OPPORTUNITIES AND THREATS. Central Statistical Office in Poland IMPLEMENTING STATISTICAL DOMAIN DATABASES IN POLAND. OPPORTUNITIES AND THREATS. Central Statistical Office in Poland Agenda 2 Background Current state The goal of the SDD Architecture Technologies Data

More information

Data Warehousing ETL. Esteban Zimányi Slides by Toon Calders

Data Warehousing ETL. Esteban Zimányi Slides by Toon Calders Data Warehousing ETL Esteban Zimányi ezimanyi@ulb.ac.be Slides by Toon Calders 1 Overview Picture other sources Metadata Monitor & Integrator OLAP Server Analysis Operational DBs Extract Transform Load

More information

Data Warehouse and Mining

Data Warehouse and Mining Data Warehouse and Mining 1. is a subject-oriented, integrated, time-variant, nonvolatile collection of data in support of management decisions. A. Data Mining. B. Data Warehousing. C. Web Mining. D. Text

More information

DC Area Business Objects Crystal User Group (DCABOCUG) Data Warehouse Architectures for Business Intelligence Reporting.

DC Area Business Objects Crystal User Group (DCABOCUG) Data Warehouse Architectures for Business Intelligence Reporting. DC Area Business Objects Crystal User Group (DCABOCUG) Data Warehouse Architectures for Business Intelligence Reporting April 14, 2009 Whitemarsh Information Systems Corporation 2008 Althea Lane Bowie,

More information

CHAPTER 3 Implementation of Data warehouse in Data Mining

CHAPTER 3 Implementation of Data warehouse in Data Mining CHAPTER 3 Implementation of Data warehouse in Data Mining 3.1 Introduction to Data Warehousing A data warehouse is storage of convenient, consistent, complete and consolidated data, which is collected

More information

Data Warehouse and Data Mining

Data Warehouse and Data Mining Data Warehouse and Data Mining Lecture No. 04-06 Data Warehouse Architecture Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology

More information

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing. About the Tutorial A data warehouse is constructed by integrating data from multiple heterogeneous sources. It supports analytical reporting, structured and/or ad hoc queries and decision making. This

More information

Overview of Reporting in the Business Information Warehouse

Overview of Reporting in the Business Information Warehouse Overview of Reporting in the Business Information Warehouse Contents What Is the Business Information Warehouse?...2 Business Information Warehouse Architecture: An Overview...2 Business Information Warehouse

More information

Data Warehouse. Asst.Prof.Dr. Pattarachai Lalitrojwong

Data Warehouse. Asst.Prof.Dr. Pattarachai Lalitrojwong Data Warehouse Asst.Prof.Dr. Pattarachai Lalitrojwong Faculty of Information Technology King Mongkut s Institute of Technology Ladkrabang Bangkok 10520 pattarachai@it.kmitl.ac.th The Evolution of Data

More information

Designing Data Warehouses. Data Warehousing Design. Designing Data Warehouses. Designing Data Warehouses

Designing Data Warehouses. Data Warehousing Design. Designing Data Warehouses. Designing Data Warehouses Designing Data Warehouses To begin a data warehouse project, need to find answers for questions such as: Data Warehousing Design Which user requirements are most important and which data should be considered

More information

OLAP Introduction and Overview

OLAP Introduction and Overview 1 CHAPTER 1 OLAP Introduction and Overview What Is OLAP? 1 Data Storage and Access 1 Benefits of OLAP 2 What Is a Cube? 2 Understanding the Cube Structure 3 What Is SAS OLAP Server? 3 About Cube Metadata

More information

Data Warehouse and Data Mining

Data Warehouse and Data Mining Data Warehouse and Data Mining Lecture No. 03 Architecture of DW Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro Basic

More information

Data Mining Concepts & Techniques

Data Mining Concepts & Techniques Data Mining Concepts & Techniques Lecture No. 01 Databases, Data warehouse Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro

More information

Modern Software Engineering Methodologies Meet Data Warehouse Design: 4WD

Modern Software Engineering Methodologies Meet Data Warehouse Design: 4WD Modern Software Engineering Methodologies Meet Data Warehouse Design: 4WD Matteo Golfarelli Stefano Rizzi Elisa Turricchia University of Bologna - Italy 13th International Conference on Data Warehousing

More information

02 Hr/week. Theory Marks. Internal assessment. Avg. of 2 Tests

02 Hr/week. Theory Marks. Internal assessment. Avg. of 2 Tests Course Code Course Name Teaching Scheme Credits Assigned Theory Practical Tutorial Theory Practical/Oral Tutorial Total TEITC504 Database Management Systems 04 Hr/week 02 Hr/week --- 04 01 --- 05 Examination

More information

COURSE 20466D: IMPLEMENTING DATA MODELS AND REPORTS WITH MICROSOFT SQL SERVER

COURSE 20466D: IMPLEMENTING DATA MODELS AND REPORTS WITH MICROSOFT SQL SERVER ABOUT THIS COURSE The focus of this five-day instructor-led course is on creating managed enterprise BI solutions. It describes how to implement multidimensional and tabular data models, deliver reports

More information

Decision Support Systems aka Analytical Systems

Decision Support Systems aka Analytical Systems Decision Support Systems aka Analytical Systems Decision Support Systems Systems that are used to transform data into information, to manage the organization: OLAP vs OLTP OLTP vs OLAP Transactions Analysis

More information

Seminars of Software and Services for the Information Society. Data Warehousing Design Issues

Seminars of Software and Services for the Information Society. Data Warehousing Design Issues DIPARTIMENTO DI INGEGNERIA INFORMATICA AUTOMATICA E GESTIONALE ANTONIO RUBERTI Master of Science in Engineering in Computer Science (MSE-CS) Seminars in Software and Services for the Information Society

More information

DATA MINING AND WAREHOUSING

DATA MINING AND WAREHOUSING DATA MINING AND WAREHOUSING Qno Question Answer 1 Define data warehouse? Data warehouse is a subject oriented, integrated, time-variant, and nonvolatile collection of data that supports management's decision-making

More information

Data Warehousing Concepts

Data Warehousing Concepts Data Warehousing Concepts Data Warehousing Definition Basic Data Warehousing Architecture Transaction & Transactional Data OLTP / Operational System / Transactional System OLAP / Data Warehouse / Decision

More information

Oracle Database 11g: Data Warehousing Fundamentals

Oracle Database 11g: Data Warehousing Fundamentals Oracle Database 11g: Data Warehousing Fundamentals Duration: 3 Days What you will learn This Oracle Database 11g: Data Warehousing Fundamentals training will teach you about the basic concepts of a data

More information

Introduction to DWH / BI Concepts

Introduction to DWH / BI Concepts SAS INTELLIGENCE PLATFORM CURRICULUM SAS INTELLIGENCE PLATFORM BI TOOLS 4.2 VERSION SAS BUSINESS INTELLIGENCE TOOLS - COURSE OUTLINE Practical Project Based Training & Implementation on all the BI Tools

More information

Pro Tech protechtraining.com

Pro Tech protechtraining.com Course Summary Description This course provides students with the skills necessary to plan, design, build, and run the ETL processes which are needed to build and maintain a data warehouse. It is based

More information

Sql Fact Constellation Schema In Data Warehouse With Example

Sql Fact Constellation Schema In Data Warehouse With Example Sql Fact Constellation Schema In Data Warehouse With Example Data Warehouse OLAP - Learn Data Warehouse in simple and easy steps using Multidimensional OLAP (MOLAP), Hybrid OLAP (HOLAP), Specialized SQL

More information

Data Warehousing Fundamentals by Mark Peco

Data Warehousing Fundamentals by Mark Peco Data Warehousing Fundamentals by Mark Peco All rights reserved. Reproduction in whole or part prohibited except by written permission. Product and company names mentioned herein may be trademarks of their

More information

DATA WAREHOUING UNIT I

DATA WAREHOUING UNIT I BHARATHIDASAN ENGINEERING COLLEGE NATTRAMAPALLI DEPARTMENT OF COMPUTER SCIENCE SUB CODE & NAME: IT6702/DWDM DEPT: IT Staff Name : N.RAMESH DATA WAREHOUING UNIT I 1. Define data warehouse? NOV/DEC 2009

More information

ETL and OLAP Systems

ETL and OLAP Systems ETL and OLAP Systems Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Software Development Technologies Master studies, first semester

More information

Data Warehouses Chapter 12. Class 10: Data Warehouses 1

Data Warehouses Chapter 12. Class 10: Data Warehouses 1 Data Warehouses Chapter 12 Class 10: Data Warehouses 1 OLTP vs OLAP Operational Database: a database designed to support the day today transactions of an organization Data Warehouse: historical data is

More information

Chapter 3: Data Warehousing

Chapter 3: Data Warehousing Solution Manual Business Intelligence and Analytics Systems for Decision Support 10th Edition Sharda Instant download and all chapters Solution Manual Business Intelligence and Analytics Systems for Decision

More information

1. Analytical queries on the dimensionally modeled database can be significantly simpler to create than on the equivalent nondimensional database.

1. Analytical queries on the dimensionally modeled database can be significantly simpler to create than on the equivalent nondimensional database. 1. Creating a data warehouse involves using the functionalities of database management software to implement the data warehouse model as a collection of physically created and mutually connected database

More information

Building a Data Warehouse step by step

Building a Data Warehouse step by step Informatica Economică, nr. 2 (42)/2007 83 Building a Data Warehouse step by step Manole VELICANU, Academy of Economic Studies, Bucharest Gheorghe MATEI, Romanian Commercial Bank Data warehouses have been

More information

1 Dulcian, Inc., 2001 All rights reserved. Oracle9i Data Warehouse Review. Agenda

1 Dulcian, Inc., 2001 All rights reserved. Oracle9i Data Warehouse Review. Agenda Agenda Oracle9i Warehouse Review Dulcian, Inc. Oracle9i Server OLAP Server Analytical SQL Mining ETL Infrastructure 9i Warehouse Builder Oracle 9i Server Overview E-Business Intelligence Platform 9i Server:

More information

After completing this course, participants will be able to:

After completing this course, participants will be able to: Designing a Business Intelligence Solution by Using Microsoft SQL Server 2008 T h i s f i v e - d a y i n s t r u c t o r - l e d c o u r s e p r o v i d e s i n - d e p t h k n o w l e d g e o n d e s

More information

Information Management course

Information Management course Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 05(b) : 23/10/2012 Data Mining: Concepts and Techniques (3 rd ed.) Chapter

More information

Data Warehousing. Adopted from Dr. Sanjay Gunasekaran

Data Warehousing. Adopted from Dr. Sanjay Gunasekaran Data Warehousing Adopted from Dr. Sanjay Gunasekaran Main Topics Overview of Data Warehouse Concept of Data Conversion Importance of Data conversion and the steps involved Common Industry Methodology Outline

More information

Question Bank. 4) It is the source of information later delivered to data marts.

Question Bank. 4) It is the source of information later delivered to data marts. Question Bank Year: 2016-2017 Subject Dept: CS Semester: First Subject Name: Data Mining. Q1) What is data warehouse? ANS. A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile

More information

Table Of Contents: xix Foreword to Second Edition

Table Of Contents: xix Foreword to Second Edition Data Mining : Concepts and Techniques Table Of Contents: Foreword xix Foreword to Second Edition xxi Preface xxiii Acknowledgments xxxi About the Authors xxxv Chapter 1 Introduction 1 (38) 1.1 Why Data

More information

Business Intelligence An Overview. Zahra Mansoori

Business Intelligence An Overview. Zahra Mansoori Business Intelligence An Overview Zahra Mansoori Contents 1. Preference 2. History 3. Inmon Model - Inmonities 4. Kimball Model - Kimballities 5. Inmon vs. Kimball 6. Reporting 7. BI Algorithms 8. Summary

More information

IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 06, 2016 ISSN (online):

IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 06, 2016 ISSN (online): IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 06, 2016 ISSN (online): 2321-0613 Tanzeela Khanam 1 Pravin S.Metkewar 2 1 Student 2 Associate Professor 1,2 SICSR, affiliated

More information

Chapter 13 Business Intelligence and Data Warehouses The Need for Data Analysis Business Intelligence. Objectives

Chapter 13 Business Intelligence and Data Warehouses The Need for Data Analysis Business Intelligence. Objectives Chapter 13 Business Intelligence and Data Warehouses Objectives In this chapter, you will learn: How business intelligence is a comprehensive framework to support business decision making How operational

More information

Generating Cross level Rules: An automated approach

Generating Cross level Rules: An automated approach Generating Cross level Rules: An automated approach Ashok 1, Sonika Dhingra 1 1HOD, Dept of Software Engg.,Bhiwani Institute of Technology, Bhiwani, India 1M.Tech Student, Dept of Software Engg.,Bhiwani

More information

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1394

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1394 Data Mining Data warehousing Hamid Beigy Sharif University of Technology Fall 1394 Hamid Beigy (Sharif University of Technology) Data Mining Fall 1394 1 / 22 Table of contents 1 Introduction 2 Data warehousing

More information

Data Warehouse and Data Mining

Data Warehouse and Data Mining Data Warehouse and Data Mining Lecture No. 05 Data Modeling Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro Data Modeling

More information

Implementing Data Models and Reports with SQL Server 2014

Implementing Data Models and Reports with SQL Server 2014 Course 20466D: Implementing Data Models and Reports with SQL Server 2014 Page 1 of 6 Implementing Data Models and Reports with SQL Server 2014 Course 20466D: 4 days; Instructor-Led Introduction The focus

More information

Data Management Glossary

Data Management Glossary Data Management Glossary A Access path: The route through a system by which data is found, accessed and retrieved Agile methodology: An approach to software development which takes incremental, iterative

More information

DATA MINING TRANSACTION

DATA MINING TRANSACTION DATA MINING Data Mining is the process of extracting patterns from data. Data mining is seen as an increasingly important tool by modern business to transform data into an informational advantage. It is

More information

Research Article ISSN:

Research Article ISSN: Research Article [Srivastava,1(4): Jun., 2012] IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY An Optimized algorithm to select the appropriate Schema in Data Warehouses Rahul

More information

Q1) Describe business intelligence system development phases? (6 marks)

Q1) Describe business intelligence system development phases? (6 marks) BUISINESS ANALYTICS AND INTELLIGENCE SOLVED QUESTIONS Q1) Describe business intelligence system development phases? (6 marks) The 4 phases of BI system development are as follow: Analysis phase Design

More information

Microsoft SQL Server Training Course Catalogue. Learning Solutions

Microsoft SQL Server Training Course Catalogue. Learning Solutions Training Course Catalogue Learning Solutions Querying SQL Server 2000 with Transact-SQL Course No: MS2071 Two days Instructor-led-Classroom 2000 The goal of this course is to provide students with the

More information

Aggregating Knowledge in a Data Warehouse and Multidimensional Analysis

Aggregating Knowledge in a Data Warehouse and Multidimensional Analysis Aggregating Knowledge in a Data Warehouse and Multidimensional Analysis Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com Objectives Explain the basics of: 1. Data

More information

Physical Modeling of Data Warehouses using UML

Physical Modeling of Data Warehouses using UML Department of Software and Computing Systems Physical Modeling of Data Warehouses using UML Sergio Luján-Mora Juan Trujillo DOLAP 2004 Contents Motivation UML extension mechanisms DW design framework DW

More information

CT75 DATA WAREHOUSING AND DATA MINING DEC 2015

CT75 DATA WAREHOUSING AND DATA MINING DEC 2015 Q.1 a. Briefly explain data granularity with the help of example Data Granularity: The single most important aspect and issue of the design of the data warehouse is the issue of granularity. It refers

More information

Overview. Introduction to Data Warehousing and Business Intelligence. BI Is Important. What is Business Intelligence (BI)?

Overview. Introduction to Data Warehousing and Business Intelligence. BI Is Important. What is Business Intelligence (BI)? Introduction to Data Warehousing and Business Intelligence Overview Why Business Intelligence? Data analysis problems Data Warehouse (DW) introduction A tour of the coming DW lectures DW Applications Loosely

More information

CS377: Database Systems Data Warehouse and Data Mining. Li Xiong Department of Mathematics and Computer Science Emory University

CS377: Database Systems Data Warehouse and Data Mining. Li Xiong Department of Mathematics and Computer Science Emory University CS377: Database Systems Data Warehouse and Data Mining Li Xiong Department of Mathematics and Computer Science Emory University 1 1960s: Evolution of Database Technology Data collection, database creation,

More information

#mstrworld. Analyzing Multiple Data Sources with Multisource Data Federation and In-Memory Data Blending. Presented by: Trishla Maru.

#mstrworld. Analyzing Multiple Data Sources with Multisource Data Federation and In-Memory Data Blending. Presented by: Trishla Maru. Analyzing Multiple Data Sources with Multisource Data Federation and In-Memory Data Blending Presented by: Trishla Maru Agenda Overview MultiSource Data Federation Use Cases Design Considerations Data

More information

Data Warehouse Logical Design. Letizia Tanca Politecnico di Milano (with the kind support of Rosalba Rossato)

Data Warehouse Logical Design. Letizia Tanca Politecnico di Milano (with the kind support of Rosalba Rossato) Data Warehouse Logical Design Letizia Tanca Politecnico di Milano (with the kind support of Rosalba Rossato) Data Mart logical models MOLAP (Multidimensional On-Line Analytical Processing) stores data

More information

A Star Schema Has One To Many Relationship Between A Dimension And Fact Table

A Star Schema Has One To Many Relationship Between A Dimension And Fact Table A Star Schema Has One To Many Relationship Between A Dimension And Fact Table Many organizations implement star and snowflake schema data warehouse The fact table has foreign key relationships to one or

More information

Dr.G.R.Damodaran College of Science

Dr.G.R.Damodaran College of Science 1 of 20 8/28/2017 2:13 PM Dr.G.R.Damodaran College of Science (Autonomous, affiliated to the Bharathiar University, recognized by the UGC)Reaccredited at the 'A' Grade Level by the NAAC and ISO 9001:2008

More information

DSS based on Data Warehouse

DSS based on Data Warehouse DSS based on Data Warehouse C_13 / 19.01.2017 Decision support system is a complex system engineering. At the same time, research DW composition, DW structure and DSS Architecture based on DW, puts forward

More information

Advanced Data Management Technologies Written Exam

Advanced Data Management Technologies Written Exam Advanced Data Management Technologies Written Exam 02.02.2016 First name Student number Last name Signature Instructions for Students Write your name, student number, and signature on the exam sheet. This

More information

Data Warehouse Design Using Row and Column Data Distribution

Data Warehouse Design Using Row and Column Data Distribution Int'l Conf. Information and Knowledge Engineering IKE'15 55 Data Warehouse Design Using Row and Column Data Distribution Behrooz Seyed-Abbassi and Vivekanand Madesi School of Computing, University of North

More information

Lecture2: Database Environment

Lecture2: Database Environment College of Computer and Information Sciences - Information Systems Dept. Lecture2: Database Environment 1 IS220 : D a t a b a s e F u n d a m e n t a l s Topics Covered Data abstraction Schemas and Instances

More information

International Journal of Computer Engineering and Applications, REQUIREMENT GATHERING FOR MODEL DRIVEN DESIGN OF DATAWAREHOUSE

International Journal of Computer Engineering and Applications, REQUIREMENT GATHERING FOR MODEL DRIVEN DESIGN OF DATAWAREHOUSE International Journal of Computer Engineering and Applications, Volume XII, Issue I, Jan. 18, www.ijcea.com ISSN 2321-3469 REQUIREMENT GATHERING FOR MODEL DRIVEN DESIGN OF DATAWAREHOUSE Kuldeep Deshpande

More information

Data Warehousing Methods and its Applications

Data Warehousing Methods and its Applications International Journal of Engineering Science Invention (IJESI) ISSN (Online): 2319 6734, ISSN (Print): 2319 6726 www.ijesi.org PP. 12-19 Data Warehousing Methods and its Applications 1 Dr. C. Suba 1 (Department

More information

The Evolution of Data Warehousing. Data Warehousing Concepts. The Evolution of Data Warehousing. The Evolution of Data Warehousing

The Evolution of Data Warehousing. Data Warehousing Concepts. The Evolution of Data Warehousing. The Evolution of Data Warehousing The Evolution of Data Warehousing Data Warehousing Concepts Since 1970s, organizations gained competitive advantage through systems that automate business processes to offer more efficient and cost-effective

More information

DHANALAKSHMI COLLEGE OF ENGINEERING, CHENNAI

DHANALAKSHMI COLLEGE OF ENGINEERING, CHENNAI DHANALAKSHMI COLLEGE OF ENGINEERING, CHENNAI Department of Information Technology IT6702 Data Warehousing & Data Mining Anna University 2 & 16 Mark Questions & Answers Year / Semester: IV / VII Regulation:

More information

Complete. The. Reference. Christopher Adamson. Mc Grauu. LlLIJBB. New York Chicago. San Francisco Lisbon London Madrid Mexico City

Complete. The. Reference. Christopher Adamson. Mc Grauu. LlLIJBB. New York Chicago. San Francisco Lisbon London Madrid Mexico City The Complete Reference Christopher Adamson Mc Grauu LlLIJBB New York Chicago San Francisco Lisbon London Madrid Mexico City Milan New Delhi San Juan Seoul Singapore Sydney Toronto Contents Acknowledgments

More information

Data Warehousing. Overview

Data Warehousing. Overview Data Warehousing Overview Basic Definitions Normalization Entity Relationship Diagrams (ERDs) Normal Forms Many to Many relationships Warehouse Considerations Dimension Tables Fact Tables Star Schema Snowflake

More information

Data transfer, storage and analysis for data mart enlargement

Data transfer, storage and analysis for data mart enlargement Data transfer, storage and analysis for data mart enlargement PROKOPOVA ZDENKA, SILHAVY PETR, SILHAVY RADEK Department of Computer and Communication Systems Faculty of Applied Informatics Tomas Bata University

More information

Measuring the functional size of a data warehouse application using COSMIC-FFP

Measuring the functional size of a data warehouse application using COSMIC-FFP Measuring the functional size of a data warehouse application using COSMIC-FFP Harold van Heeringen Abstract A data warehouse system is not the kind of traditional system that is easily sized with FPA,

More information

Database design View Access patterns Need for separate data warehouse:- A multidimensional data model:-

Database design View Access patterns Need for separate data warehouse:- A multidimensional data model:- UNIT III: Data Warehouse and OLAP Technology: An Overview : What Is a Data Warehouse? A Multidimensional Data Model, Data Warehouse Architecture, Data Warehouse Implementation, From Data Warehousing to

More information

Drawing the Big Picture

Drawing the Big Picture Drawing the Big Picture Multi-Platform Data Architectures, Queries, and Analytics Philip Russom TDWI Research Director for Data Management August 26, 2015 Sponsor 2 Speakers Philip Russom TDWI Research

More information

A Data Warehouse Implementation Using the Star Schema. For an outpatient hospital information system

A Data Warehouse Implementation Using the Star Schema. For an outpatient hospital information system A Data Warehouse Implementation Using the Star Schema For an outpatient hospital information system GurvinderKaurJosan Master of Computer Application,YMT College of Management Kharghar, Navi Mumbai ---------------------------------------------------------------------***----------------------------------------------------------------

More information

Learning Alliance Corporation, Inc. For more info: go to

Learning Alliance Corporation, Inc. For more info: go to Writing Queries Using Microsoft SQL Server Transact-SQL Length: 3 Day(s) Language(s): English Audience(s): IT Professionals Level: 200 Technology: Microsoft SQL Server Type: Course Delivery Method: Instructor-led

More information

Informatica Power Center 10.1 Developer Training

Informatica Power Center 10.1 Developer Training Informatica Power Center 10.1 Developer Training Course Overview An introduction to Informatica Power Center 10.x which is comprised of a server and client workbench tools that Developers use to create,

More information

The COSMIC Functional Size Measurement Method Version 4.0.1

The COSMIC Functional Size Measurement Method Version 4.0.1 The COSMIC Functional Size Measurement Method Version 4.0.1 Guideline for sizing Data Warehouse Application Software Version 1.1 April 2015 Acknowledgements Reviewers of v1.1 (alphabetical order) Diana

More information

Data-Driven Driven Business Intelligence Systems: Parts I. Lecture Outline. Learning Objectives

Data-Driven Driven Business Intelligence Systems: Parts I. Lecture Outline. Learning Objectives Data-Driven Driven Business Intelligence Systems: Parts I Week 5 Dr. Jocelyn San Pedro School of Information Management & Systems Monash University IMS3001 BUSINESS INTELLIGENCE SYSTEMS SEM 1, 2004 Lecture

More information

The Use of Soft Systems Methodology for the Development of Data Warehouses

The Use of Soft Systems Methodology for the Development of Data Warehouses The Use of Soft Systems Methodology for the Development of Data Warehouses Roelien Goede School of Information Technology, North-West University Vanderbijlpark, 1900, South Africa ABSTRACT When making

More information

The Data Organization

The Data Organization C V I T F E P A O TM The Data Organization Best Practices Metadata Dictionary Application Architecture Prepared by Rainer Schoenrank January 2017 Table of Contents 1. INTRODUCTION... 3 1.1 PURPOSE OF THE

More information

Teradata Aggregate Designer

Teradata Aggregate Designer Data Warehousing Teradata Aggregate Designer By: Sam Tawfik Product Marketing Manager Teradata Corporation Table of Contents Executive Summary 2 Introduction 3 Problem Statement 3 Implications of MOLAP

More information

Ontology based Model and Procedure Creation for Topic Analysis in Chinese Language

Ontology based Model and Procedure Creation for Topic Analysis in Chinese Language Ontology based Model and Procedure Creation for Topic Analysis in Chinese Language Dong Han and Kilian Stoffel Information Management Institute, University of Neuchâtel Pierre-à-Mazel 7, CH-2000 Neuchâtel,

More information

Data Warehouse and Data Mining

Data Warehouse and Data Mining Data Warehouse and Data Mining Lecture No. 07 Terminologies Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro Database

More information

Benefits of Automating Data Warehousing

Benefits of Automating Data Warehousing Benefits of Automating Data Warehousing Introduction Data warehousing can be defined as: A copy of data specifically structured for querying and reporting. In most cases, the data is transactional data

More information

IT DATA WAREHOUSING AND DATA MINING UNIT-2 BUSINESS ANALYSIS

IT DATA WAREHOUSING AND DATA MINING UNIT-2 BUSINESS ANALYSIS PART A 1. What are production reporting tools? Give examples. (May/June 2013) Production reporting tools will let companies generate regular operational reports or support high-volume batch jobs. Such

More information

DATA WAREHOUSE- MODEL QUESTIONS

DATA WAREHOUSE- MODEL QUESTIONS DATA WAREHOUSE- MODEL QUESTIONS 1. The generic two-level data warehouse architecture includes which of the following? a. At least one data mart b. Data that can extracted from numerous internal and external

More information

Table of Contents. Knowledge Management Data Warehouses and Data Mining. Introduction and Motivation

Table of Contents. Knowledge Management Data Warehouses and Data Mining. Introduction and Motivation Table of Contents Knowledge Management Data Warehouses and Data Mining Dr. Michael Hahsler Dept. of Information Processing Vienna Univ. of Economics and BA 11. December 2001

More information

Knowledge Management Data Warehouses and Data Mining

Knowledge Management Data Warehouses and Data Mining Knowledge Management Data Warehouses and Data Mining Dr. Michael Hahsler Dept. of Information Processing Vienna Univ. of Economics and BA 11. December 2001 1 Table of Contents

More information

Advanced Modeling and Design

Advanced Modeling and Design Advanced Modeling and Design 1. Advanced Multidimensional Modeling Handling changes in dimensions Large-scale dimensional modeling 2. Design Methodologies 3. Project Management Acknowledgements: I am indebted

More information

Application software office packets, databases and data warehouses.

Application software office packets, databases and data warehouses. Introduction to Computer Systems (9) Application software office packets, databases and data warehouses. Piotr Mielecki Ph. D. http://www.wssk.wroc.pl/~mielecki piotr.mielecki@pwr.edu.pl pmielecki@gmail.com

More information