Decision Support Systems aka Analytical Systems

Similar documents
Data Mining Concepts & Techniques

An Overview of Data Warehousing and OLAP Technology

DATA WAREHOUSE EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY

Data Warehouses. Yanlei Diao. Slides Courtesy of R. Ramakrishnan and J. Gehrke

OPEN LAB: HOSPITAL. An hospital needs a DM to extract information from their operational database with information about inpatients treatments.

Basics of Dimensional Modeling

CHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS. Assist. Prof. Dr. Volkan TUNALI

DATA WAREHOUING UNIT I

Decision Support, Data Warehousing, and OLAP

IT DATA WAREHOUSING AND DATA MINING UNIT-2 BUSINESS ANALYSIS

Data Warehousing ETL. Esteban Zimányi Slides by Toon Calders

MIS2502: Data Analytics Dimensional Data Modeling. Jing Gong

OLAP2 outline. Multi Dimensional Data Model. A Sample Data Cube

Evolution of Database Systems

Data Warehousing & OLAP

Data Warehousing and OLAP

Data Warehouses Chapter 12. Class 10: Data Warehouses 1

Summary of Last Chapter. Course Content. Chapter 2 Objectives. Data Warehouse and OLAP Outline. Incentive for a Data Warehouse

Fig 1.2: Relationship between DW, ODS and OLTP Systems

Data Warehouse. Asst.Prof.Dr. Pattarachai Lalitrojwong

CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP)

Adnan YAZICI Computer Engineering Department

collection of data that is used primarily in organizational decision making.

Syllabus. Syllabus. Motivation Decision Support. Syllabus

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

What is a Data Warehouse?

STRATEGIC INFORMATION SYSTEMS IV STV401T / B BTIP05 / BTIX05 - BTECH DEPARTMENT OF INFORMATICS. By: Dr. Tendani J. Lavhengwa

On-Line Analytical Processing (OLAP) Traditional OLTP

Data Warehousing and Decision Support. Introduction. Three Complementary Trends. [R&G] Chapter 23, Part A

A Multi-Dimensional Data Model

Aggregating Knowledge in a Data Warehouse and Multidimensional Analysis

Data Warehouses and OLAP. Database and Information Systems. Data Warehouses and OLAP. Data Warehouses and OLAP

Database design View Access patterns Need for separate data warehouse:- A multidimensional data model:-

Data Warehouse and Data Mining

Data warehouses Decision support The multidimensional model OLAP queries

A Star Schema Has One To Many Relationship Between A Dimension And Fact Table

Data Warehousing and Decision Support

CSPP 53017: Data Warehousing Winter 2013! Lecture 7! Svetlozar Nestorov! Class News!

Data Warehousing and Decision Support

Chapter 4, Data Warehouse and OLAP Operations

Data Mining & Data Warehouse

DATA MINING AND WAREHOUSING

Chapter 3. The Multidimensional Model: Basic Concepts. Introduction. The multidimensional model. The multidimensional model

Data Warehousing and Decision Support (mostly using Relational Databases) CS634 Class 20

Data Warehouses. Vera Goebel. Fall Department of Informatics, University of Oslo

Sql Fact Constellation Schema In Data Warehouse With Example

QUALITY MONITORING AND

CSE 544 Principles of Database Management Systems. Alvin Cheung Fall 2015 Lecture 8 - Data Warehousing and Column Stores

Lectures for the course: Data Warehousing and Data Mining (IT 60107)

Information Management course

Tribhuvan University Institute of Science and Technology MODEL QUESTION

Introduction to Data Warehousing

Question Bank. 4) It is the source of information later delivered to data marts.

Information Management course

CS 245: Database System Principles. Warehousing. Outline. What is a Warehouse? What is a Warehouse? Notes 13: Data Warehousing

CS 1655 / Spring 2013! Secure Data Management and Web Applications

Data Mining. Associate Professor Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology

BUSINESS INTELLIGENCE. SSAS - SQL Server Analysis Services. Business Informatics Degree

The Evolution of Data Warehousing. Data Warehousing Concepts. The Evolution of Data Warehousing. The Evolution of Data Warehousing

Dta Mining and Data Warehousing

MIS2502: Data Analytics Dimensional Data Modeling. Jing Gong

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1396

Data Warehousing & OLAP

ETL and OLAP Systems

Business Intelligence An Overview. Zahra Mansoori

Managing Information Resources

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1394

Unit 7: Basics in MS Power BI for Excel 2013 M7-5: OLAP

Create Cube From Star Schema Grouping Framework Manager

1. Analytical queries on the dimensionally modeled database can be significantly simpler to create than on the equivalent nondimensional database.

Data Warehousing & On-Line Analytical Processing

CS377: Database Systems Data Warehouse and Data Mining. Li Xiong Department of Mathematics and Computer Science Emory University

Rocky Mountain Technology Ventures

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1394

Data warehouse design

REPORTING AND QUERY TOOLS AND APPLICATIONS

Chapter 18: Data Analysis and Mining

The University of Iowa Intelligent Systems Laboratory The University of Iowa Intelligent Systems Laboratory

Acknowledgment. MTAT Data Mining. Week 7: Online Analytical Processing and Data Warehouses. Typical Data Analysis Process.

Pentaho BI Suite. Basic notions of OLAP cubes and their implementation with Schema Workbench edited by Vladan Mijatovic

The strategic advantage of OLAP and multidimensional analysis

Analytical data bases Database lectures for math

ALTERNATE SCHEMA DIAGRAMMING METHODS DECISION SUPPORT SYSTEMS. CS121: Relational Databases Fall 2017 Lecture 22

CSE 544 Principles of Database Management Systems. Fall 2016 Lecture 14 - Data Warehousing and Column Stores

On-Line Application Processing

Information Management course

Course Number : SEWI ZG514 Course Title : Data Warehousing Type of Exam : Open Book Weightage : 60 % Duration : 180 Minutes

~ Ian Hunneybell: DWDM Revision Notes (31/05/2006) ~

Data Warehousing 2. ICS 421 Spring Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa

Data warehouse architecture consists of the following interconnected layers:

DHANALAKSHMI COLLEGE OF ENGINEERING, CHENNAI

Seminars of Software and Services for the Information Society. Data Warehousing Design Issues

Chapter 13 Business Intelligence and Data Warehouses The Need for Data Analysis Business Intelligence. Objectives

Improving the Performance of OLAP Queries Using Families of Statistics Trees

International Journal of Scientific & Engineering Research, Volume 7, Issue 11, November ISSN

Data Warehousing. Overview

1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar

Data Warehousing & On-line Analytical Processing

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

Transcription:

Decision Support Systems aka Analytical Systems

Decision Support Systems Systems that are used to transform data into information, to manage the organization: OLAP vs OLTP

OLTP vs OLAP Transactions Analysis Objectives Operations Decisions Users Operatives Managers and analyists Detail Analytic May be aggregated Data origin Internal Internal and external Applications Known a priori Ad hoc Data items Few (tens) Many (millions) Per operation Updates Frequent, Small Rare, Mostly massive State of data Current data Historic Stored in Data Base Data Warehouse / DSS

Operational systems vs DSS Operational systems: Data are organized in a DB. Data are managed by a traditional DBMS. Applications used to perform structured operational activities Decision Support Systems: Data are organized in a separate specialized DB (Data Warehouse (DW)). Data are managed by a specialized DBMS. Business Intelligence applications used to analyze data

Data Warehouse The first definition of data warehouse was provided by William Inmon in 1990. A DW is a specialized database static (non volatile), with integrated data from different data sources organized to analyze subjects of interest, with historical data, used to produce summarized data to support decisionmaking processes

Building the data warehouse DSS + Data Visualization Tools

DW design The creation of a data warehouse takes place gradually, at different levels of abstraction: a conceptual model, a logical model and physical model. DW Design

Data models for DW The Dimensional Fact Model (DFM) is a graphical conceptual model. The Relational Data Model, as a logical model The Multidimensional Model (called Cube), useful to understand OLAP operations DW Design

Design of a Data Mart Number of items ordered, by product, by customer, by month Total revenue by product category, by customer, by year Total revenue from Italian customers by customer city, by year, by quarter DATA BASE BUSINESS QUESTIONS DW Design

A data model for conceptual design Object data model: Classes with Attributes Data warehouse data model Facts tables with Measures Dimensions DW Design

The facts Facts are typically transactions / events Sales, clicks, complains, visits Periodic facts One fact for a group of transactions made over a period of time. Example: the monthly balance of all monthly transactions Accumulating fact One fact for the entire lifetime of an evolving event that has a duration and change over time Example: the life cycle of a mortgage application DW Design

The measures Measures: quantative information whose aggregation is of interest Aggregation is often Sum, but not always (count, average ) DW Design

Dimensions Dimensions relate to who, what, why, when, and where: Who is involved? What is about? When happens? Where occurs? Dimensions: the variables that influence the measures and indicate possible intervention levers How does a measure depend on a dimension? DW Design

Dimensions have attributes and hierarchies Without hierarchies With hierarchies DW Design

Case study An hospital needs a DM to extract information from their operational database with information about inpatients treatments. DW Design 1. Total billed amount for hospitalizations, by diagnosis code and description, by month (year). 2. Total number of hospitalizations and billed amount, by ward, by patient gender (age at date of admission, city, region). 3. Total billed amount, average length of stay and average waiting time, by diagnosis code and description, by name (specialization) of the physician who has admitted the patient. 4. Total billed amount, and average waiting time of admission, by patient age (region), by treatment code (description).

Requirements specification DW Design

Requirements specification DW Design

Data mart conceptual schema DATA BASE DATA MART DW Design

Multidimensional model (cube) The cube model helps understanding interactive data analysis ProductId * D3 D2 D1 DateId * P3 P2 P1 S1 S2 S3 * StoreId

2-D Cube Fact Table 2-D Cube Cross tabulation

3-D Cube Fact Table 3-D Cube

Cube operator: Slice Sales SLICE FOR DateId = D1 ; SLICE

Cube operator: Dice Sales DICE FOR DateId = D1 StoreId IN ( S1, S2 ); DICE

Cube operator: Pivot PIVOT (Sales SLICE FOR DateId = D1 ); SLICE PIVOT Data cube

Cube operators: Roll-up Data cube

Cube operators: Roll-up and Drill-down Roll-up aggregates data by dimension reduction or by navigating dimension hierarchy Drill-down: the reverse SALES ROLL-UP ON DateId Data cube

Extended cube Extend each dimension with *, mapped to the sum Data cube

Extended cube The cube is now a set of cuboids white cells are the data cube gray cells are rolledup by a dimension dark gray cells are rolled-up by two dimensions black cells are rolledup by all dimensions. * D3 D2 D1 * P3 P2 P1 S1 S2 S3 * Data cube

8.30 DW Lattice: a lattice of cuboids On the set of cuboids is defined the following partial order relation: C1 C2 if C1 can be computed by C2 Data cube

Cuboid materialization If the materialization is partial, which cuboids do we store? Data cube

Relational representation Relational OLAP systems are relational DBMS extended with specific features to support business intelligence analysis A DW is represented with a special kind of relational schema: star schema snowflake schema constellation schema

Star schema A fact table with a foreign key for each dimension Each dimension has one table with an artificial key

Snowflake schema Dimension tables are normalized Not very popular DW Design

Constellation schema DW Design

Data Warehouse Management System Interface: what to change? Engine: what to change?