ECT7110 Introduction to Data Warehousing

Similar documents
ECLT 5810 Introduction to Data Warehousing

What is a Data Warehouse?

Information Management course

Information Management course

Data Warehousing & OLAP

Data Mining. Associate Professor Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology

Analyse des Données. Master 2 IMAFA. Andrea G. B. Tettamanzi

Data Mining & Data Warehouse

Chapter 4, Data Warehouse and OLAP Operations

Dta Mining and Data Warehousing

Jarek Szlichta Acknowledgments: Jiawei Han, Micheline Kamber and Jian Pei, Data Mining - Concepts and Techniques

Data Warehousing (1)

Data Warehousing & On-Line Analytical Processing

Data Warehousing & On-line Analytical Processing

Information Management course

CS490D: Introduction to Data Mining Chris Clifton

Data Warehouse. Concepts and Techniques. Chapter 3. SS Chung. April 5, 2013 Data Mining: Concepts and Techniques 1

CS 412 Intro. to Data Mining

A Multi-Dimensional Data Model

Introduction to Data Warehousing

Data mining: Hmm, what is it?

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1394

Data Warehousing & OLAP

Data Warehousing. Wolf-Tilo Balke Silviu Homoceanu Institut für Informationssysteme Technische Universität Braunschweig

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective

Database design View Access patterns Need for separate data warehouse:- A multidimensional data model:-

MSCIT 5210/MSCBD 5002: Knowledge Discovery and Data Mining

CS490D: Introduction to Data Mining Prof. Chris Clifton

Data Cube Technology

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1396

DATA WAREHOUSING & DATA MINING. by: Prof. Asha Ambhaikar

UNIT 4. DATA WAREHOUSING

Data Cube Technology. Chapter 5: Data Cube Technology. Data Cube: A Lattice of Cuboids. Data Cube: A Lattice of Cuboids

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1394

Question Bank. 4) It is the source of information later delivered to data marts.

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

Business Intelligence

Chapter 5, Data Cube Computation

By Mahesh R. Sanghavi Associate professor, SNJB s KBJ CoE, Chandwad

Summary of Last Chapter. Course Content. Chapter 2 Objectives. Data Warehouse and OLAP Outline. Incentive for a Data Warehouse

OLAP2 outline. Multi Dimensional Data Model. A Sample Data Cube

Managing Information Resources

CS490D: Introduction to Data Mining Prof. Chris Clifton. Project Presentations

Outline. Managing Information Resources. Concepts and Definitions. Introduction. Chapter 7

MSCIT 5210/MSCBD 5002: Knowledge Discovery and Data Mining

Data Warehousing 2. ICS 421 Spring Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa

DATA WAREHOUSE AND DATA MINING

Basics of Dimensional Modeling

Data Mining Concepts & Techniques

Evolution of Database Systems

Acknowledgment. MTAT Data Mining. Week 7: Online Analytical Processing and Data Warehouses. Typical Data Analysis Process.

Decision Support Systems aka Analytical Systems

Reminds on Data Warehousing

Fig 1.2: Relationship between DW, ODS and OLTP Systems

ETL and OLAP Systems

Data Warehousing & OLAP

On-Line Analytical Processing (OLAP) Traditional OLTP

Full file at

Data Mining: Concepts and Techniques. (3 rd ed.) Chapter 5

Reporting and Query tools and Applications

Data Warehousing Introduction. Toon Calders

An Overview of Data Warehousing and OLAP Technology

Decision Support Systems

Table of Contents. Rajesh Pandey Page 1

Data Warehousing and Decision Support. Introduction. Three Complementary Trends. [R&G] Chapter 23, Part A

collection of data that is used primarily in organizational decision making.

Data Warehousing and OLAP Technologies for Decision-Making Process

QUALITY MONITORING AND

Syllabus. Syllabus. Motivation Decision Support. Syllabus

The University of Iowa Intelligent Systems Laboratory The University of Iowa Intelligent Systems Laboratory

CS377: Database Systems Data Warehouse and Data Mining. Li Xiong Department of Mathematics and Computer Science Emory University

Data warehouse architecture consists of the following interconnected layers:

Data Warehousing and Decision Support

Decision Support, Data Warehousing, and OLAP

Data Warehousing and Decision Support

Cognos also provides you an option to export the report in XML or PDF format or you can view the reports in XML format.

An Overview of Data Warehousing and OLAP Technology

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems

Data Warehouses Chapter 12. Class 10: Data Warehouses 1

MOLAP Data Warehouse of a Software Products Servicing Call Center

This tutorial has been prepared for computer science graduates to help them understand the basic-to-advanced concepts related to data mining.

DHANALAKSHMI COLLEGE OF ENGINEERING, CHENNAI

DW Performance Optimization (II)

Data Warehouse and Data Mining

Lecture 2 and 3 - Dimensional Modelling

Overview. Introduction to Data Warehousing and Business Intelligence. BI Is Important. What is Business Intelligence (BI)?

CHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS. Assist. Prof. Dr. Volkan TUNALI

Enterprise Informatization LECTURE

Adnan YAZICI Computer Engineering Department

Decision Support Systems

Data Warehouse and Data Mining

TIM 50 - Business Information Systems

Data Warehouses. Yanlei Diao. Slides Courtesy of R. Ramakrishnan and J. Gehrke

Database Vs. Data Warehouse

Data Mining: Concepts and Techniques. Example of Star Schema. Multidimensional Data. A Short Recap on Data Cubes Data Cube: A Lattice of Cuboids

IT DATA WAREHOUSING AND DATA MINING UNIT-2 BUSINESS ANALYSIS

CHAPTER 3 Implementation of Data warehouse in Data Mining

What is Data Warehouse?

Course Objective. Data Warehousing & Mining. Course Outline. Course Outline

Transcription:

ECT7110 Introduction to Data Warehousing Prof. Wai Lam ECT7110 Introduction to Data Warehousing 1

What is Data Warehouse? Defined in many different ways, but not rigorously. A decision support database that is maintained separately from the organization s operational database Support information processing by providing a solid platform of consolidated, historical data for analysis. Provides tools for business executives to systematically organize and use the data to make strategic decisions A data warehouse is a subject-oriented,integrated, time-variant, and nonvolatile collection of data in support of management s decision-making process. W. H. Inmon Data warehousing: The process of constructing and using data warehouses ECT7110 Introduction to Data Warehousing 2

Data Warehouse Subject-Oriented Organized around major subjects, such as customer, product, sales. Focusing on the modeling and analysis of data for decision makers, not on daily operations or transaction processing. Provide a simple and concise view around particular subject issues by excluding data that are not useful in the decision support process. ECT7110 Introduction to Data Warehousing 3

Data Warehouse Integrated Constructed by integrating multiple, heterogeneous data sources relational databases, flat files, on-line transaction records Data cleaning and data integration techniques are applied. Ensure consistency in naming conventions, encoding structures, attribute measures, etc. among different data sources E.g., Hotel price: currency, tax, breakfast covered, etc. When data is moved to the warehouse, it is converted. ECT7110 Introduction to Data Warehousing 4

Data Warehouse Time Variant The time horizon for the data warehouse is significantly longer than that of operational systems. Operational database: current value data. Data warehouse data: provide information from a historical perspective (e.g., past 5-10 years) Every key structure in the data warehouse Contains an element of time, explicitly or implicitly But the key of operational data may or may not contain time element. ECT7110 Introduction to Data Warehousing 5

Data Warehouse Non-Volatile A physically separate store of data transformed from the operational environment. Operational update of data does not occur in the data warehouse environment. Does not require transaction processing, recovery, and concurrency control mechanisms Requires only two operations in data accessing: initial loading of data and access of data. ECT7110 Introduction to Data Warehousing 6

From Tables and Spreadsheets to Data Cubes A data warehouse is based on a multidimensional data model which views data in the form of a data cube. A data cube, such as sales, allows data to be modeled and viewed in multiple dimensions It is defined by dimensions and facts Dimension tables, such as item (item_name, brand, type), or time(day, week, month, quarter, year) Fact table contains measures (such as dollars_sold) and keys to each of the related dimension table ECT7110 Introduction to Data Warehousing 7

Multidimensional Data Sales volume as a function of product, month, and region Region Dimensions: Product, Location, Time Hierarchical summarization paths Industry Region Year Category Country Quarter Product Product City Month Week Office Day Month ECT7110 Introduction to Data Warehousing 8

4-D Cube (Supplier dimension) ECT7110 Introduction to Data Warehousing 9

A Sample Data Cube TV PC VCR sum Product Date 1Qtr 2Qtr 3Qtr 4Qtr sum Total annual sales of TV in U.S.A. U.S.A Canada Mexico Country sum ECT7110 Introduction to Data Warehousing 10

Cuboids In data warehousing literature, an n-d base cube is called a base cuboid. The top most 0-D cuboid, which holds the highest-level of summarization, is called the apex cuboid. The lattice of cuboids forms a data cube. ECT7110 Introduction to Data Warehousing 11

Cuboids Corresponding to the Cube all product date country product,date product,country date, country 0-D(apex) cuboid 1-D cuboids 2-D cuboids product, date, country 3-D(base) cuboid ECT7110 Introduction to Data Warehousing 12

Browsing a Data Cube Visualization OLAP capabilities Interactive manipulation ECT7110 Introduction to Data Warehousing 13

Cube: A Lattice of Cuboids all time item location supplier 0-D(apex) cuboid 1-D cuboids time,item time,location item,location time,supplier location,supplier item,supplier time,location,supplier 2-D cuboids 3-D cuboids time,item,location time,item,supplier item,location,supplier time, item, location, supplier 4-D(base) cuboid ECT7110 Introduction to Data Warehousing 14