Data Mining & Data Warehouse

Similar documents
Data Mining. Associate Professor Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology

Data Warehousing (1)

Information Management course

Question Bank. 4) It is the source of information later delivered to data marts.

Data Warehouse and Data Mining

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective

ECT7110 Introduction to Data Warehousing

ECLT 5810 Introduction to Data Warehousing

Managing Information Resources

Outline. Managing Information Resources. Concepts and Definitions. Introduction. Chapter 7

Data Mining Concepts & Techniques

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

Evolution of Database Systems

What is a Data Warehouse?

Introduction to Data Warehousing

Syllabus. Syllabus. Motivation Decision Support. Syllabus

CHAPTER 3 Implementation of Data warehouse in Data Mining

Summary of Last Chapter. Course Content. Chapter 2 Objectives. Data Warehouse and OLAP Outline. Incentive for a Data Warehouse

Information Management course

Data Mining & Analytics Data Mining Reference Model Data Warehouse Legal and Ethical Issues. Slides by Michael Hahsler

Decision Support, Data Warehousing, and OLAP

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1394

On-Line Analytical Processing (OLAP) Traditional OLTP

Data Mining & Data Warehouse

CS377: Database Systems Data Warehouse and Data Mining. Li Xiong Department of Mathematics and Computer Science Emory University

Data Warehousing & OLAP

Databases and Data Warehouses

Data Warehousing and OLAP Technologies for Decision-Making Process

Data Mining. Asso. Profe. Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology Department of CS (1)

Chapter 4, Data Warehouse and OLAP Operations

Knowledge Discovery & Data Mining

Fig 1.2: Relationship between DW, ODS and OLTP Systems

TIM 50 - Business Information Systems

IT DATA WAREHOUSING AND DATA MINING UNIT-2 BUSINESS ANALYSIS

Introduction to Databases

ISM 50 - Business Information Systems

Data Warehouse and Data Mining

DATA MINING TRANSACTION

Data Warehouse and Data Mining

Data Warehouses Chapter 12. Class 10: Data Warehouses 1

Full file at

Data Warehousing and OLAP

TIM 50 - Business Information Systems

Data Warehousing. Data Warehousing and Mining. Lecture 8. by Hossen Asiful Mustafa

Introduction to Databases

STRATEGIC INFORMATION SYSTEMS IV STV401T / B BTIP05 / BTIX05 - BTECH DEPARTMENT OF INFORMATICS. By: Dr. Tendani J. Lavhengwa

Jarek Szlichta Acknowledgments: Jiawei Han, Micheline Kamber and Jian Pei, Data Mining - Concepts and Techniques

Decision Support Systems aka Analytical Systems

Database Vs. Data Warehouse

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1396

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures)

Database design View Access patterns Need for separate data warehouse:- A multidimensional data model:-

Data Warehousing. Ritham Vashisht, Sukhdeep Kaur and Shobti Saini

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1394

Data Warehouse. Asst.Prof.Dr. Pattarachai Lalitrojwong

Rocky Mountain Technology Ventures

CHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS. Assist. Prof. Dr. Volkan TUNALI

Chapter 3. Databases and Data Warehouses: Building Business Intelligence

Testing Masters Technologies

Course Objective. Data Warehousing & Mining. Course Outline. Course Outline

Overview. Introduction to Data Warehousing and Business Intelligence. BI Is Important. What is Business Intelligence (BI)?

Data Warehouse and Mining

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

Data Warehousing Introduction. Toon Calders

DHANALAKSHMI COLLEGE OF ENGINEERING, CHENNAI

Data Warehouse and Data Mining

Data Warehousing & Mining. Data integration. OLTP versus OLAP. CPS 116 Introduction to Database Systems

Data Mining and Warehousing

Improving Resource Management And Solving Scheduling Problem In Dataware House Using OLAP AND OLTP Authors Seenu Kohar 1, Surender Singh 2

Analyse des Données. Master 2 IMAFA. Andrea G. B. Tettamanzi

Data Warehousing & Mining Techniques

DATA WAREHOUSING IN LIBRARIES FOR MANAGING DATABASE

Chapter 13 Business Intelligence and Data Warehouses The Need for Data Analysis Business Intelligence. Objectives

Data warehousing in telecom Industry

Q1) Describe business intelligence system development phases? (6 marks)

Partner Presentation Faster and Smarter Data Warehouses with Oracle OLAP 11g

2. Summary. 2.1 Basic Architecture. 2. Architecture. 2.1 Staging Area. 2.1 Operational Data Store. Last week: Architecture and Data model

Evolving To The Big Data Warehouse

Advanced Applications of Data Warehousing Using 3-tier Architecture

Data Warehousing & Mining Techniques

Knowledge Modelling and Management. Part B (9)

Handout 12 Data Warehousing and Analytics.

1/12/2018. APPA Institute Dallas, TX Feb DATA INTEGRATION PURPOSE OF TODAY S PRESENTATION

Data Warehouses. Vera Goebel. Fall Department of Informatics, University of Oslo

Data Warehousing and Data Mining. Announcements (December 1) Data integration. CPS 116 Introduction to Database Systems

CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP)

KORA. Business Intelligence An Introduction

DATA MINING AND WAREHOUSING

Data Management Framework

Data Warehousing. Adopted from Dr. Sanjay Gunasekaran

The Evolution of Data Warehousing. Data Warehousing Concepts. The Evolution of Data Warehousing. The Evolution of Data Warehousing

Data Warehousing & OLAP

Data-Driven Driven Business Intelligence Systems: Parts I. Lecture Outline. Learning Objectives

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems

Cognos also provides you an option to export the report in XML or PDF format or you can view the reports in XML format.

A Data Warehouse Implementation Using the Star Schema. For an outpatient hospital information system

CHAPTER 3 BUILDING ARCHITECTURAL DATA WAREHOUSE FOR CANCER DISEASE

Data Warehousing & On-Line Analytical Processing

Unit 1.

IT1105 Information Systems and Technology. BIT 1 ST YEAR SEMESTER 1 University of Colombo School of Computing. Student Manual

Transcription:

Data Mining & Data Warehouse Associate Professor Dr. Raed Ibraheem Hamed University of Human Development, College of Science and Technology (1) 2016 2017 1

Points to Cover Why Do We Need Data Warehouses? Operational System What is Data Warehouse? Data Warehouse Subject-Oriented Data Warehouse Integrated Data Warehouse Time Variant Data Warehouse Non-Volatile Data Warehouse vs. Operational DBMS OLTP vs. OLAP Why Separate Data Warehouse? Data Warehouse Architecture: Basic (2) 2

Why Do We Need Data Warehouses? 1. Unification of information resources. Improved query performance Separate research and decision support functions from the operational systems. 2. The data stored in the warehouse is uploaded from the operational systems. The data may pass through an operational data store for additional operations before it is used in the DW for reporting. (3) 3

Operational System An operational system is a term used in data warehousing to refer to a system that is used to process the day-to-day transactions of an organization. These systems are designed in a manner that processing of day-to-day transactions is performed efficiently and the integrity of the transactional data is preserved. Sales Purchases Expenses (4) 4

What is Data Warehouse? Defined in many different ways, but not rigorously:- 1. A decision support database that is maintained separately from the organization s operational database. 2. A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile collection of data in support of management s decision-making process. by William H. Inmon 5

Data Warehouse Subject-Oriented Organized around major subjects, such as customer, product, sales. Focusing on the modeling and analysis of data for decision makers, not on daily operations or transaction processing. Provide a simple and concise view around particular subject issues by excluding data that are not useful in the decision support process. 6

Data Warehouse Integrated 1. Constructed by integrating multiple, heterogeneous data sources relational databases, flat files, on-line transaction records 2. Data cleaning and data integration techniques are applied. Ensure consistency in naming conventions, encoding structures, attribute measures, etc. among different data sources E.g., Hotel price: currency, tax, breakfast covered, etc. When data is moved to the warehouse, it is converted. 7

Data Warehouse Time Variant The time horizon for the data warehouse is significantly longer than that of operational systems. Operational database: current value data. Data warehouse data: provide information from a historical perspective (e.g., past 5-10 years) Every key structure in the data warehouse Contains an element of time, explicitly or implicitly But the key of operational data may or may not contain time element. 8

Data Warehouse Non-Volatile A physically separate store of data transformed from the operational environment. Operational update of data does not occur in the data warehouse environment. Does not require transaction processing, recovery, and concurrency control mechanisms Requires only two operations in data accessing: initial loading of data and access of data. 9

Data Warehouse Versus Operational DBMS Operational DBMS 10

Data Warehouse Versus Operational DBMS OLTP (on-line transaction processing) 1. Major task of traditional relational DBMS 2. Day-to-day operations: purchasing, inventory, banking, manufacturing, payroll, registration, accounting, etc. OLAP (on-line analytical processing) 1. Major task of data warehouse system 2. Data analysis and decision making 11

Difference Between OLTP and OLAP OLTP OLAP users Writer, IT professional knowledge worker function day to day operations decision support DB design application-oriented subject-oriented data access current, up-to-date detailed, flat relational isolated read/write index/hash on primary key unit of work short, simple transaction complex query # records accessed tens millions #users thousands hundreds DB size 100MB-GB 100GB-TB historical, summarized, multidimensional integrated, consolidated lots of scans 12

Why Separate Data Warehouse? High performance for both systems Different functions and different data 13

Why Separate Data Warehouse (1)? 1. High performance for both systems DBMS tuned for OLTP: access methods, indexing, concurrency control, recovery Warehouse tuned for OLAP: complex OLAP queries, multidimensional view, consolidation. 14

Why Separate Data Warehouse(2)? 2. Different functions and different data: 15

Why Separate Data Warehouse(2)? 2. Different functions and different data: missing data: Decision support requires historical data which operational DBs do not typically maintain data consolidation: DS requires consolidation (aggregation, summarization) of data from heterogeneous sources data quality: different sources typically use inconsistent data representations, codes and formats which have to be suitable. 16

Data Warehouse Architecture: Basic shows a simple architecture for a data warehouse. End users directly access data derived from several source systems through the data warehouse. 17

Data Warehouse Architecture: with a Staging Area 18

Data Warehouse Architecture: with a Staging Area and Data Marts 19

20 Data Marts is quite common, you may want to customize your warehouse's architecture for different groups within your organization. You can do this by adding data marts, which are systems designed for a particular line of business. This figure illustrates an example where purchasing, sales, and inventories are separated. In this example, a financial analyst might want to analyze historical data for purchases and sales or mine historical data to make predictions about customer behavior.

(21) 21