Lectures for the course: Data Warehousing and Data Mining (IT 60107)

Similar documents
DATA WAREHOUING UNIT I

Tribhuvan University Institute of Science and Technology MODEL QUESTION

IT DATA WAREHOUSING AND DATA MINING UNIT-2 BUSINESS ANALYSIS

An Overview of Data Warehousing and OLAP Technology

GUJARAT TECHNOLOGICAL UNIVERSITY MASTER OF COMPUTER APPLICATIONS (MCA) Semester: IV

DATA WAREHOUSE EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY

Data Warehouse and Data Mining

SCHEME OF COURSE WORK. Data Warehousing and Data mining

Table Of Contents: xix Foreword to Second Edition

Extended TDWI Data Modeling: An In-Depth Tutorial on Data Warehouse Design & Analysis Techniques

1. SQL Server Integration Services. What Is Microsoft BI? Core concept BI Introduction to SQL Server Integration Services

Call: SAS BI Course Content:35-40hours

CT75 DATA WAREHOUSING AND DATA MINING DEC 2015

Contents. Foreword to Second Edition. Acknowledgments About the Authors

Database design View Access patterns Need for separate data warehouse:- A multidimensional data model:-

ETL and OLAP Systems

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

PESIT- Bangalore South Campus Hosur Road (1km Before Electronic city) Bangalore

CHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS. Assist. Prof. Dr. Volkan TUNALI

UNIT -1 UNIT -II. Q. 4 Why is entity-relationship modeling technique not suitable for the data warehouse? How is dimensional modeling different?

Complete. The. Reference. Christopher Adamson. Mc Grauu. LlLIJBB. New York Chicago. San Francisco Lisbon London Madrid Mexico City

Data Warehouse. Asst.Prof.Dr. Pattarachai Lalitrojwong

02 Hr/week. Theory Marks. Internal assessment. Avg. of 2 Tests

Deccansoft Software Services Microsoft Silver Learning Partner. SSAS Syllabus

Create Cube From Star Schema Grouping Framework Manager

CT75 (ALCCS) DATA WAREHOUSING AND DATA MINING JUN

CHAKRA IT SOLUTIONS TO LEARN ABOUT OUR UNIQUE TRAINING PROCESS:

MSBI (SSIS, SSRS, SSAS) Course Content

Syllabus. Syllabus. Motivation Decision Support. Syllabus

MICROSOFT BUSINESS INTELLIGENCE (MSBI: SSIS, SSRS and SSAS)

CSPP 53017: Data Warehousing Winter 2013! Lecture 7! Svetlozar Nestorov! Class News!


Improving the Performance of OLAP Queries Using Families of Statistics Trees

Decision Support Systems aka Analytical Systems

SIDDHARTH GROUP OF INSTITUTIONS :: PUTTUR Siddharth Nagar, Narayanavanam Road QUESTION BANK (DESCRIPTIVE)

Data warehouses Decision support The multidimensional model OLAP queries

Sql Fact Constellation Schema In Data Warehouse With Example

Chapter 18: Data Analysis and Mining

Exam Questions

UNIT -1 UNIT -II. Q. 4 Why is entity-relationship modeling technique not suitable for the data warehouse? How is dimensional modeling different?

Data Warehouses and OLAP. Database and Information Systems. Data Warehouses and OLAP. Data Warehouses and OLAP

On-Line Analytical Processing (OLAP) Traditional OLTP

MICROSOFT BUSINESS INTELLIGENCE

DATA MINING AND WAREHOUSING

COMPUTER SCIENCE AND ENGINEERING TUTORIAL QUESTION BANK

Foundations of SQL Server 2008 R2 Business. Intelligence. Second Edition. Guy Fouche. Lynn Lang it. Apress*

Data Warehouses. Yanlei Diao. Slides Courtesy of R. Ramakrishnan and J. Gehrke

Chapter 18: Data Analysis and Mining

Data Warehouse and Data Mining

Chapter 13 Business Intelligence and Data Warehouses The Need for Data Analysis Business Intelligence. Objectives

Data Mining Concepts & Techniques

A Multi-Dimensional Data Model

Information Integration

Course Number : SEWI ZG514 Course Title : Data Warehousing Type of Exam : Open Book Weightage : 60 % Duration : 180 Minutes

Evolution of Database Systems

Unit 7: Basics in MS Power BI for Excel 2013 M7-5: OLAP

INSTITUTE OF AERONAUTICAL ENGINEERING (Autonomous) Dundigal, Hyderabad

On-Line Application Processing

Warehousing. Data Mining

Introduction to DWH / BI Concepts

Data Warehousing and OLAP

~ Ian Hunneybell: DWDM Revision Notes (31/05/2006) ~

Carnegie Mellon Univ. Dept. of Computer Science /615 DB Applications. Data mining - detailed outline. Problem

INSTITUTE OF AERONAUTICAL ENGINEERING (Autonomous) Dundigal, Hyderabad

Data Warehousing Conclusion. Esteban Zimányi Slides by Toon Calders

IDU0010 ERP,CRM ja DW süsteemid Loeng 5 DW concepts. Enn Õunapuu

Chapter 4, Data Warehouse and OLAP Operations

Adnan YAZICI Computer Engineering Department

Advanced Data Management Technologies Written Exam

SAP CERTIFIED APPLICATION ASSOCIATE - SAP HANA 2.0 (SPS01)

1. Attempt any two of the following: 10 a. State and justify the characteristics of a Data Warehouse with suitable examples.

Data Modeling and Databases Ch 7: Schemas. Gustavo Alonso, Ce Zhang Systems Group Department of Computer Science ETH Zürich

DATA MINING TRANSACTION

Data Warehouse and Data Mining

Answer All Questions. All Questions Carry Equal Marks. Time: 20 Min. Marks: 10.

Time: 3 hours. Full Marks: 70. The figures in the margin indicate full marks. Answers from all the Groups as directed. Group A.

Data Warehousing and Decision Support

Data Warehousing & Mining. Data integration. OLTP versus OLAP. CPS 116 Introduction to Database Systems

Dta Mining and Data Warehousing

REPORTING AND QUERY TOOLS AND APPLICATIONS

Data Warehouse and Data Mining

Data mining - detailed outline. Carnegie Mellon Univ. Dept. of Computer Science /615 DB Applications. Problem.

Course no: CSC- 451 Full Marks: Credit hours: 3 Pass Marks: Nature of course: Theory (3 Hrs.) + Lab (3 Hrs.)

Information Management course

Interdepartmental Programme of Postgraduate Studies in Information Systems, University of Macedonia MASTER THESIS

CS 1655 / Spring 2013! Secure Data Management and Web Applications

Data Warehousing and Data Mining. Announcements (December 1) Data integration. CPS 116 Introduction to Database Systems

Data Warehousing & OLAP

Oracle Database 11g: Data Warehousing Fundamentals

CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP)

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1394

Data Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1396

Basics of Dimensional Modeling

CS377: Database Systems Data Warehouse and Data Mining. Li Xiong Department of Mathematics and Computer Science Emory University

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures)

MSBI( SSAS, SSIS, SSRS) Course Content:35-40hours

Question Bank. 4) It is the source of information later delivered to data marts.

Data Warehousing and Decision Support

Transcription:

Lectures for the course: Data Warehousing and Data Mining (IT 60107) Week 1 Lecture 1 21/07/2011 Introduction to the course Pre-requisite Expectations Evaluation Guideline Term Paper and Term Project Guideline Week 2 Lecture 2 25/07/2011 What is a Data warehouse Properties of a Data warehouse Lecture 3 28/07/2011 Data Warehousing defined Data warehouse architecture Difference between OTLP and OLAP Need for a separate database Week 3 Lecture 4 01/08/2011 Multi-dimensional data model Data cube as a lattice of cuboids Number of cuboids in a data cube Dimension hierarchy Lecture 5+6 03/08/2011 OLAP operations Slicing, Dicing, Roll-up and Drill-down Roll-up and Drill-down in the presence of dimension hierarchy Term Paper team formation Term paper preparation explained

Getting data from relational database tables for generating multi-dimensional view Lecture 7 04/08/2011 Roll up and drill down using relational database tables Tables for representing dimension hierarchy Materialized views. Week 4 Lecture 8 08/08/2011 Formal definition of data cube as a lattice Correspondence between data cube, multi-dimensional view and group by queries ROLAP, MOLAP and HOLAP servers Lecture 9+10 10/08/2011 Lattice in the presence of dimension hierarchy Greedy algorithm for view materialization Effect of greedy choice on optimality Lecture 11 11/08/2011 Multi-way array aggregation Week 5 Lecture 12+13 24/08/2011 Class Test held Lecture 14 25/08/2011 Dimensional modeling introduction Star schema Basic retail sales fact and dimension Snowflake schema Steps in dimensional modeling Lecture 15 26/08/2011 Retail sales business process Fact constellation schema

Granularity at the level of POS transactions Promotion dimension Degenerate dimension Sample SQLs for getting information out of star schema Lecture 16 27/08/2011 Factless fact table promotion coverage Additivity of facts Additive, semi-additive and non-additive facts Discussion on date dimension Extensibility of star schema addition of new dimensions without changing granularity Time dimension, clerk dimension and customer dimension Week 6 Lecture 17 30/08/2011 Class Test scripts shown and feedback given Lecture 18 01/09/2011 Extensibility of star schema change in granularity, addition of new attributes in dimension tables, addition of new facts Inventory management business process Periodic snapshot schema Semi additive facts Periodic snapshot fact table vs. retail sales fact table at the same granularity level Week 7 Lecture 19 05/09/2011 Further facts in periodic snapshot schema Inventory transactions schema Similarity with banking operations Lecture 20+21 07/09/2011 Inventory accumulating snapshot Handling slowly changing dimensions Type 1, Type 2 and Type 3 changes

Lecture 22 08/09/2011 Hybrid methods for handling changes Predictable and unpredictable changes Week 8 Lecture 23 12/09/2011 Rapidly changing dimensions Mini dimensions Dimension outriggers Bitmap indexing Lecture 24+25 14/09/2011 Join indexing Summary of topics covered under Data Warehousing Introduction to Data Mining Data Mining and SQL Data Mining and Machine Learning Introduction to Association Rule Mining Support and Confidence Lecture 26 15/09/2011 Support and Confidence Itemsets Properties of frequent itemsets closure properties Minimum support and confidence A priori algorithm Week 9 Lecture 27 19/09/2011 Generating association rules from large itemsets Dynamic itemset counting algorithm Lecture 28 21/09/2011 Dynamic itemset counting contd.. Generating association rules from the large itemsets obtained using DIC Partitioning algorithm

Week 10 Lecture 29 10/10/2011 Mining frequent patterns without candidate generation FP-Tree Construction Mining FP Trees Lecture 30+31 12/10/2011 FP-Tree Construction - revisited Mining FP Trees revisited Examples worked out from Paper and Book Mid-sem scripts shown and feedback given Lecture 32 13/10/2011 Sequential pattern mining Sequence, Maximal sequence, large customer sequence Week 11 Lecture 33 17/10/2011 Sequential pattern mining Phases in sequential pattern mining Lecture 34+35 19/10/2011 Sequential pattern mining Maximal Phase Example worked out by students Lecture 36 20/10/2011 Introduction to Clustering Hierarchical and Partitioning approaches Definition of centroid and error measure K-means clustering Week 12 Lecture 37 24/10/2011

K-mediod clustering PAM CLARA Lecture 38 27/10/2011 CLARANS Week 13 Lecture 39 31/10/2011 Problem on CLARANS worked out Introduction to hierarchical clustering Agglomerative hierarchical clustering Dendrogram Distance metrics Inter-cluster distance measures Lecture 40+41 02/11/2011 Agglomerative hierarchical clustering Algorithm CF Vector CF Tree Lecture 42 03/10/2011 Insertion in CF Tree BIRCH Introduction to Classification Week 14 Lecture 43+44 09/11/2011 Quantifying classifier performance MLP as classifier Back Propagation algorithm Introduction to Decision trees Lecture 45+46 10/11/2011 Decision tree construction algorithm Problems on decision tree, MLP, BIRCH

Week 15 Lecture 47 16/11/2011 Clarification on all topics covered during the semester Summary of topics covered in the course