Data Warehousing & OLAP
|
|
- Tamsin Barrett
- 5 years ago
- Views:
Transcription
1 CMPUT 391 Database Management Systems Data Warehousing & OLAP Textbook: (first edition: ) Based on slides by Lewis, Bernstein and Kifer and other sources University of Alberta 1
2 Why Data Warehouses Businesses have a lot of data, operational data and facts, stored in heterogeneous and distributed databases. in different databases in different physical locations in different formats Decision makers need fast access to this information in a summarized form, with a focus often on historical data University of Alberta 2
3 What Is Data Warehouse? consolidates the information from different data sources, enabling OLAP (online analytical processing), to help decision support. is maintained separately from an operational database (which is used for OLTP online transaction processing). Option 1: Consolidate Data Marts Corporate Data Warehouse Data Mart Data Mart Data Mart Data Mart Option 2: Build from scratch Corporate data University of Alberta 3
4 OLTP Compared With OLAP On Line Transaction Processing -- OLTP Maintain a database that is an accurate model of some real-world enterprise Short simple transactions Relatively frequent updates Transactions access only a small fraction of the database On Line Analytic Processing -- OLAP Use information in database to guide strategic decisions Complex aggregation queries Infrequent updates Transactions access a large fraction of the database University of Alberta 4
5 Why Do We Separate DWs From Operational DBs? Performance reasons: OLAP necessitates special data organization that supports multidimensional views. OLAP queries would degrade operational DB. OLAP is read only. No concurrency control and recovery. Decision support requires historical data. Decision support requires consolidated data. University of Alberta 5
6 Fact Tables Many OLAP applications are based on a fact table For example, a supermarket application might be based on a table Sales (Market_Id, Product_Id, Time_Id, Sales_Amt) The table can be viewed as a multidimensional data cube The first three columns are the dimensions representing specific supermarkets products time intervals The fourth column, the Sales_Amt, is a function of the other three, called a measure University of Alberta 6
7 Dimension Tables The dimensions of the fact table can be further described with dimension tables Fact table Sales (Market_id, Product_Id, Time_Id, Sales_Amt) Dimension Tables Market (Market_Id, City, Province, Region) Product (Product_Id, Name, Category, Price) Time (Time_Id, Week, Month, Quarter) University of Alberta 7
8 Star Schema The fact and dimension relations can be displayed in an E-R diagram, which suggests a star and is called a star schema University of Alberta 8
9 Table View of a Star Schema Time TimeId Day Month Year Store StoreID City Province Country Region Sales Fact Table Time Product Store Customer unit_sales dollar_sales Product ProductNo ProdName ProdDesc Category Cust CustId CustName CustCity CustCountry Two different measures (Source: JH) University of Alberta 9
10 Aggregation Many OLAP queries involve aggregation of the data in the fact table For example, to find the total sales (over time) of each product in each market, we might use SELECT S.Market_Id, S.Product_Id, SUM (S.Sales_Amt) FROM Sales S GROUP BY S.Market_Id, S.Product_Id The aggregation is over the entire time dimension and thus produces a two-dimensional view of the data University of Alberta 10
11 Aggregation over Time The output of the previous query Product_Id Market_Id M1 M2 M3 M4 SUM(Sales_Amt) P P P P P5 University of Alberta 11
12 Concept-Hierarchies Many dimensions form an aggregation hierarchy (total or partial orders) Examples: Markets(Market_Id City Province Country Region) Time(year quarter week month day) University of Alberta 12
13 Drilling Down and Rolling Up Executing a series of queries that moves down a hierarchy (e.g., from aggregation over regions to that over provinces) is called drilling down Requires the use of the fact table or information more specific than the requested aggregation (e.g., cities) Executing a series of queries that moves up the hierarchy (e.g., from provinces to regions) is called rolling up Note: In a rollup, coarser aggregations can be computed using prior queries for finer aggregations University of Alberta 13
14 Drilling Down Drilling down on market: from Region to Province Sales (Market_Id, Product_Id, Time_Id, Sales_Amt) Market (Market_Id, City, Province, Region) 1. SELECT S.Product_Id, M.Region, SUM (S.Sales_Amt) FROM Sales S, Market M WHERE M.Market_Id = S.Market_Id GROUP BY S.Product_Id, M.Region 2. SELECT S.Product_Id, M.Province, SUM (S.Sales_Amt) FROM Sales S, Market M WHERE M.Market_Id = S.Market_Id GROUP BY S.Product_Id, M.Province, University of Alberta 14
15 Rolling Up Rolling up on market, from Province to Region If we have already created a table, Province_Sales, using 1. SELECT S.Product_Id, M.Province, SUM (S.Sales_Amt) INTO FROM WHERE Province_Sales Sales S, Market M M.Market_Id = S.Market_Id GROUP BY S.Product_Id, M.Province then we can roll up from there to: 2. SELECT T.Product_Id, M.Region, SUM (T.Sales_Amt) FROM Province_Sales T, Market M WHERE M.Province = T.Province GROUP BY T.Product_Id, M.Region University of Alberta 15
16 Pivoting When we view the data as a multi-dimensional cube and group on a subset of the axes, we are said to be performing a pivot on those axes Pivoting on dimensions D 1,,D k in a data cube D 1,,D k,d k+1,,d n means that we use GROUP BY A 1,,A k and aggregate over A k+1, A n, where A i is an attribute of the dimension D i Example: Pivoting on Product and Time corresponds to grouping on Product_id and Quarter and aggregating Sales_Amt over Market_id: SELECT S.Product_Id, T.Quarter, SUM (S.Sales_Amt) FROM Sales S, Time T WHERE T.Time_Id = S.Time_Id GROUP BY S.Product_Id, T.Quarter Pivot University of Alberta 16
17 Dicing When we use GROUP BY to specify part of a hierarchy, we are performing a range selection called a dice Dicing Sales in the time dimension: total sales for each product in each quarter. SELECT FROM WHERE GROUP BY S.Product_Id, T.Quarter, SUM (Sales_Amt) Sales S, Time T T.Time_Id = S.Time_Id T.Quarter, S.Product_Id Dice University of Alberta 17
18 Slicing When we use WHERE to specify a particular value for an axis (or several axes), we are performing a slice Slicing the data cube in the Time dimension (choosing sales only in week 12) then pivoting to Product_id (aggregating over Market_id) SELECT FROM WHERE S.Product_Id, SUM (Sales_Amt) Sales S, Time T T.Time_Id = S.Time_Id AND T.Week = Wk-12 GROUP BY S. Product_Id Slice University of Alberta 18
19 Slicing-and-Dicing Typically slicing and dicing involves several queries to find the right slice. For instance, change the slice and the axes: Slicing on Time and Market dimensions then pivoting to Product_id and Week (in the time dimension) SELECT S.Product_Id, T.Week, SUM (Sales_Amt) FROM Sales S, Time T WHERE T.Time_Id = S.Time_Id AND T.Quarter = 4 AND S.Market_id = M1 GROUP BY S.Product_Id, T.Week Pivot Slice University of Alberta 19
20 The extended Multi-dimensional Data Cube/Fact Table City Edmonton Calgary Lethbridge Sum Year Sum All Years Drama, Edmonton Drama Comedy... Category Sum Contains all possible aggregates in addition to the facts in the fact table University of Alberta 20
21 Product_Id The CUBE Operator To construct the following table, would take 4 queries (next slide) Market_Id M1 M2 M3 Total SUM(Sales_Amt) P P P P Total University of Alberta 21
22 The Four Queries For the table entries, without the totals (aggregation on time) SELECT S.Market_Id, S.Product_Id, SUM (S.Sales_Amt) FROM Sales S GROUP BY S.Market_Id, S.Product_Id For the row totals (aggregation on time and supermarkets) SELECT S.Product_Id, SUM (S.Sales_Amt) FROM Sales S GROUP BY S.Product_Id For the column totals (aggregation on time and products) SELECT S.Market_Id, SUM (S.Sales) FROM Sales S GROUP BY S.Market_Id For global total: SELECT SUM (S.Sales) FROM Sales S University of Alberta 22
23 Definition of the CUBE Operator Doing these four queries is wasteful The first does much of the work of the other three: if we could save that result and aggregate over Market_Id and Product_Id, we could compute the other queries more efficiently The CUBE clause is part of SQL:1999 GROUP BY CUBE(v1, v2,, vn) Equivalent to a collection of GROUP BYs, one for each of the 2 n subsets of v1, v2,, vn University of Alberta 23
24 Example of CUBE Operator The following query returns all the information needed to obtain the previous products/markets table: SELECT S.Market_Id, S.Product_Id, SUM (S.Sales_Amt) FROM Sales S GROUP BY CUBE (S.Market_Id, S.Product_Id) University of Alberta 24
25 ROLLUP ROLLUPis similar to CUBE except that instead of aggregating over all subsets of the arguments, it creates subsets moving from right to left GROUP BY ROLLUP (A 1,A 2,,A n ) is a series of these aggregations: GROUP BY A 1,, A n-1,a n GROUP BY A 1,, A n-1 GROUP BY A 1, A 2 GROUP BY A 1 No GROUP BY ROLLUPis also in SQL:1999 University of Alberta 25
26 Example of ROLLUP Operator SELECT S.Market_Id, S.Product_Id, SUM (S.Sales_Amt) FROM Sales S GROUP BY ROLLUP (S.Market_Id, S. Product_Id) first aggregates with the finest granularity: GROUP BY S.Market_Id, S.Product_Id then with the next level of granularity: GROUP BY S.Market_Id then the grand total is computed with no GROUP BY clause University of Alberta 26
27 ROLLUP vs. CUBE The same query with CUBE: - first aggregates with the finest granularity: GROUP BY S.Market_Id, S.Product_Id - then with the next level of granularity (both subsets): GROUP BY GROUP BY S.Market_Id S.Product_Id - then the grand total with no GROUP BY University of Alberta 27
28 Materialized Views The CUBE operator is often used to precompute aggregations on all dimensions of a fact table and then save them as a materialized views to speed up future queries University of Alberta 28
29 ROLAP and MOLAP Relational OLAP: ROLAP OLAP data is stored in a relational database as previously described. Data cube is a way to think about a fact table. Multidimensional OLAP: MOLAP Vendor provides an OLAP server that implements a fact table as a data cube using some multi-dimensional (non-relational) implementation. provide proprietary, perhaps visual, languages that allow unsophisticated users to make queries that involve pivots, drilling down, or rolling up University of Alberta 29
30 Implementation Issues OLAP applications are characterized by a very large amount of data that is relatively static, with infrequent updates Thus, various aggregations can be precomputed and stored in the database Star joins, join indices, and bitmap indices can be used to improve efficiency Since updates are infrequent, the inefficiencies associated with updates are minimized University of Alberta 30
Data Warehousing and Decision Support. Introduction. Three Complementary Trends. [R&G] Chapter 23, Part A
Data Warehousing and Decision Support [R&G] Chapter 23, Part A CS 432 1 Introduction Increasingly, organizations are analyzing current and historical data to identify useful patterns and support business
More informationCOMP33111 Lecture 3. Plan. Aims COMP33111, 2012/ Introduction to data analytics and on-line analytical processing (OLAP)
COMP33111 Lecture 3 Introduction to data analytics and on-line analytical processing (OLAP) Goran Nenadic School of Computer Science 1 Plan Lecture today: Data analytics and OLAP Tutorial 3: Understanding
More informationData Warehousing and Decision Support
Data Warehousing and Decision Support Chapter 23, Part A Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke 1 Introduction Increasingly, organizations are analyzing current and historical
More informationData Warehousing and Decision Support
Data Warehousing and Decision Support [R&G] Chapter 23, Part A CS 4320 1 Introduction Increasingly, organizations are analyzing current and historical data to identify useful patterns and support business
More informationSummary of Last Chapter. Course Content. Chapter 2 Objectives. Data Warehouse and OLAP Outline. Incentive for a Data Warehouse
Principles of Knowledge Discovery in bases Fall 1999 Chapter 2: Warehousing and Dr. Osmar R. Zaïane University of Alberta Dr. Osmar R. Zaïane, 1999 Principles of Knowledge Discovery in bases University
More informationData Warehouses. Yanlei Diao. Slides Courtesy of R. Ramakrishnan and J. Gehrke
Data Warehouses Yanlei Diao Slides Courtesy of R. Ramakrishnan and J. Gehrke Introduction v In the late 80s and early 90s, companies began to use their DBMSs for complex, interactive, exploratory analysis
More informationAn Overview of Data Warehousing and OLAP Technology
An Overview of Data Warehousing and OLAP Technology CMPT 843 Karanjit Singh Tiwana 1 Intro and Architecture 2 What is Data Warehouse? Subject-oriented, integrated, time varying, non-volatile collection
More informationData Mining Concepts & Techniques
Data Mining Concepts & Techniques Lecture No. 01 Databases, Data warehouse Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro
More informationCSE 544 Principles of Database Management Systems. Alvin Cheung Fall 2015 Lecture 8 - Data Warehousing and Column Stores
CSE 544 Principles of Database Management Systems Alvin Cheung Fall 2015 Lecture 8 - Data Warehousing and Column Stores Announcements Shumo office hours change See website for details HW2 due next Thurs
More informationData warehouses Decision support The multidimensional model OLAP queries
Data warehouses Decision support The multidimensional model OLAP queries Traditional DBMSs are used by organizations for maintaining data to record day to day operations On-line Transaction Processing
More informationSyllabus. Syllabus. Motivation Decision Support. Syllabus
Presentation: Sophia Discussion: Tianyu Metadata Requirements and Conclusion 3 4 Decision Support Decision Making: Everyday, Everywhere Decision Support System: a class of computerized information systems
More informationCSPP 53017: Data Warehousing Winter 2013! Lecture 7! Svetlozar Nestorov! Class News!
CSPP 53017: Data Warehousing Winter 2013! Lecture 7! Svetlozar Nestorov! Class News! Make-up class on Saturday, Mar 9 in Gleacher 203 10:30am 1:30pm.! Last 15 minute in-class quiz (6:30pm) on Mar 5.! Covers
More informationData Warehousing and Decision Support (mostly using Relational Databases) CS634 Class 20
Data Warehousing and Decision Support (mostly using Relational Databases) CS634 Class 20 Slides based on Database Management Systems 3 rd ed, Ramakrishnan and Gehrke, Chapter 25 Introduction Increasingly,
More informationDecision Support Systems aka Analytical Systems
Decision Support Systems aka Analytical Systems Decision Support Systems Systems that are used to transform data into information, to manage the organization: OLAP vs OLTP OLTP vs OLAP Transactions Analysis
More informationData Warehouse and Data Mining
Data Warehouse and Data Mining Lecture No. 04-06 Data Warehouse Architecture Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology
More informationIntroduction to Data Warehousing
ICS 321 Spring 2012 Introduction to Data Warehousing Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 4/23/2012 Lipyeow Lim -- University of Hawaii at Manoa
More informationChapter 18: Data Analysis and Mining
Chapter 18: Data Analysis and Mining Database System Concepts See www.db-book.com for conditions on re-use Chapter 18: Data Analysis and Mining Decision Support Systems Data Analysis and OLAP 18.2 Decision
More informationDatabase design View Access patterns Need for separate data warehouse:- A multidimensional data model:-
UNIT III: Data Warehouse and OLAP Technology: An Overview : What Is a Data Warehouse? A Multidimensional Data Model, Data Warehouse Architecture, Data Warehouse Implementation, From Data Warehousing to
More informationCHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS. Assist. Prof. Dr. Volkan TUNALI
CHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS Assist. Prof. Dr. Volkan TUNALI Topics 2 Business Intelligence (BI) Decision Support System (DSS) Data Warehouse Online Analytical Processing (OLAP)
More informationCHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP)
CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP) INTRODUCTION A dimension is an attribute within a multidimensional model consisting of a list of values (called members). A fact is defined by a combination
More informationCSE 544 Principles of Database Management Systems. Fall 2016 Lecture 14 - Data Warehousing and Column Stores
CSE 544 Principles of Database Management Systems Fall 2016 Lecture 14 - Data Warehousing and Column Stores References Data Cube: A Relational Aggregation Operator Generalizing Group By, Cross-Tab, and
More informationData Warehousing and OLAP
Data Warehousing and OLAP INFO 330 Slides courtesy of Mirek Riedewald Motivation Large retailer Several databases: inventory, personnel, sales etc. High volume of updates Management requirements Efficient
More informationData Warehouses and OLAP. Database and Information Systems. Data Warehouses and OLAP. Data Warehouses and OLAP
Database and Information Systems 11. Deductive Databases 12. Data Warehouses and OLAP 13. Index Structures for Similarity Queries 14. Data Mining 15. Semi-Structured Data 16. Document Retrieval 17. Web
More informationBasics of Dimensional Modeling
Basics of Dimensional Modeling Data warehouse and OLAP tools are based on a dimensional data model. A dimensional model is based on dimensions, facts, cubes, and schemas such as star and snowflake. Dimension
More informationDecision Support. Chapter 25. CS 286, UC Berkeley, Spring 2007, R. Ramakrishnan 1
Decision Support Chapter 25 CS 286, UC Berkeley, Spring 2007, R. Ramakrishnan 1 Introduction Increasingly, organizations are analyzing current and historical data to identify useful patterns and support
More informationData Warehousing 2. ICS 421 Spring Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa
ICS 421 Spring 2010 Data Warehousing 2 Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 3/30/2010 Lipyeow Lim -- University of Hawaii at Manoa 1 Data Warehousing
More informationOLAP2 outline. Multi Dimensional Data Model. A Sample Data Cube
OLAP2 outline Multi Dimensional Data Model Need for Multi Dimensional Analysis OLAP Operators Data Cube Demonstration Using SQL Multi Dimensional Data Model Multi dimensional analysis is a popular approach
More informationcollection of data that is used primarily in organizational decision making.
Data Warehousing A data warehouse is a special purpose database. Classic databases are generally used to model some enterprise. Most often they are used to support transactions, a process that is referred
More informationMIS2502: Data Analytics Dimensional Data Modeling. Jing Gong
MIS2502: Data Analytics Dimensional Data Modeling Jing Gong gong@temple.edu http://community.mis.temple.edu/gong Where we are Now we re here Data entry Transactional Database Data extraction Analytical
More informationCS 1655 / Spring 2013! Secure Data Management and Web Applications
CS 1655 / Spring 2013 Secure Data Management and Web Applications 03 Data Warehousing Alexandros Labrinidis University of Pittsburgh What is a Data Warehouse A data warehouse: archives information gathered
More informationDATA WAREHOUSE EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY
DATA WAREHOUSE EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY CHARACTERISTICS Data warehouse is a central repository for summarized and integrated data
More informationAnnouncements. Course Outline. CS/INFO 330 Data Warehousing and OLAP. Mirek Riedewald
CS/INFO 330 Data Warehousing and OLAP Mirek Riedewald mirek@cs.cornell.edu Announcements Don t forget to let me know about the demo sessions next Monday Who does not have a laptop for the demo? CS/INFO
More informationThis tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.
About the Tutorial A data warehouse is constructed by integrating data from multiple heterogeneous sources. It supports analytical reporting, structured and/or ad hoc queries and decision making. This
More informationUnit 7: Basics in MS Power BI for Excel 2013 M7-5: OLAP
Unit 7: Basics in MS Power BI for Excel M7-5: OLAP Outline: Introduction Learning Objectives Content Exercise What is an OLAP Table Operations: Drill Down Operations: Roll Up Operations: Slice Operations:
More informationImproving the Performance of OLAP Queries Using Families of Statistics Trees
Improving the Performance of OLAP Queries Using Families of Statistics Trees Joachim Hammer Dept. of Computer and Information Science University of Florida Lixin Fu Dept. of Mathematical Sciences University
More informationETL and OLAP Systems
ETL and OLAP Systems Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Software Development Technologies Master studies, first semester
More informationIT DATA WAREHOUSING AND DATA MINING UNIT-2 BUSINESS ANALYSIS
PART A 1. What are production reporting tools? Give examples. (May/June 2013) Production reporting tools will let companies generate regular operational reports or support high-volume batch jobs. Such
More informationData Warehouse and Data Mining
Data Warehouse and Data Mining Lecture No. 03 Architecture of DW Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro Basic
More informationAdnan YAZICI Computer Engineering Department
Data Warehouse Adnan YAZICI Computer Engineering Department Middle East Technical University, A.Yazici, 2010 Definition A data warehouse is a subject-oriented integrated time-variant nonvolatile collection
More informationREPORTING AND QUERY TOOLS AND APPLICATIONS
Tool Categories: REPORTING AND QUERY TOOLS AND APPLICATIONS There are five categories of decision support tools Reporting Managed query Executive information system OLAP Data Mining Reporting Tools Production
More informationCS 245: Database System Principles. Warehousing. Outline. What is a Warehouse? What is a Warehouse? Notes 13: Data Warehousing
Recall : Database System Principles Notes 3: Data Warehousing Three approaches to information integration: Federated databases did teaser Data warehousing next Mediation Hector Garcia-Molina (Some modifications
More informationEvolution of Database Systems
Evolution of Database Systems Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Intelligent Decision Support Systems Master studies, second
More informationQUALITY MONITORING AND
BUSINESS INTELLIGENCE FOR CMS DATA QUALITY MONITORING AND DATA CERTIFICATION. Author: Daina Dirmaite Supervisor: Broen van Besien CERN&Vilnius University 2016/08/16 WHAT IS BI? Business intelligence is
More informationRocky Mountain Technology Ventures
Rocky Mountain Technology Ventures Comparing and Contrasting Online Analytical Processing (OLAP) and Online Transactional Processing (OLTP) Architectures 3/19/2006 Introduction One of the most important
More informationData Warehouses. Vera Goebel. Fall Department of Informatics, University of Oslo
Data Warehouses Vera Goebel Department of Informatics, University of Oslo Fall 2014 1! Warehousing: History & Economics 1990 IBM, Business Intelligence : process of collecting and analyzing 1993 Bill Inmon,
More informationCT75 DATA WAREHOUSING AND DATA MINING DEC 2015
Q.1 a. Briefly explain data granularity with the help of example Data Granularity: The single most important aspect and issue of the design of the data warehouse is the issue of granularity. It refers
More informationData Warehousing and OLAP Technologies for Decision-Making Process
Data Warehousing and OLAP Technologies for Decision-Making Process Hiren H Darji Asst. Prof in Anand Institute of Information Science,Anand Abstract Data warehousing and on-line analytical processing (OLAP)
More informationChapter 13 Business Intelligence and Data Warehouses The Need for Data Analysis Business Intelligence. Objectives
Chapter 13 Business Intelligence and Data Warehouses Objectives In this chapter, you will learn: How business intelligence is a comprehensive framework to support business decision making How operational
More informationDATA WAREHOUING UNIT I
BHARATHIDASAN ENGINEERING COLLEGE NATTRAMAPALLI DEPARTMENT OF COMPUTER SCIENCE SUB CODE & NAME: IT6702/DWDM DEPT: IT Staff Name : N.RAMESH DATA WAREHOUING UNIT I 1. Define data warehouse? NOV/DEC 2009
More informationFig 1.2: Relationship between DW, ODS and OLTP Systems
1.4 DATA WAREHOUSES Data warehousing is a process for assembling and managing data from various sources for the purpose of gaining a single detailed view of an enterprise. Although there are several definitions
More informationData Warehousing. Overview
Data Warehousing Overview Basic Definitions Normalization Entity Relationship Diagrams (ERDs) Normal Forms Many to Many relationships Warehouse Considerations Dimension Tables Fact Tables Star Schema Snowflake
More informationOn-Line Application Processing
On-Line Application Processing WAREHOUSING DATA CUBES DATA MINING 1 Overview Traditional database systems are tuned to many, small, simple queries. Some new applications use fewer, more time-consuming,
More informationLectures for the course: Data Warehousing and Data Mining (IT 60107)
Lectures for the course: Data Warehousing and Data Mining (IT 60107) Week 1 Lecture 1 21/07/2011 Introduction to the course Pre-requisite Expectations Evaluation Guideline Term Paper and Term Project Guideline
More informationDATA MINING TRANSACTION
DATA MINING Data Mining is the process of extracting patterns from data. Data mining is seen as an increasingly important tool by modern business to transform data into an informational advantage. It is
More informationIDU0010 ERP,CRM ja DW süsteemid Loeng 5 DW concepts. Enn Õunapuu
IDU0010 ERP,CRM ja DW süsteemid Loeng 5 DW concepts Enn Õunapuu enn.ounapuu@ttu.ee Content Oveall approach Dimensional model Tabular model Overall approach Data modeling is a discipline that has been practiced
More informationData Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1396
Data Mining Data warehousing Hamid Beigy Sharif University of Technology Fall 1396 Hamid Beigy (Sharif University of Technology) Data Mining Fall 1396 1 / 31 Table of contents 1 Introduction 2 Data warehousing
More informationData Mining. Data warehousing. Hamid Beigy. Sharif University of Technology. Fall 1394
Data Mining Data warehousing Hamid Beigy Sharif University of Technology Fall 1394 Hamid Beigy (Sharif University of Technology) Data Mining Fall 1394 1 / 31 Table of contents 1 Introduction 2 Data warehousing
More informationAcknowledgment. MTAT Data Mining. Week 7: Online Analytical Processing and Data Warehouses. Typical Data Analysis Process.
MTAT.03.183 Data Mining Week 7: Online Analytical Processing and Data Warehouses Marlon Dumas marlon.dumas ät ut. ee Acknowledgment This slide deck is a mashup of the following publicly available slide
More informationBUSINESS INTELLIGENCE. SSAS - SQL Server Analysis Services. Business Informatics Degree
BUSINESS INTELLIGENCE SSAS - SQL Server Analysis Services Business Informatics Degree 2 BI Architecture SSAS: SQL Server Analysis Services 3 It is both an OLAP Server and a Data Mining Server Distinct
More informationOn-Line Analytical Processing (OLAP) Traditional OLTP
On-Line Analytical Processing (OLAP) CSE 6331 / CSE 6362 Data Mining Fall 1999 Diane J. Cook Traditional OLTP DBMS used for on-line transaction processing (OLTP) order entry: pull up order xx-yy-zz and
More informationData Mining & Data Warehouse
Data Mining & Data Warehouse Associate Professor Dr. Raed Ibraheem Hamed University of Human Development, College of Science and Technology (1) 2016 2017 1 Points to Cover Why Do We Need Data Warehouses?
More informationSql Fact Constellation Schema In Data Warehouse With Example
Sql Fact Constellation Schema In Data Warehouse With Example Data Warehouse OLAP - Learn Data Warehouse in simple and easy steps using Multidimensional OLAP (MOLAP), Hybrid OLAP (HOLAP), Specialized SQL
More informationA Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective
A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective B.Manivannan Research Scholar, Dept. Computer Science, Dravidian University, Kuppam, Andhra Pradesh, India
More informationA Multi-Dimensional Data Model
A Multi-Dimensional Data Model A Data Warehouse is based on a Multidimensional data model which views data in the form of a data cube A data cube, such as sales, allows data to be modeled and viewed in
More informationInformation Management course
Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 05(b) : 23/10/2012 Data Mining: Concepts and Techniques (3 rd ed.) Chapter
More informationChapter 4, Data Warehouse and OLAP Operations
CSI 4352, Introduction to Data Mining Chapter 4, Data Warehouse and OLAP Operations Young-Rae Cho Associate Professor Department of Computer Science Baylor University CSI 4352, Introduction to Data Mining
More informationData Warehousing & OLAP
Data Warehousing & OLAP Data Mining: Concepts and Techniques Chapter 3 Jiawei Han and An Introduction to Database Systems C.J.Date, Eighth Eddition, Addidon Wesley, 4 1 What is Data Warehousing? What is
More informationDta Mining and Data Warehousing
CSCI6405 Fall 2003 Dta Mining and Data Warehousing Instructor: Qigang Gao, Office: CS219, Tel:494-3356, Email: q.gao@dal.ca Teaching Assistant: Christopher Jordan, Email: cjordan@cs.dal.ca Office Hours:
More informationOverview. Introduction to Data Warehousing and Business Intelligence. BI Is Important. What is Business Intelligence (BI)?
Introduction to Data Warehousing and Business Intelligence Overview Why Business Intelligence? Data analysis problems Data Warehouse (DW) introduction A tour of the coming DW lectures DW Applications Loosely
More informationWhat is a Data Warehouse?
What is a Data Warehouse? COMP 465 Data Mining Data Warehousing Slides Adapted From : Jiawei Han, Micheline Kamber & Jian Pei Data Mining: Concepts and Techniques, 3 rd ed. Defined in many different ways,
More informationOLAP Introduction and Overview
1 CHAPTER 1 OLAP Introduction and Overview What Is OLAP? 1 Data Storage and Access 1 Benefits of OLAP 2 What Is a Cube? 2 Understanding the Cube Structure 3 What Is SAS OLAP Server? 3 About Cube Metadata
More informationDecision Support, Data Warehousing, and OLAP
Decision Support, Data Warehousing, and OLAP : Contents Terminology : OLAP vs. OLTP Data Warehousing Architecture Technologies References 1 Decision Support and OLAP Information technology to help knowledge
More informationAdvanced Data Management Technologies
ADMT 2017/18 Unit 10 J. Gamper 1/37 Advanced Data Management Technologies Unit 10 SQL GROUP BY Extensions J. Gamper Free University of Bozen-Bolzano Faculty of Computer Science IDSE Acknowledgements: I
More informationCS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures)
CS614- Data Warehousing Solved MCQ(S) From Midterm Papers (1 TO 22 Lectures) BY Arslan Arshad Nov 21,2016 BS110401050 BS110401050@vu.edu.pk Arslan.arshad01@gmail.com AKMP01 CS614 - Data Warehousing - Midterm
More informationAggregating Knowledge in a Data Warehouse and Multidimensional Analysis
Aggregating Knowledge in a Data Warehouse and Multidimensional Analysis Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com Objectives Explain the basics of: 1. Data
More informationData Warehousing & On-Line Analytical Processing
Data Warehousing & On-Line Analytical Processing Erwin M. Bakker & Stefan Manegold https://homepages.cwi.nl/~manegold/dbdm/ http://liacs.leidenuniv.nl/~bakkerem2/dbdm/ s.manegold@liacs.leidenuniv.nl e.m.bakker@liacs.leidenuniv.nl
More informationData Warehouses Chapter 12. Class 10: Data Warehouses 1
Data Warehouses Chapter 12 Class 10: Data Warehouses 1 OLTP vs OLAP Operational Database: a database designed to support the day today transactions of an organization Data Warehouse: historical data is
More informationData Warehousing & Mining. Data integration. OLTP versus OLAP. CPS 116 Introduction to Database Systems
Data Warehousing & Mining CPS 116 Introduction to Database Systems Data integration 2 Data resides in many distributed, heterogeneous OLTP (On-Line Transaction Processing) sources Sales, inventory, customer,
More informationDeccansoft Software Services Microsoft Silver Learning Partner. SSAS Syllabus
Overview: Analysis Services enables you to analyze large quantities of data. With it, you can design, create, and manage multidimensional structures that contain detail and aggregated data from multiple
More informationWarehousing. Data Mining
On Line Application Processing Warehousing Data Cubes Data Mining 1 Overview Traditional database systems are tuned to many, small, simple queries. Some new applications use fewer, more timeconsuming,
More informationMIS2502: Data Analytics Dimensional Data Modeling. Jing Gong
MIS2502: Data Analytics Dimensional Data Modeling Jing Gong gong@temple.edu http://community.mis.temple.edu/gong Where we are Now we re here Data entry Transactional Database Data extraction Analytical
More informationCHAPTER 3 Implementation of Data warehouse in Data Mining
CHAPTER 3 Implementation of Data warehouse in Data Mining 3.1 Introduction to Data Warehousing A data warehouse is storage of convenient, consistent, complete and consolidated data, which is collected
More informationDATA MINING AND WAREHOUSING
DATA MINING AND WAREHOUSING Qno Question Answer 1 Define data warehouse? Data warehouse is a subject oriented, integrated, time-variant, and nonvolatile collection of data that supports management's decision-making
More informationMultidimensional Queries
Multidimensional Queries Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Software Development Technologies Master studies, first semester
More information53 Multidimensional Databases and OLAP
53 Multidimensional Databases and OLAP Christian S. Jensen Aalborg University Torben Bach Pedersen Aalborg University 53.1 Introduction............................................ 53-1 53.2 Background............................................
More informationData Warehouse. Asst.Prof.Dr. Pattarachai Lalitrojwong
Data Warehouse Asst.Prof.Dr. Pattarachai Lalitrojwong Faculty of Information Technology King Mongkut s Institute of Technology Ladkrabang Bangkok 10520 pattarachai@it.kmitl.ac.th The Evolution of Data
More informationData Warehousing and Decision Support, part 2
Data Warehousing and Decision Support, part 2 CS634 Class 23, Apr 27, 2016 Slides based on Database Management Systems 3 rd ed, Ramakrishnan and Gehrke, Chapter 25 pid 11 12 13 pid timeid locid sales Multidimensional
More informationData Warehousing Conclusion. Esteban Zimányi Slides by Toon Calders
Data Warehousing Conclusion Esteban Zimányi ezimanyi@ulb.ac.be Slides by Toon Calders Motivation for the Course Database = a piece of software to handle data: Store, maintain, and query Most ideal system
More informationALTERNATE SCHEMA DIAGRAMMING METHODS DECISION SUPPORT SYSTEMS. CS121: Relational Databases Fall 2017 Lecture 22
ALTERNATE SCHEMA DIAGRAMMING METHODS DECISION SUPPORT SYSTEMS CS121: Relational Databases Fall 2017 Lecture 22 E-R Diagramming 2 E-R diagramming techniques used in book are similar to ones used in industry
More informationData Warehouses These slides are a modified version of the slides of the book Database System Concepts (Chapter 18), 5th Ed McGraw-Hill by
Data Warehouses These slides are a modified version of the slides of the book Database System Concepts (Chapter 18), 5th Ed., McGraw-Hill, by Silberschatz, Korth and Sudarshan. Original slides are available
More informationThis tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.
About the Tutorial A data warehouse is constructed by integrating data from multiple heterogeneous sources. It supports analytical reporting, structured and/or ad hoc queries and decision making. This
More informationBig Data 13. Data Warehousing
Ghislain Fourny Big Data 13. Data Warehousing fotoreactor / 123RF Stock Photo The road to analytics Aurelio Scetta / 123RF Stock Photo Another history of data management (T. Hofmann) 1970s 2000s Age of
More informationAnalytical data bases Database lectures for math
Analytical data bases Database lectures for mathematics students May 14, 2017 Decision support systems From the perspective of the time span all decisions in the organization could be divided into three
More informationSQL Server Analysis Services
DataBase and Data Mining Group of DataBase and Data Mining Group of Database and data mining group, SQL Server 2005 Analysis Services SQL Server 2005 Analysis Services - 1 Analysis Services Database and
More information02 Hr/week. Theory Marks. Internal assessment. Avg. of 2 Tests
Course Code Course Name Teaching Scheme Credits Assigned Theory Practical Tutorial Theory Practical/Oral Tutorial Total TEITC504 Database Management Systems 04 Hr/week 02 Hr/week --- 04 01 --- 05 Examination
More informationData Warehouse and Data Mining
Data Warehouse and Data Mining Lecture No. 07 Terminologies Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro Database
More informationChapter 3. The Multidimensional Model: Basic Concepts. Introduction. The multidimensional model. The multidimensional model
Chapter 3 The Multidimensional Model: Basic Concepts Introduction Multidimensional Model Multidimensional concepts Star Schema Representation Conceptual modeling using ER, UML Conceptual modeling using
More informationData Warehouses for Decision Support
Data Warehouses for Decision Support Vera Goebel Department of Informatics, University of Oslo 2010 1 What and Why of Data Warehousing Database Database Datastore Database System Database System Data System
More informationMost database operations involve On- Line Transaction Processing (OTLP).
Data Warehouse 1 Data Warehouse Most common form of data integration. Copy data from one or more sources into a single DB (warehouse) Update: periodic reconstruction of the warehouse, perhaps overnight.
More informationData Mining. Associate Professor Dr. Raed Ibraheem Hamed. University of Human Development, College of Science and Technology
Data Mining Associate Professor Dr. Raed Ibraheem Hamed University of Human Development, College of Science and Technology (1) 2016 2017 Department of CS- DM - UHD 1 Points to Cover Why Do We Need Data
More information