DATA WAREHOUSE MANAGEMENT SYSTEM A CASE STUDY

Size: px
Start display at page:

Download "DATA WAREHOUSE MANAGEMENT SYSTEM A CASE STUDY"

Transcription

1 DATA WAREHOUSE MANAGEMENT SYSTEM A CASE STUDY DARKO KRULJ Trizon Group, Belgrade, Serbia and Montenegro. MILUTIN CUPIC MILAN MARTIC MILIJA SUKNOVIC Faculty of Organizational Science, University of Belgrade, Serbia and Montenegro Abstract In the last decade data warehouses have become the basis for support in business decision making. This paper will show our framework for the data warehouse management system as well as advantages and disadvantages of this approach in relation to the other data warehouse management systems. Many approaches to the development of data warehouse emphasize the importance of meta data yet none of them sticks to that principle. The basic characteristic of our approach is that it is fully based on XML meta data. We have also implemented significant changes in dimensional model based upon flexible dimensional hierarchies which enabled us to use special methods of creating aggregations and important advantages in data visualization. What makes this approach different is the concept of intelligent creation of aggregates based on observation of history of users queries and the use of modified data mining algorithm APRIORI. Keywords: data warehouse, meta data, dimensional modeling, data mining 1. INTRODUCTION Occurrence of the very large databases and diverting the focus of attention from collecting data to data processing caused development of data warehouse (DW). According to [1] DW is defined as overall consistence warehouse, got from different data sources, accessible to the final users, and in comprehensive way usable for work. Reaching consistence and overall of data in information systems is very complex task, especially when it is comes to big companies information systems. DW can be conceived as a possibility to transcend the problems of integrations of different information systems. In [6] Kimball gave general definition of DW as copy of transactional data specially designed for queries and analyze. In [5] Inmon defined DW as subject oriented, integrated, stable and non volatile collection of data special organized as support to businesses requests. According to this definition it can be concluded that the DW mainly represent process then specific kind of data base. DW represent technique which provides correct collecting and managing large amount of data from different sources, in the purpose of finding right answer for business tasks and support in making decisions which cannot be obtained from operational databases. Based on our experience in realization of dozens of projects of design and developing of DW we have developed our own framework, database management system for DW realization and appropriate tools for data analysis and data warehouse modeling. In this paper we will present advantages of our framework and software tools compared to Microsoft method. 2. FRAMEWORK

2 Our framework for DW realization is shown on Figure 1. Figure 1 Framework for DW realization The first stage of developing DW is defining requirements. In this stage the aim is to collect, as much as possible, information about the DW that has to be realized. Interviewing of all stake holders interested in development of DW (analysts, managers, IT departments) is of large importance. Next stage is feasibility study, in which the collected data from previous stage have to be analyzed for evaluation of the possibility to realize the DW. Often there is a need go back to the previous stage for solving some inconsistencies or to collect some more information about the realization. After collection of necessary information about the DW development project, there has to be found a suitable way of realization. If the evaluation of DW shows that it can be realized successfully, the possible ways of doing it could be: realization though DW based on materialized queries which we showed in [8], or development based on DW modeler and DW analyst represented in [10]. If the realization goes through DW modeler tools, next step is modeling of DW. After creation models have to be tested, and if they are valid, the application can be delivered to end users after the training. The basic difference between the suggested model above and the most frequently used frameworks presented in [12] is the stage in which the software revision of model is performed. There has to be pointed out that many implemented models, beside the basic purpose, can be significantly improved. In this stage, on the basis of historical users queries software for managing DW gives the suggestion for model improvement. Actual results of using improving models are shown in Table 2. The main criteria, on the basis of which the suggestions are made is frequency of dimension and dimension level usage. The experience has shown the existence of some dimensions and levels that are rarely in use. As we can see on Figure 2, our solution and Microsoft solution are the same in the first three steps: choosing of data sources, process of extraction, transformation, loading and creating of staging base. In the next few phase we will see the main difference: Microsoft solution are using Analysis server for modeling DW and MS Excel or MS Data Analyst for presenting data to clients, while our solution is using DW modeler application for modeling DW and DW analyst application for presentation multidimensional (MD) data.

3 Figure 2 Architecture of the our solution compared with the Microsoft architecture DW modeler and DW analyst applications completely rely on meta data which are located in XML formats, that provides easy manipulation of meta data. Each development approach provides ease of creation of DW, nevertheless, our approach gives several advantages. First advantage is usage of the FIRST and the LAST functions, which aren t provided in the other tools, made for developing of DW. In our opinion, limited number of functions for data aggregation is the biggest disadvantage for creating DW solutions. Precisely the main motivation for introduction of the new functions was this one. The second advantage is based on complete reliance on the XML meta data. Complete separation of the meta data provides more efficient conduction of changing structure of MD cube. Maybe the best example is that any kind of change made in structure of MD cube of Microsoft's solution would require new processing of it, even thou the change was connected to display data format, which didn't affected on context of data in MD cube at all. In our approach the changes, which have been made, are even more significant, such as changing the name of dimension or dimensional level that didn't require new processing of MD cube. By that it gains significant advantage in testing phase and adjusting of DW to users, the main reason for that is there is no waste of time on unnecessary processing any more, which are one of the main required operations in DW. In addition, our model is simpler and adaptable for comprehension of MD data models. Our approach is based on MD model that is suggested in [10] and is something that makes our model special and different from others. Usage of the MD model has fulfilled the essential assumption for creation of new algorithm for processing MD cube, which is based on history of user queries. The algorithm is, in fact, very similar to data mining algorithm APRIORI, which is according to [3] and [10] one of the most famous and mostly used data-mining algorithm. Differences are based on input data format, which uses frequencies for queries aggregation, and because of that all of candidates for aggregation are known in advance. Additionally frequencies of user queries can be pondered with average data size for the purpose of better space usage. Based on modified algorithm we could generate frequent candidates for creation of aggregation. Table 1 shows the comparison between our algorithm and algorithms that are mainly used for DW realization. Method of creation aggregation Common algorithm eg. (BUPS, BPS) Algorithm input Dimensions and cardinality Choosing an aggregate Based on optimal ratio between number of aggregates and memory space usage Result The greater number of aggregations at given limitations Algorithm asumptotion Each dimension has a same usage frequency

4 Method of creation aggregation Algorithm input Choosing an aggregate Result Algorithm asumptotion Algorithm based on query history Dimensions, cardinality and query history Based on usage frequency and average data size the lesser number of the more frequent aggregation that take up as little space as possible Each query has different appeara nce rate and after several created queries Table 1 Comparison of algorithms based on query history and the algorithms which are based on optimization ratio of number of aggregated data and necessary memory space. Applying of a proposed algorithm based on history of queries makes possible the improvement of developed DW by removing dimensions and dimension levels which are rarely used in queries. Concrete results are shown in table 2 on example of five developed DW. Data warehouse Rarely used dimension levels Frequency Total number of queries Aggregation Windows logs [7] Time(minute) Don t create Faculty of organizational science student data service [11] and [13] Statute(name) PlaceOfHabitation(Name) PlaceOfBirth(Name) Don t create Don t create Don t create FOREX [9] Time(Quartal) Time(Second) Don t create Financial accounting Merchandise Accounting Table 2 Improving realization of DW using a query history Based on a result from Table 2, we conclude that in three DWs (Windows Logs, FOS student data service and FOREX) there are some dimensions and dimensional levels that have been rarely used. Algorithms based on history of queries would not generate aggregation for those dimensional levels, whereas usual algorithms will start from cardinal dimensional levels and in many cases will generate exactly those aggregations which users wouldn't use in their analysis. E.g. In ten cases of processing MD cube based on Microsoft solution (with different requests for taking place in workspace and performances), dimension statute from DW for student data service of FOS was in the aggregations. According to the Table 2 it is easy to notice that in data processing that dimension aren't used at all. Table 3 represents basic results of algorithm based on history of users queries in relation to algorithms based on optimization of relationship between number of aggregation and needed memory space. Element Algorithms based on optimization of ratio between number of aggregation and memory space usage Number of aggregations Bigger Smaller Number of queries successfully resolved from aggregations Smaller Algorithms based on query history Bigger Memory space usage Bigger Smaller Choosing an aggregate For dimension levels with small cardinality For most frequent dimension levels

5 Precondition Nothing Query history Table 3 Comparison of algorithm performance Hereafter, there will be described application that we develop for the purpose of modeling DW and data analysis. 3. SOFTWARE SOLUTION Figure 3 shows simplified scheme DW Analyst application. Figure 3 Simplified scheme of DW Analyst application User makes graphic queries in DW analyst application; it is formed with help of object model represented in the Figure 4b and meta data which was represented on the right side of Figure 3. In object model represented on Figure 4a (Microsoft solution), main object for connecting user application and MD cube is Connection in which is defined location of the cube (client or server), name of the cube and appropriate way for assessing it. Connection object contains catalogue where definitions of the cube are placed. In cubes definitions there are collections for dimensions, hierarchies and levels. Object Cellset represent the query result and it consists of cells, axes, positions and members that are needed to be represented.

6 Figure 4a ADO MD object model according to [41] and 4b our object model from [10] Figure 4b presents an object model that is described in [10]. Primary objects in the model are MD cube, Connection, Fact table, dimension and aggregation. ADO MD model on which Microsoft's solution is based, and the other solutions which are using the same standard are more general, but semantically substantially poor. Basic advantages of the approach that we suggested in [10] are in the use of greater amount of objects which respond to composed elements of DW and usage of aggregations as a separated element, while in the Microsoft approach they implicitly consist of objects cellset and cells. ADO MD mainly serves as a model for proceeding and working out of queries, and also for taking-over and presentation of the results. Microsoft's solution is also using Microsoft Decision Support Objects (MS DSO), which has a task to manage the DM. Object model that was suggested in [10] involves good qualities of both of models implemented in original way. MS-DSO, ADO-MD and represented object model are conceptual and they contain all necessary elements of overall MD object model. Suggested model in [10] is logically clearer and much more simple for understandings well as concrete use. Advantage of the ADO MD model is that the MDX query language is implemented on its base, while the model suggested in the [10] still doesn't have precise formulation of MD query language. Query resolution in model [10] is performed with methods contained in the MD cube. Users interface is adapted to final users and it is very intuitive. Most of operations are performed with the help of wizard. Because of limited space we will represent only some of the most important screen forms. We will take FOREX DW as an example. Figure 5 shows wizard for creation of meta database for relations.

7 Figure 5 Wizard for creation of meta database for relations Figure 5 shows data model for realization of FOREX DW. Model is graphically created. Objects are dragged (drag & drop) from the list placed on the left side of the screen, and then dropped on the window for drawing on the right side of the screen. Dragging one object to another creates a connection between the objects, while doing it, it is necessary to select certain columns through which they will be connected. Figure 6 shows report from DW analyst application with complex filter structure based on hourly data. It is the report that shows paralleled movements of two cross rate foreign exchange course of March 28th in period from 10-16h. Figure 6. Example of report with functions FIRST and LAST and complex filter Some of the previous examples show simplicity and flexibility of DW modeler and DW analyst. It is necessary to mention that the DW for Foreign Exchange was implemented also on the Microsoft platform which is described in details in [9] and it is much more complicated versus to the given solution. 4. COMPARISON In this paper we represented DW modeler and DW analyst application. Tables 4, 5 and

8 6 show comparative characteristics of our solution compared to the Microsoft solution. Elements of technology Microsoft solution Our solution Simple cubes, More cubes in one base, virtual cubes SUPPORT SUPPORT Simple and calculated measurements SUPPORT SUPPORT Participation of the cube SUPPORT NOT SUPPORTED Processing of the cube SUPPORT SUPPORT Aggregations based upon first and last functions NOT SUPPORTED SUPPORT Automatic improving of the model NOT SUPPORTED SUPPORT Observation of substantial changes in meta data to avoid processing of MD cube NOT SUPPORTED SUPPORT OLAP operations SUPPORT SUPPORT Dimensional modeling SUPPORT SUPPORT Meta data SUPPORT SUPPORT ROLAP SUPPORT SUPPORT HOLAP SUPPORT NOT SUPPORTED MOLAP SUPPORT NOT SUPPORTED Table 4 Comparison of the main elements of Microsoft solution and our solution Characteristics Microsoft solution Our solution DW based upon materialized queries Algorithm for picking of the aggregate Modification of BUPS and BPS algorithms Dimensional modeling Strict dimensional hierarchies Security Procedural and declaration Introduction of new operators and functions for aggregating of data Algorithm based upon query history Flexible hierarchies Aggregates all queries from users application dimensional Don't dimensional modeling Based on virtual cubes and filtering of meta data use Same as OLTP application Unknown Practicable Not applicable Meta data Binary XML format Do not use Observation of substantial changes in meta data to avoid processing of MD cube Taking restricted action so it would not be create illogical queries Can not be performed It is not secured Helped with XML meta data Built-in model, by using concept of analysis constraint Table 5 Comparative analysis of Microsoft and our solution Do not use It is not secured Element Microsoft Our Solution Dimensions More time dimensions(strict One time hierarchy) dimension(flexible hierarchy) Measures For every dimension level in One set of measures FIRST

9 time dimensions, one set of redundant measures OPEN and CLOSE (because solution don t have FIRST and LAST function) and LAST(implemented in model) Possibility of meaningless Very emphasized Not possible(analysis queries constraint concept) Database size Larger(redundant measures Smaller(don t use and unnecessary redundant measures and aggregations) unnecessary aggregations) Improvement of model Not applicable Included in model Table 6 Comparative analysis of DW solution for FOREX in Microsoft technology and in our modeler According to Table 4, 5 and 6 it can be concluded that suggested solution has numerous advantages than Microsoft s has, first of all in the access to the development of the DW, processing algorithm, guiding of users through the data analysis procedure, usage of XML meta data, and many other advantages. The greatest disadvantages of the suggested solution is the lack of cube partitioning and limitation only to ROLAP solutions. 5. CONCLUSION This paper represents MD data model and suitable MD object model, and DW modeler and DW analyst tools. Certain advantages have been achieved by using special layer with meta data. Special importance is attributed to introduction of the approach which is based on analysis constraint with dimension levels and measurements that prevent user to create meaningless queries, that was disregarded in other methodologies. Applying of our approach, six DWs have been realized, by now. Comparing the characteristics and performances of realized DW against the developed ones in Microsoft platform, it can be concluded that represented solution shows many advantages (shown in tables 4, 5 and 6). The possibilities of improvement-represented application are significant and they can be performed in many different directions, like: 1) Development of data mining algorithm and it s implementation in MD model 2) Implementation of the dynamic security concept, for which the directions are given 3) Creating of the ETL tool which will unable the forming solution process to be more independent against some other tools and technologies 4) Improving the algorithm based on query history 5) Integration with the external systems 6) Other In last decade DWs have become one of the most important types of (DSS) for enterprises. But, there are still numerous problems in realization, such as first of all: problems based on data quality, defection of relation models, different systems integration problems, new function introduction, data types which can be used as measures, etc. This paper shows some solutions, which can simplify the development of DW, and it also points out to a different edge of DW development. We are certain that DW will still have intensive development, and that will soon appear a new generation of DW, which will be significantly integrated with data mining techniques, expert systems and other methodologies for knowledge detecting. BIBLIOGRAPHY [1] Barry.D. (1997) Data warehouse from architecture to implementation, Addison - Wesley, 1997;

10 [2] Craig R.S., Vivona J. A. i Bercovitch D, (1999), Microsoft data warehousing, Wiley; [3] Devedžić V., (2000), Inteligent information systems, Digit; [4] Group of authors, (1998), Microsoft SQL Server 7.0 data warehousing training kit, Microsoft Press; [5] Inmon, W.H., (1996), Building the data warehouse, New York, John Wiley & Sons; [6] Kimball R., (1996), The data warehouse toolkit: practical techniques for building dimensional data warehouse, John Willy & Sons; [7] Krulj D., Čupić M, Martić M i Suknović M., (2003), Design and implementation of data warehouse for windows logs analysis, INFOFEST, Budva, Serbia and Montenegro, 2003; [8] Krulj D., Čupić M., Suknović M. i Martić M., (2005), Data warehouse based on materialized queries, YUINFO, Kopaonik, Serbia and Montenegro, 2005; [9] Krulj D., Čupić M., Suknović M. i Martić M., (2005), Data warehouse for foreign exchange market, JISA, Herceg Novi, Serbia And Montenegro 2005; [10] Krulj D (2005) Design of decision support systems based od data warehouse, Phd Disertation, Faculty of Organizational Sciences, Belgrade, Serbia and Montenegro; [11] Krulj D., Suknović M., Čupić M., Martić M. i Vujnović T., (2002), Design and implementation of olap system for student data service of faculty of organizational sciences, INFOFEST, Budva, Serbia and Montenegro; [12] Krulj D., Cupic M., and Suknovic M. (2005), Managing projects of data warehouse development, JUPMA, Zlatibor, Serbia and Montenegro; [13] Suknović M., Čupić M., Martić M. i Krulj D., (2005), Data warehouse and data mining a case study, YUJOR, No. 15 Vol. I, pg

MOLAP Data Warehouse of a Software Products Servicing Call Center

MOLAP Data Warehouse of a Software Products Servicing Call Center MOLAP Data Warehouse of a Software Products Servicing Call Center Z. Kazi, B. Radulovic, D. Radovanovic and Lj. Kazi Technical faculty "Mihajlo Pupin" University of Novi Sad Complete Address: Technical

More information

DATAWAREHOUSING AND ETL PROCESSES: An Explanatory Research

DATAWAREHOUSING AND ETL PROCESSES: An Explanatory Research DATAWAREHOUSING AND ETL PROCESSES: An Explanatory Research Priyanshu Gupta ETL Software Developer United Health Group Abstract- In this paper, the author has focused on explaining Data Warehousing and

More information

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures)

CS614 - Data Warehousing - Midterm Papers Solved MCQ(S) (1 TO 22 Lectures) CS614- Data Warehousing Solved MCQ(S) From Midterm Papers (1 TO 22 Lectures) BY Arslan Arshad Nov 21,2016 BS110401050 BS110401050@vu.edu.pk Arslan.arshad01@gmail.com AKMP01 CS614 - Data Warehousing - Midterm

More information

Data Warehousing and OLAP Technologies for Decision-Making Process

Data Warehousing and OLAP Technologies for Decision-Making Process Data Warehousing and OLAP Technologies for Decision-Making Process Hiren H Darji Asst. Prof in Anand Institute of Information Science,Anand Abstract Data warehousing and on-line analytical processing (OLAP)

More information

Evolution of Database Systems

Evolution of Database Systems Evolution of Database Systems Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Intelligent Decision Support Systems Master studies, second

More information

Question Bank. 4) It is the source of information later delivered to data marts.

Question Bank. 4) It is the source of information later delivered to data marts. Question Bank Year: 2016-2017 Subject Dept: CS Semester: First Subject Name: Data Mining. Q1) What is data warehouse? ANS. A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile

More information

Data warehouse architecture consists of the following interconnected layers:

Data warehouse architecture consists of the following interconnected layers: Architecture, in the Data warehousing world, is the concept and design of the data base and technologies that are used to load the data. A good architecture will enable scalability, high performance and

More information

Fig 1.2: Relationship between DW, ODS and OLTP Systems

Fig 1.2: Relationship between DW, ODS and OLTP Systems 1.4 DATA WAREHOUSES Data warehousing is a process for assembling and managing data from various sources for the purpose of gaining a single detailed view of an enterprise. Although there are several definitions

More information

Data Warehousing. Ritham Vashisht, Sukhdeep Kaur and Shobti Saini

Data Warehousing. Ritham Vashisht, Sukhdeep Kaur and Shobti Saini Advance in Electronic and Electric Engineering. ISSN 2231-1297, Volume 3, Number 6 (2013), pp. 669-674 Research India Publications http://www.ripublication.com/aeee.htm Data Warehousing Ritham Vashisht,

More information

Deccansoft Software Services Microsoft Silver Learning Partner. SSAS Syllabus

Deccansoft Software Services Microsoft Silver Learning Partner. SSAS Syllabus Overview: Analysis Services enables you to analyze large quantities of data. With it, you can design, create, and manage multidimensional structures that contain detail and aggregated data from multiple

More information

Data Warehouse and Data Mining

Data Warehouse and Data Mining Data Warehouse and Data Mining Lecture No. 03 Architecture of DW Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro Basic

More information

SQL Server Analysis Services

SQL Server Analysis Services DataBase and Data Mining Group of DataBase and Data Mining Group of Database and data mining group, SQL Server 2005 Analysis Services SQL Server 2005 Analysis Services - 1 Analysis Services Database and

More information

Data Mining Concepts & Techniques

Data Mining Concepts & Techniques Data Mining Concepts & Techniques Lecture No. 01 Databases, Data warehouse Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro

More information

1. Attempt any two of the following: 10 a. State and justify the characteristics of a Data Warehouse with suitable examples.

1. Attempt any two of the following: 10 a. State and justify the characteristics of a Data Warehouse with suitable examples. Instructions to the Examiners: 1. May the Examiners not look for exact words from the text book in the Answers. 2. May any valid example be accepted - example may or may not be from the text book 1. Attempt

More information

A Systems Approach to Dimensional Modeling in Data Marts. Joseph M. Firestone, Ph.D. White Paper No. One. March 12, 1997

A Systems Approach to Dimensional Modeling in Data Marts. Joseph M. Firestone, Ph.D. White Paper No. One. March 12, 1997 1 of 8 5/24/02 4:43 PM A Systems Approach to Dimensional Modeling in Data Marts By Joseph M. Firestone, Ph.D. White Paper No. One March 12, 1997 OLAP s Purposes And Dimensional Data Modeling Dimensional

More information

CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP)

CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP) CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP) INTRODUCTION A dimension is an attribute within a multidimensional model consisting of a list of values (called members). A fact is defined by a combination

More information

Data Warehouse and Data Mining

Data Warehouse and Data Mining Data Warehouse and Data Mining Lecture No. 04-06 Data Warehouse Architecture Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology

More information

TDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended.

TDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended. Previews of TDWI course books offer an opportunity to see the quality of our material and help you to select the courses that best fit your needs. The previews cannot be printed. TDWI strives to provide

More information

IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 06, 2016 ISSN (online):

IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 06, 2016 ISSN (online): IJSRD - International Journal for Scientific Research & Development Vol. 4, Issue 06, 2016 ISSN (online): 2321-0613 Tanzeela Khanam 1 Pravin S.Metkewar 2 1 Student 2 Associate Professor 1,2 SICSR, affiliated

More information

Proceedings of the IE 2014 International Conference AGILE DATA MODELS

Proceedings of the IE 2014 International Conference  AGILE DATA MODELS AGILE DATA MODELS Mihaela MUNTEAN Academy of Economic Studies, Bucharest mun61mih@yahoo.co.uk, Mihaela.Muntean@ie.ase.ro Abstract. In last years, one of the most popular subjects related to the field of

More information

DATA WAREHOUING UNIT I

DATA WAREHOUING UNIT I BHARATHIDASAN ENGINEERING COLLEGE NATTRAMAPALLI DEPARTMENT OF COMPUTER SCIENCE SUB CODE & NAME: IT6702/DWDM DEPT: IT Staff Name : N.RAMESH DATA WAREHOUING UNIT I 1. Define data warehouse? NOV/DEC 2009

More information

REPORTING AND QUERY TOOLS AND APPLICATIONS

REPORTING AND QUERY TOOLS AND APPLICATIONS Tool Categories: REPORTING AND QUERY TOOLS AND APPLICATIONS There are five categories of decision support tools Reporting Managed query Executive information system OLAP Data Mining Reporting Tools Production

More information

CHAPTER 3 Implementation of Data warehouse in Data Mining

CHAPTER 3 Implementation of Data warehouse in Data Mining CHAPTER 3 Implementation of Data warehouse in Data Mining 3.1 Introduction to Data Warehousing A data warehouse is storage of convenient, consistent, complete and consolidated data, which is collected

More information

Development of an interface that allows MDX based data warehouse queries by less experienced users

Development of an interface that allows MDX based data warehouse queries by less experienced users Development of an interface that allows MDX based data warehouse queries by less experienced users Mariana Duprat André Monat Escola Superior de Desenho Industrial 400 Introduction Data analysis is a fundamental

More information

Data Warehouse Design Using Row and Column Data Distribution

Data Warehouse Design Using Row and Column Data Distribution Int'l Conf. Information and Knowledge Engineering IKE'15 55 Data Warehouse Design Using Row and Column Data Distribution Behrooz Seyed-Abbassi and Vivekanand Madesi School of Computing, University of North

More information

OLAP Introduction and Overview

OLAP Introduction and Overview 1 CHAPTER 1 OLAP Introduction and Overview What Is OLAP? 1 Data Storage and Access 1 Benefits of OLAP 2 What Is a Cube? 2 Understanding the Cube Structure 3 What Is SAS OLAP Server? 3 About Cube Metadata

More information

Managing Data Resources

Managing Data Resources Chapter 7 OBJECTIVES Describe basic file organization concepts and the problems of managing data resources in a traditional file environment Managing Data Resources Describe how a database management system

More information

Decision Support Systems aka Analytical Systems

Decision Support Systems aka Analytical Systems Decision Support Systems aka Analytical Systems Decision Support Systems Systems that are used to transform data into information, to manage the organization: OLAP vs OLTP OLTP vs OLAP Transactions Analysis

More information

: How does DSS data differ from operational data?

: How does DSS data differ from operational data? by Daniel J Power Editor, DSSResources.com Decision support data used for analytics and data-driven DSS is related to past actions and intentions. The data is a historical record and the scale of data

More information

BUSINESS INTELLIGENCE AND OLAP

BUSINESS INTELLIGENCE AND OLAP Volume 10, No. 3, pp. 68 76, 2018 Pro Universitaria BUSINESS INTELLIGENCE AND OLAP Dimitrie Cantemir Christian University Knowledge Horizons - Economics Volume 10, No. 3, pp. 68-76 P-ISSN: 2069-0932, E-ISSN:

More information

After completing this course, participants will be able to:

After completing this course, participants will be able to: Designing a Business Intelligence Solution by Using Microsoft SQL Server 2008 T h i s f i v e - d a y i n s t r u c t o r - l e d c o u r s e p r o v i d e s i n - d e p t h k n o w l e d g e o n d e s

More information

Management Information Systems MANAGING THE DIGITAL FIRM, 12 TH EDITION FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT

Management Information Systems MANAGING THE DIGITAL FIRM, 12 TH EDITION FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT MANAGING THE DIGITAL FIRM, 12 TH EDITION Chapter 6 FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT VIDEO CASES Case 1: Maruti Suzuki Business Intelligence and Enterprise Databases

More information

1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar

1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar 1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar 1) What does the term 'Ad-hoc Analysis' mean? Choice 1 Business analysts use a subset of the data for analysis. Choice 2: Business analysts access the Data

More information

A Data Warehouse Implementation Using the Star Schema. For an outpatient hospital information system

A Data Warehouse Implementation Using the Star Schema. For an outpatient hospital information system A Data Warehouse Implementation Using the Star Schema For an outpatient hospital information system GurvinderKaurJosan Master of Computer Application,YMT College of Management Kharghar, Navi Mumbai ---------------------------------------------------------------------***----------------------------------------------------------------

More information

Generating Cross level Rules: An automated approach

Generating Cross level Rules: An automated approach Generating Cross level Rules: An automated approach Ashok 1, Sonika Dhingra 1 1HOD, Dept of Software Engg.,Bhiwani Institute of Technology, Bhiwani, India 1M.Tech Student, Dept of Software Engg.,Bhiwani

More information

Tribhuvan University Institute of Science and Technology MODEL QUESTION

Tribhuvan University Institute of Science and Technology MODEL QUESTION MODEL QUESTION 1. Suppose that a data warehouse for Big University consists of four dimensions: student, course, semester, and instructor, and two measures count and avg-grade. When at the lowest conceptual

More information

UNIT -1 UNIT -II. Q. 4 Why is entity-relationship modeling technique not suitable for the data warehouse? How is dimensional modeling different?

UNIT -1 UNIT -II. Q. 4 Why is entity-relationship modeling technique not suitable for the data warehouse? How is dimensional modeling different? (Please write your Roll No. immediately) End-Term Examination Fourth Semester [MCA] MAY-JUNE 2006 Roll No. Paper Code: MCA-202 (ID -44202) Subject: Data Warehousing & Data Mining Note: Question no. 1 is

More information

Building a Data Warehouse step by step

Building a Data Warehouse step by step Informatica Economică, nr. 2 (42)/2007 83 Building a Data Warehouse step by step Manole VELICANU, Academy of Economic Studies, Bucharest Gheorghe MATEI, Romanian Commercial Bank Data warehouses have been

More information

Data Warehousing and OLAP Technology for Primary Industry

Data Warehousing and OLAP Technology for Primary Industry Data Warehousing and OLAP Technology for Primary Industry Taehan Kim 1), Sang Chan Park 2) 1) Department of Industrial Engineering, KAIST (taehan@kaist.ac.kr) 2) Department of Industrial Engineering, KAIST

More information

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective

A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective B.Manivannan Research Scholar, Dept. Computer Science, Dravidian University, Kuppam, Andhra Pradesh, India

More information

Q1) Describe business intelligence system development phases? (6 marks)

Q1) Describe business intelligence system development phases? (6 marks) BUISINESS ANALYTICS AND INTELLIGENCE SOLVED QUESTIONS Q1) Describe business intelligence system development phases? (6 marks) The 4 phases of BI system development are as follow: Analysis phase Design

More information

The Data Organization

The Data Organization C V I T F E P A O TM The Data Organization Best Practices Metadata Dictionary Application Architecture Prepared by Rainer Schoenrank January 2017 Table of Contents 1. INTRODUCTION... 3 1.1 PURPOSE OF THE

More information

1. Analytical queries on the dimensionally modeled database can be significantly simpler to create than on the equivalent nondimensional database.

1. Analytical queries on the dimensionally modeled database can be significantly simpler to create than on the equivalent nondimensional database. 1. Creating a data warehouse involves using the functionalities of database management software to implement the data warehouse model as a collection of physically created and mutually connected database

More information

Appliances and DW Architecture. John O Brien President and Executive Architect Zukeran Technologies 1

Appliances and DW Architecture. John O Brien President and Executive Architect Zukeran Technologies 1 Appliances and DW Architecture John O Brien President and Executive Architect Zukeran Technologies 1 OBJECTIVES To define an appliance Understand critical components of a DW appliance Learn how DW appliances

More information

Data Mining & Data Warehouse

Data Mining & Data Warehouse Data Mining & Data Warehouse Associate Professor Dr. Raed Ibraheem Hamed University of Human Development, College of Science and Technology (1) 2016 2017 1 Points to Cover Why Do We Need Data Warehouses?

More information

DATA WAREHOUSE EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY

DATA WAREHOUSE EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY DATA WAREHOUSE EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY CHARACTERISTICS Data warehouse is a central repository for summarized and integrated data

More information

Data Warehousing Introduction. Toon Calders

Data Warehousing Introduction. Toon Calders Data Warehousing Introduction Toon Calders toon.calders@ulb.ac.be Course Organization Lectures on Tuesday 14:00 and Friday 16:00 Check http://gehol.ulb.ac.be/ for room Most exercises in computer class

More information

Novel Materialized View Selection in a Multidimensional Database

Novel Materialized View Selection in a Multidimensional Database Graphic Era University From the SelectedWorks of vijay singh Winter February 10, 2009 Novel Materialized View Selection in a Multidimensional Database vijay singh Available at: https://works.bepress.com/vijaysingh/5/

More information

Index. Symbols = (equal) operator, 87

Index. Symbols = (equal) operator, 87 riordan.book Page 343 Thursday, December 16, 2004 2:23 PM Index Symbols = (equal) operator, 87 A abstract entities, 14 abstract relations, 51 accelerator keys, 321 322 Access (application), 7 access keys,

More information

SQL Server Business Intelligence 20768: Developing SQL Server 2016 Data Models in SSAS. Upcoming Dates. Course Description.

SQL Server Business Intelligence 20768: Developing SQL Server 2016 Data Models in SSAS. Upcoming Dates. Course Description. SQL Server Business Intelligence 20768: Developing SQL Server 2016 Data Models in SSAS Get the skills needed to successfully create multidimensional databases using Microsoft SQL Server Analysis Services

More information

The Data Organization

The Data Organization C V I T F E P A O TM The Data Organization 1251 Yosemite Way Hayward, CA 94545 (510) 303-8868 rschoenrank@computer.org Business Intelligence Process Architecture By Rainer Schoenrank Data Warehouse Consultant

More information

GUJARAT TECHNOLOGICAL UNIVERSITY MASTER OF COMPUTER APPLICATIONS (MCA) Semester: IV

GUJARAT TECHNOLOGICAL UNIVERSITY MASTER OF COMPUTER APPLICATIONS (MCA) Semester: IV GUJARAT TECHNOLOGICAL UNIVERSITY MASTER OF COMPUTER APPLICATIONS (MCA) Semester: IV Subject Name: Elective I Data Warehousing & Data Mining (DWDM) Subject Code: 2640005 Learning Objectives: To understand

More information

Oracle Database 11g: Data Warehousing Fundamentals

Oracle Database 11g: Data Warehousing Fundamentals Oracle Database 11g: Data Warehousing Fundamentals Duration: 3 Days What you will learn This Oracle Database 11g: Data Warehousing Fundamentals training will teach you about the basic concepts of a data

More information

Overview. Introduction to Data Warehousing and Business Intelligence. BI Is Important. What is Business Intelligence (BI)?

Overview. Introduction to Data Warehousing and Business Intelligence. BI Is Important. What is Business Intelligence (BI)? Introduction to Data Warehousing and Business Intelligence Overview Why Business Intelligence? Data analysis problems Data Warehouse (DW) introduction A tour of the coming DW lectures DW Applications Loosely

More information

Hierarchies in a multidimensional model: From conceptual modeling to logical representation

Hierarchies in a multidimensional model: From conceptual modeling to logical representation Data & Knowledge Engineering 59 (2006) 348 377 www.elsevier.com/locate/datak Hierarchies in a multidimensional model: From conceptual modeling to logical representation E. Malinowski *, E. Zimányi Department

More information

Full file at

Full file at Chapter 2 Data Warehousing True-False Questions 1. A real-time, enterprise-level data warehouse combined with a strategy for its use in decision support can leverage data to provide massive financial benefits

More information

Data Warehousing and OLAP

Data Warehousing and OLAP Data Warehousing and OLAP INFO 330 Slides courtesy of Mirek Riedewald Motivation Large retailer Several databases: inventory, personnel, sales etc. High volume of updates Management requirements Efficient

More information

A Step towards Centralized Data Warehousing Process: A Quality Aware Data Warehouse Architecture

A Step towards Centralized Data Warehousing Process: A Quality Aware Data Warehouse Architecture A Step towards Centralized Data Warehousing Process: A Quality Aware Data Warehouse Architecture Maqbool-uddin-Shaikh Comsats Institute of Information Technology Islamabad maqboolshaikh@comsats.edu.pk

More information

Data Warehouse. Asst.Prof.Dr. Pattarachai Lalitrojwong

Data Warehouse. Asst.Prof.Dr. Pattarachai Lalitrojwong Data Warehouse Asst.Prof.Dr. Pattarachai Lalitrojwong Faculty of Information Technology King Mongkut s Institute of Technology Ladkrabang Bangkok 10520 pattarachai@it.kmitl.ac.th The Evolution of Data

More information

A Methodology for Integrating XML Data into Data Warehouses

A Methodology for Integrating XML Data into Data Warehouses A Methodology for Integrating XML Data into Data Warehouses Boris Vrdoljak, Marko Banek, Zoran Skočir University of Zagreb Faculty of Electrical Engineering and Computing Address: Unska 3, HR-10000 Zagreb,

More information

Sql Fact Constellation Schema In Data Warehouse With Example

Sql Fact Constellation Schema In Data Warehouse With Example Sql Fact Constellation Schema In Data Warehouse With Example Data Warehouse OLAP - Learn Data Warehouse in simple and easy steps using Multidimensional OLAP (MOLAP), Hybrid OLAP (HOLAP), Specialized SQL

More information

MICROSOFT? OLAP SOLUTIONS BY ERIK THOMSEN, GEORGE SPOFFORD, DICK CHASE

MICROSOFT? OLAP SOLUTIONS BY ERIK THOMSEN, GEORGE SPOFFORD, DICK CHASE MICROSOFT? OLAP SOLUTIONS BY ERIK THOMSEN, GEORGE SPOFFORD, DICK CHASE DOWNLOAD EBOOK : MICROSOFT? OLAP SOLUTIONS BY ERIK THOMSEN, GEORGE SPOFFORD, DICK CHASE PDF Click link bellow and free register to

More information

ETL and OLAP Systems

ETL and OLAP Systems ETL and OLAP Systems Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Software Development Technologies Master studies, first semester

More information

Implementing and Maintaining Microsoft SQL Server 2008 Analysis Services

Implementing and Maintaining Microsoft SQL Server 2008 Analysis Services Course 6234A: Implementing and Maintaining Microsoft SQL Server 2008 Analysis Services Course Details Course Outline Module 1: Introduction to Microsoft SQL Server Analysis Services This module introduces

More information

Data Modeling Online Training

Data Modeling Online Training Data Modeling Online Training IQ Online training facility offers Data Modeling online training by trainers who have expert knowledge in the Data Modeling and proven record of training hundreds of students.

More information

Rocky Mountain Technology Ventures

Rocky Mountain Technology Ventures Rocky Mountain Technology Ventures Comparing and Contrasting Online Analytical Processing (OLAP) and Online Transactional Processing (OLTP) Architectures 3/19/2006 Introduction One of the most important

More information

Designing Data Warehouses. Data Warehousing Design. Designing Data Warehouses. Designing Data Warehouses

Designing Data Warehouses. Data Warehousing Design. Designing Data Warehouses. Designing Data Warehouses Designing Data Warehouses To begin a data warehouse project, need to find answers for questions such as: Data Warehousing Design Which user requirements are most important and which data should be considered

More information

COURSE 20466D: IMPLEMENTING DATA MODELS AND REPORTS WITH MICROSOFT SQL SERVER

COURSE 20466D: IMPLEMENTING DATA MODELS AND REPORTS WITH MICROSOFT SQL SERVER ABOUT THIS COURSE The focus of this five-day instructor-led course is on creating managed enterprise BI solutions. It describes how to implement multidimensional and tabular data models, deliver reports

More information

Teradata Aggregate Designer

Teradata Aggregate Designer Data Warehousing Teradata Aggregate Designer By: Sam Tawfik Product Marketing Manager Teradata Corporation Table of Contents Executive Summary 2 Introduction 3 Problem Statement 3 Implications of MOLAP

More information

Data Warehouses Chapter 12. Class 10: Data Warehouses 1

Data Warehouses Chapter 12. Class 10: Data Warehouses 1 Data Warehouses Chapter 12 Class 10: Data Warehouses 1 OLTP vs OLAP Operational Database: a database designed to support the day today transactions of an organization Data Warehouse: historical data is

More information

Chapter 3. Architecture and Design

Chapter 3. Architecture and Design Chapter 3. Architecture and Design Design decisions and functional architecture of the Semi automatic generation of warehouse schema has been explained in this section. 3.1. Technical Architecture System

More information

ENTITIES IN THE OBJECT-ORIENTED DESIGN PROCESS MODEL

ENTITIES IN THE OBJECT-ORIENTED DESIGN PROCESS MODEL INTERNATIONAL DESIGN CONFERENCE - DESIGN 2000 Dubrovnik, May 23-26, 2000. ENTITIES IN THE OBJECT-ORIENTED DESIGN PROCESS MODEL N. Pavković, D. Marjanović Keywords: object oriented methodology, design process

More information

DB2: Data Warehousing. by Andrea Piermarteri & Matteo Micheletti

DB2: Data Warehousing. by Andrea Piermarteri & Matteo Micheletti DB2: Data Warehousing by Andrea Piermarteri & Matteo Micheletti Introduction to Data Warehousing What is Data Warehousing? A collection of methods, technologies and tools to assist the knowledge workers

More information

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing. About the Tutorial A data warehouse is constructed by integrating data from multiple heterogeneous sources. It supports analytical reporting, structured and/or ad hoc queries and decision making. This

More information

IT DATA WAREHOUSING AND DATA MINING UNIT-2 BUSINESS ANALYSIS

IT DATA WAREHOUSING AND DATA MINING UNIT-2 BUSINESS ANALYSIS PART A 1. What are production reporting tools? Give examples. (May/June 2013) Production reporting tools will let companies generate regular operational reports or support high-volume batch jobs. Such

More information

INTRODUCTION. Chris Claterbos, Vlamis Software Solutions, Inc. REVIEW OF ARCHITECTURE

INTRODUCTION. Chris Claterbos, Vlamis Software Solutions, Inc. REVIEW OF ARCHITECTURE BUILDING AN END TO END OLAP SOLUTION USING ORACLE BUSINESS INTELLIGENCE Chris Claterbos, Vlamis Software Solutions, Inc. claterbos@vlamis.com INTRODUCTION Using Oracle 10g R2 and Oracle Business Intelligence

More information

Call: SAS BI Course Content:35-40hours

Call: SAS BI Course Content:35-40hours SAS BI Course Content:35-40hours Course Outline SAS Data Integration Studio 4.2 Introduction * to SAS DIS Studio Features of SAS DIS Studio Tasks performed by SAS DIS Studio Navigation to SAS DIS Studio

More information

Syllabus. Syllabus. Motivation Decision Support. Syllabus

Syllabus. Syllabus. Motivation Decision Support. Syllabus Presentation: Sophia Discussion: Tianyu Metadata Requirements and Conclusion 3 4 Decision Support Decision Making: Everyday, Everywhere Decision Support System: a class of computerized information systems

More information

The University of Iowa Intelligent Systems Laboratory The University of Iowa Intelligent Systems Laboratory

The University of Iowa Intelligent Systems Laboratory The University of Iowa Intelligent Systems Laboratory Warehousing Outline Andrew Kusiak 2139 Seamans Center Iowa City, IA 52242-1527 andrew-kusiak@uiowa.edu http://www.icaen.uiowa.edu/~ankusiak Tel. 319-335 5934 Introduction warehousing concepts Relationship

More information

Developing SQL Data Models

Developing SQL Data Models Developing SQL Data Models 20768B; 3 Days; Instructor-led Course Description The focus of this 3-day instructor-led course is on creating managed enterprise BI solutions. It describes how to implement

More information

Managing Data Resources

Managing Data Resources Chapter 7 Managing Data Resources 7.1 2006 by Prentice Hall OBJECTIVES Describe basic file organization concepts and the problems of managing data resources in a traditional file environment Describe how

More information

turning data into dollars

turning data into dollars turning data into dollars Tom s Ten Data Tips December 2008 ETL ETL stands for Extract, Transform, Load. This process merges and integrates information from source systems in the data warehouse (DWH).

More information

Advanced Data Management Technologies

Advanced Data Management Technologies ADMT 2017/18 Unit 2 J. Gamper 1/44 Advanced Data Management Technologies Unit 2 Basic Concepts of BI and DW J. Gamper Free University of Bozen-Bolzano Faculty of Computer Science IDSE Acknowledgements:

More information

6234A - Implementing and Maintaining Microsoft SQL Server 2008 Analysis Services

6234A - Implementing and Maintaining Microsoft SQL Server 2008 Analysis Services 6234A - Implementing and Maintaining Microsoft SQL Server 2008 Analysis Services Course Number: 6234A Course Length: 3 Days Course Overview This instructor-led course teaches students how to implement

More information

Database Vs. Data Warehouse

Database Vs. Data Warehouse Database Vs. Data Warehouse Similarities and differences Databases and data warehouses are used to generate different types of information. Information generated by both are used for different purposes.

More information

Using Microsoft Analysis Service to analyze graduates performances and working conditions

Using Microsoft Analysis Service to analyze graduates performances and working conditions Using Microsoft Analysis Service to analyze graduates performances and working conditions Alberto Leone 1, Loris Cancellieri 2, Angelo Guerriero 3 and Andrea Cammelli 4 Consorzio Interuniversitario AlmaLaurea,

More information

A MAS Based ETL Approach for Complex Data

A MAS Based ETL Approach for Complex Data A MAS Based ETL Approach for Complex Data O. Boussaid, F. Bentayeb, J. Darmont Abstract : In a data warehousing process, the phase of data integration is crucial. Many methods for data integration have

More information

DSS based on Data Warehouse

DSS based on Data Warehouse DSS based on Data Warehouse C_13 / 19.01.2017 Decision support system is a complex system engineering. At the same time, research DW composition, DW structure and DSS Architecture based on DW, puts forward

More information

The Use of Soft Systems Methodology for the Development of Data Warehouses

The Use of Soft Systems Methodology for the Development of Data Warehouses The Use of Soft Systems Methodology for the Development of Data Warehouses Roelien Goede School of Information Technology, North-West University Vanderbijlpark, 1900, South Africa ABSTRACT When making

More information

International Journal of Scientific & Engineering Research, Volume 7, Issue 11, November ISSN

International Journal of Scientific & Engineering Research, Volume 7, Issue 11, November ISSN International Journal of Scientific & Engineering Research, Volume 7, Issue 11, November-2016 5 An Embedded Car Parts Sale of Mercedes Benz with OLAP Implementation (Case Study: PT. Mass Sarana Motorama)

More information

Using SLE for creation of Data Warehouses

Using SLE for creation of Data Warehouses Using SLE for creation of Data Warehouses Yvette Teiken OFFIS, Institute for Information Technology, Germany teiken@offis.de Abstract. This paper describes how software language engineering is applied

More information

Power Distribution Analysis For Electrical Usage In Province Area Using Olap (Online Analytical Processing)

Power Distribution Analysis For Electrical Usage In Province Area Using Olap (Online Analytical Processing) Power Distribution Analysis For Electrical Usage In Province Area Using Olap (Online Analytical Processing) Riza Samsinar 1,*, Jatmiko Endro Suseno 2, and Catur Edi Widodo 3 1 Master Program of Information

More information

Unit 7: Basics in MS Power BI for Excel 2013 M7-5: OLAP

Unit 7: Basics in MS Power BI for Excel 2013 M7-5: OLAP Unit 7: Basics in MS Power BI for Excel M7-5: OLAP Outline: Introduction Learning Objectives Content Exercise What is an OLAP Table Operations: Drill Down Operations: Roll Up Operations: Slice Operations:

More information

Multidimensional modeling using MIDEA

Multidimensional modeling using MIDEA Multidimensional modeling using MIDEA JOSÉ MARÍA CAVERO 1, MARIO PIATTINI 2, ESPERANZA MARCOS 1 1 Kybele Research Group, Escuela Superior de Ciencias Experimentales y Tecnología Universidad Rey Juan Carlos

More information

DATA MINING TRANSACTION

DATA MINING TRANSACTION DATA MINING Data Mining is the process of extracting patterns from data. Data mining is seen as an increasingly important tool by modern business to transform data into an informational advantage. It is

More information

Data Warehousing. Overview

Data Warehousing. Overview Data Warehousing Overview Basic Definitions Normalization Entity Relationship Diagrams (ERDs) Normal Forms Many to Many relationships Warehouse Considerations Dimension Tables Fact Tables Star Schema Snowflake

More information

Introduction to DWML. Christian Thomsen, Aalborg University. Slides adapted from Torben Bach Pedersen and Man Lung Yiu

Introduction to DWML. Christian Thomsen, Aalborg University. Slides adapted from Torben Bach Pedersen and Man Lung Yiu Introduction to DWML Christian Thomsen, Aalborg University Slides adapted from Torben Bach Pedersen and Man Lung Yiu Course Structure Business intelligence Extract knowledge from large amounts of data

More information

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems Management Information Systems Management Information Systems B10. Data Management: Warehousing, Analyzing, Mining, and Visualization Code: 166137-01+02 Course: Management Information Systems Period: Spring

More information

Aggregating Knowledge in a Data Warehouse and Multidimensional Analysis

Aggregating Knowledge in a Data Warehouse and Multidimensional Analysis Aggregating Knowledge in a Data Warehouse and Multidimensional Analysis Rafal Lukawiecki Strategic Consultant, Project Botticelli Ltd rafal@projectbotticelli.com Objectives Explain the basics of: 1. Data

More information

Design patterns of database models as storage systems for experimental information in solving research problems

Design patterns of database models as storage systems for experimental information in solving research problems Design patterns of database models as storage systems for experimental information in solving research problems D.E. Yablokov 1 1 Samara National Research University, 34 Moskovskoe Shosse, 443086, Samara,

More information