A Proposal of Integrating Data Mining and On-Line Analytical Processing in Data Warehouse
|
|
- Delilah Hawkins
- 5 years ago
- Views:
Transcription
1 A Proposal of ntegrating Data Mining and On-Line Analytical Processing in Data Warehouse Zhen LU Faculty of Human Environment, Nagasaki nstitute of Applied Science, 536 Aba-rnachi, Nagasaki , Japan Tel.: , Fax: t , Minyi GUO Department of Computer Software, The University of Aizu, Aim Wakamatsu City, Fukushima %S8580, Japan Tel.: , Fax: , Abat ract As an analysis tool either of On-Line Analytical Processing (OW) or Data Mining has its own strong points and weaknesses. t is an effective way to enhance the power and flexibility of data mining in data warehouse and large databases by intepating data mining with O W to offset their weaknesses. n this paper, a proposal of integrating data mining and O W is put forward. The mechanism of the on-line analytical data mining in data warehouse is described. Aspects which is necessary to develop successful on-line analytical data mining system is also discussed. Key words: Data Mining, On-Line Analytical Processing, Data Warehouse 1. ntroduction n recent years, with the development of the technologies of Data Warehouse, Data Mining and OnLine Analytical Processing (OLAP), Decision Support System (DSS) research and implementation entered a completely new stage [1][2]. OLAP and data mining have become integral parts of decision support process. Although Data mining and O M are all analytical tools, obvious differences exist between each other. The analysis process of data mining is completed automatically. t is only needed to extract hidden patterns, and predict the future trends and behaviors without giving exact query by user. t is ofbenefit to finding unknown facts. While OLAP depends on user s queries and propositions to complete analysis process. t restricted the scope of queries and propositions, and affects the final results. From the view of data analysis, OLAP lies in a lower leve1,while data mining can find more complex and detail information which can not be found by OM. On the other hand, to data, most O W systems have focused on providing access to multi-dimensional data, while data mining systems have deal with influence analysis of data along a single dimension. n the application of data mining in data warehouse, the problem is that data mining is difficult to be realized. There are enormous data and hundreds or thousands attributes in data warehouse. As the process of minng analysis is in progress automatically, and user only points mining tasks but not provide search path, it will lead to the search space too large and the patterns generated too many. And, most of the generated patterns may be general knowledge or non-meaning ones. On the other hand, also OLAF can provide views in different viewpoints anddifferent abstract levels, as user s query could not be understand completely beforehand, it will lack dimensions which should be included in the view, and the results obtained from different view may not be consistent. Therefore, it is easy to bring about erroneous leading. Basing on data warehouse, an integration of data mining with OLAP will improve the efficiency and effectiveness of data warehouse based DSS. n this paper, an approach of integrating O M and data mining architecture is put forward. t integrates data mining with online analytical processing organically to accomplish /01/$ EEE. 146
2 knowledge discovery in data warehouse. The mechanism of the orrline analytical data mining in data warehouse is described. 2. Data Warehouse, Data Mining and OLAP 2.1 Data Warehouse Data Warehouse has been defined as a subject oriented, timevariant, nonvolatile collection of data in support of management s decision needs by W. H. nmon [3][4] t provides tools to satisfy the information needs of users at all organizational levels- not just for complex data queries, but as a general facility for getting quick, accurate, and often insightful information. A Data Warehouse is designed so that its users can recognize the information they want access that information using simple tools. Data Warehouse is physically separated from operational systems, and hold both aggregated and atomic data for management separate from the databases used for OW. t should have two basic functions. One is that it should provide subject-oriented summarized data for supporting decision. The data warehouse provides a clear and unambiguous definition of every key data entity, describing the way each is used, as well as defining derivation formulas, aggregation categories, and refreshment time periods. Another function is that it should take on the task of capturing data from operational databases and other different external data sources. The schematic drawing of the data warehouse is shown in Figure , n order to obtain consistent data for mining, data mining tools often require the raw data to be first integrated and cleaned. t requires costly preprocessing steps of data cleaning, data transforming, and data integration. Since a data warehouse normally goes through these preprocessing steps for O W operations, it serves as a valuable data source for data mining. 2.2 Knowledge Discovery in Database and Data Mining Knowledge Discovery in Database (KDD) is a process of applying technologies mostly from artificial intelligence to discover new information from data warehouse [5][6]. t is a noctrivial process of identifying valid, novel, potentially useful, ultimately understandable patterns in data. There are five steps in KDD process (Figure 2): Selection; Preprocessing; Transformation; Data mining; nterpretation/ evaluation. 147
3 Figure 2. Outlining of the Steps of the KDD Process Data mining is one step of the KDD process. t extracts hidden predictive information from data warehouse or large database. Data mining tools predict future trends and behaviors, allowing businesses to make proactive, knowledge-driven decisions. The automated, prospective analyses offered by data mining move beyond the analyses of past events provided by retrospective tools typical of DSS. 23 On-Line Analytical Processing On-line analytical processing was introduced by E. F.Wd [7][8], the father of Relational Databases in 1993 in a major article in Computerworld. Codd came to the conclusion that relational databases for Transaction Processing) had reached the maximum of their capabilities in terms of the viewsof the data they provided the user. The problem stemmed principally from the massive computing required when relational databases were asked to answer relatively simple SQL quires. He also came to the view that operational data are not adequate for answering managerial questions. He therefore advocated the use of multidimensional databases. His convetsion to the DSS/ES viewpoint gave legitimacy to the data warehouse based concepts. The basic idea in OLAP is that managers should be able to manipulate enterprise data across many dimensions to understand changes that are occurring. As the facility of powerful multidimensional analysis for data warehouse, it is necessary to adopt online analytical processing technology in data warehouse and large database. O W provides such facilities as drilling, pivoting, filtering, dicing and slicing so the user can traverse the data flexibly, define the set ofrelevant data, analyze data at different granularities. and visualize the results in different forms. These operations can also be applied to data mining to make it an exploratory and effective process. Together with OW, data mining functions can provide an overview of the discovered knowledge such that the user can investigate further on any interesting pattens or anomalies. Because with O W operations, the size of the data set is relatively more compact. So that, The mining integrated with OLAP technology can do insure faster response than mining in the raw data directly. 3. On-Line Analytical Data Mining 3.1 The Architecture The integrated data mining and on-line analytical processing architecture is suggested as shown in Figure 3. t mainly consists of 7 components. (1) Data Warehouse: the platform of the on-line analytical data mining; (2) Data Mining Agent: performing analytical mining in data cubes aided by OLAP engine; (3) OLAP Engine: providing fast access to summarized data along multiple dimensions; (4) Applications Programming nterface: aggregation of instructions, functions, regulations and rules for on-line data mining in the data warehouse platform; (5) Data Cube: aggregation of data warehouse information; (6) Meta Data: data for managing and controlling data warehouse creation and maintenance. 148
4 e Applications Programming nterface iil Data Warehouse L_ Figure 3. The ntegrated Data Mining and O M Architecture 3.2 The mechanism Data cube is a core of on-line analytical data mining. t provides aggregated information that can be used to analyze the contents of databases and data warehouses. t is constructed from a subset of attibutes in the databases and data warehouses. Data mining agent performs analytical mining in data cubes with the aid of OLAP engine. Data mining agent and the OLAP engine both accept user's on-line queries through the user interface and work with the data cube through the applications programming interface in the analysis. Furthermore, data mining agent may perform multiple data mining tasks, such as concept description, association, classification, prediction, clustering, time-series analysis, etc. Therefore, data mining agent is more sophisticated than the O W engine since it usually consists of multiple mining modules which may interact with each other for effective mining. Since some requirements in data mining agent, such as the construction of numerical dimensions, may not be readily available in the commercial O W products, particular mining modules should be built in model base. With many OLAP products available on the market, it is important to develop online analytical mining mechanisms directly on top of constructed data cubes and OLAP engines. Although, data mining agent analysis may often involve the analysis of a large number of dimensions the finer granularities and thus require more powerful data cube construction and accessing tools than O W analysis, there is no fundamental difference between the data cube required for OLAP engine and that for data mining agent. Since data mining agent is constructed either on customized data cubes which often work with relational database systems, or on bp of data cubes provided by the O W products, it is suggested to build on-line analytical mining systems on top of the existing OLAP and relational database systems, rather than from the group up. As data warehouse provides subject oriented summarized data (Figure l), data warehouse data is advantageous to improve the efficiency of data mining. When data mining is camed on data warehouse, the first two steps of selection and preprocessing are already completed roughly. The KDD process begins from data tansformation for the purpose of determining useable dimensions for special data mining problem and using particular algorithm to carry out data mining. n data transformation, particular analysis algorithms are used for determining the dimension which is useful to special data mining task or affects the special data mining task greatly. As data mining functions usually cost more than simple O W operations, efficient implementation and fast response are the keys in the realization of on-line analytical data mining. t is very important to develop the online analytical data mining, and test and share data mining module that achieve modularization design and standard 149
5 application programming interface. n order to achieve fast response for data mining queries, efficient and constraint-based data mining algorithms are needed to be applied. 4. Discussion There have been many studies on on-line analytical data mining recently [9][10]. A thoughtful design will help systematic development of owline analytical data mining mechanisms in data warehouse. But, there are many difficulties to develop a thoughtful on-line analytical data mining system in practice. Efficient support of multi-feature cubes and cubes with complex dimensions and measures are necessary. Many data mining tasks need discovery-driven exploration of multi-feature cubes, which are complex sub queries involving multiple dependent queries at multiple granularities. Moreover, traditional data cubes support only dimensions of categorical data and measures of numerical data. n practice, the dimensions of a data cube can be of. numerical, spatial and multimedia data. The measures of a cube can also be of spatial and multimedia aggregations or the collections of such object pointers. Support of such non-traditional data cubes will enhance the power of data mining. On-line analytical data mining must have the ability of mining anywhere. With a multidimensional database and an O W engine, it is easy to carve and portions of data sets at multiple levds of abstraction using OLAP operations, such as drilling, dicing/slicing, pivoting, filtering, etc. This greatly facilities the online analytical data mining process since such a process should be exploratory in nature, that is, mining should be performd at different portions of data at multiple levels of abstraction. By interaction with O W operations, one can perform drilling, dicinghlicing, and pivoting during data mining as well. Moreover, some data mining process may need to explore at least some of the data in great detail. An OLAP engine often provides facilities to drill through the data cube down to the primitiveflow level data store in the database. The interaction of multiple data mining modules with an O W engine will ensure that mining can be easily performed anywhere in a data warehouse. On-line analytical data mining must have the ability of interaction among multiple data mining functions. The strength of on-line analytical data mining should be not only at the selection of a set of datamining functions but also at the interaction among multiple data mining and OLAP functions. Fast response and high performance mining is necessary. t is highly desirable and productive to interact with the mining process and dynamically explore data spaces. However, fast response is critical for interactive mining. Sometimes one may even like to trade mining accuracy for fast response since interactive mining may progressively lead miners to focus the search space and find more and more important pattern. Once a user can identify a small search space, more sophisticated but slower mining algorithm can be called up for careful examination. An on-line analytical data mining system is a system which will communicate with users and knowledge visualization packages at the top and data cubeddatabases at the bottom. Thus, it should be highly modularized with careful design and systematic development. Moreover, an online analytical data mining system should be designed with extensibility in consideration since an on-line analytical data mining system will be expected to be integrated with many subsystems or be extended in many ways. For example, an O W data mining system may be integrated with a statistical data analysis package, or be extended for spatial dab mining, text mining, financial data analysis, multimedia data mining, Web mining, and so on. A modularized design may lead to easy extension towards new domains. To develop a successful on-line data mining system, visualization tools are indispensable. Since an O W data mining system will integrate OLAP and data mining and mine various kinds of knowledge from data warehouse, it i's important to develop a variety of knowledge and data visualization tools. Charts, curves, decision trees, rule graphs, cube views, boxplot graphs, etc. are effective tools to describe data mining results and help users monitor the process of data mining and interact with the mining process. 150
6 5. References [l] S. Anahory and D. Murray, Data Warehousing in the Real World: A Practical Guide for Building Decision Support Systems, Harlow, UK Addison Wesley Longman, [2] P. Gray and H. J. Watson, Decision Support in the Data Warehouse, Upper Saddle River, NJ, PrenticsHall, PTR [3] W. H. nmon, Building the Data Warehouse, New York: John Wiley & Sons, [4] W. H. nmon and R. H. Terdeman, Claudia mhoff, Exploration Warehousing: Turning Business nformation into Business Opportunity, Wiley, 2000 [5] U. M. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy, Advances in Knowledge Discovery and Data Mining, AAA/MT Press, 1996 [6] M.S. Chen, J. Han, and P.S. Yu. Data mining: An overview from a database perspective, EEE Transactions on Knowledge and Data Engineering, [7] E. F. Codd, E. S. Codd and C. T. Salley, Beyond Decision Support, Computerworld, Vo1.27, No.30, July [8] Qing Chen, Mining Exceptions And Quantitative Association Rules n Olap Data Cube, EEE Transactions on Knowledge and Data Engineering, J. W. Han, Towards On-Line Analytical Mining in Large Databases, ACM SGMOD Record, [lo] K. Parsaye, OLAP and Data Mining: Bridging the Gap, Database Programming and Design,
WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems
Management Information Systems Management Information Systems B10. Data Management: Warehousing, Analyzing, Mining, and Visualization Code: 166137-01+02 Course: Management Information Systems Period: Spring
More informationData Warehousing. Ritham Vashisht, Sukhdeep Kaur and Shobti Saini
Advance in Electronic and Electric Engineering. ISSN 2231-1297, Volume 3, Number 6 (2013), pp. 669-674 Research India Publications http://www.ripublication.com/aeee.htm Data Warehousing Ritham Vashisht,
More informationData Mining Concepts & Techniques
Data Mining Concepts & Techniques Lecture No. 01 Databases, Data warehouse Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology Jamshoro
More informationThis proposed research is inspired by the work of Mr Jagdish Sadhave 2009, who used
Literature Review This proposed research is inspired by the work of Mr Jagdish Sadhave 2009, who used the technology of Data Mining and Knowledge Discovery in Databases to build Examination Data Warehouse
More informationThe University of Iowa Intelligent Systems Laboratory The University of Iowa Intelligent Systems Laboratory
Warehousing Outline Andrew Kusiak 2139 Seamans Center Iowa City, IA 52242-1527 andrew-kusiak@uiowa.edu http://www.icaen.uiowa.edu/~ankusiak Tel. 319-335 5934 Introduction warehousing concepts Relationship
More informationThe Data Organization
C V I T F E P A O TM The Data Organization Best Practices Metadata Dictionary Application Architecture Prepared by Rainer Schoenrank January 2017 Table of Contents 1. INTRODUCTION... 3 1.1 PURPOSE OF THE
More informationDATA WAREHOUSE EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY
DATA WAREHOUSE EGCO321 DATABASE SYSTEMS KANAT POOLSAWASD DEPARTMENT OF COMPUTER ENGINEERING MAHIDOL UNIVERSITY CHARACTERISTICS Data warehouse is a central repository for summarized and integrated data
More informationCHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS. Assist. Prof. Dr. Volkan TUNALI
CHAPTER 8 DECISION SUPPORT V2 ADVANCED DATABASE SYSTEMS Assist. Prof. Dr. Volkan TUNALI Topics 2 Business Intelligence (BI) Decision Support System (DSS) Data Warehouse Online Analytical Processing (OLAP)
More informationQuestion Bank. 4) It is the source of information later delivered to data marts.
Question Bank Year: 2016-2017 Subject Dept: CS Semester: First Subject Name: Data Mining. Q1) What is data warehouse? ANS. A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile
More information1 (eagle_eye) and Naeem Latif
1 CS614 today quiz solved by my campus group these are just for idea if any wrong than we don t responsible for it Question # 1 of 10 ( Start time: 07:08:29 PM ) Total Marks: 1 As opposed to the outcome
More informationDATA WAREHOUING UNIT I
BHARATHIDASAN ENGINEERING COLLEGE NATTRAMAPALLI DEPARTMENT OF COMPUTER SCIENCE SUB CODE & NAME: IT6702/DWDM DEPT: IT Staff Name : N.RAMESH DATA WAREHOUING UNIT I 1. Define data warehouse? NOV/DEC 2009
More informationDATA MINING AND WAREHOUSING
DATA MINING AND WAREHOUSING Qno Question Answer 1 Define data warehouse? Data warehouse is a subject oriented, integrated, time-variant, and nonvolatile collection of data that supports management's decision-making
More informationThe Data Mining usage in Production System Management
The Data Mining usage in Production System Management Pavel Vazan, Pavol Tanuska, Michal Kebisek Abstract The paper gives the pilot results of the project that is oriented on the use of data mining techniques
More informationThis tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.
About the Tutorial A data warehouse is constructed by integrating data from multiple heterogeneous sources. It supports analytical reporting, structured and/or ad hoc queries and decision making. This
More informationRocky Mountain Technology Ventures
Rocky Mountain Technology Ventures Comparing and Contrasting Online Analytical Processing (OLAP) and Online Transactional Processing (OLTP) Architectures 3/19/2006 Introduction One of the most important
More informationIT DATA WAREHOUSING AND DATA MINING UNIT-2 BUSINESS ANALYSIS
PART A 1. What are production reporting tools? Give examples. (May/June 2013) Production reporting tools will let companies generate regular operational reports or support high-volume batch jobs. Such
More informationThe Study on Data Warehouse and Data Mining for Birth Registration System of the Surat City
The Study on Data Warehouse and Data Mining for Birth Registration System of the Surat City Pushpal Desai M.Sc.(I.T.) Programme Veer Narmad South Gujarat University Surat, India. Desai Apurva Department
More informationPower Distribution Analysis For Electrical Usage In Province Area Using Olap (Online Analytical Processing)
Power Distribution Analysis For Electrical Usage In Province Area Using Olap (Online Analytical Processing) Riza Samsinar 1,*, Jatmiko Endro Suseno 2, and Catur Edi Widodo 3 1 Master Program of Information
More informationA Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective
A Novel Approach of Data Warehouse OLTP and OLAP Technology for Supporting Management prospective B.Manivannan Research Scholar, Dept. Computer Science, Dravidian University, Kuppam, Andhra Pradesh, India
More informationData Mining and Warehousing
Data Mining and Warehousing Sangeetha K V I st MCA Adhiyamaan College of Engineering, Hosur-635109. E-mail:veerasangee1989@gmail.com Rajeshwari P I st MCA Adhiyamaan College of Engineering, Hosur-635109.
More informationData Mining. Ryan Benton Center for Advanced Computer Studies University of Louisiana at Lafayette Lafayette, La., USA.
Data Mining Ryan Benton Center for Advanced Computer Studies University of Louisiana at Lafayette Lafayette, La., USA January 13, 2011 Important Note! This presentation was obtained from Dr. Vijay Raghavan
More informationData Warehouse and Data Mining
Data Warehouse and Data Mining Lecture No. 04-06 Data Warehouse Architecture Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology
More informationInternational Journal of Computer Engineering and Applications, ICCSTAR-2016, Special Issue, May.16
The Survey Of Data Mining And Warehousing Architha.S, A.Kishore Kumar Department of Computer Engineering Department of computer engineering city engineering college VTU Bangalore, India ABSTRACT: Data
More informationChapter 3. Databases and Data Warehouses: Building Business Intelligence
Chapter 3 Databases and Data Warehouses: Building Business Intelligence How Can a Business Increase its Intelligence? Summary Overview of Main Concepts Details/Design of a Relational Database Creating
More informationData Warehousing and OLAP Technologies for Decision-Making Process
Data Warehousing and OLAP Technologies for Decision-Making Process Hiren H Darji Asst. Prof in Anand Institute of Information Science,Anand Abstract Data warehousing and on-line analytical processing (OLAP)
More informationCS377: Database Systems Data Warehouse and Data Mining. Li Xiong Department of Mathematics and Computer Science Emory University
CS377: Database Systems Data Warehouse and Data Mining Li Xiong Department of Mathematics and Computer Science Emory University 1 1960s: Evolution of Database Technology Data collection, database creation,
More informationCHAPTER 3 Implementation of Data warehouse in Data Mining
CHAPTER 3 Implementation of Data warehouse in Data Mining 3.1 Introduction to Data Warehousing A data warehouse is storage of convenient, consistent, complete and consolidated data, which is collected
More informationData Mining. Introduction. Piotr Paszek. (Piotr Paszek) Data Mining DM KDD 1 / 44
Data Mining Piotr Paszek piotr.paszek@us.edu.pl Introduction (Piotr Paszek) Data Mining DM KDD 1 / 44 Plan of the lecture 1 Data Mining (DM) 2 Knowledge Discovery in Databases (KDD) 3 CRISP-DM 4 DM software
More informationTIM 50 - Business Information Systems
TIM 50 - Business Information Systems Lecture 15 UC Santa Cruz Nov 10, 2016 Class Announcements n Database Assignment 2 posted n Due 11/22 The Database Approach to Data Management The Final Database Design
More informationA Systems Approach to Dimensional Modeling in Data Marts. Joseph M. Firestone, Ph.D. White Paper No. One. March 12, 1997
1 of 8 5/24/02 4:43 PM A Systems Approach to Dimensional Modeling in Data Marts By Joseph M. Firestone, Ph.D. White Paper No. One March 12, 1997 OLAP s Purposes And Dimensional Data Modeling Dimensional
More informationDatabase design View Access patterns Need for separate data warehouse:- A multidimensional data model:-
UNIT III: Data Warehouse and OLAP Technology: An Overview : What Is a Data Warehouse? A Multidimensional Data Model, Data Warehouse Architecture, Data Warehouse Implementation, From Data Warehousing to
More informationData Warehouse and Mining
Data Warehouse and Mining 1. is a subject-oriented, integrated, time-variant, nonvolatile collection of data in support of management decisions. A. Data Mining. B. Data Warehousing. C. Web Mining. D. Text
More informationData Warehousing and OLAP Technology for Primary Industry
Data Warehousing and OLAP Technology for Primary Industry Taehan Kim 1), Sang Chan Park 2) 1) Department of Industrial Engineering, KAIST (taehan@kaist.ac.kr) 2) Department of Industrial Engineering, KAIST
More information1. Inroduction to Data Mininig
1. Inroduction to Data Mininig 1.1 Introduction Universe of Data Information Technology has grown in various directions in the recent years. One natural evolutionary path has been the development of the
More informationRETRACTED ARTICLE. Web-Based Data Mining in System Design and Implementation. Open Access. Jianhu Gong 1* and Jianzhi Gong 2
Send Orders for Reprints to reprints@benthamscience.ae The Open Automation and Control Systems Journal, 2014, 6, 1907-1911 1907 Web-Based Data Mining in System Design and Implementation Open Access Jianhu
More informationA Comparative Study of Data Mining Process Models (KDD, CRISP-DM and SEMMA)
International Journal of Innovation and Scientific Research ISSN 2351-8014 Vol. 12 No. 1 Nov. 2014, pp. 217-222 2014 Innovative Space of Scientific Research Journals http://www.ijisr.issr-journals.org/
More informationInformation mining and information retrieval : methods and applications
Information mining and information retrieval : methods and applications J. Mothe, C. Chrisment Institut de Recherche en Informatique de Toulouse Université Paul Sabatier, 118 Route de Narbonne, 31062 Toulouse
More informationDecision Support, Data Warehousing, and OLAP
Decision Support, Data Warehousing, and OLAP : Contents Terminology : OLAP vs. OLTP Data Warehousing Architecture Technologies References 1 Decision Support and OLAP Information technology to help knowledge
More informationCHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP)
CHAPTER 8: ONLINE ANALYTICAL PROCESSING(OLAP) INTRODUCTION A dimension is an attribute within a multidimensional model consisting of a list of values (called members). A fact is defined by a combination
More informationOverview. Data-mining. Commercial & Scientific Applications. Ongoing Research Activities. From Research to Technology Transfer
Data Mining George Karypis Department of Computer Science Digital Technology Center University of Minnesota, Minneapolis, USA. http://www.cs.umn.edu/~karypis karypis@cs.umn.edu Overview Data-mining What
More informationDSS based on Data Warehouse
DSS based on Data Warehouse C_13 / 19.01.2017 Decision support system is a complex system engineering. At the same time, research DW composition, DW structure and DSS Architecture based on DW, puts forward
More informationEvolution of Database Systems
Evolution of Database Systems Krzysztof Dembczyński Intelligent Decision Support Systems Laboratory (IDSS) Poznań University of Technology, Poland Intelligent Decision Support Systems Master studies, second
More informationData Mining. Yi-Cheng Chen ( 陳以錚 ) Dept. of Computer Science & Information Engineering, Tamkang University
Data Mining Yi-Cheng Chen ( 陳以錚 ) Dept. of Computer Science & Information Engineering, Tamkang University Why Mine Data? Commercial Viewpoint Lots of data is being collected and warehoused Web data, e-commerce
More informationDEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
SHRI ANGALAMMAN COLLEGE OF ENGINEERING & TECHNOLOGY (An ISO 9001:2008 Certified Institution) SIRUGANOOR,TRICHY-621105. DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING Year / Semester: IV/VII CS1011-DATA
More informationGUJARAT TECHNOLOGICAL UNIVERSITY MASTER OF COMPUTER APPLICATIONS (MCA) Semester: IV
GUJARAT TECHNOLOGICAL UNIVERSITY MASTER OF COMPUTER APPLICATIONS (MCA) Semester: IV Subject Name: Elective I Data Warehousing & Data Mining (DWDM) Subject Code: 2640005 Learning Objectives: To understand
More informationThis tutorial has been prepared for computer science graduates to help them understand the basic-to-advanced concepts related to data mining.
About the Tutorial Data Mining is defined as the procedure of extracting information from huge sets of data. In other words, we can say that data mining is mining knowledge from data. The tutorial starts
More informationMeaning & Concepts of Databases
27 th August 2015 Unit 1 Objective Meaning & Concepts of Databases Learning outcome Students will appreciate conceptual development of Databases Section 1: What is a Database & Applications Section 2:
More informationInternational Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.7, No.3, May Dr.Zakea Il-Agure and Mr.Hicham Noureddine Itani
LINK MINING PROCESS Dr.Zakea Il-Agure and Mr.Hicham Noureddine Itani Higher Colleges of Technology, United Arab Emirates ABSTRACT Many data mining and knowledge discovery methodologies and process models
More informationThe application of OLAP and Data mining technology in the analysis of. book lending
2nd International Conference on Automation, Mechanical Control and Computational Engineering (AMCCE 2017) The application of OLAP and Data mining technology in the analysis of book lending Xiao-Han Zhou1,a,
More informationTable of Contents. Knowledge Management Data Warehouses and Data Mining. Introduction and Motivation
Table of Contents Knowledge Management Data Warehouses and Data Mining Dr. Michael Hahsler Dept. of Information Processing Vienna Univ. of Economics and BA 11. December 2001
More informationKnowledge Management Data Warehouses and Data Mining
Knowledge Management Data Warehouses and Data Mining Dr. Michael Hahsler Dept. of Information Processing Vienna Univ. of Economics and BA 11. December 2001 1 Table of Contents
More informationLog Information Mining Using Association Rules Technique: A Case Study Of Utusan Education Portal
Log Information Mining Using Association Rules Technique: A Case Study Of Utusan Education Portal Mohd Helmy Ab Wahab 1, Azizul Azhar Ramli 2, Nureize Arbaiy 3, Zurinah Suradi 4 1 Faculty of Electrical
More informationTIM 50 - Business Information Systems
TIM 50 - Business Information Systems Lecture 15 UC Santa Cruz May 20, 2014 Announcements DB 2 Due Tuesday Next Week The Database Approach to Data Management Database: Collection of related files containing
More informationCSE 626: Data mining. Instructor: Sargur N. Srihari. Phone: , ext. 113
CSE 626: Data mining Instructor: Sargur N. Srihari E-mail: srihari@cedar.buffalo.edu Phone: 645-6164, ext. 113 1 What is Data Mining? Different perspectives: CSE, Business, IT As a field of research in
More informationData Mining. ❸Chapter 3 Data warehouse, ETL and OLAP. Asso.Prof.Dr. Xiao-dong Zhu. Business School, University of Shanghai for Science & Technology
❸Chapter 3 Data warehouse, and Business School, University of Shanghai for Science & Technology 2016-2017 2nd Semester, Spring2017 Contents of chapter 2 1 KDD Process 2 3 4 5 What is KDD? KDD Process the
More informationData warehouse architecture consists of the following interconnected layers:
Architecture, in the Data warehousing world, is the concept and design of the data base and technologies that are used to load the data. A good architecture will enable scalability, high performance and
More informationStep-by-step data transformation
Step-by-step data transformation Explanation of what BI4Dynamics does in a process of delivering business intelligence Contents 1. Introduction... 3 Before we start... 3 1 st. STEP: CREATING A STAGING
More informationChapter 3. Foundations of Business Intelligence: Databases and Information Management
Chapter 3 Foundations of Business Intelligence: Databases and Information Management THE DATA HIERARCHY TRADITIONAL FILE PROCESSING Organizing Data in a Traditional File Environment Problems with the traditional
More informationImplementing and Maintaining Microsoft SQL Server 2008 Analysis Services
Course 6234A: Implementing and Maintaining Microsoft SQL Server 2008 Analysis Services Course Details Course Outline Module 1: Introduction to Microsoft SQL Server Analysis Services This module introduces
More informationChapter 1, Introduction
CSI 4352, Introduction to Data Mining Chapter 1, Introduction Young-Rae Cho Associate Professor Department of Computer Science Baylor University What is Data Mining? Definition Knowledge Discovery from
More informationViságe.BIT. An OLAP/Data Warehouse solution for multi-valued databases
Viságe.BIT An OLAP/Data Warehouse solution for multi-valued databases Abstract : Viságe.BIT provides data warehouse/business intelligence/olap facilities to the multi-valued database environment. Boasting
More informationData Mining Technology Based on Bayesian Network Structure Applied in Learning
, pp.67-71 http://dx.doi.org/10.14257/astl.2016.137.12 Data Mining Technology Based on Bayesian Network Structure Applied in Learning Chunhua Wang, Dong Han College of Information Engineering, Huanghuai
More informationData Warehouse and Data Mining
Data Warehouse and Data Mining Lecture No. 02 Introduction to Data Warehouse Naeem Ahmed Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology
More informationUsing SLE for creation of Data Warehouses
Using SLE for creation of Data Warehouses Yvette Teiken OFFIS, Institute for Information Technology, Germany teiken@offis.de Abstract. This paper describes how software language engineering is applied
More informationWith data-based models and design of experiments towards successful products - Concept of the product design workbench
European Symposium on Computer Arded Aided Process Engineering 15 L. Puigjaner and A. Espuña (Editors) 2005 Elsevier Science B.V. All rights reserved. With data-based models and design of experiments towards
More informationData Warehousing and OLAP
Data Warehousing and OLAP INFO 330 Slides courtesy of Mirek Riedewald Motivation Large retailer Several databases: inventory, personnel, sales etc. High volume of updates Management requirements Efficient
More informationDr.G.R.Damodaran College of Science
1 of 20 8/28/2017 2:13 PM Dr.G.R.Damodaran College of Science (Autonomous, affiliated to the Bharathiar University, recognized by the UGC)Reaccredited at the 'A' Grade Level by the NAAC and ISO 9001:2008
More informationDatabase and Knowledge-Base Systems: Data Mining. Martin Ester
Database and Knowledge-Base Systems: Data Mining Martin Ester Simon Fraser University School of Computing Science Graduate Course Spring 2006 CMPT 843, SFU, Martin Ester, 1-06 1 Introduction [Fayyad, Piatetsky-Shapiro
More informationQ1) Describe business intelligence system development phases? (6 marks)
BUISINESS ANALYTICS AND INTELLIGENCE SOLVED QUESTIONS Q1) Describe business intelligence system development phases? (6 marks) The 4 phases of BI system development are as follow: Analysis phase Design
More informationR07. FirstRanker. 7. a) What is text mining? Describe about basic measures for text retrieval. b) Briefly describe document cluster analysis.
www..com www..com Set No.1 1. a) What is data mining? Briefly explain the Knowledge discovery process. b) Explain the three-tier data warehouse architecture. 2. a) With an example, describe any two schema
More informationBUSINESS INTELLIGENCE AND OLAP
Volume 10, No. 3, pp. 68 76, 2018 Pro Universitaria BUSINESS INTELLIGENCE AND OLAP Dimitrie Cantemir Christian University Knowledge Horizons - Economics Volume 10, No. 3, pp. 68-76 P-ISSN: 2069-0932, E-ISSN:
More informationUNIT -1 UNIT -II. Q. 4 Why is entity-relationship modeling technique not suitable for the data warehouse? How is dimensional modeling different?
(Please write your Roll No. immediately) End-Term Examination Fourth Semester [MCA] MAY-JUNE 2006 Roll No. Paper Code: MCA-202 (ID -44202) Subject: Data Warehousing & Data Mining Note: Question no. 1 is
More informationINSTITUTE OF AERONAUTICAL ENGINEERING (Autonomous) Dundigal, Hyderabad
INSTITUTE OF AERONAUTICAL ENGINEERING (Autonomous) Dundigal, Hyderabad - 500 043 INFORMATION TECHNOLOGY DEFINITIONS AND TERMINOLOGY Course Name : DATA WAREHOUSING AND DATA MINING Course Code : AIT006 Program
More informationCourse no: CSC- 451 Full Marks: Credit hours: 3 Pass Marks: Nature of course: Theory (3 Hrs.) + Lab (3 Hrs.)
CSC-451 Data Warehousing and Data Mining Course no: CSC- 451 Full Marks: 60+20+20 Credit hours: 3 Pass Marks: 24+8+8 Nature of course: Theory (3 Hrs.) + Lab (3 Hrs.) Course Synopsis: Analysis of advanced
More informationOn-Line Analytical Processing (OLAP) Traditional OLTP
On-Line Analytical Processing (OLAP) CSE 6331 / CSE 6362 Data Mining Fall 1999 Diane J. Cook Traditional OLTP DBMS used for on-line transaction processing (OLTP) order entry: pull up order xx-yy-zz and
More informationPartner Presentation Faster and Smarter Data Warehouses with Oracle OLAP 11g
Partner Presentation Faster and Smarter Data Warehouses with Oracle OLAP 11g Vlamis Software Solutions, Inc. Founded in 1992 in Kansas City, Missouri Oracle Partner and reseller since 1995 Specializes
More information20466C - Version: 1. Implementing Data Models and Reports with Microsoft SQL Server
20466C - Version: 1 Implementing Data Models and Reports with Microsoft SQL Server Implementing Data Models and Reports with Microsoft SQL Server 20466C - Version: 1 5 days Course Description: The focus
More informationData Warehousing and Decision Support. Introduction. Three Complementary Trends. [R&G] Chapter 23, Part A
Data Warehousing and Decision Support [R&G] Chapter 23, Part A CS 432 1 Introduction Increasingly, organizations are analyzing current and historical data to identify useful patterns and support business
More informationChapter 13 Business Intelligence and Data Warehouses The Need for Data Analysis Business Intelligence. Objectives
Chapter 13 Business Intelligence and Data Warehouses Objectives In this chapter, you will learn: How business intelligence is a comprehensive framework to support business decision making How operational
More informationAn Improved Apriori Algorithm for Association Rules
Research article An Improved Apriori Algorithm for Association Rules Hassan M. Najadat 1, Mohammed Al-Maolegi 2, Bassam Arkok 3 Computer Science, Jordan University of Science and Technology, Irbid, Jordan
More informationUbiquitous Computing and Communication Journal (ISSN )
A STRATEGY TO COMPROMISE HANDWRITTEN DOCUMENTS PROCESSING AND RETRIEVING USING ASSOCIATION RULES MINING Prof. Dr. Alaa H. AL-Hamami, Amman Arab University for Graduate Studies, Amman, Jordan, 2011. Alaa_hamami@yahoo.com
More informationEnhancing Preprocessing in Data-Intensive Domains using Online-Analytical Processing
Enhancing Preprocessing in Data-Intensive Domains using Online-Analytical Processing Alexander Maedche 1, Andreas Hotho 1, and Markus Wiese 2 1 Institute AIFB, Karlsruhe University, D-76128 Karlsruhe,
More informationData Management Glossary
Data Management Glossary A Access path: The route through a system by which data is found, accessed and retrieved Agile methodology: An approach to software development which takes incremental, iterative
More informationKnowledge Modelling and Management. Part B (9)
Knowledge Modelling and Management Part B (9) Yun-Heh Chen-Burger http://www.aiai.ed.ac.uk/~jessicac/project/kmm 1 A Brief Introduction to Business Intelligence 2 What is Business Intelligence? Business
More informationChapter 6. Foundations of Business Intelligence: Databases and Information Management VIDEO CASES
Chapter 6 Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:
More informationData Science. Data Analyst. Data Scientist. Data Architect
Data Science Data Analyst Data Analysis in Excel Programming in R Introduction to Python/SQL/Tableau Data Visualization in R / Tableau Exploratory Data Analysis Data Scientist Inferential Statistics &
More informationFull file at
Chapter 2 Data Warehousing True-False Questions 1. A real-time, enterprise-level data warehouse combined with a strategy for its use in decision support can leverage data to provide massive financial benefits
More informationTime: 3 hours. Full Marks: 70. The figures in the margin indicate full marks. Answers from all the Groups as directed. Group A.
COPYRIGHT RESERVED End Sem (V) MCA (XXVIII) 2017 Time: 3 hours Full Marks: 70 Candidates are required to give their answers in their own words as far as practicable. The figures in the margin indicate
More information1. INTRODUCTION 2. THE COMPONENTS OFDECISION SUPPORT SYSTEMS
DECISION SUPPORT SYSTEMS PRESENT AND PERSPECTIVE Stanciu Cristina Ofelia Tibiscus University of Timisoara, Faculty of Economics, 1/A Daliei Street, 300558, Timisoara, Romania, Phone: +40-256-202931, E-mail:
More informationAfter completing this course, participants will be able to:
Designing a Business Intelligence Solution by Using Microsoft SQL Server 2008 T h i s f i v e - d a y i n s t r u c t o r - l e d c o u r s e p r o v i d e s i n - d e p t h k n o w l e d g e o n d e s
More informationDepartment of Industrial Engineering. Sharif University of Technology. Operational and enterprises systems. Exciting directions in systems
Department of Industrial Engineering Sharif University of Technology Session# 9 Contents: The role of managers in Information Technology (IT) Organizational Issues Information Technology Operational and
More information9. Conclusions. 9.1 Definition KDD
9. Conclusions Contents of this Chapter 9.1 Course review 9.2 State-of-the-art in KDD 9.3 KDD challenges SFU, CMPT 740, 03-3, Martin Ester 419 9.1 Definition KDD [Fayyad, Piatetsky-Shapiro & Smyth 96]
More informationMOLAP Data Warehouse of a Software Products Servicing Call Center
MOLAP Data Warehouse of a Software Products Servicing Call Center Z. Kazi, B. Radulovic, D. Radovanovic and Lj. Kazi Technical faculty "Mihajlo Pupin" University of Novi Sad Complete Address: Technical
More informationData Warehousing and Decision Support
Data Warehousing and Decision Support Chapter 23, Part A Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke 1 Introduction Increasingly, organizations are analyzing current and historical
More informationDecision Support Systems aka Analytical Systems
Decision Support Systems aka Analytical Systems Decision Support Systems Systems that are used to transform data into information, to manage the organization: OLAP vs OLTP OLTP vs OLAP Transactions Analysis
More informationUML-Based Conceptual Modeling of Pattern-Bases
UML-Based Conceptual Modeling of Pattern-Bases Stefano Rizzi DEIS - University of Bologna Viale Risorgimento, 2 40136 Bologna - Italy srizzi@deis.unibo.it Abstract. The concept of pattern, meant as an
More informationKDD, SEMMA AND CRISP-DM: A PARALLEL OVERVIEW. Ana Azevedo and M.F. Santos
KDD, SEMMA AND CRISP-DM: A PARALLEL OVERVIEW Ana Azevedo and M.F. Santos ABSTRACT In the last years there has been a huge growth and consolidation of the Data Mining field. Some efforts are being done
More information1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar
1 DATAWAREHOUSING QUESTIONS by Mausami Sawarkar 1) What does the term 'Ad-hoc Analysis' mean? Choice 1 Business analysts use a subset of the data for analysis. Choice 2: Business analysts access the Data
More informationPerformance Analysis of Data Mining Classification Techniques
Performance Analysis of Data Mining Classification Techniques Tejas Mehta 1, Dr. Dhaval Kathiriya 2 Ph.D. Student, School of Computer Science, Dr. Babasaheb Ambedkar Open University, Gujarat, India 1 Principal
More informationData Warehousing. Data Warehousing and Mining. Lecture 8. by Hossen Asiful Mustafa
Data Warehousing Data Warehousing and Mining Lecture 8 by Hossen Asiful Mustafa Databases Databases are developed on the IDEA that DATA is one of the critical materials of the Information Age Information,
More information