DATA MINING DATA KNOWLEDGE DECISION ACTION. Data. Decision. Data mining (analysis) Business modeling (using data mining software) Business hypothesis

Size: px
Start display at page:

Download "DATA MINING DATA KNOWLEDGE DECISION ACTION. Data. Decision. Data mining (analysis) Business modeling (using data mining software) Business hypothesis"

Transcription

1 1 DATA MINING Data Mining software ก ก ก software ก ก ก ก ก ก Executive Information System ( EIS ) ก ก ก ก ก ก ( Decision Support System) ก 1 DATA KNOWLEDGE DECISION ACTION 1 ก 1. ก ก ก data mining ก ก user ก 3. ก ก user ก Data Business knowledge Business hypothesis Business modeling (using data mining software) Data mining (analysis) Validation of hypothesis Decision 2. data mining tools user model ก ก ก ก ก

2 2 ก ก ก ก ก ก ก DBMS ( Database Management System ) ก ก ก ก ก Data Mining ก ก 30 ก ก ก ก KDD ( Knowledge Discovery in Database ) ก ก Data Mining Philippe Nieuwbourg ( CXP Information ) ก Data Mining ก ก ก ก ก ก ก ก ก Data Mining ก ก ก ก ก ก Data Mining ก ก ก 1960 ก file processing ก 1970 ก ก ก ( Ralational Database System ) ก ก ก ก ก ก ก ก ก ก ก ( Query Language ) 1980 ก ก ก ก ก ก hardware 30 ก ก ก 1990 ก ก ก ก ก ก ก ก

3 3 ก ก Data Warehouse ก ก ก Data Warehouse Data Cleansing, Data Integration On-Line Analytical Processing ( OLAP ) ก ก ก ก ก ก ก ก ( data rich but information poor ) ก ก ก ก ก ก ก ก ก ก Data Mining Data Collection ( 1960 s and earlier) - primitive file processing Database management system ( 1970 s ) - network and relational database management system - data modeling tools, query language Advanced database management system ( 1980 s - present) - advanced data model - object-oriented database management system - object relational database management system Data Warehousing & Data mining ( 1990 s present ) 2 ก

4 4 Data Mining 1. ก ก ก ก ก ก ก ก ก ก ก ก ก ก 2. ก 3. ก ก ก ก ก ก ก ก ( Data Warehouse) Data Mining ก ก ก Data Mining ก ก ก ก Internet, ก, ก ก ก ก ( Decision Support System) ก ก ก ก ก ก ก ก ก ก ( Operational System ) ( Data Warehouse ) ก ก ก computer Data Mining ก Algorithm ก ก ก computer computer microcomputer ก ก ( PC Cluster ) computer

5 5 ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก Data Mining Relational Database ก ก entity-relationship ( ER ) model Data Warehouses ก ก ก ก ก ก Transactional Database ก ก ก ก ก ก ก Advanced Database ก objectoriented, text file,, web ก Data Mining ก ก ก Database Management System ( DBMS ) ก ก ก ก ก DBMS Oracle, DB2, MS SQL, MS Access ก ก Mining ก ก ก ก ก Mining ก ก ก Mining

6 6 ก Mining ก Mining ก ก Data Mining Data Mining 1. Association rule Discovery Data Mining ก ก กก ก ก ก ก ก ก กก ก ก ก Market Basket Analysis ก ก ก ก ก ( Association Rule ) ก ก ก ก ก ก Amazon ก Amazon ก ก ก ก ก กก ก ก ก ก buys ( x, database) -> buys ( x, data mining ) [ 80%, 60% ] database ก data mining 60 % ก database data mining ก 80 % ก ก ก 1 ก ก ก ( Virtual store ) web ก ก ก ก ก ก ก ก ก ก ก ( itemset ) ก ก ก {, }, { กก, ก} {, } ก ก ก ก ก ก ก ก ก 5 ก ก ก ก ก (query ) ก ก ก

7 7 { ก, A}, { ก, B}, { ก, C }, {, B} ก DBMS query SQL SELECT P.custid, P.item, SUM( P.qty ) FROM Purchases P GROUP BY P.custid, P.item HAVING SUM(P.qty ) > 5 ก DBMS SQL ก ก ก query ก ก ก query iceberg query ก 2. Classification & Prediction Classification ก ก model ก ก ก ก ก ก ก ก ก ก ก ก ก classification ก 3 3 Training Data Model Construction Classifier Model Testing Data Model Evaluation Classifier Model Unseen Data Classification 3 ก ก Classification

8 8 - Model Construction ( Learning ) ก model ก ก ก ( training data ) model 1. ( Decision Tree) 2. ( Neural Net) 1) Decision Tree ก ก ก ก ก ก ก attribute ก ก ก ก ก 50 ก ก ก ก ก ก ก ก ก ( Branch ) ก ( Staff ) ( Propety) ก ( Client ) ก ( Relationship) ก ก ( Customer_rental ) ก ( Rentals ) ก ( Sales ) ก ก ก 40 % ก ก ก 25 ก ก 35 % ก 4 Decision Tree ก ก ก ก ก

9 9 Customer renting property > 2 years? NO Yes Rent property Customer age > 25 years? NO Yes Rent property Buy property 4 Decision Tree ก ก ก ก ก ก Business_Info ก ก ก ก ก ก ก ก Business_Info Age Rent_Period Buy No No No Yes No Yes No Yes no

10 10 SQL Decision Tree 2 1. SQL root node SELECT B.Rent_Period, B.Buy, COUNT(*) FROM Business_Info B WHERE B.Rent_Period > 2 GROUP BY B.Rent_Period, B.Buy SQL Rent_Period Buy Yes No SQL node child root SELECT B.Age, B.Buy, COUNT(*) FROM Business_Info B WHERE B.Age > 25 GROUP BY B.Age, B.Buy SQL Rent_Period Buy Yes No ก Decision Tree ก AVC set ( Attribute Value, Class label ) ก 2 AVC sets ก ก ก

11 11 2) ก ( Neural Net) ก Artificial Intelligence:AI ก ก กก ก ( ก Artificial Neural Networks ANN ) ก ก ก ( train ) ก ก ก ก ( node ) Input Output ก ก ก input layer, output layer hidden layers ก ก ก layer 5 ก ก 4 Customer renting property > 2 years? Customer age > 25 years? 0.4 Input Hidden processing layer 5 ก ก Output Class (Rent or buy property - Model Evaluation ( Accuracy ) ก ก ( testing data ) ก ก ก model ก - Model Usage ( Classification ) Model ก ( unseen data ) ก ก ก object ก ก Prediction ก ก ก ก ก ก

12 12 3. Database clustering Segmentation ก ก ก ก ก ก กก ก ก 3 ก 1. ก (>$80,000) 2. ก ก ($25,000 to $ 80,000) 3. ก (less than $25,000) ก ก ก - Have Children - Married - Last car is a used car - Own cars First Segment (High income) Second Segment (Middle income) Third Segment (Low income) Have Children Married Last car is a used car Own cars 6 Clustering ก ก ก ก 4. Deviation Detection ก ก ก ก ก ก ก ก (Visualization)

13 13 ก ก ก ก 5. Link Analysis Link Analysis ก link ก associations recode ก recode link analysis ก 3 -associations discovery -sequential pattern discovery -similar time sequence discovery Data Mining Data Warehouse ก Data Mining ก ก ก ก mining Data mining ก ก ก Data warehouse ก ก ก mining ก - Data warehouse ก ก ก ก mining ก predictive models - Data warehouse ก mining ก ก Data warehouse ก - ก ก record fields Data mining ก ก query Data warehouse - ก ก กก Data mining ก ก ก Data warehouse ก ก Data mining Data warehouse ก ก Data mining Data warehouse ก ก ก Data Mining

14 ก ก Data Mining ก ก ก ก Midas ก ก ก ก ก ก ก ก Bouygues Telecom ก ก ก ก ก ก ก ก ก ก ก ก ก ก France Telecom ก ก ก ก ก ก ก Data Mining ก ก - ก ก ก ก ก - ก ก - ก ก ก ก - ก ก ก ก - ก ก ก ก 6 ก ก - 70 % ก ก ก ก ก - ก - ก Text Mining ก Data Mining ก SDP Infoware ก ก ก e-commerce 14

15 15 - ก ก ก ก web ก - ก web site web ก กก ก ก Data Mining ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก (Mining Objective) ก กก ก ก ก ก ก ก 10 ก ก ก ก ก ก ก (knowledge) กก ก ก ก ก ก ก 2 ID Sex Address SchoolGPA Major GPA 1 ก 86/ /

16 16 ก 1 ก ก ก ก ID Subject Section Term Year Grade C D B+ 2 ก ก 2 ก ก ก ก ก ก 1. ก (data cleaning) ก ก ก ก ก ก ก ก ก 1 ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก 1 ก ก ก ก ก ก ก ก

17 17 ก ก ก ก ก ก ก ก (NULL) ก ก 2 Grade ก ก ก ก ก ก Bangkok Non-Bangkok ก ก ก ก (Binning Data) ก ก ก ก 10 ก {A,B+,B,C+,C,D+,D,F,W,I} ก ก ก ก ก ก ก 3 ก ก {A,B+,B} High, ก {C+,C} Meduim ก {D+,D,F,W,I} Low ก 1 ก - ก ก ก - ก ก ก ก ก ก 2 ก ก ก 6 ก School = 0 ก ก ก ก 6 School = 1 - ก ก กก ก 1 3 ID Sex Term School Major GPA 1 Female Bangkok 1 ELEC Male Non-Bangkok 0 CIVIL 3.2 3

18 18 ก 2 ก ก - ก ก ก - ก ก ก ก กก 2 4 ID Subject Term Year Grade Meduim Low High 4 ก 2. ก ก (Data Selection) ก - ก ก ก ก 10 ก ก ก ก ก ก ก ก - ก ก 6 ก ก ก ก ก ก ก ก ก ก ก 6 ก 6 ก ก ก ก ก ก ก ก ก 3. ก (Data Transformation)

19 19 ก 4 ก ก ก ก ก ก 3 4 ก ก ก 5 ID Sex Address Major GPA 1 Male Bangkok Meduim Low ELEC Female Non-Bangkok High High CIVIL ก 5 ก ก Web Mining ก ก Application Data Mining ก Web Mining Web Data Mining ก Data Mining e-commerce e-research ก ก ( Mass market technology ) Web mining ก ก ก Web Mining ก ก กก Data Mining ก ก ก Web Mining ก web ก ก ก Data Mining ก

20 ก ก web ก ก ก ก Web Mining ก ก ก ก web web ก ก ก ก ก ก Web Mining ก ก web ก Web Mining ก ก web ก ก ก ก (Text ) ก ก กก ก ก ก web ก ก Web Mining ก ก ก ก ก Links web web page ก ก ก Search Engines ก ก ก ก ก Link ก web ก ก ก ก ก ก ก ก ( Image ) ก ก Search Engines ก ก Web Mining ก ก ก e-commerce ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก กก ก log file ก ก web กก ก ก (user tracking ) web ก ก กก ก ก ก ก ก ก 3 ก 1. Demographics ก ก ก web ก ก 20

21 21 2. Phychographics ก กก ก ก web 3. Technolographics ก ก 3 ก ก ก ก ก ก ก ก ก ก ก ก ก Web Mining ก Data Mining ก ก ก ก ก กก ก ก ก Data Mining ก ก ก ก (pattern recognition technology) ก ก ก ก (machinelearning & genetic algorithms) ก ก ก (Netural & Polynomial networks) ก ก Data Mining ก ก ก ก ก (Who) ก ก ก (What) (Where) (When) ก ก ก (Why) ก ก ก ก ก ก ก Data Mining ก ก ก ก ก ก ก ก ก ก ก Data Mining ก (Data warehouse) ก (Data marts) ก ก (Customer information file) ก ก กก ก ก ก ก ก ก ก

22 ก ก (electronic commerce) ก ก Data Mining ก ก ก ก ก ก ก ก ก ก ก ก ก Data Mining ก ก ก ก ก ก ก ( ก ) ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก กก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก กก ก ก กก (Site cookie file) ก (Registration form) ก (Web site traffic report) ก ก ก ก ก ก ก ก ก ก Accure sight, DoubleClick NetCount ก ก web tracking software ก / ก ก ก ก ก ก ก ก ก IP address ก ก IP address ก ก ก ก ก ก ก IP address dynamic Ips ก ก ก proxy server IP address ก ก IP address ก กก 22

23 กก ก ก (web browser) ก ก ก ก ก กก (cookies) ก (serail number) ก ก ก ก กก ก ก ก กก ก ก กก ก ก ก ก ก ก ก ก กก ก ก ก ก ก! ก ก ก ก ก ก ก ก - กก ก ก ก ก ก กก ก ก ก ก ก กก ( ก ก Netcape 3.0 ก ก ก Option ก Protocol ก Networked Preferences ก ก Show An Alert Before Accepting A Cookie) ก ก กก ก ก ก กก ก ก ก ก ก ก dynamic Ips Proxy server ก กก ก ก ก ก (ISPs) ก ก ก ก กก proxy server ก.netcom.ibm.byu.aol ก ก ก ก ก (Registry form) ก ก ก ก ก ก ก ก ก ก ก ก ก 23

24 24 ก ก ก ก ก ก ก ก ก ก 5%-10% ก ก กก ก ก ก ก ก ก ก ก ก ก ก ก ก ก กก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก กก ก ก ก ก ก ( address) ก ก ก ก ก ก ก ก ก ก ก ก ก ( ก ก ก ก ) ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก Site-tracking software

25 ก ก ก ก กก ก ก ก ก (Log file) ก ก ก ก ก ก ก ก ก Internet Protocaol (IP) ก ก ก ก ก( ก ก IP address ก ก ก ) ก Accure Insight ก ก Accure ก ก (several databases) ก (single entiry report) ก ก ก ก ก ก ก ก ก ก ก กก ก ก ( ก ก Accure Insight ก hrad Count netcount ก ก ) ก ก ก Customer loyalty ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก 25

26 ก ก ก กก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก CACI Marketing System ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก กก ก ก ก (ZIP code) 55114, 01103, ก ก ก ก ก ก ก ก ก, ก ก ก ก ก, ก ก กก ก ก ก ก ก ก ก ก ก ก 30% ก ก 30% ก ก ก กก 20% ก ก ก ก ก ก ก ก ก (lifestyle) ก ก ก ก ก ก ก,,, ก ก ก ก ก ก Data Mining ก ก ก ก ก ก ก ก กก ก ก Data Mining ก ก ก ก (pattern) ก (cluster) ก ก ก ก ก ก ก ก ก ก 26

27 ก ก ก ก ก ก ก ก (loyalty) ก ก ก ก Data Mining ก 2 ก ก (statistics) (Artificial Intelligence) ก ก ก ก ก (machine-learning) ก ก ก Data Mining ก ก ก ก ก Data Mining ก ก ก ก ก Data Mining ก ก ก ก database query program, report generators, statistical packages ก ก ก ก Data Mining ก ก ก ก ก ก ก ก ก ก ก Data Mining ก ก ก ก ก กก ก Data Mining ก :- ก ก ก ก ก ก $1500-$3300 ก ก ก ก ก ก $900 ก Data Mining ก ก ก ก ก ก ก Data Mining ก CART, CHAD, ID3, C4.5 ก ก c ก ก n ก ก ก Data mining ก ก ก ก ก ก ก ก ก ก ก ก 27

28 ก ก ก ก ก ก ก ก IF ZIP CODE ( ) AND REGISTRATION NO.( ) AND USER ID NO. (4566jk dl01) THEN /WEBSELL/PRODUCT8.HTM 87% ก Data Mining ก กก ก ก (Reg NO ) ก 87% ก ก ก ก ก ก ก ก ก ก ก ก ก ก Data mining ก ก ก ก produc8.htm ก ก Data mining ก ก ก ก ก produc8.htm ก ก ก produc5.htm order_form.htm ก ก ก ก order_form.htm ก produc8.htm ก 83% ก ก ก order_form.htm :- IF ZIP CODE ( ) AND REGISTRATION NO. ( ) AND PRIOR PAGE /WEBSELL/PRODUCT8.HTM THEN /WEBSELL/PRODUCT5.HTM 73% IF ZIP CODE ( ) AND REGISTERATION NO. ( ) AND PRIOR PAGE /WEBSELL/PRODUCT8.HTM THEN /WEBSELL/ORDER_FORM.HTM 83% ก ก 28

29 Data mining ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ( ก ก ก ก ) ก ก Rulebased tools ก Data mining ก ก ก ก ก ก ก,, ก ก, ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก ก Data mining กก ก ก ก ก ก ก ก ก ก ก ก ก Data Mining ก ก ก ก ก ก ก ก (Customer loyality) 29

30 30 Demo Demo Software ก ก Data Mining Isoft Alice Alice version 6 ก Decision Tree demo ก ก ก ก ก ก ก ก field ก ก ก ก ก field ก ก Success

31 ก ก - Relational Database - SPSS,SAS - Ole DB and Ole DB for OLAP - Microsoft Access - Microsoft Excel 31

32 32 ก filed ก ก ก ก Alice ก decision tree

33 root node ก ก 188,400 ก ก ก 2 ก Housing ก ก ก ก ก, ก 33

34 34 ก ก ก ก ก ก ก

35 35

Data Mining. Ryan Benton Center for Advanced Computer Studies University of Louisiana at Lafayette Lafayette, La., USA.

Data Mining. Ryan Benton Center for Advanced Computer Studies University of Louisiana at Lafayette Lafayette, La., USA. Data Mining Ryan Benton Center for Advanced Computer Studies University of Louisiana at Lafayette Lafayette, La., USA January 13, 2011 Important Note! This presentation was obtained from Dr. Vijay Raghavan

More information

Data warehouse and Data Mining

Data warehouse and Data Mining Data warehouse and Data Mining Lecture No. 14 Data Mining and its techniques Naeem A. Mahoto Email: naeemmahoto@gmail.com Department of Software Engineering Mehran Univeristy of Engineering and Technology

More information

Chapter 6. Foundations of Business Intelligence: Databases and Information Management VIDEO CASES

Chapter 6. Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Chapter 6 Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:

More information

Question Bank. 4) It is the source of information later delivered to data marts.

Question Bank. 4) It is the source of information later delivered to data marts. Question Bank Year: 2016-2017 Subject Dept: CS Semester: First Subject Name: Data Mining. Q1) What is data warehouse? ANS. A data warehouse is a subject-oriented, integrated, time-variant, and nonvolatile

More information

Data warehouses Decision support The multidimensional model OLAP queries

Data warehouses Decision support The multidimensional model OLAP queries Data warehouses Decision support The multidimensional model OLAP queries Traditional DBMSs are used by organizations for maintaining data to record day to day operations On-line Transaction Processing

More information

Data Mining. Vera Goebel. Department of Informatics, University of Oslo

Data Mining. Vera Goebel. Department of Informatics, University of Oslo Data Mining Vera Goebel Department of Informatics, University of Oslo 2012 1 Lecture Contents Knowledge Discovery in Databases (KDD) Definition and Applications OLAP Architectures for OLAP and KDD KDD

More information

TIM 50 - Business Information Systems

TIM 50 - Business Information Systems TIM 50 - Business Information Systems Lecture 15 UC Santa Cruz Nov 10, 2016 Class Announcements n Database Assignment 2 posted n Due 11/22 The Database Approach to Data Management The Final Database Design

More information

TIM 50 - Business Information Systems

TIM 50 - Business Information Systems TIM 50 - Business Information Systems Lecture 15 UC Santa Cruz May 20, 2014 Announcements DB 2 Due Tuesday Next Week The Database Approach to Data Management Database: Collection of related files containing

More information

Chapter 28. Outline. Definitions of Data Mining. Data Mining Concepts

Chapter 28. Outline. Definitions of Data Mining. Data Mining Concepts Chapter 28 Data Mining Concepts Outline Data Mining Data Warehousing Knowledge Discovery in Databases (KDD) Goals of Data Mining and Knowledge Discovery Association Rules Additional Data Mining Algorithms

More information

Knowledge Discovery and Data Mining

Knowledge Discovery and Data Mining Knowledge Discovery and Data Mining Unit # 1 1 Acknowledgement Several Slides in this presentation are taken from course slides provided by Han and Kimber (Data Mining Concepts and Techniques) and Tan,

More information

Data Mining. Yi-Cheng Chen ( 陳以錚 ) Dept. of Computer Science & Information Engineering, Tamkang University

Data Mining. Yi-Cheng Chen ( 陳以錚 ) Dept. of Computer Science & Information Engineering, Tamkang University Data Mining Yi-Cheng Chen ( 陳以錚 ) Dept. of Computer Science & Information Engineering, Tamkang University Why Mine Data? Commercial Viewpoint Lots of data is being collected and warehoused Web data, e-commerce

More information

Analytical model A structure and process for analyzing a dataset. For example, a decision tree is a model for the classification of a dataset.

Analytical model A structure and process for analyzing a dataset. For example, a decision tree is a model for the classification of a dataset. Glossary of data mining terms: Accuracy Accuracy is an important factor in assessing the success of data mining. When applied to data, accuracy refers to the rate of correct values in the data. When applied

More information

Dr.G.R.Damodaran College of Science

Dr.G.R.Damodaran College of Science 1 of 20 8/28/2017 2:13 PM Dr.G.R.Damodaran College of Science (Autonomous, affiliated to the Bharathiar University, recognized by the UGC)Reaccredited at the 'A' Grade Level by the NAAC and ISO 9001:2008

More information

Management Information Systems

Management Information Systems Foundations of Business Intelligence: Databases and Information Management Lecturer: Richard Boateng, PhD. Lecturer in Information Systems, University of Ghana Business School Executive Director, PearlRichards

More information

Time: 3 hours. Full Marks: 70. The figures in the margin indicate full marks. Answers from all the Groups as directed. Group A.

Time: 3 hours. Full Marks: 70. The figures in the margin indicate full marks. Answers from all the Groups as directed. Group A. COPYRIGHT RESERVED End Sem (V) MCA (XXVIII) 2017 Time: 3 hours Full Marks: 70 Candidates are required to give their answers in their own words as far as practicable. The figures in the margin indicate

More information

On-Line Application Processing

On-Line Application Processing On-Line Application Processing WAREHOUSING DATA CUBES DATA MINING 1 Overview Traditional database systems are tuned to many, small, simple queries. Some new applications use fewer, more time-consuming,

More information

Knowledge Discovery in Data Bases

Knowledge Discovery in Data Bases Knowledge Discovery in Data Bases Chien-Chung Chan Department of CS University of Akron Akron, OH 44325-4003 2/24/99 1 Why KDD? We are drowning in information, but starving for knowledge John Naisbett

More information

CS377: Database Systems Data Warehouse and Data Mining. Li Xiong Department of Mathematics and Computer Science Emory University

CS377: Database Systems Data Warehouse and Data Mining. Li Xiong Department of Mathematics and Computer Science Emory University CS377: Database Systems Data Warehouse and Data Mining Li Xiong Department of Mathematics and Computer Science Emory University 1 1960s: Evolution of Database Technology Data collection, database creation,

More information

Taking Your Application Design to the Next Level with Data Mining

Taking Your Application Design to the Next Level with Data Mining Taking Your Application Design to the Next Level with Data Mining Peter Myers Mentor SolidQ Australia HDNUG 24 June, 2008 WHO WE ARE Industry experts: Growing, elite group of over 90 of the world s best

More information

Data Mining Course Overview

Data Mining Course Overview Data Mining Course Overview 1 Data Mining Overview Understanding Data Classification: Decision Trees and Bayesian classifiers, ANN, SVM Association Rules Mining: APriori, FP-growth Clustering: Hierarchical

More information

CMPUT 391 Database Management Systems. Data Mining. Textbook: Chapter (without 17.10)

CMPUT 391 Database Management Systems. Data Mining. Textbook: Chapter (without 17.10) CMPUT 391 Database Management Systems Data Mining Textbook: Chapter 17.7-17.11 (without 17.10) University of Alberta 1 Overview Motivation KDD and Data Mining Association Rules Clustering Classification

More information

Management Information Systems Review Questions. Chapter 6 Foundations of Business Intelligence: Databases and Information Management

Management Information Systems Review Questions. Chapter 6 Foundations of Business Intelligence: Databases and Information Management Management Information Systems Review Questions Chapter 6 Foundations of Business Intelligence: Databases and Information Management 1) The traditional file environment does not typically have a problem

More information

Data Mining Concepts

Data Mining Concepts Data Mining Concepts Outline Data Mining Data Warehousing Knowledge Discovery in Databases (KDD) Goals of Data Mining and Knowledge Discovery Association Rules Additional Data Mining Algorithms Sequential

More information

Management Information Systems MANAGING THE DIGITAL FIRM, 12 TH EDITION FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT

Management Information Systems MANAGING THE DIGITAL FIRM, 12 TH EDITION FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT MANAGING THE DIGITAL FIRM, 12 TH EDITION Chapter 6 FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT VIDEO CASES Case 1: Maruti Suzuki Business Intelligence and Enterprise Databases

More information

Introduction to Data Mining and Data Analytics

Introduction to Data Mining and Data Analytics 1/28/2016 MIST.7060 Data Analytics 1 Introduction to Data Mining and Data Analytics What Are Data Mining and Data Analytics? Data mining is the process of discovering hidden patterns in data, where Patterns

More information

An Introduction to Data Mining BY:GAGAN DEEP KAUSHAL

An Introduction to Data Mining BY:GAGAN DEEP KAUSHAL An Introduction to Data Mining BY:GAGAN DEEP KAUSHAL Trends leading to Data Flood More data is generated: Bank, telecom, other business transactions... Scientific Data: astronomy, biology, etc Web, text,

More information

Lecture 18. Business Intelligence and Data Warehousing. 1:M Normalization. M:M Normalization 11/1/2017. Topics Covered

Lecture 18. Business Intelligence and Data Warehousing. 1:M Normalization. M:M Normalization 11/1/2017. Topics Covered Lecture 18 Business Intelligence and Data Warehousing BDIS 6.2 BSAD 141 Dave Novak Topics Covered Test # Review What is Business Intelligence? How can an organization be data rich and information poor?

More information

by Prentice Hall

by Prentice Hall Chapter 6 Foundations of Business Intelligence: Databases and Information Management 6.1 2010 by Prentice Hall Organizing Data in a Traditional File Environment File organization concepts Computer system

More information

Chapter 6 VIDEO CASES

Chapter 6 VIDEO CASES Chapter 6 Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:

More information

Dr. SubraMANI Paramasivam. Think & Work like a Data Scientist with SQL 2016 & R

Dr. SubraMANI Paramasivam. Think & Work like a Data Scientist with SQL 2016 & R Dr. SubraMANI Paramasivam Think & Work like a Data Scientist with SQL 2016 & R About the Speaker Group Leader Dr. SubraMANI Paramasivam PhD., MVP, MCT, MCSE (x2), MCITP (x2), MCP, MCTS (x3), MCSA CEO,

More information

Tribhuvan University Institute of Science and Technology MODEL QUESTION

Tribhuvan University Institute of Science and Technology MODEL QUESTION MODEL QUESTION 1. Suppose that a data warehouse for Big University consists of four dimensions: student, course, semester, and instructor, and two measures count and avg-grade. When at the lowest conceptual

More information

Slice Intelligence!

Slice Intelligence! Intern @ Slice Intelligence! Wei1an(Wu( September(8,(2014( Outline!! Details about the job!! Skills required and learned!! My thoughts regarding the internship! About the company!! Slice, which we call

More information

1. Inroduction to Data Mininig

1. Inroduction to Data Mininig 1. Inroduction to Data Mininig 1.1 Introduction Universe of Data Information Technology has grown in various directions in the recent years. One natural evolutionary path has been the development of the

More information

COMP 465 Special Topics: Data Mining

COMP 465 Special Topics: Data Mining COMP 465 Special Topics: Data Mining Introduction & Course Overview 1 Course Page & Class Schedule http://cs.rhodes.edu/welshc/comp465_s15/ What s there? Course info Course schedule Lecture media (slides,

More information

DATA MINING AND WAREHOUSING

DATA MINING AND WAREHOUSING DATA MINING AND WAREHOUSING Qno Question Answer 1 Define data warehouse? Data warehouse is a subject oriented, integrated, time-variant, and nonvolatile collection of data that supports management's decision-making

More information

2. (a) Briefly discuss the forms of Data preprocessing with neat diagram. (b) Explain about concept hierarchy generation for categorical data.

2. (a) Briefly discuss the forms of Data preprocessing with neat diagram. (b) Explain about concept hierarchy generation for categorical data. Code No: M0502/R05 Set No. 1 1. (a) Explain data mining as a step in the process of knowledge discovery. (b) Differentiate operational database systems and data warehousing. [8+8] 2. (a) Briefly discuss

More information

Code No: R Set No. 1

Code No: R Set No. 1 Code No: R05321204 Set No. 1 1. (a) Draw and explain the architecture for on-line analytical mining. (b) Briefly discuss the data warehouse applications. [8+8] 2. Briefly discuss the role of data cube

More information

Warehousing. Data Mining

Warehousing. Data Mining On Line Application Processing Warehousing Data Cubes Data Mining 1 Overview Traditional database systems are tuned to many, small, simple queries. Some new applications use fewer, more timeconsuming,

More information

UNIT -1 UNIT -II. Q. 4 Why is entity-relationship modeling technique not suitable for the data warehouse? How is dimensional modeling different?

UNIT -1 UNIT -II. Q. 4 Why is entity-relationship modeling technique not suitable for the data warehouse? How is dimensional modeling different? (Please write your Roll No. immediately) End-Term Examination Fourth Semester [MCA] MAY-JUNE 2006 Roll No. Paper Code: MCA-202 (ID -44202) Subject: Data Warehousing & Data Mining Note: Question no. 1 is

More information

Introduction to Data Mining

Introduction to Data Mining Introduction to Data Mining José Hernández ndez-orallo Dpto.. de Sistemas Informáticos and Computación Universidad Politécnica de Valencia, Spain jorallo@dsic.upv.es Horsens, Denmark, 26th September 2005

More information

Data Warehousing & Mining. Data integration. OLTP versus OLAP. CPS 116 Introduction to Database Systems

Data Warehousing & Mining. Data integration. OLTP versus OLAP. CPS 116 Introduction to Database Systems Data Warehousing & Mining CPS 116 Introduction to Database Systems Data integration 2 Data resides in many distributed, heterogeneous OLTP (On-Line Transaction Processing) sources Sales, inventory, customer,

More information

Penn State Student Chapter of the Association for Computing Machinery

Penn State Student Chapter of the Association for Computing Machinery Penn State Student Chapter of the Association for Computing Machinery We welcome all interested students to our 4th general meeting of the Spring 2005 semester! When: Monday, April 11th, 2005 from 7-8

More information

Data mining overview. Data Mining. Data mining overview. Data mining overview. Data mining overview. Data mining overview 3/24/2014

Data mining overview. Data Mining. Data mining overview. Data mining overview. Data mining overview. Data mining overview 3/24/2014 Data Mining Data mining processes What technological infrastructure is required? Data mining is a system of searching through large amounts of data for patterns. It is a relatively new concept which is

More information

Data warehousing and Phases used in Internet Mining Jitender Ahlawat 1, Joni Birla 2, Mohit Yadav 3

Data warehousing and Phases used in Internet Mining Jitender Ahlawat 1, Joni Birla 2, Mohit Yadav 3 International Journal of Computer Science and Management Studies, Vol. 11, Issue 02, Aug 2011 170 Data warehousing and Phases used in Internet Mining Jitender Ahlawat 1, Joni Birla 2, Mohit Yadav 3 1 M.Tech.

More information

DATA WAREHOUING UNIT I

DATA WAREHOUING UNIT I BHARATHIDASAN ENGINEERING COLLEGE NATTRAMAPALLI DEPARTMENT OF COMPUTER SCIENCE SUB CODE & NAME: IT6702/DWDM DEPT: IT Staff Name : N.RAMESH DATA WAREHOUING UNIT I 1. Define data warehouse? NOV/DEC 2009

More information

Think & Work like a Data Scientist with SQL 2016 & R DR. SUBRAMANI PARAMASIVAM (MANI)

Think & Work like a Data Scientist with SQL 2016 & R DR. SUBRAMANI PARAMASIVAM (MANI) Think & Work like a Data Scientist with SQL 2016 & R DR. SUBRAMANI PARAMASIVAM (MANI) About the Speaker Dr. SubraMANI Paramasivam PhD., MCT, MCSE, MCITP, MCP, MCTS, MCSA CEO, Principal Consultant & Trainer

More information

This tutorial has been prepared for computer science graduates to help them understand the basic-to-advanced concepts related to data mining.

This tutorial has been prepared for computer science graduates to help them understand the basic-to-advanced concepts related to data mining. About the Tutorial Data Mining is defined as the procedure of extracting information from huge sets of data. In other words, we can say that data mining is mining knowledge from data. The tutorial starts

More information

Data Warehouse and Mining

Data Warehouse and Mining Data Warehouse and Mining 1. is a subject-oriented, integrated, time-variant, nonvolatile collection of data in support of management decisions. A. Data Mining. B. Data Warehousing. C. Web Mining. D. Text

More information

Chapter 3. Foundations of Business Intelligence: Databases and Information Management

Chapter 3. Foundations of Business Intelligence: Databases and Information Management Chapter 3 Foundations of Business Intelligence: Databases and Information Management THE DATA HIERARCHY TRADITIONAL FILE PROCESSING Organizing Data in a Traditional File Environment Problems with the traditional

More information

Fall Principles of Knowledge Discovery in Databases. University of Alberta

Fall Principles of Knowledge Discovery in Databases. University of Alberta Principles of Knowledge Discovery in Databases Fall 1999 Dr. Osmar R. Zaïane 2 1 Class and Office Hours Class: Mondays, Wednesdays and Fridays from 10:00 to 10:50 Office Hours: Tuesdays from 11:00 to 11:55

More information

Study on the Application Analysis and Future Development of Data Mining Technology

Study on the Application Analysis and Future Development of Data Mining Technology Study on the Application Analysis and Future Development of Data Mining Technology Ge ZHU 1, Feng LIN 2,* 1 Department of Information Science and Technology, Heilongjiang University, Harbin 150080, China

More information

Thanks to the advances of data processing technologies, a lot of data can be collected and stored in databases efficiently New challenges: with a

Thanks to the advances of data processing technologies, a lot of data can be collected and stored in databases efficiently New challenges: with a Data Mining and Information Retrieval Introduction to Data Mining Why Data Mining? Thanks to the advances of data processing technologies, a lot of data can be collected and stored in databases efficiently

More information

DATABASE DEVELOPMENT (H4)

DATABASE DEVELOPMENT (H4) IMIS HIGHER DIPLOMA QUALIFICATIONS DATABASE DEVELOPMENT (H4) December 2017 10:00hrs 13:00hrs DURATION: 3 HOURS Candidates should answer ALL the questions in Part A and THREE of the five questions in Part

More information

Overview. Introduction to Data Warehousing and Business Intelligence. BI Is Important. What is Business Intelligence (BI)?

Overview. Introduction to Data Warehousing and Business Intelligence. BI Is Important. What is Business Intelligence (BI)? Introduction to Data Warehousing and Business Intelligence Overview Why Business Intelligence? Data analysis problems Data Warehouse (DW) introduction A tour of the coming DW lectures DW Applications Loosely

More information

Table of Contents. Knowledge Management Data Warehouses and Data Mining. Introduction and Motivation

Table of Contents. Knowledge Management Data Warehouses and Data Mining. Introduction and Motivation Table of Contents Knowledge Management Data Warehouses and Data Mining Dr. Michael Hahsler Dept. of Information Processing Vienna Univ. of Economics and BA 11. December 2001

More information

Knowledge Management Data Warehouses and Data Mining

Knowledge Management Data Warehouses and Data Mining Knowledge Management Data Warehouses and Data Mining Dr. Michael Hahsler Dept. of Information Processing Vienna Univ. of Economics and BA 11. December 2001 1 Table of Contents

More information

Data Warehousing and Data Mining. Announcements (December 1) Data integration. CPS 116 Introduction to Database Systems

Data Warehousing and Data Mining. Announcements (December 1) Data integration. CPS 116 Introduction to Database Systems Data Warehousing and Data Mining CPS 116 Introduction to Database Systems Announcements (December 1) 2 Homework #4 due today Sample solution available Thursday Course project demo period has begun! Check

More information

DATA MINING Introductory and Advanced Topics Part I

DATA MINING Introductory and Advanced Topics Part I DATA MINING Introductory and Advanced Topics Part I Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University Companion slides for the text by Dr. M.H.Dunham, Data

More information

Table Of Contents: xix Foreword to Second Edition

Table Of Contents: xix Foreword to Second Edition Data Mining : Concepts and Techniques Table Of Contents: Foreword xix Foreword to Second Edition xxi Preface xxiii Acknowledgments xxxi About the Authors xxxv Chapter 1 Introduction 1 (38) 1.1 Why Data

More information

Topics covered 10/12/2015. Pengantar Teknologi Informasi dan Teknologi Hijau. Suryo Widiantoro, ST, MMSI, M.Com(IS)

Topics covered 10/12/2015. Pengantar Teknologi Informasi dan Teknologi Hijau. Suryo Widiantoro, ST, MMSI, M.Com(IS) Pengantar Teknologi Informasi dan Teknologi Hijau Suryo Widiantoro, ST, MMSI, M.Com(IS) 1 Topics covered 1. Basic concept of managing files 2. Database management system 3. Database models 4. Data mining

More information

Chapter 4 Data Mining A Short Introduction

Chapter 4 Data Mining A Short Introduction Chapter 4 Data Mining A Short Introduction Data Mining - 1 1 Today's Question 1. Data Mining Overview 2. Association Rule Mining 3. Clustering 4. Classification Data Mining - 2 2 1. Data Mining Overview

More information

Data Mining & Data Warehouse

Data Mining & Data Warehouse Data Mining & Data Warehouse Associate Professor Dr. Raed Ibraheem Hamed University of Human Development, College of Science and Technology (1) 2016 2017 1 Points to Cover Why Do We Need Data Warehouses?

More information

R07. FirstRanker. 7. a) What is text mining? Describe about basic measures for text retrieval. b) Briefly describe document cluster analysis.

R07. FirstRanker. 7. a) What is text mining? Describe about basic measures for text retrieval. b) Briefly describe document cluster analysis. www..com www..com Set No.1 1. a) What is data mining? Briefly explain the Knowledge discovery process. b) Explain the three-tier data warehouse architecture. 2. a) With an example, describe any two schema

More information

Web Usage Mining from Bing Liu. Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data, Springer Chapter written by Bamshad Mobasher

Web Usage Mining from Bing Liu. Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data, Springer Chapter written by Bamshad Mobasher Web Usage Mining from Bing Liu. Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data, Springer Chapter written by Bamshad Mobasher Many slides are from a tutorial given by B. Berendt, B. Mobasher,

More information

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing.

This tutorial will help computer science graduates to understand the basic-to-advanced concepts related to data warehousing. About the Tutorial A data warehouse is constructed by integrating data from multiple heterogeneous sources. It supports analytical reporting, structured and/or ad hoc queries and decision making. This

More information

DATA MINING - 1DL105, 1DL111

DATA MINING - 1DL105, 1DL111 1 DATA MINING - 1DL105, 1DL111 Fall 2007 An introductory class in data mining http://user.it.uu.se/~udbl/dut-ht2007/ alt. http://www.it.uu.se/edu/course/homepage/infoutv/ht07 Kjell Orsborn Uppsala Database

More information

Managing Data Resources

Managing Data Resources Chapter 7 Managing Data Resources 7.1 2006 by Prentice Hall OBJECTIVES Describe basic file organization concepts and the problems of managing data resources in a traditional file environment Describe how

More information

Introduction to Data Mining S L I D E S B Y : S H R E E J A S W A L

Introduction to Data Mining S L I D E S B Y : S H R E E J A S W A L Introduction to Data Mining S L I D E S B Y : S H R E E J A S W A L Books 2 Which Chapter from which Text Book? Chapter 1: Introduction from Han, Kamber, "Data Mining Concepts and Techniques", Morgan Kaufmann

More information

SIDDHARTH GROUP OF INSTITUTIONS :: PUTTUR Siddharth Nagar, Narayanavanam Road QUESTION BANK (DESCRIPTIVE)

SIDDHARTH GROUP OF INSTITUTIONS :: PUTTUR Siddharth Nagar, Narayanavanam Road QUESTION BANK (DESCRIPTIVE) SIDDHARTH GROUP OF INSTITUTIONS :: PUTTUR Siddharth Nagar, Narayanavanam Road 517583 QUESTION BANK (DESCRIPTIVE) Subject with Code : Data Warehousing and Mining (16MC815) Year & Sem: II-MCA & I-Sem Course

More information

IJMIE Volume 2, Issue 9 ISSN:

IJMIE Volume 2, Issue 9 ISSN: WEB USAGE MINING: LEARNER CENTRIC APPROACH FOR E-BUSINESS APPLICATIONS B. NAVEENA DEVI* Abstract Emerging of web has put forward a great deal of challenges to web researchers for web based information

More information

DATA MINING II - 1DL460

DATA MINING II - 1DL460 DATA MINING II - 1DL460 Spring 2016 A second course in data mining http://www.it.uu.se/edu/course/homepage/infoutv2/vt16 Kjell Orsborn Uppsala Database Laboratory Department of Information Technology,

More information

DR. JIVRAJ MEHTA INSTITUTE OF TECHNOLOGY

DR. JIVRAJ MEHTA INSTITUTE OF TECHNOLOGY DR. JIVRAJ MEHTA INSTITUTE OF TECHNOLOGY Subject Name: - DISTRIBUTED SYSTEMS Semester :- 8 th Subject Code: -180701 Branch :- Computer Science & Engineering Department :- Computer Science & Engineering

More information

emetrics Study Llew Mason, Zijian Zheng, Ron Kohavi, Brian Frasca Blue Martini Software {lmason, zijian, ronnyk,

emetrics Study Llew Mason, Zijian Zheng, Ron Kohavi, Brian Frasca Blue Martini Software {lmason, zijian, ronnyk, emetrics Study Llew Mason, Zijian Zheng, Ron Kohavi, Brian Frasca Blue Martini Software {lmason, zijian, ronnyk, brianf}@bluemartini.com December 5 th 2001 2001 Blue Martini Software 1. Introduction Managers

More information

Foundation of Data Mining: Introduction

Foundation of Data Mining: Introduction Foundation of Data Mining: Introduction Hillol Kargupta CSEE Department, UMBC hillol@cs.umbc.edu ITE 342, (410) 455-3972 www.cs.umbc.edu/~hillol Acknowledgement: Tan, Steinbach, and Kumar provided some

More information

Information Systems and Networks

Information Systems and Networks Information Systems and Networks by Samuel Rota Bulò Department of Management Università Ca' Foscari Venezia Lesson 5 Databased and Information Management Case study: RR Donnelley giant commercial printing

More information

Data Mining Techniques Methods Algorithms and Tools

Data Mining Techniques Methods Algorithms and Tools Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN 2320 088X IMPACT FACTOR: 6.017 IJCSMC,

More information

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING SHRI ANGALAMMAN COLLEGE OF ENGINEERING & TECHNOLOGY (An ISO 9001:2008 Certified Institution) SIRUGANOOR,TRICHY-621105. DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING Year / Semester: IV/VII CS1011-DATA

More information

A Review Paper on Web Usage Mining and Pattern Discovery

A Review Paper on Web Usage Mining and Pattern Discovery A Review Paper on Web Usage Mining and Pattern Discovery 1 RACHIT ADHVARYU 1 Student M.E CSE, B. H. Gardi Vidyapith, Rajkot, Gujarat, India. ABSTRACT: - Web Technology is evolving very fast and Internet

More information

What is Data Mining? Data Mining. Data Mining Architecture. Illustrative Applications. Pharmaceutical Industry. Pharmaceutical Industry

What is Data Mining? Data Mining. Data Mining Architecture. Illustrative Applications. Pharmaceutical Industry. Pharmaceutical Industry Data Mining Andrew Kusiak Intelligent Systems Laboratory 2139 Seamans Center The University of Iowa Iowa City, IA 52242-1527 andrew-kusiak@uiowa.edu http://www.icaen.uiowa.edu/~ankusiak Tel. 319-335 5934

More information

INTRODUCTION TO BIG DATA, DATA MINING, AND MACHINE LEARNING

INTRODUCTION TO BIG DATA, DATA MINING, AND MACHINE LEARNING CS 7265 BIG DATA ANALYTICS INTRODUCTION TO BIG DATA, DATA MINING, AND MACHINE LEARNING * Some contents are adapted from Dr. Hung Huang and Dr. Chengkai Li at UT Arlington Mingon Kang, PhD Computer Science,

More information

Department of Industrial Engineering. Sharif University of Technology. Operational and enterprises systems. Exciting directions in systems

Department of Industrial Engineering. Sharif University of Technology. Operational and enterprises systems. Exciting directions in systems Department of Industrial Engineering Sharif University of Technology Session# 9 Contents: The role of managers in Information Technology (IT) Organizational Issues Information Technology Operational and

More information

What is Data Mining? Data Mining. Data Mining Architecture. Illustrative Applications. Pharmaceutical Industry. Pharmaceutical Industry

What is Data Mining? Data Mining. Data Mining Architecture. Illustrative Applications. Pharmaceutical Industry. Pharmaceutical Industry Data Mining Andrew Kusiak Intelligent Systems Laboratory 2139 Seamans Center The University it of Iowa Iowa City, IA 52242-1527 andrew-kusiak@uiowa.edu http://www.icaen.uiowa.edu/~ankusiak Tel. 319-335

More information

Contents. Foreword to Second Edition. Acknowledgments About the Authors

Contents. Foreword to Second Edition. Acknowledgments About the Authors Contents Foreword xix Foreword to Second Edition xxi Preface xxiii Acknowledgments About the Authors xxxi xxxv Chapter 1 Introduction 1 1.1 Why Data Mining? 1 1.1.1 Moving toward the Information Age 1

More information

Data Set. What is Data Mining? Data Mining (Big Data Analytics) Illustrative Applications. What is Knowledge Discovery?

Data Set. What is Data Mining? Data Mining (Big Data Analytics) Illustrative Applications. What is Knowledge Discovery? Data Mining (Big Data Analytics) Andrew Kusiak Intelligent Systems Laboratory 2139 Seamans Center The University of Iowa Iowa City, IA 52242-1527 andrew-kusiak@uiowa.edu http://user.engineering.uiowa.edu/~ankusiak/

More information

Foundations of Business Intelligence: Databases and Information Management

Foundations of Business Intelligence: Databases and Information Management Foundations of Business Intelligence: Databases and Information Management TOPIC 1: Foundations of Business Intelligence: Databases and Information Management TOPIC 1: Foundations of Business Intelligence:

More information

Data Mining. Introduction. Hamid Beigy. Sharif University of Technology. Fall 1395

Data Mining. Introduction. Hamid Beigy. Sharif University of Technology. Fall 1395 Data Mining Introduction Hamid Beigy Sharif University of Technology Fall 1395 Hamid Beigy (Sharif University of Technology) Data Mining Fall 1395 1 / 21 Table of contents 1 Introduction 2 Data mining

More information

VALLIAMMAI ENGNIEERING COLLEGE SRM Nagar, Kattankulathur 603203. DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING Year & Semester : III & VI Section : CSE - 2 Subject Code : IT6702 Subject Name : Data warehousing

More information

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems Management Information Systems Management Information Systems B10. Data Management: Warehousing, Analyzing, Mining, and Visualization Code: 166137-01+02 Course: Management Information Systems Period: Spring

More information

Business Intelligence Roadmap HDT923 Three Days

Business Intelligence Roadmap HDT923 Three Days Three Days Prerequisites Students should have experience with any relational database management system as well as experience with data warehouses and star schemas. It would be helpful if students are

More information

DATA MINING II - 1DL460. Spring 2014"

DATA MINING II - 1DL460. Spring 2014 DATA MINING II - 1DL460 Spring 2014" A second course in data mining http://www.it.uu.se/edu/course/homepage/infoutv2/vt14 Kjell Orsborn Uppsala Database Laboratory Department of Information Technology,

More information

Data Mining. Introduction. Hamid Beigy. Sharif University of Technology. Fall 1394

Data Mining. Introduction. Hamid Beigy. Sharif University of Technology. Fall 1394 Data Mining Introduction Hamid Beigy Sharif University of Technology Fall 1394 Hamid Beigy (Sharif University of Technology) Data Mining Fall 1394 1 / 20 Table of contents 1 Introduction 2 Data mining

More information

DATA WAREHOUSING AND MINING UNIT-V TWO MARK QUESTIONS WITH ANSWERS

DATA WAREHOUSING AND MINING UNIT-V TWO MARK QUESTIONS WITH ANSWERS DATA WAREHOUSING AND MINING UNIT-V TWO MARK QUESTIONS WITH ANSWERS 1. NAME SOME SPECIFIC APPLICATION ORIENTED DATABASES. Spatial databases, Time-series databases, Text databases and multimedia databases.

More information

Data Mining Concepts. Duen Horng (Polo) Chau Assistant Professor Associate Director, MS Analytics Georgia Tech

Data Mining Concepts. Duen Horng (Polo) Chau Assistant Professor Associate Director, MS Analytics Georgia Tech http://poloclub.gatech.edu/cse6242 CSE6242 / CX4242: Data & Visual Analytics Data Mining Concepts Duen Horng (Polo) Chau Assistant Professor Associate Director, MS Analytics Georgia Tech Partly based on

More information

DATA MINING II - 1DL460

DATA MINING II - 1DL460 DATA MINING II - 1DL460 Spring 2012 A second course in data mining!! http://www.it.uu.se/edu/course/homepage/infoutv2/vt12 Kjell Orsborn! Uppsala Database Laboratory! Department of Information Technology,

More information

CompSci 516 Data Intensive Computing Systems

CompSci 516 Data Intensive Computing Systems CompSci 516 Data Intensive Computing Systems Lecture 20 Data Mining and Mining Association Rules Instructor: Sudeepa Roy CompSci 516: Data Intensive Computing Systems 1 Reading Material Optional Reading:

More information

Efficient integration of data mining techniques in DBMSs

Efficient integration of data mining techniques in DBMSs Efficient integration of data mining techniques in DBMSs Fadila Bentayeb Jérôme Darmont Cédric Udréa ERIC, University of Lyon 2 5 avenue Pierre Mendès-France 69676 Bron Cedex, FRANCE {bentayeb jdarmont

More information

Data Mining & Machine Learning

Data Mining & Machine Learning Data Mining & Machine Learning Dino Pedreschi & Anna Monreale Dipartimento di Infomatica Tutor: Riccardo Guidotti, Dipartimento di Informatica DIPARTIMENTO DI INFORMATICA - Università di Pisa Data Mining

More information

DATA MINING TRANSACTION

DATA MINING TRANSACTION DATA MINING Data Mining is the process of extracting patterns from data. Data mining is seen as an increasingly important tool by modern business to transform data into an informational advantage. It is

More information

Managing Data Resources

Managing Data Resources Chapter 7 OBJECTIVES Describe basic file organization concepts and the problems of managing data resources in a traditional file environment Managing Data Resources Describe how a database management system

More information

Chapter 3 Process of Web Usage Mining

Chapter 3 Process of Web Usage Mining Chapter 3 Process of Web Usage Mining 3.1 Introduction Users interact frequently with different web sites and can access plenty of information on WWW. The World Wide Web is growing continuously and huge

More information