Log Information Mining Using Association Rules Technique: A Case Study Of Utusan Education Portal
|
|
- Elijah Dennis
- 5 years ago
- Views:
Transcription
1 Log Information Mining Using Association Rules Technique: A Case Study Of Utusan Education Portal Mohd Helmy Ab Wahab 1, Azizul Azhar Ramli 2, Nureize Arbaiy 3, Zurinah Suradi 4 1 Faculty of Electrical and Electronic Engineering Kolej Universiti Teknologi Tun Hussein Onn(KUiTTHO) Parit Raja, Batu Pahat, Johor. Tel: ext. 1230, Fax: , helmy@kuittho.edu.my, 2,3,4 Faculty of Information Technology and Multimedia Kolej Universiti Teknologi Tun Hussein Onn(KUiTTHO) Parit Raja, Batu Pahat, Johor. Tel: ext. 8056, Fax: , azizulr@kuittho.edu.my, nureize@kuittho.edu.m, zurinah@kuitthoedu.my Abstract The advent of the World Wide Web has caused an explosive growth in its size and usage. The web itself provides a rich and surprising data mining source. Hence, it made necessary for Web site administrators to track and analyze the navigation patterns of Web site visitors. However, data mining techniques are not easily apply to Web data due to problems both related with the technology underlying the Web and the lack of standards in the design and implementation of Web pages. Information collected by the Web servers are kept in the server log. This is the main source of data for analyzing user navigation patterns. Once logs have been pre-processed and sessions have been obtained, there are several kinds of access pattern mining that can be performed depending on the needs of the analyst. In this study, a Data Mining technique known as Association Rules was used in order to get some insights into website usage pattern. For the purpose of this study, server logs from Utusan Education portal were retrieved, pre-processed and analyzed. Keywords: Web Mining, Web Usage Mining, Generalized Association Rules, Tutor.com, Server Logs Introduction Finding useful patterns in data is known by different names including data mining. It is a process of discovering various models, summaries, and derived values from a collection of large data. Data mining is a process of extracting large data and used to determine relationships using Artificial Intelligence (AI) or statistical techniques [7]. With this useful information it can be utilized to guide professional in making decision and do scientific research [2]. Data mining also can be applied in the various field including computational linguistics, statistics, informatics, artificial intelligence and knowledge discovery. Moreover, data mining can also be used to discover and analyze useful information from the Web [9]. One of the data mining techniques is web mining. According to [8], Web Mining is the use of data mining techniques to automatically discover and extract information from Web documents. It consists of Web Content Mining, Web Structure Mining and Web Usage Mining [12]. Refer to Fig 1. Figure 1 Taxonomy of Web Mining Web Content Mining involves mining web data contents [10] ranging from the HTML based document and XML-based documents accumulated in the web servers. The goal of Web Content Mining is mainly to assist or to improve information finding or filtering the information. In contrast, Web Structure Mining tries to discover the model that based on the hyperlinks topology and document structure on the Web. Web Structure Mining aims to generate structural summary about web sites and web pages. In turn, we will discover the structure information of the Web. On the other hand, Web Usage Mining focuses on discovery of user access patterns from web server logs. Web server logs 712
2 generate log files which records web server activity. [3] suggested that web usage mining includes data related to the usage of the pages of a website such as IP address, page references and the date and time access. One of data mining techniques that are commonly used in web mining is Association Rules. In brief, an association rule is an expression X => Y, where X and Y are sets of items. The meaning of such rules is quite intuitive: Given a database D of transactions where each transaction T 'LVD set of items, X => Y expresses that whenever a transaction T contains X than T probably contains Y also. The probability or rule confidence is defined as the percentage of transactions containing Y in addition to X with regard to the overall number of transactions containing X. Since its introduction in 1993 [1] the task of Association Rule mining has received a great deal of attention. As we enter the age of Web technology, the amount of information available on the World Wide Web (WWW) has rapidly increased [11]. In generals, for a particular website, more than hundred thousand of users will be access it. This server transactions activity will be recorded in a server log files. Log files produced by the web servers are in the form of text files. This file consists of HTTP transactions that processed by hash coding URLs, IPs, client environment and cookies. The Web logs provide information to ensure adequate bandwidth and server capacity on their organizations website. Log file data can offers valuable insight of web site usage among users. It reflects actual usage in natural working condition, compared to the artificial setting of a usability lab. Log data file represents the activity of many users, over potentially long period of time, compared to a limited number of users for an hour or two each. The general objective of this project is to determine user browsing and access pattern using Association Rules. Specifically, the objectives of this project are 1. To determine and discover the user access pattern from Education portal of Utusan server. 2. To analyze the usage patterns output and user behaviours of Utusan Education portal from the Web usage mining implementation process. Method Figure 2 Flow Chart of Log File Creation Fig. 2 shows how a system administrator can gather information from the server logs. Basically, when a user sends queries to the server, the requested information will be retrieved from the database. At the same time, the user session including the URL, Client s IP address, accessing date and time, query stem will be recorded in the server logs. Thus, this server logs will be preprocessed and mined in order to get some insight into the usage of a server site as well as the users behaviour. In this study, the methodology used is adopted from [13]. Since the log files are continuously generated, it contains of large amounts of log data. Apart from that it also required large storage device. Therefore, analyzing and exploring regularities in web log records can enhance the quality and delivery of Internet information services to the end user, and improve Web server system performance as well. Web mining application in education field is not new. The previous research proposed the beyond usage mining to consider the content of the pages that have been visited.in the web-based learning environment, both learner s browsing behaviours and course content are important to derive learner s learning levels, intention, goals and interest. Incorporating course content can aid in an understanding of learners browsing habits. In particular, understanding the learners browsing behaviors can facilitate the course content personalization. Log File Figure 3 Association Rules The log file was retrieved from the portal server to be analyzed. However, one of the main problems encountered when dealing with the log files is the amount of data needs to be pre-processed. A sample of a single entry log file contains the following information: :00: CSLNTSVR GET /tutor/include/style03.css HTTP/1.1www.tutor.com.myMozilla/4.0+(compatible;+MSI 713
3 E+5.5;+Windows+98;+Win+9x+4.90)ASPSESSIONIDCST SBQDC=NBKBCPIBBJHCMMFIKMLNNKFD;+browser= done;+aspsessionidaqrrcqcc=lbdgbpibdfcok HMLHEHNKFBN For this particular session, there are 19 attributes identified from the log file. For this study, the attributes such as cookies, hostname, server IP will be removed. Pre-processing Pre-processing is a task of converting the usage, content, and structure information from the data sources into data abstractions necessary for pattern discovery. Before data mining algorithm can be applied, data pre-processing must be performed to convert the raw data into data abstraction necessary for the further processing say Pattern Analysis. This process involves data extraction and data cleaning tasks. Pattern Mining - Association Rule In the Web domain, the pages, which are most often referenced, can be put in one single server session by applying the association rule generation. Association Rule mining techniques can be used to discover unordered correlation between items found in a database of transactions [5]. [6] pointed that in the term of the Web usage mining, the association rules refer to sets of pages that are accessed together with a support value exceeding some specified threshold. The support is the percentage of the transactions that contain a given pattern. Results The Web Usage Mining task for Utusan Education Portal is divided into two main stages. Each stage has its own phases with certain sub activities or tasks. The first stage includes data retrieving and pre-processing. The second stage involves Pattern Discovery where Association Rules are applied. General Pattern Analysis Results (access pattern and users behaviors descriptive statistic) The Utusan Education portal has several options (menu, content, dunia, info, etuisyen, banksoalan and interaktif) that can be accessed by the users. Based on the Universal Resource Locator (URL) stem, the users accessed the portal options host named Figure 4 10 Most Requested Directories in Tutor.com Portal Association Rules Results (support and confidence of the different options) Fig. 4 shows that, /index path is the most requested page and it followed by the /content path. The /index path is the top level for Tutor.com portal and it displays the general information about the etuisyen, banksoalan, interaktif, bilikguru, komuniti and petalaman options. With /index path and other options path, user also can select other options that provided by Utusan Education portal. Association Rules is used to extract rules in the form of X Y (if X then Y) quantified with a confidence (proportion of occurrences that verifies Y among occurrences that verifies X) and a support (proportion of occurrences that verifies X and Y among all occurrences). Fig. 5, shows the Association Rules, including support and confidence by applying Apriori algorithm for identifying the patterns, defining a threshold of 15% for the minimum support and a threshold of 70% for the minimum confidence. /banksoalan <- /index /content (17.4%, 73.6%) /content <- /index /etuisyen (16.9%, 87.4%) /info <- /dunia /content (16.4%, 74.7%) /interaktif <- /banksoalan /index (17.8%, 85.0%) /etuisyen <- /banksoalan /content (17.0%, 99.0%) /info <- /contentrm /etuisyen1 (15.6%, 84.7%) /dunia <- /index /content (15.9%, 71.8%) /index <- /f_menu /dunia (19.6%, 75.3%) /banksoalan<- /etuisyen /content (17.6%, 83.8%) /interaktif <- /index /etuisyen1 (15.7%, 96.4%) Figure 5 Output for Tutor.com Portal Options Association Rules (Related Pages) Fig. 5 also shows the pages that are related to each other where the most frequent options were being selected during the certain options requested. 714
4 Based on the Fig. 5 above, it can be concluded that the rule with higher support (19.6%) means, if in a session the user selected the /f_menu and /dunia options path, user also selected the /index option path ; the rule with higher confidence (99%) says that if in a session, the user selected the /banksoalan and /content option path, user also selected the /etuisyen option path. Fig. 6 represents a graphical chart for the 6 most accepted rules for options relationship. Figure 8 The 6 Most Accepted Rules for Tutor.com Education Options Association Hyperedges (orderly archived) Discussion Figure 6 The 6 Most Accepted Rules for Tutor.com Education Portal (related options) Fig. 7 below shows the Association Hyperedges for Tutor.com Education Portal that illustrates the portal pages those orderly archived. A threshold of 10% for the minimum support and a threshold of 75% for the minimum confidence are being used. It shows that support of 13.7% is the high percentage of transactions that contain all items appearing in the hyperedge, that is in the /index /content /f_menu with the percentages of confidence is 87.5%. The confidence of 97.5% with 10.2% of support is on /banksoalan /interaktif /etuisyen /content is represent the highest of average confidence of all rules that can be formed using the items in the hyperedge. /banksoalan /interaktif /index (10.8%, 90.4%) /index /content /f_menu (13.7%, 87.5%) /etuisyen /banksoalan /content (10.6%, 92.8%) /interaktif /index /content (11.8%, 76.9%) /dunia /info /content (10.4%, 85.0%) /content /banksoalan /index (10.0%, 79.4%) /etuisyen /etuisyen1 /index (11.8%, 87.0%) /interaktif /banksoalan /etuisyen /f_menu (10.7%, 78.4%) /banksoalan /etuisyen /content1 /content (10.5%, 90.7%) /banksoalan /interaktif /etuisyen /content (10.2%, 97.5%) Figure 7 The Output for Tutor.com Education Portal Options Association Hyperedges (orderly archived) Furthermore, Fig. 8 shows the six most accepted Association Rules Hyperedge for Tutor.com Education server log files during selected period of time. At present, Web usage mining extract patterns in Web server logs. The findings from this study provide an overview of the usage pattern of Tutor.com Portal. The results from this study are useful for the Web administrator in order to improve Web services and performance of the Web sites, in terms of their contents, structure, presentation and delivery. With this information, web developer can design better Web Pages which will attract more users to visit the website. The study also demonstrates the use of Association Rules in Web Usage Mining by extracting rules from the log files. The outcome of this study can be used by the Tutor.com s System administrator as a guideline in enhancing the use of Tutor.com. Conclusion With the growth of Web-based applications, there is significant interest in analyzing Web usage data to better understand Web usage, and apply the knowledge to better serve users. This study use Association Rules to identify rules for Tutor.com Education Portal. From the findings, it showed that the etuisyen and banksoalan option has high access rates as compared to other options. These two options provide notes, exercises and trial examination questions that have been prepared by expert teachers. This education material is prepared according to the latest examination format and needs. For future study, maybe we should embed the web usage mining process in the web pages itself. With this, the system itself can automatically generate rules according to Association Rules. Therefore the system may perform and implement the Web usage mining phase including data selection, data preprocessing, pattern discovery and analysis. In future, another method for analyzing sparse data can also be used in the study of E- Learning Web log access for knowledge extraction from Web log data 715
5 References [1] Agrawal, R. and Srikant, R Fast Algorithms For Mining Association Rules. Proc. of the 20th VLDB Conference. pp [2] Chen, M.-S., Han, J., Yu, P.S Data Mining: An Overview from a Database Perspective. IEEE Transactions on Knowledge and Data Engineering, (8:6). Pp [3] Cooley, R., Mobasher, B., and Srivastava, J Web Mining: Information and Pattern Discovery on the World Wide Web. Technical Report, TR [4] Cooley, R., Tan, P.N., and Srivastava, J WebSIFT: The Web Site Information Filter System. Proceedings of the Web Usage Analysis and User Profiling Workshop (WebKDD 99). [5] Cooley, R.,,Mobasher, B., and Srivastava, J Data preparation for mining world wide Web browsing patterns. Knowledge and Information Systems. Vol 1. No.1. [6] Cooley, R Web Usage Mining: Discovery and Application of Interesting Patterns from Web data. PhD thesis. Dept. of Computer Science, University of Minnesota [7] Fayyad, U. M., Piatetski-Shapiro, G., Smith, P From Data Mining to Knowledge Discovery: An Overview. Advances in Knowledge Discovery and Data Mining. Pp [8] Kosala, R. and Blockeel, H Web Mining Research: A Survey. ACM SIGKDD. Vol. 2. Issue 1.pp [9] Lee, R. S. T and Liu, J. N. K ijade eminer: A Web-Based Mining Agent Based on Intelligent Java Agent Development Environment on Internet Shopping. PAKDD LNAI pp [10] Madria, S., Bhowmick, S. S., Ng, W. K., and Lim, E. P Research Issue in Web Data Mining. Data Warehousing and Knowledge Discovery. [11] Mohammadian, M Intelligent Data Mining and Information Retrieval from World Wide Web for E- Business Applications. URL [12] Pal, S. K., Talwar, V., and Mitra, P Web Mining in Soft Computing Framework: Relevance, State of the Art and Future Directions. IEEE [13] Xue, G. R., Zeng, H. J., Chen, Z., Ma, W. Y., and Lu, C. J Log Mining to Improve the performance of Site Search. Third Int. Conf. of WISEw
SEQUENTIAL PATTERN MINING FROM WEB LOG DATA
SEQUENTIAL PATTERN MINING FROM WEB LOG DATA Rajashree Shettar 1 1 Associate Professor, Department of Computer Science, R. V College of Engineering, Karnataka, India, rajashreeshettar@rvce.edu.in Abstract
More informationData Mining of Web Access Logs Using Classification Techniques
Data Mining of Web Logs Using Classification Techniques Md. Azam 1, Asst. Prof. Md. Tabrez Nafis 2 1 M.Tech Scholar, Department of Computer Science & Engineering, Al-Falah School of Engineering & Technology,
More informationAssociation Rule Mining among web pages for Discovering Usage Patterns in Web Log Data L.Mohan 1
Volume 4, No. 5, May 2013 (Special Issue) International Journal of Advanced Research in Computer Science RESEARCH PAPER Available Online at www.ijarcs.info Association Rule Mining among web pages for Discovering
More informationSupport System- Pioneering approach for Web Data Mining
Support System- Pioneering approach for Web Data Mining Geeta Kataria 1, Surbhi Kaushik 2, Nidhi Narang 3 and Sunny Dahiya 4 1,2,3,4 Computer Science Department Kurukshetra University Sonepat, India ABSTRACT
More informationWe are IntechOpen, the first native scientific publisher of Open Access books. International authors and editors. Our authors are among the TOP 1%
We are IntechOpen, the first native scientific publisher of Open Access books 3,350 108,000 1.7 M Open access books available International authors and editors Downloads Our authors are among the 151 Countries
More informationFuzzy Cognitive Maps application for Webmining
Fuzzy Cognitive Maps application for Webmining Andreas Kakolyris Dept. Computer Science, University of Ioannina Greece, csst9942@otenet.gr George Stylios Dept. of Communications, Informatics and Management,
More informationSurvey Paper on Web Usage Mining for Web Personalization
ISSN 2278 0211 (Online) Survey Paper on Web Usage Mining for Web Personalization Namdev Anwat Department of Computer Engineering Matoshri College of Engineering & Research Center, Eklahare, Nashik University
More informationA Survey on Web Personalization of Web Usage Mining
A Survey on Web Personalization of Web Usage Mining S.Jagan 1, Dr.S.P.Rajagopalan 2 1 Assistant Professor, Department of CSE, T.J. Institute of Technology, Tamilnadu, India 2 Professor, Department of CSE,
More informationEXTRACTION OF INTERESTING PATTERNS THROUGH ASSOCIATION RULE MINING FOR IMPROVEMENT OF WEBSITE USABILITY
ISTANBUL UNIVERSITY JOURNAL OF ELECTRICAL & ELECTRONICS ENGINEERING YEAR VOLUME NUMBER : 2009 : 9 : 2 (1037-1046) EXTRACTION OF INTERESTING PATTERNS THROUGH ASSOCIATION RULE MINING FOR IMPROVEMENT OF WEBSITE
More informationThe influence of caching on web usage mining
The influence of caching on web usage mining J. Huysmans 1, B. Baesens 1,2 & J. Vanthienen 1 1 Department of Applied Economic Sciences, K.U.Leuven, Belgium 2 School of Management, University of Southampton,
More informationWeb Page Classification using FP Growth Algorithm Akansha Garg,Computer Science Department Swami Vivekanad Subharti University,Meerut, India
Web Page Classification using FP Growth Algorithm Akansha Garg,Computer Science Department Swami Vivekanad Subharti University,Meerut, India Abstract - The primary goal of the web site is to provide the
More informationPattern Classification based on Web Usage Mining using Neural Network Technique
International Journal of Computer Applications (975 8887) Pattern Classification based on Web Usage Mining using Neural Network Technique Er. Romil V Patel PIET, VADODARA Dheeraj Kumar Singh, PIET, VADODARA
More informationChapter 2 BACKGROUND OF WEB MINING
Chapter 2 BACKGROUND OF WEB MINING Overview 2.1. Introduction to Data Mining Data mining is an important and fast developing area in web mining where already a lot of research has been done. Recently,
More informationKnowledge Discovery from Web Usage Data: Research and Development of Web Access Pattern Tree Based Sequential Pattern Mining Techniques: A Survey
Knowledge Discovery from Web Usage Data: Research and Development of Web Access Pattern Tree Based Sequential Pattern Mining Techniques: A Survey G. Shivaprasad, N. V. Subbareddy and U. Dinesh Acharya
More informationWeb Usage Mining for Comparing User Access Behaviour using Sequential Pattern
Web Usage Mining for Comparing User Access Behaviour using Sequential Pattern Amit Dipchandji Kasliwal #, Dr. Girish S. Katkar * # Malegaon, Nashik, Maharashtra, India * Dept. of Computer Science, Arts,
More informationTHE STUDY OF WEB MINING - A SURVEY
THE STUDY OF WEB MINING - A SURVEY Ashish Gupta, Anil Khandekar Abstract over the year s web mining is the very fast growing research field. Web mining contains two research areas: Data mining and World
More informationWeb page recommendation using a stochastic process model
Data Mining VII: Data, Text and Web Mining and their Business Applications 233 Web page recommendation using a stochastic process model B. J. Park 1, W. Choi 1 & S. H. Noh 2 1 Computer Science Department,
More informationA Framework for Personal Web Usage Mining
A Framework for Personal Web Usage Mining Yongjian Fu Ming-Yi Shih Department of Computer Science Department of Computer Science University of Missouri-Rolla University of Missouri-Rolla Rolla, MO 65409-0350
More informationIJMIE Volume 2, Issue 9 ISSN:
WEB USAGE MINING: LEARNER CENTRIC APPROACH FOR E-BUSINESS APPLICATIONS B. NAVEENA DEVI* Abstract Emerging of web has put forward a great deal of challenges to web researchers for web based information
More informationUNIT-V WEB MINING. 3/18/2012 Prof. Asha Ambhaikar, RCET Bhilai.
UNIT-V WEB MINING 1 Mining the World-Wide Web 2 What is Web Mining? Discovering useful information from the World-Wide Web and its usage patterns. 3 Web search engines Index-based: search the Web, index
More informationInternational Journal of Computer Engineering and Applications, ICCSTAR-2016, Special Issue, May.16
The Survey Of Data Mining And Warehousing Architha.S, A.Kishore Kumar Department of Computer Engineering Department of computer engineering city engineering college VTU Bangalore, India ABSTRACT: Data
More informationTo Enhance Projection Scalability of Item Transactions by Parallel and Partition Projection using Dynamic Data Set
To Enhance Scalability of Item Transactions by Parallel and Partition using Dynamic Data Set Priyanka Soni, Research Scholar (CSE), MTRI, Bhopal, priyanka.soni379@gmail.com Dhirendra Kumar Jha, MTRI, Bhopal,
More informationSemantic Web Mining and its application in Human Resource Management
International Journal of Computer Science & Management Studies, Vol. 11, Issue 02, August 2011 60 Semantic Web Mining and its application in Human Resource Management Ridhika Malik 1, Kunjana Vasudev 2
More informationImproving the prediction of next page request by a web user using Page Rank algorithm
Improving the prediction of next page request by a web user using Page Rank algorithm Claudia Elena Dinucă, Dumitru Ciobanu Faculty of Economics and Business Administration Cybernetics and statistics University
More informationChapter 1, Introduction
CSI 4352, Introduction to Data Mining Chapter 1, Introduction Young-Rae Cho Associate Professor Department of Computer Science Baylor University What is Data Mining? Definition Knowledge Discovery from
More informationAPD-A Tool for Identifying Behavioural Patterns Automatically from Clickstream Data
APD-A Tool for Identifying Behavioural Patterns Automatically from Clickstream Data I-Hsien Ting, Lillian Clark, Chris Kimble, Daniel Kudenko, and Peter Wright Department of Computer Science, The University
More informationAn Improved Apriori Algorithm for Association Rules
Research article An Improved Apriori Algorithm for Association Rules Hassan M. Najadat 1, Mohammed Al-Maolegi 2, Bassam Arkok 3 Computer Science, Jordan University of Science and Technology, Irbid, Jordan
More informationI. Introduction II. Keywords- Pre-processing, Cleaning, Null Values, Webmining, logs
ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: An Enhanced Pre-Processing Research Framework for Web Log Data
More informationCLASSIFICATION OF WEB LOG DATA TO IDENTIFY INTERESTED USERS USING DECISION TREES
CLASSIFICATION OF WEB LOG DATA TO IDENTIFY INTERESTED USERS USING DECISION TREES K. R. Suneetha, R. Krishnamoorthi Bharathidasan Institute of Technology, Anna University krs_mangalore@hotmail.com rkrish_26@hotmail.com
More informationContext-based Navigational Support in Hypermedia
Context-based Navigational Support in Hypermedia Sebastian Stober and Andreas Nürnberger Institut für Wissens- und Sprachverarbeitung, Fakultät für Informatik, Otto-von-Guericke-Universität Magdeburg,
More informationMining of Web Server Logs using Extended Apriori Algorithm
International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) International Journal of Emerging Technologies in Computational
More informationA Website Mining Model Centered on User Queries
A Website Mining Model Centered on User Queries Ricardo Baeza-Yates 1, 3, 2 and Barbara Poblete 2, 3 1 ICREA, Barcelona, Catalunya, Spain 2 Center for Web Research, CS Dept., University of Chile 3 Web
More informationWeb Data mining-a Research area in Web usage mining
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661, p- ISSN: 2278-8727Volume 13, Issue 1 (Jul. - Aug. 2013), PP 22-26 Web Data mining-a Research area in Web usage mining 1 V.S.Thiyagarajan,
More informationAssociation-Rules-Based Recommender System for Personalization in Adaptive Web-Based Applications
Association-Rules-Based Recommender System for Personalization in Adaptive Web-Based Applications Daniel Mican, Nicolae Tomai Babes-Bolyai University, Dept. of Business Information Systems, Str. Theodor
More informationResearch/Review Paper: Web Personalization Using Usage Based Clustering Author: Madhavi M.Mali,Sonal S.Jogdand, Deepali P. Shinde Paper ID: V1-I3-002
Journal) Volume1, Issue3, Nov-Dec, 2014.ISSN: 2349-7173(Online) International Journal of Advanced Research in Technology, Engineering and Science (A Bimonthly Open Access Online. Research/Review Paper:
More informationInternational Journal of Advanced Research in Computer Science and Software Engineering
Volume 2, Issue 9, September 2012 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Discovery
More informationInternational Journal of Scientific & Engineering Research, Volume 4, Issue 11, November ISSN
International Journal of Scientific & Engineering Research, Volume 4, Issue 11, November-2013 398 Web Usage Mining has Pattern Discovery DR.A.Venumadhav : venumadhavaka@yahoo.in/ akavenu17@rediffmail.com
More informationWeb Usage Mining: A Research Area in Web Mining
Web Usage Mining: A Research Area in Web Mining Rajni Pamnani, Pramila Chawan Department of computer technology, VJTI University, Mumbai Abstract Web usage mining is a main research area in Web mining
More informationCharacterizing Home Pages 1
Characterizing Home Pages 1 Xubin He and Qing Yang Dept. of Electrical and Computer Engineering University of Rhode Island Kingston, RI 881, USA Abstract Home pages are very important for any successful
More informationSemantic Clickstream Mining
Semantic Clickstream Mining Mehrdad Jalali 1, and Norwati Mustapha 2 1 Department of Software Engineering, Mashhad Branch, Islamic Azad University, Mashhad, Iran 2 Department of Computer Science, Universiti
More informationWeb Usage Data for Web Access Control (WUDWAC)
Web Usage Data for Web Access Control (WUDWAC) Dr. Selma Elsheikh* Abstract The development and the widespread use of the World Wide Web have made electronic data storage and data distribution possible
More informationIJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, ISSN:
IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 20131 Improve Search Engine Relevance with Filter session Addlin Shinney R 1, Saravana Kumar T
More informationA recommendation engine by using association rules
Available online at www.sciencedirect.com Procedia - Social and Behavioral Sciences 62 ( 2012 ) 452 456 WCBEM 2012 A recommendation engine by using association rules Ozgur Cakir a 1, Murat Efe Aras b a
More informationKeywords: Figure 1: Web Log File. 2013, IJARCSSE All Rights Reserved Page 1167
Volume 3, Issue 12, December 2013 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Review on
More informationDeep Web Crawling and Mining for Building Advanced Search Application
Deep Web Crawling and Mining for Building Advanced Search Application Zhigang Hua, Dan Hou, Yu Liu, Xin Sun, Yanbing Yu {hua, houdan, yuliu, xinsun, yyu}@cc.gatech.edu College of computing, Georgia Tech
More informationARS: Web Page Recommendation System for Anonymous Users Based On Web Usage Mining
ARS: Web Page Recommendation System for Anonymous Users Based On Web Usage Mining Yahya AlMurtadha, MD. Nasir Bin Sulaiman, Norwati Mustapha, Nur Izura Udzir and Zaiton Muda University Putra Malaysia,
More informationKeywords Web Mining, Web Usage Mining, Web Structure Mining, Web Content Mining.
Volume 3, Issue 7, July 2013 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Framework to
More informationWeb Usage Mining: How to Efficiently Manage New Transactions and New Clients
Web Usage Mining: How to Efficiently Manage New Transactions and New Clients F. Masseglia 1,2, P. Poncelet 2, and M. Teisseire 2 1 Laboratoire PRiSM, Univ. de Versailles, 45 Avenue des Etats-Unis, 78035
More informationObtaining Language Models of Web Collections Using Query-Based Sampling Techniques
-7695-1435-9/2 $17. (c) 22 IEEE 1 Obtaining Language Models of Web Collections Using Query-Based Sampling Techniques Gary A. Monroe James C. French Allison L. Powell Department of Computer Science University
More informationOntology Generation from Session Data for Web Personalization
Int. J. of Advanced Networking and Application 241 Ontology Generation from Session Data for Web Personalization P.Arun Research Associate, Madurai Kamaraj University, Madurai 62 021, Tamil Nadu, India.
More informationKDD, SEMMA AND CRISP-DM: A PARALLEL OVERVIEW. Ana Azevedo and M.F. Santos
KDD, SEMMA AND CRISP-DM: A PARALLEL OVERVIEW Ana Azevedo and M.F. Santos ABSTRACT In the last years there has been a huge growth and consolidation of the Data Mining field. Some efforts are being done
More informationOverview of Web Mining Techniques and its Application towards Web
Overview of Web Mining Techniques and its Application towards Web *Prof.Pooja Mehta Abstract The World Wide Web (WWW) acts as an interactive and popular way to transfer information. Due to the enormous
More informationA Data Mining Framework for Extracting Product Sales Patterns in Retail Store Transactions Using Association Rules: A Case Study
A Data Mining Framework for Extracting Product Sales Patterns in Retail Store Transactions Using Association Rules: A Case Study Mirzaei.Afshin 1, Sheikh.Reza 2 1 Department of Industrial Engineering and
More informationDeep Web Content Mining
Deep Web Content Mining Shohreh Ajoudanian, and Mohammad Davarpanah Jazi Abstract The rapid expansion of the web is causing the constant growth of information, leading to several problems such as increased
More informationWeb Mining Using Cloud Computing Technology
International Journal of Scientific Research in Computer Science and Engineering Review Paper Volume-3, Issue-2 ISSN: 2320-7639 Web Mining Using Cloud Computing Technology Rajesh Shah 1 * and Suresh Jain
More informationPre-processing of Web Logs for Mining World Wide Web Browsing Patterns
Pre-processing of Web Logs for Mining World Wide Web Browsing Patterns # Yogish H K #1 Dr. G T Raju *2 Department of Computer Science and Engineering Bharathiar University Coimbatore, 641046, Tamilnadu
More informationA New Web Usage Mining Approach for Website Recommendations Using Concept Hierarchy and Website Graph
A New Web Usage Mining Approach for Website Recommendations Using Concept Hierarchy and Website Graph T. Vijaya Kumar, H. S. Guruprasad, Bharath Kumar K. M., Irfan Baig, and Kiran Babu S. Abstract To have
More informationAssociation Rules Mining using BOINC based Enterprise Desktop Grid
Association Rules Mining using BOINC based Enterprise Desktop Grid Evgeny Ivashko and Alexander Golovin Institute of Applied Mathematical Research, Karelian Research Centre of Russian Academy of Sciences,
More informationWeighted Page Rank Algorithm Based on Number of Visits of Links of Web Page
International Journal of Soft Computing and Engineering (IJSCE) ISSN: 31-307, Volume-, Issue-3, July 01 Weighted Page Rank Algorithm Based on Number of Visits of Links of Web Page Neelam Tyagi, Simple
More informationA Modified Apriori Algorithm for Fast and Accurate Generation of Frequent Item Sets
INTERNATIONAL JOURNAL OF SCIENTIFIC & TECHNOLOGY RESEARCH VOLUME 6, ISSUE 08, AUGUST 2017 ISSN 2277-8616 A Modified Apriori Algorithm for Fast and Accurate Generation of Frequent Item Sets K.A.Baffour,
More informationRaunak Rathi 1, Prof. A.V.Deorankar 2 1,2 Department of Computer Science and Engineering, Government College of Engineering Amravati
Analytical Representation on Secure Mining in Horizontally Distributed Database Raunak Rathi 1, Prof. A.V.Deorankar 2 1,2 Department of Computer Science and Engineering, Government College of Engineering
More informationAssociating Terms with Text Categories
Associating Terms with Text Categories Osmar R. Zaïane Department of Computing Science University of Alberta Edmonton, AB, Canada zaiane@cs.ualberta.ca Maria-Luiza Antonie Department of Computing Science
More informationWeb Mining and Knowledge Discovery of Usage Patterns
Web Mining and Knowledge Discovery of Usage Patterns CS 748T Project (Part I) Yan Wang February, 2000 1 Abstract Web mining is a very hot research topic which combines two of the activated research areas:
More informationInferring User Search for Feedback Sessions
Inferring User Search for Feedback Sessions Sharayu Kakade 1, Prof. Ranjana Barde 2 PG Student, Department of Computer Science, MIT Academy of Engineering, Pune, MH, India 1 Assistant Professor, Department
More informationKeywords Data alignment, Data annotation, Web database, Search Result Record
Volume 5, Issue 8, August 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Annotating Web
More informationLife Science Journal 2017;14(2) Optimized Web Content Mining
Optimized Web Content Mining * K. Thirugnana Sambanthan,** Dr. S.S. Dhenakaran, Professor * Research Scholar, Dept. Computer Science, Alagappa University, Karaikudi, E-mail: shivaperuman@gmail.com ** Dept.
More informationINTRODUCTION. Chapter GENERAL
Chapter 1 INTRODUCTION 1.1 GENERAL The World Wide Web (WWW) [1] is a system of interlinked hypertext documents accessed via the Internet. It is an interactive world of shared information through which
More informationA Navigation-log based Web Mining Application to Profile the Interests of Users Accessing the Web of Bidasoa Turismo
A Navigation-log based Web Mining Application to Profile the Interests of Users Accessing the Web of Bidasoa Turismo Olatz Arbelaitz, Ibai Gurrutxaga, Aizea Lojo, Javier Muguerza, Jesús M. Pérez and Iñigo
More informationA SURVEY- WEB MINING TOOLS AND TECHNIQUE
International Journal of Latest Trends in Engineering and Technology Vol.(7)Issue(4), pp.212-217 DOI: http://dx.doi.org/10.21172/1.74.028 e-issn:2278-621x A SURVEY- WEB MINING TOOLS AND TECHNIQUE Prof.
More informationDiscovering the Association Rules in OLAP Data Cube with Daily Downloads of Folklore Materials *
Discovering the Association Rules in OLAP Data Cube with Daily Downloads of Folklore Materials * Galina Bogdanova, Tsvetanka Georgieva Abstract: Association rules mining is one kind of data mining techniques
More informationResearch Article A Novel Web Classification Algorithm Using Fuzzy Weighted Association Rules
ISRN Artificial Intelligence Volume 213, Article ID 316913, 1 pages http://dx.doi.org/1.1155/213/316913 Research Article A Novel Web Classification Algorithm Using Fuzzy Weighted Association Rules Binu
More informationGraph based Approach for Mining Frequent Sequential Access Patterns of Web pages
Graph based Approach for Mining Frequent Sequential Access Patterns of Web pages Dheeraj Kumar Singh Varsha Sharma Sanjeev Sharma ABSTRACT The Internet has impacted almost every aspect of our society.
More informationA Framework for Securing Databases from Intrusion Threats
A Framework for Securing Databases from Intrusion Threats R. Prince Jeyaseelan James Department of Computer Applications, Valliammai Engineering College Affiliated to Anna University, Chennai, India Email:
More informationItem Set Extraction of Mining Association Rule
Item Set Extraction of Mining Association Rule Shabana Yasmeen, Prof. P.Pradeep Kumar, A.Ranjith Kumar Department CSE, Vivekananda Institute of Technology and Science, Karimnagar, A.P, India Abstract:
More informationMonotone Constraints in Frequent Tree Mining
Monotone Constraints in Frequent Tree Mining Jeroen De Knijf Ad Feelders Abstract Recent studies show that using constraints that can be pushed into the mining process, substantially improves the performance
More informationDMSA TECHNIQUE FOR FINDING SIGNIFICANT PATTERNS IN LARGE DATABASE
DMSA TECHNIQUE FOR FINDING SIGNIFICANT PATTERNS IN LARGE DATABASE Saravanan.Suba Assistant Professor of Computer Science Kamarajar Government Art & Science College Surandai, TN, India-627859 Email:saravanansuba@rediffmail.com
More informationINTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & TECHNOLOGY (IJCET)
INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & TECHNOLOGY (IJCET) International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 6367(Print) ISSN 0976 6375(Online)
More informationWeb Mining Team 11 Professor Anita Wasilewska CSE 634 : Data Mining Concepts and Techniques
Web Mining Team 11 Professor Anita Wasilewska CSE 634 : Data Mining Concepts and Techniques Imgref: https://www.kdnuggets.com/2014/09/most-viewed-web-mining-lectures-videolectures.html Contents Introduction
More informationThe Comparison of CBA Algorithm and CBS Algorithm for Meteorological Data Classification Mohammad Iqbal, Imam Mukhlash, Hanim Maria Astuti
Information Systems International Conference (ISICO), 2 4 December 2013 The Comparison of CBA Algorithm and CBS Algorithm for Meteorological Data Classification Mohammad Iqbal, Imam Mukhlash, Hanim Maria
More informationAn Efficient Algorithm for finding high utility itemsets from online sell
An Efficient Algorithm for finding high utility itemsets from online sell Sarode Nutan S, Kothavle Suhas R 1 Department of Computer Engineering, ICOER, Maharashtra, India 2 Department of Computer Engineering,
More informationA System for Identifying Voyage Package Using Different Recommendations Techniques
GLOBAL IMPACT FACTOR 0.238 DIIF 0.876 A System for Identifying Voyage Package Using Different Recommendations Techniques 1 Gajjela.Sandeep, 2 R. Chandrashekar 1 M.Tech (CS),Department of Computer Science
More informationIntelligent management of on-line video learning resources supported by Web-mining technology based on the practical application of VOD
World Transactions on Engineering and Technology Education Vol.13, No.3, 2015 2015 WIETE Intelligent management of on-line video learning resources supported by Web-mining technology based on the practical
More informationChapter 3 Process of Web Usage Mining
Chapter 3 Process of Web Usage Mining 3.1 Introduction Users interact frequently with different web sites and can access plenty of information on WWW. The World Wide Web is growing continuously and huge
More informationA Comparative Study of Association Rules Mining Algorithms
A Comparative Study of Association Rules Mining Algorithms Cornelia Győrödi *, Robert Győrödi *, prof. dr. ing. Stefan Holban ** * Department of Computer Science, University of Oradea, Str. Armatei Romane
More informationIliya Mitov 1, Krassimira Ivanova 1, Benoit Depaire 2, Koen Vanhoof 2
Iliya Mitov 1, Krassimira Ivanova 1, Benoit Depaire 2, Koen Vanhoof 2 1: Institute of Mathematics and Informatics BAS, Sofia, Bulgaria 2: Hasselt University, Belgium 1 st Int. Conf. IMMM, 23-29.10.2011,
More informationUser Session Identification Using Enhanced Href Method
User Session Identification Using Enhanced Href Method Department of Computer Science, Constantine the Philosopher University in Nitra, Slovakia jkapusta@ukf.sk, psvec@ukf.sk, mmunk@ukf.sk, jskalka@ukf.sk
More informationA Hierarchical Document Clustering Approach with Frequent Itemsets
A Hierarchical Document Clustering Approach with Frequent Itemsets Cheng-Jhe Lee, Chiun-Chieh Hsu, and Da-Ren Chen Abstract In order to effectively retrieve required information from the large amount of
More informationKeywords: Web Mining, Web Usage Mining, Web Structure Mining, Web Content Mining, Graph theory.
Volume 5, Issue 9, September 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Survey on
More informationA Modified Apriori Algorithm
A Modified Apriori Algorithm K.A.Baffour, C.Osei-Bonsu, A.F. Adekoya Abstract: The Classical Apriori Algorithm (CAA), which is used for finding frequent itemsets in Association Rule Mining consists of
More informationNitin Cyriac et al, Int.J.Computer Technology & Applications,Vol 5 (1), WEB PERSONALIZATION
WEB PERSONALIZATION Mrs. M.Kiruthika 1, Nitin Cyriac 2, Aditya Mandhare 3, Soniya Nemade 4 DEPARTMENT OF COMPUTER ENGINEERING Fr. CONCEICAO RODRIGUES INSTITUTE OF TECHNOLOGY,VASHI Email- 1 venkatr20032002@gmail.com,
More informationTERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES
TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.
More informationWeb crawlers Data Mining Techniques for Handling Big Data Analytics
Web crawlers Data Mining Techniques for Handling Big Data Analytics Mr.V.NarsingRao 2 Mr.K.Vijay Babu 1 Sphoorthy Engineering College, Nadergul,R.R.District CMR Engineering College, Medchal. Abstract:
More informationPattern Mining. Knowledge Discovery and Data Mining 1. Roman Kern KTI, TU Graz. Roman Kern (KTI, TU Graz) Pattern Mining / 42
Pattern Mining Knowledge Discovery and Data Mining 1 Roman Kern KTI, TU Graz 2016-01-14 Roman Kern (KTI, TU Graz) Pattern Mining 2016-01-14 1 / 42 Outline 1 Introduction 2 Apriori Algorithm 3 FP-Growth
More informationDomain-specific Concept-based Information Retrieval System
Domain-specific Concept-based Information Retrieval System L. Shen 1, Y. K. Lim 1, H. T. Loh 2 1 Design Technology Institute Ltd, National University of Singapore, Singapore 2 Department of Mechanical
More informationImproved Data Preparation Technique in Web Usage Mining
International Journal of Computer Networks and Communications Security VOL.1, NO.7, DECEMBER 2013, 284 291 Available online at: www.ijcncs.org ISSN 2308-9830 C N C S Improved Data Preparation Technique
More informationDATA MINING II - 1DL460. Spring 2014"
DATA MINING II - 1DL460 Spring 2014" A second course in data mining http://www.it.uu.se/edu/course/homepage/infoutv2/vt14 Kjell Orsborn Uppsala Database Laboratory Department of Information Technology,
More informationOverview. Data-mining. Commercial & Scientific Applications. Ongoing Research Activities. From Research to Technology Transfer
Data Mining George Karypis Department of Computer Science Digital Technology Center University of Minnesota, Minneapolis, USA. http://www.cs.umn.edu/~karypis karypis@cs.umn.edu Overview Data-mining What
More informationDENSITY BASED AND PARTITION BASED CLUSTERING OF UNCERTAIN DATA BASED ON KL-DIVERGENCE SIMILARITY MEASURE
DENSITY BASED AND PARTITION BASED CLUSTERING OF UNCERTAIN DATA BASED ON KL-DIVERGENCE SIMILARITY MEASURE Sinu T S 1, Mr.Joseph George 1,2 Computer Science and Engineering, Adi Shankara Institute of Engineering
More informationarxiv: v1 [cs.db] 7 Dec 2011
Using Taxonomies to Facilitate the Analysis of the Association Rules Marcos Aurélio Domingues 1 and Solange Oliveira Rezende 2 arxiv:1112.1734v1 [cs.db] 7 Dec 2011 1 LIACC-NIAAD Universidade do Porto Rua
More informationFUFM-High Utility Itemsets in Transactional Database
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 3, March 2014,
More information