IJMIE Volume 2, Issue 9 ISSN:

Similar documents
International Journal of Advance Engineering and Research Development. A Survey on Data Mining Methods and its Applications

Overview of Web Mining Techniques and its Application towards Web

A Framework for Personal Web Usage Mining

Correlation Based Feature Selection with Irrelevant Feature Removal

Support System- Pioneering approach for Web Data Mining

I. Introduction II. Keywords- Pre-processing, Cleaning, Null Values, Webmining, logs

ISSN: (Online) Volume 3, Issue 9, September 2015 International Journal of Advance Research in Computer Science and Management Studies

SK International Journal of Multidisciplinary Research Hub Research Article / Survey Paper / Case Study Published By: SK Publisher

THE STUDY OF WEB MINING - A SURVEY

Improving Stack Overflow Tag Prediction Using Eye Tracking Alina Lazar Youngstown State University Bonita Sharif, Youngstown State University

International Journal of Scientific Research & Engineering Trends Volume 4, Issue 6, Nov-Dec-2018, ISSN (Online): X

Performance Analysis of Data Mining Classification Techniques

Semantic Clickstream Mining

Bachelor of Science in Business Administration - Information Systems and Technology Major

Enhancing Forecasting Performance of Naïve-Bayes Classifiers with Discretization Techniques

Chapter 2 BACKGROUND OF WEB MINING

International Journal of Computer Engineering and Applications, Volume VIII, Issue III, Part I, December 14

Data Mining. Ryan Benton Center for Advanced Computer Studies University of Louisiana at Lafayette Lafayette, La., USA.

A Comparative Study of Data Mining Process Models (KDD, CRISP-DM and SEMMA)

Clustering of Data with Mixed Attributes based on Unified Similarity Metric

The influence of caching on web usage mining

Oracle9i Data Mining. An Oracle White Paper December 2001

Data Mining of Web Access Logs Using Classification Techniques

International Journal of Computer Engineering and Applications, ICCSTAR-2016, Special Issue, May.16

PREDICTING UPCOMING STUDENTS PERFORMANCE USING MINING TECHNIQUE

Research on Applications of Data Mining in Electronic Commerce. Xiuping YANG 1, a

CE4031 and CZ4031 Database System Principles

Pattern Classification based on Web Usage Mining using Neural Network Technique

Data Mining in the Application of E-Commerce Website

Oracle9i Data Mining. Data Sheet August 2002

CLASSIFICATION OF WEB LOG DATA TO IDENTIFY INTERESTED USERS USING DECISION TREES

CSE 626: Data mining. Instructor: Sargur N. Srihari. Phone: , ext. 113

DATA MINING II - 1DL460. Spring 2014"

Classifying Twitter Data in Multiple Classes Based On Sentiment Class Labels

Fig 1. Overview of IE-based text mining framework

Ajloun National University

Log Information Mining Using Association Rules Technique: A Case Study Of Utusan Education Portal

CE4031 and CZ4031 Database System Principles

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, ISSN:

Survey Paper on Web Usage Mining for Web Personalization

An Integrated Framework to Enhance the Web Content Mining and Knowledge Discovery

A Survey on Web Personalization of Web Usage Mining

6 TOOLS FOR A COMPLETE MARKETING WORKFLOW

Web Mining Evolution & Comparative Study with Data Mining

Search Engine Optimization Specialized Studies

Web Page Classification using FP Growth Algorithm Akansha Garg,Computer Science Department Swami Vivekanad Subharti University,Meerut, India

A Lime Light on the Emerging Trends of Web Mining

WKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems

TIM 50 - Business Information Systems

TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES

Web Data mining-a Research area in Web usage mining

An Efficient Approach for Color Pattern Matching Using Image Mining

Question Bank. 4) It is the source of information later delivered to data marts.

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

Efficient integration of data mining techniques in DBMSs

American International Journal of Research in Science, Technology, Engineering & Mathematics

Data Mining. Introduction. Hamid Beigy. Sharif University of Technology. Fall 1395

Data Mining Techniques Methods Algorithms and Tools

A REVIEW ON CLASSIFICATION TECHNIQUES OVER AGRICULTURAL DATA

An Overview of various methodologies used in Data set Preparation for Data mining Analysis

Best Customer Services among the E-Commerce Websites A Predictive Analysis

Similarity Matrix Based Session Clustering by Sequence Alignment Using Dynamic Programming

Web Usage Mining using ART Neural Network. Abstract

Keywords: Web Mining, Web Usage Mining, Web Structure Mining, Web Content Mining, Graph theory.

Accounting Ethics and Auditing

Outlier Detection Using Unsupervised and Semi-Supervised Technique on High Dimensional Data

Data Mining. Introduction. Hamid Beigy. Sharif University of Technology. Fall 1394

Web Usage Mining: A Research Area in Web Mining

Improved Frequent Pattern Mining Algorithm with Indexing

Competitive Intelligence and Web Mining:

E-COMMERCE WEBSITE DESIGN IMPROVEMENT BASED ON WEB USAGE MINING

KDD, SEMMA AND CRISP-DM: A PARALLEL OVERVIEW. Ana Azevedo and M.F. Santos

An Effective method for Web Log Preprocessing and Page Access Frequency using Web Usage Mining

SQA Advanced Unit specification. General information for centres. Unit title: Web Development Fundamentals. Unit code: HR7M 47

Dynamic Data in terms of Data Mining Streams

Analysis of Fragment Mining on Indian Financial Market

CSCI 6312 Advanced Internet Programming

National Unit Specification: general information. The Internet (Higher) NUMBER DM4F 12. Information Systems (Higher)

Web Mining Team 11 Professor Anita Wasilewska CSE 634 : Data Mining Concepts and Techniques

SIR C R REDDY COLLEGE OF ENGINEERING

Big Data Specialized Studies

Chapter 5: Summary and Conclusion CHAPTER 5 SUMMARY AND CONCLUSION. Chapter 1: Introduction

MASTER OF INFORMATION TECHNOLOGY (Structure B)

An Improved Document Clustering Approach Using Weighted K-Means Algorithm

ASSOCIATE DEGREE REQUIREMENTS

Taccumulation of the social network data has raised

A Review Paper on Web Usage Mining and Pattern Discovery

Association-Rules-Based Recommender System for Personalization in Adaptive Web-Based Applications

International Journal of Advanced Research in Computer Science and Software Engineering

Data Mining Technology Based on Bayesian Network Structure Applied in Learning

In this project, I examined methods to classify a corpus of s by their content in order to suggest text blocks for semi-automatic replies.

Web Mining Using Cloud Computing Technology

Intelligent management of on-line video learning resources supported by Web-mining technology based on the practical application of VOD

Chapter 12: Web Usage Mining

TIM 50 - Business Information Systems

RETRACTED ARTICLE. Web-Based Data Mining in System Design and Implementation. Open Access. Jianhu Gong 1* and Jianzhi Gong 2

CSI5387: Data Mining Project

A PROPOSED HYBRID BOOK RECOMMENDER SYSTEM

Recommendation on the Web Search by Using Co-Occurrence

WEB MINING: A KEY TO IMPROVE BUSINESS ON WEB

Transcription:

WEB USAGE MINING: LEARNER CENTRIC APPROACH FOR E-BUSINESS APPLICATIONS B. NAVEENA DEVI* Abstract Emerging of web has put forward a great deal of challenges to web researchers for web based information management and retrieval, especially online business competitiveness. Users find it very difficult to extract useful and relevant information from the huge amount of information particularly for business, education and e-learning. The problem can be solved by web usage mining with the experience using open web application programming interfaces(apis) which are available by major companies for example RapidMiner, Digg.com, Amazon, ebay are opened access to their services and data through APIs, and we can make use of their services for the development of web usage mining research applications. This paper presents an experiment with pattern classification for user behavior prediction for an e-commerce business application. The paper concludes that web usage mining can be an approach to build innovative business applications and has potential to help improve learning performance with the help of APIs with in short period of time. Keywords: Data Mining, Web Usage Mining, APIs, E-Learning. * Department of CSE, Mahatma Gandhi Institute of Technology, Hyderabad, Andhra Pradesh 500075, India. 479

1. Introduction The World Wide Web is a vast resource of multiple types of information in varied formats. Researchers are beginning to investigate human behavior in this distributed Web data warehouse and are trying to build models for understanding human behavior in virtual environments. Data mining, often called Web mining when applied to the Internet, is a process of extracting hidden predictive information and discovering meaningful patterns, profiles, and trends from large databases. Web research academia is requested to develop more efficient and effective techniques to satisfy the increasing demands of web users, such as retrieving the desirable and related information creating good quality web communities. In order to effectively utilize the power of the web, information technology professionals need to have sufficient knowledge and experience in various web technologies and applications. In recent years courses in Internet and web related topics have been offered in many universities to equip students with such knowledge. However, to built web usage mining applications from raw data not an easy task that every student can complete in a semester. Now a days many major internet companies providing their services and data in the form of application programming interfaces. Such type of real time data provides ideal playground for students to gain practical skills in business application development and experimenting with the various pattern analysis techniques. 2. Potential of Web usage Mining Web usage mining approaches can be applied to API-based learning and have promise for helping explain system usage. Web usage mining can be used to articulating and identifying patterns of use in learning systems may provide better understanding of how students undertake Web-based learning and guidance for better understanding of online business activities. There are many potential benefits of using Web mining for exploring learning behavior and patterns in APIlearning. However, there is scant literature on the use of Web mining with APIs. This article is a case report of how Web usage mining method, classification could be applied on APIs data sets. This case report shows how patterns emerge from the application of Web mining approaches. The key purpose of this report is to illustrate the potential of Web usage mining and to identify issues 480

in its application with currently available APIs. As a way of setting the context for the case report the following examples show how Web usage mining could benefit learning process. 2.1. Understand learner or customer behavior University administrators and instructors may be able to improve the implementation of business design systems by understanding the dynamic behavior of customers. 2.2.Determine e-learning or e-commerce system effectiveness Patterns of behavior may be associated with system performance and enable more customized system configuration. Administrators and instructors may be able to discover high and low use areas of the business system and adjust resources to optimize the technical performance of the system. 2.3.Measure the success of instructional efforts In API-learning systems, students use email, the Web forum, feedback forms, etc. to express their concerns and ask questions. These data are completely recorded in the learning system. Web usage mining can provide quantitative feedback to instructors about the outcomes of their activity. The above features show how Web usage mining could provide new insights about student activity in API based learning and to address information needs as well as suggest customization of approaches for implementing e-learning, e-business, education by APIs with experience of real time data sets broad and rich benefits may accrue from better understanding patterns of behavior. Table 1. Comparison of web usage mining process questions in business, education and e- learning. Business Education E-learning Who are my most profitable customers? Who are the students taking most credit hours? Who are the students with highest frequency of logging-in? 481

Who are my repeat Website Who are the ones likely to Who are the students most visitors? return for more classes? engaged on discussion boards? Who are my loyal customers? What clients are likely to defect to my rivals? Who are the persisters at our university, college? What type of courses can we offer to attract more students? What pages do the students access most? What kinds of students are likely to get a high score online? 3. Technical Preparation to Integrate APIs Recently there are a large number of web services that we can use and many of them are open source based. Web services are APIs that facilitate the communication between applications for example RapidMiner, Digg.com, Amazon, ebay are opened access to their services and data through APIs, and we can make use of their services for the development of web usage mining research applications. The concept of Web APIs enables direct access to the website functionalities in order to leverage third party efforts on value adding services. However, the number of companies, services or web sites that gather information about users increasing continuously. In recent years, due to the rapid development of web, more techniques related to internet and web applications have been introduced in many universities around the world to better equip students to effectively utilize the power of the web. <!DOCTYPE html -//W3C//DTD XHTML 1.0 Strict//EN http://www.w3.org/tr/xhtml1/dtd/xhtml1-strict.dtd > <html xmlns= http://www.w3.org/1999/xhtml > <head> <meta http-equiv= content- type content= text/html;charset=utf8 /> <script Src= httpmaps.google.com/maps?file=api&v=2& key=abcdefg type= text/javascript ></script> <script type= text/javascript/ > Function initialize() { If(GBrowserIsCompatible()) { Var map = new GMap2(document.getElementById ( map_canvas )); } } </script> </head> <body onload= initialize() onunload= GUnload() > <div id= map_canvas style= width:500px;height:300px ></div> </body> </html> 482

This example can be edited and play around with it, but we have to replace the key in the file with our own maps API key. Once register a key for a particular directory it works for all sub directories as well. Technical Preparation of developing e-business application using Open Web APIs: The Amazon e-commerce service example was implemented based on REST protocol. In order to illustrate the e-business application to integrate the web APIs into a specific website to perform selected business operations, depending on the system functionality. Many systems may need to store data such as Oracle, Microsoft SQL server or My SQL. This application is a Business Model constructed by using Amazon e-commerce service API for item listing data in the cellular phone category of auctions. The application then analyzes the captured data for market research and analysis purposes. Application provides tools such as statistical analysis, data analysis, and data visualization to aid sellers and buyers in achieving their business goals. We mined the Amazon e-commerce service API for all necessary product information, the use of the Amazon e-commerce service API is most crucial to the success of our Business Model. Through the Amazon e-commerce service API the data obtain information such as item,price, quantity,time and date of purchase. 4. Research Goals Given the limited use of Web mining in education and the potential benefits to online education of making it more student centric that might emerge from effective utilization of Web mining, we set two research goals for this project: (1) to see if patterns of behavior could be used to predict achievement in effective learning in applying various pattern analysis techniques in a set of API data, and (2) to develop a better understanding about the process of applying Web mining to business applications, as well as to minimize preprocessing work with existing current versions of real time data sets. While Web usage mining is a recognized approach for building knowledge and value in business and commercial information systems. In this sense, this research is primarily exploratory and while the objective is to build new insights about learning activity, an equally important objective is to examine the fit of Web mining approaches to e-commerce. What are the challenges in extracting data from APIs and applying Web mining approaches? What strategies for data formatting and data analysis are needed for building meaningful insights? What changes are needed in Web usage mining solutions to improve the yield of Web mining in E- 483

commerce systems? A key outcome of this research will be suggestions for understanding how best to utilize open resources for Web mining in business applications and as well as for education. 5. Process of Mining There are various data mining techniques such as statistics, classification, association rules, sequential patterns, and clustering which can apply to the Web domain. Classification is the form of data mining used in this study and is a technique that uses a set of pre-classified examples to develop a model that can classify the products of records. There are many algorithms for classification such as decision tree, neural network classification, etc. The classification algorithm starts with a training set of pre-defined example transactions. The classifier training algorithm uses these pre-defined examples to determine the set of parameters required for proper discrimination. The algorithm encodes these parameters into a model called a classifier. After an effective classifier is developed, it will be used in a predictive mode to classify new records into these same pre-defined classes. For instance, one classifier that is capable of identifying customer behavior performance could be used to help in the decision of whether to provide a specific recommendation to an individual customer. In this study, the decision tree software C4.5 (Quinlan, 1993) is used, which is shown in detail in the case report. C4.5 is an algorithm introduced by Quinlan for inducing Decision Trees from data. 6. Results and Discussion In this research, mobile information retrieved from Amazon e-commerce API. A binary decision tree was built to classify the access log file. The log file contains name of the product, visited time, price, and quantity. The log data were examined to identify which product having no demand by analyzing behavior of customer. This could predict grades. 484

Table.1. Sample log file from Amazon e-commerce API Table.1. shows browsed information. This kind of information could predict the users next action and offer personalized website content and service based on corresponding forecast. Fig.1. Graph showing the Frequency of browsing. Fig.1 showing the Frequency of browsing by the customers. It will help vendor hypothetically a new or customized recommendation could be made for future demand. 485

7. Conclusion In this research, we explained the use of Web usage mining approaches using APIs and identified some illustrative learning patterns that can be found by using Web-mining approaches. Although some interesting patterns were found, the exploratory state of Web Usage mining tools in education suggests replication and confirmation from other forms of research to build an innovative business application. The primary findings of this research are to suggest that Web usage mining can be an approach that educational researchers can use, and when combined with other forms of data collection has potential for adding to the way we build knowledge about pattern analysis. A second contribution of the current study is to draw implications for how to improve the process of Web Usage mining with API services. The current research has shown a innovative business web usage mining application by utilizing Amazon API with minimum time by avoiding preprocessing phase and main concentration on analysis has promise for identifying patterns within the large processes. This achievement might be improved by availing various APIs and by introducing new mining methods. These results and our ability to use Web usage mining in e-learning and e-commerce are quite preliminary, and there is a need for further exploration and possible adaptation of the forms and usage of web mining to best suit education and business. 486

References [1] Buchner, A. and Mulvenna, M.D. (1998). Discovering Internet Marketing Intelligence through Online Analytical Web Usage Mining. SIGMOD Record, 27(4):54-61 [2] Chakrabarti, S., Dom, B., and Indyk P. (1998). Enhanced hypertext categorization using hyperlinks. In Proc. of the ACM SIGMOD Conference on Management of Data, Seattle, Washington: ACM Press, pp. 307 318 [3] Cooley, R. Mobasher, B. Srivastava, J. (1997). Web Mining: Information and Pattern Discovery on the World Wide Web. In Proc. of IEEE International Conference on Tools with Artificial Intelligence, Newport Beach, CA, pp558~567 [4] Etzioni, O. (1996).The World-Wide Web: Quagmire or Goldmine? Communications of the ACM, vol. 39, no. 11, pp65-68. [5] Luan, J. (2002). Data Mining and Its Applications in Higher Education. New Directions for Institutional Research, 2002, 113 (Spring), pp17-36. [6] Padmanabhan, B. and Tuzhilin, A. (1998). A Belief-Driven Method for Discovering Unexpected Patterns. In Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining (KDD-98). AAAI, Newport Beach, California, USA, pp94--100. [7] Quinlan, J. (1993). C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, San Mateo, CA. 487