Personalized Information Retrieval by Using Adaptive User Profiling and Collaborative Filtering
|
|
- Allison Walker
- 6 years ago
- Views:
Transcription
1 Personalized Information Retrieval by Using Adaptive User Profiling and Collaborative Filtering Department of Computer Science & Engineering, Hanyang University doi: /aiss.vol2. issue4.14 Abstract Many search engines such as Yahoo, Google, MSN, and AltaVista, have been developed to meet various users search needs in real world. In general, because of the lack of the personal information such as hobby, preferences, and interests, these existing information retrieval systems are unsuitable to provide personalized search results to users. In this paper, we propose an adaptive user profiling method using dynamic updating policy considering the change of the users preferences over time and domain. Moreover, we employ collaborative filtering method to handle the situation that users preferences are frequently or continuously changed. Experimental results show that our method considerably improved personalized search performance for each user through automatic creation, maintenance, and personalization of user preference profiles that include search patterns of individual users. Keywords: Personalized Information Retrieval, Adaptive User Profiling, Collaborative Filtering 1. Introduction With the rapid growth of the World Wide Web, there is a vast amount of available data on the web and the number of the available information on the web rapidly increases. To overcome a flood of information, and provide personalized information to users is an important issue in information retrieval field. Many search engines such as Yahoo, Google, MSN, and AltaVista, have been developed to meet various users search needs in real world. In general, because of the lack of the personal information such as hobby, preferences, and interests, these existing information retrieval systems are unsuitable to provide personalized search results to users. As a solution to this problem, user preference profile approach has been employed to provide personalized search results to each user[5]. In this method, users preferences are compared with search results and then the matched results are served to the user. In general, user preference profiles consist of the results of lexical analysis, the input query, the documents that were clicked by the user, the queries that were used by the user in the past, and some weight values. However, a user preference profile that includes incorrect user preferences only gives a pain to users. Incorrect user preferences are generally obtained by the static profile approach. In this static profile approach, preferences or weight values are static and not changed once the user preference profile is created. Most portal systems use the static profile approach to provide personalized information. Because users preferences vary over time, place, context, or domain, the static profile approach has a high chance of having incorrect users preferences. To address this problem, various learning techniques, such as Bayesian classifiers, neural networks, and genetic algorithms (GAs), have been utilized for revising user profiles in several studies [8, 9, 10, 11], achieving various levels of improvement. However, these studies have a problem of overspecialization, i.e., users can only obtain the information indicated in their profiles and have no chance of exploring new information they might desire. Moreover, because of the complication of user profiles, the learning processes are always time consuming and are not appropriate if user preferences change rapidly and frequently. To solve the problem, in this paper, we propose an adaptive user profiling method using dynamic updating policy considering the change of the users preferences over time and domain. Moreover, we employ collaborative filtering method to deal with the situation that users preferences are frequently or continuously changed. Experimental results show that our method considerably improved personalized
2 Advances in Information Sciences and Service Sciences Volume 2, Number 4, December 2010 search performance for each user through automatic creation, maintenance, and personalization of user preference profiles that include search patterns of individual users. By using this user profile, our system can provide more personalized search results to users. The remainder of this paper is organized as follows. Section 2 describes related work of several user profile technologies for personalized information retrieval. Section 3 presents the architecture of our proposed system and its components. Section 4 describes some technologies to find similar users and experts, and explains the employed collaborative filtering. Section 5 shows the results of experiments about the satisfaction of participants. The last part of this paper summarizes the employed adaptive user profiling, and gives conclusions and discusses future researches. 2. Related Work In this section, we describe the current status of user profile-based personalized information retrieval systems with their problems. In general, most personalized information retrieval systems use both the user preference profile method and the filtering method that is commonly used in recommendation systems. We also employ both the user profile method and the collaborative filtering method, but our proposed system can provide more personalized and optimized search results to users by overcoming and improving the problems of existing systems Adaptive User Profiling In [1], the system uses a user profile approach to minimize the difference between users perception and the physical features of data in an information retrieval system. By using the user profile method, the system could provide more suitable search results to users. The approach in this system just learns the created user profiles using machine learning technology. Therefore, if the user profiles have incorrect information then the system gives irrelevant results to users. Users can only obtain the information indicated in their profiles and have no chance of exploring new information they might desire. Moreover, because of the complication of user profiles, the learning processes are always time consuming and are not appropriate if user preferences change rapidly and frequently. Compared to this system, our method utilizes the users relevance feedback to improve the profiles automatically using a genetic algorithm. A user sends feedbacks by clicking one of the search results, and then our system updates the user s profile by using the given feedbacks. The system describe in [5] focuses on the change of user preferences. The core problem of personalized recommendation is to model and track users' interests and their changes. To address this problem, both content-based filtering (CBF) and collaborative filtering (CF) have been explored in this system. User interests involve the interests on fixed categories and dynamic events, yet in the current CBF approaches, there is a lack of ability to model user's interests at the event level. So, the system in [5] proposed a novel approach to user profile modeling. In this model, user's interests are modeled by a multi-layer tree with a dynamically changeable structure, the top layers of which are used to model user interests on fixed categories, and the bottom layers are for dynamic events. Thus, this model can track the user's reading behaviors on both fixed categories and dynamic events, and consequently capture the interest changes. A modified CF algorithm based on the hierarchically structured profile model is also proposed. However, the proposed system in [5] is unsuitable in the situation that user preference information is frequently changed. In summary, our system employs user profile approach and collaborative filtering approach to deal a situation that user preferences are continuously and frequently changed Personalized Information Retrieval The study in [2] focuses on utilizing clickthrough data to improve the performance of Web search. Since millions of searches are conducted every day, a search engine accumulates a large volume of clickthrough data, which records who submits queries and which pages he/she clicks on. The clickthrough data is highly sparse and contains different types of objects (user, query and Web page), and the relationships among these objects are also very complicated
3 [2] attempts to discover Web users interests and the patterns that users locate information by analyzing these data. The clickthrough data is represented by a 3-order tensor, on which 3-mode analysis is performed using the higher-order singular value decomposition technique to automatically capture the latent factors that govern the relations among these multi-type objects: users, queries and Web pages. A tensor reconstructed based on the CubeSVD analysis reflects both the observed interactions among these objects and the implicit associations among them. Therefore, Web search activities can be carried out based on CubeSVD analysis. However, it is difficult to apply commonly used search engines, because analysis is very complex. [3] focuses on how to model the user and his/her context in an extensible way that can be interpreted and used for personalization. [3] describes the architecture that provides personalization facilities based on the contextual user model for tourism usage. User modeling begins with the creation of a user profile. In [3], each user profile is created based on the ontology, and is used to influence the current context of the user. The ontology includes pairs of context and user behaviors in the past, and influences users current context and behaviors. Therefore, the proposed system in [3] can provide personalized information by modeling based on the ontology. However, the abovementioned existing systems are all unsuitable in the situation that user preference information is continuously and frequently changed. [4] is working on a new system which learns to improve retrieval effectiveness by integrating the following factors: A. The user characteristics (user model or user profile). B. The characteristics in the interaction of the other users (social IR, stereotypes and collaborative information retrieval). C. The context of the research (context modeling). Such system may have the potential to overcome the current plateau in ad-hoc retrieval. [4] concerns on the first two elements: the user profile and the Collaborative Information Retrieval (CIR). CIR is an approach which learns to improve retrieval effectiveness from the interaction of different users with the retrieval system. Collaboration here assumes that users can benefit from search processes carried out at former times by other users although they may not know about the other users and their search processes. In other words, collaborative search records the fact that a result d has been selected for query q, and then reuses this information for similar queries in the future, by promoting results that were reliably selected in the past. However the goals and the characteristics of two users may be different so when they send the same query to a CIR system, they may be interested in two different lists of documents (known as personalization problem). Personalization is a common problem which the CIR researchers often encounter in constructing their systems. The proposed personalized system in [4] is the first attempt toward resolving the problem of personalization in the CIR systems by incorporating the user profiles. [4] uses three Profile Similarity (PS) calculation methods: query based PS, document based PS and query-document based PS. Query based PS approach merely considers the queries in the user profile. [4] believes that the user queries can partially represent the needs and the preferences of the users because the users express their requirements formally with the queries. Document based PS approach absolutely considers the documents that the user has studied or has marked as pertinent to his requirements. These marked documents lead the system to determine the users' needs. When a user reads a particular document it can be judged that the user's need is related to the content of this document. Thus the marked documents in a user profile can be useful to estimate the similarity between two profiles. This approach is very similar to the former approach (query based) except that instead of queries it deals with the documents the user has marked before. [4] uses both query based and document based approaches to partially capture the users' interests. However, the proposed system in [4] is also unsuitable in the situation that user preference information is continuously and frequently changed. 3. System
4 Advances in Information Sciences and Service Sciences Volume 2, Number 4, December 2010 In this section, we will introduce an adaptive user profiling model which can model user interests at both category (or domain) and event levels. After that, we will describe how it can be learned and utilized for prediction System Architecture As illustrated in Figure 1, our proposed system provides the search results that are retrieved from user profile search with weight value and from an existing search engine with weight value (1- ). This weight value is initialized to 0.5. This value is used to re-rank the search results, which means that from 10 results, our system selects top 5 results from user profile search and an existing search engine. This weight value is updated whenever users give feedback to the system. By increasing the value, the number of search results from user profile search increases. On the contrary, if the value decreases then the number of search results from user profile search decreases. Since user feedback is reflected on user profiles, the value is dwindled at the initial stage, and with more feedbacks, it is increased User Profile Figure 1. System Architecture As illustrated in Figure 2, a user profile in our system has a hierarchical multi-layer structure. Top layer is a domain layer that contains web search results that were selected by the user. Bottom layer is a search result layer that contains results that were selected by the user. Figure 2. User Profile Structure We define some parameters used in Figure 2 as follows :
5 U : a set of Domains, U = {Domain 1, Domain 2,, Domain n } Domain n : domain that includes the documents selected by the user. Domain n = <T 1 :WD n T 1, T 2 :WD n T 2,, T m :WD n T m > T : a term vector for the given user query T = {t 1, t 2, t 3,, t n } t = a term of the user query D : a set of documents selected by the user D = {D 1, D 2, D 3,, D n } D i = {t i1, t i2, t i3,, t in } t = an index term of the document selected by the user WD i T j : the probability(or weight) of the query using T j by the user in domain D i WT i D j : the probability(or weight) that the user will select document D j By maintaining this hierarchical structure, system can more effectively deal with continuously changed user interests. To address frequently changed user interests, system provides more personalized search results or information by using collaborative filtering method. Our system finds users who have similar interests or much information for a given query, and then returns filtered results. Also, our system can deal with two situations by using the search results from existing search engines. The first situation is where there are no search results, and the second situation is where user interests are suddenly changed Probabilistic approach for personalized information retrieval Once a user query is inputted, our system launches the search process for the given query in two phases. The first phase is the preprocessing phase for the given query. A query is represented as a set as follows. Q = {t 1, t 2, t 3,, t n } Q : a set of user query terms, t = a term of the query For each domain, our system selects similar queries that were used in the past by the user. The system takes the queries that have similarity values greater than threshold. The value is decided through experiments. We use a modified cosine similarity method as a similarity measurement. n Q Sim T, w ti w qi, wti tf WD jti, w qi tf 1 (1) i 1 The second phase is the searching phase for the given query. For each query that was used in the past, the system selects similar documents that were used in the past by the user. The system takes the documents that have similarity values (or expected values) greater than threshold. As before, we use a modified cosine similarity method as a similarity measurement, n Sim D, Q w di w qi, wdi tf WTjDi, i 1 1 w qi tf N (2) where N denotes the number of documents selected by the user Discussion In the engineering psychology, researchers represent the diagnostic stage of decision making as a process by which the decision maker is confronted by a series of cues or sources of information, as
6 Advances in Information Sciences and Service Sciences Volume 2, Number 4, December 2010 shown below, bearing on the true state of the world. The decision maker attends to some or all of these with the goal of using those cues to influence belief in one of several alternative hypotheses[6]. The process of decision making of human in the engineering psychology is as follows : A. Take hypotheses(true state of the world) B. Extract physical features from cues based on hypotheses C. Apply weight(or probability) to physical features based on experience D. Choose data(information) Therefore, using the past query and documents that were clicked by the user is very useful and efficient in personalized information retrieval. Our proposed system reflects this process through rerank factor and user profile architecture. 4. User Finding and Collaborative Filtering Our system finds similar users and expert users in the domain to provide more personalized results. By using the collaborative filtering method, the system can handle no results situation or unsatisfied situation where all the results have similarity values less than threshold value. These situations imply that user interests are very fast and suddenly changed Similar Users Equation (3) measures the distance between a term vector T in the profile of one user and a term vector T in the profile of another user. A shorter distance means more similar users. Finding similar users enables the handling of frequently and continuously changing user interests feasible through collaborative filtering Expert Users Dis T T t t n i , (3) An expert user is selected from the outside users who have not similar query patterns with the current user, but have much high quality information for a given query. In this paper, 5 experts are searched for each domain. The process of finding expert users proceeds off-line. 5. Experiments To evaluate the performance of the adaptive user profiling approach, we adopted a satisfaction degree measure that has been used in machine learning and classification society. In this paper, we measured the satisfaction for the adaptive user profiling using the following formula. 5 users participated in the experiments for 4 weeks. Number of Retrieved Satisfied Documents SAT (4) Number of Retrieved Documents(10) Equation (4) measures the satisfaction degree by calculating the ratio of the number of retrieved relevant documents over the retrieved top 10 documents. Each user carried out at least 2 queries everyday, and our system did not explicitly record the satisfaction for each result, instead we gathered the satisfaction data through the users implicit feedbacks by using the decision tree conditions of [12]. Figure 3 shows that the satisfaction degree of each user for the top 10 search results during the experiments that were carried out for 4 weeks. As shown in Figure 3, the number of relevant documents in top 10 for a given query is increased for most participants as time goes by. This result indicates that the re-access ratio for the same documents is increased for most participants. On the
7 contrary, the decrease of the number of satisfying documents over time implies the decrease of the reaccess ratio, and this situation happens occasionally including the case for user1 from the first to the second week. The cause of the decrease of the ratio is that a user uses a query using new terms, finds some new documents whenever doing search, or re-accesses the same documents by using queries with different terms. Figure 3. The satisfaction for top 10 search results Figure 4 show a comparison of the average number of the relevant documents in the top 10 results by using the adaptive user profiling method with the Google search results for the same queries. As represented in this figure, the adaptive user profiling approach outperforms Google s by 16.4%. Figure 4. A comparison of the satisfaction for the top 10 search results Figure 5 shows the reflection values of each user. A user who has a smaller reflection value implies that he/she used different queries whenever doing search, or employed different queries to re-access the same documents. On the other hand, a user who has a larger value implies that he/she has higher reaccess ratio for the same documents and frequently uses the same query
8 Advances in Information Sciences and Service Sciences Volume 2, Number 4, December 2010 Figure 5. Reflection value of each user 6. Conclusion We have proposed an adaptive user profiling method using dynamic updating policy considering the change of the user preferences over time and domain. Moreover, we used collaborative filtering method to deal with the situation that user preferences are frequently and continuously changed. Our proposed system reflects the process of decision making of human that was described in the engineering psychology through re-rank factor and user profile architecture. Our system eventually returns improved, personalized search results for each user through automatic creation, maintenance, and personalization of user preference profiles that include search patterns for each user. To improve the performance of personalized information retrieval, the various analysis information for web documents that were selected by users has need to utilize such as personal PageRank. 7. References [1] C. Shahabi, Y. Chen, Automatically Improving the Accuracy of User Profiles with Genetic Algorithms, In Proceedings of the Fourth Annual IASTED International Conference on Artificial Intelligence and Soft Computing(ASC2001), Cancun, Mexico, pp , [2] J. Sun, H. Zeng, H. Liu, Y. Lu, and Z. Chen, CubeSVD: A novel approach to personalized web search, In Proceedings of the Fourteenth International World Wide Web Conference, ACM Press, pp , [3] Z. Jrad, M. Aufaure, M. Hadjouni, A Contextual User Model for Web Personalization, In Proceedings of the Wise Workshops 2007, pp , [4] H. Naderi, and B. Rumpler, PERCIRS: a PERsonalized Collaborative. Information Retrieval System, In Proceedings of the INFORSID, pp , [5] J. Wang, Z. Li, J. Yao, Z. Sun, M. Li, and W. Ma, Adaptive User Profile Model and Collaborative Filtering for Personalized News, In Proceedings of the APWeb 2006, pp , [6] C.D. Wickens, and J.G. Hollands, Engineering Psychology and Human Performance, in Proceedings of the 3rd ed. Upper Saddle River, NJ: Prentice-Hall Inc, pp , [7] B. van Gils, H.A. Proper, P. van Bommel, and E.D. Schabell, Profile-based retrieval on the World Wide Web, In Proceedings of the Conferentie Informatiewetenschap(INFWET2003), pp , [8] M. Pazzani, and D. Billsus, Learning and revising user profiles: The identification of interesting web sites, In Proceedings of the Machine Learning, pp. 27: , [9] A. Tan and C. Teo, Learning user profiles for personalized information dissemination, In Proceedings of the International Joint Conference on Neural Network, pp ,
9 [10] W. Lam, S. Mukhopadhyay, J. Mostafa, and M. Palakal, Detection of shifts in user interests for personalized information filtering, In Proceedings of the 19th International ACMSIGIR Conference on Research and Development in Information Retrieval, pp , [11] A. Moukas, Amalthea: Information discovery and filtering using a multiagent evolving ecosystem, In Proceedings of the Conference on Practical Applications of Agents and Multiagent Technology (PAAM), vol. 11, pp , [12] S. Fox, K. Karnawat, M. Mydland, S. Dumais and T. White, "Evaluating implicit measures to improve web search", ACM Transactions on Information Systems, vol.23, no.2, pp. 147~168,
Automated Online News Classification with Personalization
Automated Online News Classification with Personalization Chee-Hong Chan Aixin Sun Ee-Peng Lim Center for Advanced Information Systems, Nanyang Technological University Nanyang Avenue, Singapore, 639798
More informationDomain Specific Search Engine for Students
Domain Specific Search Engine for Students Domain Specific Search Engine for Students Wai Yuen Tang The Department of Computer Science City University of Hong Kong, Hong Kong wytang@cs.cityu.edu.hk Lam
More informationKeywords APSE: Advanced Preferred Search Engine, Google Android Platform, Search Engine, Click-through data, Location and Content Concepts.
Volume 5, Issue 3, March 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Advanced Preferred
More informationUsing Text Learning to help Web browsing
Using Text Learning to help Web browsing Dunja Mladenić J.Stefan Institute, Ljubljana, Slovenia Carnegie Mellon University, Pittsburgh, PA, USA Dunja.Mladenic@{ijs.si, cs.cmu.edu} Abstract Web browsing
More informationEnhancing Cluster Quality by Using User Browsing Time
Enhancing Cluster Quality by Using User Browsing Time Rehab Duwairi Dept. of Computer Information Systems Jordan Univ. of Sc. and Technology Irbid, Jordan rehab@just.edu.jo Khaleifah Al.jada' Dept. of
More informationLetter Pair Similarity Classification and URL Ranking Based on Feedback Approach
Letter Pair Similarity Classification and URL Ranking Based on Feedback Approach P.T.Shijili 1 P.G Student, Department of CSE, Dr.Nallini Institute of Engineering & Technology, Dharapuram, Tamilnadu, India
More informationEnhancing Cluster Quality by Using User Browsing Time
Enhancing Cluster Quality by Using User Browsing Time Rehab M. Duwairi* and Khaleifah Al.jada'** * Department of Computer Information Systems, Jordan University of Science and Technology, Irbid 22110,
More informationContextual Search Using Ontology-Based User Profiles Susan Gauch EECS Department University of Kansas Lawrence, KS
Vishnu Challam Microsoft Corporation One Microsoft Way Redmond, WA 9802 vishnuc@microsoft.com Contextual Search Using Ontology-Based User s Susan Gauch EECS Department University of Kansas Lawrence, KS
More informationA Framework on Ontology Based Classification and Clustering for Grouping Research Proposals
A Framework on Ontology Based Classification and Clustering for Grouping Research Proposals KODAM ANUSHA* 1 PG Scholar, Dept of CSE, Kakatiya Institute of Technology and Science, Warangal. Abstract- The
More informationMining Web Data. Lijun Zhang
Mining Web Data Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Web Crawling and Resource Discovery Search Engine Indexing and Query Processing Ranking Algorithms Recommender Systems
More informationINTELLIGENT SYSTEMS OVER THE INTERNET
INTELLIGENT SYSTEMS OVER THE INTERNET Web-Based Intelligent Systems Intelligent systems use a Web-based architecture and friendly user interface Web-based intelligent systems: Use the Web as a platform
More informationInternational Journal of Innovative Research in Computer and Communication Engineering
Optimized Re-Ranking In Mobile Search Engine Using User Profiling A.VINCY 1, M.KALAIYARASI 2, C.KALAIYARASI 3 PG Student, Department of Computer Science, Arunai Engineering College, Tiruvannamalai, India
More informationA Vector Space Equalization Scheme for a Concept-based Collaborative Information Retrieval System
A Vector Space Equalization Scheme for a Concept-based Collaborative Information Retrieval System Takashi Yukawa Nagaoka University of Technology 1603-1 Kamitomioka-cho, Nagaoka-shi Niigata, 940-2188 JAPAN
More informationContext based Re-ranking of Web Documents (CReWD)
Context based Re-ranking of Web Documents (CReWD) Arijit Banerjee, Jagadish Venkatraman Graduate Students, Department of Computer Science, Stanford University arijitb@stanford.edu, jagadish@stanford.edu}
More informationMURDOCH RESEARCH REPOSITORY
MURDOCH RESEARCH REPOSITORY http://researchrepository.murdoch.edu.au/ This is the author s final version of the work, as accepted for publication following peer review but without the publisher s layout
More informationSuccess Index: Measuring the efficiency of search engines using implicit user feedback
Success Index: Measuring the efficiency of search engines using implicit user feedback Apostolos Kritikopoulos, Martha Sideri, Iraklis Varlamis Athens University of Economics and Business, Patision 76,
More informationA Hybrid Recommender System for Dynamic Web Users
A Hybrid Recommender System for Dynamic Web Users Shiva Nadi Department of Computer Engineering, Islamic Azad University of Najafabad Isfahan, Iran Mohammad Hossein Saraee Department of Electrical and
More informationNUS-I2R: Learning a Combined System for Entity Linking
NUS-I2R: Learning a Combined System for Entity Linking Wei Zhang Yan Chuan Sim Jian Su Chew Lim Tan School of Computing National University of Singapore {z-wei, tancl} @comp.nus.edu.sg Institute for Infocomm
More informationA Novel Categorized Search Strategy using Distributional Clustering Neenu Joseph. M 1, Sudheep Elayidom 2
A Novel Categorized Search Strategy using Distributional Clustering Neenu Joseph. M 1, Sudheep Elayidom 2 1 Student, M.E., (Computer science and Engineering) in M.G University, India, 2 Associate Professor
More informationMetaData for Database Mining
MetaData for Database Mining John Cleary, Geoffrey Holmes, Sally Jo Cunningham, and Ian H. Witten Department of Computer Science University of Waikato Hamilton, New Zealand. Abstract: At present, a machine
More informationIntelligent Risk Identification and Analysis in IT Network Systems
Intelligent Risk Identification and Analysis in IT Network Systems Masoud Mohammadian University of Canberra, Faculty of Information Sciences and Engineering, Canberra, ACT 2616, Australia masoud.mohammadian@canberra.edu.au
More informationA PERSONALIZED RECOMMENDER SYSTEM FOR TELECOM PRODUCTS AND SERVICES
A PERSONALIZED RECOMMENDER SYSTEM FOR TELECOM PRODUCTS AND SERVICES Zui Zhang, Kun Liu, William Wang, Tai Zhang and Jie Lu Decision Systems & e-service Intelligence Lab, Centre for Quantum Computation
More informationVISUAL RERANKING USING MULTIPLE SEARCH ENGINES
VISUAL RERANKING USING MULTIPLE SEARCH ENGINES By Dennis Lim Thye Loon A REPORT SUBMITTED TO Universiti Tunku Abdul Rahman in partial fulfillment of the requirements for the degree of Faculty of Information
More informationComment Extraction from Blog Posts and Its Applications to Opinion Mining
Comment Extraction from Blog Posts and Its Applications to Opinion Mining Huan-An Kao, Hsin-Hsi Chen Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan
More informationRecommendation System for Location-based Social Network CS224W Project Report
Recommendation System for Location-based Social Network CS224W Project Report Group 42, Yiying Cheng, Yangru Fang, Yongqing Yuan 1 Introduction With the rapid development of mobile devices and wireless
More informationAN ENHANCED ATTRIBUTE RERANKING DESIGN FOR WEB IMAGE SEARCH
AN ENHANCED ATTRIBUTE RERANKING DESIGN FOR WEB IMAGE SEARCH Sai Tejaswi Dasari #1 and G K Kishore Babu *2 # Student,Cse, CIET, Lam,Guntur, India * Assistant Professort,Cse, CIET, Lam,Guntur, India Abstract-
More informationContextual Information Retrieval Using Ontology-Based User Profiles
Contextual Information Retrieval Using Ontology-Based User Profiles Vishnu Kanth Reddy Challam Master s Thesis Defense Date: Jan 22 nd, 2004. Committee Dr. Susan Gauch(Chair) Dr.David Andrews Dr. Jerzy
More informationReview on Techniques of Collaborative Tagging
Review on Techniques of Collaborative Tagging Ms. Benazeer S. Inamdar 1, Mrs. Gyankamal J. Chhajed 2 1 Student, M. E. Computer Engineering, VPCOE Baramati, Savitribai Phule Pune University, India benazeer.inamdar@gmail.com
More informationA Constrained Spreading Activation Approach to Collaborative Filtering
A Constrained Spreading Activation Approach to Collaborative Filtering Josephine Griffith 1, Colm O Riordan 1, and Humphrey Sorensen 2 1 Dept. of Information Technology, National University of Ireland,
More informationInformation Retrieval
Information Retrieval CSC 375, Fall 2016 An information retrieval system will tend not to be used whenever it is more painful and troublesome for a customer to have information than for him not to have
More informationIJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, ISSN:
IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 20131 Improve Search Engine Relevance with Filter session Addlin Shinney R 1, Saravana Kumar T
More informationsecond_language research_teaching sla vivian_cook language_department idl
Using Implicit Relevance Feedback in a Web Search Assistant Maria Fasli and Udo Kruschwitz Department of Computer Science, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, United Kingdom fmfasli
More informationThe Comparative Study of Machine Learning Algorithms in Text Data Classification*
The Comparative Study of Machine Learning Algorithms in Text Data Classification* Wang Xin School of Science, Beijing Information Science and Technology University Beijing, China Abstract Classification
More informationAdaptive Search Engines Learning Ranking Functions with SVMs
Adaptive Search Engines Learning Ranking Functions with SVMs CS478/578 Machine Learning Fall 24 Thorsten Joachims Cornell University T. Joachims, Optimizing Search Engines Using Clickthrough Data, Proceedings
More informationSuccess Index: Measuring the efficiency of search engines using implicit user feedback
Success Index: Measuring the efficiency of search engines using implicit user feedback Apostolos Kritikopoulos, Martha Sideri, Iraklis Varlamis Athens University of Economics and Business Patision 76,
More informationEvaluation of Meta-Search Engine Merge Algorithms
2008 International Conference on Internet Computing in Science and Engineering Evaluation of Meta-Search Engine Merge Algorithms Chunshuang Liu, Zhiqiang Zhang,2, Xiaoqin Xie 2, TingTing Liang School of
More informationDevelopment of an Ontology-Based Portal for Digital Archive Services
Development of an Ontology-Based Portal for Digital Archive Services Ching-Long Yeh Department of Computer Science and Engineering Tatung University 40 Chungshan N. Rd. 3rd Sec. Taipei, 104, Taiwan chingyeh@cse.ttu.edu.tw
More informationLearning to Match. Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li
Learning to Match Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li 1. Introduction The main tasks in many applications can be formalized as matching between heterogeneous objects, including search, recommendation,
More informationINTRODUCTION. Chapter GENERAL
Chapter 1 INTRODUCTION 1.1 GENERAL The World Wide Web (WWW) [1] is a system of interlinked hypertext documents accessed via the Internet. It is an interactive world of shared information through which
More informationMining Web Data. Lijun Zhang
Mining Web Data Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Web Crawling and Resource Discovery Search Engine Indexing and Query Processing Ranking Algorithms Recommender Systems
More informationA Network Intrusion Detection System Architecture Based on Snort and. Computational Intelligence
2nd International Conference on Electronics, Network and Computer Engineering (ICENCE 206) A Network Intrusion Detection System Architecture Based on Snort and Computational Intelligence Tao Liu, a, Da
More informationTERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES
TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.
More informationA New Technique to Optimize User s Browsing Session using Data Mining
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,
More informationLearning Ranking Functions with SVMs
Learning Ranking Functions with SVMs CS4780/5780 Machine Learning Fall 2014 Thorsten Joachims Cornell University T. Joachims, Optimizing Search Engines Using Clickthrough Data, Proceedings of the ACM Conference
More informationCHAPTER THREE INFORMATION RETRIEVAL SYSTEM
CHAPTER THREE INFORMATION RETRIEVAL SYSTEM 3.1 INTRODUCTION Search engine is one of the most effective and prominent method to find information online. It has become an essential part of life for almost
More informationR. R. Badre Associate Professor Department of Computer Engineering MIT Academy of Engineering, Pune, Maharashtra, India
Volume 7, Issue 4, April 2017 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Web Service Ranking
More informationII. RELATED WORK. A. Probabilistic roadmap path planner
Gaussian PRM Samplers for Dynamic Configuration Spaces Yu-Te Lin and Shih-Chia Cheng Computer Science Department Stanford University Stanford, CA 94305, USA {yutelin, sccheng}@cs.stanford.edu SUID: 05371954,
More informationCompetitive Intelligence and Web Mining:
Competitive Intelligence and Web Mining: Domain Specific Web Spiders American University in Cairo (AUC) CSCE 590: Seminar1 Report Dr. Ahmed Rafea 2 P age Khalid Magdy Salama 3 P age Table of Contents Introduction
More informationSemantic Clickstream Mining
Semantic Clickstream Mining Mehrdad Jalali 1, and Norwati Mustapha 2 1 Department of Software Engineering, Mashhad Branch, Islamic Azad University, Mashhad, Iran 2 Department of Computer Science, Universiti
More informationEstimating Credibility of User Clicks with Mouse Movement and Eye-tracking Information
Estimating Credibility of User Clicks with Mouse Movement and Eye-tracking Information Jiaxin Mao, Yiqun Liu, Min Zhang, Shaoping Ma Department of Computer Science and Technology, Tsinghua University Background
More informationAn Application of Genetic Algorithm for Auto-body Panel Die-design Case Library Based on Grid
An Application of Genetic Algorithm for Auto-body Panel Die-design Case Library Based on Grid Demin Wang 2, Hong Zhu 1, and Xin Liu 2 1 College of Computer Science and Technology, Jilin University, Changchun
More informationWeb Information Retrieval using WordNet
Web Information Retrieval using WordNet Jyotsna Gharat Asst. Professor, Xavier Institute of Engineering, Mumbai, India Jayant Gadge Asst. Professor, Thadomal Shahani Engineering College Mumbai, India ABSTRACT
More informationAutomated Cognitive Walkthrough for the Web (AutoCWW)
CHI 2002 Workshop: Automatically Evaluating the Usability of Web Sites Workshop Date: April 21-22, 2002 Automated Cognitive Walkthrough for the Web (AutoCWW) Position Paper by Marilyn Hughes Blackmon Marilyn
More informationCLUSTERING, TIERED INDEXES AND TERM PROXIMITY WEIGHTING IN TEXT-BASED RETRIEVAL
STUDIA UNIV. BABEŞ BOLYAI, INFORMATICA, Volume LVII, Number 4, 2012 CLUSTERING, TIERED INDEXES AND TERM PROXIMITY WEIGHTING IN TEXT-BASED RETRIEVAL IOAN BADARINZA AND ADRIAN STERCA Abstract. In this paper
More informationDeep Web Crawling and Mining for Building Advanced Search Application
Deep Web Crawling and Mining for Building Advanced Search Application Zhigang Hua, Dan Hou, Yu Liu, Xin Sun, Yanbing Yu {hua, houdan, yuliu, xinsun, yyu}@cc.gatech.edu College of computing, Georgia Tech
More informationIRCE at the NTCIR-12 IMine-2 Task
IRCE at the NTCIR-12 IMine-2 Task Ximei Song University of Tsukuba songximei@slis.tsukuba.ac.jp Yuka Egusa National Institute for Educational Policy Research yuka@nier.go.jp Masao Takaku University of
More informationihits: Extending HITS for Personal Interests Profiling
ihits: Extending HITS for Personal Interests Profiling Ziming Zhuang School of Information Sciences and Technology The Pennsylvania State University zzhuang@ist.psu.edu Abstract Ever since the boom of
More informationFinding Topic-centric Identified Experts based on Full Text Analysis
Finding Topic-centric Identified Experts based on Full Text Analysis Hanmin Jung, Mikyoung Lee, In-Su Kang, Seung-Woo Lee, Won-Kyung Sung Information Service Research Lab., KISTI, Korea jhm@kisti.re.kr
More informationA Metric for Inferring User Search Goals in Search Engines
International Journal of Engineering and Technical Research (IJETR) A Metric for Inferring User Search Goals in Search Engines M. Monika, N. Rajesh, K.Rameshbabu Abstract For a broad topic, different users
More informationLearning Ranking Functions with SVMs
Learning Ranking Functions with SVMs CS4780/5780 Machine Learning Fall 2012 Thorsten Joachims Cornell University T. Joachims, Optimizing Search Engines Using Clickthrough Data, Proceedings of the ACM Conference
More informationIMAGE CONTENT IN LOCATION-BASED SHOPPING RECOMMENDER SYSTEMS FOR MOBILE USERS
IMAGE CONTENT IN LOCATION-BASED SHOPPING RECOMMENDER SYSTEMS FOR MOBILE USERS Tranos Zuva 1, Oludayo O. Olugbara, Sunday O. Ojo 3 and Seleman M. Ngwira 4 1, 4 Department of Computer Engineering, Tshwane
More informationSentiment analysis under temporal shift
Sentiment analysis under temporal shift Jan Lukes and Anders Søgaard Dpt. of Computer Science University of Copenhagen Copenhagen, Denmark smx262@alumni.ku.dk Abstract Sentiment analysis models often rely
More informationCopyright 2011 please consult the authors
Alsaleh, Slah, Nayak, Richi, Xu, Yue, & Chen, Lin (2011) Improving matching process in social network using implicit and explicit user information. In: Proceedings of the Asia-Pacific Web Conference (APWeb
More informationA Survey on Postive and Unlabelled Learning
A Survey on Postive and Unlabelled Learning Gang Li Computer & Information Sciences University of Delaware ligang@udel.edu Abstract In this paper we survey the main algorithms used in positive and unlabeled
More informationSocial Behavior Prediction Through Reality Mining
Social Behavior Prediction Through Reality Mining Charlie Dagli, William Campbell, Clifford Weinstein Human Language Technology Group MIT Lincoln Laboratory This work was sponsored by the DDR&E / RRTO
More informationPersonalized Information Retrieval
Personalized Information Retrieval Shihn Yuarn Chen Traditional Information Retrieval Content based approaches Statistical and natural language techniques Results that contain a specific set of words or
More informationWKU-MIS-B10 Data Management: Warehousing, Analyzing, Mining, and Visualization. Management Information Systems
Management Information Systems Management Information Systems B10. Data Management: Warehousing, Analyzing, Mining, and Visualization Code: 166137-01+02 Course: Management Information Systems Period: Spring
More informationImproving Relevance Prediction for Focused Web Crawlers
2012 IEEE/ACIS 11th International Conference on Computer and Information Science Improving Relevance Prediction for Focused Web Crawlers Mejdl S. Safran 1,2, Abdullah Althagafi 1 and Dunren Che 1 Department
More informationFSRM Feedback Algorithm based on Learning Theory
Send Orders for Reprints to reprints@benthamscience.ae The Open Cybernetics & Systemics Journal, 2015, 9, 699-703 699 FSRM Feedback Algorithm based on Learning Theory Open Access Zhang Shui-Li *, Dong
More informationThe Application Research of Semantic Web Technology and Clickstream Data Mart in Tourism Electronic Commerce Website Bo Liu
International Conference on Education Technology, Management and Humanities Science (ETMHS 2015) The Application Research of Semantic Web Technology and Clickstream Data Mart in Tourism Electronic Commerce
More informationPatent Classification Using Ontology-Based Patent Network Analysis
Association for Information Systems AIS Electronic Library (AISeL) PACIS 2010 Proceedings Pacific Asia Conference on Information Systems (PACIS) 2010 Patent Classification Using Ontology-Based Patent Network
More informationChapter 27 Introduction to Information Retrieval and Web Search
Chapter 27 Introduction to Information Retrieval and Web Search Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 27 Outline Information Retrieval (IR) Concepts Retrieval
More informationAnalysis on the technology improvement of the library network information retrieval efficiency
Available online www.jocpr.com Journal of Chemical and Pharmaceutical Research, 2014, 6(6):2198-2202 Research Article ISSN : 0975-7384 CODEN(USA) : JCPRC5 Analysis on the technology improvement of the
More informationRetrieval Evaluation
Retrieval Evaluation - Reference Collections Berlin Chen Department of Computer Science & Information Engineering National Taiwan Normal University References: 1. Modern Information Retrieval, Chapter
More informationCombining Review Text Content and Reviewer-Item Rating Matrix to Predict Review Rating
Combining Review Text Content and Reviewer-Item Rating Matrix to Predict Review Rating Dipak J Kakade, Nilesh P Sable Department of Computer Engineering, JSPM S Imperial College of Engg. And Research,
More informationContent-based Dimensionality Reduction for Recommender Systems
Content-based Dimensionality Reduction for Recommender Systems Panagiotis Symeonidis Aristotle University, Department of Informatics, Thessaloniki 54124, Greece symeon@csd.auth.gr Abstract. Recommender
More informationAutomatically Generating Queries for Prior Art Search
Automatically Generating Queries for Prior Art Search Erik Graf, Leif Azzopardi, Keith van Rijsbergen University of Glasgow {graf,leif,keith}@dcs.gla.ac.uk Abstract This report outlines our participation
More informationImage Mining: frameworks and techniques
Image Mining: frameworks and techniques Madhumathi.k 1, Dr.Antony Selvadoss Thanamani 2 M.Phil, Department of computer science, NGM College, Pollachi, Coimbatore, India 1 HOD Department of Computer Science,
More informationKeywords Data alignment, Data annotation, Web database, Search Result Record
Volume 5, Issue 8, August 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Annotating Web
More informationA Novel Approach for Inferring and Analyzing User Search Goals
A Novel Approach for Inferring and Analyzing User Search Goals Y. Sai Krishna 1, N. Swapna Goud 2 1 MTech Student, Department of CSE, Anurag Group of Institutions, India 2 Associate Professor, Department
More informationA novel supervised learning algorithm and its use for Spam Detection in Social Bookmarking Systems
A novel supervised learning algorithm and its use for Spam Detection in Social Bookmarking Systems Anestis Gkanogiannis and Theodore Kalamboukis Department of Informatics Athens University of Economics
More informationMining Frequent Itemsets for data streams over Weighted Sliding Windows
Mining Frequent Itemsets for data streams over Weighted Sliding Windows Pauray S.M. Tsai Yao-Ming Chen Department of Computer Science and Information Engineering Minghsin University of Science and Technology
More informationFault Diagnosis of Wind Turbine Based on ELMD and FCM
Send Orders for Reprints to reprints@benthamscience.ae 76 The Open Mechanical Engineering Journal, 24, 8, 76-72 Fault Diagnosis of Wind Turbine Based on ELMD and FCM Open Access Xianjin Luo * and Xiumei
More informationSearch Evaluation. Tao Yang CS293S Slides partially based on text book [CMS] [MRS]
Search Evaluation Tao Yang CS293S Slides partially based on text book [CMS] [MRS] Table of Content Search Engine Evaluation Metrics for relevancy Precision/recall F-measure MAP NDCG Difficulties in Evaluating
More informationContent Based Image Retrieval system with a combination of Rough Set and Support Vector Machine
Shahabi Lotfabadi, M., Shiratuddin, M.F. and Wong, K.W. (2013) Content Based Image Retrieval system with a combination of rough set and support vector machine. In: 9th Annual International Joint Conferences
More informationA Mobile Web Focused Search Engine Using Implicit Feedback
A Mobile Web Focused Search Engine Using Implicit Feedback Malvika Pimple Department of Computer Science University of North Dakota Grand Forks, ND 58202 malvika.pimple@email.und.edu Naima Kaabouch Department
More informationTop-N Recommendations from Implicit Feedback Leveraging Linked Open Data
Top-N Recommendations from Implicit Feedback Leveraging Linked Open Data Vito Claudio Ostuni, Tommaso Di Noia, Roberto Mirizzi, Eugenio Di Sciascio Polytechnic University of Bari, Italy {ostuni,mirizzi}@deemail.poliba.it,
More informationA Constrained Spreading Activation Approach to Collaborative Filtering
A Constrained Spreading Activation Approach to Collaborative Filtering Josephine Griffith 1, Colm O Riordan 1, and Humphrey Sorensen 2 1 Dept. of Information Technology, National University of Ireland,
More informationA Semantic Model for Concept Based Clustering
A Semantic Model for Concept Based Clustering S.Saranya 1, S.Logeswari 2 PG Scholar, Dept. of CSE, Bannari Amman Institute of Technology, Sathyamangalam, Tamilnadu, India 1 Associate Professor, Dept. of
More informationInternational Journal of Advance Engineering and Research Development. A Review Paper On Various Web Page Ranking Algorithms In Web Mining
Scientific Journal of Impact Factor (SJIF): 4.14 International Journal of Advance Engineering and Research Development Volume 3, Issue 2, February -2016 e-issn (O): 2348-4470 p-issn (P): 2348-6406 A Review
More informationTaccumulation of the social network data has raised
International Journal of Advanced Research in Social Sciences, Environmental Studies & Technology Hard Print: 2536-6505 Online: 2536-6513 September, 2016 Vol. 2, No. 1 Review Social Network Analysis and
More informationA NEW CLUSTER MERGING ALGORITHM OF SUFFIX TREE CLUSTERING
A NEW CLUSTER MERGING ALGORITHM OF SUFFIX TREE CLUSTERING Jianhua Wang, Ruixu Li Computer Science Department, Yantai University, Yantai, Shandong, China Abstract: Key words: Document clustering methods
More informationQuery Likelihood with Negative Query Generation
Query Likelihood with Negative Query Generation Yuanhua Lv Department of Computer Science University of Illinois at Urbana-Champaign Urbana, IL 61801 ylv2@uiuc.edu ChengXiang Zhai Department of Computer
More informationAUTOMATIC VISUAL CONCEPT DETECTION IN VIDEOS
AUTOMATIC VISUAL CONCEPT DETECTION IN VIDEOS Nilam B. Lonkar 1, Dinesh B. Hanchate 2 Student of Computer Engineering, Pune University VPKBIET, Baramati, India Computer Engineering, Pune University VPKBIET,
More informationThe Open University s repository of research publications and other research outputs. Search Personalization with Embeddings
Open Research Online The Open University s repository of research publications and other research outputs Search Personalization with Embeddings Conference Item How to cite: Vu, Thanh; Nguyen, Dat Quoc;
More informationA probabilistic model to resolve diversity-accuracy challenge of recommendation systems
A probabilistic model to resolve diversity-accuracy challenge of recommendation systems AMIN JAVARI MAHDI JALILI 1 Received: 17 Mar 2013 / Revised: 19 May 2014 / Accepted: 30 Jun 2014 Recommendation systems
More informationINCORPORATING SYNONYMS INTO SNIPPET BASED QUERY RECOMMENDATION SYSTEM
INCORPORATING SYNONYMS INTO SNIPPET BASED QUERY RECOMMENDATION SYSTEM Megha R. Sisode and Ujwala M. Patil Department of Computer Engineering, R. C. Patel Institute of Technology, Shirpur, Maharashtra,
More informationAutomatic New Topic Identification in Search Engine Transaction Log Using Goal Programming
Proceedings of the 2012 International Conference on Industrial Engineering and Operations Management Istanbul, Turkey, July 3 6, 2012 Automatic New Topic Identification in Search Engine Transaction Log
More informationITERATIVE SEARCHING IN AN ONLINE DATABASE. Susan T. Dumais and Deborah G. Schmitt Cognitive Science Research Group Bellcore Morristown, NJ
- 1 - ITERATIVE SEARCHING IN AN ONLINE DATABASE Susan T. Dumais and Deborah G. Schmitt Cognitive Science Research Group Bellcore Morristown, NJ 07962-1910 ABSTRACT An experiment examined how people use
More informationMultimodal Information Spaces for Content-based Image Retrieval
Research Proposal Multimodal Information Spaces for Content-based Image Retrieval Abstract Currently, image retrieval by content is a research problem of great interest in academia and the industry, due
More information