International Journal of Computer Engineering and Applications, Volume IX, Issue X, Oct. 15 ISSN
|
|
- Darren Watkins
- 6 years ago
- Views:
Transcription
1 DIVERSIFIED DATASET EXPLORATION BASED ON USEFULNESS SCORE Geetanjali Mohite 1, Prof. Gauri Rao 2 1 Student, Department of Computer Engineering, B.V.D.U.C.O.E, Pune, Maharashtra, India 2 Associate Professor, Department of Computer Engineering, B.V.D.U.C.O.E, Pune, Maharashtra, India ABSTRACT: Data sharing is the key cause for the burst in web usage in today's context. Users have become large producers of diverse data which can be stored in data spaces distributed in different systems. Thus the data sharing and then searching for data amongst such diversified data distributed across many system has become difficult. In the scenario of huge distribution of users and system for the diversified data many approaches were proposed. In this paper we have discussed the Gossip Based recommendation approach for searching the useful data for the user. Further we have suggested enhancement in the approach for more relevant search result and with efficiency of the approach measured across the usefulness of the result with better performance. Keywords: Usefulness Score, Space Partitioning and Probing, Diversification, Distributed Environment [1] INTRODUCTION Data diversification has recently attained substantial attention because of increased user confidence in Recommender Systems (RS) due to which satisfaction has improved amongst users, as well as in online and backend search. The huge amount of information available on web creates the need for developing methods towards selecting and presenting to the user specific subgroups. Recently, data diversification has sought considerable attention as a way of increased user satisfaction. Data diversification has different forms consisting of selecting items so that their novelty, coverage, or content dissimilarity is maximized[6]. Existing approaches to data diversification is divided into two categories 1. Greedy heuristics 2. Interchange heuristics Greedy heuristics (e.g., [11]) creates a diverse dataset incrementally, considering one item at a time so that some distance function is maximized, whereas interchange heuristics (e.g., [10]) start from a random initial dataset and takes effort to improve it. Applying indexing to diversification is also an approach proposed by many researchers. One such approach is, a Dewey-based tree which is used for structure based diversity, which uses priorities of attribute and the second approach is 33
2 DIVERSIFIED DATASET EXPLORATION BASED ON USEFULNESS SCORE spatial indexing which exploits the location of nearest neighbors of an item that are the most far away to each other. In spite of the immense interest in diversification in recent years, most previous researches study and address the static nature of the problem, which is, the available items out of which a diverse subset is selected which do not change with time. As a solution to the above mentioned challenges a simple solution to data sharing is offered by distributed search and recommendation. To be specific it is gossip-based search and recommendation where every user constructs a cluster of "relevant" data that will be employed in the processing of queries. However, considering only usefulness introduces a significant amount of data duplicity among users. In the system when a query is submitted, As the user profiles in each user's cluster are quite similar, the probability of retrieving the same set of relevant items increases, and recall results are limited. Thus a modified version of the gossip based recommendation is discussed in this paper where for enhancing the relevance of the data retrieved the "usefulness" score is introduced and for performance efficiency the space partitioning and probing algorithm is used in conjunction. [2] EXISTING SYSTEM OVERVIEW Recommender Systems (RS) is considered as a reference to guide users in the task of speedily browse/explore large product space, assisting users to identify interesting products in an optimized way. However, common RS usually do not provide diverse results though it is considered that diversity is a required feature. The study of diversity aware RS has become an important research area in recent years, inspired from diversified solutions for Information Retrieval (IR). Diversity is a concept that has been applied in many fields; mostly with the goal of obtaining a set of objects that have a high level of dissimilarity between them, and that as a group, maximize a quality criterion. However, there is usually a trade-off between diversity and quality. Hence, the diversification problem is how to choose k elements from a set that maximizes diversity at a low quality sacrifice. Diversification approaches for both RS and IR can be classified as implicit or explicit. In IR, implicit approaches infer that by selecting dissimilar documents the diverse query aspects will be indirectly covered. The method MMR (Carbonell et al. 1998) is a classic example that aims to maximize relevant novelty : weighted linear combination 484 of relevance and novelty (novelty is defined as dissimilarity from previously selected documents). In contrast, explicit approaches directly attempt to cover different query aspects or sub-topics. IA-Select (Agrawal et al. 2009) and xquad (Santos et al. 2010) are examples of explicit approaches. In addition, (Zheng et al. 2012) proposed strategies to specify coverage functions of query sub-topics that serve as a basis for their diversification solution. [3] COMPAARATIVE ANALYSIS The comparison of different approaches has been done against the below parameters in this paper. (a) Greedy optimization means that the query should be result hungry and the extensive search should be performed for the expected result.
3 (b) Explicit approach should propose solution directly attempting to cover the diverse aspects of the query/user profile. (c) Implicit approach means the proposed solution explicitly prevent redundancy within the results. (d) Control of diversity vs. Relevance trade-off asks question that is there a control parameter that can tune the diversity vs. relevance trade-off. (e) Encourages discovery identifies that does the proposed approach not penalize novel/serendipitous items. (f) Control of exploitation vs. Exploration trade-off answers that is there a control parameter that can tune the exploitation vs. exploration trade-off. Below Table shows the comparison of the approaches proposed previously: Greedy Optimization Y Y Y Y Y Y Y N Explicit Approach N Y Y Y N N Y N Implicit Approach Y N N N Y Y N N Control of diversity vs. relevance trade-off Y N Y Y Y Y - - Encourages Discovery - N N N - - N - Control of exploitation vs. exploration trade-off N N N N N N N N 1 - (Carbonell et al. 1998) 2 - (Agrawal et al. 2009) 3 - (Santos et al. 2010) 4 - (Zheng et al. 2012) 5 - (Smyth et al. 2001) 6 - (Ziegler et al. 2005) 7 - (Vargas 2012) 8 - (Adomavicius et al. 2009) Table 1 Comparison of different approaches [4] OUR APPROACH - USEFULNESS SCORE BASED EXPLORATION OF DIVERSIFIED DATASET In the existing Web world there are numerous systems where user from diverse location and with diverse interest has facility to share data and the users have become heavily dependent on the Web for relevant information search. Introduction of cloud has added more distribution and diversification to the dataset the search engines has to navigate to extract relevant data as per the user search query. For the discussion of our modified approaches methodology we have considered the Real Estate Data set where the Diversification parameters in the consideration will be Cost, Area, Location and Property type. If Q is the set of all possible queries (all the combinations of terms), and the probability that a user v can return at least one relevant item given a random query q out of Q. In the following, we first define the coverage with respect to User Set. Then, based on coverage, we express the usefulness of a user v with respect to the other users in the user set. 35
4 DIVERSIFIED DATASET EXPLORATION BASED ON USEFULNESS SCORE For gossip based recommendation approach we need to have a set of registered users say U- Set. The user profiles should be such that the coverage probability is maximized. Thus a strategy for maximized coverage probability will be devised. For the usefulness score Given u's from U-Set, the usefulness of a user profile v is the probability that it can return relevant items for a random query q, that could not be returned by other users in u's U-Net. The usefulness score should also consider relevance. The usefulness score will be provided by the user Then the useful U-Set Clustering should happen for which we are using Space Partitioning and Probing mechanism where, Bounded diversification with sorted access methods is introduced for the first time and defined formally. The Pull/Bound Maximum Marginal Relevance (PBMMR) family of algorithms will be used, which exploits spatial probing locations and the adaptive alternation of usefulness score-based and distance-based access to reduce the number of fetched objects. An instance of PBMMR, called Space Partitioning and Probing (SPP), is presented, whose pulling strategy uses a tight upper bound. SPP is shown to attain the same diversification quality and exactly the same output as MMR, the most popular result diversification algorithm, but accessing only a fraction of the objects. Data Owne 1. Divers ified Datas Submits Data To be available for Search With Web Server 1. Searc h Engin U s e Search Results Provided Search User Search Query is formed and Pushed to Modified Usefulness Score Sent Figure. 1 System architecture The architecture used to demonstrate the working of our system is distributed web based i.e. the diversified dataset will reside on the centralized web server with the web based application also hosted on the cloud web server. The user will be able to access the hosted site from any cloud enabled environment. The search query will be submitted by user and the Query engine will interpret the query and based on logged in users group and parameters like income and area the usefulness score will be derived and search results will be provided to users. for the search results
5 the user will have the facility to specify the usefulness score and thus the usefulness score will be recomputed and persisted in database for next search query by other users in the same group. REFERENCES [1] Adomavicius, G., & Kwon, Y., Toward more diverse recommendations: Item re-ranking methods for recommender systems. InWorkshop on Information Technologies and Systems. (2009, December). [2] Agrawal, R., Gollapudi, S., Halverson, A., & Ieong, S. Diversifying search results. In Proceedings of the Second ACM International Conference on Web Search and Data Mining (pp. 5-14). ACM, (2009, February) [3] Carbonell, J., & Goldstein, J., The use of MMR, diversity-based reranking for reordering documents and producing summaries. In Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval (pp ). ACM, (1998, August). [4] Drosou, M., & Pitoura, E. Search result diversification. ACM SIGMOD Record, 39(1), 41-47, (2010) [5] Haritsa, J. R., The KNDN Problem: A Quest for Unity in Diversity. IEEE Data Eng. Bull., 32(4), 15-22, (2009). [6] Santos, R. L., Macdonald, C., & Ounis, I., Exploiting query reformulations for web search result diversification. In Proceedings of the 19th international conference on World wide web (pp ). ACM, (2010, April) [7] Smyth, B., & McClave, P. Similarity vs. diversity. In Case-Based Reasoning Research and Development (pp ). Springer Berlin Heidelberg, (2001). [8] Vargas, S., Novelty and Diversity Enhancement and Evaluation in Recommender Systems. MSc. diss., Department of Ingeniería Informática, Universidad Autónoma de Madrid, Spain. (2012). [9] Vee, E., Srivastava, U., Shanmugasundaram, J., Bhat, P., & Yahia, S. A., Efficient computation of diverse query results. In Data Engineering, ICDE IEEE 24th International Conference on (pp ). IEEE. (2008, April). [10] Yu, C., Lakshmanan, L., & Amer-Yahia, S. It takes variety to make a world: diversification in recommender systems. In Proceedings of the 12th international conference on extending database technology: Advances in database technology (pp ). ACM. (2009, March). [11] Ziegler, C. N., McNee, S. M., Konstan, J. A., & Lausen, G. Improving recommendation lists through topic diversification. In Proceedings of the 14th international conference on World Wide Web (pp ). ACM. (2005, May) [12] Zheng, W., Wang, X., Fang, H., & Cheng, H., Coverage-based search result diversification. Information Retrieval, 15(5), , (2012). 37
Proposed System. Start. Search parameter definition. User search criteria (input) usefulness score > 0.5. Retrieve results
, Impact Factor- 5.343 Hybrid Approach For Efficient Diversification on Cloud Stored Large Dataset Geetanjali Mohite 1, Prof. Gauri Rao 2 1 Student, Department of Computer Engineering, B.V.D.U.C.O.E, Pune,
More informationSearch Result Diversification
Search Result Diversification Marina Drosou Dept. of Computer Science University of Ioannina, Greece mdrosou@cs.uoi.gr Evaggelia Pitoura Dept. of Computer Science University of Ioannina, Greece pitoura@cs.uoi.gr
More informationINTERSOCIAL: Unleashing the Power of Social Networks for Regional SMEs
INTERSOCIAL I1 1.2, Subsidy Contract No. , MIS Nr 902010 European Territorial Cooperation Programme Greece-Italy 2007-2013 INTERSOCIAL: Unleashing the Power of Social Networks for Regional SMEs
More informationUniversity of Delaware at Diversity Task of Web Track 2010
University of Delaware at Diversity Task of Web Track 2010 Wei Zheng 1, Xuanhui Wang 2, and Hui Fang 1 1 Department of ECE, University of Delaware 2 Yahoo! Abstract We report our systems and experiments
More informationExploiting the Diversity of User Preferences for Recommendation
Exploiting the Diversity of User Preferences for Recommendation Saúl Vargas and Pablo Castells Escuela Politécnica Superior Universidad Autónoma de Madrid 28049 Madrid, Spain {saul.vargas,pablo.castells}@uam.es
More informationA Study of Pattern-based Subtopic Discovery and Integration in the Web Track
A Study of Pattern-based Subtopic Discovery and Integration in the Web Track Wei Zheng and Hui Fang Department of ECE, University of Delaware Abstract We report our systems and experiments in the diversity
More informationEfficient Diversification of Web Search Results
Efficient Diversification of Web Search Results G. Capannini, F. M. Nardini, R. Perego, and F. Silvestri ISTI-CNR, Pisa, Italy Laboratory Web Search Results Diversification Query: Vinci, what is the user
More information5. Novelty & Diversity
5. Novelty & Diversity Outline 5.1. Why Novelty & Diversity? 5.2. Probability Ranking Principled Revisited 5.3. Implicit Diversification 5.4. Explicit Diversification 5.5. Evaluating Novelty & Diversity
More informationNTU Approaches to Subtopic Mining and Document Ranking at NTCIR-9 Intent Task
NTU Approaches to Subtopic Mining and Document Ranking at NTCIR-9 Intent Task Chieh-Jen Wang, Yung-Wei Lin, *Ming-Feng Tsai and Hsin-Hsi Chen Department of Computer Science and Information Engineering,
More informationMicrosoft Research Asia at the Web Track of TREC 2009
Microsoft Research Asia at the Web Track of TREC 2009 Zhicheng Dou, Kun Chen, Ruihua Song, Yunxiao Ma, Shuming Shi, and Ji-Rong Wen Microsoft Research Asia, Xi an Jiongtong University {zhichdou, rsong,
More informationScalable Diversified Ranking on Large Graphs
211 11th IEEE International Conference on Data Mining Scalable Diversified Ranking on Large Graphs Rong-Hua Li Jeffrey Xu Yu The Chinese University of Hong ong, Hong ong {rhli,yu}@se.cuhk.edu.hk Abstract
More informationCurrent Approaches to Search Result Diversication
Current Approaches to Search Result Diversication Enrico Minack, Gianluca Demartini, and Wolfgang Nejdl L3S Research Center, Leibniz Universität Hannover, 30167 Hannover, Germany, {lastname}@l3s.de Abstract
More informationA Survey On Diversification Techniques For Unabmiguous But Under- Specified Queries
J. Appl. Environ. Biol. Sci., 4(7S)271-276, 2014 2014, TextRoad Publication ISSN: 2090-4274 Journal of Applied Environmental and Biological Sciences www.textroad.com A Survey On Diversification Techniques
More informationGrOnto: a GRanular ONTOlogy for Diversifying Search Results
GrOnto: a GRanular ONTOlogy for Diversifying Search Results Silvia Calegari Gabriella Pasi University of Milano-Bicocca V.le Sarca 6/4, 6 Milano, Italy {calegari,pasi}@disco.unimib.it ABSTRACT Results
More informationThe 1st International Workshop on Diversity in Document Retrieval
WORKSHOP REPORT The 1st International Workshop on Diversity in Document Retrieval Craig Macdonald School of Computing Science University of Glasgow UK craig.macdonald@glasgow.ac.uk Jun Wang Department
More informationInferring User Search for Feedback Sessions
Inferring User Search for Feedback Sessions Sharayu Kakade 1, Prof. Ranjana Barde 2 PG Student, Department of Computer Science, MIT Academy of Engineering, Pune, MH, India 1 Assistant Professor, Department
More informationAdvances in Natural and Applied Sciences. Information Retrieval Using Collaborative Filtering and Item Based Recommendation
AENSI Journals Advances in Natural and Applied Sciences ISSN:1995-0772 EISSN: 1998-1090 Journal home page: www.aensiweb.com/anas Information Retrieval Using Collaborative Filtering and Item Based Recommendation
More informationAn Investigation of Basic Retrieval Models for the Dynamic Domain Task
An Investigation of Basic Retrieval Models for the Dynamic Domain Task Razieh Rahimi and Grace Hui Yang Department of Computer Science, Georgetown University rr1042@georgetown.edu, huiyang@cs.georgetown.edu
More informationmnir: Diversifying Search Results based on a Mixture of Novelty, Intention and Relevance
mnir: Diversifying Search Results based on a Mixture of Novelty, Intention and Relevance Reza Taghizadeh Hemayati, Laleh Jafarian Dehordi, Weiyi Meng Department of Computer Science, Binghamton University
More informationA Survey on k-means Clustering Algorithm Using Different Ranking Methods in Data Mining
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 2, Issue. 4, April 2013,
More informationInverted Index for Fast Nearest Neighbour
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN 2320 088X IMPACT FACTOR: 5.258 IJCSMC,
More informationA Class of Submodular Functions for Document Summarization
A Class of Submodular Functions for Document Summarization Hui Lin, Jeff Bilmes University of Washington, Seattle Dept. of Electrical Engineering June 20, 2011 Lin and Bilmes Submodular Summarization June
More informationAn Analysis of NP-Completeness in Novelty and Diversity Ranking
An Analysis of NP-Completeness in Novelty and Diversity Ranking Ben Carterette (carteret@cis.udel.edu) Dept. of Computer and Info. Sciences, University of Delaware, Newark, DE, USA Abstract. A useful ability
More informationTopic Diversity Method for Image Re-Ranking
Topic Diversity Method for Image Re-Ranking D.Ashwini 1, P.Jerlin Jeba 2, D.Vanitha 3 M.E, P.Veeralakshmi M.E., Ph.D 4 1,2 Student, 3 Assistant Professor, 4 Associate Professor 1,2,3,4 Department of Information
More informationConcept Based Tie-breaking and Maximal Marginal Relevance Retrieval in Microblog Retrieval
Concept Based Tie-breaking and Maximal Marginal Relevance Retrieval in Microblog Retrieval Kuang Lu Department of Electrical and Computer Engineering University of Delaware Newark, Delaware, 19716 lukuang@udel.edu
More informationSearch Engine Architecture. Hongning Wang
Search Engine Architecture Hongning Wang CS@UVa CS@UVa CS4501: Information Retrieval 2 Document Analyzer Classical search engine architecture The Anatomy of a Large-Scale Hypertextual Web Search Engine
More informationFlight Recommendation System based on user feedback, weighting technique and context aware recommendation system
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 5 Issue 09 September 2016 Page No.17973-17978 Flight Recommendation System based on user feedback, weighting
More informationResult Diversification For Tweet Search
Result Diversification For Tweet Search Makbule Gulcin Ozsoy, Kezban Dilek Onal, and Ismail Sengor Altingovde Middle East Technical University, Ankara, Turkey {makbule.ozsoy, dilek, altingovde}@ceng.metu.edu.tr
More informationDivQ: Diversification for Keyword Search over Structured Databases
DivQ: Diversification for Keyword Search over Structured Databases Elena Demidova, Peter Fankhauser, 2, Xuan Zhou 3 and Wolfgang Nejdl L3S Research Center, Hannover, Germany 2 Fraunhofer IPSI, Darmstadt
More informationDiversification of Query Interpretations and Search Results
Diversification of Query Interpretations and Search Results Advanced Methods of IR Elena Demidova Materials used in the slides: Charles L.A. Clarke, Maheedhar Kolla, Gordon V. Cormack, Olga Vechtomova,
More informationAN IMPROVISED FREQUENT PATTERN TREE BASED ASSOCIATION RULE MINING TECHNIQUE WITH MINING FREQUENT ITEM SETS ALGORITHM AND A MODIFIED HEADER TABLE
AN IMPROVISED FREQUENT PATTERN TREE BASED ASSOCIATION RULE MINING TECHNIQUE WITH MINING FREQUENT ITEM SETS ALGORITHM AND A MODIFIED HEADER TABLE Vandit Agarwal 1, Mandhani Kushal 2 and Preetham Kumar 3
More informationA Survey on Keyword Diversification Over XML Data
ISSN (Online) : 2319-8753 ISSN (Print) : 2347-6710 International Journal of Innovative Research in Science, Engineering and Technology An ISO 3297: 2007 Certified Organization Volume 6, Special Issue 5,
More informationpresent the results in the best way to users. These challenges reflect an issue that has been presented in different works which is: diversity of quer
An Approach to Diversify Entity Search Results Imène Saidi University of Oran, LITIO Laboratory BP 1524, El-M Naouer, 31000 Oran, Algeria saidi.imene@univ-oran.dz Sihem Amer-Yahia Safia Nait Bahloul CNRS,
More informationQuery Subtopic Mining Exploiting Word Embedding for Search Result Diversification
Query Subtopic Mining Exploiting Word Embedding for Search Result Diversification Md Zia Ullah, Md Shajalal, Abu Nowshed Chy, and Masaki Aono Department of Computer Science and Engineering, Toyohashi University
More informationSpatial Index Keyword Search in Multi- Dimensional Database
Spatial Index Keyword Search in Multi- Dimensional Database Sushma Ahirrao M. E Student, Department of Computer Engineering, GHRIEM, Jalgaon, India ABSTRACT: Nearest neighbor search in multimedia databases
More information{david.vallet,
Personalized Diversification of Search Results David Vallet and Pablo Castells Universidad Autónoma de Madrid Escuela Politécnica Superior, Departamento de Ingeniería Informática {david.vallet, pablo.castells}@uam.es
More informationA New Technique to Optimize User s Browsing Session using Data Mining
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,
More informationAn Efficient Methodology for Image Rich Information Retrieval
An Efficient Methodology for Image Rich Information Retrieval 56 Ashwini Jaid, 2 Komal Savant, 3 Sonali Varma, 4 Pushpa Jat, 5 Prof. Sushama Shinde,2,3,4 Computer Department, Siddhant College of Engineering,
More informationA Novel Categorized Search Strategy using Distributional Clustering Neenu Joseph. M 1, Sudheep Elayidom 2
A Novel Categorized Search Strategy using Distributional Clustering Neenu Joseph. M 1, Sudheep Elayidom 2 1 Student, M.E., (Computer science and Engineering) in M.G University, India, 2 Associate Professor
More informationFinal Exam Search Engines ( / ) December 8, 2014
Student Name: Andrew ID: Seat Number: Final Exam Search Engines (11-442 / 11-642) December 8, 2014 Answer all of the following questions. Each answer should be thorough, complete, and relevant. Points
More informationLearning Socially Optimal Information Systems from Egoistic Users
Learning Socially Optimal Information Systems from Egoistic Users Karthik Raman Thorsten Joachims Department of Computer Science Cornell University, Ithaca NY karthik@cs.cornell.edu www.cs.cornell.edu/
More informationTowards Rule Learning Approaches to Instance-based Ontology Matching
Towards Rule Learning Approaches to Instance-based Ontology Matching Frederik Janssen 1, Faraz Fallahi 2 Jan Noessner 3, and Heiko Paulheim 1 1 Knowledge Engineering Group, TU Darmstadt, Hochschulstrasse
More informationA PROPOSED HYBRID BOOK RECOMMENDER SYSTEM
A PROPOSED HYBRID BOOK RECOMMENDER SYSTEM SUHAS PATIL [M.Tech Scholar, Department Of Computer Science &Engineering, RKDF IST, Bhopal, RGPV University, India] Dr.Varsha Namdeo [Assistant Professor, Department
More informationCase-based Recommendation. Peter Brusilovsky with slides of Danielle Lee
Case-based Recommendation Peter Brusilovsky with slides of Danielle Lee Where we are? Search Navigation Recommendation Content-based Semantics / Metadata Social Modern E-Commerce Site The Power of Metadata
More informationPromoting Ranking Diversity for Biomedical Information Retrieval based on LDA
Promoting Ranking Diversity for Biomedical Information Retrieval based on LDA Yan Chen, Xiaoshi Yin, Zhoujun Li, Xiaohua Hu and Jimmy Huang State Key Laboratory of Software Development Environment, Beihang
More informationRisk Minimization and Language Modeling in Text Retrieval Thesis Summary
Risk Minimization and Language Modeling in Text Retrieval Thesis Summary ChengXiang Zhai Language Technologies Institute School of Computer Science Carnegie Mellon University July 21, 2002 Abstract This
More informationHierarchical Online Mining for Associative Rules
Hierarchical Online Mining for Associative Rules Naresh Jotwani Dhirubhai Ambani Institute of Information & Communication Technology Gandhinagar 382009 INDIA naresh_jotwani@da-iict.org Abstract Mining
More informationAddressing the Challenges of Underspecification in Web Search. Michael Welch
Addressing the Challenges of Underspecification in Web Search Michael Welch mjwelch@cs.ucla.edu Why study Web search?!! Search engines have enormous reach!! Nearly 1 billion queries globally each day!!
More informationTag Based Image Search by Social Re-ranking
Tag Based Image Search by Social Re-ranking Vilas Dilip Mane, Prof.Nilesh P. Sable Student, Department of Computer Engineering, Imperial College of Engineering & Research, Wagholi, Pune, Savitribai Phule
More informationIJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, ISSN:
IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 20131 Improve Search Engine Relevance with Filter session Addlin Shinney R 1, Saravana Kumar T
More informationExploiting the Diversity of User Preferences for Recommendation. Saúl Vargas and Pablo Castells {saul.vargas,
Exploiting the Diversity of User Preferences for Recommendation Saúl Vargas and Pablo Castells {saul.vargas, pablo.castells}@uam.es Item Recommendation User A B C E F D G I H User profile You may also
More informationLetter Pair Similarity Classification and URL Ranking Based on Feedback Approach
Letter Pair Similarity Classification and URL Ranking Based on Feedback Approach P.T.Shijili 1 P.G Student, Department of CSE, Dr.Nallini Institute of Engineering & Technology, Dharapuram, Tamilnadu, India
More informationMeasuring Diversity of a Domain-Specic Crawl
Measuring Diversity of a Domain-Specic Crawl Pattisapu Nikhil Priyatam, Ajay Dubey, Krish Perumal, Dharmesh Kakadia, and Vasudeva Varma Search and Information Extraction Lab, IIIT-Hyderabad, India {nikhil.pattisapu,
More informationWEB PAGE RE-RANKING TECHNIQUE IN SEARCH ENGINE
WEB PAGE RE-RANKING TECHNIQUE IN SEARCH ENGINE Ms.S.Muthukakshmi 1, R. Surya 2, M. Umira Taj 3 Assistant Professor, Department of Information Technology, Sri Krishna College of Technology, Kovaipudur,
More informationI. INTRODUCTION. Fig Taxonomy of approaches to build specialized search engines, as shown in [80].
Focus: Accustom To Crawl Web-Based Forums M.Nikhil 1, Mrs. A.Phani Sheetal 2 1 Student, Department of Computer Science, GITAM University, Hyderabad. 2 Assistant Professor, Department of Computer Science,
More informationSupporting Fuzzy Keyword Search in Databases
I J C T A, 9(24), 2016, pp. 385-391 International Science Press Supporting Fuzzy Keyword Search in Databases Jayavarthini C.* and Priya S. ABSTRACT An efficient keyword search system computes answers as
More informationMicrosoft Research Asia at the NTCIR-10 Intent Task
Microsoft Research Asia at the NTCIR-0 Intent Task Kosetsu Tsukuda Kyoto University tsukuda@dl.kuis.kyotou.ac.jp Zhicheng Dou Microsoft Research Asia zhichdou@microsoft.com Tetsuya Sakai Microsoft Research
More informationTREC 2017 Dynamic Domain Track Overview
TREC 2017 Dynamic Domain Track Overview Grace Hui Yang Zhiwen Tang Ian Soboroff Georgetown University Georgetown University NIST huiyang@cs.georgetown.edu zt79@georgetown.edu ian.soboroff@nist.gov 1. Introduction
More informationKeywords Data alignment, Data annotation, Web database, Search Result Record
Volume 5, Issue 8, August 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Annotating Web
More informationIJESRT. Scientific Journal Impact Factor: (ISRA), Impact Factor: [35] [Rana, 3(12): December, 2014] ISSN:
IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY A Brief Survey on Frequent Patterns Mining of Uncertain Data Purvi Y. Rana*, Prof. Pragna Makwana, Prof. Kishori Shekokar *Student,
More informationNDoT: Nearest Neighbor Distance Based Outlier Detection Technique
NDoT: Nearest Neighbor Distance Based Outlier Detection Technique Neminath Hubballi 1, Bidyut Kr. Patra 2, and Sukumar Nandi 1 1 Department of Computer Science & Engineering, Indian Institute of Technology
More informationA Comparative Analysis of Cascade Measures for Novelty and Diversity
A Comparative Analysis of Cascade Measures for Novelty and Diversity Charles Clarke, University of Waterloo Nick Craswell, Microsoft Ian Soboroff, NIST Azin Ashkan, University of Waterloo Background Measuring
More informationClustering Based Diversity Improvement in Top-N Recommendation
Journal of Intelligent Information Systems manuscript No. (will be inserted by the editor) Clustering Based Diversity Improvement in Top-N Recommendation Tevfik Aytekin Mahmut Özge Karakaya Received: date
More informationCapturing User Interests by Both Exploitation and Exploration
Capturing User Interests by Both Exploitation and Exploration Ka Cheung Sia 1, Shenghuo Zhu 2, Yun Chi 2, Koji Hino 2, and Belle L. Tseng 2 1 University of California, Los Angeles, CA 90095, USA kcsia@cs.ucla.edu
More informationSocial Data Exploration
Social Data Exploration Sihem Amer-Yahia DR CNRS @ LIG Sihem.Amer-Yahia@imag.fr Big Data & Optimization Workshop 12ème Séminaire POC LIP6 Dec 5 th, 2014 Collaborative data model User space (with attributes)
More informationINTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK REAL TIME DATA SEARCH OPTIMIZATION: AN OVERVIEW MS. DEEPASHRI S. KHAWASE 1, PROF.
More informationR. R. Badre Associate Professor Department of Computer Engineering MIT Academy of Engineering, Pune, Maharashtra, India
Volume 7, Issue 4, April 2017 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Web Service Ranking
More informationGRID SIMULATION FOR DYNAMIC LOAD BALANCING
GRID SIMULATION FOR DYNAMIC LOAD BALANCING Kapil B. Morey 1, Prof. A. S. Kapse 2, Prof. Y. B. Jadhao 3 1 Research Scholar, Computer Engineering Dept., Padm. Dr. V. B. Kolte College of Engineering, Malkapur,
More informationImproving the Efficiency of Fast Using Semantic Similarity Algorithm
International Journal of Scientific and Research Publications, Volume 4, Issue 1, January 2014 1 Improving the Efficiency of Fast Using Semantic Similarity Algorithm D.KARTHIKA 1, S. DIVAKAR 2 Final year
More informationDynamic Optimization of Generalized SQL Queries with Horizontal Aggregations Using K-Means Clustering
Dynamic Optimization of Generalized SQL Queries with Horizontal Aggregations Using K-Means Clustering Abstract Mrs. C. Poongodi 1, Ms. R. Kalaivani 2 1 PG Student, 2 Assistant Professor, Department of
More informationSEQUENTIAL PATTERN MINING FROM WEB LOG DATA
SEQUENTIAL PATTERN MINING FROM WEB LOG DATA Rajashree Shettar 1 1 Associate Professor, Department of Computer Science, R. V College of Engineering, Karnataka, India, rajashreeshettar@rvce.edu.in Abstract
More informationBuilding Rich User Profiles for Personalized News Recommendation
Building Rich User Profiles for Personalized News Recommendation Youssef Meguebli 1, Mouna Kacimi 2, Bich-liên Doan 1, and Fabrice Popineau 1 1 SUPELEC Systems Sciences (E3S), Gif sur Yvette, France, {youssef.meguebli,bich-lien.doan,fabrice.popineau}@supelec.fr
More informationOntology-Based Web Query Classification for Research Paper Searching
Ontology-Based Web Query Classification for Research Paper Searching MyoMyo ThanNaing University of Technology(Yatanarpon Cyber City) Mandalay,Myanmar Abstract- In web search engines, the retrieval of
More informationDiversity based Relevance Feedback for Time Series Search
Diversity based Relevance Feedback for Time Series Search ABSTRACT Bahaeddin Eravci Department of Computer Engineering Bilkent University Ankara, Turkey beravci@gmail.com We propose a diversity based relevance
More informationIn the recent past, the World Wide Web has been witnessing an. explosive growth. All the leading web search engines, namely, Google,
1 1.1 Introduction In the recent past, the World Wide Web has been witnessing an explosive growth. All the leading web search engines, namely, Google, Yahoo, Askjeeves, etc. are vying with each other to
More informationContent Based Smart Crawler For Efficiently Harvesting Deep Web Interface
Content Based Smart Crawler For Efficiently Harvesting Deep Web Interface Prof. T.P.Aher(ME), Ms.Rupal R.Boob, Ms.Saburi V.Dhole, Ms.Dipika B.Avhad, Ms.Suvarna S.Burkul 1 Assistant Professor, Computer
More informationAdaptive and Personalized System for Semantic Web Mining
Journal of Computational Intelligence in Bioinformatics ISSN 0973-385X Volume 10, Number 1 (2017) pp. 15-22 Research Foundation http://www.rfgindia.com Adaptive and Personalized System for Semantic Web
More informationA Survey on improving performance of Information Retrieval System using Adaptive Genetic Algorithm
A Survey on improving performance of Information Retrieval System using Adaptive Genetic Algorithm Prajakta Mitkal 1, Prof. Ms. D.V. Gore 2 1 Modern College of Engineering Shivajinagar, Pune 2 Modern College
More informationA FRAMEWORK FOR EFFICIENT DATA SEARCH THROUGH XML TREE PATTERNS
A FRAMEWORK FOR EFFICIENT DATA SEARCH THROUGH XML TREE PATTERNS SRIVANI SARIKONDA 1 PG Scholar Department of CSE P.SANDEEP REDDY 2 Associate professor Department of CSE DR.M.V.SIVA PRASAD 3 Principal Abstract:
More informationImproving Aggregated Search Coherence
Improving Aggregated Search Coherence Jaime Arguello University of North Carolina at Chapel Hill jarguello@unc.edu Abstract. Aggregated search is that task of blending results from different search services,
More informationDaund, Pune, India I. INTRODUCTION
GLOBAL JOURNAL OF ENGINEERING SCIENCE AND RESEARCHES DYNAMICALLY RANKING THE DATABASE QUERY FORM Vinayak Venakt Jadhav * 1 & Prof. Amrit Priyadarshi 2 * 1 M.E Student, Dattakala Group of Institute, Faculty
More informationKDD 10 Tutorial: Recommender Problems for Web Applications. Deepak Agarwal and Bee-Chung Chen Yahoo! Research
KDD 10 Tutorial: Recommender Problems for Web Applications Deepak Agarwal and Bee-Chung Chen Yahoo! Research Agenda Focus: Recommender problems for dynamic, time-sensitive applications Content Optimization
More informationShrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India
International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent
More informationStructure of Association Rule Classifiers: a Review
Structure of Association Rule Classifiers: a Review Koen Vanhoof Benoît Depaire Transportation Research Institute (IMOB), University Hasselt 3590 Diepenbeek, Belgium koen.vanhoof@uhasselt.be benoit.depaire@uhasselt.be
More informationInternational Journal of Data Mining & Knowledge Management Process (IJDKP) Vol.7, No.3, May Dr.Zakea Il-Agure and Mr.Hicham Noureddine Itani
LINK MINING PROCESS Dr.Zakea Il-Agure and Mr.Hicham Noureddine Itani Higher Colleges of Technology, United Arab Emirates ABSTRACT Many data mining and knowledge discovery methodologies and process models
More informationCATEGORIZATION OF THE DOCUMENTS BY USING MACHINE LEARNING
CATEGORIZATION OF THE DOCUMENTS BY USING MACHINE LEARNING Amol Jagtap ME Computer Engineering, AISSMS COE Pune, India Email: 1 amol.jagtap55@gmail.com Abstract Machine learning is a scientific discipline
More informationA Joint Optimization Approach for Personalized Recommendation Diversification
A Joint Optimization Approach for Personalized Recommendation Diversification Xiaojie Wang 1, Jianzhong Qi 2, Kotagiri Ramamohanarao 2, Yu Sun 3, Bo Li 4, and Rui Zhang 2. 1,2 The University of Melbourne,
More informationQUERY RECOMMENDATION SYSTEM USING USERS QUERYING BEHAVIOR
International Journal of Emerging Technology and Innovative Engineering QUERY RECOMMENDATION SYSTEM USING USERS QUERYING BEHAVIOR V.Megha Dept of Computer science and Engineering College Of Engineering
More informationIRCE at the NTCIR-12 IMine-2 Task
IRCE at the NTCIR-12 IMine-2 Task Ximei Song University of Tsukuba songximei@slis.tsukuba.ac.jp Yuka Egusa National Institute for Educational Policy Research yuka@nier.go.jp Masao Takaku University of
More informationINTRODUCTION. Chapter GENERAL
Chapter 1 INTRODUCTION 1.1 GENERAL The World Wide Web (WWW) [1] is a system of interlinked hypertext documents accessed via the Internet. It is an interactive world of shared information through which
More informationAvailable online at ScienceDirect. Procedia Computer Science 89 (2016 )
Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 89 (2016 ) 341 348 Twelfth International Multi-Conference on Information Processing-2016 (IMCIP-2016) Parallel Approach
More informationExtracting Summary from Documents Using K-Mean Clustering Algorithm
Extracting Summary from Documents Using K-Mean Clustering Algorithm Manjula.K.S 1, Sarvar Begum 2, D. Venkata Swetha Ramana 3 Student, CSE, RYMEC, Bellary, India 1 Student, CSE, RYMEC, Bellary, India 2
More informationDiversity in Recommender Systems Week 2: The Problems. Toni Mikkola, Andy Valjakka, Heng Gui, Wilson Poon
Diversity in Recommender Systems Week 2: The Problems Toni Mikkola, Andy Valjakka, Heng Gui, Wilson Poon Review diversification happens by searching from further away balancing diversity and relevance
More informationREDUNDANCY REMOVAL IN WEB SEARCH RESULTS USING RECURSIVE DUPLICATION CHECK ALGORITHM. Pudukkottai, Tamil Nadu, India
REDUNDANCY REMOVAL IN WEB SEARCH RESULTS USING RECURSIVE DUPLICATION CHECK ALGORITHM Dr. S. RAVICHANDRAN 1 E.ELAKKIYA 2 1 Head, Dept. of Computer Science, H. H. The Rajah s College, Pudukkottai, Tamil
More informationCHAPTER 3 A FAST K-MODES CLUSTERING ALGORITHM TO WAREHOUSE VERY LARGE HETEROGENEOUS MEDICAL DATABASES
70 CHAPTER 3 A FAST K-MODES CLUSTERING ALGORITHM TO WAREHOUSE VERY LARGE HETEROGENEOUS MEDICAL DATABASES 3.1 INTRODUCTION In medical science, effective tools are essential to categorize and systematically
More informationDATA MINING II - 1DL460. Spring 2014"
DATA MINING II - 1DL460 Spring 2014" A second course in data mining http://www.it.uu.se/edu/course/homepage/infoutv2/vt14 Kjell Orsborn Uppsala Database Laboratory Department of Information Technology,
More informationAssociation-Rules-Based Recommender System for Personalization in Adaptive Web-Based Applications
Association-Rules-Based Recommender System for Personalization in Adaptive Web-Based Applications Daniel Mican, Nicolae Tomai Babes-Bolyai University, Dept. of Business Information Systems, Str. Theodor
More informationTERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES
TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.
More informationRecommender Systems: User Experience and System Issues
Recommender Systems: User Experience and System ssues Joseph A. Konstan University of Minnesota konstan@cs.umn.edu http://www.grouplens.org Summer 2005 1 About me Professor of Computer Science & Engineering,
More informationChapter 5: Summary and Conclusion CHAPTER 5 SUMMARY AND CONCLUSION. Chapter 1: Introduction
CHAPTER 5 SUMMARY AND CONCLUSION Chapter 1: Introduction Data mining is used to extract the hidden, potential, useful and valuable information from very large amount of data. Data mining tools can handle
More information