Learning Ontology-Based User Profiles: A Semantic Approach to Personalized Web Search
|
|
- Shonda Watkins
- 6 years ago
- Views:
Transcription
1 1 / 33 Learning Ontology-Based User Profiles: A Semantic Approach to Personalized Web Search Bernd Wittefeld Supervisor Markus Löckelt 20. July 2012
2 2 / 33 Teaser - Google Web History Web history associated with Google account Keeps track of search queries and visited websites Personalized search results displayed to the user Can be switched on, paused and off Can be deleted Cookie usage for users not signed in
3 Teaser - Google Web History 3 / 33
4 4 / 33 Introduction Outline 1 Introduction 2 Ontologies for Web Personalization 3 Search Personalization 4 Experimental Evaluation 5 Conclusion
5 5 / 33 Introduction Motivation
6 6 / 33 Introduction Motivation Tailor search results to users interest Maintain and update interest scores in a user profile
7 6 / 33 Introduction Motivation Tailor search results to users interest Maintain and update interest scores in a user profile Three crucial elements: User s short term information need Semantic knowledge about current domain User profile capturing long-term interests
8 7 / 33 Introduction Motivation Approach taken: Interest scores to existing concepts in a domain ontology Spreading activation algorithm to maintain scores
9 8 / 33 Ontologies for Web Personalization Outline 1 Introduction 2 Ontologies for Web Personalization 3 Search Personalization 4 Experimental Evaluation 5 Conclusion
10 9 / 33 Ontologies for Web Personalization Ontologies
11 Ontologies for Web Personalization Spreading Activation Algorithms 10 / 33
12 11 / 33 Ontologies for Web Personalization Ontological User Profiles Unified context model Instance of pre-existing reference domain ontology Concepts annotated with interest scores Updated based on user s behavior
13 12 / 33 Ontologies for Web Personalization Ontological User Profiles Information gathered implicitly through observing browsing behavior over time collecting interesting sites
14 12 / 33 Ontologies for Web Personalization Ontological User Profiles Information gathered implicitly through observing browsing behavior over time collecting interesting sites Factors for ranking can be frequency of visits time spent on the page bookmarking
15 13 / 33 Ontologies for Web Personalization Context Model Portion of an ontological user profile representing node Music Nodes represented as pair (C j, IS(C j )) C j concept in reference ontology IS(C j ) Interest score
16 14 / 33 Ontologies for Web Personalization Representation of Reference Ontology Open Directory Project (ODP) - Community driven web-classification
17 15 / 33 Ontologies for Web Personalization Representation of Reference Ontology Compute a term vector n containing most frequent terms for concept n and its subconcepts with corresponding weight Stop list contains semantically non relevant high frequency words Porter stemming to reduce words to their stems
18 16 / 33 Search Personalization Outline 1 Introduction 2 Ontologies for Web Personalization 3 Search Personalization 4 Experimental Evaluation 5 Conclusion
19 17 / 33 Search Personalization Re-ranking search results A set of query results is obtained from a search engine For each document in result a term vector r is computed as before Similarity of document and query is computed Compute similarity of document and each concept Rank-score for document is product of interest score for concept, similarity of document to query and similarity of specific concept to query
20 18 / 33 Experimental Evaluation Outline 1 Introduction 2 Ontologies for Web Personalization 3 Search Personalization 4 Experimental Evaluation 5 Conclusion
21 19 / 33 Experimental Evaluation Experimental Data Sets Open Directory contained concepts as of December 2006 Test-Set: Branching factor of four, six levels deep = 563 concepts and documents
22 19 / 33 Experimental Evaluation Experimental Data Sets Open Directory contained concepts as of December 2006 Test-Set: Branching factor of four, six levels deep = 563 concepts and documents pre-processed into training set - representation of reference ontology, 5041 documents test set - document collection for searching, 3067 documents profile set - spreading activation, 2118 documents
23 20 / 33 Experimental Evaluation User Profile Convergence Do the interest scores for individual concepts in the ontological profile converge?
24 20 / 33 Experimental Evaluation User Profile Convergence Do the interest scores for individual concepts in the ontological profile converge? Single profile document for every concept from profile set 25 rounds in spreading activation algorithm = signal concept Repeated experiment 50 times with distinct signal concepts Arithmetic mean and variance
25 21 / 33 Experimental Evaluation User Profile Convergence Arithmetic mean and variance of interest scores
26 22 / 33 Experimental Evaluation User Profile Accuracy Do the changes in interest scores accurately reflect user interest in specific topics?
27 22 / 33 Experimental Evaluation User Profile Accuracy Do the changes in interest scores accurately reflect user interest in specific topics? One signal concept from training set Single document belonging to signal concept from profile set Interest scores for all concepts monitored 30 repetitions of the experiment for the same signal concept Arithmetic mean and variance
28 23 / 33 Experimental Evaluation User Profile Accuracy
29 24 / 33 Experimental Evaluation User Profile Accuracy Two signal concepts from training set Single document belonging to signal concept from profile set 30 repetitions of the experiment alternating the signal concept every 5 iterations
30 25 / 33 Experimental Evaluation User Profile Accuracy
31 26 / 33 Experimental Evaluation Re-ranking Web Search Results Can the semantic evidence provided by the ontological profiles be used to effectively re-rank Web search results to present the user with a personalized view?
32 26 / 33 Experimental Evaluation Re-ranking Web Search Results Can the semantic evidence provided by the ontological profiles be used to effectively re-rank Web search results to present the user with a personalized view? Standard search on test set with queries Documents from profile set used to simulate user interest in concepts search results re-sorted based on ranking from user profile
33 26 / 33 Experimental Evaluation Re-ranking Web Search Results Can the semantic evidence provided by the ontological profiles be used to effectively re-rank Web search results to present the user with a personalized view? Standard search on test set with queries Documents from profile set used to simulate user interest in concepts search results re-sorted based on ranking from user profile effectiveness of re-ranking in terms of Top-n Recall = Top-n Precision = with n {100, 90, 80,..., 10} of relevant retrieved within n total of relevant documents of relevant retrieved within n n
34 Experimental Evaluation Re-ranking Web Search Results Keyword queries extracted from concept term vectors in training set Query of Terms Criteria Set 1 1 highest weighted term in concept term vector Set 2 2 two highest weighted terms in concept term vector Set 3 3 three highest weighted terms in concept term vector Set 4 2 or more overlapping terms within highest weighted 10 terms 27 / 33
35 28 / 33 Experimental Evaluation Re-ranking Web Search Results
36 29 / 33 Experimental Evaluation Re-ranking Web Search Results
37 30 / 33 Experimental Evaluation Discussion of Experimental Results Queries intentionally short Significant improvement for single word queries Query terms disambiguated by semantic evidence in ontological user profiles Stability of the approach was shown
38 31 / 33 Conclusion Outline 1 Introduction 2 Ontologies for Web Personalization 3 Search Personalization 4 Experimental Evaluation 5 Conclusion
39 32 / 33 Conclusion Framework for contextual information access using ontologies Semantic knowledge in ontologies combined with long-term user profiles effectively tailor search results based on user s interest Stability and adaptation to user s interest over long-term needs to be analyzed further
40 33 / 33 Conclusion Questions?
41 34 / 33 Representation of Reference Ontology Document d from training data represented as d = (w1, w 2,..., w k ) w i = tf i log( N n i ) tf i being the frequency of term i in document d N the total number of documents in the training set n i the number of documents containing term i. Vectors are normalized to a unit length.
42 35 / 33 Representation of Reference Ontology Aggregate representation of the concept hierarchy: S(n) the set of sub-concepts under concept n {d1 n, d 2 n,..., d k n } the individual documents indexed under concept n Docs(n) = [ n S(n)Docs(n ) ] {d n 1, d n 2,..., d n k } n = d Docs(n) d Docs(n)
43 36 / 33 Context Model Portion of an ontological user profile representing node Music Nodes represented as pair (C j, IS(C j )) C j concept in reference ontology IS(C j ) Interest score
Web Search Personalization with Ontological User Profiles
Web Search Personalization with Ontological User Profiles Ahu Sieg asieg@cti.depaul.edu Bamshad Mobasher mobasher@cti.depaul.edu Robin Burke rburke@cti.depaul.edu Center for Web Intelligence School of
More informationTansu Alpcan C. Bauckhage S. Agarwal
1 / 16 C. Bauckhage S. Agarwal Deutsche Telekom Laboratories GBR 2007 2 / 16 Outline 3 / 16 Overview A novel expert peering system for community-based information exchange A graph-based scheme consisting
More informationQuery Phrase Expansion using Wikipedia for Patent Class Search
Query Phrase Expansion using Wikipedia for Patent Class Search 1 Bashar Al-Shboul, Sung-Hyon Myaeng Korea Advanced Institute of Science and Technology (KAIST) December 19 th, 2011 AIRS 11, Dubai, UAE OUTLINE
More informationChapter 27 Introduction to Information Retrieval and Web Search
Chapter 27 Introduction to Information Retrieval and Web Search Copyright 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 27 Outline Information Retrieval (IR) Concepts Retrieval
More informationMining Web Data. Lijun Zhang
Mining Web Data Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Web Crawling and Resource Discovery Search Engine Indexing and Query Processing Ranking Algorithms Recommender Systems
More informationA Personalized Approach for Re-ranking Search Results Using User Preferences
Journal of Universal Computer Science, vol. 20, no. 9 (2014), 1232-1258 submitted: 17/2/14, accepted: 29/8/14, appeared: 1/9/14 J.UCS A Personalized Approach for Re-ranking Search Results Using User Preferences
More informationMining Web Data. Lijun Zhang
Mining Web Data Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline Introduction Web Crawling and Resource Discovery Search Engine Indexing and Query Processing Ranking Algorithms Recommender Systems
More informationCHAPTER 3 ASSOCIATON RULE BASED CLUSTERING
41 CHAPTER 3 ASSOCIATON RULE BASED CLUSTERING 3.1 INTRODUCTION This chapter describes the clustering process based on association rule mining. As discussed in the introduction, clustering algorithms have
More informationTERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES
TERM BASED WEIGHT MEASURE FOR INFORMATION FILTERING IN SEARCH ENGINES Mu. Annalakshmi Research Scholar, Department of Computer Science, Alagappa University, Karaikudi. annalakshmi_mu@yahoo.co.in Dr. A.
More informationSemantic Web. Ontology Alignment. Morteza Amini. Sharif University of Technology Fall 95-96
ه عا ی Semantic Web Ontology Alignment Morteza Amini Sharif University of Technology Fall 95-96 Outline The Problem of Ontologies Ontology Heterogeneity Ontology Alignment Overall Process Similarity (Matching)
More informationSearching. Outline. Copyright 2006 Haim Levkowitz. Copyright 2006 Haim Levkowitz
Searching 1 Outline Goals and Objectives Topic Headlines Introduction Directories Open Directory Project Search Engines Metasearch Engines Search techniques Intelligent Agents Invisible Web Summary 2 1
More informationSearching the Deep Web
Searching the Deep Web 1 What is Deep Web? Information accessed only through HTML form pages database queries results embedded in HTML pages Also can included other information on Web can t directly index
More informationCHAPTER THREE INFORMATION RETRIEVAL SYSTEM
CHAPTER THREE INFORMATION RETRIEVAL SYSTEM 3.1 INTRODUCTION Search engine is one of the most effective and prominent method to find information online. It has become an essential part of life for almost
More informationAn Adaptive Agent for Web Exploration Based on Concept Hierarchies
An Adaptive Agent for Web Exploration Based on Concept Hierarchies Scott Parent, Bamshad Mobasher, Steve Lytinen School of Computer Science, Telecommunication and Information Systems DePaul University
More informationChrome based Keyword Visualizer (under sparse text constraint) SANGHO SUH MOONSHIK KANG HOONHEE CHO
Chrome based Keyword Visualizer (under sparse text constraint) SANGHO SUH MOONSHIK KANG HOONHEE CHO INDEX Proposal Recap Implementation Evaluation Future Works Proposal Recap Keyword Visualizer (chrome
More informationShrey Patel B.E. Computer Engineering, Gujarat Technological University, Ahmedabad, Gujarat, India
International Journal of Scientific Research in Computer Science, Engineering and Information Technology 2018 IJSRCSEIT Volume 3 Issue 3 ISSN : 2456-3307 Some Issues in Application of NLP to Intelligent
More informationSemantic Search in s
Semantic Search in Emails Navneet Kapur, Mustafa Safdari, Rahul Sharma December 10, 2010 Abstract Web search technology is abound with techniques to tap into the semantics of information. For email search,
More informationAutomated Online News Classification with Personalization
Automated Online News Classification with Personalization Chee-Hong Chan Aixin Sun Ee-Peng Lim Center for Advanced Information Systems, Nanyang Technological University Nanyang Avenue, Singapore, 639798
More informationDomain-specific Concept-based Information Retrieval System
Domain-specific Concept-based Information Retrieval System L. Shen 1, Y. K. Lim 1, H. T. Loh 2 1 Design Technology Institute Ltd, National University of Singapore, Singapore 2 Department of Mechanical
More informationChapter 6: Information Retrieval and Web Search. An introduction
Chapter 6: Information Retrieval and Web Search An introduction Introduction n Text mining refers to data mining using text documents as data. n Most text mining tasks use Information Retrieval (IR) methods
More informationSemantic Web. Ontology Alignment. Morteza Amini. Sharif University of Technology Fall 94-95
ه عا ی Semantic Web Ontology Alignment Morteza Amini Sharif University of Technology Fall 94-95 Outline The Problem of Ontologies Ontology Heterogeneity Ontology Alignment Overall Process Similarity Methods
More informationContextual Information Retrieval Using Ontology-Based User Profiles
Contextual Information Retrieval Using Ontology-Based User Profiles Vishnu Kanth Reddy Challam Master s Thesis Defense Date: Jan 22 nd, 2004. Committee Dr. Susan Gauch(Chair) Dr.David Andrews Dr. Jerzy
More informationBasic Tokenizing, Indexing, and Implementation of Vector-Space Retrieval
Basic Tokenizing, Indexing, and Implementation of Vector-Space Retrieval 1 Naïve Implementation Convert all documents in collection D to tf-idf weighted vectors, d j, for keyword vocabulary V. Convert
More informationWEIGHTING QUERY TERMS USING WORDNET ONTOLOGY
IJCSNS International Journal of Computer Science and Network Security, VOL.9 No.4, April 2009 349 WEIGHTING QUERY TERMS USING WORDNET ONTOLOGY Mohammed M. Sakre Mohammed M. Kouta Ali M. N. Allam Al Shorouk
More informationInformation Retrieval
Information Retrieval CSC 375, Fall 2016 An information retrieval system will tend not to be used whenever it is more painful and troublesome for a customer to have information than for him not to have
More informationDisambiguating Search by Leveraging a Social Context Based on the Stream of User s Activity
Disambiguating Search by Leveraging a Social Context Based on the Stream of User s Activity Tomáš Kramár, Michal Barla and Mária Bieliková Faculty of Informatics and Information Technology Slovak University
More informationSearching the Deep Web
Searching the Deep Web 1 What is Deep Web? Information accessed only through HTML form pages database queries results embedded in HTML pages Also can included other information on Web can t directly index
More informationKNOWLEDGE GRAPH: FROM METADATA TO INFORMATION VISUALIZATION AND BACK. Xia Lin College of Computing and Informatics Drexel University Philadelphia, PA
KNOWLEDGE GRAPH: FROM METADATA TO INFORMATION VISUALIZATION AND BACK Xia Lin College of Computing and Informatics Drexel University Philadelphia, PA 1 A little background of me Teach at Drexel University
More informationWeb Information Retrieval using WordNet
Web Information Retrieval using WordNet Jyotsna Gharat Asst. Professor, Xavier Institute of Engineering, Mumbai, India Jayant Gadge Asst. Professor, Thadomal Shahani Engineering College Mumbai, India ABSTRACT
More informationA Document-centered Approach to a Natural Language Music Search Engine
A Document-centered Approach to a Natural Language Music Search Engine Peter Knees, Tim Pohle, Markus Schedl, Dominik Schnitzer, and Klaus Seyerlehner Dept. of Computational Perception, Johannes Kepler
More informationJianyong Wang Department of Computer Science and Technology Tsinghua University
Jianyong Wang Department of Computer Science and Technology Tsinghua University jianyong@tsinghua.edu.cn Joint work with Wei Shen (Tsinghua), Ping Luo (HP), and Min Wang (HP) Outline Introduction to entity
More informationSYSTEMS FOR NON STRUCTURED INFORMATION MANAGEMENT
SYSTEMS FOR NON STRUCTURED INFORMATION MANAGEMENT Prof. Dipartimento di Elettronica e Informazione Politecnico di Milano INFORMATION SEARCH AND RETRIEVAL Inf. retrieval 1 PRESENTATION SCHEMA GOALS AND
More informationTag-based Social Interest Discovery
Tag-based Social Interest Discovery Xin Li / Lei Guo / Yihong (Eric) Zhao Yahoo!Inc 2008 Presented by: Tuan Anh Le (aletuan@vub.ac.be) 1 Outline Introduction Data set collection & Pre-processing Architecture
More informationIJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 2013 ISSN:
Semi Automatic Annotation Exploitation Similarity of Pics in i Personal Photo Albums P. Subashree Kasi Thangam 1 and R. Rosy Angel 2 1 Assistant Professor, Department of Computer Science Engineering College,
More informationInformation Retrieval. (M&S Ch 15)
Information Retrieval (M&S Ch 15) 1 Retrieval Models A retrieval model specifies the details of: Document representation Query representation Retrieval function Determines a notion of relevance. Notion
More informationPERSONAL WEB REVISITATION BY CONTEXT AND CONTENT KEYWORDS WITH RELEVANCE FEEDBACK
PERSONAL WEB REVISITATION BY CONTEXT AND CONTENT KEYWORDS WITH RELEVANCE FEEDBACK Getting back to previously viewed web pages is a common yet uneasy task for users due to the large volume of personally
More informationRyen W. White, Peter Bailey, Liwei Chen Microsoft Corporation
Ryen W. White, Peter Bailey, Liwei Chen Microsoft Corporation Motivation Information behavior is embedded in external context Context motivates the problem, influences interaction IR community theorized
More informationToday s topic CS347. Results list clustering example. Why cluster documents. Clustering documents. Lecture 8 May 7, 2001 Prabhakar Raghavan
Today s topic CS347 Clustering documents Lecture 8 May 7, 2001 Prabhakar Raghavan Why cluster documents Given a corpus, partition it into groups of related docs Recursively, can induce a tree of topics
More informationCS490W. Text Clustering. Luo Si. Department of Computer Science Purdue University
CS490W Text Clustering Luo Si Department of Computer Science Purdue University [Borrows slides from Chris Manning, Ray Mooney and Soumen Chakrabarti] Clustering Document clustering Motivations Document
More informationExploring Ancient Texts with a User Driven Concept Search
Exploring Ancient Texts with a User Driven Concept Search Muhammad Faisal Cheema, Stefan Jänicke, Christoph Weilbach, Judith Blumenstein, Gerik Scheuermann Leipzig University, Germany exchange: Exploring
More informationCADIAL Search Engine at INEX
CADIAL Search Engine at INEX Jure Mijić 1, Marie-Francine Moens 2, and Bojana Dalbelo Bašić 1 1 Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, 10000 Zagreb, Croatia {jure.mijic,bojana.dalbelo}@fer.hr
More informationCMPSCI 646, Information Retrieval (Fall 2003)
CMPSCI 646, Information Retrieval (Fall 2003) Midterm exam solutions Problem CO (compression) 1. The problem of text classification can be described as follows. Given a set of classes, C = {C i }, where
More informationOntology-Based Web Query Classification for Research Paper Searching
Ontology-Based Web Query Classification for Research Paper Searching MyoMyo ThanNaing University of Technology(Yatanarpon Cyber City) Mandalay,Myanmar Abstract- In web search engines, the retrieval of
More informationConceptual document indexing using a large scale semantic dictionary providing a concept hierarchy
Conceptual document indexing using a large scale semantic dictionary providing a concept hierarchy Martin Rajman, Pierre Andrews, María del Mar Pérez Almenta, and Florian Seydoux Artificial Intelligence
More informationihits: Extending HITS for Personal Interests Profiling
ihits: Extending HITS for Personal Interests Profiling Ziming Zhuang School of Information Sciences and Technology The Pennsylvania State University zzhuang@ist.psu.edu Abstract Ever since the boom of
More informationHome Page. Title Page. Page 1 of 14. Go Back. Full Screen. Close. Quit
Page 1 of 14 Retrieving Information from the Web Database and Information Retrieval (IR) Systems both manage data! The data of an IR system is a collection of documents (or pages) User tasks: Browsing
More informationInformation Retrieval
Introduction Information Retrieval Information retrieval is a field concerned with the structure, analysis, organization, storage, searching and retrieval of information Gerard Salton, 1968 J. Pei: Information
More informationInformation Retrieval
Multimedia Computing: Algorithms, Systems, and Applications: Information Retrieval and Search Engine By Dr. Yu Cao Department of Computer Science The University of Massachusetts Lowell Lowell, MA 01854,
More informationDeveloping Focused Crawlers for Genre Specific Search Engines
Developing Focused Crawlers for Genre Specific Search Engines Nikhil Priyatam Thesis Advisor: Prof. Vasudeva Varma IIIT Hyderabad July 7, 2014 Examples of Genre Specific Search Engines MedlinePlus Naukri.com
More informationMetadata Topic Harmonization and Semantic Search for Linked-Data-Driven Geoportals -- A Case Study Using ArcGIS Online
Metadata Topic Harmonization and Semantic Search for Linked-Data-Driven Geoportals -- A Case Study Using ArcGIS Online Yingjie Hu 1, Krzysztof Janowicz 1, Sathya Prasad 2, and Song Gao 1 1 STKO Lab, Department
More informationLIST OF ACRONYMS & ABBREVIATIONS
LIST OF ACRONYMS & ABBREVIATIONS ARPA CBFSE CBR CS CSE FiPRA GUI HITS HTML HTTP HyPRA NoRPRA ODP PR RBSE RS SE TF-IDF UI URI URL W3 W3C WePRA WP WWW Alpha Page Rank Algorithm Context based Focused Search
More informationCS377: Database Systems Text data and information. Li Xiong Department of Mathematics and Computer Science Emory University
CS377: Database Systems Text data and information retrieval Li Xiong Department of Mathematics and Computer Science Emory University Outline Information Retrieval (IR) Concepts Text Preprocessing Inverted
More informationText Mining. Representation of Text Documents
Data Mining is typically concerned with the detection of patterns in numeric data, but very often important (e.g., critical to business) information is stored in the form of text. Unlike numeric data,
More informationDocument Clustering for Mediated Information Access The WebCluster Project
Document Clustering for Mediated Information Access The WebCluster Project School of Communication, Information and Library Sciences Rutgers University The original WebCluster project was conducted at
More informationA Survey On Diversification Techniques For Unabmiguous But Under- Specified Queries
J. Appl. Environ. Biol. Sci., 4(7S)271-276, 2014 2014, TextRoad Publication ISSN: 2090-4274 Journal of Applied Environmental and Biological Sciences www.textroad.com A Survey On Diversification Techniques
More informationDATA MINING II - 1DL460. Spring 2014"
DATA MINING II - 1DL460 Spring 2014" A second course in data mining http://www.it.uu.se/edu/course/homepage/infoutv2/vt14 Kjell Orsborn Uppsala Database Laboratory Department of Information Technology,
More informationCS 6320 Natural Language Processing
CS 6320 Natural Language Processing Information Retrieval Yang Liu Slides modified from Ray Mooney s (http://www.cs.utexas.edu/users/mooney/ir-course/slides/) 1 Introduction of IR System components, basic
More informationSearch Results Clustering in Polish: Evaluation of Carrot
Search Results Clustering in Polish: Evaluation of Carrot DAWID WEISS JERZY STEFANOWSKI Institute of Computing Science Poznań University of Technology Introduction search engines tools of everyday use
More informationLecture 8 May 7, Prabhakar Raghavan
Lecture 8 May 7, 2001 Prabhakar Raghavan Clustering documents Given a corpus, partition it into groups of related docs Recursively, can induce a tree of topics Given the set of docs from the results of
More informationInformation Retrieval and Web Search Engines
Information Retrieval and Web Search Engines Lecture 7: Document Clustering May 25, 2011 Wolf-Tilo Balke and Joachim Selke Institut für Informationssysteme Technische Universität Braunschweig Homework
More informationContextual Search using Cognitive Discovery Capabilities
Contextual Search using Cognitive Discovery Capabilities In this exercise, you will work with a sample application that uses the Watson Discovery service API s for cognitive search use cases. Discovery
More informationUNIT-V WEB MINING. 3/18/2012 Prof. Asha Ambhaikar, RCET Bhilai.
UNIT-V WEB MINING 1 Mining the World-Wide Web 2 What is Web Mining? Discovering useful information from the World-Wide Web and its usage patterns. 3 Web search engines Index-based: search the Web, index
More informationA B2B Search Engine. Abstract. Motivation. Challenges. Technical Report
Technical Report A B2B Search Engine Abstract In this report, we describe a business-to-business search engine that allows searching for potential customers with highly-specific queries. Currently over
More informationEMC GREENPLUM MANAGEMENT ENABLED BY AGINITY WORKBENCH
White Paper EMC GREENPLUM MANAGEMENT ENABLED BY AGINITY WORKBENCH A Detailed Review EMC SOLUTIONS GROUP Abstract This white paper discusses the features, benefits, and use of Aginity Workbench for EMC
More informationUnderstanding the Query: THCIB and THUIS at NTCIR-10 Intent Task. Junjun Wang 2013/4/22
Understanding the Query: THCIB and THUIS at NTCIR-10 Intent Task Junjun Wang 2013/4/22 Outline Introduction Related Word System Overview Subtopic Candidate Mining Subtopic Ranking Results and Discussion
More informationA Linguistic Approach for Semantic Web Service Discovery
A Linguistic Approach for Semantic Web Service Discovery Jordy Sangers 307370js jordysangers@hotmail.com Bachelor Thesis Economics and Informatics Erasmus School of Economics Erasmus University Rotterdam
More informationTelling Experts from Spammers Expertise Ranking in Folksonomies
32 nd Annual ACM SIGIR 09 Boston, USA, Jul 19-23 2009 Telling Experts from Spammers Expertise Ranking in Folksonomies Michael G. Noll (Albert) Ching-Man Au Yeung Christoph Meinel Nicholas Gibbins Nigel
More informationText Analytics (Text Mining)
CSE 6242 / CX 4242 Apr 1, 2014 Text Analytics (Text Mining) Concepts and Algorithms Duen Horng (Polo) Chau Georgia Tech Some lectures are partly based on materials by Professors Guy Lebanon, Jeffrey Heer,
More informationLarge Scale Chinese News Categorization. Peng Wang. Joint work with H. Zhang, B. Xu, H.W. Hao
Large Scale Chinese News Categorization --based on Improved Feature Selection Method Peng Wang Joint work with H. Zhang, B. Xu, H.W. Hao Computational-Brain Research Center Institute of Automation, Chinese
More informationEQUELLA. Searching User Guide. Version 6.2
EQUELLA Searching User Guide Version 6.2 Document History Document No. Reviewed Finalised Published 1 5/12/2013 6/12/2013 6/12/2013 December 2013 edition. Information in this document may change without
More informationStudying the Impact of Text Summarization on Contextual Advertising
Studying the Impact of Text Summarization on Contextual Advertising G. Armano, A. Giuliani, and E. Vargiu Intelligent Agents and Soft-Computing Group Dept. of Electrical and Electronic Engineering University
More informationIntroduction to Information Retrieval
Introduction to Information Retrieval Mohsen Kamyar چهارمین کارگاه ساالنه آزمایشگاه فناوری و وب بهمن ماه 1391 Outline Outline in classic categorization Information vs. Data Retrieval IR Models Evaluation
More informationShort Text Categorization Exploiting Contextual Enrichment and External Knowledge
Short Text Categorization Exploiting Contextual Enrichment and External Knowledge Stefano Mizzaro, Marco Pavan, Ivan Scagnetto, Martino Valenti!! University of Udine, Italy 1 Disclaimer Keep it simple,
More informationExtracting Rankings for Spatial Keyword Queries from GPS Data
Extracting Rankings for Spatial Keyword Queries from GPS Data Ilkcan Keles Christian S. Jensen Simonas Saltenis Aalborg University Outline Introduction Motivation Problem Definition Proposed Method Overview
More informationA Content Based Image Retrieval System Based on Color Features
A Content Based Image Retrieval System Based on Features Irena Valova, University of Rousse Angel Kanchev, Department of Computer Systems and Technologies, Rousse, Bulgaria, Irena@ecs.ru.acad.bg Boris
More informationUser guide for GEM-TREND
User guide for GEM-TREND 1. Requirements for Using GEM-TREND GEM-TREND is implemented as a java applet which can be run in most common browsers and has been test with Internet Explorer 7.0, Internet Explorer
More informationCustomer Clustering using RFM analysis
Customer Clustering using RFM analysis VASILIS AGGELIS WINBANK PIRAEUS BANK Athens GREECE AggelisV@winbank.gr DIMITRIS CHRISTODOULAKIS Computer Engineering and Informatics Department University of Patras
More informationTowards Efficient and Effective Semantic Table Interpretation Ziqi Zhang
Towards Efficient and Effective Semantic Table Interpretation Ziqi Zhang Department of Computer Science, University of Sheffield Outline Define semantic table interpretation State-of-the-art and motivation
More informationIdentifying Web Spam With User Behavior Analysis
Identifying Web Spam With User Behavior Analysis Yiqun Liu, Rongwei Cen, Min Zhang, Shaoping Ma, Liyun Ru State Key Lab of Intelligent Tech. & Sys. Tsinghua University 2008/04/23 Introduction simple math
More informationClustering Results. Result List Example. Clustering Results. Information Retrieval
Information Retrieval INFO 4300 / CS 4300! Presenting Results Clustering Clustering Results! Result lists often contain documents related to different aspects of the query topic! Clustering is used to
More informationClustering (COSC 488) Nazli Goharian. Document Clustering.
Clustering (COSC 488) Nazli Goharian nazli@ir.cs.georgetown.edu 1 Document Clustering. Cluster Hypothesis : By clustering, documents relevant to the same topics tend to be grouped together. C. J. van Rijsbergen,
More informationSearch Engines. Information Retrieval in Practice
Search Engines Information Retrieval in Practice All slides Addison Wesley, 2008 Classification and Clustering Classification and clustering are classical pattern recognition / machine learning problems
More informationInferring User s Information Context: Integrating User Profiles and Concept Hierarchies
Inferring User s Information Context: Integrating User Profiles and Concept Hierarchies Ahu Sieg, Bamshad Mobasher, and Robin Burke DePaul University School of Computer Science, Telecommunication and Information
More informationModule 1: Internet Basics for Web Development (II)
INTERNET & WEB APPLICATION DEVELOPMENT SWE 444 Fall Semester 2008-2009 (081) Module 1: Internet Basics for Web Development (II) Dr. El-Sayed El-Alfy Computer Science Department King Fahd University of
More informationInformation Retrieval and Web Search Engines
Information Retrieval and Web Search Engines Lecture 7: Document Clustering December 4th, 2014 Wolf-Tilo Balke and José Pinto Institut für Informationssysteme Technische Universität Braunschweig The Cluster
More informationSearch Engine Architecture II
Search Engine Architecture II Primary Goals of Search Engines Effectiveness (quality): to retrieve the most relevant set of documents for a query Process text and store text statistics to improve relevance
More informationCS473: Course Review CS-473. Luo Si Department of Computer Science Purdue University
CS473: CS-473 Course Review Luo Si Department of Computer Science Purdue University Basic Concepts of IR: Outline Basic Concepts of Information Retrieval: Task definition of Ad-hoc IR Terminologies and
More informationCHAPTER 5 OPTIMAL CLUSTER-BASED RETRIEVAL
85 CHAPTER 5 OPTIMAL CLUSTER-BASED RETRIEVAL 5.1 INTRODUCTION Document clustering can be applied to improve the retrieval process. Fast and high quality document clustering algorithms play an important
More informationAPPROACHES TO IMPLEMENT SEMANTIC SEARCH. Johannes Peter Product Owner / Architect for Search
APPROACHES TO IMPLEMENT SEMANTIC SEARCH Johannes Peter Product Owner / Architect for Search 1 WHAT IS SEMANTIC SEARCH? 2 Success of search Interface of shops to brains of customers Wide range of usage
More informationGoNTogle: A Tool for Semantic Annotation and Search
GoNTogle: A Tool for Semantic Annotation and Search Giorgos Giannopoulos 1,2, Nikos Bikakis 1,2, Theodore Dalamagas 2, and Timos Sellis 1,2 1 KDBSL Lab, School of ECE, Nat. Tech. Univ. of Athens, Greece
More informationGoNTogle: A Tool for Semantic Annotation and Search
GoNTogle: A Tool for Semantic Annotation and Search Giorgos Giannopoulos 1,2, Nikos Bikakis 1, Theodore Dalamagas 2 and Timos Sellis 1,2 1 KDBS Lab, School of ECE, NTU Athens, Greece. {giann@dblab.ntua.gr,
More informationConcept-Based Document Similarity Based on Suffix Tree Document
Concept-Based Document Similarity Based on Suffix Tree Document *P.Perumal Sri Ramakrishna Engineering College Associate Professor Department of CSE, Coimbatore perumalsrec@gmail.com R. Nedunchezhian Sri
More informationClustering (COSC 416) Nazli Goharian. Document Clustering.
Clustering (COSC 416) Nazli Goharian nazli@cs.georgetown.edu 1 Document Clustering. Cluster Hypothesis : By clustering, documents relevant to the same topics tend to be grouped together. C. J. van Rijsbergen,
More informationwhite paper 4 Steps to Better Keyword Grouping Strategies for More Effective & Profitable Keyword Segmentation
white paper 4 Steps to Better Keyword Grouping Strategies for More Effective & Profitable Keyword Segmentation 2009, WordStream, Inc. All rights reserved. WordStream technologies are protected by pending
More informationSearching the Web What is this Page Known for? Luis De Alba
Searching the Web What is this Page Known for? Luis De Alba ldealbar@cc.hut.fi Searching the Web Arasu, Cho, Garcia-Molina, Paepcke, Raghavan August, 2001. Stanford University Introduction People browse
More informationSemantically Driven Snippet Selection for Supporting Focused Web Searches
Semantically Driven Snippet Selection for Supporting Focused Web Searches IRAKLIS VARLAMIS Harokopio University of Athens Department of Informatics and Telematics, 89, Harokopou Street, 176 71, Athens,
More informationTowards Summarizing the Web of Entities
Towards Summarizing the Web of Entities contributors: August 15, 2012 Thomas Hofmann Director of Engineering Search Ads Quality Zurich, Google Switzerland thofmann@google.com Enrique Alfonseca Yasemin
More informationFACULTY OF ENGINEERING B.E. 4/4 (CSE) II Semester (Old) Examination, June Subject : Information Retrieval Systems (Elective III) Estelar
B.E. 4/4 (CSE) II Semester (Old) Examination, June 2014 Subject : Information Retrieval Systems Code No. 6306 / O 1 Define Information retrieval systems. 3 2 What is precision and recall? 3 3 List the
More informationNews Article Matcher. Team: Rohan Sehgal, Arnold Kao, Nithin Kunala
News Article Matcher Team: Rohan Sehgal, Arnold Kao, Nithin Kunala Abstract: The news article matcher is a search engine that allows you to input an entire news article and it returns articles that are
More informationSystem Requirement Specification
Fair Hearing Resource Center System Requirement Specification State University of New York at Buffalo Team 20 (Amit Shrikant Kulkarni) (Chirag Todarka) (Satish Saley) (Uday Deep Singh) Table of Contents
More information