An adaptable search system for collection of partially structured documents
|
|
- Blaze Robertson
- 5 years ago
- Views:
Transcription
1 Samantha Riccadonna An adaptable search system for collection of partially structured documents by Udo Kruschwitz Web Information Retrieval Course A.Y
2 Outline Search system overview Few concepts and observations: Adaptive (W)IR Searching Web Documents The domain model The general framework Conclusions References 2
3 Search system overview Generic framework: online search system accessing standard search engine automatically constructed domain model built using implicit knowledge (markup) dialogue system: interacting with user by offering options to refine or relax the query 3
4 Adaptive (W)IR adaptive techniques improve their performance over time, in response to feedback they receive on prior performance [Belew,, 2000] Relevance Feedback Techniques [Gudivada et al.,1997] 4
5 Searching Web documents queries are normally very short (average 2.3 words) search engine will retrieve numerous docs most of users do not perform any query modification [Silverstein et al.,1998] OBS: system that applies a domain model to propose query refinements must perform very well for the user to accept it 5
6 Searching Web documents interactive query expansion process improves search process inexperienced user do not make good term selections from potentially relevant term list significant improvement for experienced user [Margennis and van Rijsbergen,1997] advice to help users reformulate queries improves relevance of retrieved documents [Bruza et al.,2000] 6
7 The Domain Model (DM) Domain model is automatically acquired by exploiting the markup of documents Main issues: 1. extracting the concepts 2. constructing the domain model (organize concepts as a set of simple hierarchies) 3. applying the model as part of the search system 7
8 DM: extracting the concepts CONCEPT DOMAIN-INDEPENDENT An index term c is a concept term of type n for document d, if c occurs in at least n different markup contexts Partially structured documents documents can be divided into indifferent markup context (explicit or implicit structure) [HTML docs] doc titles, bold text, underlined text, [articles] article headings, captions, summaries, 8
9 DM: extracting the concepts CONCEPT An index term c is a concept term of type n for document d, if c occurs in at least n different markup contexts document type a concept of type n is a term found in at least n markup context OBS: - type 2 and type 3 concepts - higher type concepts are more reliable, but more sparse 9
10 DM: extracting the concepts Partially structured documents documents can be divided into indifferent markup context (explicit or implicit structure) use the structure to extract significant terms not rely on a single type of information: might be used infrequently could be used for spamming (HTML metatags) no assumptions about semantic interpretation of particular type of information 10
11 DM: concept hierarchies world model set of simple concept hierarchies AIM: RELATED CONCEPT establish that there is some relation (will be used to guide search process), no interest in semantic inter-concept relations Two concepts c 1,c 2 are related concepts of type n for document d, if c 1 and c 2 are concepts of type n for document d. 11
12 DM: concept hierarchies world model set of simple concept hierarchies AIM: establish that there is some relation (will be used to guide search process), no interest in semantic inter-concept relations relation construction process: - automatic - offline 12
13 DM: concept hierarchies concepts identified in the indexing process are likely to be among user queries keywords/terms (log analysis) model construction: iterative process each concept is a potential query ( appropriate concept hierarchy) (sequence of) user request simulation, using the concepts identified in the documents (no live user queries) 13
14 DM: concept hierarchies model construction: query modification step: exploration of all related concepts following relations detected in the source data (extended by single concept) new node added if there are documents matching the refined query limit to the number of branches originating in a node ( usable model) arc weights=number of matches that a node represent 14
15 DM: applying the model application of the model as part of the search system to help the user navigate the documents retrieved assist user in ad hoc search task support browse process (vs search process): user can get a perception of actually available data 15
16 DM: applying the model Dialogue manager: central part of the search system controls access to the domain model and search engine 16
17 The Domain Model (DM) usually not available can be constructed by hand-coding (using existing domain-independent knowledge sources) by processing available data into a concept model offline (by analyzing the entire document collection) online (by investigating docs retrieved for query) 17
18 The general framework Domain Model: automatically acquired by exploiting the markup of documents built offline by analyzing the entire document collection no assumptions about the domain type or semantic content 18
19 The general framework combines UKsearch system and std search engine: query submitted to both merge of the result 19
20 The general framework ADVANTAGES: initial response time depends on search engine technology conceptual model needed only if std search is unsatisfactory 20
21 Conclusions automatically created domain model relations not present in general knowledge sources (world knowledge vs. linguistic information) changing domain suited for document collection limited in size or domain-specific portable system (could be run on completely different collections) simple domain-independent dialogue manager assisting users (interactive search task) 21
22 References [Kruschwitz,, 2003] U. Kruschwitz, An Adaptable Search System for Collections of Partially Structured Documents, Intelligent Systems, IEEE, 18 (4), Jul-Aug 2003, pp [Belew,, 2000] R. K. Belew, Finding Out About, Cambridge University Press, Cambridge, [Gudivada et al, 1998] V. N. Gudivada, V.V. Raghavan, W. I. Grosky, R. Kasanagottu, Information Retrieval on the World Wide Web, Internet Computing, IEEE, 1 (5), Sep/Oct 1997, pp [Silverstein et al, 1998] [Silverstein et al, 1998] C. Silverstein, M. Henzinger, and H. Marais, Analysis of a Very Large alta Vista Query Log, Digital SRC Technical Note ,
23 References [Margennis and van Rijsbergen,1997] M. Margennis and C.J. van Rijsbergen, The Potential and Actual Effectiveness of Interactive Query Expansion, Proc. 20th Ann. Int l ACM SIGIR Conf. Research and Development in Information Retrieval,ACM Press,1997, pp [Bruza et al., 2000] P. Bruza, R. McArthur, and S. Dennis, Interactive Internet Search: Keyword, Directory, and Query Reformulation Mechanisms Compared, Proc. 23rd Ann. Int l ACM SIGIR Conf. Research and Development in Information Retrieval, ACM Press, 2000, pp
24 24
second_language research_teaching sla vivian_cook language_department idl
Using Implicit Relevance Feedback in a Web Search Assistant Maria Fasli and Udo Kruschwitz Department of Computer Science, University of Essex, Wivenhoe Park, Colchester, CO4 3SQ, United Kingdom fmfasli
More informationInformation Retrieval
Multimedia Computing: Algorithms, Systems, and Applications: Information Retrieval and Search Engine By Dr. Yu Cao Department of Computer Science The University of Massachusetts Lowell Lowell, MA 01854,
More informationConcept-Based Document Similarity Based on Suffix Tree Document
Concept-Based Document Similarity Based on Suffix Tree Document *P.Perumal Sri Ramakrishna Engineering College Associate Professor Department of CSE, Coimbatore perumalsrec@gmail.com R. Nedunchezhian Sri
More informationA New Technique to Optimize User s Browsing Session using Data Mining
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,
More informationThis document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore.
This document is downloaded from DR-NTU, Nanyang Technological University Library, Singapore. Title Query formulation with a search assistant. Author(s) Citation Fu, Lin.; Goh, Dion Hoe-Lian.; Foo, Schubert
More informationFinding Relevant Documents using Top Ranking Sentences: An Evaluation of Two Alternative Schemes
Finding Relevant Documents using Top Ranking Sentences: An Evaluation of Two Alternative Schemes Ryen W. White Department of Computing Science University of Glasgow Glasgow. G12 8QQ whiter@dcs.gla.ac.uk
More informationLecture 7: Relevance Feedback and Query Expansion
Lecture 7: Relevance Feedback and Query Expansion Information Retrieval Computer Science Tripos Part II Ronan Cummins Natural Language and Information Processing (NLIP) Group ronan.cummins@cl.cam.ac.uk
More informationImproving Adaptive Hypermedia by Adding Semantics
Improving Adaptive Hypermedia by Adding Semantics Anton ANDREJKO Slovak University of Technology Faculty of Informatics and Information Technologies Ilkovičova 3, 842 16 Bratislava, Slovak republic andrejko@fiit.stuba.sk
More informationJan Pedersen 22 July 2010
Jan Pedersen 22 July 2010 Outline Problem Statement Best effort retrieval vs automated reformulation Query Evaluation Architecture Query Understanding Models Data Sources Standard IR Assumptions Queries
More informationKnowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.
Knowledge Retrieval Franz J. Kurfess Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. 1 Acknowledgements This lecture series has been sponsored by the European
More informationA World Wide Web-based HCI-library Designed for Interaction Studies
A World Wide Web-based HCI-library Designed for Interaction Studies Ketil Perstrup, Erik Frøkjær, Maria Konstantinovitz, Thorbjørn Konstantinovitz, Flemming S. Sørensen, Jytte Varming Department of Computing,
More information21. Search Models and UIs for IR
21. Search Models and UIs for IR INFO 202-10 November 2008 Bob Glushko Plan for Today's Lecture The "Classical" Model of Search and the "Classical" UI for IR Web-based Search Best practices for UIs in
More informationUsing Clusters on the Vivisimo Web Search Engine
Using Clusters on the Vivisimo Web Search Engine Sherry Koshman and Amanda Spink School of Information Sciences University of Pittsburgh 135 N. Bellefield Ave., Pittsburgh, PA 15237 skoshman@sis.pitt.edu,
More informationijade Reporter An Intelligent Multi-agent Based Context Aware News Reporting System
ijade Reporter An Intelligent Multi-agent Based Context Aware Reporting System Eddie C.L. Chan and Raymond S.T. Lee The Department of Computing, The Hong Kong Polytechnic University, Hung Hong, Kowloon,
More informationEnhancing Cluster Quality by Using User Browsing Time
Enhancing Cluster Quality by Using User Browsing Time Rehab M. Duwairi* and Khaleifah Al.jada'** * Department of Computer Information Systems, Jordan University of Science and Technology, Irbid 22110,
More informationDesigning for Web Using Markup Language and Style Sheets
Module Presenter s Manual Designing for Web Using Markup Language and Style Sheets Effective from: July 2014 Ver. 1.0 Amendment Record Version No. Effective Date Change Replaced Pages 1.0 July 2014 New
More informationEnhancing Cluster Quality by Using User Browsing Time
Enhancing Cluster Quality by Using User Browsing Time Rehab Duwairi Dept. of Computer Information Systems Jordan Univ. of Sc. and Technology Irbid, Jordan rehab@just.edu.jo Khaleifah Al.jada' Dept. of
More informationI&R SYSTEMS ON THE INTERNET/INTRANET CITES AS THE TOOL FOR DISTANCE LEARNING. Andrii Donchenko
International Journal "Information Technologies and Knowledge" Vol.1 / 2007 293 I&R SYSTEMS ON THE INTERNET/INTRANET CITES AS THE TOOL FOR DISTANCE LEARNING Andrii Donchenko Abstract: This article considers
More informationInformation Retrieval
Natural Language Processing SoSe 2015 Information Retrieval Dr. Mariana Neves June 22nd, 2015 (based on the slides of Dr. Saeedeh Momtazi) Outline Introduction Indexing Block 2 Document Crawling Text Processing
More informationDesigning for Multimedia
1 ing for Multi Phil Gray Outline What s Special about Multi? A Method Based on the City Method Developed by Alistair Sutcliffe and Stephanie Wilson Evaluation 2 What s Special About Multi? Rich forms
More informationNatural Language Processing
Natural Language Processing Information Retrieval Potsdam, 14 June 2012 Saeedeh Momtazi Information Systems Group based on the slides of the course book Outline 2 1 Introduction 2 Indexing Block Document
More informationDocument Clustering for Mediated Information Access The WebCluster Project
Document Clustering for Mediated Information Access The WebCluster Project School of Communication, Information and Library Sciences Rutgers University The original WebCluster project was conducted at
More informationKDD 10 Tutorial: Recommender Problems for Web Applications. Deepak Agarwal and Bee-Chung Chen Yahoo! Research
KDD 10 Tutorial: Recommender Problems for Web Applications Deepak Agarwal and Bee-Chung Chen Yahoo! Research Agenda Focus: Recommender problems for dynamic, time-sensitive applications Content Optimization
More informationEleven+ Views of Semantic Search
Eleven+ Views of Semantic Search Denise A. D. Bedford, Ph.d. Goodyear Professor of Knowledge Management Information Architecture and Knowledge Management Kent State University Presentation Focus Long-Term
More informationTREC 2016 Dynamic Domain Track: Exploiting Passage Representation for Retrieval and Relevance Feedback
RMIT @ TREC 2016 Dynamic Domain Track: Exploiting Passage Representation for Retrieval and Relevance Feedback Ameer Albahem ameer.albahem@rmit.edu.au Lawrence Cavedon lawrence.cavedon@rmit.edu.au Damiano
More informationISSN: , (2015): DOI:
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 6 Issue 6 June 2017, Page No. 21737-21742 Index Copernicus value (2015): 58.10 DOI: 10.18535/ijecs/v6i6.31 A
More informationHuman-Computer Information Retrieval
Human-Computer Information Retrieval Gary Marchionini University of North Carolina at Chapel Hill march@ils.unc.edu CSAIL MIT November 12, 2004 Message IR and HCI are related fields that have strong (staid?)
More informationBitmap Indexing-based Clustering and Retrieval of XML Documents
Bitmap Indexing-based Clustering and Retrieval of XML Documents Jong P. Yoon Ctr. for Advanced Computer Studies University of Louisiana Lafayette, LA 754-433 jyoon@cacs.louisiana.edu Vijay Raghavan Ctr.
More informationIJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, ISSN:
IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 20131 Improve Search Engine Relevance with Filter session Addlin Shinney R 1, Saravana Kumar T
More informationLearning Ranking Functions with SVMs
Learning Ranking Functions with SVMs CS4780/5780 Machine Learning Fall 2012 Thorsten Joachims Cornell University T. Joachims, Optimizing Search Engines Using Clickthrough Data, Proceedings of the ACM Conference
More informationInformation Retrieval
Natural Language Processing SoSe 2014 Information Retrieval Dr. Mariana Neves June 18th, 2014 (based on the slides of Dr. Saeedeh Momtazi) Outline Introduction Indexing Block 2 Document Crawling Text Processing
More informationSession 3 Presentation and Dialogue Design. Presentation Techniques. Design for Attention. Planning thematic threads through several media
Session 3 Presentation and Dialogue Design Planning thematic threads through several media Presentation layout: sequential or concurrent Drawing attention to key facts especially in image Specifying explicit
More informationSearch Engine Architecture. Hongning Wang
Search Engine Architecture Hongning Wang CS@UVa CS@UVa CS4501: Information Retrieval 2 Document Analyzer Classical search engine architecture The Anatomy of a Large-Scale Hypertextual Web Search Engine
More information@dvanced document management Holstraat 19, B-9000 Gent, tel: 09/ , fax: 09/
IKEM -toolkit Gives support for managing your electronic documents on the basis of a standardised knowledgebase. This is a semantic network of concepts that are inter-related according to ISO-norm thesaurus
More informationA Patent Retrieval Method Using a Hierarchy of Clusters at TUT
A Patent Retrieval Method Using a Hierarchy of Clusters at TUT Hironori Doi Yohei Seki Masaki Aono Toyohashi University of Technology 1-1 Hibarigaoka, Tenpaku-cho, Toyohashi-shi, Aichi 441-8580, Japan
More informationQuery Modifications Patterns During Web Searching
Bernard J. Jansen The Pennsylvania State University jjansen@ist.psu.edu Query Modifications Patterns During Web Searching Amanda Spink Queensland University of Technology ah.spink@qut.edu.au Bhuva Narayan
More informationBitCube: A Three-Dimensional Bitmap Indexing for XML Documents
BitCube: A Three-Dimensional Bitmap Indexing for XML Documents Jong P. Yoon Ctr. for Advanced Computer Studies University of Louisiana Lafayette, LA 754-433 jyoon@cacs.louisiana.edu Vijay Raghavan Ctr.
More informationPrivacy Protection in Personalized Web Search with User Profile
Privacy Protection in Personalized Web Search with User Profile Prateek C. Shukla 1,Tekchand D. Patil 2, Yogeshwar J. Shirsath 3,Dnyaneshwar N. Rasal 4 1,2,3,4, (I.T. Dept.,B.V.C.O.E.&R.I. Anjaneri,university.Pune,
More information5 Choosing keywords Initially choosing keywords Frequent and rare keywords Evaluating the competition rates of search
Seo tutorial Seo tutorial Introduction to seo... 4 1. General seo information... 5 1.1 History of search engines... 5 1.2 Common search engine principles... 6 2. Internal ranking factors... 8 2.1 Web page
More informationOntology Extraction from Heterogeneous Documents
Vol.3, Issue.2, March-April. 2013 pp-985-989 ISSN: 2249-6645 Ontology Extraction from Heterogeneous Documents Kirankumar Kataraki, 1 Sumana M 2 1 IV sem M.Tech/ Department of Information Science & Engg
More informationThis literature review provides an overview of the various topics related to using implicit
Vijay Deepak Dollu. Implicit Feedback in Information Retrieval: A Literature Analysis. A Master s Paper for the M.S. in I.S. degree. April 2005. 56 pages. Advisor: Stephanie W. Haas This literature review
More informationDomain Specific Search Engine for Students
Domain Specific Search Engine for Students Domain Specific Search Engine for Students Wai Yuen Tang The Department of Computer Science City University of Hong Kong, Hong Kong wytang@cs.cityu.edu.hk Lam
More informationarxiv:cs/ v1 [cs.ir] 26 Apr 2002
Navigating the Small World Web by Textual Cues arxiv:cs/0204054v1 [cs.ir] 26 Apr 2002 Filippo Menczer Department of Management Sciences The University of Iowa Iowa City, IA 52242 Phone: (319) 335-0884
More informationInformation Retrieval CS Lecture 01. Razvan C. Bunescu School of Electrical Engineering and Computer Science
Information Retrieval CS 6900 Razvan C. Bunescu School of Electrical Engineering and Computer Science bunescu@ohio.edu Information Retrieval Information Retrieval (IR) is finding material of an unstructured
More informationInformation Retrieval
Introduction Information Retrieval Information retrieval is a field concerned with the structure, analysis, organization, storage, searching and retrieval of information Gerard Salton, 1968 J. Pei: Information
More informationOntology Based Personalized Search Engine
Grand Valley State University ScholarWorks@GVSU Technical Library School of Computing and Information Systems 2014 Ontology Based Personalized Search Engine Divya Vemula Grand Valley State University Follow
More informationA JAVA-BASED SYSTEM FOR XML DATA PROTECTION* E. Bertino, M. Braun, S. Castano, E. Ferrari, M. Mesiti
CHAPTER 2 Author- A JAVA-BASED SYSTEM FOR XML DATA PROTECTION* E. Bertino, M. Braun, S. Castano, E. Ferrari, M. Mesiti Abstract Author- is a Java-based system for access control to XML documents. Author-
More informationStep 1 - Learning & Discovery
OUR PROCESS Our goal as a integrated marketing agency is to provide clients with the most creative and effective solutions for their business. Reaching that goal begins with our Learning and Discovery
More informationThe Use of Domain Modelling to Improve Performance Over a Query Session
The Use of Domain Modelling to Improve Performance Over a Query Session Deirdre Lungley Dyaa Albakour Udo Kruschwitz Apr 18, 2011 Table of contents 1 The AutoAdapt Project 2 3 4 5 The AutoAdapt Project
More informationIMAGE RETRIEVAL SYSTEM: BASED ON USER REQUIREMENT AND INFERRING ANALYSIS TROUGH FEEDBACK
IMAGE RETRIEVAL SYSTEM: BASED ON USER REQUIREMENT AND INFERRING ANALYSIS TROUGH FEEDBACK 1 Mount Steffi Varish.C, 2 Guru Rama SenthilVel Abstract - Image Mining is a recent trended approach enveloped in
More informationIntroduction to Text Mining. Hongning Wang
Introduction to Text Mining Hongning Wang CS@UVa Who Am I? Hongning Wang Assistant professor in CS@UVa since August 2014 Research areas Information retrieval Data mining Machine learning CS@UVa CS6501:
More informationChapter 2. Architecture of a Search Engine
Chapter 2 Architecture of a Search Engine Search Engine Architecture A software architecture consists of software components, the interfaces provided by those components and the relationships between them
More informationA Novel Approach for Restructuring Web Search Results by Feedback Sessions Using Fuzzy clustering
A Novel Approach for Restructuring Web Search Results by Feedback Sessions Using Fuzzy clustering R.Dhivya 1, R.Rajavignesh 2 (M.E CSE), Department of CSE, Arasu Engineering College, kumbakonam 1 Asst.
More informationGlossary. advance: to move forward
Computer Computer Skills Glossary Skills Glossary advance: to move forward alignment tab: the tab in the Format Cells dialog box that allows you to choose how the data in the cells will be aligned (left,
More informationCLUSTERING, TIERED INDEXES AND TERM PROXIMITY WEIGHTING IN TEXT-BASED RETRIEVAL
STUDIA UNIV. BABEŞ BOLYAI, INFORMATICA, Volume LVII, Number 4, 2012 CLUSTERING, TIERED INDEXES AND TERM PROXIMITY WEIGHTING IN TEXT-BASED RETRIEVAL IOAN BADARINZA AND ADRIAN STERCA Abstract. In this paper
More informationEffective Structured Query Formulation for Session Search
Effective Structured Query Formulation for Session Search Dongyi Guan Hui Yang Nazli Goharian Department of Computer Science Georgetown University 37 th and O Street, NW, Washington, DC, 20057 dg372@georgetown.edu,
More informationAn Efficient Technique for Tag Extraction and Content Retrieval from Web Pages
An Efficient Technique for Tag Extraction and Content Retrieval from Web Pages S.Sathya M.Sc 1, Dr. B.Srinivasan M.C.A., M.Phil, M.B.A., Ph.D., 2 1 Mphil Scholar, Department of Computer Science, Gobi Arts
More informationSupporting Fuzzy Keyword Search in Databases
I J C T A, 9(24), 2016, pp. 385-391 International Science Press Supporting Fuzzy Keyword Search in Databases Jayavarthini C.* and Priya S. ABSTRACT An efficient keyword search system computes answers as
More informationTip. Tip. Tip. Tap on the BoardEffect icon to open the App: Tap the green Get Started button in the lower right corner to begin.
Tap on the BoardEffect icon to open the App: Tap the green Get Started button in the lower right corner to begin. To skip this welcome screen in the future, click the button in the bottom left corner.
More informationLetter Pair Similarity Classification and URL Ranking Based on Feedback Approach
Letter Pair Similarity Classification and URL Ranking Based on Feedback Approach P.T.Shijili 1 P.G Student, Department of CSE, Dr.Nallini Institute of Engineering & Technology, Dharapuram, Tamilnadu, India
More informationSearch Costs vs. User Satisfaction on Mobile
Search Costs vs. User Satisfaction on Mobile Manisha Verma, Emine Yilmaz University College London mverma@cs.ucl.ac.uk, emine.yilmaz@ucl.ac.uk Abstract. Information seeking is an interactive process where
More information<is web> Information Systems & Semantic Web University of Koblenz Landau, Germany
Information Systems & University of Koblenz Landau, Germany Semantic Search examples: Swoogle and Watson Steffen Staad credit: Tim Finin (swoogle), Mathieu d Aquin (watson) and their groups 2009-07-17
More informationAN ENHANCED ATTRIBUTE RERANKING DESIGN FOR WEB IMAGE SEARCH
AN ENHANCED ATTRIBUTE RERANKING DESIGN FOR WEB IMAGE SEARCH Sai Tejaswi Dasari #1 and G K Kishore Babu *2 # Student,Cse, CIET, Lam,Guntur, India * Assistant Professort,Cse, CIET, Lam,Guntur, India Abstract-
More informationIJESRT. Scientific Journal Impact Factor: (ISRA), Impact Factor: 2.114
[Saranya, 4(3): March, 2015] ISSN: 2277-9655 IJESRT INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY A SURVEY ON KEYWORD QUERY ROUTING IN DATABASES N.Saranya*, R.Rajeshkumar, S.Saranya
More informationWEIGHTING QUERY TERMS USING WORDNET ONTOLOGY
IJCSNS International Journal of Computer Science and Network Security, VOL.9 No.4, April 2009 349 WEIGHTING QUERY TERMS USING WORDNET ONTOLOGY Mohammed M. Sakre Mohammed M. Kouta Ali M. N. Allam Al Shorouk
More informationClustering (COSC 488) Nazli Goharian. Document Clustering.
Clustering (COSC 488) Nazli Goharian nazli@ir.cs.georgetown.edu 1 Document Clustering. Cluster Hypothesis : By clustering, documents relevant to the same topics tend to be grouped together. C. J. van Rijsbergen,
More informationImgSeek: Capturing User s Intent For Internet Image Search
ImgSeek: Capturing User s Intent For Internet Image Search Abstract - Internet image search engines (e.g. Bing Image Search) frequently lean on adjacent text features. It is difficult for them to illustrate
More informationTREC-10 Web Track Experiments at MSRA
TREC-10 Web Track Experiments at MSRA Jianfeng Gao*, Guihong Cao #, Hongzhao He #, Min Zhang ##, Jian-Yun Nie**, Stephen Walker*, Stephen Robertson* * Microsoft Research, {jfgao,sw,ser}@microsoft.com **
More informationChapter XXI Using Action-Object Pairs as a Conceptual Framework for Transaction Log Analysis
414 Chapter XXI Using Action-Object Pairs as a Conceptual Framework for Transaction Log Analysis Mimi Zhang Pennsylvania State University, USA Bernard J. Jansen Pennsylvania State University, USA Abstract
More informationText Document Clustering Using DPM with Concept and Feature Analysis
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 2, Issue. 10, October 2013,
More informationAutomated Online News Classification with Personalization
Automated Online News Classification with Personalization Chee-Hong Chan Aixin Sun Ee-Peng Lim Center for Advanced Information Systems, Nanyang Technological University Nanyang Avenue, Singapore, 639798
More informationSemantic-Based Information Retrieval for Java Learning Management System
AENSI Journals Australian Journal of Basic and Applied Sciences Journal home page: www.ajbasweb.com Semantic-Based Information Retrieval for Java Learning Management System Nurul Shahida Tukiman and Amirah
More informationAutomatic Identification of User Goals in Web Search [WWW 05]
Automatic Identification of User Goals in Web Search [WWW 05] UichinLee @ UCLA ZhenyuLiu @ UCLA JunghooCho @ UCLA Presenter: Emiran Curtmola@ UC San Diego CSE 291 4/29/2008 Need to improve the quality
More informationUnobtrusive Data Collection for Web-Based Social Navigation
Unobtrusive Data Collection for Web-Based Social Navigation Katja Hofmann 1, Catherine Reed 2, Hilary Holz 2 California State University, East Bay 25800 Carlos Bee Boulevard Hayward, CA 94542 1 katja.hofmann@gmail.com
More informationChapter 6. Queries and Interfaces
Chapter 6 Queries and Interfaces Keyword Queries Simple, natural language queries were designed to enable everyone to search Current search engines do not perform well (in general) with natural language
More informationAdaptive Search Engines Learning Ranking Functions with SVMs
Adaptive Search Engines Learning Ranking Functions with SVMs CS478/578 Machine Learning Fall 24 Thorsten Joachims Cornell University T. Joachims, Optimizing Search Engines Using Clickthrough Data, Proceedings
More informationTheme Identification in RDF Graphs
Theme Identification in RDF Graphs Hanane Ouksili PRiSM, Univ. Versailles St Quentin, UMR CNRS 8144, Versailles France hanane.ouksili@prism.uvsq.fr Abstract. An increasing number of RDF datasets is published
More informationSTRUCTURE-BASED QUERY EXPANSION FOR XML SEARCH ENGINE
STRUCTURE-BASED QUERY EXPANSION FOR XML SEARCH ENGINE Wei-ning Qian, Hai-lei Qian, Li Wei, Yan Wang and Ao-ying Zhou Computer Science Department Fudan University Shanghai 200433 E-mail: wnqian@fudan.edu.cn
More informationBridging Versioning and Adaptive Hypermedia in the Dynamic Web
Bridging Versioning and Adaptive Hypermedia in the Dynamic Web Evgeny Knutov, Mykola Pechenizkiy, Paul De Bra Eindhoven University of Technology, Department of Computer Science PO Box 513, NL 5600 MB Eindhoven,
More informationEffective Latent Space Graph-based Re-ranking Model with Global Consistency
Effective Latent Space Graph-based Re-ranking Model with Global Consistency Feb. 12, 2009 1 Outline Introduction Related work Methodology Graph-based re-ranking model Learning a latent space graph A case
More informationCase-based Recommendation. Peter Brusilovsky with slides of Danielle Lee
Case-based Recommendation Peter Brusilovsky with slides of Danielle Lee Where we are? Search Navigation Recommendation Content-based Semantics / Metadata Social Modern E-Commerce Site The Power of Metadata
More informationExploiting Index Pruning Methods for Clustering XML Collections
Exploiting Index Pruning Methods for Clustering XML Collections Ismail Sengor Altingovde, Duygu Atilgan and Özgür Ulusoy Department of Computer Engineering, Bilkent University, Ankara, Turkey {ismaila,
More informationJianyong Wang Department of Computer Science and Technology Tsinghua University
Jianyong Wang Department of Computer Science and Technology Tsinghua University jianyong@tsinghua.edu.cn Joint work with Wei Shen (Tsinghua), Ping Luo (HP), and Min Wang (HP) Outline Introduction to entity
More informationCS506/606 - Topics in Information Retrieval
CS506/606 - Topics in Information Retrieval Instructors: Class time: Steven Bedrick, Brian Roark, Emily Prud hommeaux Tu/Th 11:00 a.m. - 12:30 p.m. September 25 - December 6, 2012 Class location: WCC 403
More informationIJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 2013 ISSN:
Semi Automatic Annotation Exploitation Similarity of Pics in i Personal Photo Albums P. Subashree Kasi Thangam 1 and R. Rosy Angel 2 1 Assistant Professor, Department of Computer Science Engineering College,
More informationDevelopment of an Ontology-Based Portal for Digital Archive Services
Development of an Ontology-Based Portal for Digital Archive Services Ching-Long Yeh Department of Computer Science and Engineering Tatung University 40 Chungshan N. Rd. 3rd Sec. Taipei, 104, Taiwan chingyeh@cse.ttu.edu.tw
More informationAll Adobe Digital Design Vocabulary Absolute Div Tag Allows you to place any page element exactly where you want it Absolute Link Includes the
All Adobe Digital Design Vocabulary Absolute Div Tag Allows you to place any page element exactly where you want it Absolute Link Includes the complete URL of the linked document, including the domain
More informationClustering Results. Result List Example. Clustering Results. Information Retrieval
Information Retrieval INFO 4300 / CS 4300! Presenting Results Clustering Clustering Results! Result lists often contain documents related to different aspects of the query topic! Clustering is used to
More informationImage retrieval based on bag of images
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2009 Image retrieval based on bag of images Jun Zhang University of Wollongong
More informationImage Similarity Measurements Using Hmok- Simrank
Image Similarity Measurements Using Hmok- Simrank A.Vijay Department of computer science and Engineering Selvam College of Technology, Namakkal, Tamilnadu,india. k.jayarajan M.E (Ph.D) Assistant Professor,
More informationFormulating XML-IR Queries
Alan Woodley Faculty of Information Technology, Queensland University of Technology PO Box 2434. Brisbane Q 4001, Australia ap.woodley@student.qut.edu.au Abstract: XML information retrieval systems differ
More informationSecured implementation of personalized web search with UPS
Secured implementation of personalized web search with UPS Mrs.P.Nagajothi, Assistant Professor, Department of CSE, K.S.Rangasamy College of Technology, Tiruchengodu, Namakkal (DT). S.Vinothini, M.E-II
More informationDIONE. (DAML Integrated Ontology Evolution Tools) Ontology Versioning in Semantic Web Applications. ISX Corporation Lehigh University
(DAML Integrated Evolution Tools) Versioning in Semantic Web Applications ISX Corporation Lehigh University Dr. Brian Kettler, ISX bkettler@isx.com Prof. Jeff Heflin & Zhengxiang Pan, Lehigh heflin@cse.lehigh.edu
More informationLearning Ranking Functions with SVMs
Learning Ranking Functions with SVMs CS4780/5780 Machine Learning Fall 2014 Thorsten Joachims Cornell University T. Joachims, Optimizing Search Engines Using Clickthrough Data, Proceedings of the ACM Conference
More informationAdaptive Search at Essex
Adaptive Search at Essex Udo Kruschwitz School of Computer Science and Electronic Engineering University of Essex udo@essex.ac.uk 7th October 2011 Adaptive Search - LAC Day - 7 October 2011 1 (Source:
More informationWeb Search. Lecture Objectives. Text Technologies for Data Science INFR Learn about: 11/14/2017. Instructor: Walid Magdy
Text Technologies for Data Science INFR11145 Web Search Instructor: Walid Magdy 14-Nov-2017 Lecture Objectives Learn about: Working with Massive data Link analysis (PageRank) Anchor text 2 1 The Web Document
More informationExtracting Conceptual Relationships from Specialized Documents
Extracting Conceptual Relationships from Specialized Documents Bowen Hui Dept of Computer Science University of Toronto Eric Yu Fac of Information Studies University of Toronto Motivation much of human
More informationHTML OBJECTIVES WHAT IS HTML? BY FAITH BRENNER AN INTRODUCTION
HTML AN INTRODUCTION BY FAITH BRENNER 1 OBJECTIVES BY THE END OF THIS LESSON YOU WILL: UNDERSTAND HTML BASICS AND WHAT YOU CAN DO WITH IT BE ABLE TO USE BASIC HTML TAGS BE ABLE TO USE SOME BASIC FORMATTING
More informationInformation Discovery, Extraction and Integration for the Hidden Web
Information Discovery, Extraction and Integration for the Hidden Web Jiying Wang Department of Computer Science University of Science and Technology Clear Water Bay, Kowloon Hong Kong cswangjy@cs.ust.hk
More informationA Survey on Keyword Diversification Over XML Data
ISSN (Online) : 2319-8753 ISSN (Print) : 2347-6710 International Journal of Innovative Research in Science, Engineering and Technology An ISO 3297: 2007 Certified Organization Volume 6, Special Issue 5,
More information