Information Retrieval
|
|
- Randolph Cross
- 6 years ago
- Views:
Transcription
1 Information Retrieval Test Collections Gintarė Grigonytė Department of Linguistics and Philology Uppsala University Slides based on previous IR course given by K.F. Heppin and Introduction to Information Retrieval slides Gintarė Grigonytė 1/65
2 Overview User queries The Cranfield paradigm Test collections Indri query language Gintarė Grigonytė 2/65
3 Information need An information need is the underlying cause of the query that a person submits to a search engine Can be categorized using variety of dimensions: type of information needed domain > subject question that needs to be answered level of expertise: professional layperson type of task that led to the requirement for information: are you writing a paper, are you preparing for a meeting or for an exam Gintarė Grigonytė 3/65
4 Topic - formal information need <top> <num> C208 </num> <EN-title> "Sophie s World" </EN-title> <EN-desc> Find documents about the editorial success of the book "Sophie s World" by Jostein Gaarder. </EN-desc> <EN-narr> Relevant documents should describe the topic of "Sophie s World", and should mention its sales success. </EN-narr> </top> Gintarė Grigonytė 4/65
5 Queries A query is the formulation of a user information need put to the system. Keyword based queries are popular, since they are intuitive, easy to express, and allow for fast ranking. However, a query can also be a more complex combination of operations using different kinds of operators. Gintarė Grigonytė 5/65
6 Information need vs. queries Gintarė Grigonytė 6/65
7 User queries The term query is used both for the input the user gives to the system and for the modified version of this which the system uses for the matching with the index terms. In Web search, the user writes a key word based query in the field for search keys. Which the system translates into a more complex form for the matching process. Gintarė Grigonytė 7/65
8 Intermediary queries Query languages in the past were designed for professional searchers intermediaries. The user would input a natural language query to the intermediary, who translated it to a query the system could interpret. Gintarė Grigonytė 8/65
9 Keyword queries The result of key word queries for most retrieval models is the set of documents containing at least one of the words of the query The resulting documents are ranked according to the degree of similarity with respect to the query How the ranking is done depends on the retrieval model. Gintarė Grigonytė 9/65
10 Query in Indri query language Gintarė Grigonytė 10/65
11 Query modification Gintarė Grigonytė 11/65
12 Query expansion Gintarė Grigonytė 12/65
13 Facets Gintarė Grigonytė 13/65
14 Visualizing facets with Boolean syntax Gintarė Grigonytė 14/65
15 Luhn s Significant terms H. P. Luhn, The automatic creation of literature abstracts, IBM Journal of Research and Development, v.2 n.2, p , April 1958 [doi> /rd ] Gintarė Grigonytė 15/65
16 How do we know which searches are good? Gintarė Grigonytė 16/65
17 Use test collection Gintarė Grigonytė 17/65
18 Feedback The user indicates, consciously or unconsciously, which documents are relevant to their query or to indicate which terms extracted from those documents are relevant. The user or the system then constructs a new query from this information by: Boosting weights of terms from relevant documents Adding terms from relevant documents to the query Idea: you may not know what you re looking for, but you ll know when you see it Gintarė Grigonytė 18/65
19 Test collection environment To test and compare search strategies you need a laboratory environment that doesn t change a test collection determine how well IR systems perform compare the performance of the IR system with that of other systems compare search alghoritms compare search strategies Gintarė Grigonytė 19/65
20 The Cranfield Paradigm Evaluation of IR systems is the result of early experimentation initiated by Cyril Cleverdon Cleverdon started a series of projects, called the Cranfield projects, in 1957 that lasted for about 10 years in which he and his colleagues set the stage for information retrieval research. In the Cranfield project, retrieval experiments were conducted on test databases in a controlled, laboratorylike setting. The Cranfield projects provided a foundation for the evaluation of IR systems Gintarė Grigonytė 20/65
21 The laboratory environment Experiments at the Cranfield College of Aeronautics The objective was to study what kinds of indexing languages were most effective. At this time documents were indexed manually with a few keywords from controlled vocabularies/thesauri documents of research in metallurgy Small enough to have every document assessed for relevance to every topic A database with, for the time, a large set of documents A set of information needs expressed in plain text A relevance judgement for every document in relation to every information need Gintarė Grigonytė 21/65
22 Recall, precision Gintarė Grigonytė 22/65
23 Implications for evaluation Model a real user application, with realistic information needs. Collect enough documents and create enough topics to allow significant testing on results. Make relevance judgments before the experiments, which prevents human bias and enables re-usability. Run strategy A and B Evaluate A and B using appropriate metrics Compare A with B statistically State whether A works better than B, A and B are equivalent, or B works better than A Gintarė Grigonytė 23/65
24 Test collection To test and compare strategies a test collection is needed. A test collection is a laboratory testbed representing the real world. A test collection consists of: A static set of documents A set of information needs/topics A set of known relevant documents for each of the information needs Gintarė Grigonytė 24/65
25 Test collection based IR evaluation System function separate relevant from non-relevant documents rank relevant above non-relevant documents rank highly relevant above less relevant documents Pupose of evaluation decide how well a system performs the function above determine the best system/queries/algorithms Gintarė Grigonytė 25/65
26 Why not any other way? Should users be involved? Why? Why not to use web search engine? Gintarė Grigonytė 26/65
27 Gintarė Grigonytė 27/65
28 Gintarė Grigonytė 28/65
29 Gintarė Grigonytė 29/65
30 Gintarė Grigonytė 30/65
31 Gintarė Grigonytė 31/65
32 Gintarė Grigonytė 32/65
33 Gintarė Grigonytė 33/65
34 Gintarė Grigonytė 34/65
35 Gintarė Grigonytė 35/65
36 Gintarė Grigonytė 36/65
37 Gintarė Grigonytė 37/65
38 Gintarė Grigonytė 38/65
39 Gintarė Grigonytė 39/65
40 Gintarė Grigonytė 40/65
41 Gintarė Grigonytė 41/65
42 Precision/Recall Gintarė Grigonytė 42/65
43 Indri query language terms field restrictions numeric combining beliefs field/passage retrieval filters Quick referece: Reference: Gintarė Grigonytė 43/65
44 Gintarė Grigonytė 44/65
45 Gintarė Grigonytė 45/65
46 Gintarė Grigonytė 46/65
47 Gintarė Grigonytė 47/65
48 Gintarė Grigonytė 48/65
49 Gintarė Grigonytė 49/65
50 Gintarė Grigonytė 50/65
51 Gintarė Grigonytė 51/65
52 Gintarė Grigonytė 52/65
53 Gintarė Grigonytė 53/65
54 Gintarė Grigonytė 54/65
55 Gintarė Grigonytė 55/65
56 Gintarė Grigonytė 56/65
57 Gintarė Grigonytė 57/65
58 Gintarė Grigonytė 58/65
59 Gintarė Grigonytė 59/65
60 Gintarė Grigonytė 60/65
61 Gintarė Grigonytė 61/65
62 Gintarė Grigonytė 62/65
63 Gintarė Grigonytė 63/65
64 Gintarė Grigonytė 64/65
65 Next Lab 2 building index from document collection querying with Indri query language evaluation Gintarė Grigonytė 65/65
Information Retrieval
Multimedia Computing: Algorithms, Systems, and Applications: Information Retrieval and Search Engine By Dr. Yu Cao Department of Computer Science The University of Massachusetts Lowell Lowell, MA 01854,
More information21. Search Models and UIs for IR
21. Search Models and UIs for IR INFO 202-10 November 2008 Bob Glushko Plan for Today's Lecture The "Classical" Model of Search and the "Classical" UI for IR Web-based Search Best practices for UIs in
More informationSearch Engines Chapter 8 Evaluating Search Engines Felix Naumann
Search Engines Chapter 8 Evaluating Search Engines 9.7.2009 Felix Naumann Evaluation 2 Evaluation is key to building effective and efficient search engines. Drives advancement of search engines When intuition
More informationSearch Evaluation. Tao Yang CS293S Slides partially based on text book [CMS] [MRS]
Search Evaluation Tao Yang CS293S Slides partially based on text book [CMS] [MRS] Table of Content Search Engine Evaluation Metrics for relevancy Precision/recall F-measure MAP NDCG Difficulties in Evaluating
More informationAgile Tester Foundation E-learning Course Outline
Foundation E-learning Course Outline General Description This course provides testers and test managers with an understanding of the fundamentals of testing on agile projects. Attendees will learn how
More informationInformation Retrieval
Information Retrieval Dictionaries & Tolerant Retrieval Gintarė Grigonytė gintare@ling.su.se Department of Linguistics and Philology Uppsala University Slides based on previous IR course given by Jörg
More informationRetrieval Evaluation. Hongning Wang
Retrieval Evaluation Hongning Wang CS@UVa What we have learned so far Indexed corpus Crawler Ranking procedure Research attention Doc Analyzer Doc Rep (Index) Query Rep Feedback (Query) Evaluation User
More informationBuilding Test Collections. Donna Harman National Institute of Standards and Technology
Building Test Collections Donna Harman National Institute of Standards and Technology Cranfield 2 (1962-1966) Goal: learn what makes a good indexing descriptor (4 different types tested at 3 levels of
More informationInformation Retrieval
Introduction Information Retrieval Information retrieval is a field concerned with the structure, analysis, organization, storage, searching and retrieval of information Gerard Salton, 1968 J. Pei: Information
More informationInformation Retrieval
Information Retrieval Lecture 7 - Evaluation in Information Retrieval Seminar für Sprachwissenschaft International Studies in Computational Linguistics Wintersemester 2007 1/ 29 Introduction Framework
More informationInformation Retrieval. Lecture 7 - Evaluation in Information Retrieval. Introduction. Overview. Standard test collection. Wintersemester 2007
Information Retrieval Lecture 7 - Evaluation in Information Retrieval Seminar für Sprachwissenschaft International Studies in Computational Linguistics Wintersemester 2007 1 / 29 Introduction Framework
More informationCS6200 Information Retrieval. Jesse Anderton College of Computer and Information Science Northeastern University
CS6200 Information Retrieval Jesse Anderton College of Computer and Information Science Northeastern University Major Contributors Gerard Salton! Vector Space Model Indexing Relevance Feedback SMART Karen
More informationAdvanced Search Techniques for Large Scale Data Analytics Pavel Zezula and Jan Sedmidubsky Masaryk University
Advanced Search Techniques for Large Scale Data Analytics Pavel Zezula and Jan Sedmidubsky Masaryk University http://disa.fi.muni.cz The Cranfield Paradigm Retrieval Performance Evaluation Evaluation Using
More informationInformativeness for Adhoc IR Evaluation:
Informativeness for Adhoc IR Evaluation: A measure that prevents assessing individual documents Romain Deveaud 1, Véronique Moriceau 2, Josiane Mothe 3, and Eric SanJuan 1 1 LIA, Univ. Avignon, France,
More informationThe IR Black Box. Anomalous State of Knowledge. The Information Retrieval Cycle. Different Types of Interactions. Upcoming Topics.
The IR Black Bo LBSC 796/INFM 718R: Week 8 Relevance Feedback Query Search Ranked List Jimmy Lin College of Information Studies University of Maryland Monday, March 27, 2006 Anomalous State of Knowledge
More informationINFS 427: AUTOMATED INFORMATION RETRIEVAL (1 st Semester, 2018/2019)
INFS 427: AUTOMATED INFORMATION RETRIEVAL (1 st Semester, 2018/2019) Session 01 Introduction to Information Retrieval Lecturer: Mrs. Florence O. Entsua-Mensah, DIS Contact Information: fentsua-mensah@ug.edu.gh
More informationA new interaction evaluation framework for digital libraries
A new interaction evaluation framework for digital libraries G. Tsakonas, S. Kapidakis, C. Papatheodorou {gtsak, sarantos, papatheodor} @ionio.gr DELOS Workshop on the Evaluation of Digital Libraries Department
More informationDocument Clustering for Mediated Information Access The WebCluster Project
Document Clustering for Mediated Information Access The WebCluster Project School of Communication, Information and Library Sciences Rutgers University The original WebCluster project was conducted at
More informationSearch Engine Architecture. Hongning Wang
Search Engine Architecture Hongning Wang CS@UVa CS@UVa CS4501: Information Retrieval 2 Document Analyzer Classical search engine architecture The Anatomy of a Large-Scale Hypertextual Web Search Engine
More informationCLEF-IP 2009: Exploring Standard IR Techniques on Patent Retrieval
DCU @ CLEF-IP 2009: Exploring Standard IR Techniques on Patent Retrieval Walid Magdy, Johannes Leveling, Gareth J.F. Jones Centre for Next Generation Localization School of Computing Dublin City University,
More informationTREC OpenSearch Planning Session
TREC OpenSearch Planning Session Anne Schuth, Krisztian Balog TREC OpenSearch OpenSearch is a new evaluation paradigm for IR. The experimentation platform is an existing search engine. Researchers have
More informationHOW BUILDING OUR OWN E-COMMERCE SEARCH CHANGED OUR APPROACH TO SEARCH QUALITY. 2018// Berlin
HOW BUILDING OUR OWN E-COMMERCE SEARCH CHANGED OUR APPROACH TO SEARCH QUALITY 13.06.18 1 Our Search Team @otto.de Search Team in 2017 Christine Bellstedt Business Designer Search www.otto.de Search Quality
More informationDeveloping a Test Collection for the Evaluation of Integrated Search Lykke, Marianne; Larsen, Birger; Lund, Haakon; Ingwersen, Peter
university of copenhagen Københavns Universitet Developing a Test Collection for the Evaluation of Integrated Search Lykke, Marianne; Larsen, Birger; Lund, Haakon; Ingwersen, Peter Published in: Advances
More informationASLIB CRANFIELD RESEARCH PROJECT REPORT ON THE TESTING AND ANALYSIS OF AN INVESTIGATION INTO THE COMPARATIVE EFFICIENCY OF INDEXING SYSTEMS
ASLIB CRANFIELD RESEARCH PROJECT REPORT ON THE TESTING AND ANALYSIS OF AN INVESTIGATION INTO THE COMPARATIVE EFFICIENCY OF INDEXING SYSTEMS by Cyril W. Gleverdon An investigation supported by a grant from
More informationCS506/606 - Topics in Information Retrieval
CS506/606 - Topics in Information Retrieval Instructors: Class time: Steven Bedrick, Brian Roark, Emily Prud hommeaux Tu/Th 11:00 a.m. - 12:30 p.m. September 25 - December 6, 2012 Class location: WCC 403
More informationHow Primo Works VE. 1.1 Welcome. Notes: Published by Articulate Storyline Welcome to how Primo works.
How Primo Works VE 1.1 Welcome Welcome to how Primo works. 1.2 Objectives By the end of this session, you will know - What discovery, delivery, and optimization are - How the library s collections and
More informationOrganizing Information. Organizing information is at the heart of information science and is important in many other
Dagobert Soergel College of Library and Information Services University of Maryland College Park, MD 20742 Organizing Information Organizing information is at the heart of information science and is important
More informationEnhanced retrieval using semantic technologies:
Enhanced retrieval using semantic technologies: Ontology based retrieval as a new search paradigm? - Considerations based on new projects at the Bavarian State Library Dr. Berthold Gillitzer 28. Mai 2008
More informationCS47300: Web Information Search and Management
CS47300: Web Information Search and Management Prof. Chris Clifton 27 August 2018 Material adapted from course created by Dr. Luo Si, now leading Alibaba research group 1 AD-hoc IR: Basic Process Information
More informationIntroduction to Information Retrieval. Lecture Outline
Introduction to Information Retrieval Lecture 1 CS 410/510 Information Retrieval on the Internet Lecture Outline IR systems Overview IR systems vs. DBMS Types, facets of interest User tasks Document representations
More informationWEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS
1 WEB SEARCH, FILTERING, AND TEXT MINING: TECHNOLOGY FOR A NEW ERA OF INFORMATION ACCESS BRUCE CROFT NSF Center for Intelligent Information Retrieval, Computer Science Department, University of Massachusetts,
More informationModern Information Retrieval
Modern Information Retrieval Chapter 3 Retrieval Evaluation Retrieval Performance Evaluation Reference Collections CFC: The Cystic Fibrosis Collection Retrieval Evaluation, Modern Information Retrieval,
More informationEvaluation. David Kauchak cs160 Fall 2009 adapted from:
Evaluation David Kauchak cs160 Fall 2009 adapted from: http://www.stanford.edu/class/cs276/handouts/lecture8-evaluation.ppt Administrative How are things going? Slides Points Zipf s law IR Evaluation For
More informationModern Information Retrieval
Modern Information Retrieval Appendix A Open Source Search Engines with Christian Middleton Introduction Search Engines Comparison Methodology Experimental Results Open Source Search Engines, Modern Information
More informationOverview. Lecture 6: Evaluation. Summary: Ranked retrieval. Overview. Information Retrieval Computer Science Tripos Part II.
Overview Lecture 6: Evaluation Information Retrieval Computer Science Tripos Part II Recap/Catchup 2 Introduction Ronan Cummins 3 Unranked evaluation Natural Language and Information Processing (NLIP)
More informationInformation Retrieval and Knowledge Organisation
Information Retrieval and Knowledge Organisation Knut Hinkelmann Content Information Retrieval Indexing (string search and computer-linguistic aproach) Classical Information Retrieval: Boolean, vector
More informationChapter 6: Information Retrieval and Web Search. An introduction
Chapter 6: Information Retrieval and Web Search An introduction Introduction n Text mining refers to data mining using text documents as data. n Most text mining tasks use Information Retrieval (IR) methods
More informationDatabases and Information Retrieval Integration TIETS42. Kostas Stefanidis Autumn 2016
+ Databases and Information Retrieval Integration TIETS42 Autumn 2016 Kostas Stefanidis kostas.stefanidis@uta.fi http://www.uta.fi/sis/tie/dbir/index.html http://people.uta.fi/~kostas.stefanidis/dbir16/dbir16-main.html
More informationCSCI 5417 Information Retrieval Systems. Jim Martin!
CSCI 5417 Information Retrieval Systems Jim Martin! Lecture 7 9/13/2011 Today Review Efficient scoring schemes Approximate scoring Evaluating IR systems 1 Normal Cosine Scoring Speedups... Compute the
More informationAC : NEW KNOVEL INTERFACE
AC 2010-1938: NEW KNOVEL INTERFACE Sasha Gurke, Knovel Corporation Sasha Gurke is Sr. Vice President of Knovel Corp. He was one of the co-founders of Knovel in 1999, having joined a predecessor company
More informationSEARCH TECHNIQUES: BASIC AND ADVANCED
17 SEARCH TECHNIQUES: BASIC AND ADVANCED 17.1 INTRODUCTION Searching is the activity of looking thoroughly in order to find something. In library and information science, searching refers to looking through
More informationExpert Reviews (1) Lecture 5-2: Usability Methods II. Usability Inspection Methods. Expert Reviews (2)
: Usability Methods II Heuristic Analysis Heuristics versus Testing Debate Some Common Heuristics Heuristic Evaluation Expert Reviews (1) Nielsen & Molich (1990) CHI Proceedings Based upon empirical article
More informationWrapper: An Application for Evaluating Exploratory Searching Outside of the Lab
Wrapper: An Application for Evaluating Exploratory Searching Outside of the Lab Bernard J Jansen College of Information Sciences and Technology The Pennsylvania State University University Park PA 16802
More informationChapter 8. Evaluating Search Engine
Chapter 8 Evaluating Search Engine Evaluation Evaluation is key to building effective and efficient search engines Measurement usually carried out in controlled laboratory experiments Online testing can
More informationInformation Retrieval (Part 1)
Information Retrieval (Part 1) Fabio Aiolli http://www.math.unipd.it/~aiolli Dipartimento di Matematica Università di Padova Anno Accademico 2008/2009 1 Bibliographic References Copies of slides Selected
More informationTerminologies Services Strawman
Terminologies Services Strawman Background This document was drafted for discussion for a meeting at the Metropolitan Museum of Art on September 12, 2007. This document was not intended to represent a
More informationInformation Retrieval CS Lecture 06. Razvan C. Bunescu School of Electrical Engineering and Computer Science
Information Retrieval CS 6900 Lecture 06 Razvan C. Bunescu School of Electrical Engineering and Computer Science bunescu@ohio.edu Boolean Retrieval vs. Ranked Retrieval Many users (professionals) prefer
More informationExpert Test Manager: Operational Module Course Outline
Expert Test Manager: Operational Module Course Outline General Description A truly successful test organization not only has solid, relevant test objectives and a test strategy, but it also has the means
More informationInformation Retrieval. (M&S Ch 15)
Information Retrieval (M&S Ch 15) 1 Retrieval Models A retrieval model specifies the details of: Document representation Query representation Retrieval function Determines a notion of relevance. Notion
More informationCLARIT Compound Queries and Constraint-Controlled Feedback in TREC-5 Ad-Hoc Experiments
CLARIT Compound Queries and Constraint-Controlled Feedback in TREC-5 Ad-Hoc Experiments Natasa Milic-Frayling 1, Xiang Tong 2, Chengxiang Zhai 2, David A. Evans 1 1 CLARITECH Corporation 2 Laboratory for
More informationIntroduction to Information Retrieval
Introduction to Information Retrieval Mohsen Kamyar چهارمین کارگاه ساالنه آزمایشگاه فناوری و وب بهمن ماه 1391 Outline Outline in classic categorization Information vs. Data Retrieval IR Models Evaluation
More informationIntroduction to Information Retrieval. Hongning Wang
Introduction to Information Retrieval Hongning Wang CS@UVa What is information retrieval? 2 Why information retrieval Information overload It refers to the difficulty a person can have understanding an
More informationTilburg University. Authoritative re-ranking of search results Bogers, A.M.; van den Bosch, A. Published in: Advances in Information Retrieval
Tilburg University Authoritative re-ranking of search results Bogers, A.M.; van den Bosch, A. Published in: Advances in Information Retrieval Publication date: 2006 Link to publication Citation for published
More informationWeb document summarisation: a task-oriented evaluation
Web document summarisation: a task-oriented evaluation Ryen White whiter@dcs.gla.ac.uk Ian Ruthven igr@dcs.gla.ac.uk Joemon M. Jose jj@dcs.gla.ac.uk Abstract In this paper we present a query-biased summarisation
More informationA RECOMMENDER SYSTEM FOR SOCIAL BOOK SEARCH
A RECOMMENDER SYSTEM FOR SOCIAL BOOK SEARCH A thesis Submitted to the faculty of the graduate school of the University of Minnesota by Vamshi Krishna Thotempudi In partial fulfillment of the requirements
More informationCS54701: Information Retrieval
CS54701: Information Retrieval Basic Concepts 19 January 2016 Prof. Chris Clifton 1 Text Representation: Process of Indexing Remove Stopword, Stemming, Phrase Extraction etc Document Parser Extract useful
More informationRetrieval Evaluation
Retrieval Evaluation - Reference Collections Berlin Chen Department of Computer Science & Information Engineering National Taiwan Normal University References: 1. Modern Information Retrieval, Chapter
More informationNatural Language Processing
Natural Language Processing Information Retrieval Potsdam, 14 June 2012 Saeedeh Momtazi Information Systems Group based on the slides of the course book Outline 2 1 Introduction 2 Indexing Block Document
More informationApplying the KISS Principle for the CLEF- IP 2010 Prior Art Candidate Patent Search Task
Applying the KISS Principle for the CLEF- IP 2010 Prior Art Candidate Patent Search Task Walid Magdy, Gareth J.F. Jones Centre for Next Generation Localisation School of Computing Dublin City University,
More informationDELOS WP7: Evaluation
DELOS WP7: Evaluation Claus-Peter Klas Univ. of Duisburg-Essen, Germany (WP leader: Norbert Fuhr) WP Objectives Enable communication between evaluation experts and DL researchers/developers Continue existing
More informationMultilingual Image Search from a user s perspective
Multilingual Image Search from a user s perspective Julio Gonzalo, Paul Clough, Jussi Karlgren QUAERO-Image CLEF workshop, 16/09/08 Finding is a matter of two fast stupid smart slow great potential for
More informationInformation Retrieval and Organisation
Information Retrieval and Organisation Dell Zhang Birkbeck, University of London 2016/17 IR Chapter 00 Motivation What is Information Retrieval? The meaning of the term Information Retrieval (IR) can be
More informationTHIS LECTURE. How do we know if our results are any good? Results summaries: Evaluating a search engine. Making our good results usable to a user
EVALUATION Sec. 6.2 THIS LECTURE How do we know if our results are any good? Evaluating a search engine Benchmarks Precision and recall Results summaries: Making our good results usable to a user 2 3 EVALUATING
More informationReducing Redundancy with Anchor Text and Spam Priors
Reducing Redundancy with Anchor Text and Spam Priors Marijn Koolen 1 Jaap Kamps 1,2 1 Archives and Information Studies, Faculty of Humanities, University of Amsterdam 2 ISLA, Informatics Institute, University
More informationInformation Retrieval and Web Search
Information Retrieval and Web Search Introduction to IR models and methods Rada Mihalcea (Some of the slides in this slide set come from IR courses taught at UT Austin and Stanford) Information Retrieval
More informationInformation Retrieval
Natural Language Processing SoSe 2014 Information Retrieval Dr. Mariana Neves June 18th, 2014 (based on the slides of Dr. Saeedeh Momtazi) Outline Introduction Indexing Block 2 Document Crawling Text Processing
More informationDiscounted Cumulated Gain based Evaluation of Multiple Query IR Sessions
Preprint from: Järvelin, K. & Price, S. & Delcambre, L. & Nielsen, M. (2008). Discounted Cumulated Gain based Evaluation of Multiple Query IR Sessions. In: Ruthven, I. & al. (Eds.), Proc. of the 30th European
More informationMultimedia Information Extraction and Retrieval Term Frequency Inverse Document Frequency
Multimedia Information Extraction and Retrieval Term Frequency Inverse Document Frequency Ralf Moeller Hamburg Univ. of Technology Acknowledgement Slides taken from presentation material for the following
More informationAccessibility/Usability Through Advanced Interactive Devices
Accessibility/Usability Through Advanced Interactive Devices Moderator Leslie Miller, Iowa State University, USA Panelists Silvana Roncagliolo, Pontificia Universidad Catolica de Valparaiso, Chile Alma
More informationInformation Retrieval. Information Retrieval and Web Search
Information Retrieval and Web Search Introduction to IR models and methods Information Retrieval The indexing and retrieval of textual documents. Searching for pages on the World Wide Web is the most recent
More informationExam IST 441 Spring 2011
Exam IST 441 Spring 2011 Last name: Student ID: First name: I acknowledge and accept the University Policies and the Course Policies on Academic Integrity This 100 point exam determines 30% of your grade.
More informationLecture 5: Information Retrieval using the Vector Space Model
Lecture 5: Information Retrieval using the Vector Space Model Trevor Cohn (tcohn@unimelb.edu.au) Slide credits: William Webber COMP90042, 2015, Semester 1 What we ll learn today How to take a user query
More informationPrior Art Retrieval Using Various Patent Document Fields Contents
Prior Art Retrieval Using Various Patent Document Fields Contents Metti Zakaria Wanagiri and Mirna Adriani Fakultas Ilmu Komputer, Universitas Indonesia Depok 16424, Indonesia metti.zakaria@ui.edu, mirna@cs.ui.ac.id
More informationSearch Engines Considered Harmful In Search of an Unbiased Web Ranking
Search Engines Considered Harmful In Search of an Unbiased Web Ranking Junghoo John Cho cho@cs.ucla.edu UCLA Search Engines Considered Harmful Junghoo John Cho 1/38 Motivation If you are not indexed by
More informationProcessing Structural Constraints
SYNONYMS None Processing Structural Constraints Andrew Trotman Department of Computer Science University of Otago Dunedin New Zealand DEFINITION When searching unstructured plain-text the user is limited
More informationINFORMATION RETRIEVAL SYSTEM: CONCEPT AND SCOPE
15 : CONCEPT AND SCOPE 15.1 INTRODUCTION Information is communicated or received knowledge concerning a particular fact or circumstance. Retrieval refers to searching through stored information to find
More informationWebBiblio Subject Gateway System:
WebBiblio Subject Gateway System: An Open Source Solution for Internet Resources Management 1. Introduction Jack Eapen C. 1 With the advent of the Internet, the rate of information explosion increased
More informationBetter Contextual Suggestions in ClueWeb12 Using Domain Knowledge Inferred from The Open Web
Better Contextual Suggestions in ClueWeb12 Using Domain Knowledge Inferred from The Open Web Thaer Samar 1, Alejandro Bellogín 2, and Arjen P. de Vries 1 1 Centrum Wiskunde & Informatica, {samar,arjen}@cwi.nl
More informationLaHC at CLEF 2015 SBS Lab
LaHC at CLEF 2015 SBS Lab Nawal Ould-Amer, Mathias Géry To cite this version: Nawal Ould-Amer, Mathias Géry. LaHC at CLEF 2015 SBS Lab. Conference and Labs of the Evaluation Forum, Sep 2015, Toulouse,
More informationCSCI 599: Applications of Natural Language Processing Information Retrieval Evaluation"
CSCI 599: Applications of Natural Language Processing Information Retrieval Evaluation" All slides Addison Wesley, Donald Metzler, and Anton Leuski, 2008, 2012! Evaluation" Evaluation is key to building
More informationNUSIS at TREC 2011 Microblog Track: Refining Query Results with Hashtags
NUSIS at TREC 2011 Microblog Track: Refining Query Results with Hashtags Hadi Amiri 1,, Yang Bao 2,, Anqi Cui 3,,*, Anindya Datta 2,, Fang Fang 2,, Xiaoying Xu 2, 1 Department of Computer Science, School
More information11/2/2012. Database Queries. Peeking into Computer Science. Jalal Kawash Mandatory: Chapter 4 Sections 4.6 & 4.7. Reading Assignment
Database Queries Mandatory: Chapter 4 Sections 4.6 & 4.7 Reading Assignment 2 1 Can be found on: http://pages.cpsc.ucalgary.ca/~kawash/peeking.html Includes all examples in the book Numbered by exercise
More informationAutomatic Detection of Section Membership for SAS Conference Paper Abstract Submissions: A Case Study
1746-2014 Automatic Detection of Section Membership for SAS Conference Paper Abstract Submissions: A Case Study Dr. Goutam Chakraborty, Professor, Department of Marketing, Spears School of Business, Oklahoma
More informationChapter 1. Web-Mining and Information Retrieval. 1.1 Introduction
Chapter 1 Web-Mining and Information Retrieval 1.1 Introduction The World Wide Web or simply the web may be seen as a huge collection of documents freely produced and published by a very large number of
More informationOkapi in TIPS: The Changing Context of Information Retrieval
Okapi in TIPS: The Changing Context of Information Retrieval Murat Karamuftuoglu, Fabio Venuti Centre for Interactive Systems Research Department of Information Science City University {hmk, fabio}@soi.city.ac.uk
More informationInformation Retrieval
Natural Language Processing SoSe 2015 Information Retrieval Dr. Mariana Neves June 22nd, 2015 (based on the slides of Dr. Saeedeh Momtazi) Outline Introduction Indexing Block 2 Document Crawling Text Processing
More informationInformation Retrieval
Introduction to Information Retrieval CS3245 Information Retrieval Lecture 9: IR Evaluation 9 Ch. 7 Last Time The VSM Reloaded optimized for your pleasure! Improvements to the computation and selection
More informationImpact of Search Engines on Page Popularity
Impact of Search Engines on Page Popularity Junghoo John Cho (cho@cs.ucla.edu) Sourashis Roy (roys@cs.ucla.edu) University of California, Los Angeles Impact of Search Engines on Page Popularity J. Cho,
More informationKnowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.
Knowledge Retrieval Franz J. Kurfess Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A. 1 Acknowledgements This lecture series has been sponsored by the European
More informationParsing II Top-down parsing. Comp 412
COMP 412 FALL 2018 Parsing II Top-down parsing Comp 412 source code IR Front End Optimizer Back End IR target code Copyright 2018, Keith D. Cooper & Linda Torczon, all rights reserved. Students enrolled
More informationPilot: Collaborative Glossary Platform
Pilot: Collaborative Glossary Platform CI Context Viewpoint 1. As-Is Workflow Model Workflow Name: Term / Definition Collection Domain Context: Typically each group / community has its own list of terms
More information+ Page 4 + Abstract. 1.0 Introduction
+ Page 4 + ----------------------------------------------------------------- Tonta, Yasar. "Analysis of Search Failures in Document Retrieval Systems: A Review." The Public-Access Computer Systems Review
More informationComparative Analysis of Clicks and Judgments for IR Evaluation
Comparative Analysis of Clicks and Judgments for IR Evaluation Jaap Kamps 1,3 Marijn Koolen 1 Andrew Trotman 2,3 1 University of Amsterdam, The Netherlands 2 University of Otago, New Zealand 3 INitiative
More informationHow to Use Google Scholar An Educator s Guide
http://scholar.google.com/ How to Use Google Scholar An Educator s Guide What is Google Scholar? Google Scholar provides a simple way to broadly search for scholarly literature. Google Scholar helps you
More informationBetter Contextual Suggestions in ClueWeb12 Using Domain Knowledge Inferred from The Open Web
Better Contextual Suggestions in ClueWeb12 Using Domain Knowledge Inferred from The Open Web Thaer Samar 1, Alejandro Bellogín 2, and Arjen P. de Vries 1 1 Centrum Wiskunde & Informatica, {samar,arjen}@cwi.nl
More informationQUERY EXPANSION USING WORDNET WITH A LOGICAL MODEL OF INFORMATION RETRIEVAL
QUERY EXPANSION USING WORDNET WITH A LOGICAL MODEL OF INFORMATION RETRIEVAL David Parapar, Álvaro Barreiro AILab, Department of Computer Science, University of A Coruña, Spain dparapar@udc.es, barreiro@udc.es
More informationAudit and Assurance Overview
Chartered Professional Accountants of Canada, CPA Canada, CPA are trademarks and/or certification marks of the Chartered Professional Accountants of Canada. 2018, Chartered Professional Accountants of
More informationCSE 115. Introduction to Computer Science I
CSE 115 Introduction to Computer Science I Announcements Lab activites/lab exams submit regularly to autograder.cse.buffalo.edu Announcements Lab activites/lab exams submit regularly to autograder.cse.buffalo.edu
More informationInformation Search in Web Archives
Information Search in Web Archives Miguel Costa Advisor: Prof. Mário J. Silva Co-Advisor: Prof. Francisco Couto Department of Informatics, Faculty of Sciences, University of Lisbon PhD thesis defense,
More informationUnit 2: Collaborative Working
Candidate MW Unit 2: Collaborative Working Assessment AO1 The focus of this unit is for candidates to work collaboratively with a small group of their peers to produce a final product (possibly a magazine
More information