Adding user context to IR test collections

Similar documents
Developing a Test Collection for the Evaluation of Integrated Search Lykke, Marianne; Larsen, Birger; Lund, Haakon; Ingwersen, Peter

Inter and Intra-Document Contexts Applied in Polyrepresentation

A Preliminary Investigation into the Search Behaviour of Users in a Collection of Digitized Broadcast Audio

Building A Multilingual Test Collection for Metadata Records

Aggregation for searching complex information spaces. Mounia Lalmas

Retrieval Evaluation. Hongning Wang

Knowledge Retrieval. Franz J. Kurfess. Computer Science Department California Polytechnic State University San Luis Obispo, CA, U.S.A.

Searching for Expertise

A User Study on Features Supporting Subjective Relevance for Information Retrieval Interfaces

Digital Communication and Aesthetics,

Building Institutional Repositories: Emerging Challenges

RSLIS at INEX 2012: Social Book Search Track

Tilburg University. Authoritative re-ranking of search results Bogers, A.M.; van den Bosch, A. Published in: Advances in Information Retrieval

TREC 2017 Dynamic Domain Track Overview

Discounted Cumulated Gain based Evaluation of Multiple Query IR Sessions

Document Clustering for Mediated Information Access The WebCluster Project

DL User Interfaces. Giuseppe Santucci Dipartimento di Informatica e Sistemistica Università di Roma La Sapienza

Databases and Information Retrieval Integration TIETS42. Kostas Stefanidis Autumn 2016

Retrieval Evaluation

Master Project. Various Aspects of Recommender Systems. Prof. Dr. Georg Lausen Dr. Michael Färber Anas Alzoghbi Victor Anthony Arrascue Ayala

Using Statistical Properties of Text to Create. Metadata. Computer Science and Electrical Engineering Department

Institutional Repository using DSpace. Yatrik Patel Scientist D (CS)

SUMMON WEB-SCALE DISCOVERY. ADA University Baku 02/04/2014

MASTER OF DISASTER MANAGEMENT CURRICULUM

KNOW At The Social Book Search Lab 2016 Suggestion Track

TEXT CHAPTER 5. W. Bruce Croft BACKGROUND

Relevance in XML Retrieval: The User Perspective

Automated Cognitive Walkthrough for the Web (AutoCWW)

Metadata for Digital Collections: A How-to-Do-It Manual

Subjective Relevance: Implications on Interface Design for Information Retrieval Systems

Digital Preservation at NARA

Particular experience in design and implementation of a Current Research Information System in Russia: national specificity

CSCI 599: Applications of Natural Language Processing Information Retrieval Evaluation"

SciVerse Scopus. 1. Scopus introduction and content coverage. 2. Scopus in comparison with Web of Science. 3. Basic functionalities of Scopus

Patent Terminlogy Analysis: Passage Retrieval Experiments for the Intellecutal Property Track at CLEF

Curriculum for the main subject at Master s level in. IT and Cognition, The 2015 curriculum. Adjusted 2017 and 2018

Information Retrieval

An Investigation of Basic Retrieval Models for the Dynamic Domain Task

Self-tuning ongoing terminology extraction retrained on terminology validation decisions

Formulating XML-IR Queries

Multimodal Information Spaces for Content-based Image Retrieval

Review of. Amanda Spink. and her work in. Web Searching and Retrieval,

Information Search in Web Archives

The least common denominator does not satisfy user s needs: metadata schemes for digital audio archives

Self Introduction. Presentation Outline. College of Information 3/31/2016. Multilingual Information Access to Digital Collections

MLIS eportfolio Guidelines

Film and Media Studies,

CADIAL Search Engine at INEX

Multilingual Image Search from a user s perspective

INFORMATION RETRIEVAL SYSTEM: CONCEPT AND SCOPE

CS506/606 - Topics in Information Retrieval

Student Usability Project Recommendations Define Information Architecture for Library Technology

Headings: Academic Libraries. Database Management. Database Searching. Electronic Information Resource Searching Evaluation. Web Portals.

Search Engines Chapter 8 Evaluating Search Engines Felix Naumann

PROGRAMME SYLLABUS Information Architecture and Innovation (Two Years), 120

Film and Media Studies,

Organizing Economic Information

Extending the Facets concept by applying NLP tools to catalog records of scientific literature

CURRICULUM MASTER OF DISASTER MANAGEMENT

A model of information searching behaviour to facilitate end-user support in KOS-enhanced systems

Human-Generated learning object metadata

A RECOMMENDER SYSTEM FOR SOCIAL BOOK SEARCH

Open Research Online The Open University s repository of research publications and other research outputs

A Framework for Source Code metrics

Chapter 8. Evaluating Search Engine

Adaptable and Adaptive Web Information Systems. Lecture 1: Introduction

The Necessity of a New Culture of Electronic Publishing C A S L I N

Effective metadata for social book search from a user perspective Huurdeman, H.C.; Kamps, J.; Koolen, M.H.A.

Only the original curriculum in Danish language has legal validity in matters of discrepancy

Assignment 1. Assignment 2. Relevance. Performance Evaluation. Retrieval System Evaluation. Evaluate an IR system

ANNUAL REPORT Visit us at project.eu Supported by. Mission

Archives in a Networked Information Society: The Problem of Sustainability in the Digital Information Environment

Bulgarian Folk Songs in a Digital Library

How user-friendly are user interfaces of open access digital repositories?

Supporting a Locale of One: Global Content Delivery for the Individual

School of Engineering & Built Environment

INFORMATION RETRIEVAL SYSTEM USING FUZZY SET THEORY - THE BASIC CONCEPT

University of Amsterdam at INEX 2010: Ad hoc and Book Tracks

Survey of Existing Services in the Mathematical Digital Libraries and Repositories in the EuDML Project

Language Model Document Priors based on Citation and Co-citation Analysis

Robin Dale RLG

Data Replication: Automated move and copy of data. PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013

Towards a Long Term Research Agenda for Digital Library Research. Yannis Ioannidis University of Athens

COURSE SPECIFICATION

Prior Art Retrieval Using Various Patent Document Fields Contents

Easy Ed: An Integration of Technologies for Multimedia Education 1

CHAPTER THREE INFORMATION RETRIEVAL SYSTEM

Community Health Graduate Program Assessment Plan AY

User-Centered Evaluation of a Discovery Layer System with Google Scholar

Using the Library s Online Resources

Enas El-Sayed Mohammed El-Sharawy

4 FEBRUARY, Information architecture in theory

Dmesure: a readability platform for French as a foreign language

Cookies, fake news and single search boxes: the role of A&I services in a changing research landscape

This is the accepted version of a paper presented at International Conference on Web Engineering (ICWE).

Search Evaluation. Tao Yang CS293S Slides partially based on text book [CMS] [MRS]

Lecture 9: I: Web Retrieval II: Webology. Johan Bollen Old Dominion University Department of Computer Science

CSCI 5417 Information Retrieval Systems! What is Information Retrieval?

Development of Search Engines using Lucene: An Experience

DIGITAL ARCHIVING OF SPECIFIC SCIENTIFIC INFORMATION IN THE CZECH REPUBLIC

Transcription:

Adding user context to IR test collections Birger Larsen Information Systems and Interaction Design Royal School of Library and Information Science Copenhagen, Denmark blar @ iva.dk

Outline RSLIS and ISID A cognitive view on information needs isearch a test collection for task based and integrated search B. Larsen UAM, October 2011 2

RSLIS A small higher education institution under the Danish Ministry of Culture Full university status 2000 BSc, MSc and PhD programmes 200 BSc students accepted a year 75 MSc students accepted a year 1 degree course taught in English Continuing education courses Two locations Copenhagen and Aalborg 40 faculty members B. Larsen UAM, October 2011 3

Research @ RSLIS Seven research groups since April 2010 (including cultural mediation, library and knowledge management, research analysis and bibliometrics) Information Systems and Interaction Design Interaction between users, information and systems in given contexts Research in theories, models and tools for design, construction and evaluation of interactive and motivated solution for access to information 9 faculty members RSLIS ITlab OpenLab for students (10 desktops) USElab (4 powerful desktops; Tobii EyeTracker) ITlab (5 racked linux servers; 4 power desktops; 20 TB storage) Peter Ingwersen Birger Larsen Haakon Lund Toine Bogers

ISID research interests Task based IR and domain specific IR isearch Aggregated and integrated search Citation analysis and bibliometrics for IR MUMIA User based IR and personalisation, including interactive approaches e.g. eye tracking Recommender systems Cultural heritage information access and multimedia metadata Collaboration on formal approaches IR, e.g. quantum IR and subjective logic for IR LARM CoSound B. Larsen UAM, October 2011 5

A COGNITIVE VIEW ON INFORMATION NEEDS B. Larsen UAM, October 2011 6

Cognitive communication system at a given point in time Context Situation A Generator World Model Information Cognitive Emotional Level of System State of Uncertainty Recipient World Model Problem Space Current Cognitive State Interpretation Cognitive free fall Information processing stages Generated object Signs Linguistic Level of System Transformation Interaction Perceived object Context Situation B B. Larsen UAM, October 2011 Ingwersen & Järvelin, The Turn (2005) 7

Information needs Queries have a background and a context Work tasks as an important trigger B. Larsen UAM, October 2011 Ingwersen, JDoc (1996) 8

B. Larsen UAM, October 2011 Ingwersen, JDoc (1996) 9

The Cranfield/TREC IR model Search request Relevance assessment Documents Representation Representation Database Query Matching Result Recall base Evaluation Evaluation Result Ingwersen & Järvelin, The Turn (2005) UAM, October 2011 10

IR test collections Three parts 1. Document corpus a collection of documents (text, audio, video, web pages) 2. Topics a set of information needs, usually described at several levels (TREC: title; description; narrative) 3. Relevance assessments judgements of the relevance of documents for the topics IR test collections facilitate Comparison of the performance of different IR models and variants Using a set of standard performance measures, e.g., Mean Average Precision (MAP): Precision calculated after each relevant retrieved document, then averaged over each topic and mean over all topics Precision @ 10 (P@10): Precision after 10 retrieved documents Cumulated Gain (NDCG): Gain cumulated after each retrieved document; allows comparison to ideal model + use of non binary relevance assessments B. Larsen UAM, October 2011 11

Information Retrieval (IR) as Formula 1? B. Larsen UAM, October 2011 12

A cognitive view on information needs Human interaction with information systems including IR systems is a cognitive activity Queries and information needs have a background and context that might be important for retrieval Several distinct aspects may be identified Most IR test collections contain only a few of these our work on the isearch test collection B. Larsen UAM, October 2011 13

THE ISEARCH TEST COLLECTION B. Larsen UAM, October 2011 14

Why is Google so easy and the library so hard? (Claire Duddy student) United Kingdom Serials Group 2009 Annual conference (Stone, 2010) B. Larsen UAM, October 2011 15

What is Integrated Search? Integrated search scenario Library search GUI Search engine Indexes Own catalogue ebrary Full text journals BUT how to develop and test existing and new integrated search solutions? build a test collection B. Larsen UAM, October 2011 16

Integrated Search Move from federated search search conducted in several disjoint sources and top results presented for each... to integrated search Search conducted in one system integrating several sources and document types Same query across all document types, presenting results in one ranked list Challenge: different documents types, genres and levels of descriptions For instance, articles, conference papers, books, audio, images, video, web pages Full text, title only, metadata with/without abstracts, citations, links, anchor text, user tagging, intellectual indexing and controlled vocabulary Real world digital library need, e.g., Danish State Library (http://www.statsbiblioteket.dk/summa/features text in english) Similar to universal search in web retrieval B. Larsen UAM, October 2011 17

Integrated Search Comparison to standard web retrieval Similarities Heterogeneous document corpus Differences Sources provide different but distinct levels of description For instance, very sparse data for a full book (title, authors, classification codes, controlled and uncontrolled index terms and notes) versus the full text of an article, with self assigned classification and uncontrolled keywords Conscious effort to include each type in the results regardless of level of description Different scale cannot rely on large quantity data Challenges How to rank and integrate different types of documents? How to exploit different levels and types of descriptions? How to present results? B. Larsen UAM, October 2011 18

Integrated Search Test Collection Co funded project with Denmark s Electronic Research Library Research team: Marianne Lykke, Birger Larsen, Haakon Lund, Toine Bogers, Peter Ingwersen & Christina Lioma Goals to create a test collection of scholarly documents that facilitates the design and test of integrated search IR models, e.g., Which standard IR models perform best for this task? Is parameter tuning sufficient to avoid over emphasis of some document types (e.g., full text), or do special measures need to be taken? to base topics on realistic information needs and to obtain realistic relevance assessments to obtain a realistic and diverse document corpus Focus on textual documents, but other media could be used also B. Larsen UAM, October 2011 19

isearch Test Collection Domain and document subsets Physics chosen as domain Availability of documents because of self archiving in open access repositories and e print archives arxiv.org 500,000+ documents (metadata + full text) Complex and specific information needs Sufficiently large research field from which to recruit topic authors Document subset extracted (453,254 in total) 143,569 (32%) arxiv.org full text PDF e prints + metadata 291,244 (64%) arxiv.org e print metadata (title, authors, subject, source, abstract) 18,441 (4%) book records (title, authors, subject, source) Approx. 6,2 GB B. Larsen UAM, October 2011 20

isearch Test Collection Topicst Collection Topics 23 physics master s and PhD students + lecturers recruited Created 65 topics based on own tasks Thorough descriptions in five fields based on user studies: Which central search terms would you use to express your situation and information need? Keywords What are you looking for? current information need Why are you looking for this? underlying work task What is your background knowledge of this topic? current knowledge state What should an ideal answer contain to solve your problem or task? Formed the basis for relevance assessments B. Larsen UAM, October 2011 21

isearch Test Collection Topics 5 perspectives Perspective Question a) Current information need What are you looking for? b) Work task situation Why are you looking for this? c) Current knowledge state d) Ideal answer e) Adequate search terms What is you background knowledge of this topic? What should am ideal answer contain to solve problem or task? Which central search terms would you use to express situation and information need? Based on the cognitive view on IR, e.g. Ingwersen & Järvelin (2005) B. Larsen UAM, October 2011 22

Example: isearch topic No. 49 1. Keywords: ZnO, rf magnetron sputtering, photo luminescence, al doped, green luminescence 2. Information Need: Information on characterization by photo luminescence of highly doped ZnO films 3. Work Task: For my master thesis I work with characterization of ZnO films by photo luminescence. The films are manufactured by RF magnetron sputtering and have thicknesses of approximately 100 nm. The films are either intrinsic or doped with Al. Green luminescence are of particular interest, but other defect modes are also of interest. The aim is to document a simple way of characterizing films in a non intrusive manor, and maybe to implement the technique in the production to monitor film growth. In particular information on sub band gab excitation is interesting as only a 405 nm laser is readily available at the institute 4. Background: I have worked with the topic for a year and a half. We have made experiments with photo luminescence and have observed green luminescence. I have read quite a lot of review articles on the subject and have been seeking articles with comparable parameters 5. Ideal Answer: An article containing examples of luminescence from samples made by rf magnetron sputtering. Graphs with photoluminescence data from ZnO films are essential. Ideally Al doped ZnO films would be featured in the article B. Larsen UAM, October 2011 23

isearch Test Collection Assessed documents Topic authors agreed to assess up to 200 documents per topic These were identified through iterative subject searches by the research team Similar to searches by information specialists (using document fields, Boolean combinations, classification codes etc. in Lucene) Separate searches adapted to each document subset Proportional to the corpus distribution where possible Assessments collected through online interface Graded relevance: Highly, Fairly, Marginally + Non relevant Additional background and satisfaction data collected through questionnaires B. Larsen UAM, October 2011 24

isearch Test Collection Collection Statistics (Lykke et al., 2010) B. Larsen UAM, October 2011 25

isearch Test Collection Usage Testing aspects of integrated search How do standard IR models perform for this task? How are relevant documents in different subsets ranked? How does parameter tuning affect each subset? Can better performing integrated models be developed? How to weight or fuse subsets? How to exploit different types and level of description? Studying other aspects Can added user context improve retrieval? IR in a restricted highly specialised scientific domain High precision IR using graded relevance Use of citations in IR Theoretical aspects, e.g., effect of topic fields in Polyrepresentation B. Larsen UAM, October 2011 26

isearch Test Collection Usage Plan to release the collection in fall 2011 Have proposed an ECIR 2012 workshop on Taskbased and aggregated search isearch will be provided for participants Facilitate exploratory experiments in both or either aspect B. Larsen UAM, October 2011 27

Wrap up ISID group and research interests isearch with extended user need descriptions Integrated/aggregated search B. Larsen UAM, October 2011 28

References Peter Ingwersen. Cognitive perspectives of information retrieval interaction: Elements of a cognitive IR theory. Journal of Documentation, 52:3 50, 1996. Peter Ingwersen and Kalvero Järvelin. The turn: integration of information seeking and retrieval in context. Springer Verlag New York, Inc., Secaucus, NJ, USA, 2005. Marianne Lykke, Birger Larsen, Haakon Lund & Peter Ingwersen (2010): Developing a Test Collection for the Evaluation of Integrated Search. In: Proceedings of ECIR 2010, 32nd European Conference on IR Research. Berlin: Springer, p. 627 630. Graham Stone. Searching life, the universe and everything? The implementation of Summon at the University of Huddersfield. Library Quarterly, 20(1):24 52, 2010. B. Larsen UAM, October 2011 29

Thank you! B. Larsen UAM, October 2011 30