NTCIR-11 Temporalia. Temporal Information Access

Similar documents
Temporal Information Access (Temporalia-2)

Overview of NTCIR-11 Temporal Information Access (Temporalia) Task

NTCIR Temporalia: A Test Collection for Temporal Information Access Research

NTCIR-11 Temporal Information Access Task

MPI-INF at the NTCIR-11 Temporal Query Classification Task

Using machine learning to predict temporal orientation of search engines queries in the Temporalia challenge

A Survey of Temporal Web Search Experience

Using machine learning to predict temporal orientation of search engines queries in the Temporalia challenge

Advanced Topics in Information Retrieval. Learning to Rank. ATIR July 14, 2016

Time-aware Approaches to Information Retrieval

Understanding the Query: THCIB and THUIS at NTCIR-10 Intent Task. Junjun Wang 2013/4/22

HUKB at NTCIR-12 IMine-2 task: Utilization of Query Analysis Results and Wikipedia Data for Subtopic Mining

MPI-INF AT THE NTCIR-11 TEMPORAL QUERY CLASSIFICATION TASK

Overview of the NTCIR-10 INTENT-2 Task

Building Test Collections. Donna Harman National Institute of Standards and Technology

Understanding the Query: THCIB and THUIS at NTCIR-10 Intent Task

Overview of the Patent Retrieval Task at the NTCIR-6 Workshop

Aggregation for searching complex information spaces. Mounia Lalmas

University of Alicante at NTCIR-9 GeoTime

CLEF-IP 2009: Exploring Standard IR Techniques on Patent Retrieval

Understanding User s Search Behavior towards Spiky Events

Time-Surfer: Time-Based Graphical Access to Document Content

GTE-Cluster: A Temporal Search Interface for Implicit Temporal Queries

Overview of the NTCIR-12 MobileClick-2 Task

Overview of the NTCIR-13 OpenLiveQ Task

Information Search in Web Archives

Temporal Query Classification at Different Granularities

Overview of the Patent Mining Task at the NTCIR-8 Workshop

KGO at the NTCIR-12 Temporalia Task

Understanding the use of Temporal Expressions on Persian Web Search

Applying the KISS Principle for the CLEF- IP 2010 Prior Art Candidate Patent Search Task

Information Extraction based Approach for the NTCIR-10 1CLICK-2 Task

PRIS at TAC2012 KBP Track

IRCE at the NTCIR-12 IMine-2 Task

A Micro-analysis of Topic Variation for a Geotemporal Query

Overview of the TREC 2013 Crowdsourcing Track

NUS-I2R: Learning a Combined System for Entity Linking

Using Web Snippets and Web Query-logs to Measure Implicit Temporal Intents in Queries

Proceedings of NTCIR-9 Workshop Meeting, December 6-9, 2011, Tokyo, Japan

Natural Language Processing SoSe Question Answering. (based on the slides of Dr. Saeedeh Momtazi) )

Query Classification System Based on Snippet Summary Similarities for NTCIR-10 1CLICK-2 Task

Overview of Patent Retrieval Task at NTCIR-5

Question Answering Systems

Ghent University-IBCN Participation in TAC-KBP 2015 Cold Start Slot Filling task

NTU Approaches to Subtopic Mining and Document Ranking at NTCIR-9 Intent Task

University of Virginia Department of Computer Science. CS 4501: Information Retrieval Fall 2015

From CLIR to CLIE: Some Lessons in NTCIR Evaluation

Document Structure Analysis in Associative Patent Retrieval

Wikipedia and the Web of Confusable Entities: Experience from Entity Linking Query Creation for TAC 2009 Knowledge Base Population

R 2 D 2 at NTCIR-4 Web Retrieval Task

A PRELIMINARY STUDY ON THE EXTRACTION OF SOCIO-TOPICAL WEB KEYWORDS

An Investigation of Basic Retrieval Models for the Dynamic Domain Task

Natural Language Processing SoSe Question Answering. (based on the slides of Dr. Saeedeh Momtazi)

Use of Temporal Expressions in Web Search

Annotation and Feature Engineering

Annotating Spatio-Temporal Information in Documents

Developing Focused Crawlers for Genre Specific Search Engines

Building Search Applications

Master Project. Various Aspects of Recommender Systems. Prof. Dr. Georg Lausen Dr. Michael Färber Anas Alzoghbi Victor Anthony Arrascue Ayala

Overview of the NTCIR-13 OpenLiveQ Task

Comparative Analysis of Clicks and Judgments for IR Evaluation

Introduction to Text Mining. Hongning Wang

The Use of Domain Modelling to Improve Performance Over a Query Session

Information Retrieval

Beyond Ten Blue Links Seven Challenges

Visualization and text mining of patent and non-patent data

A Survey Of Different Text Mining Techniques Varsha C. Pande 1 and Dr. A.S. Khandelwal 2

Searching the Deep Web

TREC 2017 Dynamic Domain Track Overview

Modern Retrieval Evaluations. Hongning Wang

Evaluation of the Document Categorization in Fixed-point Observatory

Natural Language Processing. SoSe Question Answering

cocobean A Socially Ranked Question Answering System ethan deyoung jerry ye jimmy chen Srinivasan Ramaswamy advisor: marti hearst

Wikipedia Retrieval Task ImageCLEF 2011

Query Subtopic Mining Exploiting Word Embedding for Search Result Diversification

Learning Temporal-Dependent Ranking Models

Context based Re-ranking of Web Documents (CReWD)

CADIAL Search Engine at INEX

Final Project Discussion. Adam Meyers Montclair State University

ResPubliQA 2010

Search Engine Architecture. Hongning Wang

A Comparative Study Weighting Schemes for Double Scoring Technique

MIRACLE at ImageCLEFmed 2008: Evaluating Strategies for Automatic Topic Expansion

Information Retrieval

CIRGDISCO at RepLab2012 Filtering Task: A Two-Pass Approach for Company Name Disambiguation in Tweets

Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information

University of Amsterdam at INEX 2010: Ad hoc and Book Tracks

On Modeling Temporal Dynamics Forgetting and Remembering for Intelligent Information Access

Entity-centric Topic Extraction and Exploration: A Network-based Approach

Learning to Rank for Faceted Search Bridging the gap between theory and practice

The 1st International Workshop on Diversity in Document Retrieval

Entity and Knowledge Base-oriented Information Retrieval

An Ensemble Dialogue System for Facts-Based Sentence Generation

TempWeb rd Temporal Web Analytics Workshop

Searching the Deep Web

Overview of Classification Subtask at NTCIR-6 Patent Retrieval Task

Semi-supervised Learning

BIG DATA SCIENTIST Certification. Big Data Scientist

Parsing tree matching based question answering

Annotation and Evaluation

Transcription:

NTCIR-11 Temporalia Temporal Information Access Hideo Joho (Univ. of Tsukuba) Adam Jatowt (Kyoto Univ.) Roi Blanco (Yahoo! Research) Hajime Naka (Univ. of Tsukuba) Shuhei Yamamoto (Univ. of Tsukuba) https://sites.google.com/site/ntcirtemporalia

Importance of Time in Search When search results do not match temporal intent behind query: tokyo weather doc with yesterday s results economy forecast japan doc with past forecasts previous tokyo olympics doc about achieved milestones in preparations for forthcoming Olympics etc.

Temporal IR 1.5% of queries are explicit temporal queries [Nunes 2008] Germany 1920s, Olympics 1956 Many queries are implicitly temporal, e.g.: einstein childhood, past depressions, fashion trend, kyoto weather, olympics, population increase expectation Queries with temporal component require special treatment [Arikan 2009; Berberich 2010; Kanhabua 2010] By matching query temporality with document temporality Recent survey on Temporal IR contains over 150 citations [Campos 2014] Arikan, I., Bedathur, S.J. and Berberich, K. Time Will Tell: Leveraging Temporal Expressions in IR. WSDM2009 Berberich et al. Language Modeling Approach for Temporal Information Needs. ECIR 2010 Kanhabua, N. and Norvag,K., Determining Time of Queries for Reranking Search Results. ECDL2010 Nunes, S., Ribeiro, C., and David, G. Use of Temporal Expressions in Web Search. ECIR2008 Campos, R., Dias, G. Jorge, A.M., and Jatowt, A. Survey of Temporal Information Retrieval and Related Applications. ACM Comput. Surv. 47(2): 15 (2014)

Our Objective Offer standardized test bench for evaluating different aspects of Temporal Information Retrieval Foster research to understand temporal aspects of search needs and to accommodate them in the best way Search aspects: Intent Query Context Document processing Result ranking Result presentation User feedback. The problem is not trivial

NTCIR-11 Temporalia: Subtasks Temporal Query Intent Classification (TQIC) Classify query to temporal classes Temporal Information Retrieval (TIR) Rank documents for temporal queries Search aspects: Intent Query Context Document processing Result ranking Result presentation User feedback. Just a pilot task...

Participation 7 countries

Participating Teams Total number of participating teams 9 6 for TQIC (17 runs) 5 for TIR + 1 organizer (18 runs) Team ID Institution Country TQIC TIR HITSZ Graduate School of Harbin Institute of Technology at Shenzhen P.R.C. HULTECH University of Caen France TUTA1 University of Tokushima Japan Andd7 Dhirubhai Ambani Institute of Information and Communication Technology India MPII Max Planck Institute for Informatics Germany UniMAN University of Manchester UK BRKLY U.C. Berkeley USA OSKAT Osaka Kyoiku University Japan ORG Temporalia Organizers Japan, Spain

Temporal Query Intent Classification

Background: Search for Present, Past and Future Related Information on the Web Survey of 110 users about their recent search queries Past: 24.5% Present: 57.3% Future: 7.2% Target time of sought information Joho, H., Jatowt, A., Blanco, R. A survey of temporal web search experience. TempWeb2013

Query Temporal Categories About recent things/events About past things/events Recency Past Query Future Atemporal About future things/events Coarsest-grained distinction, yet intuitive No temporal search intent

TQIC: Query Examples Query issuing time provided

Collecting 300 seed temp. expressions from dictionaries & query logs TQIC Dataset Creation Submitting to three major web search engines Collecting 10-20 query suggestions for each input query Removing duplicates and annotating by organizers (1.5K queries) 100 dry run queries (high agreement) 300 formal run queries (75% with high agreement, 25% with medium agreement)

TQIC Approaches Team Runs HITSZ 3 Num. of features 3 features + ngrams and POS ngrams HULTECH 2 11 features TUTA1 (top) 3 Andd7 3 MPII 3 External data sources Web search results (titles, snippets) Web search results (snippets) 3 feature groups AOL 500K query session 3 features + bag of words 6 feature groups + ngrams External tools POS tagger, NER TempoWordnet [1], GTE [2] POS tagger, SUTime [3], NER - Porter Stemmer DMOZ directory UniMan 3 19 features Wikipedia 2 thesauri, POS tagger, NER POS tagger, TempoWordnet [1], ManTIME [5] Classifiers Classifier voting & rule-based method Ensemble learning (8 classifiers) Semi-supervised & supervised classifiers (Log. Reg., SVMLin [4]) SVM, NB, DT & their agreement NB, DT & simulated annealing SVM & Random Forrest [1] https://tempowordnet.greyc.fr/ [2] http://www.ccc.ipt.pt/~ricardo/software [3] http://nlp.stanford.edu/software/sutime.shtml [4] http://vikas.sindhwani.org/svmlin.html [5] http://www.cs.man.ac.uk/~filannim/projects/tempeval-3/

TQIC Results Top for all Top for past Top for recency Top for atemporal Mean Acc. Future: 0.68 Mean Acc. Recent: 0.56 Mean Acc. Past: 0.73 Mean Acc. Atemp.: 0.69 Mean Acc. All: 0.66

TQIC: Confusion Matrix

Pearson s Correlation of Temporal Classes

Temporal Information Retrieval

NTCiR-11: Temporal Information Retrieval Temporal Information Retrieval (TIR) Returning top 100 documents for each subtopic of same topic: past, recency, future and atemporal Query 1. DocA 2. DocB 3. DocC 4. 5. topical + temporal relevance ranking https://sites.google.com/site/ntcirtemporalia/

Document Collection LivingKnowledge Corpus [http://livingknowledge.europarchive.org/] Crawl time: April 2011 and March 2013 Source: annotated news and blog feeds Size: 20GB uncompressed (>5GB zipped) 3.8M documents from 1500 different blogs and news sources Text only Provided annotations: Time Annotations, Named Entities and Sentence Splitting Participants can choose to use annotations or not Participants can use external collections such as Wikipedia

Example Document More details at: https://sites.google.com/site/ntcirtemporalia M. Matthews et al. Searching Through Time in the New York Times, HCIR, 2010

Sub-Topic Temporal Categories and their Characteristics Unquestionable king in news domain terminology gap, foreign country? Recency Past Subtopic Future Atemporal Scarce information? credibility issues? Recent information overload? Omnia mutantur

Temporalia: Topic Examples

Workshop series: 30 university students searching for their interesting topics within collection using Solr TIR Topics Creation Workshop series: expanding successful topics into 4 temporal subtopics Selecting 80 candidate topics (organizers) Removing similar topics, discarding low quality topics and editing (organizers) Dry runs: 15 topics (60 subtopics) Formal runs: 50 topics (200 subtopics)

TIR: Relevance Assessment (Formal Run) 1. About 36K documents pooled for evaluation Depth of 20 2. Workshop participants found one highly relevant and one nonrelevant document for each subtopic Used as test questions for crowdsourcing platform after manual verification 3. Crowdsourcing with common settings [Kazai 2013]: min. 3 judgments per document 5 docs/task min. 120sec work time/task $0.1/task G. Kazai, J. Kamps, and N. Milic-Frayling. An analysis of human factors and label accuracy in crowdsourcing relevance judgments. Information Retrieval, 16(2):138 178, 2013.

TIR: Relevance Assessment L2: Highly relevant exhaustive information related to search question L1: Relevant some information related to search question contained (non exhaustive) L0: Not relevant no information related to search question

TIR Approaches Team Runs Used tags HITSZ 3 TUTA1 3 Andd7 3 time annotations of collection time annotations of collection time annotations of collection External data sources - External tools Own TQIC classification method BRKLY 3 - - - OKSAT 3 time annotations of collection Approach Learning to rank, BM25 - POS tagger Learning to rank - POS tagger Lucene Wikipedia, Web search results Logistic regression model, blind relevance feedback - Plural set of terms ORG (top) 3 - - - BM25 (Solr)

TIR Results Top for all Top for past Top for atemporal Mean ndcg@20 Future: 0.43 Mean ndcg@20 Recent: 0.46 Mean ndcg@20 Past: 0.39 Mean ndcg@20 Atemp.: 0.44 Mean ndcg@20 All: 0.42

TIR Results per Topic Topic: teaching English as a second language Future: future learning methods Past: past learning methods before 2000 Recency: latest technologies available Atemp.: definition of second languages Topic: comet elenin Past: other comets spotted around the time when Elenin was discovered Topics tend to have varying difficulty over different temporal classes

Conclusions Pilot task First task entirely devoted to temporal information retrieval Subtasks: Temporal Query Intent Classification (TQIC) Temporal Information Retrieval (TIR) 8 different teams participating Provided task-specific document collection Available for anyone Much room for future improvement

Round Table Discussion Day: Thu 11th December Time: 16:05-18:00 Location: NII 20F, Room: 2001A

Proposal of Temporalia-2 Temporal Intent Disambiguation (TID) Subtask Challenge Class (4) Assign probability to each temporal intent class for a query Atemporal, past, recency, future Temporally Diversified Retrieval (TDR) Subtask Challenge Diversified retrieval Data: Living Knowledge news and blogs Evaluation metric To be decided Relevance Diversity Proportionality Temporal Search Result Presentation (TSRP) Subtask Challenge Open-ended & exploratory task Design of interactive interface with good visualization and flexibility for information access

Thank you for Participation! Any suggestions for improving Temporalia ver.2 are welcome! tc4fia@googlegroups.com https://sites.google.com/site/ntcirtemporalia