Heading-aware Snippet Generation for Web Search

Similar documents
From Passages into Elements in XML Retrieval

A Practical Passage-based Approach for Chinese Document Retrieval

RMIT University at TREC 2006: Terabyte Track

LaHC at CLEF 2015 SBS Lab

INEX REPORT. Report on INEX 2011

Information Retrieval

INEX REPORT. Report on INEX 2012

Northeastern University in TREC 2009 Million Query Track

European Web Retrieval Experiments at WebCLEF 2006

(Not Too) Personalized Learning to Rank for Contextual Suggestion

INEX REPORT. Report on INEX 2013

Verbose Query Reduction by Learning to Rank for Social Book Search Track

Information Retrieval. (M&S Ch 15)

Passage Retrieval and other XML-Retrieval Tasks. Andrew Trotman (Otago) Shlomo Geva (QUT)

Reducing Redundancy with Anchor Text and Spam Priors

Comparative Analysis of Clicks and Judgments for IR Evaluation

University of Santiago de Compostela at CLEF-IP09

Semantic Extensions to Syntactic Analysis of Queries Ben Handy, Rohini Rajaraman

Term Frequency Normalisation Tuning for BM25 and DFR Models

University of Alicante at NTCIR-9 GeoTime

Citation for published version (APA): He, J. (2011). Exploring topic structure: Coherence, diversity and relatedness

Overview of the INEX 2008 Ad Hoc Track

Better Contextual Suggestions in ClueWeb12 Using Domain Knowledge Inferred from The Open Web

CS47300: Web Information Search and Management

CLEF-IP 2009: Exploring Standard IR Techniques on Patent Retrieval

IITH at CLEF 2017: Finding Relevant Tweets for Cultural Events

Ranked Retrieval. Evaluation in IR. One option is to average the precision scores at discrete. points on the ROC curve But which points?

Overview of the INEX 2008 Ad Hoc Track

TREC 2016 Dynamic Domain Track: Exploiting Passage Representation for Retrieval and Relevance Feedback

(Not Too) Personalized Learning to Rank for Contextual Suggestion

CS54701: Information Retrieval

Relevance in XML Retrieval: The User Perspective

NUSIS at TREC 2011 Microblog Track: Refining Query Results with Hashtags

ResPubliQA 2010

TREC-10 Web Track Experiments at MSRA

CS473: Course Review CS-473. Luo Si Department of Computer Science Purdue University

QCRI at TREC 2014: Applying the KISS principle for the TTG task in the Microblog Track

iarabicweb16: Making a Large Web Collection More Accessible for Research

Chapter 27 Introduction to Information Retrieval and Web Search

Information Retrieval Spring Web retrieval

Web Search Ranking. (COSC 488) Nazli Goharian Evaluation of Web Search Engines: High Precision Search

Chapter 6: Information Retrieval and Web Search. An introduction

University of Amsterdam at INEX 2010: Ad hoc and Book Tracks

A Deep Relevance Matching Model for Ad-hoc Retrieval

Combining Strategies for XML Retrieval

DCU and 2010: Ad-hoc and Data-Centric tracks

Melbourne University at the 2006 Terabyte Track

A Comparative Analysis of Cascade Measures for Novelty and Diversity

Search Engines Chapter 8 Evaluating Search Engines Felix Naumann

HARD Track Overview in TREC 2004 (Notebook) High Accuracy Retrieval from Documents

R 2 D 2 at NTCIR-4 Web Retrieval Task

Search Evaluation. Tao Yang CS293S Slides partially based on text book [CMS] [MRS]

Document Structure Analysis in Associative Patent Retrieval

Overview of the INEX 2010 Ad Hoc Track

Word Indexing Versus Conceptual Indexing in Medical Image Retrieval

Information Retrieval

Combining fields for query expansion and adaptive query expansion

Applying the KISS Principle for the CLEF- IP 2010 Prior Art Candidate Patent Search Task

A System for Query-Specific Document Summarization

Comment Extraction from Blog Posts and Its Applications to Opinion Mining

Microsoft Cambridge at TREC 13: Web and HARD tracks

Passage Retrieval and other XML-Retrieval Tasks

Finding Relevant Documents using Top Ranking Sentences: An Evaluation of Two Alternative Schemes

Running CLEF-IP experiments using a graphical query builder

ECNU at 2017 ehealth Task 2: Technologically Assisted Reviews in Empirical Medicine

Automatically Generating Queries for Prior Art Search

Identifying and Ranking Relevant Document Elements

MeSH-based dataset for measuring the relevance of text retrieval

Better Contextual Suggestions in ClueWeb12 Using Domain Knowledge Inferred from The Open Web

The TREC 2005 Terabyte Track

CSCI 599: Applications of Natural Language Processing Information Retrieval Evaluation"

MAXLCA: A NEW QUERY SEMANTIC MODEL FOR XML KEYWORD SEARCH

University of Waterloo: Logistic Regression and Reciprocal Rank Fusion at the Microblog Track

Classification and retrieval of biomedical literatures: SNUMedinfo at CLEF QA track BioASQ 2014

Fall CS646: Information Retrieval. Lecture 2 - Introduction to Search Result Ranking. Jiepu Jiang University of Massachusetts Amherst 2016/09/12

CADIAL Search Engine at INEX

CS47300: Web Information Search and Management

Retrieval Quality vs. Effectiveness of Relevance-Oriented Search in XML Documents

Using the K-Nearest Neighbor Method and SMART Weighting in the Patent Document Categorization Subtask at NTCIR-6

University of Virginia Department of Computer Science. CS 4501: Information Retrieval Fall 2015

Information Retrieval May 15. Web retrieval

Using XML Logical Structure to Retrieve (Multimedia) Objects

A Machine Learning Approach for Displaying Query Results in Search Engines

Automatic people tagging for expertise profiling in the enterprise

Kikori-KS: An Effective and Efficient Keyword Search System for Digital Libraries in XML

SEO According to Google

CriES 2010

Exploiting Index Pruning Methods for Clustering XML Collections

CS47300 Web Information Search and Management

Information Retrieval I

An Adaptive Intent Resolving Scheme for Service Discovery and Integration

MG4J: Managing Gigabytes for Java. MG4J - intro 1

A Document Graph Based Query Focused Multi- Document Summarizer

Navigation Retrieval with Site Anchor Text

Full-Text Indexing For Heritrix

WEIGHTING QUERY TERMS USING WORDNET ONTOLOGY

Improving Difficult Queries by Leveraging Clusters in Term Graph

Improving Data Access Performance by Reverse Indexing

Overview of the NTCIR-13 OpenLiveQ Task

James Mayfield! The Johns Hopkins University Applied Physics Laboratory The Human Language Technology Center of Excellence!

Transcription:

Heading-aware Snippet Generation for Web Search Tomohiro Manabe and Keishi Tajima Graduate School of Informatics, Kyoto Univ. {manabe@dl.kuis, tajima@i}.kyoto-u.ac.jp

Web Search Result Snippets Are short summaries of web page text jogging benefit Search Popular exercise... Jogging... One benefit is to improve fitness.... Benefit of sprint is weight loss.... One benefit of this exercise is protection from stress. Search engine users read them and judge relevance of original pages to search intents 2

Key Idea To generate snippets, search engines rank sentences Headings of sentences are important to rank them E.g., a query jogging A heading Jogging Sentences about jogging without keyword jogging Popular exercise Running Jogging Slow running. One benefit is to improve fitness. We propose heading-aware generation methods 3

Key Idea To generate snippets, search engines rank sentences Headings of sentences are important to rank them E.g., a query jogging A heading Jogging Sentences about jogging without keyword jogging Popular exercise Running Jogging Slow running. One benefit is to improve fitness. We propose heading-aware generation methods 4

Key Idea To generate snippets, search engines rank sentences Headings of sentences are important to rank them E.g., a query jogging A heading Jogging Sentences about jogging without keyword jogging Popular exercise Running Jogging Slow running. One benefit is to improve fitness. We propose heading-aware generation methods 5

Key Idea To generate snippets, search engines rank sentences Headings of sentences are important to rank them E.g., a query jogging A heading Jogging Sentences about jogging without keyword jogging Popular exercise Running Jogging Slow running. One benefit is to improve fitness. We propose heading-aware generation methods 6

Outline of This Presentation I. Our definition of hierarchical heading structure II. Heading-aware snippet generation methods III. Current evaluation result 7

Our definition of hierarchical heading structure Hierarchical heading structure and its components 8

In Short Hierarchical Heading Structure is composed of nested logical blocks associated with headings Each heading describes a block topic briefly Popular exercise Running Jogging Slow running. One benefit is to improve fitness. Sprint Benefit of sprint is weight loss. Swimming Front Crawl One benefit of this exercise is protection from stress. 9

In Short Hierarchical Heading Structure is composed of nested logical blocks associated with headings Each heading describes a block topic briefly Popular exercise Running Jogging Slow running. One benefit is to improve fitness. Sprint Benefit of sprint is weight loss. Swimming Front Crawl One benefit of this exercise is protection from stress. 10

A Heading is A highly summarized topic description for a part of a web page Heading words are words in headings Popular exercise Running Jogging Slow running. One benefit is to improve fitness. Sprint Benefit of sprint is weight loss. Swimming Front Crawl One benefit of this exercise is protection from stress. 11

A Heading is A highly summarized topic description for a part of a web page Heading words are words in headings Popular exercise Running Jogging Slow running. One benefit is to improve fitness. Sprint Benefit of sprint is weight loss. Swimming Front Crawl One benefit of this exercise is protection from stress. 12

A block is A part of a web page associated with a heading Note that An entire web page is also a block There is one-to-one correspondence between headings and blocks Popular exercise Running Jogging Slow running. One benefit is to improve fitness. Sprint Benefit of sprint is weight loss. Swimming Front Crawl One benefit of this exercise is protection from stress. 13

A block is A part of a web page associated with a heading Note that An entire web page is also a block There is one-to-one correspondence between headings and blocks Popular exercise Running Jogging Slow running. One benefit is to improve fitness. Sprint Benefit of sprint is weight loss. Swimming Front Crawl One benefit of this exercise is protection from stress. 14

Hierarchical Heading Structure A block may include other blocks entirely Blocks in a page form hierarchical heading structure Its root is the entire page Our methods focus on such structure Popular exercise Running Jogging Slow running. One benefit is to improve fitness. Sprint Benefit of sprint is weight loss. Swimming Front Crawl One benefit of this exercise is protection from stress. 15

Hierarchical Heading Structure Extraction from Web Pages is NOT a trivial problem Throughout this presentation, the structure is given For evaluation, we used previously proposed method Its implementation is at https://github.com/tmanabe Manabe, T., Tajima, K.: Extracting logical hierarchical structure of HTML documents based on headings. VLDB 8(12), 1606 1617 (2015) 16

Heading-aware Snippet Generation Methods 17

Basic Snippet Generation Method Input: Web page Sentence segmentation Sentence Scoring Sentence Selection Output: Snippets 1. Split the page into semantically coherent fragments (e.g. sentences) 2. Score the fragments By using scoring functions based on query keyword occurrences E.g., TFIDF and BM25 3. Select the fragments In desc. order of their scores Until the snippet length reaches limit 18

Basic Snippet Generation Method Input: Web page Sentence segmentation Sentence Scoring Sentence Selection Output: Snippets 1. Split the page into semantically coherent fragments (e.g. sentences) 2. Score the fragments By using scoring functions based on query keyword occurrences E.g., TFIDF and BM25 3. Select the fragments In desc. order of their scores Until the snippet length reaches limit 19

Basic Snippet Generation Method Input: Web page Sentence segmentation Sentence Scoring Sentence Selection Output: Snippets 1. Split the page into semantically coherent fragments (e.g. sentences) 2. Score the fragments By using scoring functions based on query keyword occurrences E.g., TFIDF and BM25 3. Select the fragments In desc. order of their scores Until the snippet length reaches limit 20

Basic Snippet Generation Method Input: Web page Sentence segmentation Sentence Scoring Sentence Selection Output: Snippets 1. Split the page into semantically coherent fragments (e.g. sentences) 2. Score the fragments By using scoring functions based on query keyword occurrences E.g., TFIDF and BM25 3. Select the fragments In desc. order of their scores Until the snippet length reaches limit 21

Basic Snippet Generation Method Input: Web page Sentence segmentation Sentence Scoring Sentence Selection Output: Snippets 1. Split the page into semantically coherent fragments (e.g. sentences) 2. Score the fragments By using scoring functions based on query keyword occurrences E.g., TFIDF and BM25 3. Select the fragments In desc. order of their scores Until the snippet length reaches limit 22

Basic Snippet Generation Method Input: Web page Sentence segmentation Sentence Scoring Sentence Selection Output: Snippets 1. Split the page into semantically coherent fragments (e.g. sentences) 2. Score the fragments By using scoring functions based on query keyword occurrences E.g., TFIDF and BM25 3. Select the fragments In desc. order of their scores Until the snippet length reaches limit 23

Four Methods All follow these steps 1. Baseline method Query keywords in sentence indicate importance 2. Existing method Heading words in sentence also indicate importance 3. Our method Query keywords in headings also indicate importance 4. Combined method All three ideas 24

Four Methods All follow these steps 1. Baseline method Query keywords in sentence indicate importance 2. Existing method Heading words in sentence also indicate importance 3. Our method Query keywords in headings also indicate importance 4. Combined method All three ideas 25

Input 1. Baseline Method Input A web page Segmentation Scoring Selection Output Popular exercise Running Jogging Slow running. One benefit is to improve fitness. 26

Input 1. Baseline Method Segmentation First step The method segments the page into sentences NOT the main topic of our research Popular exercise Running Jogging Slow running. One benefit is to improve fitness. Scoring Selection Output Popular exercise Running Jogging Slow running. One benefit is to improve fitness. 27

Input 1. Baseline Method Segmentation Scoring Second step The method scores the sentences based on the number of query keywords in them The main topic of our research But as the baseline, we used BM25(Query keywords) jogging benefit Search Popular exercise Selection Output Running Jogging Slow running. One benefit is to improve fitness. Robertson, S., Walker, S., Jones, S., Hancock-Beaulieu, M., Gatford, M.: Okapi at TREC-3. In: TREC. pp. 109-126 (1996) 28

Input 1. Baseline Method Segmentation Scoring Third step The method selects the sentences It simply scans the sentences in desc. order of score If there remains space to include sentence, it does so NOT the main topic of our research Jogging One benefit is to improve fitness. Benefit of sprint is weight loss. One benefit of this exercise is protection from stress. Selection Output 29

Input 1. Baseline Method Output Title (or URL if there is no title) Snippets Segmentation Scoring Selection Output Popular exercise... Jogging... One benefit is to improve fitness.... Benefit of sprint is weight loss.... One benefit of this exercise is protection from stress. 30

Four Methods 1. Baseline method Query keywords in sentence indicate importance 2. Existing method Heading words in sentence also indicate importance 3. Our method Query keywords in headings also indicate importance of sentences in their associated blocks 4. Combined method All three ideas 31

2. Existing Method An existing idea of Pembe and Güngör Heading words in sentences indicate importance Because the heading words are expected to be important words in the block Pembe, F.C., Güngör, T.: Structure-preserving and query-biased document summarization for web searching. Online Info. Rev. 33(4), 696-719 (2009) 32

2. Existing Method Existing method counts heading words in sentences to score the sentences because heading words are important for the sentences Heading words of a sentence are words in hierarchical headings of the block including it Hierarchical headings: headings of ancestor-or-self blocks 33

Heading Words of a Sentence Heading words of Slow running. are: Popular, exercise, running, and jogging The sentence includes a heading word running It is important Popular exercise Running Jogging Slow running. One benefit is to improve fitness. Sprint Benefit of sprint is weight loss. Swimming Front Crawl One benefit of this exercise is protection from stress. 34

Input 2. Existing Method Segmentation Scoring The scoring function for sentences Simply, BM25 Query keywords + BM25(Heading words) Problem Summation of BM25 scores produces worse ranking when they count the same words In other words, when heading words include some query keywords Selection Output Robertson, S., Zaragoza, H., Taylor, M.: Simple BM25 extension to multiple weighted fields. In: CIKM. pp. 42-49 (2004) 35

Input 2. Existing Method Therefore, we split words into three types Segmentation Scoring Selection Output Sum up their scores Not query keywords Query keywords Not heading words (Do not care) Query-only words Heading words Heading-only words Query-heading words I.e., BM25 Query only + BM25 Heading only + BM25(Query heading) 36

Three Word Types For Slow running. and a query jogging benefit Query-heading word is jogging Heading-only words are popular, exercise, running Query-only word is benefit Popular exercise Running Jogging Slow running. One benefit is to improve fitness. Sprint Benefit of sprint is weight loss. Swimming Front Crawl One benefit of this exercise is protection from stress. 37

Three Word Types For Slow running. and a query jogging benefit Query-heading word is jogging Heading-only words are popular, exercise, running Query-only word is benefit Popular exercise Running Jogging Slow running. One benefit is to improve fitness. Sprint Benefit of sprint is weight loss. Swimming Front Crawl One benefit of this exercise is protection from stress. 38

Three Word Types For Slow running. and a query jogging benefit Query-heading word is jogging Heading-only words are popular, exercise, running Query-only word is benefit Popular exercise Running Jogging Slow running. One benefit is to improve fitness. Sprint Benefit of sprint is weight loss. Swimming Front Crawl One benefit of this exercise is protection from stress. 39

Three Word Types For Slow running. and a query jogging benefit Query-heading word is jogging Heading-only words are popular, exercise, running Query-only word is benefit Popular exercise Running Jogging Slow running. One benefit is to improve fitness. Sprint Benefit of sprint is weight loss. Swimming Front Crawl One benefit of this exercise is protection from stress. 40

Input 2. Existing Method Segmentation Scoring Modified output Showed headings separately to improve readability Popular exercise > Running > Sprint Benefit of sprint is weight loss. > Swimming > Front Crawl One benefit of this exercise is protection from stress. Selection Output Headings shown iff sentences in their blocks are chosen Pembe, F.C., Güngör, T.: Structure-preserving and query-biased document summarization for web searching. Online Info. Rev. 33(4), 696-719 (2009) 41

Input 2. Existing Method Other steps are same as those of the baseline method Segmentation Scoring Selection Output 42

Four Methods 1. Baseline method Query keywords in sentence indicate importance 2. Existing method Heading words in sentence also indicate importance 3. Our method Query keywords in headings also indicate importance of sentences in their associated blocks 4. Combined method All three ideas 43

3. Our Method The existing idea Heading words in sentences indicate importance of them Our idea Query keywords in a heading indicate importance of sentences in its associated blocks 44

Omission of Heading Words Heading words are very often omitted Sentence Slow running is talking about popular exercise, running, and jogging but the heading words popular, exercise, and jogging are omitted Popular exercise Running Jogging Slow running. One benefit is to improve fitness. Sprint Benefit of sprint is weight loss. Swimming Front Crawl One benefit of this exercise is protection from stress. 45

3. Our Method Takes the omission of heading words into account Assigns high scores to sentences including query keywords within either Sentences themselves Their hierarchical headings 46

Input 3. Our Method Segmentation Scoring Sentence scoring modified in different way Each sentence comprises two fields Contents of the sentence itself Its hierarchical headings We use BM25F Scoring function for documents comprising multiple fields BM25F = к qq w(к,ss) kk 1 +w(к,ss) log NN sf к +0.5 sf к +0.5 Selection Output, w к, SS = occurs(к,ff,ss) bbbbbbbbbb ff 1 bb +bb length(ff,ss) avglength(ff) 47

Query Keywords in headings The sentence Slow running. NOT include a query keyword jogging in the sentence itself Includes jogging in the hierarchical headings of the sentence Popular exercise Running Jogging Slow running. One benefit is to improve fitness. Sprint Benefit of sprint is weight loss. Swimming Front Crawl One benefit of this exercise is protection from stress. 48

Query Keywords in headings The sentence One benefit is includes a query keyword jogging in the hierarchical headings Includes another query keyword benefit in the sentence itself Important for a query jogging benefit Popular exercise Running Jogging Slow running. One benefit is to improve fitness. Sprint Benefit of sprint is weight loss. Swimming Front Crawl One benefit of this exercise is protection from stress. 49

Input 3. Our Method Other steps are same as those of the existing method Segmentation Scoring Selection Output 50

Four Methods 1. Baseline method Query keywords in sentence indicate importance 2. Existing method Heading words in sentence also indicate importance 3. Our method Query keywords in headings also indicate importance of sentences in their associated blocks 4. Combined method All three ideas 51

Input 4. Combined Method Segmentation Scoring The two ideas are independent The combined method adopts both The scoring function is: BM25F Query only + BM25F Heading only + BM25F(Query heading) Selection Output 52

Input 4. Combined Method Other steps are same as those of the existing and our methods Segmentation Scoring Selection Output 53

Four Methods 1. Baseline method Query keywords in sentence indicate importance 2. Existing method Heading words in sentence also indicate importance 3. Our method Query keywords in headings also indicate importance of sentences in their associated blocks 4. Combined method All three ideas 54

Current Evaluation Result On web search 55

Evaluation Methodology The most important feature of snippets: Judgeability To what extent snippets help to judge relevance of pages In the INEX snippet retrieval track Relevance judgments under different conditions are compared Based on the entire documents Only based on their snippets If they agree, the snippets provides high judgeability and the snippet generation method is effective Length limit of snippets: 180 letters for a page Trappett, M., Geva, S., Trotman, A., Scholer, F., Sanderson, M.: Overview of the INEX 2013 snippet retrieval track. In: CLEF (2013) 56

Data Set Target of INEX is XML while our target is web We used data set for TREC 2014 web track ad-hoc task 50 keyword queries ClueWeb12 document collection Relevance judgement based on the entire pages We used only subset of the collection Top-20 pages for each query (1,000 in total) from baseline Generated by Indri with Waterloo spam filter 57

User Experiment To obtain snippet-based relevance judgment With 4 participants In each period, each participant is required to: 1. Read intent description behind a query 2. Scan titles and snippets of top-20 search result items 3. Judge whether each page is relevant by only them Each snippets was judged once, each participant judged a page once and used all methods evenly 58

Evaluation Measures From INEX Recall Pages correctly judged as relevant on their snippets Negative recall Pages relevant as a whole Pages correctly judged as irrelevant on their snippets Pages irrelevant as a whole Geometric mean of them Recall Negative Recall Primary evaluation measure 59

General Queries Comparison of mean evaluation scores Method Recall Negative recall Geometric mean Baseline 0.475 0.828 0.512 Existing 0.373 0.780 0.386 Ours 0.438 0.777 0.456 Combined 0.396 0.776 0.401 60

General Queries Comparison of mean evaluation scores Method Recall Negative recall Geometric mean Baseline 0.475 0.828 0.512 Existing 0.373 0.780 0.386 Ours 0.438 0.777 0.456 Combined 0.396 0.776 0.401 61

Single Queries TREC splits queries into several types Single queries have clear and focused intents Method Recall Negative recall Geometric mean Baseline 0.431 0.837 0.488 Existing 0.336 0.804 0.392 Ours 0.375 0.816 0.443 Combined 0.491 0.795 0.530 62

Single Queries TREC splits queries into several types Single queries have clear and focused intents Method Recall Negative recall Geometric mean Baseline 0.431 0.837 0.488 Existing 0.336 0.804 0.392 Ours 0.375 0.816 0.443 Combined 0.491 0.795 0.530 63

Query Length and Geometric Mean Scores Users need pages including all keywords in relation Finding such pages gets more difficult for longer queries Headings may help indicating relation Method 2 keywords 3 keywords 4+ keywords Baseline 0.585 0.503 0.394 Existing 0.406 0.387 0.350 Ours 0.543 0.378 0.362 Combined 0.393 0.388 0.425

Query Length and Geometric Mean Scores Users need pages including all keywords in relation Finding such pages gets more difficult for longer queries Headings may help indicating relation Method 2 keywords 3 keywords 4+ keywords Baseline 0.585 0.503 0.394 Existing 0.406 0.387 0.350 Ours 0.543 0.378 0.362 Combined 0.393 0.388 0.425

Conclusion Introduced a new idea for heading-aware snippet generation Query keyword in headings of sentences indicate importance of them Compared baseline and 3 heading-aware generation methods Evaluation result indicated that heading-aware methods were: Not effective for general queries Effective only for queries: representing its intent clearly containing four or more keywords Additional evaluation with more queries is needed 66