Ryen W. White, Peter Bailey, Liwei Chen Microsoft Corporation

Similar documents
Mining the Search Trails of Surfing Crowds: Identifying Relevant Websites from User Activity Data

Characterization and Modeling of Deleted Questions on Stack Overflow

Learning Ontology-Based User Profiles: A Semantic Approach to Personalized Web Search

Ryen W. White, Matthew Richardson, Mikhail Bilenko Microsoft Research Allison Heath Rice University

PUTTING CONTEXT INTO SEARCH AND SEARCH INTO CONTEXT. Susan Dumais, Microsoft Research

Ryen White, Susan Dumais, Jaime Teevan Microsoft Research. {ryenw, sdumais,

Personalized Models of Search Satisfaction. Ahmed Hassan and Ryen White

Predicting Short-Term Interests Using Activity-Based Search Context

Traveler s Path to Purchase

CS473: Course Review CS-473. Luo Si Department of Computer Science Purdue University

Assessing the Scenic Route: Measuring the Value of Search Trails in Web Logs

Extracting Users Interest From Web Log Files

ELEC6910Q Analytics and Systems for Social Media and Big Data Applications Lecture 4. Prof. James She

Learning Temporal-Dependent Ranking Models

The Open University s repository of research publications and other research outputs. Search Personalization with Embeddings

Personalized Web Search

Social Behavior Prediction Through Reality Mining

DIGITAL GLOBAL AGENCY Search Engine Optimization- A Case Study. Edited by Digital Global Agency in house team March 2017

Re-contextualization and contextual Entity exploration. Sebastian Holzki

Aggregation for searching complex information spaces. Mounia Lalmas

How to register and enter scores for a NASP virtual tournament. Basic Steps:

SEO CASE STUDY. Pulis Professional Plumbing is a group of plumbers who are specialized in plumbing more than 20 years of experience.

Deep Character-Level Click-Through Rate Prediction for Sponsored Search

Latent Space Model for Road Networks to Predict Time-Varying Traffic. Presented by: Rob Fitzgerald Spring 2017

Large-Scale Validation and Analysis of Interleaved Search Evaluation

Providing link suggestions based on recent user activity

Evaluating the Usefulness of Sentiment Information for Focused Crawlers

Minghai Liu, Rui Cai, Ming Zhang, and Lei Zhang. Microsoft Research, Asia School of EECS, Peking University

Logging into the Firepower System

Deep Tracking: Biologically Inspired Tracking with Deep Convolutional Networks

This page contains information regarding legacy Altium Vault Technologies. For the latest technology available, see Altium Vault.

WebSci and Learning to Rank for IR

Microsoft Exam

SEO WITH SHOPIFY: DOES SHOPIFY HAVE GOOD SEO?

Advanced SEO Training Details Call Us:

Comparative Analysis of Clicks and Judgments for IR Evaluation

Ranking Algorithms For Digital Forensic String Search Hits

Fluidity Trader Historical Data for Ensign Software Playback

Blackboard 5. Instructor Manual Level One Release 5.5

A Dynamic Bayesian Network Click Model for Web Search Ranking

Adding user context to IR test collections

Digital Libraries. Patterns of Use. 12 Oct 2004 CS 5244: Patterns of Use 1

CSCI 599: Applications of Natural Language Processing Information Retrieval Evaluation"

Unsupervised Learning. Presenter: Anil Sharma, PhD Scholar, IIIT-Delhi

Instructor: Stefan Savev

I N D E X GOOGLE DROPS KEYWORD TOOL. REPLACES IT WITH KEYWORD PLANNER BENEFITS OF GUEST BLOGGING FOR SEO

SEOHUNK INTERNATIONAL D-62, Basundhara Apt., Naharkanta, Hanspal, Bhubaneswar, India

CS 229 Final Project - Using machine learning to enhance a collaborative filtering recommendation system for Yelp

Acurian on. The Role of Technology in Patient Recruitment

User Guide Help Topics

The Kinect Sensor. Luís Carriço FCUL 2014/15

Wrapper: An Application for Evaluating Exploratory Searching Outside of the Lab

Automatic people tagging for expertise profiling in the enterprise

Query Phrase Expansion using Wikipedia for Patent Class Search

Automatic Search Engine Evaluation with Click-through Data Analysis. Yiqun Liu State Key Lab of Intelligent Tech. & Sys Jun.

Context based Re-ranking of Web Documents (CReWD)

NinthDecimal Mobile Audience Q Insights Report

Understanding the use of Temporal Expressions on Persian Web Search

Use this procedure to submit an invoice for services provided to OPG.

Should the Word Survey Be Avoided in Invitation Messaging?

Predicting Next Search Actions with Search Engine Query Logs

The HyperContext Prototype

Challenges on Combining Open Web and Dataset Evaluation Results: The Case of the Contextual Suggestion Track

Final Exam Search Engines ( / ) December 8, 2014

Modern Retrieval Evaluations. Hongning Wang

Understanding Online Social Network Usage from a Network Perspective

Setting Up Your Public Profile

New Products & Innovation

Mining Web Data. Lijun Zhang

RSLIS at INEX 2012: Social Book Search Track

LIST OF ACRONYMS & ABBREVIATIONS

Mining Web Data. Lijun Zhang

The Finer Things In Alteryx Ken Black 7/2/18

SEO ISSUES FOUND ON YOUR SITE (JANUARY 2, 2017)

Navigating Cisco Prime Health and Utilization Monitor Tasks in LMS 4.1

Link Farming in Twitter

No Trade Secrets. Microsoft does not claim any trade secret rights in this documentation.

Data Analysis and Solver Plugins for KSpread USER S MANUAL. Tomasz Maliszewski

arxiv: v1 [cs.cv] 2 Sep 2018

Notes: Notes: Primo Ranking Customization

SEO ISSUES FOUND ON YOUR SITE (DECEMBER 30, 2016)

Industrial Machinery. Search Marketing Case Study

Evaluation of Seed Selection Strategies for Vehicle to Vehicle Epidemic Information Dissemination

ArchiMate 3 Practitioner (Level 1 & 2) Lesson Plan. This course covers all learning materials for ArchiMate v3

Internet Usage Transaction Log Studies: The Next Generation

Developing a Test Collection for the Evaluation of Integrated Search Lykke, Marianne; Larsen, Birger; Lund, Haakon; Ingwersen, Peter

SUPPORTING SYNCHRONOUS SOCIAL Q&A THROUGHOUT THE QUESTION LIFECYCLE

whitehvac.com SEO ISSUES FOUND ON YOUR SITE (JANUARY 2, 2017)

Telkomtelstra Corporate Website Increase a Business Experience through telkomtelstra Website

IITH at CLEF 2017: Finding Relevant Tweets for Cultural Events

Domain Specific Search Engine for Students

User Guide. ISI Web of Knowledge Service for UK Education. Web of Knowledge. Helpdesk: Helpline:

Online Caching. Jiazhong Nie directed by Prof. Manfred Warmuth SOE UC, Santa Cruz. May 16, jiazhong (UCSC) May 16, / 35

UNDERSTANDING AND IMPROVING WEB SEARCH USING LARGE-SCALE BEHAVIORAL LOGS. Susan Dumais, Microsoft Research

Using the Prime Performance Manager Web Interface

CS47300: Web Information Search and Management

Marketer's Guide. User guide for marketing analysts and business users

Contextual Search using Cognitive Discovery Capabilities

Better Contextual Suggestions in ClueWeb12 Using Domain Knowledge Inferred from The Open Web

Home Page. Title Page. Page 1 of 14. Go Back. Full Screen. Close. Quit

Transcription:

Ryen W. White, Peter Bailey, Liwei Chen Microsoft Corporation

Motivation Information behavior is embedded in external context Context motivates the problem, influences interaction IR community theorized about context Context sensitive search, user studies of search context User interest models can enhance post-query behavior & general browsing by leveraging contextual info. e.g., personalization, information filtering, etc. Little is known about the value of different contextual sources for user interest modeling

Overview A systematic, log-based study of five contextual sources for user interest modeling during Web interaction Assume user has browsed to URL Evaluate the predictive value of five contexts of URL: Interaction: recent interactions preceding URL Collection: pages that link to URL Task: pages sharing search engine queries with URL Historic: long term interests of current user Social: combined long-term interests those who visit URL Domain is website recommendation not search results

Data Sources Anonymized URLs visited by users of a widelydistributed browser toolbar 4 months of logs (Aug 08 Nov 08 inclusive): Past: Aug-Sep used to create user histories Present: Oct-Nov used for current behavior and future interests 250K users randomly selected from a larger user pool once most active users (top 1%) were removed Chosen users with at least 100 page visits in Past

Trails and Terminal URLs From logs we extracted millions of browse trails Temporally-ordered sequence of URLs comprising all pages visited by a user per Web browser instance Terminate with 30-minute inactivity timeout A set of 5M terminal URLs (u t ) obtained by randomlysampling all URLs in the trails Terminal URLs demarcate past and future events Task = Learn user interest models from contexts for u t, use those models to predict future user interests

Building User Interest Models Classified context URLs in the Open Directory Project human-edited Web directory (ODP, dmoz.org) Automatically assigned category labels via URL match URL back-off used if no exact match obtained Represent interests as list of ODP category labels Labels ranked in descending order by frequency For example, for a British golf enthusiast, the top of their user interest profile might resemble: ODP Category Labels Frequency Sports/Golf/Courses/Europe/United Kingdom 102 Sports/Golf/Driving Ranges 86 Sports/Golf/Instruction/Golf Schools 63

Selecting Contexts Ingwersen and Järvelin (2005) developed nested model of context stratification representing main contextual influences on people engaged in information behavior Dimensions used Others challenging to model via logs e.g., cognitive and affective state, infrastructures, etc.

Defining Contexts None (u t only): Interest model for terminal URL Interaction (u t-5 u t-1 ): Interest model for five Web pages immediately preceding u t Task: Interest model for pages encountered during the same or similar tasks Walk on search engine click graph from u t to queries and then back out to pages

Defining Contexts Collection: Interest model pages linking to u t We obtained a set of in-links for each u t from a search engine index, built model from pages linking to u t Historic: Interest model for each user based on their long-term Web page visit history Social: Interest model from combination of the historic contexts of users that also visit u t What is the effectiveness of different context sources for user interest modeling?

Methodology Found instances of u t in Present set (Oct-Nov 08 logs) Used all actions after u t as source of future behavior Futures specific to each user and each u t Used to gauge predictive value of each context Created three interest models representing future interests (ranked list of ODP labels & frequencies): Short: within one hour of u t Medium: within one day of u t Long: within one week of u t Filtered {u t } to help ensure experimental integrity e.g., no more than 10 u t per user

Methodology Divided filtered {u t } into 10 equally-sized runs Each run contained at most one u t from each user Experimental procedure: For each u t in each run: Build ground truth for short-, medium-, and long-term future interest models Build interest models for different contexts (and combinations) Determine predictive accuracy of each model Used five measures to determine prediction accuracy P@1, P@3, Mean Reciprocal Rank, ndcg, and F1 F1 tracked well with others - focus on that here

F1 F1 score score Findings Context comparison Predictive performance of contextual sources for different futures Interaction context & Task context most predictive of short-term interests 1.2 0.7 Task context most predictive of medium-term interests 0.6 1 Historic context most predictive of long-term interests 0.8 0.5 0.4 0.6 None 0.3 Interaction 0.4 Task 0.2 0.2 Collection 0.1 Social 0 Historic Short Medium Long Temporal Temporal distance distance from from u t t

F1 score Findings Handling near misses Near miss between prediction and ground truth regarded as total miss Use one/two/three-level back-off on both ground truth and prediction 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 Back-off to top two ODP levels No back-off Short Medium Long Temporal distance from u t None Interaction Task Collection Social Historic

F1 score Findings Improved confidence Basing predictions & ground truth on small # page visits may skew results Repeat experiment & ignore labels based on < 5 page visits 0.8 0.7 0.6 0.5 Predicted and ground truth labels based on 5 pages No back-off 0.4 0.3 0.2 0.1 0 Short Medium Long Temporal distance from u t Interaction Task Collection Social Historic

Findings Combining contexts Rank Short Medium Long Sources F1 score Sources F1 score Sources F1 score 1 n, i, t, h, s, c 0.72 ** n, i, t, h, s, c 0.53 ** n, i, t, s, h, c 0.45 ** 2 n, i, s, h, c 0.71 ** n, i, t, h, c 0.52 ** n, i, s, h, c 0.43 ** 3 n, i, t, h, c 0.71 ** n, i, t 0.49 ** n, i, t, h, c 0.43 * 4 n, i, h, c 0.71 ** n, i, s, h, c 0.48 * s, h 0.43 * 5 n, i, s, t, c 0.69 ** n, i, h, t 0.48 * n, i, s, h, t 0.42 * Overlap beats single contextual sources Key contexts still important Short = Interaction (i) and Task (t) Medium = Task (t) Long = Historic (h) Supports polyrepresentation theory (Ingwersen, 1994) Overlap between sources boosts predictive accuracy

Summary of Findings Performance of context dependent on distance between u t and end of prediction window Short-term interests predicted by task/interaction contexts Topical interest may not be highly dynamic, even if queries and information needs are Medium-term interests best predicted by task context More likely to include task variants appearing in next day Long-term interests predicted by historic/social contexts Interest may be invariant over time, users visiting same pages may have similar interests Overlap effective - many contexts reinforce key interests

Conclusions and Take-away Systematic study of context for user interest modeling Studied predictive value of five context sources Value varied with duration of prediction Short: interaction/task, Medium: task, Long: historic/social Overlap was more effective than any individual source Source must be tailored to modeling task Search/recommendation systems should not treat all contextual sources equally Weights should be assigned to each source based on the nature of the prediction task