Supervised Random Walks

Similar documents
Topic mash II: assortativity, resilience, link prediction CS224W

Mining Web Data. Lijun Zhang

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

Mining Web Data. Lijun Zhang

Link Farming in Twitter

Network embedding. Cheng Zheng

Information Networks: PageRank

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

node2vec: Scalable Feature Learning for Networks

CSE 190 Lecture 16. Data Mining and Predictive Analytics. Small-world phenomena

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

Big Data Analytics CSCI 4030

Introduction to Data Mining

How to organize the Web?

Using PageRank in Feature Selection

Link Prediction for Social Network

Part 1: Link Analysis & Page Rank

Supervised Link Prediction with Path Scores

CS224W Project: Recommendation System Models in Product Rating Predictions

arxiv: v1 [cs.si] 17 Nov 2010

Knowledge Discovery and Data Mining 1 (VO) ( )

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

CS6200 Information Retreival. The WebGraph. July 13, 2015

Recommendation System for Location-based Social Network CS224W Project Report

Big Data Analytics CSCI 4030

Link Analysis and Web Search

Internal Link Prediction in Early Stage using External Network

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

IITH at CLEF 2017: Finding Relevant Tweets for Cultural Events

Web consists of web pages and hyperlinks between pages. A page receiving many links from other pages may be a hint of the authority of the page

Bilevel Sparse Coding

3 announcements: Thanks for filling out the HW1 poll HW2 is due today 5pm (scans must be readable) HW3 will be posted today

Discriminate Analysis

Using PageRank in Feature Selection

Sampling Large Graphs: Algorithms and Applications

CENTRALITIES. Carlo PICCARDI. DEIB - Department of Electronics, Information and Bioengineering Politecnico di Milano, Italy

E-commercial Recommendation Algorithms Based on Link Analysis

Hybrid Rating Prediction using Graph Regularization from Network Structure Group 45

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

Extracting Information from Complex Networks

CS425: Algorithms for Web Scale Data

Slides based on those in:

Hotel Recommendation Based on Hybrid Model

Online Social Networks and Media

Comparison of Optimization Methods for L1-regularized Logistic Regression

Modeling Dynamic Behavior in Large Evolving Graphs

Sampling Large Graphs: Algorithms and Applications

Proximity Prestige using Incremental Iteration in Page Rank Algorithm

CS224W Final Report Emergence of Global Status Hierarchy in Social Networks

CS294-1 Assignment 2 Report

CPSC 340: Machine Learning and Data Mining. Feature Selection Fall 2016

CPSC 340: Machine Learning and Data Mining. Ranking Fall 2016

Learning to Discover Social Circles in Ego Networks

TRANSDUCTIVE LINK SPAM DETECTION

Lecture 5: Graphs. Rajat Mittal. IIT Kanpur

CS6220: DATA MINING TECHNIQUES

COLLABORATIVE LOCATION AND ACTIVITY RECOMMENDATIONS WITH GPS HISTORY DATA

Combining Review Text Content and Reviewer-Item Rating Matrix to Predict Review Rating

Week 10: DTMC Applications Randomized Routing. Network Performance 10-1

Automatic Summarization

Using Social Networks to Improve Movie Rating Predictions

CS224W: Analysis of Networks Jure Leskovec, Stanford University

Link Structure Analysis

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

PageRank and related algorithms

Lecture 8: Linkage algorithms and web search

Influence Maximization in Location-Based Social Networks Ivan Suarez, Sudarshan Seshadri, Patrick Cho CS224W Final Project Report

CS249: SPECIAL TOPICS MINING INFORMATION/SOCIAL NETWORKS

Domain-specific user preference prediction based on multiple user activities

DS504/CS586: Big Data Analytics Graph Mining Prof. Yanhua Li

Use of KNN for the Netflix Prize Ted Hong, Dimitris Tsamis Stanford University

DS504/CS586: Big Data Analytics Graph Mining Prof. Yanhua Li

Incorporating Satellite Documents into Co-citation Networks for Scientific Paper Searches

Einführung in Web und Data Science Community Analysis. Prof. Dr. Ralf Möller Universität zu Lübeck Institut für Informationssysteme

Classify My Social Contacts into Circles Stanford University CS224W Fall 2014

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

Algorithms and Applications in Social Networks. 2017/2018, Semester B Slava Novgorodov

G(B)enchmark GraphBench: Towards a Universal Graph Benchmark. Khaled Ammar M. Tamer Özsu

Mathematical Methods and Computational Algorithms for Complex Networks. Benard Abola

Fast Inbound Top- K Query for Random Walk with Restart

Your Name: Section: INTRODUCTION TO STATISTICAL REASONING Computer Lab #4 Scatterplots and Regression

CS 124/LINGUIST 180 From Languages to Information

CPSC 340: Machine Learning and Data Mining. Recommender Systems Fall 2017

Recognizing Handwritten Digits Using the LLE Algorithm with Back Propagation

CSE 494: Information Retrieval, Mining and Integration on the Internet

Explore Co-clustering on Job Applications. Qingyun Wan SUNet ID:qywan

Link Prediction and Anomoly Detection

My Best Current Friend in a Social Network

Link Analysis in Weibo

Spectral Methods for Network Community Detection and Graph Partitioning

CS 124/LINGUIST 180 From Languages to Information

VisoLink: A User-Centric Social Relationship Mining

Data Mining Lecture 2: Recommender Systems

Scalable Network Analysis

Semi-Supervised Clustering with Partial Background Information

Top-N Recommendations from Implicit Feedback Leveraging Linked Open Data

DSCI 575: Advanced Machine Learning. PageRank Winter 2018

Recommender Systems - Introduction. Data Mining Lecture 2: Recommender Systems

Limitations of Matrix Completion via Trace Norm Minimization

Efficient Mining Algorithms for Large-scale Graphs

Transcription:

Supervised Random Walks Pawan Goyal CSE, IITKGP September 8, 2014 Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 1 / 17

Correlation Discovery by random walk Problem definition Estimate the importance/affinity of node B with respect to another node A in the graph. Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 2 / 17

Correlation Discovery by random walk Problem definition Estimate the importance/affinity of node B with respect to another node A in the graph. Framework: Random walk with restarts Goal: Compute the importance of node B for node A" Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 2 / 17

Correlation Discovery by random walk Problem definition Estimate the importance/affinity of node B with respect to another node A in the graph. Framework: Random walk with restarts Goal: Compute the importance of node B for node A" Consider a random walker that starts from node A, choosing among the available edges every time Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 2 / 17

Correlation Discovery by random walk Problem definition Estimate the importance/affinity of node B with respect to another node A in the graph. Framework: Random walk with restarts Goal: Compute the importance of node B for node A" Consider a random walker that starts from node A, choosing among the available edges every time Except that, before he makes a choice, with probability c, he goes back to node A (restart) Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 2 / 17

Random walk with restarts Let u A (B) denote the steady state probability that the random walker will find himself at node B. Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 3 / 17

Random walk with restarts Let u A (B) denote the steady state probability that the random walker will find himself at node B. u A (B) is what we want, the importance of B with respect to A. Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 3 / 17

Random walk with restarts Let u A (B) denote the steady state probability that the random walker will find himself at node B. u A (B) is what we want, the importance of B with respect to A. u A = (u A (1),...,u A (N)) Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 3 / 17

Random walk with restarts Let u A (B) denote the steady state probability that the random walker will find himself at node B. u A (B) is what we want, the importance of B with respect to A. u A = (u A (1),...,u A (N)) Steady-state vector: u A = (1 c)au A + cv A Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 3 / 17

Random walk with restarts Let u A (B) denote the steady state probability that the random walker will find himself at node B. u A (B) is what we want, the importance of B with respect to A. u A = (u A (1),...,u A (N)) Steady-state vector: u A = (1 c)au A + cv A A: transition matrix, c: restart probability, v A : restart vector with all its N elements zero except for the entry corresponding to node A. Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 3 / 17

The problem of link prediction and recommendation Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 4 / 17

The problem of link prediction and recommendation Link Prediction We are given a snapshot of a social network at time t Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 4 / 17

The problem of link prediction and recommendation Link Prediction We are given a snapshot of a social network at time t We seek to predict the edges that will be added to the network during the interval from time t to a future time t Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 4 / 17

The problem of link prediction and recommendation Link Prediction We are given a snapshot of a social network at time t We seek to predict the edges that will be added to the network during the interval from time t to a future time t e.g. we are given a large network, say Facebook, at time t and for each user we would like to predict what new edges (friendships) that particular user will create between t and t Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 4 / 17

The problem of link prediction and recommendation Link Prediction We are given a snapshot of a social network at time t We seek to predict the edges that will be added to the network during the interval from time t to a future time t e.g. we are given a large network, say Facebook, at time t and for each user we would like to predict what new edges (friendships) that particular user will create between t and t Link Recommendation Problem The same problem can also be viewed as a link recommendation problem, where we aim to suggest to each user a list of people that the user is likely to create new connections to. Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 4 / 17

Challenges Involved Sparsity Real networks are really sparse, in Facebook, a typical user is connected to about 100-200 out of more than 500 million nodes Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 5 / 17

Challenges Involved Sparsity Real networks are really sparse, in Facebook, a typical user is connected to about 100-200 out of more than 500 million nodes Can it be modeled using network features only? New edges in Facebook social network Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 5 / 17

Creation of New Links: Important questions How do network and node features interact? How important it is to have common interests and characteristics? Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 6 / 17

Creation of New Links: Important questions How do network and node features interact? How important it is to have common interests and characteristics? How important it is to be in the same social circle and be close in the network in order to eventually connect. Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 6 / 17

Creation of New Links: Important questions How do network and node features interact? How important it is to have common interests and characteristics? How important it is to be in the same social circle and be close in the network in order to eventually connect. Develop a method that combines the features of nodes (user profile) and edges (interaction) with the network structure Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 6 / 17

Supervised Random Walks Basic Idea In a supervised way, learn how to bias a PageRank-like random walk on the network so that it visits given nodes (positive training examples) more often than the others. Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 7 / 17

Supervised Random Walks Basic Idea In a supervised way, learn how to bias a PageRank-like random walk on the network so that it visits given nodes (positive training examples) more often than the others. Use node and edge features to learn edge strengths. Random walk on such a weighted network will be more likely to visit positive than negative nodes. Link Prediction: positive : nodes to which new edges will be created in the future, negative: all other nodes. Link recommendation: positive : nodes to which user clicks on Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 7 / 17

Learning Task Training data A source node s is given, along with the training examples to which s will create links in the future. Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 8 / 17

Learning Task Training data A source node s is given, along with the training examples to which s will create links in the future. Goal Learn a function that assigns a strength (random walk probability) to each edge. Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 8 / 17

Link Prediction: Baseline Approaches Link Prediction as a classification task Take nodes to which s has created edges as positive training examples, all other nodes as negative training examples Learn a classifier that predicts where node s is going to create links Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 9 / 17

Link Prediction: Baseline Approaches Link Prediction as a classification task Take nodes to which s has created edges as positive training examples, all other nodes as negative training examples Learn a classifier that predicts where node s is going to create links Random walk with restarts Start a random walk at node s and compute the proximity of each other node to node s. Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 9 / 17

Relation to personalized PageRank We are given a source node s and a set of destination nodes d 1,...,d k D to which s will create edges in the future Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 10 / 17

Relation to personalized PageRank We are given a source node s and a set of destination nodes d 1,...,d k D to which s will create edges in the future Aim is to bias the random walk such that it will visit nodes d i more often than the other nodes in the network Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 10 / 17

Relation to personalized PageRank We are given a source node s and a set of destination nodes d 1,...,d k D to which s will create edges in the future Aim is to bias the random walk such that it will visit nodes d i more often than the other nodes in the network Can we directly set an arbitrary transition probability to each edge? Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 10 / 17

Relation to personalized PageRank We are given a source node s and a set of destination nodes d 1,...,d k D to which s will create edges in the future Aim is to bias the random walk such that it will visit nodes d i more often than the other nodes in the network Can we directly set an arbitrary transition probability to each edge? Would result in drastic over-fitting Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 10 / 17

Relation to personalized PageRank We are given a source node s and a set of destination nodes d 1,...,d k D to which s will create edges in the future Aim is to bias the random walk such that it will visit nodes d i more often than the other nodes in the network Can we directly set an arbitrary transition probability to each edge? Would result in drastic over-fitting Instead, we assign the transition probability for each edge (u, v) based on features of nodes u and v, as well as features of edge (u,v). Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 10 / 17

Problem Formulation Directed graph G(V, E) Node s, destination nodes D = {d 1,...,d k } and no-link nodes L = {l 1,...,l n } Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 11 / 17

Problem Formulation Directed graph G(V, E) Node s, destination nodes D = {d 1,...,d k } and no-link nodes L = {l 1,...,l n } Each edge (u,v) has a feature vector ψ(u,v) that describes the nodes u and v (e.g., gender, age, hometown) and the interaction attributes (e.g., time of edge creation, messages exchanges, photos appeared together in) Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 11 / 17

Problem Formulation Directed graph G(V, E) Node s, destination nodes D = {d 1,...,d k } and no-link nodes L = {l 1,...,l n } Each edge (u,v) has a feature vector ψ(u,v) that describes the nodes u and v (e.g., gender, age, hometown) and the interaction attributes (e.g., time of edge creation, messages exchanges, photos appeared together in) Compute the strength a uv = f w (ψ uv ) for edge (u,v). We want to learn the function f w (ψ) in the training phase of the algorithm Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 11 / 17

Predicting new edges using Edge Strength Edge strengths of all edges are calculated using f w Random walk with restarts is run from s Stationary distribution p of the random walk assigns each node u a probability p u Top ranked nodes are predicted as destinations of future links of s Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 12 / 17

Using edge weights Function f w (ψ uv ) combines the attributes ψ uv and the parameter vector w to output a non-negative weight a uv for each edge We use this to build the random walk stochastic transition matrix Q such that Q uv = a uv,(u,v) E w a uw Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 13 / 17

Using edge weights Function f w (ψ uv ) combines the attributes ψ uv and the parameter vector w to output a non-negative weight a uv for each edge We use this to build the random walk stochastic transition matrix Q such that Q uv = a uv,(u,v) E w a uw Corresponding matrix for random walk with restart: Q uv = (1 c)q uv + c1(v = s) Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 13 / 17

Using edge weights Function f w (ψ uv ) combines the attributes ψ uv and the parameter vector w to output a non-negative weight a uv for each edge We use this to build the random walk stochastic transition matrix Q such that Q uv = a uv,(u,v) E w a uw Corresponding matrix for random walk with restart: Q uv = (1 c)q uv + c1(v = s) Verify that Q is row stochastic Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 13 / 17

Using edge weights Function f w (ψ uv ) combines the attributes ψ uv and the parameter vector w to output a non-negative weight a uv for each edge We use this to build the random walk stochastic transition matrix Q such that Q uv = a uv,(u,v) E w a uw Corresponding matrix for random walk with restart: Q uv = (1 c)q uv + c1(v = s) Verify that Q is row stochastic P 1 n is the stationary distribution of the Random walk with restarts, and is the solution of the following equation: P = PQ Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 13 / 17

Optimization Problem Aim: Learn the parameters w of function f w (ψ uv ) that assigns each edge a strength of a uv Criterion: Assign the weights such that the random walk is more likely to visit nodes in D than L, i.e., p l < p d, for each d D and l L Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 14 / 17

Optimization Problem Aim: Learn the parameters w of function f w (ψ uv ) that assigns each edge a strength of a uv Criterion: Assign the weights such that the random walk is more likely to visit nodes in D than L, i.e., p l < p d, for each d D and l L Optimization function min w F(w) = w 2 such that d D,l L : p l < p d p i s are the pagerank scores A smaller w is preferred simply for regularization Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 14 / 17

Optimization function: Softer version min w F(w) = w 2 + λ d D,l L h(p l p d ) h(.) : loss function such that h(.) = 0 as p l < p d and h(.) > 0 for p l p d > 0 Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 15 / 17

Features used for the Facebook Network For each edge (i,j), Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 16 / 17

Features used for the Facebook Network For each edge (i,j), The number of common friends between the two nodes Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 16 / 17

Features used for the Facebook Network For each edge (i,j), The number of common friends between the two nodes Communication and observation features: probability of communication and profile observation in one week period Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 16 / 17

Features used for the Facebook Network For each edge (i,j), The number of common friends between the two nodes Communication and observation features: probability of communication and profile observation in one week period Edge initiator: Individual making the friend request is encoded as +1 or -1 Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 16 / 17

Features used for the Facebook Network For each edge (i,j), The number of common friends between the two nodes Communication and observation features: probability of communication and profile observation in one week period Edge initiator: Individual making the friend request is encoded as +1 or -1 Edge age Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 16 / 17

References Random Walk with Restarts: Pan, Jia-Yu, et al. Automatic multimedia cross-modal correlation discovery. Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 2004. Supervised Random Walks: Backstrom, Lars, and Jure Leskovec. Supervised random walks: predicting and recommending links in social networks. Proceedings of the fourth ACM international conference on Web search and data mining. ACM, 2011. Pawan Goyal (IIT Kharagpur) Supervised Random Walks September 8, 2014 17 / 17