QUINT: On Query-Specific Optimal Networks

Similar documents
QUINT: On Query-Specific Optimal Networks

arxiv: v1 [cs.si] 18 Oct 2017

Supervised Link Prediction with Path Scores

Topic mash II: assortativity, resilience, link prediction CS224W

Online Social Networks and Media

Link Prediction for Social Network

Predicting Disease-related Genes using Integrated Biomedical Networks

Relational Retrieval Using a Combination of Path-Constrained Random Walks

Fast Nearest Neighbor Search on Large Time-Evolving Graphs

Random Walk Inference and Learning. Carnegie Mellon University 7/28/2011 EMNLP 2011, Edinburgh, Scotland, UK

Combine the PA Algorithm with a Proximal Classifier

Relation Learning with Path Constrained Random Walks

Mining Web Data. Lijun Zhang

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

Scalable Network Analysis

Hierarchical Graph Clustering: Quality Metrics & Algorithms

CS224W: Analysis of Networks Jure Leskovec, Stanford University

Link Prediction and Anomoly Detection

This Talk. Map nodes to low-dimensional embeddings. 2) Graph neural networks. Deep learning architectures for graphstructured

CS249: ADVANCED DATA MINING

Supervised Random Walks

A Brief Review of Representation Learning in Recommender 赵鑫 RUC

Outsourcing Privacy-Preserving Social Networks to a Cloud

The link prediction problem for social networks

Mining Web Data. Lijun Zhang

Hadoop Based Link Prediction Performance Analysis

Link Sign Prediction and Ranking in Signed Directed Social Networks

Learning to Rank on Network Data

AspEm: Embedding Learning by Aspects in Heterogeneous Information Networks

Learning Dense Models of Query Similarity from User Click Logs

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

Bipartite Edge Prediction via Transductive Learning over Product Graphs

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

CS249: SPECIAL TOPICS MINING INFORMATION/SOCIAL NETWORKS

Learning to Rank Networked Entities

arxiv: v1 [cs.si] 12 Jan 2019

Mining Social Network Graphs

TGNet: Learning to Rank Nodes in Temporal Graphs. Qi Song 1 Bo Zong 2 Yinghui Wu 1,3 Lu-An Tang 2 Hui Zhang 2 Guofei Jiang 2 Haifeng Chen 2

Composite Self-concordant Minimization

node2vec: Scalable Feature Learning for Networks

Machine Learning. Support Vector Machines. Fabio Vandin November 20, 2017

GIVEN a large graph and a query node, finding its k-

Semi-Supervised Clustering with Partial Background Information

over Multi Label Images

Graph-based Semi- Supervised Learning as Optimization

Link Prediction in Networks with Nodes Attributes. by Similarity Propagation

The Comparative Study of Machine Learning Algorithms in Text Data Classification*

Effective Latent Space Graph-based Re-ranking Model with Global Consistency

Jianyong Wang Department of Computer Science and Technology Tsinghua University

How to organize the Web?

Efficient Iterative Semi-supervised Classification on Manifold

Recommendation System for Location-based Social Network CS224W Project Report

Sampling Large Graphs: Algorithms and Applications

Uncovering the Formation of Triadic Closure in Social Networks. Zhanpeng Fang and Jie Tang Tsinghua University

Structured prediction using the network perceptron

Analysis of Biological Networks. 1. Clustering 2. Random Walks 3. Finding paths

Reddit Recommendation System Daniel Poon, Yu Wu, David (Qifan) Zhang CS229, Stanford University December 11 th, 2011

GraphGAN: Graph Representation Learning with Generative Adversarial Nets

Introduction to Data Mining

Overview Citation. ML Introduction. Overview Schedule. ML Intro Dataset. Introduction to Semi-Supervised Learning Review 10/4/2010

CSE 573: Artificial Intelligence Autumn 2010

Link Prediction in a Modified Heterogeneous Bibliographic Network

Query Independent Scholarly Article Ranking

Applying SnapVX to Real-World Problems

De#anonymizing,Social,Networks, and,inferring,private,attributes, Using,Knowledge,Graphs,

INF4820, Algorithms for AI and NLP: Hierarchical Clustering

SCALABLE, LOW LATENCY MODEL SERVING AND MANAGEMENT WITH VELOX

Learning to Match. Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li

Proximal operator and methods

ECS289: Scalable Machine Learning

Parallel and Distributed Sparse Optimization Algorithms

PERSONALIZED TAG RECOMMENDATION

C N O S N T S RA R INT N - T BA B SE S D E L O L C O A C L S E S A E RC R H

1 Case study of SVM (Rob)

Network-based auto-probit modeling for protein function prediction

Markov Networks in Computer Vision

Introduction to Optimization Problems and Methods

CS224W: Social and Information Network Analysis Project Report: Edge Detection in Review Networks

Learning a Distance Metric for Structured Network Prediction. Stuart Andrews and Tony Jebara Columbia University

BEAR: Block Elimination Approach for Random Walk with Restart on Large Graphs

Jure Leskovec, Cornell/Stanford University. Joint work with Kevin Lang, Anirban Dasgupta and Michael Mahoney, Yahoo! Research

Markov Networks in Computer Vision. Sargur Srihari

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

DS Machine Learning and Data Mining I. Alina Oprea Associate Professor, CCIS Northeastern University

CS224W Project: Recommendation System Models in Product Rating Predictions

Link Prediction Benchmarks

Review: Identification of cell types from single-cell transcriptom. method

Extracting Information from Complex Networks

Personalized Information Retrieval

Link prediction in multiplex bibliographical networks

Numerical Geometry of Nonrigid Shapes. CS 468, Spring 2013 Differential Geometry for Computer Science Justin Solomon and Adrian Butscher

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

Deep-Q: Traffic-driven QoS Inference using Deep Generative Network

S-MART: Novel Tree-based Structured Learning Algorithms Applied to Tweet Entity Linking

Network embedding. Cheng Zheng

Evaluation of different biological data and computational classification methods for use in protein interaction prediction.

A Survey on Postive and Unlabelled Learning

Machine Learning in Biology

TPA: Fast, Scalable, and Accurate Method for Approximate Random Walk with Restart on Billion Scale Graphs

Regularization and Markov Random Fields (MRF) CS 664 Spring 2008

Transcription:

QUINT: On Query-Specific Optimal Networks Presenter: Liangyue Li Joint work with Yuan Yao (NJU) -1- Jie Tang (Tsinghua) Wei Fan (Baidu) Hanghang Tong (ASU)

Node Proximity: What? Node proximity: the closeness (a.k.a., relevance, or similarity) between two nodes What is the closest node to 4? 0.13 1 4 0.10 2 3 0.13 0.13 5 7 6 0.04 9 0.08 8 0.05 0.05 0.03 10 11 0.04 12 0.02-2 -

Node Proximity: Why? Biology [Ni+] Social Network [Lerman+] E-commerce [Chen+] Disaster Mgtm [Zheng+] - 3 -

- 4 - Node Proximity: How? Random Walk with Restart (RWR) Idea: summarize multiple weighted relationships btw nodes Variants: A Electric networks: SAEC[Faloutsos+] Katz [Katz], [Huang+] Matrix-Forest-based Alg [Chobotarev+] 1 H 1 1 1 D E I 1 1 1 1 F J 1 1 Prox (A, B) = G B Score (Red Path) + Score (Green Path) + Score (Blue Path) + Score (Purple Path) +

Node Proximity: RWR 9 10 2 12 1 3 8 11 4 5 6 7-5 -

Node Proximity -- RWR Detail: a random walker starts from s (a) transmit to one neighbor with (b) go back to s with prob Formulation (1 c) r s = car s +(1 p ca ij c)e s Ranking vector Adjacent matrix Restart prob Starting vector Assumption How to best leverage the fixed input graph A - 6 -

Node Proximity: Learning RWR Goal Use side information to learn better graph Side info: user feedback, node attributes Key Idea: Infer optimal edge weights X min w kwk2 + h(q(y, s) Q(x, s)) Map edge attributes to weights x2p,y2n Limitation: Fixed topology Match user preferences Q =(I ca) 1 J. Tang, T. Lou and J. Kleinberg. Transfer Link Prediction across Heterogeneous Social Networks. TOIS, 2015. L. Backstrom and J. Leskovec. Supervised random walks: predicting and recommending links in social networks. WSDM, 2011. A. - 7Agarwal, - S. Chakrabarti, and S. Aggarwal. Learning to rank networked Arizona entities. State KDD, University 2006.

Algorithmic Questions Q1: optimal weights or optimal topology? Q2: one-fits-all or one-fits-one? Q3: offline learning or online learning? - 8 -

Q1: Optimal Weights or Topology? Observation: real network is noisy and incomplete Challenge: learn optimal weights and topology 0.13 1 4 0.10 2 3 0.13 Missing edge 0.13 5 6 0.04 9 0.08 8 0.05 0.03 10 11 Noisy edge 0.04 12 0.02 7 0.05-9 -

Q2: One-fits-all, or one-fits-one? Observation: optimal network for different queries might be different Query Node P 1 Positive Nodes 4 2 3 5 9 8 6 7 10 11 12 Negative Nodes N N 1 Negative Nodes 4 2 3 Query Node 5 7 6 9 8 10 11 12 Positive Nodes P Challenge: How to tailor learning for each query - 10 -

Q3: Offline or Online Learning Observation: Learning RWR: costly iterative sub-routine to compute a single gradient vector Learning topology: parameter space expands to O(n 2 ) One-fits-one: one optimal network for each query Challenge: How to perform query-specific online learning? - 11 -

Query-specific Optimal Network Learning s Query Node P 1 Positive Nodes 4 2 3 5 7 6 9 8 10 11 12 Negative Nodes N A Given: An input network, a query node, positive nodes P and negative nodes N Learn: An optimal network A s specific to the query s - 12 -

Roadmap Motivations Proposed Solutions: QUINT Empirical Evaluations Conclusions - 13 -

QUINT - Formulations Q =(I ca) 1 Optimization Formulation (hard version) Remarks Matching Input Network Larger parameter space Query-specific Optimal Network Positive nodes No exception is allowed in the constraint Negative nodes arg min ka s Ak 2 F A s s.t., Q(x, s) > Q(y, s), 8x 2 P, 8y 2 N Matching Preference(hard) O(n 2 ) - 14 -

QUINT - Formulations Q =(I ca) 1 Optimization Formulation (soft version) arg min L(A s ) = ka s Ak 2 F A s + P g(q(y, s) Remarks Characteristic x2p,y2n Wilcoxon-Mann-Whitney (WMW) loss Loss function Q(x, s)) Penalty to the violation of preferences Q(y, s) < Q(x, s) ) g( ) =0 Q(y, s) > Q(x, s) ) g( ) > 0-15 -

QUINT -- Optimization Q =(I ca) 1 Gradient Descent Based Solution Gradient @L(A s ) @A s =2 (A s A)+ P =2 (A s A)+ P x,y x2p,y2n @g(d yx ) @d yx @g(q(y,s) Q(x,s)) @A s ( @Q(y,s) @A s @Q(x,s) @A s ) Derivative of an Inverse Differentiable @Q @A s (i,j) = Q @(I ca s) @A s (i,j) Q = cqjij Q @Q(x, s) @A s (i, j) = cq(x, i)q(j, s) - 16 -

QUINT -- Optimization Q =(I ca) 1 Intuition @Q(x, s) @A s (i, j) Complexity O(T 1 P N (T 2 m + n 2 )) Observation Usually = cq(x, i)q(j, s) Complexity: quadratic Query node s j Neighbor of Q(j, s) Q(x, i) T 1, T 2, P, N m, n Q: how to scale up? s / @Q(x, s) @A s (i, j) x i Positive node Neighbor of x - 17 -

QUINT Scale-up Q =(I ca) 1 Key idea: Optimal network is rank-one perturbation to original network Details: arg min L(f, g) = f,g kfg0 k 2 F + (kfk2 + kgk 2 ) + P g(q(y, s) Q(x, s)) Optimization: alternating gradient descent Complexity: x2p,y2n O(T 1 P N (T 2 m + n)) - 18 -

QUINT Variant #1 Key idea: apply Taylor Approximation for Details: Q =(I ca) 1 I + P k i=1 ck A k Complexity: using 1 st order Taylor O(T 1 P N n) Q Benefit: accessing faster Q(i, j) - 19 -

QUINT Variant #2 Key idea: Only update neighborhood of the query node and the pos/neg nodes (Localized Rank-One Perturbation) Complexity O(T 1 P N max( N(s), N(P, N ) )) N(s) :Neighbors of s N(P, N ):Neighbors of pos/neg nodes max( N(s), N(P, N ) ) n Benefit: usually sub-linear to n - 20 -

Roadmap Motivations Proposed Solutions: QUINT Empirical Evaluations Conclusions - 21 -

Datasets 10+ diverse networks - 22 -

Effectiveness: MAP (Higher is better) MAP: Mean Average Precision 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 Admic/Adar Common Nbr SRW RWR wizan_dual ProSIN QUINT-Basic QUINT-Basic1st QUINT-rankOne Astro-Ph GR-QC Hep-TH Hep-PH Protein Airport Oregon NBA Email Gene Last.fm - 23 -

Effectiveness: HLU (Higher is better) HLU: Half-life Utility 90 80 70 60 50 40 30 20 10 0 Astro-Ph GR-QC Hep-TH Hep-PH Protein Airport Oregon NBA Email Gene Last.fm - 24 -

Effectiveness: AUC (Higher is better) 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0-25 - Astro-Ph GR-QC Hep-TH Hep-PH Protein Airport Oregon NBA Email Gene Last.fm

Effectiveness: Precision@20 (Higher is better) 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 Astro-Ph GR-QC Hep-TH Hep-PH Protein Airport Oregon NBA Email Gene Last.fm - 26 -

Effectiveness: Recall@5 (Higher is better) - 27-0.2 0.18 0.16 0.14 0.12 0.1 0.08 0.06 0.04 0.02 0 Astro-Ph GR-QC Hep-TH Hep-PH Protein Airport Oregon NBA Email Gene Last.fm

Effectiveness: MPR (Lower is better) MPR: Mean Percentile Ranking 0.5 0.45 0.4 0.35 0.3 0.25 0.2 0.15 0.1 0.05 0 Astro-Ph GR-QC Hep-TH Hep-PH Protein Airport Oregon NBA Email Gene Last.fm - 28 -

Efficiency -- Twitter 10 3 10 3 Running Time (second) 10 2 10 1 10 0 Running Time (second) 0.34 0.32 0.3 0.28 0.26 0.24 0.22 0.2 0.18 0.16 0 0.5 1 1.5 2 2.5 3 3.5 4 # Nodes QUINT rankone QUINT Basic1st QUINT rankone x 10 7 Running Time (second) 10 2 10 1 10 0 Running Time (second) 0.34 0.32 0.3 0.28 0.26 0.24 0.22 0.2 0.18 0.16 0 5 10 15 # Edges QUINT rankone QUINT Basic1st QUINT rankone x 10 8 1s 10 1 0 0.5 1 1.5 2 2.5 3 3.5 4 0 5 10 15 # Nodes 10 7 x 7 # Edges 10 8 x 8 10 1 QUINT-rankOne scales sub-linearly - 29 -

Roadmap Motivations Proposed Solutions: QUINT Empirical Evaluations Conclusions - 30 -

0.34 0.32 0.3 0.28 0.26 0.24 0.22 0.2 0.18 0.16 QUINT rankone 0 5 10 15 x 10 8-31 - Conclusion: QUINT Goals: Learn Optimal network (for Node Proximity) Q1 Q2 Q3 Existing Optimal weights One-fit-all offline QUINT Optimal topology One-fit-one online Algorithms: VERY efficient way to compute Rank-1 approx + Taylor approx + local search Results: consistently better on 10+ networks & 6 metrics sublinear scalability, near real-time response on billionscale networks @Q(x, s) @A s (i, j) Admic/Adar Common Nbr SRW 10 3 RWR wizan_dual ProSIN 0.9 QUINT-Basic QUINT-Basic1st QUINT-rankOne 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 Astro-Ph GR-QC Hep-TH Hep-PH Protein Airport Oregon NBA Email Gene Last.fm Running Time (second) 10 2 10 1 10 0 Query node Arizona State 10 University 1 s / j Neighbor of s Running Time (second) Q(j, s) Q(x, i) @Q(x, s) @As(i, j) # Edges x Positive node i Neighbor of x QUINT Basic1st QUINT rankone 0 5 10 15 # Edges x 10 8