Personalized Recommendations using Knowledge Graphs. Rose Catherine Kanjirathinkal & Prof. William Cohen Carnegie Mellon University

Similar documents
Personalized Recommendations using Knowledge Graphs: A Probabilistic Logic Programming Approach

Personalized Entity Recommendation: A Heterogeneous Information Network Approach

CS249: ADVANCED DATA MINING

Mining Trusted Information in Medical Science: An Information Network Approach

c 2014 by Xiao Yu. All rights reserved.

Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University Infinite data. Filtering data streams

arxiv: v2 [cs.ir] 2 Mar 2017

AspEm: Embedding Learning by Aspects in Heterogeneous Information Networks

User Guided Entity Similarity Search Using Meta-Path Selection in Heterogeneous Information Networks

TriRank: Review-aware Explainable Recommendation by Modeling Aspects

Heterogeneous Graph-Based Intent Learning with Queries, Web Pages and Wikipedia Concepts

Thanks to Jure Leskovec, Anand Rajaraman, Jeff Ullman

Web Personalization & Recommender Systems

Web Personalization & Recommender Systems

Using Data Mining to Determine User-Specific Movie Ratings

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

Relational Retrieval Using a Combination of Path-Constrained Random Walks

Assignment 5: Collaborative Filtering

Big Data Analytics Influx of data pertaining to the 4Vs, i.e. Volume, Veracity, Velocity and Variety

Bayesian Personalized Ranking for Las Vegas Restaurant Recommendation

Holistic Analysis of Multi-Source, Multi- Feature Data: Modeling and Computation Challenges

5/13/2009. Introduction. Introduction. Introduction. Introduction. Introduction

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

COLLABORATIVE LOCATION AND ACTIVITY RECOMMENDATIONS WITH GPS HISTORY DATA

Leveraging Knowledge Graphs for Web-Scale Unsupervised Semantic Parsing. Interspeech 2013

Recommender Systems. Techniques of AI

Demystifying movie ratings 224W Project Report. Amritha Raghunath Vignesh Ganapathi Subramanian

DATABASES AND DATABASE USERS CHAPTER 1

Use of KNN for the Netflix Prize Ted Hong, Dimitris Tsamis Stanford University

Batch-Incremental vs. Instance-Incremental Learning in Dynamic and Evolving Data

Integrating Meta-Path Selection with User-Preference for Top-k Relevant Search in Heterogeneous Information Networks

(Not Too) Personalized Learning to Rank for Contextual Suggestion

BBS654 Data Mining. Pinar Duygulu

Predict the box office of US movies

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

Domain-specific user preference prediction based on multiple user activities

Effective Latent Space Graph-based Re-ranking Model with Global Consistency

SCENARIO BASED ADAPTIVE PREPROCESSING FOR STREAM DATA USING SVM CLASSIFIER

Community-Based Recommendations: a Solution to the Cold Start Problem

CS 124/LINGUIST 180 From Languages to Information

Part 12: Advanced Topics in Collaborative Filtering. Francesco Ricci

Introduction to Data Mining

CS 124/LINGUIST 180 From Languages to Information

Reddit Recommendation System Daniel Poon, Yu Wu, David (Qifan) Zhang CS229, Stanford University December 11 th, 2011

Presented by: Dimitri Galmanovich. Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Paşca, Warren Shen, Gengxin Miao, Chung Wu

A KNOWLEDGE BASED ONLINE RECORD MATCHING OVER QUERY RESULTS FROM MULTIPLE WEB DATABASE

Combining Review Text Content and Reviewer-Item Rating Matrix to Predict Review Rating

Tom hanks movie in order

Random Walk Inference and Learning. Carnegie Mellon University 7/28/2011 EMNLP 2011, Edinburgh, Scotland, UK

Sparse Estimation of Movie Preferences via Constrained Optimization

Collaborative Filtering using Weighted BiPartite Graph Projection A Recommendation System for Yelp

Part I: Data Mining Foundations

CS 229 Final Project - Using machine learning to enhance a collaborative filtering recommendation system for Yelp

Recommender Systems - Content, Collaborative, Hybrid

WebSci and Learning to Rank for IR

Ambiguity. Potential for Personalization. Personalization. Example: NDCG values for a query. Computing potential for personalization

Fast Contextual Preference Scoring of Database Tuples

Relation Learning with Path Constrained Random Walks

Graph Classification in Heterogeneous

CS 124/LINGUIST 180 From Languages to Information

International Journal of Advance Engineering and Research Development. A Facebook Profile Based TV Shows and Movies Recommendation System

Department of Computer Science & Engineering The Graduate School, Chung-Ang University. CAU Artificial Intelligence LAB

Chapter 5: Summary and Conclusion CHAPTER 5 SUMMARY AND CONCLUSION. Chapter 1: Introduction

Automatic Domain Partitioning for Multi-Domain Learning

Is Brad Pitt Related to Backstreet Boys? Exploring Related Entities

Mining Web Data. Lijun Zhang

Refinement of keyword queries over structured data with ontologies and users

Query Sugges*ons. Debapriyo Majumdar Information Retrieval Spring 2015 Indian Statistical Institute Kolkata

Pseudo-Implicit Feedback for Alleviating Data Sparsity in Top-K Recommendation

ETL and OLAP Systems

NYU CSCI-GA Fall 2016

Basic Network Concepts

An Efficient Methodology for Image Rich Information Retrieval

Collaborative Filtering using Euclidean Distance in Recommendation Engine

Learning Ranking Functions with Implicit Feedback

A System for Identifying Voyage Package Using Different Recommendations Techniques

Opportunities and challenges in personalization of online hotel search

Personalized Web Search

Towards a hybrid approach to Netflix Challenge

Using Text Learning to help Web browsing

Comparison of Recommender System Algorithms focusing on the New-Item and User-Bias Problem

Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University

VALLIAMMAI ENGINEERING COLLEGE SRM Nagar, Kattankulathur DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING QUESTION BANK VII SEMESTER

Structured Learning. Jun Zhu

A Performance Assessment on Various Data mining Tool Using Support Vector Machine

Rating Prediction Using Preference Relations Based Matrix Factorization

Detecting and Analyzing Communities in Social Network Graphs for Targeted Marketing

Knowledge Discovery and Data Mining 1 (VO) ( )

Mining Web Data. Lijun Zhang

Infrequent Weighted Itemset Mining Using SVM Classifier in Transaction Dataset

Introduction p. 1 What is the World Wide Web? p. 1 A Brief History of the Web and the Internet p. 2 Web Data Mining p. 4 What is Data Mining? p.

CompSci 101 Exam 2 Sec02 Fall 2016

Inverted Index for Fast Nearest Neighbour

Recommender Systems. Collaborative Filtering & Content-Based Recommending

PReFacTO: Preference Relations Based. Factor Model with Topic Awareness and. Offset

An Open Source Discussion Group Recommendation System. A Project Presented to. The Faculty of the Department of Computer Science

Query Likelihood with Negative Query Generation

Recommender Systems 6CCS3WSN-7CCSMWAL

Efficiently Multiplying Sparse Matrix - Sparse Vector for Social Network Analysis

Resource and Performance Distribution Prediction for Large Scale Analytics Queries

Transcription:

+ Personalized Recommendations using Knowledge Graphs Rose Catherine Kanjirathinkal & Prof. William Cohen Carnegie Mellon University

+ The Problem 2 n Generate content-based recommendations on sparse real world data using knowledge graphs (KG) n Knowledge Graph: n Think of the content (also referred to as entities) as nodes n Add links between: n Items-Entities: e.g. Bridge of Spies ßà Tom Hanks n User-Items: e.g. Alice ßà Saving Private Ryan n Entities-Entities: e.g. Tom Hanks ßà Best Actor n Items-Items: e.g. Finding Nemo ßà Finding Dory

+ Example 3 Bob Alice Kumar The Da Vinci Code User-Item Item-Item Item-Entity Inferno The Terminal Saving Private Ryan Good Will Hunting Actor Actor Director Actor Felicity Jones Tom Hanks Steven Spielberg Matt Damon Award Oscars Entity-Entity Award Golden Globe

+ Example 4 The Da Vinci Code Item-Item Item-Entity Inferno Bob Alice Kumar User-Item The Terminal Saving Private Ryan? The Bridge of Spies Good Will Hunting Actor Actor Director Actor Felicity Jones Tom Hanks Steven Spielberg Matt Damon Award Oscars Entity-Entity Award Golden Globe

+ Proposed Approach 5 n Given that a user has liked specific movies and entities in the past, rank new movies (e.g. The Bridge of Spies) for that user using ProPPR n ProPPR: Programming with Personalized PageRank n First order probabilistic logic system n Accepts rules and queries in a language similar to stochastic logic programs n Inference using a variant of personalized PageRank n During training, can learn weights of edges for performing the walk

The search space is a graph! Score for a query soln (e.g., Z=sport for about(a,z) ) depends on random walk probability of reaching a node 6

+ Observe: 7 User 2 User 1 User 3 Item 1 Item 2 Entity 1 Entity 2 Entity 3 Category A Entity Y Entity X KB

+ Want to learn: 8 User 2 User 1 User 3 Item P Item Q Entity 1 Entity 2 Entity 3 Category A Entity Y Entity X KB

+ Proposed Approach Step 1: SeedSet 9 n Step 1: Generate a seedset seedset(u,e) :- reviewed(u,m), link(m,x), related(x,e), isentity(e). related(x,x) :-. related(x,e) :- link(x,z), related(z,e). n E.g. seedset(alice,e) à E = TomHanks, StevenSpielberg

+ Approach 1: EntitySim 10 reviewed(u,m) :- seedset(u,e1), likesentity(u,e1), related(e1,e2), link(e2,m), isapplicable(u,m). likesentity(u,e) :- { f(u,e)}.

reviewed(alice, M) + E = TomHanks seedset(alice, E), likesentity(alice, E), related(e, X), link(x, M), isapplicable(alice, M) E = SSpielberg 11 seedset(alice, TomHanks), likesentity(alice, TomHanks), related(tomhanks, X), link(x, M), isapplicable(alice, M) wt = l(alice,tomhanks) X = TomHanks link(tomhanks, M), isapplicable(alice, M) seedset(alice, SSpielberg), likesentity(alice, SSpielberg), related(sspielberg, X), link(x, M), isapplicable(alice, M) wt = l(alice,sspielberg) X = SSpielberg link(sspielberg, M), isapplicable(alice, M) M = CaptainPhillips CaptainPhillips M = BridgeOfSpies BridgeOfSpies M = BridgeOfSpies

+ Approach 2: TypeSim 12 n Types of entities/nodes available n E.g. Tom Hanks is an Actor, Pittsburgh is a City n Additional Rule A: Learn the popularity/predictability of each type and entity n E.g. predictive power of Actor > Country, Tom Hanks > lesser known n Additional Rule B: Learn Type Associations general traversal probability between types n E.g. Actor à Movie > Country à Movie

+ Approach 2: TypeSim - RuleSet 13 reviewed(u,m) :- seedset(u,e), likesentity(u,e), popularentity(e), related(e,x), link(x,m), isapplicable(u,m). popularentity(e) :- entityoftype(e,t), populartype(t) { p(e)}. populartype(t) :- { p(t)}. related(x,x) :-. related(x,e) :- link(x,z), typeassoc(x,z), related(z,e). typeassoc(x,z) :- entityoftype(x,s), entityoftype(z,t), typesim(s,t). typesim(s,t) :- { t(s,t)}.

+ Approach 3: GraphLF 14 n Latent Factor models successful in Collaborative Filtering n Map users and items to the same feature space of hidden dimensions n E.g. comedy vs. drama, amount of action, depth of character development, un-interpretable n Each factor measures user s preference for movies that are high in that factor n Predict based on rating data; no user/item information required

+ Approach 3: GraphLF - RuleSet 15 reviewed(u,m) :- related(u,e), related(e,x), link(x,m), isapplicable(u,m). related(u,e) :- seedset(u,e), simlf(u,e). related(x,x) :-. related(x,y) :- link(x, Z), simlf(x, Z), related(z, Y). simlf(x, Y) :- isdim(d), val(x, D), val(y, D). val(x,d) :- { v(x,d)}.

+ Model Complexity 16 n Model Complexity = number of parameters learned n EntitySim: O(n), n = #users n Because of constant seedset size n TypeSim: O(n + e + t 2 ), e = #entities, t = #types n t 2 : Because of type association between pairs of types n GraphLF: O(n + e + m), m = #items n Typically, m >> t n EnitySim < TypeSim < GraphLF

17 + Recommendation using KG: Stateof-the-art method n HeteRec_p [X. Yu, X. Ren, Y. Sun, Q. Gu, B. Sturt, U. Khandelwal, B. Norick, and J. Han. Personalized entity recommendation: A heterogeneous information network approach. In Proc. 7th ACM Int. Conf. on Web Search and Data Mining, WSDM 14 ] n Main Idea: Find user-item preferences when the rating is not explicitly available, using Metapaths n Metapath : a path on the TYPE / schema of the KG. n E.g. User à Movie à Actor à Movie n Drawbacks: n Choose & tune hyper parameters: (a) Metapaths (b) number of clusters n Requires a rich KB with types for entities and links.

+ Experiments 18 Dataset #Users #Items #Reviews Yelp (2013) 43,873 11,537 229,907 IM100K-UIUC 943 1360 89,626 IM100K* 940 1566 93,948 n Timestamp sort and 80% earlier à Training, 20% newer à Test n Content in Yelp: location city/state, type of business (restaurant, hospital, shopping), cuisine (american, sushi, indian), n Content in IM100K: actor, director, studio, genre, country, language

+ Additional Baselines 19 n Popularity: Recommend the popular items to users. n Co-Click: Estimate conditional probabilities between items and recommend items with an aggregated conditional probability calculated using the training data of the target user. n NMF: Non-negative matrix factorization without using the paths n Hybrid-SVM: Use SVM-based ranking function to learn a global recommendation model with user implicit feedback and metapaths based similarity measures n Naïve bayes uses the content of items as features

+ Yelp: Performance Comparison Method P@1 MRR 20 Popularity 0.0074 0.0228 Co-Click 0.0147 0.0371 NMF 0.0162 0.0382 Hybrid SVM 0.0122 0.0337 HeteRec_p 0.0213 0.0513 EntitySim 0.0221 0.0641 TypeSim 0.0444 0.0973 [ 89%] GraphLF 0.0482 [ 126%] 0.0966 NB 0 0.0087 n Using Type info & Latent Factorization gives improvements n TypeSim vs. GraphLF: No clear winner n NB: Using content without graph poor performance

+ IM100K: Performance Comparison (on IM100K-UIUC) Method P@1 MRR Popularity 0.0731 0.1923 Co-Click 0.0668 0.2041 NMF 0.2064 0.4938 Hybrid SVM 0.2087 0.4493 HeteRec_p 0.2121 0.5530 EntitySim 0.3485 0.5010 TypeSim 0.353 [ 66%] 0.5053 GraphLF 0.3248 0.4852 [ 12%] NB 0.312 0.4069 21 n Slightly different datasets: Methods cannot be compared directly appear to be comparable n EntitySim & NB: good performance n Conjecture: simple methods suffice with enough training examples per user, enough content per item

+ Rating Matrix Density 22 n Density = #Ratings / (#Users x #Items) n Density of Yelp = 0.00077 n Density of IM100K* = 0.06382 n (82 times more dense) (b) IMDb Feedback Distribution n Study the performance as density increases: (c) Yelp Feedback Distribution n Create datasets by filtering out users and businesses with lesser than k ratings, where k = 10, 25, 50, 100

+ Performance vs. Density 23 0.6 0.4 EntitySim TypeSim GraphLF NB Density 0.15 0.1 MRR Density 0.2 0.05 0 0 0 20 40 60 80 100 k

+ Conclusions 24 n Proposed 3 methods that use KGs for making personalized recommendations n EntitySim: Uses the graph links n TypeSim: Uses additional type information n GraphLF: Combines Latent Factorization with Graphs n Our methods gave large improvements compared to the state-ofthe art method that uses knowledge graphs n Studied the behavior of the methods as rating matrix density increased: n Type info became redundant n In sparse datasets, KG is an important source of information, especially at low densities.

+ 25 Thank You!