Personalized Recommendations using Knowledge Graphs. Rose Catherine Kanjirathinkal & Prof. William Cohen Carnegie Mellon University

Size: px

Start display at page:

Download "Personalized Recommendations using Knowledge Graphs. Rose Catherine Kanjirathinkal & Prof. William Cohen Carnegie Mellon University"

Kathryn Charles
5 years ago
Views:

1 + Personalized Recommendations using Knowledge Graphs Rose Catherine Kanjirathinkal & Prof. William Cohen Carnegie Mellon University

2 + The Problem 2 n Generate content-based recommendations on sparse real world data using knowledge graphs (KG) n Knowledge Graph: n Think of the content (also referred to as entities) as nodes n Add links between: n Items-Entities: e.g. Bridge of Spies ßà Tom Hanks n User-Items: e.g. Alice ßà Saving Private Ryan n Entities-Entities: e.g. Tom Hanks ßà Best Actor n Items-Items: e.g. Finding Nemo ßà Finding Dory

3 + Example 3 Bob Alice Kumar The Da Vinci Code User-Item Item-Item Item-Entity Inferno The Terminal Saving Private Ryan Good Will Hunting Actor Actor Director Actor Felicity Jones Tom Hanks Steven Spielberg Matt Damon Award Oscars Entity-Entity Award Golden Globe

4 + Example 4 The Da Vinci Code Item-Item Item-Entity Inferno Bob Alice Kumar User-Item The Terminal Saving Private Ryan? The Bridge of Spies Good Will Hunting Actor Actor Director Actor Felicity Jones Tom Hanks Steven Spielberg Matt Damon Award Oscars Entity-Entity Award Golden Globe

5 + Proposed Approach 5 n Given that a user has liked specific movies and entities in the past, rank new movies (e.g. The Bridge of Spies) for that user using ProPPR n ProPPR: Programming with Personalized PageRank n First order probabilistic logic system n Accepts rules and queries in a language similar to stochastic logic programs n Inference using a variant of personalized PageRank n During training, can learn weights of edges for performing the walk

6 The search space is a graph! Score for a query soln (e.g., Z=sport for about(a,z) ) depends on random walk probability of reaching a node 6

7 + Observe: 7 User 2 User 1 User 3 Item 1 Item 2 Entity 1 Entity 2 Entity 3 Category A Entity Y Entity X KB

8 + Want to learn: 8 User 2 User 1 User 3 Item P Item Q Entity 1 Entity 2 Entity 3 Category A Entity Y Entity X KB

9 + Proposed Approach Step 1: SeedSet 9 n Step 1: Generate a seedset seedset(u,e) :- reviewed(u,m), link(m,x), related(x,e), isentity(e). related(x,x) :-. related(x,e) :- link(x,z), related(z,e). n E.g. seedset(alice,e) à E = TomHanks, StevenSpielberg

10 + Approach 1: EntitySim 10 reviewed(u,m) :- seedset(u,e1), likesentity(u,e1), related(e1,e2), link(e2,m), isapplicable(u,m). likesentity(u,e) :- { f(u,e)}.

11 reviewed(alice, M) + E = TomHanks seedset(alice, E), likesentity(alice, E), related(e, X), link(x, M), isapplicable(alice, M) E = SSpielberg 11 seedset(alice, TomHanks), likesentity(alice, TomHanks), related(tomhanks, X), link(x, M), isapplicable(alice, M) wt = l(alice,tomhanks) X = TomHanks link(tomhanks, M), isapplicable(alice, M) seedset(alice, SSpielberg), likesentity(alice, SSpielberg), related(sspielberg, X), link(x, M), isapplicable(alice, M) wt = l(alice,sspielberg) X = SSpielberg link(sspielberg, M), isapplicable(alice, M) M = CaptainPhillips CaptainPhillips M = BridgeOfSpies BridgeOfSpies M = BridgeOfSpies

12 + Approach 2: TypeSim 12 n Types of entities/nodes available n E.g. Tom Hanks is an Actor, Pittsburgh is a City n Additional Rule A: Learn the popularity/predictability of each type and entity n E.g. predictive power of Actor > Country, Tom Hanks > lesser known n Additional Rule B: Learn Type Associations general traversal probability between types n E.g. Actor à Movie > Country à Movie

13 + Approach 2: TypeSim - RuleSet 13 reviewed(u,m) :- seedset(u,e), likesentity(u,e), popularentity(e), related(e,x), link(x,m), isapplicable(u,m). popularentity(e) :- entityoftype(e,t), populartype(t) { p(e)}. populartype(t) :- { p(t)}. related(x,x) :-. related(x,e) :- link(x,z), typeassoc(x,z), related(z,e). typeassoc(x,z) :- entityoftype(x,s), entityoftype(z,t), typesim(s,t). typesim(s,t) :- { t(s,t)}.

14 + Approach 3: GraphLF 14 n Latent Factor models successful in Collaborative Filtering n Map users and items to the same feature space of hidden dimensions n E.g. comedy vs. drama, amount of action, depth of character development, un-interpretable n Each factor measures user s preference for movies that are high in that factor n Predict based on rating data; no user/item information required

15 + Approach 3: GraphLF - RuleSet 15 reviewed(u,m) :- related(u,e), related(e,x), link(x,m), isapplicable(u,m). related(u,e) :- seedset(u,e), simlf(u,e). related(x,x) :-. related(x,y) :- link(x, Z), simlf(x, Z), related(z, Y). simlf(x, Y) :- isdim(d), val(x, D), val(y, D). val(x,d) :- { v(x,d)}.

16 + Model Complexity 16 n Model Complexity = number of parameters learned n EntitySim: O(n), n = #users n Because of constant seedset size n TypeSim: O(n + e + t 2 ), e = #entities, t = #types n t 2 : Because of type association between pairs of types n GraphLF: O(n + e + m), m = #items n Typically, m >> t n EnitySim < TypeSim < GraphLF

17 17 + Recommendation using KG: Stateof-the-art method n HeteRec_p [X. Yu, X. Ren, Y. Sun, Q. Gu, B. Sturt, U. Khandelwal, B. Norick, and J. Han. Personalized entity recommendation: A heterogeneous information network approach. In Proc. 7th ACM Int. Conf. on Web Search and Data Mining, WSDM 14 ] n Main Idea: Find user-item preferences when the rating is not explicitly available, using Metapaths n Metapath : a path on the TYPE / schema of the KG. n E.g. User à Movie à Actor à Movie n Drawbacks: n Choose & tune hyper parameters: (a) Metapaths (b) number of clusters n Requires a rich KB with types for entities and links.

18 + Experiments 18 Dataset #Users #Items #Reviews Yelp (2013) 43,873 11, ,907 IM100K-UIUC ,626 IM100K* ,948 n Timestamp sort and 80% earlier à Training, 20% newer à Test n Content in Yelp: location city/state, type of business (restaurant, hospital, shopping), cuisine (american, sushi, indian), n Content in IM100K: actor, director, studio, genre, country, language

19 + Additional Baselines 19 n Popularity: Recommend the popular items to users. n Co-Click: Estimate conditional probabilities between items and recommend items with an aggregated conditional probability calculated using the training data of the target user. n NMF: Non-negative matrix factorization without using the paths n Hybrid-SVM: Use SVM-based ranking function to learn a global recommendation model with user implicit feedback and metapaths based similarity measures n Naïve bayes uses the content of items as features

20 + Yelp: Performance Comparison Method MRR 20 Popularity Co-Click NMF Hybrid SVM HeteRec_p EntitySim TypeSim [ 89%] GraphLF [ 126%] NB n Using Type info & Latent Factorization gives improvements n TypeSim vs. GraphLF: No clear winner n NB: Using content without graph poor performance

21 + IM100K: Performance Comparison (on IM100K-UIUC) Method MRR Popularity Co-Click NMF Hybrid SVM HeteRec_p EntitySim TypeSim [ 66%] GraphLF [ 12%] NB n Slightly different datasets: Methods cannot be compared directly appear to be comparable n EntitySim & NB: good performance n Conjecture: simple methods suffice with enough training examples per user, enough content per item

+ Rating Matrix Density 22 n Density = #Ratings / (#Users x

06382 n (82 times more dense) (b) IMDb Feedback Distribution

Feedback Distribution n Create datasets by filtering out

22 + Rating Matrix Density 22 n Density = #Ratings / (#Users x #Items) n Density of Yelp = n Density of IM100K* = n (82 times more dense) (b) IMDb Feedback Distribution n Study the performance as density increases: (c) Yelp Feedback Distribution n Create datasets by filtering out users and businesses with lesser than k ratings, where k = 10, 25, 50, 100

23 + Performance vs. Density EntitySim TypeSim GraphLF NB Density MRR Density k

24 + Conclusions 24 n Proposed 3 methods that use KGs for making personalized recommendations n EntitySim: Uses the graph links n TypeSim: Uses additional type information n GraphLF: Combines Latent Factorization with Graphs n Our methods gave large improvements compared to the state-ofthe art method that uses knowledge graphs n Studied the behavior of the methods as rating matrix density increased: n Type info became redundant n In sparse datasets, KG is an important source of information, especially at low densities.

25 + 25 Thank You!

Personalized Recommendations using Knowledge Graphs: A Probabilistic Logic Programming Approach

Personalized Recommendations using Knowledge Graphs: A Probabilistic Logic Programming Approach ABSTRACT Rose Catherine Language Technologies Institute Carnegie Mellon University Pittsburgh, PA, USA rosecatherinek@cs.cmu.edu