Deep Learning for Recommender Systems

Similar documents
Opportunities and challenges in personalization of online hotel search

Weighted Alternating Least Squares (WALS) for Movie Recommendations) Drew Hodun SCPD. Abstract

MACHINE LEARNING Example: Google search

CSE 158 Lecture 8. Web Mining and Recommender Systems. Extensions of latent-factor models, (and more on the Netflix prize)

CS224W Project: Recommendation System Models in Product Rating Predictions

Data Mining Techniques

CSE 258 Lecture 8. Web Mining and Recommender Systems. Extensions of latent-factor models, (and more on the Netflix prize)

Benchmarking Spark ML using BigBench. Sweta Singh TPCTC 2016

Recommendation Systems

Data Mining Techniques

Leveraging AI on the Cloud to transform your business. Florida Business Analytics Forum 2018 at University of South Florida

Event: PASS SQL Saturday - DC 2018 Presenter: Jon Tupitza, CTO Architect

Know your neighbours: Machine Learning on Graphs

Document Information

Automation.

Assignment 5: Collaborative Filtering

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

Practical Methodology. Lecture slides for Chapter 11 of Deep Learning Ian Goodfellow

Recommender Systems - Content, Collaborative, Hybrid

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

By Atul S. Kulkarni Graduate Student, University of Minnesota Duluth. Under The Guidance of Dr. Richard Maclin

As a reference, please find a version of the Machine Learning Process described in the diagram below.

Music Recommendation at Spotify

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL., NO., DEC User Action Interpretation for Online Content Optimization

Deep Character-Level Click-Through Rate Prediction for Sponsored Search

Performance Comparison of Algorithms for Movie Rating Estimation

How GPUs Power Comcast's X1 Voice Remote and Smart Video Analytics. Jan Neumann Comcast Labs DC May 10th, 2017

Memorization and Generalization for Music Recommendation: From Wide and Deep to Content-based DNN with Psuedo-MF and Transfer Learning

Taking Your Application Design to the Next Level with Data Mining

CS 124/LINGUIST 180 From Languages to Information

Datameer for Data Preparation:

Build a system health check for Db2 using IBM Machine Learning for z/os

Data Science Training

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

Ranking by Alternating SVM and Factorization Machine

Deep Learning Applications

arxiv: v1 [cs.ir] 10 Sep 2018

Real-time Recommendations on Spark. Jan Neumann, Sridhar Alla (Comcast Labs) DC Spark Interactive Meetup East May

Bayesian Personalized Ranking for Las Vegas Restaurant Recommendation

Crossing the AI Chasm. What We Learned from building Apache PredictionIO (incubating)

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

BAYESIAN GLOBAL OPTIMIZATION

BordaRank: A Ranking Aggregation Based Approach to Collaborative Filtering

Introduction. Chapter Background Recommender systems Collaborative based filtering

arxiv: v1 [cs.ir] 15 Sep 2017

CS249: ADVANCED DATA MINING

Sparkling Water. August 2015: First Edition

On hybrid modular recommendation systems for video streaming

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

Pseudo-Implicit Feedback for Alleviating Data Sparsity in Top-K Recommendation

Overview and Practical Application of Machine Learning in Pricing

Code Mania Artificial Intelligence: a. Module - 1: Introduction to Artificial intelligence and Python:

COMP 465: Data Mining Recommender Systems

Music Recommendation with Implicit Feedback and Side Information

Deploying, Managing and Reusing R Models in an Enterprise Environment

Introduction to Deep Learning in Signal Processing & Communications with MATLAB

A Quick Guide on Training a neural network using Keras.

CS535 Big Data Fall 2017 Colorado State University 10/10/2017 Sangmi Lee Pallickara Week 8- A.

Decision models for the Digital Economy

Building a Scalable Recommender System with Apache Spark, Apache Kafka and Elasticsearch

Improved Recommendation System Using Friend Relationship in SNS

Keras: Handwritten Digit Recognition using MNIST Dataset

AI Dining Suggestion App. CS 297 Report Bao Pham ( ) Advisor: Dr. Chris Pollett

Analysis of Big Data Tim Miller, Sr. Analytics Consultant Teradata Alexander Kolovos, Ph.D., Advanced Analytics Software Engineer Teradata

AI/ML for E-Commerce in India

Team Members: Yingda Hu (yhx640), Brian Zhan (bjz002) James He (jzh642), Kevin Cheng (klc954)

Thanks to Jure Leskovec, Anand Rajaraman, Jeff Ullman

Scaled Machine Learning at Matroid

CSE 546 Machine Learning, Autumn 2013 Homework 2

arxiv: v1 [cs.ir] 22 Nov 2017

COPYRIGHT DATASHEET

Transformation and aggregation preprocessing for top-k recommendation GAP rules induction

Personalized Web Search

Transforming Transport Infrastructure with GPU- Accelerated Machine Learning Yang Lu and Shaun Howell

Machine Learning with Python

What You Will Learn. What You Will Learn. How to Get Started with Wistia & 5 Ways It Generates More Leads. with Josh White

Introduction to Data Science. Introduction to Data Science with Python. Python Basics: Basic Syntax, Data Structures. Python Concepts (Core)

Tutorial on Keras CAP ADVANCED COMPUTER VISION SPRING 2018 KISHAN S ATHREY

Research at Google: With a Case of YouTube-8M. Joonseok Lee, Google Research

Index. Umberto Michelucci 2018 U. Michelucci, Applied Deep Learning,

Machine Learning Track

Machine Learning in the Process Industry. Anders Hedlund Analytics Specialist

Google GSuite Intro Demo of GSuite and GCP integration

Qualifio for brands 40 INTERACTIVE CAMPAIGN IDEAS TO BOOST YOUR MARKETING AND GENERATE LEADS

Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University Infinite data. Filtering data streams

ECS289: Scalable Machine Learning

arxiv: v1 [cs.ir] 19 Dec 2018

TIBCO Analytics Meetup. Michael O Connell and the TIBCO Data Science Team April 25th, 2017

Polytechnic University of Tirana

Recommender Systems. Master in Computer Engineering Sapienza University of Rome. Carlos Castillo

Use of KNN for the Netflix Prize Ted Hong, Dimitris Tsamis Stanford University

Data Science Essentials

STREAMING RANKING BASED RECOMMENDER SYSTEMS

Computational Databases: Inspirations from Statistical Software. Linnea Passing, Technical University of Munich

Consumer Insights. YouGov Omnibus, 5 th -6 th April

Recommendation System for Netflix

Recommender Systems: Practical Aspects, Case Studies. Radek Pelánek

Computational Intelligence Meets the NetFlix Prize

XES Tensorflow Process Prediction using the Tensorflow Deep-Learning Framework

Efficient Deep Learning Optimization Methods

Transcription:

join at Slido.com with #bigdata2018 Deep Learning for Recommender Systems Oliver Gindele @tinyoli oliver.gindele@datatonic.com Big Data Conference Vilnius 28.11.2018

Who is Oliver? + Head of Machine Learning + PhD in computational physics Who is datatonic? We are a strong team of data scientists, machine learning experts, software engineers and mathematicians. Our mission is to provide tailor-made systems to help your organization get smart actionable insights from large data volumes.

Recommender Systems

Recommender Systems

Recommender Systems

Collaborative Filtering - Introduction

Collaborative Filtering - Introduction Objective: + Netflix Prize (2009) + Solve via SVD (ALS or SGD) + Regression problem

Finding Love with Numbers

Online Dating Dataset - LibimSeTi

Online Dating Dataset - LibimSeTi + http://www.libimseti.cz/ + 2005 + 17,359,346 ratings + 135,359 users + Ratings: 1-10 + Female (%): 69 + Male (%): 31 + Mean(rating): 5.9 + Std(rating): 3.1 userid profileid rating gender

Tensorflow: High Level APIs Dataset: Estimator:

Tensorflow: High Level APIs Dataset: Estimator: Easy distributed training (Cloud ML Engine, Kubeflow) Boosted Trees GMM KMeans AR Models

Tensorflow: High Level APIs Feature Columns:

Dataset API (tf.data)

MF Model

Estimator API (tf.estimator)

Going Deeper - Beyond MF Neural Collaborative Filtering (He et al.)

Going Deeper - Beyond MF Add user and item (profile) characteristics User Metadata Item Metadata userid age gender itemid genre length 1225 30-40 F 44044 Drama 129

Results - LibimSeTi MF MLP MF + MLP Research [1] RMSE 2.137 2.112 2.071 2.077 MAE 1.552 1.541 1.432 1.410 Training details: + 40 epochs + MLP: 4 layers (256 units pyramid) + Adam optimiser + Results calculated on held out test set (5 rating per user) + No tuning of hyperparameters [1] Trust-Based Recommendation: an Empirical Analysis, O Doherty, Jouili, Van Roy (2008)

How can we do better?

Better/More Data

Better/More Data Single Customer View Typical information available to Retailers & CPG companies Personal Details Demographics Rule-Based Segmentation Purchase History Online Interactions Support Interactions Pro-Active Machine Learning Model Output Propensity Scores Dynamic Segmentation Churn Prediction Recommendations Lifetime Value Sentiment Scores $ Analytics & Forecasting Personalised Web Experience Personalised Advertising Customer Interactions Dynamic Pricing

Better Loss Functions

Better Loss functions + Implicit feedback (Hu, Koren, Volinsky 2008): + Logistic Matrix Factorisation (Johnson, Spotify, 2014): + Use ranking loss (or pairwise loss functions)

Improved Metrics

Precision@k Precision@5 = 3/5 Of the top k recommendations, how many were relevant (interacted with)?

(Normalized) Discounted Cumulative Gain DCG given by formula 1 2 3 4 5 6 7 8 9 1 / log 2 (1+1) 1 / log 2 (3+1) 1 / log 2 (4+1) 1 / log 2 (8+1) DCG = 2.246 Best DCG = 2.562 = 0.877

Go even deeper and embed all the things!

Deep Recommender Systems - Advances Wide & Deep model (Cheng et al., 2016) Continuous Features Categorical Features

Results - Real Client Data SVD (MF) MF - Implicit Feedback model Wide & Deep P@5 0.33 0.51 0.79 NDCG@5 0.18 0.30 0.37 Training details: + Feature columns include user demographics to complement the lack of interactions of cold users + 100 epochs + Adam optimiser + Results calculated on held out test set (up to 5 ratings per user) + Tuning of the dimension size of the embedding vector of the deep part

Deep Recommender Systems - Advances Deep Neural Networks for YouTube Recommendations (Covington, Adams, Sargi, 2016)

Deep Recommender Systems - Advances Latent Cross: Making Use of Context in Recurrent Recommender Systems (Alex Beutel, Paul Covington, Sagar Jain et al., 2018)

Takeaways

Takeaways + Tensorflow can do more than vision or translation + High level APIs make model building and training painless + Custom algorithms and specific loss functions are easily implemented + Deep Recommender systems work well on real data + Embeddings and hidden layers allow for many ways to improve a recommender system

Thank you. Blog.datatonic.com facebook.com/datatonic linkedin.com/company/datatonic twitter.com/teamdatatonic