BordaRank: A Ranking Aggregation Based Approach to Collaborative Filtering

Size: px
Start display at page:

Download "BordaRank: A Ranking Aggregation Based Approach to Collaborative Filtering"

Transcription

1 BordaRank: A Ranking Aggregation Based Approach to Collaborative Filtering Yeming TANG Department of Computer Science and Technology Tsinghua University Beijing, China tym13@mails.tsinghua.edu.cn Qiuli TONG Information Technology Center Tsinghua, University Beijing, China tql@tsinghua.edu.cn Abstract Recommender systems are widely used in today s online applications. Traditional rating-oriented methods predict user ratings on items, but they fail to capture user preference among different items. This paper regards recommendation problem as a ranking task and proposes a new ranking-oriented collaborative filtering framework based on ranking aggregation methods. In this framework, recommendation lists are generated according to item rankings given by users who are similar to the target user. Then, a two-step method called BordaRank is proposed to further explain the framework. The method first uses item collaborative filtering to predict unknown ratings and then uses Borda count method to aggregate item rankings of neighbors. Finally, BordaRank is modified as a pure rankingoriented method, which could be directly applied on the sparse rating matrix without rating prediction as an intermediate step. The methods are evaluated on real world movie rating data. Experimental results show that BordaRank improves the precision and recall of original rating-oriented methods and modified BordaRank also outperforms traditional methods. Index Terms Recommender System, Ranking Aggregation, Collaborative Filtering, Information Retrieval I. INTRODUCTION Recommender systems helps people filter out the information of interest from the mass quickly and effectively. Broadly speaking, recommendation problem is defined as a rating prediction problem. Since Netflix Prize competition was held in 2006, many rating prediction algorithms were proposed to solve recommendation problems. Most of these algorithms are based on collaborative filtering techniques. One category of these algorithms uses neighborhood-based collaborative filtering methods to generate item recommendations. Such as Grouplens [1], one of the early generation of collaborative filtering algorithms, uses user rating data to calculate user similarity or item similarity. According to the calculated similarity, Grouplens predicts how well users will like new items based on similar users or items. Another category of these algorithms uses model-based collaborative filtering methods to generate item recommendations. These algorithms use user rating data to train models to predict user ratings by machine learning or data mining algorithms [2]. For example, Paterek [3] improves the set of predictors used in Netflix Cinematch by adding biases to Singular Value Decomposition (SVD) models and performs a lower error rate than the original models. Liu et al. [4] extend Restricted Boltzmann Machine (RBM) and propose Content- Based Restricted Boltzmann Machine (CBRBM); the model is applied on both rows and columns of rating matrix to predict a better result. Apart from rating-oriented recommendation algorithms, some researchers tried to combine information retrieval techniques into recommender systems, such as Learning to Rank (L2R), Normalized Discounted Cumulative Gain (NDCG) evaluation metric, etc. For example, Liu and Yang [5] propose EigenRank which improves memory based collaborative filtering methods by pairwise methods in learning to rank. In this paper, we propose an novel algorithm of rankingoriented collaborative filtering which uses ranking aggregation method to generate item recommendations. We first describe the framework of ranking aggregation based collaborative filtering algorithms. In this framework, we introduce an algorithm which uses Borda count method to aggregate item rankings derived from the predicted user-item rating matrix. We further modified our algorithm to a pure ranking-oriented collaborative filtering algorithm which could be applied directly on the original sparse rating matrix. Finally, we perform a 5-fold cross validation on MovieLens data set [6] to validate the effectiveness of our algorithms. We also discussed the impact of different neighborhood sizes and different scoring functions in section IV. II. RELATED WORKS There are mainly three parts of related works: neighborhood-based collaborative filtering, ranking-oriented collaborative filtering and ranking aggregation methods. A. Neighborhood-based collaborative filtering Neighborhood-based collaborative filtering (also called memory-based collaborative filtering) uses neighborhood to estimate the target user s ratings. The ratings are represented as a user-item rating matrix, which is highly sparse due to the huge number of users and items. The goal of neighborhoodbased collaborative filtering is to fill the user-item rating matrix. Neighborhood-based methods fall into two categories: User Collaborative Filtering (UserCF) and Item Collaborative Filtering (ItemCF) /16/$31.00 copyright 2016 IEEE ICIS 2016, June 26-29, 2016, Okayama, Japan

2 User collaborative filtering uses user neighborhood to predict the target user s ratings. The user neighborhood contains users whose ratings are similar to the target user s existing ratings. User collaborative filtering estimates the target user s unknown ratings by taking the similarity weighted average ratings of users in his or her neighborhood. The recommendation list could be generated by sorting the target user s unrated items in descending order and selecting top-n recommendations. On the contrast, item collaborative filtering uses item neighborhood to predict the target user s ratings. The item neighborhood contains items which are similar to the target user s previous rated items. Item collaborative filtering estimates the item rating by taking the similarity weighted average ratings of the item s neighborhoods. Similarity measurement is the key concept in neighborhoodbased collaborative filtering. Because of the sparsity of the user-item rating matrix, one of the shortcomings of neighborhood-based collaborative filtering is that it is unlikely to find highly similar users or items. The performance of neighborhood-based collaborative filtering decreases when the rating matrix is sparse. To alleviate this problem, dimension reduction methods such as Singular Value Decomposition (SVD), Principle Component Analysis (PCA) etc. are applied to represent user vector in a reduced space. Many hybrid methods, such as content-boosted approaches [4], are also proposed. B. Ranking-oriented collaborative filtering Traditional rating-oriented recommendation algorithms is aimed at minimizing the predicted rating errors. However, rating-oriented methods are deficient in capturing the item preference of the target user, due to user ratings could be biased depending on different users. Some users prefer to use higher ratings, while others prefer to use lower ratings. Though methods that normalize user ratings when aggregate the ratings of neighborhoods are proposed, Instead of estimating user-item rating matrix, rankingoriented collaborative filtering generates recommendation lists directly. Ranking-oriented collaborative filtering methods combine thoughts from learning to rank, especially pairwise or listwise algorithms. For pairwise algorithm, the input is item pairs which indicate the target user s preference between rated items; the output is predicted preference on unrated items. One of the state of the art pairwise collaborative filtering algorithms is EigenRank [5]. It defines a preference function on item pairs, uses Kendall Rank Correlation Coefficient (KRCC) to measure similarity between users, and generate item recommendations by maximizing the loss function based on user preference. Listwise algorithm models user preference on item lists and directly learns on the lists to optimize model parameters. Unlike most of rating-oriented approaches use Root-Mean- Square Error (RMSE) or Mean Average Error (MAE) as their loss functions, listwise approaches define the loss functions on whole lists. For example, Weimer et al. [7] propose CofiRank which uses Maximum Margin Matrix Factorization (MMMF) optimizes NDCG. C. Ranking aggregation methods Ranking aggregation methods have been used in recommender systems for aggregating multiple results from different recommendation algorithms [8]. However, ranking-oriented collaborative filtering based on ranking aggregation is rarely researched due to most existing ranking-oriented collaborative filtering methods are based on ranking generation. The most representative ranking aggregation methods falls into two categories: unsupervised methods and supervised methods. For example, for unsupervised methods, Borda count method [9] is voting mechanism which considers voter s preference on all candidates; for supervised methods, Cranking which proposed in [10] uses Markov model on permutations to aggregate multiple rankings. III. RANKING AGGREGATION BASED COLLABORATIVE FILTERING In this section, we presented a new ranking-oriented collaborative filtering approach based on ranking aggregation methods. We first introduce a two-step recommendation method based on both rating-oriented and ranking-oriented method. To alleviate the rating data sparsity problem, we use ratingoriented collaborative filtering to fill the rating matrix. We applied original Borda count method described in section III-C to aggregate items rankings of the target user s neighbors. At last, we propose a pure ranking aggregation based collaborative filtering method, which could be directly applied on the original sparse rating matrix. A. Feature extraction and problem definition In general, a recommender system will collect users demographic information and items meta-data. These are raw data to be extracted and transformed to feature representations. For categorical data (gender, occupation etc.), because they are hard to compare their similarities, categorical data are represented as vectors using one-of-k coding. If a data field can take k different discrete values, the field should be represented as a k-dimension binary vector, which corresponds with k possible values. One-of-k coding representation can also be used for multi-valued data fields. For numerical data (age, date etc.), such transformation is not needed. In feature selection and extraction, we extract various features from raw data, for example, rate time in a day, rate date interval, ratings of different category items etc. For all numerical features, we further extract the standard deviation, skewness, mean, median, minimum, maximum etc. to describe the distribution attributes of the values. In order to adjusting features which are measured on different scales, all features are normalized using x x x = (1) x max x min where x is the feature vector and x is the normalized feature vector.

3 The problem is defined as follows. Given a set of m users U = {u 1,u 2,...,u m }, a set of n items I = {i 1,i 2,...,i n }, and a user-item rating matrix R m n, for each user u find a permutation ˆπ u, which is a bijection from I to {1, 2,...,n}, to fit the users tastes best. The rating that the user u gives to item i is denoted by r u,i, while r u,i =0if the user u has not rated item i. The set of items rated by user u is denoted by I u and the set of users who have rated item i is denoted by U i. B. Similarity measurement For neighborhood-based collaborative filtering, similarity measurement is a major step. We use Euclidean distance based similarity to measure the user and item similarity. Euclidean distance based similarity is defined as 1 s x,y = (2) 1+ x y 2 where x and y are user or item feature vectors. We select k nearest neighbors as the neighborhood of user and item. The neighborhood of user u is denoted by N u and the neighborhood of item i is denoted by N i. Although our algorithm uses Euclidean distance based similarity measurement, it can be easily replaced by other similarity measurement, such as cosine similarity [11] or Pearson correlation coefficient [1] [12]. C. Borda count based collaborative filtering In this section, we introduce the basic ranking aggregation based collaborative filtering framework. A two-step method called BordaRank is introduced to further explain the proposed framework. In the first step, BordaRank uses item collaborative filtering to fill the sparse rating matrix. This step allows Borda count method to be applied for aggregating neighbors rankings in our method. Then, BordaRank derives rankings from the predicted rating matrix and generates the item recommendation to the target user. In our ranking aggregation based collaborative filtering framework, the ranking of items given by user u is denoted by π u. A recommendation algorithm should estimate a permutation ˆπ u : I {1, 2,...,n} for each user u as the recommendation list. For rating based collaborative filtering, the recommendation list ˆπ u of the target user u is generated by ranking all items according his or her estimated ratings in descending order: ˆπ u = π u where π u is the item permutation in descending order of the target user u s predicted ratings. However, for ranking aggregation based collaborative filtering algorithm, the estimated permutation ˆπ u of the target user u are generated by the items permutations of his or her neighbors: ˆπ u = f(π v1,π v2,...,π vk ) v N u where k = N u, π v is the permutation which ranks all items according previous estimated ratings of user v in descending order, function f is the ranking aggregation algorithm, for example, Borda count method in our algorithm which will be discussed later. Borda count [9] is originally an election method. In the election, each voter ranks all candidates in order of preference. Borda count gives each candidate a score according to his or her rank in each ballot. The candidate who gains the highest score wins the election. In the following part of this section, we will give a formal description of how Borda count works in our algorithm. Noting that Borda count method requires each voter to rank all the candidates, the sparse rating matrix has to be filled. The unknown ratings are predicted by item collaborative filtering: j N ˆr u,i = i I u s i,j r u,j (3) j N i I u s i,j where ˆr u,i is the estimated rating which user u will give to item i, r u,j is the actual rating which user u gives to item j, s i,j is the similarity between item i and item j, N i is the neighborhood of item i and I u is the item set rated by user i. Though user-item rating matrix is filled using item collaborative filtering, there could still be unpredicted ratings in the rating matrix, because the rated item set of the target user s neighborhood may not cover all the items: I v I v N u We adopt a simple strategy to deal with these unpredicted ratings by filling them by 0 values. The next step is aggregating neighbors permutations which derived from rating matrix and generate the estimated permutation of the target user. Inspired by Borda count method, we apply a scoring function on each item according to its ranking in the permutation. Then, for each item, we adds up the all the similarity weighted scores which the item gains among all neighbors permutations: Γ u (i) = v N u s u,v score u (i) (4) where Γ u (i) is the similarity weighted score sum of item i for the target user u, s u,v is the similarity between user u and user v. The most common formula of the scoring function [13] is linear function score u (i) =n π u (i)+1 (5) where n is the total number of items, π v (i) is the ranking of item i in permutation π v. Our algorithm uses this scoring function as well. The estimated permutation ˆπ u for user u is generated by sorting the similarity weighted score sum Γ u (i) of each item i in descending order. Other variants of scoring function formula could also be applied. For example, the General Election in Nauru held on June 19, 2010 used reciprocal function formula; the election of the most valuable NBA player uses piecewise function formula. We will further compare and discuss the selection of scoring function formula in section IV.

4 D. Modified Borda count based collaborative filtering Although Borda count based collaborative filtering performs a good result, it still needs a rating prediction step to alleviate data sparsity problem. In this section, we describe a pure ranking-oriented collaborative filtering algorithm which applies modified Borda count method directly on the original sparse rating matrix without any rating prediction. Modified Borda count method does not require users to rank all items. It allows that each user could only rank his or her top-k favorite items, where 1 k I. In this section, π u is represented as the partial permutation of user u on items: π u : I {1, 2,...,k u } where k u is the total number of items which user u have actually ranked. The scoring function needs to be modified to score u (i) =k u π u (i) (6) where π u (i) is the rank position of item i in the partial permutation π u. Substitute the scoring function into equation (4), we can obtain the similarity weighted score sum as the following equation: Γ u (i) = s u,v (k u π u (i)) (7) v N u Our ranking-oriented collaborative filtering algorithm generates the estimated permutation ˆπ u of the target user u by sorting the score sum Γ u (i) in descending order. IV. EXPERIMENTS We use MovieLens data set [6] to evaluate the proposed method. This data set consists of 100,000 ratings by 943 users on 1682 movies. The rating ranges from 1 to 5, where 1 means the worst and 5 means the best. Each user has rated at least 20 movies. The data set also contains basic information of users and items, such as user gender, movie release date. A. Comparison with other algorithms We perform a 5-fold cross validation to evaluate BordaRank and modified BordaRank algorithm. The data is equally split into five disjoint sets, from u1 to u5. Each set is used for test data while others are used for training data. We also implement user collaborative filtering and item collaborative filtering as baseline algorithms. In the evaluation, a user gives an item rating larger than 3, i.e. the user rates 4 or 5, is regarded as the user likes the item. F 1 score and coverage are used to measure all the algorithms and the results are shown in table I and table II. According to the result, BordaRank has a higher F 1 score than item collaborative filtering, which indicates that ranking aggregation in BordaRank could have a better recommendation effort than simply sorting the item ratings of the target user. The modified BordaRank performs a better result than other rating prediction based algorithms, which indicates that ranking-oriented collaborative filtering can capture user preference on items more precisely. However, the item coverages of BordaRank and modified Borda Rank are lower than the coverage of item collaborative filtering BordaRank modified BordaRank Fig. 1. F1 score of BordaRank and modified BordaRank 0.10 BordaRank modified BordaRank Fig. 2. Coverage of BordaRank and modified BordaRank B. Impact of neighborhood size The item coverage of a recommender system indicates how many items are able to be recommended by the system. In collaborative filtering, the item coverage is mainly affected by the neighborhood size. We compare the impact of different neighborhood size on BordaRank and modified BordaRank. The result is shown in figure 1 and figure 2. In figure 1, the F1 score of both BordaRank and modified BordaRank increases with neighborhood size increases. The F1 score tends to converge when neighborhood size is larger than 50. In figure 2, the coverage of BordaRank and modified BordaRank decreases with neighborhood size increases. The coverage decreases sharply while neighborhood size is smaller than 30.

5 TABLE I THE F1 SCORE OF EACH METHOD IN 5-FOLD CROSS VALIDATION UserCF ItemCF BordaRank modified BordaRank TABLE II THE ITEM COVERAGE OF EACH METHOD IN 5-FOLD CROSS VALIDATION UserCF ItemCF BordaRank modified BordaRank C. Impact of different scoring functions We designed four different scoring functions for Borda count based collaborative filtering: linear score u (i) =n π u (i)+1 reciprocal score u (i) = 1 π u (i) logarithmic score u (i) =log 2 (n π u (i)+1) polynomial score u (i) =(n +1) 2 2(n +1)π u (i)+π u (i) 2 The results of the methods using these scoring functions are shown in table III, table IV and table V. According to the result, the logarithmic scoring function performs better than linear and reciprocal scoring functions, though the latter are more common. We further implements and evaluates modified BordaRank using the logarithmic scoring function. In the pure ranking-oriented collaborative filtering, the precision and recall of modified BordaRank using logarithmic scoring function is lower than using linear scoring function, while the coverage of modified BordaRank using logarithmic scoring function is better. V. CONCLUSIONS AND FUTURE WORK In this paper, we proposed a new ranking-oriented collaborative filtering framework based on ranking aggregation methods. Differs from traditional rating-oriented collaborative filtering and ranking generation based collaborative filtering, our algorithm generates item recommendations from item rankings which are given by the neighbors of the target user. In the proposed framework, we use Borda count method as ranking aggregation method and provide BordaRank algorithm. BordaRank uses rating predict results from item collaborative filtering method and aggregates neighborhoods rankings to give item recommendations. Experimental results shows BordaRank improves precision and recall of item collaborative filtering method on a considerable scale. We further proposed modified BordaRank which is a pure ranking-oriented collaborative filtering method. The modified BordaRank could be directly applied on the sparse rating matrix. Finally, we evaluated both methods on MovieLens data set and compared the impact of different neighborhood sizes and different selections of scoring function. Both BordaRank and modified BordaRank have higher precision and recall than other baseline methods. For future work, we would like to investigate different ranking aggregation methods for our ranking aggregation based collaborative filtering framework. We would also like to apply machine learning algorithms to optimize the parameters in the methods. REFERENCES [1] P. Resnick, N. Iacovou, M. Suchak, P. Bergstrom, and J. Riedl, GroupLens : An Open Architecture for Collaborative Filtering of Netnews, Proceedings of the 1994 ACM conference on Computer supported cooperative work, pp , [2] J. L. Herlocker, J. A. Konstan, L. G. Terveen, and J. T. Riedl, Evaluating collaborative filtering recommender systems, ACM Transactions on Information Systems (TOIS), vol. 22, no. 1, pp. 5 53, [Online]. Available: [3] A. Paterek, Improving regularized singular value decomposition for collaborative filtering, in Proceedings of KDD cup and workshop, 2007, pp [4] Y. Liu, Q. Tong, Z. Du, and L. Hu, Content-Boosted Restricted Boltzmann Machine for Recommendation, in Artificial Neural Networks and Machine Learning ICANN 2014, S. Wermter, C. Weber, W. Duch, T. Honkela, P. Koprinkova-Hristova, S. Magg, G. Palm, and A. Villa, Eds. Springer International Publishing, 2014, pp [Online]. Available: 97 [5] N. N. Liu and Q. Yang, EigenRank, in Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR 08, 2008, p. 83. [Online]. Available: s &partnerid=tzotx3y1 [6] F. M. Harper and J. A. Konstan, The MovieLens Datasets: History and Context, ACM Trans. Interact. Intell. Syst., vol. 5, no. 4, pp. 19:1-19:19, [Online]. Available: [7] M. Weimer, A. Karatzoglou, M. Bruch, Q. V. Le, and A. Smola, CofiRank Maximum Margin Matrix Factorization for Collaborative Ranking, Advances in neural information processing systems, pp. 1 3, [Online]. Available: [8] H. Wu, Y. Hua, B. Li, and Y. Pei, Personalized Recommendation via Rank Aggregation in Social Tagging Systems, in Proceedings of the 10th International Conference on Fuzzy Systems and Knowledge Discovery. Shenyang: IEEE, 2013, pp [9] M. Dummett, The Borda count and agenda manipulation, Social Choice and Welfare, vol. 15, no. 2, pp , 1998.

6 TABLE III THE PRECISION OF METHODS USING DIFFERENT SCORING FUNCTION linear reciprocal logarithmic polynomial modified linear modified logarithmic TABLE IV THE RECALL OF METHODS USING DIFFERENT SCORING FUNCTION linear reciprocal logarithmic polynomial modified linear modified logarithmic TABLE V THE COVERAGE OF METHODS USING DIFFERENT SCORING FUNCTION linear reciprocal logarithmic polynomial modified linear modified logarithmic [10] J. D. Lafferty, G. Lebanon, and J. D. Lafferty, Cranking: Combining Rankings Using Conditional Probability Models on Permutations, in Proceedings of the Nineteenth International Conference on Machine Learning, ser. ICML 02. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc., 2002, pp [Online]. Available: [11] S. Wang, J. Sun, B. J. Gao, and J. Ma, VSRank: A Novel Framework for Ranking-Based Collaborative Filtering, ACM Transactions on Intelligent Systems and Technology (TIST), vol. 5, no. 3, pp. 51:1-51:24, [Online]. Available: [12] J. Herlocker, J. Konstan, and J. Riedl, An Empirical Analysis of Design Choices in Neighborhood-Based Collaborative Filtering Algroithms, Information retrieval, pp , [Online]. Available: [13] P. Emerson, The original Borda count and partial voting, Social Choice and Welfare, vol. 40, no. 2, pp , 2013.

Performance Comparison of Algorithms for Movie Rating Estimation

Performance Comparison of Algorithms for Movie Rating Estimation Performance Comparison of Algorithms for Movie Rating Estimation Alper Köse, Can Kanbak, Noyan Evirgen Research Laboratory of Electronics, Massachusetts Institute of Technology Department of Electrical

More information

Improving the Accuracy of Top-N Recommendation using a Preference Model

Improving the Accuracy of Top-N Recommendation using a Preference Model Improving the Accuracy of Top-N Recommendation using a Preference Model Jongwuk Lee a, Dongwon Lee b,, Yeon-Chang Lee c, Won-Seok Hwang c, Sang-Wook Kim c a Hankuk University of Foreign Studies, Republic

More information

EigenRank: A Ranking-Oriented Approach to Collaborative Filtering

EigenRank: A Ranking-Oriented Approach to Collaborative Filtering EigenRank: A Ranking-Oriented Approach to Collaborative Filtering ABSTRACT Nathan N. Liu Department of Computer Science and Engineering Hong Kong University of Science and Technology, Hong Kong, China

More information

Collaborative Filtering based on User Trends

Collaborative Filtering based on User Trends Collaborative Filtering based on User Trends Panagiotis Symeonidis, Alexandros Nanopoulos, Apostolos Papadopoulos, and Yannis Manolopoulos Aristotle University, Department of Informatics, Thessalonii 54124,

More information

The Principle and Improvement of the Algorithm of Matrix Factorization Model based on ALS

The Principle and Improvement of the Algorithm of Matrix Factorization Model based on ALS of the Algorithm of Matrix Factorization Model based on ALS 12 Yunnan University, Kunming, 65000, China E-mail: shaw.xuan820@gmail.com Chao Yi 3 Yunnan University, Kunming, 65000, China E-mail: yichao@ynu.edu.cn

More information

A Time-based Recommender System using Implicit Feedback

A Time-based Recommender System using Implicit Feedback A Time-based Recommender System using Implicit Feedback T. Q. Lee Department of Mobile Internet Dongyang Technical College Seoul, Korea Abstract - Recommender systems provide personalized recommendations

More information

Comparison of Recommender System Algorithms focusing on the New-Item and User-Bias Problem

Comparison of Recommender System Algorithms focusing on the New-Item and User-Bias Problem Comparison of Recommender System Algorithms focusing on the New-Item and User-Bias Problem Stefan Hauger 1, Karen H. L. Tso 2, and Lars Schmidt-Thieme 2 1 Department of Computer Science, University of

More information

Extension Study on Item-Based P-Tree Collaborative Filtering Algorithm for Netflix Prize

Extension Study on Item-Based P-Tree Collaborative Filtering Algorithm for Netflix Prize Extension Study on Item-Based P-Tree Collaborative Filtering Algorithm for Netflix Prize Tingda Lu, Yan Wang, William Perrizo, Amal Perera, Gregory Wettstein Computer Science Department North Dakota State

More information

Collaborative Filtering using a Spreading Activation Approach

Collaborative Filtering using a Spreading Activation Approach Collaborative Filtering using a Spreading Activation Approach Josephine Griffith *, Colm O Riordan *, Humphrey Sorensen ** * Department of Information Technology, NUI, Galway ** Computer Science Department,

More information

Use of KNN for the Netflix Prize Ted Hong, Dimitris Tsamis Stanford University

Use of KNN for the Netflix Prize Ted Hong, Dimitris Tsamis Stanford University Use of KNN for the Netflix Prize Ted Hong, Dimitris Tsamis Stanford University {tedhong, dtsamis}@stanford.edu Abstract This paper analyzes the performance of various KNNs techniques as applied to the

More information

A Scalable, Accurate Hybrid Recommender System

A Scalable, Accurate Hybrid Recommender System A Scalable, Accurate Hybrid Recommender System Mustansar Ali Ghazanfar and Adam Prugel-Bennett School of Electronics and Computer Science University of Southampton Highfield Campus, SO17 1BJ, United Kingdom

More information

Additive Regression Applied to a Large-Scale Collaborative Filtering Problem

Additive Regression Applied to a Large-Scale Collaborative Filtering Problem Additive Regression Applied to a Large-Scale Collaborative Filtering Problem Eibe Frank 1 and Mark Hall 2 1 Department of Computer Science, University of Waikato, Hamilton, New Zealand eibe@cs.waikato.ac.nz

More information

Proposing a New Metric for Collaborative Filtering

Proposing a New Metric for Collaborative Filtering Journal of Software Engineering and Applications 2011 4 411-416 doi:10.4236/jsea.2011.47047 Published Online July 2011 (http://www.scip.org/journal/jsea) 411 Proposing a New Metric for Collaborative Filtering

More information

Robustness and Accuracy Tradeoffs for Recommender Systems Under Attack

Robustness and Accuracy Tradeoffs for Recommender Systems Under Attack Proceedings of the Twenty-Fifth International Florida Artificial Intelligence Research Society Conference Robustness and Accuracy Tradeoffs for Recommender Systems Under Attack Carlos E. Seminario and

More information

Hybrid Recommendation Models for Binary User Preference Prediction Problem

Hybrid Recommendation Models for Binary User Preference Prediction Problem JMLR: Workshop and Conference Proceedings 18:137 151, 2012 Proceedings of KDD-Cup 2011 competition Hybrid Recommation Models for Binary User Preference Prediction Problem Siwei Lai swlai@nlpr.ia.ac.cn

More information

Jeff Howbert Introduction to Machine Learning Winter

Jeff Howbert Introduction to Machine Learning Winter Collaborative Filtering Nearest es Neighbor Approach Jeff Howbert Introduction to Machine Learning Winter 2012 1 Bad news Netflix Prize data no longer available to public. Just after contest t ended d

More information

An Empirical Comparison of Collaborative Filtering Approaches on Netflix Data

An Empirical Comparison of Collaborative Filtering Approaches on Netflix Data An Empirical Comparison of Collaborative Filtering Approaches on Netflix Data Nicola Barbieri, Massimo Guarascio, Ettore Ritacco ICAR-CNR Via Pietro Bucci 41/c, Rende, Italy {barbieri,guarascio,ritacco}@icar.cnr.it

More information

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

CS246: Mining Massive Datasets Jure Leskovec, Stanford University CS6: Mining Massive Datasets Jure Leskovec, Stanford University http://cs6.stanford.edu /6/01 Jure Leskovec, Stanford C6: Mining Massive Datasets Training data 100 million ratings, 80,000 users, 17,770

More information

A Recommender System Based on Improvised K- Means Clustering Algorithm

A Recommender System Based on Improvised K- Means Clustering Algorithm A Recommender System Based on Improvised K- Means Clustering Algorithm Shivani Sharma Department of Computer Science and Applications, Kurukshetra University, Kurukshetra Shivanigaur83@yahoo.com Abstract:

More information

Content-based Dimensionality Reduction for Recommender Systems

Content-based Dimensionality Reduction for Recommender Systems Content-based Dimensionality Reduction for Recommender Systems Panagiotis Symeonidis Aristotle University, Department of Informatics, Thessaloniki 54124, Greece symeon@csd.auth.gr Abstract. Recommender

More information

amount of available information and the number of visitors to Web sites in recent years

amount of available information and the number of visitors to Web sites in recent years Collaboration Filtering using K-Mean Algorithm Smrity Gupta Smrity_0501@yahoo.co.in Department of computer Science and Engineering University of RAJIV GANDHI PROUDYOGIKI SHWAVIDYALAYA, BHOPAL Abstract:

More information

The Tourism Recommendation of Jingdezhen Based on Unifying User-based and Item-based Collaborative filtering

The Tourism Recommendation of Jingdezhen Based on Unifying User-based and Item-based Collaborative filtering The Tourism Recommendation of Jingdezhen Based on Unifying User-based and Item-based Collaborative filtering Tao Liu 1, a, Mingang Wu 1, b and Donglan Ying 2, c 1 School of Information Engineering, Jingdezhen

More information

Comparing State-of-the-Art Collaborative Filtering Systems

Comparing State-of-the-Art Collaborative Filtering Systems Comparing State-of-the-Art Collaborative Filtering Systems Laurent Candillier, Frank Meyer, Marc Boullé France Telecom R&D Lannion, France lcandillier@hotmail.com Abstract. Collaborative filtering aims

More information

A Constrained Spreading Activation Approach to Collaborative Filtering

A Constrained Spreading Activation Approach to Collaborative Filtering A Constrained Spreading Activation Approach to Collaborative Filtering Josephine Griffith 1, Colm O Riordan 1, and Humphrey Sorensen 2 1 Dept. of Information Technology, National University of Ireland,

More information

Recommendation Algorithms: Collaborative Filtering. CSE 6111 Presentation Advanced Algorithms Fall Presented by: Farzana Yasmeen

Recommendation Algorithms: Collaborative Filtering. CSE 6111 Presentation Advanced Algorithms Fall Presented by: Farzana Yasmeen Recommendation Algorithms: Collaborative Filtering CSE 6111 Presentation Advanced Algorithms Fall. 2013 Presented by: Farzana Yasmeen 2013.11.29 Contents What are recommendation algorithms? Recommendations

More information

Collaborative Filtering using Euclidean Distance in Recommendation Engine

Collaborative Filtering using Euclidean Distance in Recommendation Engine Indian Journal of Science and Technology, Vol 9(37), DOI: 10.17485/ijst/2016/v9i37/102074, October 2016 ISSN (Print) : 0974-6846 ISSN (Online) : 0974-5645 Collaborative Filtering using Euclidean Distance

More information

arxiv: v1 [cs.ir] 1 Jul 2016

arxiv: v1 [cs.ir] 1 Jul 2016 Memory Based Collaborative Filtering with Lucene arxiv:1607.00223v1 [cs.ir] 1 Jul 2016 Claudio Gennaro claudio.gennaro@isti.cnr.it ISTI-CNR, Pisa, Italy January 8, 2018 Abstract Memory Based Collaborative

More information

Assignment 5: Collaborative Filtering

Assignment 5: Collaborative Filtering Assignment 5: Collaborative Filtering Arash Vahdat Fall 2015 Readings You are highly recommended to check the following readings before/while doing this assignment: Slope One Algorithm: https://en.wikipedia.org/wiki/slope_one.

More information

Prowess Improvement of Accuracy for Moving Rating Recommendation System

Prowess Improvement of Accuracy for Moving Rating Recommendation System 2017 IJSRST Volume 3 Issue 1 Print ISSN: 2395-6011 Online ISSN: 2395-602X Themed Section: Scienceand Technology Prowess Improvement of Accuracy for Moving Rating Recommendation System P. Damodharan *1,

More information

New user profile learning for extremely sparse data sets

New user profile learning for extremely sparse data sets New user profile learning for extremely sparse data sets Tomasz Hoffmann, Tadeusz Janasiewicz, and Andrzej Szwabe Institute of Control and Information Engineering, Poznan University of Technology, pl.

More information

Hotel Recommendation Based on Hybrid Model

Hotel Recommendation Based on Hybrid Model Hotel Recommendation Based on Hybrid Model Jing WANG, Jiajun SUN, Zhendong LIN Abstract: This project develops a hybrid model that combines content-based with collaborative filtering (CF) for hotel recommendation.

More information

Progress Report: Collaborative Filtering Using Bregman Co-clustering

Progress Report: Collaborative Filtering Using Bregman Co-clustering Progress Report: Collaborative Filtering Using Bregman Co-clustering Wei Tang, Srivatsan Ramanujam, and Andrew Dreher April 4, 2008 1 Introduction Analytics are becoming increasingly important for business

More information

Thanks to Jure Leskovec, Anand Rajaraman, Jeff Ullman

Thanks to Jure Leskovec, Anand Rajaraman, Jeff Ullman Thanks to Jure Leskovec, Anand Rajaraman, Jeff Ullman http://www.mmds.org Overview of Recommender Systems Content-based Systems Collaborative Filtering J. Leskovec, A. Rajaraman, J. Ullman: Mining of Massive

More information

A Recursive Prediction Algorithm for Collaborative Filtering Recommender Systems

A Recursive Prediction Algorithm for Collaborative Filtering Recommender Systems A Recursive rediction Algorithm for Collaborative Filtering Recommender Systems ABSTRACT Jiyong Zhang Human Computer Interaction Group, Swiss Federal Institute of Technology (EFL), CH-1015, Lausanne, Switzerland

More information

QoS Management of Web Services

QoS Management of Web Services QoS Management of Web Services Zibin Zheng (Ben) Supervisor: Prof. Michael R. Lyu Department of Computer Science & Engineering The Chinese University of Hong Kong Dec. 10, 2010 Outline Introduction Web

More information

Movie Recommender System - Hybrid Filtering Approach

Movie Recommender System - Hybrid Filtering Approach Chapter 7 Movie Recommender System - Hybrid Filtering Approach Recommender System can be built using approaches like: (i) Collaborative Filtering (ii) Content Based Filtering and (iii) Hybrid Filtering.

More information

Using Social Networks to Improve Movie Rating Predictions

Using Social Networks to Improve Movie Rating Predictions Introduction Using Social Networks to Improve Movie Rating Predictions Suhaas Prasad Recommender systems based on collaborative filtering techniques have become a large area of interest ever since the

More information

How to Get Endorsements? Predicting Facebook Likes Using Post Content and User Engagement

How to Get Endorsements? Predicting Facebook Likes Using Post Content and User Engagement How to Get Endorsements? Predicting Facebook Likes Using Post Content and User Engagement Wei-Fan Chen, Yi-Pei Chen and Lun-Wei Ku Institute of Information Science, Academia Sinica, Taipei, Taiwan {viericwf,ypc82,lwku}@iis.sinica.edu.tw

More information

Project Report. An Introduction to Collaborative Filtering

Project Report. An Introduction to Collaborative Filtering Project Report An Introduction to Collaborative Filtering Siobhán Grayson 12254530 COMP30030 School of Computer Science and Informatics College of Engineering, Mathematical & Physical Sciences University

More information

Michele Gorgoglione Politecnico di Bari Viale Japigia, Bari (Italy)

Michele Gorgoglione Politecnico di Bari Viale Japigia, Bari (Italy) Does the recommendation task affect a CARS performance? Umberto Panniello Politecnico di Bari Viale Japigia, 82 726 Bari (Italy) +3985962765 m.gorgoglione@poliba.it Michele Gorgoglione Politecnico di Bari

More information

A Constrained Spreading Activation Approach to Collaborative Filtering

A Constrained Spreading Activation Approach to Collaborative Filtering A Constrained Spreading Activation Approach to Collaborative Filtering Josephine Griffith 1, Colm O Riordan 1, and Humphrey Sorensen 2 1 Dept. of Information Technology, National University of Ireland,

More information

Data Mining Techniques

Data Mining Techniques Data Mining Techniques CS 60 - Section - Fall 06 Lecture Jan-Willem van de Meent (credit: Andrew Ng, Alex Smola, Yehuda Koren, Stanford CS6) Recommender Systems The Long Tail (from: https://www.wired.com/00/0/tail/)

More information

Part 11: Collaborative Filtering. Francesco Ricci

Part 11: Collaborative Filtering. Francesco Ricci Part : Collaborative Filtering Francesco Ricci Content An example of a Collaborative Filtering system: MovieLens The collaborative filtering method n Similarity of users n Methods for building the rating

More information

Recommender Systems New Approaches with Netflix Dataset

Recommender Systems New Approaches with Netflix Dataset Recommender Systems New Approaches with Netflix Dataset Robert Bell Yehuda Koren AT&T Labs ICDM 2007 Presented by Matt Rodriguez Outline Overview of Recommender System Approaches which are Content based

More information

Learning Bidirectional Similarity for Collaborative Filtering

Learning Bidirectional Similarity for Collaborative Filtering Learning Bidirectional Similarity for Collaborative Filtering Bin Cao 1, Jian-Tao Sun 2, Jianmin Wu 2, Qiang Yang 1, and Zheng Chen 2 1 The Hong Kong University of Science and Technology, Hong Kong {caobin,

More information

Recommendation Systems

Recommendation Systems Recommendation Systems CS 534: Machine Learning Slides adapted from Alex Smola, Jure Leskovec, Anand Rajaraman, Jeff Ullman, Lester Mackey, Dietmar Jannach, and Gerhard Friedrich Recommender Systems (RecSys)

More information

Know your neighbours: Machine Learning on Graphs

Know your neighbours: Machine Learning on Graphs Know your neighbours: Machine Learning on Graphs Andrew Docherty Senior Research Engineer andrew.docherty@data61.csiro.au www.data61.csiro.au 2 Graphs are Everywhere Online Social Networks Transportation

More information

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

CS246: Mining Massive Datasets Jure Leskovec, Stanford University CS6: Mining Massive Datasets Jure Leskovec, Stanford University http://cs6.stanford.edu Customer X Buys Metalica CD Buys Megadeth CD Customer Y Does search on Metalica Recommender system suggests Megadeth

More information

Collaborative Filtering and Recommender Systems. Definitions. .. Spring 2009 CSC 466: Knowledge Discovery from Data Alexander Dekhtyar..

Collaborative Filtering and Recommender Systems. Definitions. .. Spring 2009 CSC 466: Knowledge Discovery from Data Alexander Dekhtyar.. .. Spring 2009 CSC 466: Knowledge Discovery from Data Alexander Dekhtyar.. Collaborative Filtering and Recommender Systems Definitions Recommendation generation problem. Given a set of users and their

More information

Part 11: Collaborative Filtering. Francesco Ricci

Part 11: Collaborative Filtering. Francesco Ricci Part : Collaborative Filtering Francesco Ricci Content An example of a Collaborative Filtering system: MovieLens The collaborative filtering method n Similarity of users n Methods for building the rating

More information

Towards Time-Aware Semantic enriched Recommender Systems for movies

Towards Time-Aware Semantic enriched Recommender Systems for movies Towards Time-Aware Semantic enriched Recommender Systems for movies Marko Harasic, Pierre Ahrendt,Alexandru Todor, Adrian Paschke FU Berlin Abstract. With the World Wide Web moving from passive to active,

More information

Using Data Mining to Determine User-Specific Movie Ratings

Using Data Mining to Determine User-Specific Movie Ratings Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN 2320 088X IMPACT FACTOR: 6.017 IJCSMC,

More information

Recommender System Optimization through Collaborative Filtering

Recommender System Optimization through Collaborative Filtering Recommender System Optimization through Collaborative Filtering L.W. Hoogenboom Econometric Institute of Erasmus University Rotterdam Bachelor Thesis Business Analytics and Quantitative Marketing July

More information

A PERSONALIZED RECOMMENDER SYSTEM FOR TELECOM PRODUCTS AND SERVICES

A PERSONALIZED RECOMMENDER SYSTEM FOR TELECOM PRODUCTS AND SERVICES A PERSONALIZED RECOMMENDER SYSTEM FOR TELECOM PRODUCTS AND SERVICES Zui Zhang, Kun Liu, William Wang, Tai Zhang and Jie Lu Decision Systems & e-service Intelligence Lab, Centre for Quantum Computation

More information

A probabilistic model to resolve diversity-accuracy challenge of recommendation systems

A probabilistic model to resolve diversity-accuracy challenge of recommendation systems A probabilistic model to resolve diversity-accuracy challenge of recommendation systems AMIN JAVARI MAHDI JALILI 1 Received: 17 Mar 2013 / Revised: 19 May 2014 / Accepted: 30 Jun 2014 Recommendation systems

More information

Survey on Collaborative Filtering Technique in Recommendation System

Survey on Collaborative Filtering Technique in Recommendation System Survey on Collaborative Filtering Technique in Recommendation System Omkar S. Revankar, Dr.Mrs. Y.V.Haribhakta Department of Computer Engineering, College of Engineering Pune, Pune, India ABSTRACT This

More information

Recommender System. What is it? How to build it? Challenges. R package: recommenderlab

Recommender System. What is it? How to build it? Challenges. R package: recommenderlab Recommender System What is it? How to build it? Challenges R package: recommenderlab 1 What is a recommender system Wiki definition: A recommender system or a recommendation system (sometimes replacing

More information

Recommender Systems: User Experience and System Issues

Recommender Systems: User Experience and System Issues Recommender Systems: User Experience and System ssues Joseph A. Konstan University of Minnesota konstan@cs.umn.edu http://www.grouplens.org Summer 2005 1 About me Professor of Computer Science & Engineering,

More information

Recommendation System Using Yelp Data CS 229 Machine Learning Jia Le Xu, Yingran Xu

Recommendation System Using Yelp Data CS 229 Machine Learning Jia Le Xu, Yingran Xu Recommendation System Using Yelp Data CS 229 Machine Learning Jia Le Xu, Yingran Xu 1 Introduction Yelp Dataset Challenge provides a large number of user, business and review data which can be used for

More information

Introduction to Data Mining

Introduction to Data Mining Introduction to Data Mining Lecture #7: Recommendation Content based & Collaborative Filtering Seoul National University In This Lecture Understand the motivation and the problem of recommendation Compare

More information

Collaborative Filtering: A Comparison of Graph-Based Semi-Supervised Learning Methods and Memory-Based Methods

Collaborative Filtering: A Comparison of Graph-Based Semi-Supervised Learning Methods and Memory-Based Methods 70 Computer Science 8 Collaborative Filtering: A Comparison of Graph-Based Semi-Supervised Learning Methods and Memory-Based Methods Rasna R. Walia Collaborative filtering is a method of making predictions

More information

Reddit Recommendation System Daniel Poon, Yu Wu, David (Qifan) Zhang CS229, Stanford University December 11 th, 2011

Reddit Recommendation System Daniel Poon, Yu Wu, David (Qifan) Zhang CS229, Stanford University December 11 th, 2011 Reddit Recommendation System Daniel Poon, Yu Wu, David (Qifan) Zhang CS229, Stanford University December 11 th, 2011 1. Introduction Reddit is one of the most popular online social news websites with millions

More information

SHILLING ATTACK DETECTION IN RECOMMENDER SYSTEMS USING CLASSIFICATION TECHNIQUES

SHILLING ATTACK DETECTION IN RECOMMENDER SYSTEMS USING CLASSIFICATION TECHNIQUES SHILLING ATTACK DETECTION IN RECOMMENDER SYSTEMS USING CLASSIFICATION TECHNIQUES Parneet Kaur Department of Computer Science & Engg. Thapar University, Patiala, Punjab, India Abstract Collaborative filtering

More information

A Survey on Various Techniques of Recommendation System in Web Mining

A Survey on Various Techniques of Recommendation System in Web Mining A Survey on Various Techniques of Recommendation System in Web Mining 1 Yagnesh G. patel, 2 Vishal P.Patel 1 Department of computer engineering 1 S.P.C.E, Visnagar, India Abstract - Today internet has

More information

Collaborative Filtering Applied to Educational Data Mining

Collaborative Filtering Applied to Educational Data Mining Collaborative Filtering Applied to Educational Data Mining KDD Cup 200 July 25 th, 200 BigChaos @ KDD Team Dataset Solution Overview Michael Jahrer, Andreas Töscher from commendo research Dataset Team

More information

Collaborative Filtering using Weighted BiPartite Graph Projection A Recommendation System for Yelp

Collaborative Filtering using Weighted BiPartite Graph Projection A Recommendation System for Yelp Collaborative Filtering using Weighted BiPartite Graph Projection A Recommendation System for Yelp Sumedh Sawant sumedh@stanford.edu Team 38 December 10, 2013 Abstract We implement a personal recommendation

More information

Data Mining Techniques

Data Mining Techniques Data Mining Techniques CS 6 - Section - Spring 7 Lecture Jan-Willem van de Meent (credit: Andrew Ng, Alex Smola, Yehuda Koren, Stanford CS6) Project Project Deadlines Feb: Form teams of - people 7 Feb:

More information

Alleviating the Sparsity Problem in Collaborative Filtering by using an Adapted Distance and a Graph-based Method

Alleviating the Sparsity Problem in Collaborative Filtering by using an Adapted Distance and a Graph-based Method Alleviating the Sparsity Problem in Collaborative Filtering by using an Adapted Distance and a Graph-based Method Beau Piccart, Jan Struyf, Hendrik Blockeel Abstract Collaborative filtering (CF) is the

More information

CptS 570 Machine Learning Project: Netflix Competition. Parisa Rashidi Vikramaditya Jakkula. Team: MLSurvivors. Wednesday, December 12, 2007

CptS 570 Machine Learning Project: Netflix Competition. Parisa Rashidi Vikramaditya Jakkula. Team: MLSurvivors. Wednesday, December 12, 2007 CptS 570 Machine Learning Project: Netflix Competition Team: MLSurvivors Parisa Rashidi Vikramaditya Jakkula Wednesday, December 12, 2007 Introduction In current report, we describe our efforts put forth

More information

Variational Bayesian PCA versus k-nn on a Very Sparse Reddit Voting Dataset

Variational Bayesian PCA versus k-nn on a Very Sparse Reddit Voting Dataset Variational Bayesian PCA versus k-nn on a Very Sparse Reddit Voting Dataset Jussa Klapuri, Ilari Nieminen, Tapani Raiko, and Krista Lagus Department of Information and Computer Science, Aalto University,

More information

Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University Infinite data. Filtering data streams

Mining of Massive Datasets Jure Leskovec, Anand Rajaraman, Jeff Ullman Stanford University  Infinite data. Filtering data streams /9/7 Note to other teachers and users of these slides: We would be delighted if you found this our material useful in giving your own lectures. Feel free to use these slides verbatim, or to modify them

More information

NLMF: NonLinear Matrix Factorization Methods for Top-N Recommender Systems

NLMF: NonLinear Matrix Factorization Methods for Top-N Recommender Systems 1 NLMF: NonLinear Matrix Factorization Methods for Top-N Recommender Systems Santosh Kabbur and George Karypis Department of Computer Science, University of Minnesota Twin Cities, USA {skabbur,karypis}@cs.umn.edu

More information

COMP 465: Data Mining Recommender Systems

COMP 465: Data Mining Recommender Systems //0 movies COMP 6: Data Mining Recommender Systems Slides Adapted From: www.mmds.org (Mining Massive Datasets) movies Compare predictions with known ratings (test set T)????? Test Data Set Root-mean-square

More information

Reproducing and Prototyping Recommender Systems in R

Reproducing and Prototyping Recommender Systems in R Reproducing and Prototyping Recommender Systems in R Ludovik Çoba, Panagiotis Symeonidis, Markus Zanker Free University of Bozen-Bolzano, 39100, Bozen-Bolzano, Italy {lucoba,psymeonidis,markus.zanker}@unibz.it

More information

International Journal of Advanced Research in. Computer Science and Engineering

International Journal of Advanced Research in. Computer Science and Engineering Volume 4, Issue 1, January 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Design and Architecture

More information

Learning to Match. Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li

Learning to Match. Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li Learning to Match Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li 1. Introduction The main tasks in many applications can be formalized as matching between heterogeneous objects, including search, recommendation,

More information

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

CS246: Mining Massive Datasets Jure Leskovec, Stanford University CS6: Mining Massive Datasets Jure Leskovec, Stanford University http://cs6.stanford.edu Training data 00 million ratings, 80,000 users, 7,770 movies 6 years of data: 000 00 Test data Last few ratings of

More information

Performance of Recommender Algorithms on Top-N Recommendation Tasks

Performance of Recommender Algorithms on Top-N Recommendation Tasks Performance of Recommender Algorithms on Top- Recommendation Tasks Paolo Cremonesi Politecnico di Milano Milan, Italy paolo.cremonesi@polimi.it Yehuda Koren Yahoo! Research Haifa, Israel yehuda@yahoo-inc.com

More information

Grey forecast model for accurate recommendation in presence of data sparsity and correlation

Grey forecast model for accurate recommendation in presence of data sparsity and correlation Grey forecast model for accurate recommendation in presence of data sparsity and correlation Feng Xie a,b,, Zhen Chen b,c, Jiaxing Shang a, Geoffrey C. Fox d a Department of Automation, Tsinghua University

More information

Novel Boosting Frameworks to Improve the Performance of Collaborative Filtering

Novel Boosting Frameworks to Improve the Performance of Collaborative Filtering JMLR: Workshop and Conference Proceedings 29:87 99, 2013 ACML 2013 Novel Boosting Frameworks to Improve the Performance of Collaborative Filtering Xiaotian Jiang Zhendong Niu Jiamin Guo Ghulam Mustafa

More information

Two Collaborative Filtering Recommender Systems Based on Sparse Dictionary Coding

Two Collaborative Filtering Recommender Systems Based on Sparse Dictionary Coding Under consideration for publication in Knowledge and Information Systems Two Collaborative Filtering Recommender Systems Based on Dictionary Coding Ismail E. Kartoglu 1, Michael W. Spratling 1 1 Department

More information

On hybrid modular recommendation systems for video streaming

On hybrid modular recommendation systems for video streaming On hybrid modular recommendation systems for video streaming Evripides Tzamousis Maria Papadopouli arxiv:1901.01418v1 [cs.ir] 5 Jan 2019 Abstract The technological advances in networking, mobile computing,

More information

Improving Results and Performance of Collaborative Filtering-based Recommender Systems using Cuckoo Optimization Algorithm

Improving Results and Performance of Collaborative Filtering-based Recommender Systems using Cuckoo Optimization Algorithm Improving Results and Performance of Collaborative Filtering-based Recommender Systems using Cuckoo Optimization Algorithm Majid Hatami Faculty of Electrical and Computer Engineering University of Tabriz,

More information

Knowledge Discovery and Data Mining 1 (VO) ( )

Knowledge Discovery and Data Mining 1 (VO) ( ) Knowledge Discovery and Data Mining 1 (VO) (707.003) Data Matrices and Vector Space Model Denis Helic KTI, TU Graz Nov 6, 2014 Denis Helic (KTI, TU Graz) KDDM1 Nov 6, 2014 1 / 55 Big picture: KDDM Probability

More information

Top-N Recommendations from Implicit Feedback Leveraging Linked Open Data

Top-N Recommendations from Implicit Feedback Leveraging Linked Open Data Top-N Recommendations from Implicit Feedback Leveraging Linked Open Data Vito Claudio Ostuni, Tommaso Di Noia, Roberto Mirizzi, Eugenio Di Sciascio Polytechnic University of Bari, Italy {ostuni,mirizzi}@deemail.poliba.it,

More information

Clustering and Dimensionality Reduction. Stony Brook University CSE545, Fall 2017

Clustering and Dimensionality Reduction. Stony Brook University CSE545, Fall 2017 Clustering and Dimensionality Reduction Stony Brook University CSE545, Fall 2017 Goal: Generalize to new data Model New Data? Original Data Does the model accurately reflect new data? Supervised vs. Unsupervised

More information

Feature-weighted User Model for Recommender Systems

Feature-weighted User Model for Recommender Systems Feature-weighted User Model for Recommender Systems Panagiotis Symeonidis, Alexandros Nanopoulos, and Yannis Manolopoulos Aristotle University, Department of Informatics, Thessaloniki 54124, Greece {symeon,

More information

Slope One Predictors for Online Rating-Based Collaborative Filtering

Slope One Predictors for Online Rating-Based Collaborative Filtering Slope One Predictors for Online Rating-Based Collaborative Filtering Daniel Lemire Anna Maclachlan February 7, 2005 Abstract Rating-based collaborative filtering is the process of predicting how a user

More information

The Comparative Study of Machine Learning Algorithms in Text Data Classification*

The Comparative Study of Machine Learning Algorithms in Text Data Classification* The Comparative Study of Machine Learning Algorithms in Text Data Classification* Wang Xin School of Science, Beijing Information Science and Technology University Beijing, China Abstract Classification

More information

General Instructions. Questions

General Instructions. Questions CS246: Mining Massive Data Sets Winter 2018 Problem Set 2 Due 11:59pm February 8, 2018 Only one late period is allowed for this homework (11:59pm 2/13). General Instructions Submission instructions: These

More information

Application of Dimensionality Reduction in Recommender System -- A Case Study

Application of Dimensionality Reduction in Recommender System -- A Case Study Application of Dimensionality Reduction in Recommender System -- A Case Study Badrul M. Sarwar, George Karypis, Joseph A. Konstan, John T. Riedl Department of Computer Science and Engineering / Army HPC

More information

Yelp Recommendation System

Yelp Recommendation System Yelp Recommendation System Jason Ting, Swaroop Indra Ramaswamy Institute for Computational and Mathematical Engineering Abstract We apply principles and techniques of recommendation systems to develop

More information

Influence in Ratings-Based Recommender Systems: An Algorithm-Independent Approach

Influence in Ratings-Based Recommender Systems: An Algorithm-Independent Approach Influence in Ratings-Based Recommender Systems: An Algorithm-Independent Approach Al Mamunur Rashid George Karypis John Riedl Abstract Recommender systems have been shown to help users find items of interest

More information

Feature Selection Using Modified-MCA Based Scoring Metric for Classification

Feature Selection Using Modified-MCA Based Scoring Metric for Classification 2011 International Conference on Information Communication and Management IPCSIT vol.16 (2011) (2011) IACSIT Press, Singapore Feature Selection Using Modified-MCA Based Scoring Metric for Classification

More information

Collaborative Filtering via Euclidean Embedding

Collaborative Filtering via Euclidean Embedding Collaborative Filtering via Euclidean Embedding Mohammad Khoshneshin Management Sciences Department University of Iowa Iowa City, IA 52242 USA mohammad-khoshneshin@uiowa.edu W. Nick Street Management Sciences

More information

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

CS246: Mining Massive Datasets Jure Leskovec, Stanford University CS6: Mining Massive Datasets Jure Leskovec, Stanford University http://cs6.stanford.edu //8 Jure Leskovec, Stanford CS6: Mining Massive Datasets High dim. data Graph data Infinite data Machine learning

More information

Rating Prediction Using Preference Relations Based Matrix Factorization

Rating Prediction Using Preference Relations Based Matrix Factorization Rating Prediction Using Preference Relations Based Matrix Factorization Maunendra Sankar Desarkar and Sudeshna Sarkar Department of Computer Science and Engineering, Indian Institute of Technology Kharagpur,

More information

Community-Based Recommendations: a Solution to the Cold Start Problem

Community-Based Recommendations: a Solution to the Cold Start Problem Community-Based Recommendations: a Solution to the Cold Start Problem Shaghayegh Sahebi Intelligent Systems Program University of Pittsburgh sahebi@cs.pitt.edu William W. Cohen Machine Learning Department

More information

Detection of Obfuscated Attacks in Collaborative Recommender Systems 1

Detection of Obfuscated Attacks in Collaborative Recommender Systems 1 Detection of Obfuscated Attacks in Collaborative Recommender Systems 1 Chad Williams and Bamshad Mobasher and Robin Burke and Jeff Sandvig and Runa Bhaumik 2 Abstract. The vulnerability of collaborative

More information

KNOW At The Social Book Search Lab 2016 Suggestion Track

KNOW At The Social Book Search Lab 2016 Suggestion Track KNOW At The Social Book Search Lab 2016 Suggestion Track Hermann Ziak and Roman Kern Know-Center GmbH Inffeldgasse 13 8010 Graz, Austria hziak, rkern@know-center.at Abstract. Within this work represents

More information