Point-of-Interest Recommendation in Location- Based Social Networks with Personalized Geo-Social Influence

Size: px
Start display at page:

Download "Point-of-Interest Recommendation in Location- Based Social Networks with Personalized Geo-Social Influence"

Transcription

1 MOBILE INFORMATION SYSTEMS Point-of-Interest Recommendation in Location- Based Social Networks with Personalized Geo-Social Influence HUANG Liwei 1, MA Yutao 2,3,*, LIU Yanbo 1 1 Beijing Institute of Remote Sensing, Beijing , China 2 School of Computer, Wuhan University, Wuhan , China 3 WISET Automation Company Limited, Wuhan Iron and Steel Group Corporation, Wuhan , China Abstract: Point-of-interest (POI recommendation is a popular topic on location-based social networks (LBSNs. Geographical proximity, known as a unique feature of LBSNs, significantly affects user check-in behavior. However, most of prior studies characterize the geographical influence based on a universal or personalized distribution of geographic distance, leading to unsatisfactory recommendation results. In this paper, the personalized geographical influence in a two-dimensional geographical space is modeled using the data field method, and we propose a semi-supervised probabilistic model based on a factor graph model to integrate different factors such as the geographical influence. Moreover, a distributed learning algorithm is used to scale up our method to large-scale data sets. Experimental results based on the data sets from Foursquare and Gowalla show that our method outperforms other competing POI recommendation techniques. Keywords: point-of-interest recommendation; location-based social networks; geo-social influence; data field; factor graph model I. INTRODUCTION The wide use of mobile devices and loca- tion-based services (LBSs sparks the boom of location-based social networks (LBSNs, such as Foursquare and Gowalla, which can affect human activities in the physical and virtual worlds[1]. A point-of-interest (POI is a specific spatial location (such as a hotel that could be interesting or useful to someone else in a LBSN. A user s visits to a given POI, also known as check-in activities, by and large reflect his/her preference for the corresponding place in the real world. In recent years, POI recommendation has attracted increasing attention due to the importance of understanding human behavior and the potential business value related to that. Unlike the traditional recommender systems that focus on non-spatial items (such as books and movies, POI recommendation requires users to visit or check in locations in a LBSN. Geographical proximity has a great impact on user check-in behavior[2], that is, nearby friends always share more common POIs than the others who live far away[3]. However, human mobility behavior presents a challenge for the methods that make recommendations based on the distances between users places of residence. To address the issue, some researchers have attempted to recommend possible POIs to target users in 21

2 Recently, POI recommendation has emerged as a popular topic in the field of recommender systems. According to the data available on the Internet such as user check-in records, GPS trajectory data[13], and text data[14], the simplest way to provide a POI recommendation service is with those conventional recommendation techniques such as collaborative filtering. However, using only the information of user check-in records is not enough to asterms of the geographic distance between locations. Prior studies assume that the distances between POIs that a user visited before follow a universal or personalized distribution[1, 2, 4-6], and the probability of a user visiting a POI can be calculated according to the geographic distance between them. Unfortunately, these studies are subject to some limitations, e.g., the deficiency of location s intrinsic characteristics and the difficulty in finding a reasonable reference location[7]. Therefore, some recent studies began to model the geographical influence over two-dimensional geographic coordinates (latitude and longitude[7], aiming at characterizing user check-in behavior better. On the other hand, according to the social correlation[8] of social theories, human movement and mobility patterns are also affected by their social friendships, and a few recommender systems have utilized the social influence in conventional social networks to improve prediction performance[9]. For example, according to the result of our experiment conducted on the data sets which were collected from Foursquare[10] and Gowalla[5], although less than 10% of a user s check-ins are also visited by his/her friends, the probability of checking in the same POI for two friends is, on average, much higher than that for two strangers. To sum up, a unified framework integrating different factors seems to be a feasible solution to improve the quality of POI recommendation. How to model the personalized geographical influence in terms of geographic spatial distribution, and further characterize user check-in behavior better by integrating the social influence with the geographical influence, is the challenge we face. To address the issue, in this paper we propose an effective and scalable POI recommendation framework that returns the top K POIs with the highest scores to each target user. The main contributions of this paper are summarized as follows: (1 We model the personalized geographical influence on user check-in behavior in terms of geographic spatial distribution using the data field method[11], because it has the advantage in describing the non-contact inter- action between data objects. In particular, to alleviate the cold-start problem, we learn the effect of each location on individual users by considering both his/her historical check-in records and the geographic spatial distribution of all POIs. (2 We propose a unified and flexible recommendation framework based on a factor graph model[12] to integrate different factors (such as user preference and geo-social influence, which can automatically determine the degree of each factor s contribution by optimizing the model parameters. Besides, we propose a distributed learning algorithm to improve the scalability of our method when dealing with large-scale data sets. (3 According to the two large-scale real data sets collected from Foursquare and Gowalla, we conduct extensive experiments to evaluate the recommendation accuracy and scalability of our method. The experimental results show that our method achieves better performance than several state-of-the-art POI recommendation approaches. The remainder of this paper is organized as follows. Section II presents an overview of the related work, and Section III formalizes the POI recommendation problem. In section IV, we model the personalized two-dimensional geographical influence using the data field method; in Section V, we introduce a unified geo-social recommendation framework in detail; in Section VI, we discuss the experimental results on the two real-world data sets. Finally, Section VII summarizes this paper. II. RELATED WORK This paper presents a semi-supervised probabilistic model based on a factor graph model to integrate different factors. To recommend more appropriate POIs to target users, both the personalized geographical influence and the social influence on individual user s check-in behavior are considered. 22

3 sure accurate POI recommendation results, which suggests that more additional information should be considered together. The geographical feature of POIs is actually a unique identifier distinguishing them from other non-spatial items[7]. On the one hand, some prior studies[3] calculated the similarity between users in terms of their geographical influence, and then recommended possible POIs using collaborative filtering techniques. On the other hand, other prior studies investigated the geographical influence of locations. For example, Cheng[15] et al. viewed locations as ordinary non-spatial items and quantified the geographical influence of locations by predefining a range; Kurashima[16] et al. proposed a geo-topic model based on the assumption that if a location is closer to a user s current location or the locations he/she visited, it is more likely to be visited by the user; Lian[17] et al. proposed a weighted matrix factorization model which augmented user s and POI s latent factors in the model with activity area vectors of users and influence area vectors of POIs, respectively. In many prior studies[1, 2, 4, 5], the distances between locations visited by the same user were modeled based on a common distribution for all users, e.g., a power-law distribution or a multi-center Gaussian distribution. Although this benefits the quality of POI recommendation to some extent, a common distribution for all users often ignores the personalized geographical influence. Therefore, Zhang[6] et al. utilized kernel density estimation to estimate the distribution of distances between pairs of locations for each individual user. Furthermore, a latest study[7] attempted to characterize user check-in behavior using two-dimensional kernel density estimation. Inspired by the findings in traditional social networks, several POI recommendation approaches[3, 5, 14] took into consideration the social influence, but the results indicated that the effect of such a factor was not significant, because there was, on average, a small overlap of a user s check-ins to his/her friends check-ins. Besides this one, some researchers have also utilized the information of time and content to provide better recommendation results. For example, Gao[18] et al. studied the temporal cyclic patterns of check-ins in terms of temporal non-uniformness and temporal consecutiveness, and they proposed a timeaware POI recommendation model; Yin[19] et al. proposed a location-content-aware topic model in consideration of both personal interest and local preference. III. PROBLEM DEFINITION We explore the information of user preference, social influence and personalized geographical influence, and then integrate them into a unified probabilistic model to recommend proper POIs for target users. Next, we formalize the problem of this study. Given a set of venues (POIs V = {v 1, v 2,, v M }, L = {l 1, l 2,, l M } is the set of locations of POIs, where M is the number of POIs and each location has longitude and latitude coordinates. Let U = {u 1, u 2,, u N } be the set of users and G be the matrix of the relationships between users, where N is the number of users. C = (c ij N M is a check-in frequency matrix with each element (also defined as check-in variable representing the frequency of checking in POI v j (with l j by user u i. Assuming that C L is an observed check-in action matrix, the set of observations can be denoted as O = (U; V; G; C L. According to all the observations, our task is to predict the probability that a given user visits a new POI, and to recommend the top-k POIs to the user. IV. MODELING PERSONALIZED GEOGRAPHICAL INFLUENCE According to the Tobler s first law of geography[20], the locations near the place where a user usually visits, are more likely to be recommended to the user. Users differ in the geographic spatial distribution of already-visited locations. The frequency distribution of a user s historical check-ins reflects his/her preference for different locations, and the pref- 23

4 erences of various users for the same POI are also different. Therefore, modeling the geographical influence of locations on user checkin behavior is the key to solving the problem in this paper. Unlike the statistical method used in the latest study[7], in this paper we use the data field method[11] to model the mutual interaction between locations by constructing virtual fields in a two-dimensional geographical space. Inspired by physical fields such as electromagnetic fields, the concept of data field is used to depict the interaction between objects associated with the data points in a specific space[11]. In a data space Ω R P, let D = {x 1, x 2,, x n } denote a data set of P-dimensional independent data points, where x i = {x i1, x i2,, x ip } T with i = 1, 2,, n. If a data point is viewed as a particle, it generates a virtual field around itself and certain forces acting upon other data points within the field, and the strength of such forces is often inversely proportional to the distance from it. The whole data space constitutes a data field. The potential function of data fields mathematically models how the strength of data on a given task is diffused from individual samples to the population through the interaction between data objects. Thus, the potential value of a data point x = {x 1, x 2,, x P } T in the universe of discourse is defined as[11] (1 where m i is the mass of data point x i, x - x i is the distance between x i and x, and σ (0, + is an impact factor that determines the area of each data point s influence. Note that K(x is the unit potential function satisfying and. According to the definition, σ has a great impact on the spatial distribution of virtual fields. Hence, it should be optimized to make the potential value of each data point diverse by minimizing the uncertainty of the original data distribution. Entropy has been widely used as an indicator for the disorder or randomness in a closed thermodynamic system. Then, the concept of potential entropy H is introduced to measure the uncertainty of potential values, and the optimal value of σ can be obtained by minimizing the potential entropy, which is defined as follows: (3 where n is the number of data points and This study assumes that the location of a POI represents a two-dimensional data point and that the number of user check-ins for the location is the mass of the data point. A data field in the two-dimensional geographic spatial space is then constructed. We illustrate the modeling process with a simple example. Figure 1 shows the distributions of two users check-in locations, and Figure 2 further shows the corresponding distributions of personal check-in locations in Figure 1 in terms of data field. In each data field, we can calculate the potential value of a given point, which is regarded as the geographical influence of a location on an individual user. The higher the potential value, the more likely a user is to prefer the corresponding location.. Because Gaussian function matches the physical nature of data fields, Gaussian potential can be chosen to characterize the scalar potential of data fields. Therefore, φ(x can be written as (2 Fig.1 Distributions of personal check-in locations 24

5 V. A UNIFIED POI RECOMMENDATION FRAMEWORK 5.1 Model specification The main goal of this paper is to build a model based on a factor graph model[12] to predict the probability that a given user visits a new POI. Here, we present a simple example. As shown in Figure 3, the observed variables, including users (u 1 ~ u 5 and their friendships (denoted by dashed lines in Figure 3, POIs (v 1 ~ v 5, and locations associated with the POIs (l 1 ~ l 5, are the input of our model; the latent variable, namely the check-in variable (whose definition please refer to Section III, is solved by our model. Because only a fraction of the check-in variable s values are exactly known (denoted by solid lines in Figure 3, our task is to predict the missing entries (viz. unlabelled data in each check-in variable vector. This is a typical semi-supervised learning process. In our model, we use the state factor f and the correlation factor g to model the geographical influence and the social influence, respectively, and the two types of factors are described as follows: -- State factor: Given the check-in records of user u i, f(u i, C i represents the posterior probability that user u i visits a given POI, which reflects the geographical influence of POIs on decision-making behavior of users. -- Correlation factor: g(c i, G(C i denotes the correlation between the check-in variables of different users, where G(C i is the set of the vectors of other users related to C i. It reflects the influence of friends check-in behavior on user s decisions. Given observation data O and location information L, the joint distribution over C is defined as (4 Latitude Visited Location Potential Value Longitude Longitude Fig.2 The corresponding distributions of personal check-in locations in terms of data field u 1 c 11 c 12 u 2 u Fig.3 Graphical representation of a simple example u u 5 v ( l v ( l v ( l v ( l v ( l 1 1 c c 33 c 43 c 44 c Latitude Visited Location Potential Value gc ( 2, C 4 C 2 C C 5 gc ( 1, C 2 4 gc C C ( 4, C gc ( 2, C 3 gc ( 3, C 4 f ( u 3, C 3 f f ( u, C ( u 2 2 5, C 5 f ( u 4, C 4 f ( u 1, C 1 u 2 u u 5 4 u u 3 1 In this study, due to uneven probability distribution of users check-ins, we use an exponential-linear function to instantiate the two types of factors, and the check-in variable c ij predicted by our model is assumed to be continuous in order to generate more accurate sorting. In particular, we define the state factor as (5 where α is a weighting vector and Ф is defined as a vector of feature functions for the state factor. Similarly, the correlation factor can be defined as (6 where β is a weighting vector and Ω is defined as a vector of feature functions for the correlation factor. The normalization factors Z α and Z β, also 25

6 known as the partition function, are defined as malization factor which is defined as (7 (8 (13 Next, the log-likelihood objective function l(θ can be defined as 5.2 Factor definition The feature functions for the state factor and the correlation factor are introduced in brief. -- Personalized geographical influence: Based on the spatial location distribution of all POIs visited by users, the potential value of each location is calculated by the data field method (see Eq. (2, and it represents the state feature of a given location (y ij. The corresponding feature function is defined in quadratic form as (9 -- Social influence: In this model, the social influence on POI recommendation is modeled in terms of the friendship between users, and its feature function is defined as (10 where S ik is the correlation value between two users u i and u k. In this paper, S ik is defined as (14 where C C L represents the entries in the check-in action matrix C except the check-in variable s values in the matrix C L. To obtain the optimal value of θ, we maximize the log-likelihood objective function l(θ, i.e., (15 We then take the gradient of each θ with regard to the objective function: (11 where Γ(u is the set of friends of u and e ik = 1 represents a direct friendship between u i and u k. 5.3 Model learning The task of model learning is to estimate a parameter configuration θ = [α, β] to maximize the log-likelihood of observation data. For simplicity, all feature functions for the checkin variable are concatenated and denoted as w(c ij = (Φ(u i, c ij T, T. Then, Eq. (4 can be written as where (12 and Z is a nor- (16 Here, we calculate approximately the gradients of θ using nonparametric belief propagation (NBP[21], which is an inference algorithm for graphical models that include continuous, non-gaussian random variables. Note that the NBP process is performed twice Algorithm 1 Parameter estimation Input: U, V, L, G, C L, and η = 0.01 Output: θ = [α, β] Initialize θ repeat Calculate E ( ( using NBP θ ( CC L W c P, G, L ij Calculate E ( ( P using NBP θ ( CG, L W cij Calculate the gradient of θ according to Eq. (16 Update parameter θ with the learning rate η: θnew θold η = + until convergence θ Return θnew Fig.4 An algorithm to estimate the model parameters 26

7 in each iteration in Algorithm 1 (see Figure 4: one is to estimate the marginal probability P(C G, L, and the other is to estimate P(C C L, G, L. Finally, using the newly learned gradient, we update each parameter configuration with a given learning rate η. 5.4 Inferring unknown check-ins With the learned parameter configuration, we can predict all unknown check-in variable s values by finding a check-in variable configuration that maximizes the joint probability: (17 Here, we calculate the marginal distribution function of the check-in variable using NBP and obtain the values of the variable by maximizing the marginal distribution function. 5.5 Distributed learning algorithm Our learning algorithm is indeed extensible and scalable to address the needs that real LBSNs may contain millions of users and POIs. To achieve scalability on large-scale data sets, a distributed learning algorithm is proposed based on the message passing interface (MPI to speed up the calculation of each parameter s gradient via NBP. This algorithm adopts a master/slave architecture. One master node is used to update the parameters, and other slave nodes are used to calculate their gradients. In the initial stage we partition the set of POIs into P subsets roughly equal, where P denotes the number of slave nodes. In each iteration in Algorithm 1, the Table I Data statistics Data source Foursquare Gowalla Duration ~ ~ No. of check-in records 2,073,740 6,442,890 No. of users 18, ,591 No. of unique locations 36,907 1,280,956 No. of friendship links 115, ,327 Average check-ins per user Average friends per user Average check-ins per location master node sends the updated parameter configuration to all slave nodes first. The slave nodes compute the marginal probability of the corresponding subset using NBP, and then send the gradients of the parameters obtained to the master node. Eventually, the master node sums all the gradients gathered from different subsets, and then updates the parameters using the gradient descent algorithm. VI. EXPERIMENT ANALYSIS 6.1 Data collection Two large-scale data sets collected from Foursquare[10] and Gowalla[5] were used in our experiment, and their key statistics are presented in Table Evaluation metrics As with previous work[1, 4], this paper utilizes two widely used metrics, i.e., Recall@k and Precision@k, to measure the quality of recommendation results. They are formally defined as (18 (19 where S u (k is the set of the top-k POIs recommended to user u and V u is the set of locations that have been actually visited by the user in the testing set. 6.3 Evaluated recommendation techniques To validate the effectiveness of our method, we compared it with five typical approaches. -- Non-negative Matrix Factorization (NMF [22]: it is widely used for collaborative filtering recently, and in our experiment it only utilizes the check-in action matrix. -- Multi-center Gaussian Model (MGM[2]: in such a model, the distances between visited locations and centers (i.e., the most popular locations are modeled as a universal multi-center Gaussian distribution for all users, and in 27

8 and 8.57%, respectively. Similarly, the Recall values of our method are, on average, inour experiment it utilizes geographical information and the check-in action matrix. -- Social Regularization (SR[9]: it is a stateof-the-art social recommendation algorithm, and in our experiment it utilizes the check-in action matrix and social relationships. -- Personalized Geo-Social Location Recommendation ( igslr[6]: igslr uses a kernel density estimation approach to model the personalized geographical influence on user check-in behavior, and in our experiment it utilizes social relationships, geographical information and the check-in action matrix by applying a product rule. -- Two-dimensional Geographic Coordinates for Location Recommendations (CoRe[7]: CoRe models geographical check-in probability density over latitude and longitude coordinates, and in our experiment it utilizes the same information that igslr used. 6.4 Comparison of performance In our experiment, each data set in question was divided into a training set and a testing set according to check-in time, that is, 70% of the previous check-in records were used as the training set and the rest were used as the testing set. The parameters of each method under discussion were set to their own optima for the two data sets. The experiment was repeated five times independently to reduce random errors generated during the experiment, and we used the average of evaluation metric values as the final result to increase reliability of the measurement. As shown in Figure 5, for the two data sets, our method ( outperforms other competing methods in terms of the evaluation metrics. On one hand, because our model considers more types of information for POI recommendation, it works better than NMF, MGM and SR. On the other hand, because our method can automatically determine the degree of each factor s contribution by optimizing the model parameters, it works better than igslr and CoRe. Compared with CoRe, the Precision values of our method for the two data sets are, on average, increased by 17% Precision Precision NMF MGM SR igslr CoRe (a Foursquare (Precision NMF MGM SR igslr CoRe (c Gowalla (Precision Recall Recall NMF MGM SR igslr CoRe (b Foursquare (Recall NMF MGM SR igslr CoRe (d Gowalla (Recall Fig.5 Values of the evaluation metrics for various methods Precision Precision State Factor Correlation Factor (a Foursquare (Precision State Factor Correlation Factor (c Gowalla (Precision Recall Recall State Factor Correlation Factor (b Foursquare (Recall 0.04 State Factor 0.02 Correlation Factor (d Gowalla (Recall Fig.6 Precision and Recall values of the model with different factors 28

9 Running time (s 20,000 18,000 16,000 14,000 12,000 10,000 8,000 6,000 4,000 2, Number of cores Fig.7 Running time for one iteration vs. the number of cores Precision k=10 k=20 k= k=40 k= Number of iterations Recall k=10 k=20 k= k=40 k= Number of iterations further improve performance, with the rising cost of complexity. 6.6 Scalability To evaluate the scalability of our distributed learning algorithm, we implemented it in C++ using MPI and conducted the experiment on a server running Windows Server 2012, with Intel Core i7 CPU (3.6 GHz and 32 GB memory. Given space limitations, only the case of Foursquare is presented in this paper. Figure 7 depicts the running time for one iteration in the distributed learning algorithm with respect to various numbers of slave nodes (viz. cores. With an increase of the number of cores, the efficiency of the algorithm is improved remarkably. Besides, we also analyzed the convergence process of the algorithm, as shown in Figure 8. It shows that the algorithm converges quickly after about five iterations with respect to Precision and Recall values, though the value of top-k (see Eqs. (18 and (19 ranges from 10 to 50. Thus, we set the maximum number of iterations to 10 and the threshold for the change of θ to 0.001, so as to reduce the running time of our distributed learning algorithm. Fig.8 Convergence of the distributed learning algorithm creased by 6.19% and 8.87%, respectively. 6.5 Factor effect analysis We evaluated the effects of different factors defined in our model on the evaluation metrics, and the results are shown in Figure 6. Although the two types of factors are useful for POI recommendation, the results are not good enough when using a single type of factors alone. Interestingly, the state factor has a greater impact on Precision and Recall, especially for Foursquare, which suggests that user check-in behavior is more likely to be affected by the geographical influence. Note that more factors such as user similarity and POI similarity can be defined and used in our model to VII. CONCLUSIONS To recommend more appropriate POIs to target users, in this paper we consider both the personalized geographical influence and the social influence on individual user check-in behavior. In particular, the personalized geographical influence in a two- dimensional geographical space is modeled using the data field method. A unified semi-supervised probabilistic model, which is based on a factor graph model, is then proposed to integrate the above factors. Furthermore, a distributed learning algorithm is proposed to scale up our method to handle large-scale data sets. Experimental results based on Foursquare and Gowalla show that our method outperforms other competing approaches with regard to the evaluation metrics. Finally, our future work will extend our model by integrating more useful factors such 29

10 as user-generated tags[23] and check-in time to improve its prediction performance. ACKNOWLEDGEMENTS We greatly appreciate the constructive comments and useful suggestions from the guest editors and anonymous reviewers, which help us improve the quality and readability of our paper. This work was supported by National Key Basic Research Program of China (973 Program under Grant No. 2014CB340404; National Natural Science Foundation of China under Grant Nos and ; Youth Chenguang Project of Science and Technology of Wuhan City under Grant No References [1] YE M, YIN P, LEE W C, et al. Exploiting geographical influence for collaborative point-of-interest recommendation[c]// Proceedings of the 34 th International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, NY: ACM Press, 2011: [2] CHENG C, YANG H, KING I, et al. Fused matrix factorization with geographical and social influence in location-based social networks[c]// Proceedings of the 26 th AAAI Conference on Artificial Intelligence. Palo Alto, CA: AAAI Press, 2012: [3] GAO H, TANG J, LIU H. gscorr: modeling geo-social correlations for new check-ins on location-based social networks[c]// Proceedings of the 21 st ACM International Conference on Information and Knowledge Management. New York, NY: ACM Press, 2012: [4] LIU B, FU Y, YAO Z, et al. Learning geographical preferences for point-of-interest recommendation[c]// Proceedings of the 19 th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, NY: ACM Press, 2013: [5] CHO E, MYERS S, LESKOVEC J. Friendship and mobility: user movement in location-based social networks[c]// Proceedings of the 17 th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, NY: ACM Press, 2011: [6] ZHANG J, CHOW C. igslr: personalized geo-social location recommendation: a kernel density estimation approach[c]// Proceedings of the 21 st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems. New York, NY: ACM Press, 2013: [7] ZHANG J, CHOW C. CoRe: Exploiting the personalized influence of two-dimensional geographic coordinates for location recommendations[j]. Information Sciences, 2015, 293: [8] ANAGNOSTOPOULOS A, KUMAR R, MAHDIAN M. Influence and correlation in social networks[c]// Proceedings of the 14 th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, NY: ACM Press, 2008: [9] MA H, ZHOU D, LIU C, et al. Recommender systems with social regularization[c]// Proceedings of the 4 th ACM International Conference on Web Search and Data Mining. New York, NY: ACM Press, 2011: [10] GAO H, TANG J, LIU H. Exploring Social-Historical Ties on Location-Based Social Networks[C]// Proceedings of the 6 th International AAAI Conference on Weblogs and Social Media. Palo Alto, CA: AAAI Press, 2012: [11] LI D, DU Y. Artificial intelligence with uncertainty[m]. Boca Raton, FL: CRC Press, [12] KSCHISCHANG F, FREY B, LOELIGER H. Factor graphs and the sum-product algorithm[j]. IEEE Transactions on Information Theory, 2001, 47(2: [13] ZHENG V, ZHENG Y, XIE X, et al. Towards mobile intelligence: Learning from GPS history data for collaborative recommendation[j]. Artificial Intelligence, 2012, 184: [14] HU B, ESTER M. Spatial topic modeling in online social media for location recommendation[c]// Proceedings of the 7 th ACM Conference on Recommender Systems. New York, NY: ACM Press, 2013: [15] CHENG C, YANG H, LYU M R, et al. Where you like to go next: Successive point-of-interest recommendation[c]// Proceedings of the 23 rd International Joint Conference on Artificial Intelligence. Palo Alto, CA: AAAI Press, 2013: [16] KURASHIMA T, IWATA T, HOSHIDE T, et al. Geo topic model: joint modeling of user s activity area and interests for location recommendation[c]// Proceedings of the 6 th ACM International Conference on Web Search and Data Mining. New York, NY: ACM Press, 2013: [17] LIAN D, ZHAO C, XIE X, et al. GeoMF: joint geographical modeling and matrix factorization for point-of-interest recommendation[c]// Proceedings of the 20 th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, NY: ACM Press, 2014: [18] GAO H, TANG J, HU X, et al. Exploring temporal effects for location recommendation on loca- 30

11 tion-based social networks[c]// Proceedings of the 7 th ACM conference on Recommender Systems. New York, NY: ACM Press, 2013: [19] YIN H, SUN Y, CUI B, et al. Lcars: A location-content-aware recommender system[c]// Proceedings of the 19 th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, NY: ACM Press, 2013: [20] TOBLER W. A computer movie simulating urban growth in the Detroit region[j]. Economic geography, 1970, 46: [21] SUDDERTH E, IHLER A, ISARD M, et al. Nonparametric belief propagation[j]. Communications of the ACM, 2010, 53(10: [22] LEE D, SEUNG H. Learning the parts of objects by non-negative matrix factorization[j]. Nature, 1999, 401(6755: [23] MA T, ZHOU J, TANG M, et al. Social Network and Tag Sources Based Augmenting Collaborative Recommender System[J]. IEICE Transactions on Information and Systems, 2015, 98-D(4: Biographies HUANG Liwei, received his Ph.D. degree in Computer Science from the PLA University of Science and Technology. He is currently an engineer in the Beijing Institute of Remote Sensing. Dr. Huang s research focus is on data mining and machine learning. He is now a member of the China Computer Federation (CCF. MA Yutao, received his Ph.D. degree in Computer Science from Wuhan University. He is currently an Associate Professor in the School of Computer, Wuhan University. Dr. Ma was with the China Institute of Electronic System Engineering as a post-doctoral fellow, and he has been a visiting scholar in the Department of Electronic and Computer Engineering, Lehigh University. His research focus is on the development of large-scale software service systems and complex networks. He is now a senior member of CCF and a member of ACM and IEEE. He has also been the Vice Chair of the Services Society Young Scientists Forum (China and is a member of the CCF Technical Committee on Services Computing, the CCF Youth Working Committee, and the CCF Young Computer Scientists & Engineers Forum (Wuhan Branch. *The corresponding author. ytma@ whu.edu.cn LIU Yanbo, received her M.S. degree in Automation from the National University of Defense Technology. She is currently an engineer in the Beijing Institute of Remote Sensing. Her research focus is on information automation. 31

Recommendation System for Location-based Social Network CS224W Project Report

Recommendation System for Location-based Social Network CS224W Project Report Recommendation System for Location-based Social Network CS224W Project Report Group 42, Yiying Cheng, Yangru Fang, Yongqing Yuan 1 Introduction With the rapid development of mobile devices and wireless

More information

Combining Review Text Content and Reviewer-Item Rating Matrix to Predict Review Rating

Combining Review Text Content and Reviewer-Item Rating Matrix to Predict Review Rating Combining Review Text Content and Reviewer-Item Rating Matrix to Predict Review Rating Dipak J Kakade, Nilesh P Sable Department of Computer Engineering, JSPM S Imperial College of Engg. And Research,

More information

POI2Vec: Geographical Latent Representation for Predicting Future Visitors

POI2Vec: Geographical Latent Representation for Predicting Future Visitors : Geographical Latent Representation for Predicting Future Visitors Shanshan Feng 1 Gao Cong 2 Bo An 2 Yeow Meng Chee 3 1 Interdisciplinary Graduate School 2 School of Computer Science and Engineering

More information

igeorec: A Personalized and Efficient Geographical Location Recommendation Framework

igeorec: A Personalized and Efficient Geographical Location Recommendation Framework 1 : A Personalized and Efficient Geographical Location Recommendation Framework Jia-Dong Zhang, Chi-Yin Chow, Member, IEEE, and Yanhua Li, Member, IEEE Abstract Geographical influence has been intensively

More information

Preference-Aware POI Recommendation with Temporal and Spatial Influence

Preference-Aware POI Recommendation with Temporal and Spatial Influence Proceedings of the Twenty-inth International Florida Artificial Intelligence Research Society Conference Preference-Aware POI Recommendation with Temporal and Spatial Influence Madhuri Debnath madhuri.debnath@mavs.uta.edu

More information

Survey on Recommendation of Personalized Travel Sequence

Survey on Recommendation of Personalized Travel Sequence Survey on Recommendation of Personalized Travel Sequence Mayuri D. Aswale 1, Dr. S. C. Dharmadhikari 2 ME Student, Department of Information Technology, PICT, Pune, India 1 Head of Department, Department

More information

Social Interaction Based Video Recommendation: Recommending YouTube Videos to Facebook Users

Social Interaction Based Video Recommendation: Recommending YouTube Videos to Facebook Users Social Interaction Based Video Recommendation: Recommending YouTube Videos to Facebook Users Bin Nie, Honggang Zhang, Yong Liu Fordham University, Bronx, NY. Email: {bnie, hzhang44}@fordham.edu NYU Poly,

More information

Learning Graph-based POI Embedding for Location-based Recommendation

Learning Graph-based POI Embedding for Location-based Recommendation Learning Graph-based POI Embedding for Location-based Recommendation Min Xie, Hongzhi Yin, Hao Wang, Fanjiang Xu, Weitong Chen, Sen Wang Institute of Software, Chinese Academy of Sciences, China The University

More information

Privacy-Preserving of Check-in Services in MSNS Based on a Bit Matrix

Privacy-Preserving of Check-in Services in MSNS Based on a Bit Matrix BULGARIAN ACADEMY OF SCIENCES CYBERNETICS AND INFORMATION TECHNOLOGIES Volume 15, No 2 Sofia 2015 Print ISSN: 1311-9702; Online ISSN: 1314-4081 DOI: 10.1515/cait-2015-0032 Privacy-Preserving of Check-in

More information

Ranking Web Pages by Associating Keywords with Locations

Ranking Web Pages by Associating Keywords with Locations Ranking Web Pages by Associating Keywords with Locations Peiquan Jin, Xiaoxiang Zhang, Qingqing Zhang, Sheng Lin, and Lihua Yue University of Science and Technology of China, 230027, Hefei, China jpq@ustc.edu.cn

More information

Link Prediction for Social Network

Link Prediction for Social Network Link Prediction for Social Network Ning Lin Computer Science and Engineering University of California, San Diego Email: nil016@eng.ucsd.edu Abstract Friendship recommendation has become an important issue

More information

Joint Entity Resolution

Joint Entity Resolution Joint Entity Resolution Steven Euijong Whang, Hector Garcia-Molina Computer Science Department, Stanford University 353 Serra Mall, Stanford, CA 94305, USA {swhang, hector}@cs.stanford.edu No Institute

More information

Semi supervised clustering for Text Clustering

Semi supervised clustering for Text Clustering Semi supervised clustering for Text Clustering N.Saranya 1 Assistant Professor, Department of Computer Science and Engineering, Sri Eshwar College of Engineering, Coimbatore 1 ABSTRACT: Based on clustering

More information

A Survey on Postive and Unlabelled Learning

A Survey on Postive and Unlabelled Learning A Survey on Postive and Unlabelled Learning Gang Li Computer & Information Sciences University of Delaware ligang@udel.edu Abstract In this paper we survey the main algorithms used in positive and unlabeled

More information

Constrained Clustering with Interactive Similarity Learning

Constrained Clustering with Interactive Similarity Learning SCIS & ISIS 2010, Dec. 8-12, 2010, Okayama Convention Center, Okayama, Japan Constrained Clustering with Interactive Similarity Learning Masayuki Okabe Toyohashi University of Technology Tenpaku 1-1, Toyohashi,

More information

AN IMPROVED TAIPEI BUS ESTIMATION-TIME-OF-ARRIVAL (ETA) MODEL BASED ON INTEGRATED ANALYSIS ON HISTORICAL AND REAL-TIME BUS POSITION

AN IMPROVED TAIPEI BUS ESTIMATION-TIME-OF-ARRIVAL (ETA) MODEL BASED ON INTEGRATED ANALYSIS ON HISTORICAL AND REAL-TIME BUS POSITION AN IMPROVED TAIPEI BUS ESTIMATION-TIME-OF-ARRIVAL (ETA) MODEL BASED ON INTEGRATED ANALYSIS ON HISTORICAL AND REAL-TIME BUS POSITION Xue-Min Lu 1,3, Sendo Wang 2 1 Master Student, 2 Associate Professor

More information

An Empirical Analysis of Communities in Real-World Networks

An Empirical Analysis of Communities in Real-World Networks An Empirical Analysis of Communities in Real-World Networks Chuan Sheng Foo Computer Science Department Stanford University csfoo@cs.stanford.edu ABSTRACT Little work has been done on the characterization

More information

Semi-supervised Data Representation via Affinity Graph Learning

Semi-supervised Data Representation via Affinity Graph Learning 1 Semi-supervised Data Representation via Affinity Graph Learning Weiya Ren 1 1 College of Information System and Management, National University of Defense Technology, Changsha, Hunan, P.R China, 410073

More information

Supplementary Figure 1. Decoding results broken down for different ROIs

Supplementary Figure 1. Decoding results broken down for different ROIs Supplementary Figure 1 Decoding results broken down for different ROIs Decoding results for areas V1, V2, V3, and V1 V3 combined. (a) Decoded and presented orientations are strongly correlated in areas

More information

Random projection for non-gaussian mixture models

Random projection for non-gaussian mixture models Random projection for non-gaussian mixture models Győző Gidófalvi Department of Computer Science and Engineering University of California, San Diego La Jolla, CA 92037 gyozo@cs.ucsd.edu Abstract Recently,

More information

F2F: Friend-To-Friend Semantic Path Recommendation

F2F: Friend-To-Friend Semantic Path Recommendation F2F: Friend-To-Friend Semantic Path Recommendation Mohamed Mahmoud Hasan 1 and Hoda M. O. Mokhtar 2 1 Management Information Systems Department, Faculty of Commerce and Business Administration, Future

More information

A Brief Review of Representation Learning in Recommender 赵鑫 RUC

A Brief Review of Representation Learning in Recommender 赵鑫 RUC A Brief Review of Representation Learning in Recommender Systems @ 赵鑫 RUC batmanfly@qq.com Representation learning Overview of recommender systems Tasks Rating prediction Item recommendation Basic models

More information

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 2013 ISSN:

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 1, Issue 5, Oct-Nov, 2013 ISSN: Semi Automatic Annotation Exploitation Similarity of Pics in i Personal Photo Albums P. Subashree Kasi Thangam 1 and R. Rosy Angel 2 1 Assistant Professor, Department of Computer Science Engineering College,

More information

Content-Aware Hierarchical Point-of-Interest Embedding Model for Successive POI Recommendation

Content-Aware Hierarchical Point-of-Interest Embedding Model for Successive POI Recommendation Content-Aware Hierarchical Point-of-Interest Embedding Model for Successive POI Recommendation Buru Chang 1, Yonggyu Park 1, Donghyeon Park 1, Seongsoon Kim 2 and Jaewoo Kang 1 1 Korea University 2 Naver

More information

Influence Maximization in Location-Based Social Networks Ivan Suarez, Sudarshan Seshadri, Patrick Cho CS224W Final Project Report

Influence Maximization in Location-Based Social Networks Ivan Suarez, Sudarshan Seshadri, Patrick Cho CS224W Final Project Report Influence Maximization in Location-Based Social Networks Ivan Suarez, Sudarshan Seshadri, Patrick Cho CS224W Final Project Report Abstract The goal of influence maximization has led to research into different

More information

COLLABORATIVE LOCATION AND ACTIVITY RECOMMENDATIONS WITH GPS HISTORY DATA

COLLABORATIVE LOCATION AND ACTIVITY RECOMMENDATIONS WITH GPS HISTORY DATA COLLABORATIVE LOCATION AND ACTIVITY RECOMMENDATIONS WITH GPS HISTORY DATA Vincent W. Zheng, Yu Zheng, Xing Xie, Qiang Yang Hong Kong University of Science and Technology Microsoft Research Asia WWW 2010

More information

FSRM Feedback Algorithm based on Learning Theory

FSRM Feedback Algorithm based on Learning Theory Send Orders for Reprints to reprints@benthamscience.ae The Open Cybernetics & Systemics Journal, 2015, 9, 699-703 699 FSRM Feedback Algorithm based on Learning Theory Open Access Zhang Shui-Li *, Dong

More information

Comparison of Recommender System Algorithms focusing on the New-Item and User-Bias Problem

Comparison of Recommender System Algorithms focusing on the New-Item and User-Bias Problem Comparison of Recommender System Algorithms focusing on the New-Item and User-Bias Problem Stefan Hauger 1, Karen H. L. Tso 2, and Lars Schmidt-Thieme 2 1 Department of Computer Science, University of

More information

Community-Based Recommendations: a Solution to the Cold Start Problem

Community-Based Recommendations: a Solution to the Cold Start Problem Community-Based Recommendations: a Solution to the Cold Start Problem Shaghayegh Sahebi Intelligent Systems Program University of Pittsburgh sahebi@cs.pitt.edu William W. Cohen Machine Learning Department

More information

A Novel Method for Activity Place Sensing Based on Behavior Pattern Mining Using Crowdsourcing Trajectory Data

A Novel Method for Activity Place Sensing Based on Behavior Pattern Mining Using Crowdsourcing Trajectory Data A Novel Method for Activity Place Sensing Based on Behavior Pattern Mining Using Crowdsourcing Trajectory Data Wei Yang 1, Tinghua Ai 1, Wei Lu 1, Tong Zhang 2 1 School of Resource and Environment Sciences,

More information

An Improved KNN Classification Algorithm based on Sampling

An Improved KNN Classification Algorithm based on Sampling International Conference on Advances in Materials, Machinery, Electrical Engineering (AMMEE 017) An Improved KNN Classification Algorithm based on Sampling Zhiwei Cheng1, a, Caisen Chen1, b, Xuehuan Qiu1,

More information

TriRank: Review-aware Explainable Recommendation by Modeling Aspects

TriRank: Review-aware Explainable Recommendation by Modeling Aspects TriRank: Review-aware Explainable Recommendation by Modeling Aspects Xiangnan He, Tao Chen, Min-Yen Kan, Xiao Chen National University of Singapore Presented by Xiangnan He CIKM 15, Melbourne, Australia

More information

Collaborative Filtering using a Spreading Activation Approach

Collaborative Filtering using a Spreading Activation Approach Collaborative Filtering using a Spreading Activation Approach Josephine Griffith *, Colm O Riordan *, Humphrey Sorensen ** * Department of Information Technology, NUI, Galway ** Computer Science Department,

More information

Unsupervised Interesting Places Discovery in Location-Based Social Sensing

Unsupervised Interesting Places Discovery in Location-Based Social Sensing Unsupervised Interesting Places Discovery in Location-Based Social Sensing Chao Huang, Dong Wang Department of Computer Science and Engineering University of Notre Dame Notre Dame, IN 46556 chuang7@nd.edu,

More information

A new improved ant colony algorithm with levy mutation 1

A new improved ant colony algorithm with levy mutation 1 Acta Technica 62, No. 3B/2017, 27 34 c 2017 Institute of Thermomechanics CAS, v.v.i. A new improved ant colony algorithm with levy mutation 1 Zhang Zhixin 2, Hu Deji 2, Jiang Shuhao 2, 3, Gao Linhua 2,

More information

CS249: ADVANCED DATA MINING

CS249: ADVANCED DATA MINING CS249: ADVANCED DATA MINING Recommender Systems II Instructor: Yizhou Sun yzsun@cs.ucla.edu May 31, 2017 Recommender Systems Recommendation via Information Network Analysis Hybrid Collaborative Filtering

More information

Query Independent Scholarly Article Ranking

Query Independent Scholarly Article Ranking Query Independent Scholarly Article Ranking Shuai Ma, Chen Gong, Renjun Hu, Dongsheng Luo, Chunming Hu, Jinpeng Huai SKLSDE Lab, Beihang University, China Beijing Advanced Innovation Center for Big Data

More information

Query-Sensitive Similarity Measure for Content-Based Image Retrieval

Query-Sensitive Similarity Measure for Content-Based Image Retrieval Query-Sensitive Similarity Measure for Content-Based Image Retrieval Zhi-Hua Zhou Hong-Bin Dai National Laboratory for Novel Software Technology Nanjing University, Nanjing 2193, China {zhouzh, daihb}@lamda.nju.edu.cn

More information

High Capacity Reversible Watermarking Scheme for 2D Vector Maps

High Capacity Reversible Watermarking Scheme for 2D Vector Maps Scheme for 2D Vector Maps 1 Information Management Department, China National Petroleum Corporation, Beijing, 100007, China E-mail: jxw@petrochina.com.cn Mei Feng Research Institute of Petroleum Exploration

More information

Note Set 4: Finite Mixture Models and the EM Algorithm

Note Set 4: Finite Mixture Models and the EM Algorithm Note Set 4: Finite Mixture Models and the EM Algorithm Padhraic Smyth, Department of Computer Science University of California, Irvine Finite Mixture Models A finite mixture model with K components, for

More information

STUDYING OF CLASSIFYING CHINESE SMS MESSAGES

STUDYING OF CLASSIFYING CHINESE SMS MESSAGES STUDYING OF CLASSIFYING CHINESE SMS MESSAGES BASED ON BAYESIAN CLASSIFICATION 1 LI FENG, 2 LI JIGANG 1,2 Computer Science Department, DongHua University, Shanghai, China E-mail: 1 Lifeng@dhu.edu.cn, 2

More information

Estimating the Information Rate of Noisy Two-Dimensional Constrained Channels

Estimating the Information Rate of Noisy Two-Dimensional Constrained Channels Estimating the Information Rate of Noisy Two-Dimensional Constrained Channels Mehdi Molkaraie and Hans-Andrea Loeliger Dept. of Information Technology and Electrical Engineering ETH Zurich, Switzerland

More information

Implementation of Parallel CASINO Algorithm Based on MapReduce. Li Zhang a, Yijie Shi b

Implementation of Parallel CASINO Algorithm Based on MapReduce. Li Zhang a, Yijie Shi b International Conference on Artificial Intelligence and Engineering Applications (AIEA 2016) Implementation of Parallel CASINO Algorithm Based on MapReduce Li Zhang a, Yijie Shi b State key laboratory

More information

VisoLink: A User-Centric Social Relationship Mining

VisoLink: A User-Centric Social Relationship Mining VisoLink: A User-Centric Social Relationship Mining Lisa Fan and Botang Li Department of Computer Science, University of Regina Regina, Saskatchewan S4S 0A2 Canada {fan, li269}@cs.uregina.ca Abstract.

More information

R. R. Badre Associate Professor Department of Computer Engineering MIT Academy of Engineering, Pune, Maharashtra, India

R. R. Badre Associate Professor Department of Computer Engineering MIT Academy of Engineering, Pune, Maharashtra, India Volume 7, Issue 4, April 2017 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Web Service Ranking

More information

CS224W: Social and Information Network Analysis Project Report: Edge Detection in Review Networks

CS224W: Social and Information Network Analysis Project Report: Edge Detection in Review Networks CS224W: Social and Information Network Analysis Project Report: Edge Detection in Review Networks Archana Sulebele, Usha Prabhu, William Yang (Group 29) Keywords: Link Prediction, Review Networks, Adamic/Adar,

More information

Using PageRank in Feature Selection

Using PageRank in Feature Selection Using PageRank in Feature Selection Dino Ienco, Rosa Meo, and Marco Botta Dipartimento di Informatica, Università di Torino, Italy fienco,meo,bottag@di.unito.it Abstract. Feature selection is an important

More information

Learning to Match. Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li

Learning to Match. Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li Learning to Match Jun Xu, Zhengdong Lu, Tianqi Chen, Hang Li 1. Introduction The main tasks in many applications can be formalized as matching between heterogeneous objects, including search, recommendation,

More information

A Constrained Spreading Activation Approach to Collaborative Filtering

A Constrained Spreading Activation Approach to Collaborative Filtering A Constrained Spreading Activation Approach to Collaborative Filtering Josephine Griffith 1, Colm O Riordan 1, and Humphrey Sorensen 2 1 Dept. of Information Technology, National University of Ireland,

More information

Dynamic Clustering of Data with Modified K-Means Algorithm

Dynamic Clustering of Data with Modified K-Means Algorithm 2012 International Conference on Information and Computer Networks (ICICN 2012) IPCSIT vol. 27 (2012) (2012) IACSIT Press, Singapore Dynamic Clustering of Data with Modified K-Means Algorithm Ahamed Shafeeq

More information

Improved Balanced Parallel FP-Growth with MapReduce Qing YANG 1,a, Fei-Yang DU 2,b, Xi ZHU 1,c, Cheng-Gong JIANG *

Improved Balanced Parallel FP-Growth with MapReduce Qing YANG 1,a, Fei-Yang DU 2,b, Xi ZHU 1,c, Cheng-Gong JIANG * 2016 Joint International Conference on Artificial Intelligence and Computer Engineering (AICE 2016) and International Conference on Network and Communication Security (NCS 2016) ISBN: 978-1-60595-362-5

More information

CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS

CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS CHAPTER 4 CLASSIFICATION WITH RADIAL BASIS AND PROBABILISTIC NEURAL NETWORKS 4.1 Introduction Optical character recognition is one of

More information

Samuel Coolidge, Dan Simon, Dennis Shasha, Technical Report NYU/CIMS/TR

Samuel Coolidge, Dan Simon, Dennis Shasha, Technical Report NYU/CIMS/TR Detecting Missing and Spurious Edges in Large, Dense Networks Using Parallel Computing Samuel Coolidge, sam.r.coolidge@gmail.com Dan Simon, des480@nyu.edu Dennis Shasha, shasha@cims.nyu.edu Technical Report

More information

Recommendations in Location-based Social Networks: A Survey

Recommendations in Location-based Social Networks: A Survey Geoinformatica (2015) 19:525565 DOI 10.1007/s10707-014-0220-8 Recommendations in Location-based Social Networks: A Survey Jie Bao Yu Zheng David Wilkie Mohamed Mokbel Received: 23 September 2013 / Revised:

More information

Performance Assessment of DMOEA-DD with CEC 2009 MOEA Competition Test Instances

Performance Assessment of DMOEA-DD with CEC 2009 MOEA Competition Test Instances Performance Assessment of DMOEA-DD with CEC 2009 MOEA Competition Test Instances Minzhong Liu, Xiufen Zou, Yu Chen, Zhijian Wu Abstract In this paper, the DMOEA-DD, which is an improvement of DMOEA[1,

More information

CHAPTER 5 OPTIMAL CLUSTER-BASED RETRIEVAL

CHAPTER 5 OPTIMAL CLUSTER-BASED RETRIEVAL 85 CHAPTER 5 OPTIMAL CLUSTER-BASED RETRIEVAL 5.1 INTRODUCTION Document clustering can be applied to improve the retrieval process. Fast and high quality document clustering algorithms play an important

More information

Fast Efficient Clustering Algorithm for Balanced Data

Fast Efficient Clustering Algorithm for Balanced Data Vol. 5, No. 6, 214 Fast Efficient Clustering Algorithm for Balanced Data Adel A. Sewisy Faculty of Computer and Information, Assiut University M. H. Marghny Faculty of Computer and Information, Assiut

More information

Where s the Boss? : Monte Carlo Localization for an Autonomous Ground Vehicle using an Aerial Lidar Map

Where s the Boss? : Monte Carlo Localization for an Autonomous Ground Vehicle using an Aerial Lidar Map Where s the Boss? : Monte Carlo Localization for an Autonomous Ground Vehicle using an Aerial Lidar Map Sebastian Scherer, Young-Woo Seo, and Prasanna Velagapudi October 16, 2007 Robotics Institute Carnegie

More information

CS Introduction to Data Mining Instructor: Abdullah Mueen

CS Introduction to Data Mining Instructor: Abdullah Mueen CS 591.03 Introduction to Data Mining Instructor: Abdullah Mueen LECTURE 8: ADVANCED CLUSTERING (FUZZY AND CO -CLUSTERING) Review: Basic Cluster Analysis Methods (Chap. 10) Cluster Analysis: Basic Concepts

More information

Multiple Constraint Satisfaction by Belief Propagation: An Example Using Sudoku

Multiple Constraint Satisfaction by Belief Propagation: An Example Using Sudoku Multiple Constraint Satisfaction by Belief Propagation: An Example Using Sudoku Todd K. Moon and Jacob H. Gunther Utah State University Abstract The popular Sudoku puzzle bears structural resemblance to

More information

Mixture Models and the EM Algorithm

Mixture Models and the EM Algorithm Mixture Models and the EM Algorithm Padhraic Smyth, Department of Computer Science University of California, Irvine c 2017 1 Finite Mixture Models Say we have a data set D = {x 1,..., x N } where x i is

More information

A Survey on Personalized Travel Sequence Recommendation on Multi-Source Big Social Media

A Survey on Personalized Travel Sequence Recommendation on Multi-Source Big Social Media A Survey on Personalized Travel Sequence Recommendation on Multi-Source Big Social Media Chetan W. Rane 1, Supriya Sarkar 2 P.G. Student, Department of Computer Engineering, SKNSITS, Lonavala, Maharashtra,

More information

Feature Selecting Model in Automatic Text Categorization of Chinese Financial Industrial News

Feature Selecting Model in Automatic Text Categorization of Chinese Financial Industrial News Selecting Model in Automatic Text Categorization of Chinese Industrial 1) HUEY-MING LEE 1 ), PIN-JEN CHEN 1 ), TSUNG-YEN LEE 2) Department of Information Management, Chinese Culture University 55, Hwa-Kung

More information

A User Preference Based Search Engine

A User Preference Based Search Engine A User Preference Based Search Engine 1 Dondeti Swedhan, 2 L.N.B. Srinivas 1 M-Tech, 2 M-Tech 1 Department of Information Technology, 1 SRM University Kattankulathur, Chennai, India Abstract - In this

More information

Detecting and Analyzing Communities in Social Network Graphs for Targeted Marketing

Detecting and Analyzing Communities in Social Network Graphs for Targeted Marketing Detecting and Analyzing Communities in Social Network Graphs for Targeted Marketing Gautam Bhat, Rajeev Kumar Singh Department of Computer Science and Engineering Shiv Nadar University Gautam Buddh Nagar,

More information

Learning to Infer Social Ties in Large Networks

Learning to Infer Social Ties in Large Networks Learning to Infer Social Ties in Large Networks Wenbin Tang, Honglei Zhuang, and Jie Tang Department of Computer Science and Technology, Tsinghua University {tangwb06,honglei.zhuang}@gmail.com, jietang@tsinghua.edu.cn

More information

arxiv: v1 [stat.ml] 29 Nov 2016

arxiv: v1 [stat.ml] 29 Nov 2016 Probabilistic map-matching using particle filters Kira Kempinska 1, Toby Davies 1 and John Shawe-Taylor 2 arxiv:1611.09706v1 [stat.ml] 29 Nov 2016 1 Department of Security and Crime Science, University

More information

Probabilistic Abstraction Lattices: A Computationally Efficient Model for Conditional Probability Estimation

Probabilistic Abstraction Lattices: A Computationally Efficient Model for Conditional Probability Estimation Probabilistic Abstraction Lattices: A Computationally Efficient Model for Conditional Probability Estimation Daniel Lowd January 14, 2004 1 Introduction Probabilistic models have shown increasing popularity

More information

Tag Based Image Search by Social Re-ranking

Tag Based Image Search by Social Re-ranking Tag Based Image Search by Social Re-ranking Vilas Dilip Mane, Prof.Nilesh P. Sable Student, Department of Computer Engineering, Imperial College of Engineering & Research, Wagholi, Pune, Savitribai Phule

More information

A Constrained Spreading Activation Approach to Collaborative Filtering

A Constrained Spreading Activation Approach to Collaborative Filtering A Constrained Spreading Activation Approach to Collaborative Filtering Josephine Griffith 1, Colm O Riordan 1, and Humphrey Sorensen 2 1 Dept. of Information Technology, National University of Ireland,

More information

QUANTIZER DESIGN FOR EXPLOITING COMMON INFORMATION IN LAYERED CODING. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose

QUANTIZER DESIGN FOR EXPLOITING COMMON INFORMATION IN LAYERED CODING. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose QUANTIZER DESIGN FOR EXPLOITING COMMON INFORMATION IN LAYERED CODING Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose Department of Electrical and Computer Engineering University of California,

More information

A Recommender System Based on Improvised K- Means Clustering Algorithm

A Recommender System Based on Improvised K- Means Clustering Algorithm A Recommender System Based on Improvised K- Means Clustering Algorithm Shivani Sharma Department of Computer Science and Applications, Kurukshetra University, Kurukshetra Shivanigaur83@yahoo.com Abstract:

More information

Clustering Lecture 5: Mixture Model

Clustering Lecture 5: Mixture Model Clustering Lecture 5: Mixture Model Jing Gao SUNY Buffalo 1 Outline Basics Motivation, definition, evaluation Methods Partitional Hierarchical Density-based Mixture model Spectral methods Advanced topics

More information

Image retrieval based on bag of images

Image retrieval based on bag of images University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2009 Image retrieval based on bag of images Jun Zhang University of Wollongong

More information

Inferring Mobility Relationship via Graph Embedding

Inferring Mobility Relationship via Graph Embedding 147 Inferring Mobility Relationship via Graph Embedding YANWEI YU, Pennsylvania State University, USA HONGJIAN WANG, Pennsylvania State University, USA ZHENHUI LI, Pennsylvania State University, USA Inferring

More information

A Parallel Evolutionary Algorithm for Discovery of Decision Rules

A Parallel Evolutionary Algorithm for Discovery of Decision Rules A Parallel Evolutionary Algorithm for Discovery of Decision Rules Wojciech Kwedlo Faculty of Computer Science Technical University of Bia lystok Wiejska 45a, 15-351 Bia lystok, Poland wkwedlo@ii.pb.bialystok.pl

More information

Analysis of Directional Beam Patterns from Firefly Optimization

Analysis of Directional Beam Patterns from Firefly Optimization Analysis of Directional Beam Patterns from Firefly Optimization Nicholas Misiunas, Charles Thompson and Kavitha Chandra Center for Advanced Computation and Telecommunications Department of Electrical and

More information

Detecting Anomalous Trajectories and Traffic Services

Detecting Anomalous Trajectories and Traffic Services Detecting Anomalous Trajectories and Traffic Services Mazen Ismael Faculty of Information Technology, BUT Božetěchova 1/2, 66 Brno Mazen.ismael@vut.cz Abstract. Among the traffic studies; the importance

More information

An Information Theory Approach to Identify Sets of Key Players

An Information Theory Approach to Identify Sets of Key Players An Information Theory Approach to Identify Sets of Key Players Daniel Ortiz-Arroyo and Akbar Hussain Electronics Department Aalborg University Niels Bohrs Vej 8, 6700 Esbjerg Denmark do@cs.aaue.dk, akbar@cs.aaue.dk

More information

CS224W Project Write-up Static Crawling on Social Graph Chantat Eksombatchai Norases Vesdapunt Phumchanit Watanaprakornkul

CS224W Project Write-up Static Crawling on Social Graph Chantat Eksombatchai Norases Vesdapunt Phumchanit Watanaprakornkul 1 CS224W Project Write-up Static Crawling on Social Graph Chantat Eksombatchai Norases Vesdapunt Phumchanit Watanaprakornkul Introduction Our problem is crawling a static social graph (snapshot). Given

More information

Location-Based Social Networks: Locations

Location-Based Social Networks: Locations Chapter 9 Location-Based Social Networks: Locations Yu Zheng and Xing Xie Abstract While chapter 8 studies the research philosophy behind a location-based social network (LBSN) from the point of view of

More information

Transductive Phoneme Classification Using Local Scaling And Confidence

Transductive Phoneme Classification Using Local Scaling And Confidence 202 IEEE 27-th Convention of Electrical and Electronics Engineers in Israel Transductive Phoneme Classification Using Local Scaling And Confidence Matan Orbach Dept. of Electrical Engineering Technion

More information

Where Next? Data Mining Techniques and Challenges for Trajectory Prediction. Slides credit: Layla Pournajaf

Where Next? Data Mining Techniques and Challenges for Trajectory Prediction. Slides credit: Layla Pournajaf Where Next? Data Mining Techniques and Challenges for Trajectory Prediction Slides credit: Layla Pournajaf o Navigational services. o Traffic management. o Location-based advertising. Source: A. Monreale,

More information

1) Give decision trees to represent the following Boolean functions:

1) Give decision trees to represent the following Boolean functions: 1) Give decision trees to represent the following Boolean functions: 1) A B 2) A [B C] 3) A XOR B 4) [A B] [C Dl Answer: 1) A B 2) A [B C] 1 3) A XOR B = (A B) ( A B) 4) [A B] [C D] 2 2) Consider the following

More information

Classification. 1 o Semestre 2007/2008

Classification. 1 o Semestre 2007/2008 Classification Departamento de Engenharia Informática Instituto Superior Técnico 1 o Semestre 2007/2008 Slides baseados nos slides oficiais do livro Mining the Web c Soumen Chakrabarti. Outline 1 2 3 Single-Class

More information

IMPROVING THE RELEVANCY OF DOCUMENT SEARCH USING THE MULTI-TERM ADJACENCY KEYWORD-ORDER MODEL

IMPROVING THE RELEVANCY OF DOCUMENT SEARCH USING THE MULTI-TERM ADJACENCY KEYWORD-ORDER MODEL IMPROVING THE RELEVANCY OF DOCUMENT SEARCH USING THE MULTI-TERM ADJACENCY KEYWORD-ORDER MODEL Lim Bee Huang 1, Vimala Balakrishnan 2, Ram Gopal Raj 3 1,2 Department of Information System, 3 Department

More information

Building Classifiers using Bayesian Networks

Building Classifiers using Bayesian Networks Building Classifiers using Bayesian Networks Nir Friedman and Moises Goldszmidt 1997 Presented by Brian Collins and Lukas Seitlinger Paper Summary The Naive Bayes classifier has reasonable performance

More information

Generalized Inverse Reinforcement Learning

Generalized Inverse Reinforcement Learning Generalized Inverse Reinforcement Learning James MacGlashan Cogitai, Inc. james@cogitai.com Michael L. Littman mlittman@cs.brown.edu Nakul Gopalan ngopalan@cs.brown.edu Amy Greenwald amy@cs.brown.edu Abstract

More information

Enhancing Clustering Results In Hierarchical Approach By Mvs Measures

Enhancing Clustering Results In Hierarchical Approach By Mvs Measures International Journal of Engineering Research and Development e-issn: 2278-067X, p-issn: 2278-800X, www.ijerd.com Volume 10, Issue 6 (June 2014), PP.25-30 Enhancing Clustering Results In Hierarchical Approach

More information

SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC

SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC SINGLE PASS DEPENDENT BIT ALLOCATION FOR SPATIAL SCALABILITY CODING OF H.264/SVC Randa Atta, Rehab F. Abdel-Kader, and Amera Abd-AlRahem Electrical Engineering Department, Faculty of Engineering, Port

More information

Recommending friends and locations over a Heterogeneous Spatio-Temporal graph

Recommending friends and locations over a Heterogeneous Spatio-Temporal graph Recommending friends and locations over a Heterogeneous Spatio-Temporal graph Pavlos Kefalas and Panagiotis Symeonidis Department of Informatics, Aristotle University of Thessaloniki, 54124, Greece {kefalasp,symeon}@csd.auth.gr

More information

Hotel Recommendation Based on Hybrid Model

Hotel Recommendation Based on Hybrid Model Hotel Recommendation Based on Hybrid Model Jing WANG, Jiajun SUN, Zhendong LIN Abstract: This project develops a hybrid model that combines content-based with collaborative filtering (CF) for hotel recommendation.

More information

Using PageRank in Feature Selection

Using PageRank in Feature Selection Using PageRank in Feature Selection Dino Ienco, Rosa Meo, and Marco Botta Dipartimento di Informatica, Università di Torino, Italy {ienco,meo,botta}@di.unito.it Abstract. Feature selection is an important

More information

A PROPOSED HYBRID BOOK RECOMMENDER SYSTEM

A PROPOSED HYBRID BOOK RECOMMENDER SYSTEM A PROPOSED HYBRID BOOK RECOMMENDER SYSTEM SUHAS PATIL [M.Tech Scholar, Department Of Computer Science &Engineering, RKDF IST, Bhopal, RGPV University, India] Dr.Varsha Namdeo [Assistant Professor, Department

More information

New Message-Passing Decoding Algorithm of LDPC Codes by Partitioning Check Nodes 1

New Message-Passing Decoding Algorithm of LDPC Codes by Partitioning Check Nodes 1 New Message-Passing Decoding Algorithm of LDPC Codes by Partitioning Check Nodes 1 Sunghwan Kim* O, Min-Ho Jang*, Jong-Seon No*, Song-Nam Hong, and Dong-Joon Shin *School of Electrical Engineering and

More information

arxiv: v1 [cs.si] 12 Jan 2019

arxiv: v1 [cs.si] 12 Jan 2019 Predicting Diffusion Reach Probabilities via Representation Learning on Social Networks Furkan Gursoy furkan.gursoy@boun.edu.tr Ahmet Onur Durahim onur.durahim@boun.edu.tr arxiv:1901.03829v1 [cs.si] 12

More information

Particle Swarm Optimization applied to Pattern Recognition

Particle Swarm Optimization applied to Pattern Recognition Particle Swarm Optimization applied to Pattern Recognition by Abel Mengistu Advisor: Dr. Raheel Ahmad CS Senior Research 2011 Manchester College May, 2011-1 - Table of Contents Introduction... - 3 - Objectives...

More information

Sensor Tasking and Control

Sensor Tasking and Control Sensor Tasking and Control Outline Task-Driven Sensing Roles of Sensor Nodes and Utilities Information-Based Sensor Tasking Joint Routing and Information Aggregation Summary Introduction To efficiently

More information

A Game Map Complexity Measure Based on Hamming Distance Yan Li, Pan Su, and Wenliang Li

A Game Map Complexity Measure Based on Hamming Distance Yan Li, Pan Su, and Wenliang Li Physics Procedia 22 (2011) 634 640 2011 International Conference on Physics Science and Technology (ICPST 2011) A Game Map Complexity Measure Based on Hamming Distance Yan Li, Pan Su, and Wenliang Li Collage

More information

DIAL: A Distributed Adaptive-Learning Routing Method in VDTNs

DIAL: A Distributed Adaptive-Learning Routing Method in VDTNs : A Distributed Adaptive-Learning Routing Method in VDTNs Bo Wu, Haiying Shen and Kang Chen Department of Electrical and Computer Engineering Clemson University, Clemson, South Carolina 29634 {bwu2, shenh,

More information