Some rankings based on PageRank applied to the Valencia Metro

Similar documents
A Multilayer Model of Computer Networks

On Aggregation of Network Multiplexity

My Best Current Friend in a Social Network

EDITORS. Proceedings of the Workshop on Applied Topological Structures WATS'17. Editorial Universitat Politècnica de València

arxiv: v2 [physics.soc-ph] 29 Jan 2016

Dynamically Motivated Models for Multiplex Networks 1

IN logistics, it is necessary to planning to accomplish

Multiplex networks: a Generative Model and Algorithmic Complexity

Solving dynamic memory allocation problems in embedded systems with parallel variable neighborhood search strategies

Global dynamic routing for scale-free networks

Topology Optimization of Multiple Load Case Structures

Proximity Prestige using Incremental Iteration in Page Rank Algorithm

figures/diapbanxicofondoverde.png

COMP Page Rank

P-spline ANOVA-type interaction models for spatio-temporal smoothing

An Introduction to Multilayer Networks

An array is a collection of data that holds fixed number of values of same type. It is also known as a set. An array is a data type.

arxiv: v1 [physics.soc-ph] 29 Nov 2012

Spectrum Allocation Policies in Fragmentation Aware and Balanced Load Routing for Elastic Optical Networks

Simulation and Experimental Research Activities on Airborne Wind Energy Systems at UC3M

REGRESSION ANALYSIS : LINEAR BY MAUAJAMA FIRDAUS & TULIKA SAHA

Weighted Page Rank Algorithm based on In-Out Weight of Webpages

Open Access Research on the Prediction Model of Material Cost Based on Data Mining

Mathematical Analysis of Google PageRank

Centrality Rankings in Multiplex Networks

Algebraic Iterative Methods for Computed Tomography

CONVERGENCE RANK AND ITS APPLICATIONS

Deduction and Logic Implementation of the Fractal Scan Algorithm

Web Structure Mining using Link Analysis Algorithms

HUBify: Efficient Estimation of Central Entities across Multiplex Layer Compositions

The flare Package for High Dimensional Linear Regression and Precision Matrix Estimation in R

Seismic regionalization based on an artificial neural network

PARAMETERIZATION AND SAMPLING DESIGN FOR WATER NETWORKS DEMAND CALIBRATION USING THE SINGULAR VALUE DECOMPOSITION: APPLICATION TO A REAL NETWORK

Random Permutations, Random Sudoku Matrices and Randomized Algorithms

On the construction of Tanner graphs

A Modified Algorithm to Handle Dangling Pages using Hypothetical Node

Iteratively Re-weighted Least Squares for Sums of Convex Functions

STUDY OF THE DEVELOPMENT OF THE STRUCTURE OF THE NETWORK OF SOFIA SUBWAY

Jordan Boyd-Graber University of Maryland. Thursday, March 3, 2011

Sparse wavelet expansions for seismic tomography: Methods and algorithms

Distance-Hereditary Embeddings of Circulant Graphs

Mathematical Methods and Computational Algorithms for Complex Networks. Benard Abola

CENTRALITIES. Carlo PICCARDI. DEIB - Department of Electronics, Information and Bioengineering Politecnico di Milano, Italy

University of Maryland. Tuesday, March 2, 2010

Parameter Estimation of a DC Motor-Gear-Alternator (MGA) System via Step Response Methodology

Chapter 1 A New Parallel Algorithm for Computing the Singular Value Decomposition

FUZZY BOOLEAN ALGEBRAS AND LUKASIEWICZ LOGIC. Angel Garrido

Cellular Learning Automata-Based Color Image Segmentation using Adaptive Chains

Fast Iterative Solvers for Markov Chains, with Application to Google's PageRank. Hans De Sterck

Link Analysis in the Cloud

to ensure that both image processing and the intermediate representation of the coefficients are performed without significantly losing quality. The r

An R Package flare for High Dimensional Linear Regression and Precision Matrix Estimation

A Super Non-dominated Point for Multi-objective Transportation Problem

Appendix Brief Notes on Geometric Programming and Successive Convex Approximation

ME 391Q Network Flow Programming

Keyword: Quadratic Bézier Curve, Bisection Algorithm, Biarc, Biarc Method, Hausdorff Distances, Tolerance Band.

Centralities (4) By: Ralucca Gera, NPS. Excellence Through Knowledge

BIG DATA-DRIVEN FAST REDUCING THE VISUAL BLOCK ARTIFACTS OF DCT COMPRESSED IMAGES FOR URBAN SURVEILLANCE SYSTEMS

Analysis of guifi.net s Topology

I How does the formulation (5) serve the purpose of the composite parameterization

Soft Threshold Estimation for Varying{coecient Models 2 ations of certain basis functions (e.g. wavelets). These functions are assumed to be smooth an

Improving the quantitative testing of fast aspherics surfaces with null screen using Dijkstra algorithm

MSEC PLANT LAYOUT OPTIMIZATION CONSIDERING THE EFFECT OF MAINTENANCE

An Edge-Swap Heuristic for Finding Dense Spanning Trees

A substructure based parallel dynamic solution of large systems on homogeneous PC clusters

Lower estimate of the square-to-linear ratio for regular Peano curves

ELEG Compressive Sensing and Sparse Signal Representations

arxiv: v3 [math.co] 19 Nov 2015

SVD-Wavelet Algorithm for image compression

2.3 Algorithms Using Map-Reduce

Supplementary Material : Partial Sum Minimization of Singular Values in RPCA for Low-Level Vision

PROPERLY EVEN HARMONIOUS LABELINGS OF DISJOINT UNIONS WITH EVEN SEQUENTIAL GRAPHS

Optimal Detector Locations for OD Matrix Estimation

Definition 2.3: [5] Let, and, be two simple graphs. Then the composition of graphs. and is denoted by,

Calculating Web Page Authority Using the PageRank Algorithm. Math 45, Fall 2005 Levi Gill and Jacob Miles Prystowsky

Properly even harmonious labelings of disconnected graphs

Value added and participation in Global Value Chains: the case of Spain

A Connection between Network Coding and. Convolutional Codes

GRAPH COLORING INCONSISTENCIES IN IMAGE SEGMENTATION

SOM+EOF for Finding Missing Values

Robot Mapping. Least Squares Approach to SLAM. Cyrill Stachniss

Graphbased. Kalman filter. Particle filter. Three Main SLAM Paradigms. Robot Mapping. Least Squares Approach to SLAM. Least Squares in General

APPLICATION OF ALGORITHMS FOR AUTOMATIC GENERATION OF HEXAHEDRAL FINITE ELEMENT MESHES

Processing and Others. Xiaojun Qi -- REU Site Program in CVMA

BUILDING MODEL RECONSTRUCTION FROM DATA INTEGRATION INTRODUCTION

Outerplanar Obstructions for the Feedback Vertex Set

How to declare an array in C?

LAB 2 VECTORS AND MATRICES

Distributed constraint satisfaction problems to model railway scheduling problems

A Soft Constraint-Based Approach to QoS-Aware Service Selection

OPTIMAL 3D COORDINATION TO MAXIMIZE THE AVAILABLE STOPPING SIGHT DISTANCE IN TWO-LANE ROADS

An algorithmic method to extend TOPSIS for decision-making problems with interval data

Software Reliability Models: Failure rate estimation

Information Driven Healthcare:

A Native Approach to Cell to Switch Assignment Using Firefly Algorithm

The Travelling Salesman Problem. in Fuzzy Membership Functions 1. Abstract

Modelling Data Segmentation for Image Retrieval Systems

Advanced Operations Research Techniques IE316. Quiz 1 Review. Dr. Ted Ralphs

Performance Evaluation of an Interior Point Filter Line Search Method for Constrained Optimization

An Improved k-shell Decomposition for Complex Networks Based on Potential Edge Weights

Transcription:

Int. J. Complex Systems in Science vol. (1) (201), pp. 9 77 Some rankings based on PageRank applied to the Valencia Metro Francisco Pedroche 1,, Miguel Romance 2 and Regino Criado 2 1 Institut de Matemàtica Multidisciplinària, Universitat Politècnica de València 2 Department of Applied Mathematics, Rey Juan Carlos University, & Center for Biomedical Technology (CTB), Technical University of Madrid, Spain Abstract. In this paper we apply a recent model of Multiplex PageRank to the multiplex network formed with the 9 lines of the metro of Valencia (Spain). We compute the PageRank vector following different approaches and we compare the results with those recently obtained for the Madrid metro system. Keywords: Google matrix, PageRank, Multiplex networks MSC 2000: 90B15,9C15, 15B51 Corresponding author: pedroche@imm.upv.es Received: July 27th, 201 Published: November 30th, 201 1. Introduction The definition and computation of centrality measures in multiplex networks is a very active line of research [2], [3], [10]. One of its applications is focused on transportation systems [5], [1]. In this paper we deal with a metro system and we have three goals. First, we want to compare the results obtained by some recent centrality measures based on PageRank. Second, we would like to confirm if the obtained trends when using the Madrid metro system in [12] still hold when dealing with the Valencia metro system. Third, we shall identify some issues that should be taking into account when considering a metro system as a (multiplex) network. Note that our main objective is to compare some centrality measures in these graphs more than offering a reliable tool for these systems. To that end, one should take into account some other features (see, e.g., [8]).

70 Some rankings based on PageRank applied to the Valencia Metro-Tram system 2. Methodology The Valencia metro and tram system [13] is formed by 9 line stations. The number of stations (stops) in each line is shown in table 1. All the lines are linear graphs except lines and (that have a cycle), and line (that is a tree). The total number of stations is 13. For each line we construct an adjacency matrix of size 13 in the following form: we put the real nodes (stations) of the line and we add the rest of the stations by putting a loop in them (actually, we put a loop in all the stations). In this form we can consider the whole system as a multiplex. For example, line 1 (which is linear) has 0 stations that contribute in the adjacency matrix with 39 links. By putting a loop to all the elements (that is, by putting 1 in the diagonal of the adjacency matrix) we have that the number of nonzero elements of the adjacency matrix corresponding to line 1 is: 39*2 + 13=212. Each of these adjacency matrix represents a layer of the multiplex network. Line # 1 2 3 5 7 8 9 Number of stations 0 33 27 33 18 19 1 22 Table 1: Number of stations in each line of the Valencia metro. In each layer, the nodes that do not belong to a line are called virtual nodes, while the rest are called real nodes. One of the great advantages of our formulation is that we have a very easy tool to distinguish (to penalize, actually) these nodes. It suffices to assign to them a sufficiently small component of the personalization vector in the corresponding layer. We follow the same criterium as in [12] and we take the following value to the component of the personalization vector of any real node v real = α, max r where α is the usual parameter of Google s PageRank, and max r is the maximum number of real nodes over all the layers. In this example, max r = 0 (see Table 1). Therefore, v real = 0.85/0 0.0212 in the computations presented in this paper. The rest of the components of the personalization vector in each layer are scaled such as all the components of the personalization vector sum up to 1. 3. Ranking by degree A first ranking of the lines can be done by considering the degree of each node, that is, the number of links of each station. To perform this computation we

F. Pedroche et al 71 Ranking degree Name of the station Lines 1 5 Empalme 1, 2, 2 Àngel Guimerà 1, 2, 3, 5, 9 Benimaclet 3,,, 9 3 Alameda 3, 5, 7, 9 3 Colón 3, 5, 7, 9 3 Dr. Lluch, 3 Jesús 1, 2, 7 3 La Cadena, 3 Les Arenes, 3 Mediterrani 3 Primat Reig, 3 Rosas 3, 5, 9 3 Torrent 1, 2, 7 3 TVV 3 Vicent Andrés Estellés Table 2: Top 15 ranking by degree can construct the graph formed by the union of the 9 graphs corresponding to the lines (we call this graph the projected graph). We convert the resulting adjacency matrix to a matrix of (0, 1) elements. That is, we are not weighting by using the number of repeated links. For example, node 7 (Colon) and node 75 (Xàtiva) are connected by three lines that share the same tracks. As a result, the degree of station Colon is 3, although this station belongs to lines (by the way, this is an important difference with the Madrid metro system: in Madrid the majority of the lines do not share the tracks with other lines). The top-15 ranking by the degree of the nodes in the projected graph is shown in Table 2. Note that nodes with the same degree are sorted alphabetically. In the whole system there are 13 stations with degree 3.. Ranking by PageRank of the projected graph By considering the projected graph as before we can compute the usual PageRank. In this case we use an homogeneous distribution of the personalization vector, that is v = e/n. Note that all the nodes in the projected graph are real nodes and therefore we don t need to use virtual nodes. in Table 3 we show the resulting top-15 ranking. Only 107 iterations were needed to obtain convergence with a tolerance for the stopping criterium of tol = 10 10 (the

72 Some rankings based on PageRank applied to the Valencia Metro-Tram system Ranking PageRank Name of the station Lines 1 0.015309 Empalme 1, 2, 2 0.0122289 Benimaclet 3,,, 9 3 0.0121009 Àngel Guimerà 1, 2, 3, 5, 9 0.01123 Rosas 3, 5, 9 5 0.01108987 TVV 0.01098181 Torrent 1, 2, 7 7 0.010893 Vicent Andrés Estellés 8 0.0099722 Grau-Canyamelar, 8 9 0.009200 Primat Reig, 10 0.0092137 Jesús 1, 2, 7 11 0.009233 La Cadena, 12 0.009281 Alameda 3, 5, 7, 9 13 0.0091132 Colón 3, 5, 7, 9 1 0.0089131 Mediterrani 15 0.00878211 Las Arenas, Table 3: Top 15 ranking by usual PageRank of the projected graph iterative method stops when the norm of the difference of two consecutive iterated vectors is lower than tol). 5. Ranking by multiplex PageRank By using the multilayer model defined in [12] and the personalization vectors explained above we obtain the top-15 ranking shown in Table. 922 iterations are needed in order to obtain convergence with the same stopping tolerance as before, tol = 10 10. In this example, the test spent about 10 seconds of real clock time in a standard computer.. Ranking by average PageRank To serve as a further comparison, in this section we calculate the usual PageRank of each layer. To that end we use the concept of real and virtual nodes explained above and the corresponding personalization vector. After that, we assign to each node the mean value of the 9 corresponding values of the PageRank in each layer for that node. The top-15 ranking of this average PageRank for each node is shown in Table 5.

F. Pedroche et al 73 Ranking Multiplex PageRank Name of the station Lines 1 0.01011508 Àngel Guimerà 1, 2, 3, 5, 9 2 0.0100782 València Sud 1, 2, 7 3 0.0100990 Sant Isidre 1, 2, 7 0.01003807 Paiporta 1, 2, 7 5 0.0100377 Avinguda del Cid 3, 5, 9 0.0100157 Safranar 1, 2, 7 7 0.009989 Nou d Octubre 3, 5, 9 8 0.00997018 Xàtiva 3, 5, 9 9 0.0099501 Mislata 3, 5, 9 10 0.0099 Picanya 1, 2, 7 11 0.00992008 Mislata-Almassil 3, 5, 9 12 0.0099012 Patraix 1, 2, 7 13 0.00989700 La Cadena, 1 0.00988808 Faitanar 3, 5, 9 15 0.0098703 Quart de Poblet 3, 5, 9 Table : Top 15 ranking by Multiplex PageRank Ranking Average PageRank Name of the station Lines 1 0.011020 Àngel Guimerà 1, 2, 3, 5, 9 2 0.0119298 Benimaclet 3,,, 9 3 0.0118921 Alameda 3, 5, 7, 9 0.011502 Colón 3, 5, 7, 9 5 0.0107887 Torrent 1, 2, 7 0.010002 Empalme 1, 2, 7 0.0105702 Picanya 1, 2, 7 8 0.010285 Paiporta 1, 2, 7 9 0.0103815 Rosas 3, 5, 9 10 0.0103707 València Sud 1, 2, 7 11 0.0103238 Sant Isidre 1, 2, 7 12 0.01032881 Safranar 1, 2, 7 13 0.01032279 Patraix 1, 2, 7 1 0.01032111 Jesús 1, 2, 7 15 0.0101257 Manises 3, 5, 9 Table 5: Top 15 ranking by average PageRank

7 Some rankings based on PageRank applied to the Valencia Metro-Tram system 7 5 degree 3 2 1 8 10 12 1 1 PR proj x10 3 Figure 1: PageRank of the projected graph vs the degree of each node, 7. Comparisons By using a linear regression we obtain that the values of the degree of each node correlate with the usual (projected) PageRank with a value of the squared coefficient of determination r 2 = 0.8. Furthermore, taking into account that both rankings have ties we also employ the Kendall coefficient τ with penalty parameter (see [, 11, ]) to perform this comparison. This value results to be τ = 0.80 and hence there is a soft correlation between degree and PageRank, as it was expected. By using linear regression we get that the values of the PageRank of the projected graph correlate with the multiplex PageRank with r 2 = 0.12. The value of the Kendall coefficient results to be τ = 0.081. Finally, by using a linear regression we obtain that the values of the average PageRank correlate with the multiplex PageRank with a value of r 2 = 0.752 and the value of the Kendall coefficient results to be τ = 0.0. We also compute the correlation of of the degree of each node with the multiplex PageRank, obtaining r 2 = 0.23 and τ = 0.57. In this results we see that the correlaton between multiplex PageRank and average PageRank is greater than in the case of the Madrid metro system. This could be due to the fact that in Valencia there are more lines than share the same graph and therefore, the mean of the PageRank in each layer is closer to the multiplex PageRank than in the case of Madrid in which each line is represented with a different graph.

75 1 1 1 1 12 12 PRM x10 3 PRM x10 3 F. Pedroche et al 10 8 10 8 2 2 8 10 PRproj x10 3 12 1 1 8 10 12 1 1 PRaverage x10 3 Figure 2: PageRank of the projected graph vs multiplex PageRank (left panel) and average PageRank vs multiplex PageRank (right panel) 8. Conclusions We have shown that the basic principles outlined in [12] to apply the multiplex PageRank to a metro system (Madrid) still work to the valencian case. In particular, the criterium for assigning values for the personalization component of virtual nodes still work here and all the methodology can be adapted. Regarding the networks, we have noticed that there are some differences in the topological properties of the Madrid and Valencia metro systems. For example, in Madrid there s a cycle line, while in Valencia there is not. Other important difference is that in Valencia there a a great number of lines that share the same tracks. One issue that should be taken into account in future works is the convenience of taking into account the number of tracks that a line has (e.g, line 1 in Valencia metro has one track in some parts and two tracks in other parts). Other feature to be taken into consideration is that some lines are connected to others in their endings. As a consequence, the start and ending of a line do not have degree one. Regarding the results we have obtained that there exists a soft correlation between degree and PageRank of the projected graph (like in the Madrid metro) but a greater correlation between multiplex PageRank and average PageRank than in the case of Madrid. We think this difference is due to the main feature of the valencian metro: some lines share the same tracks, and therefore the mean of the PageRanks of each layer is more similar to the resulting PageRank of the multiplex network than in the Madrid case.

7 Some rankings based on PageRank applied to the Valencia Metro-Tram system Acknowledgements This work has been partially supported by the project MTM201-5990 (Spanish Ministry) and the grant URJC-Grupo de Excelencia Investigadora GARECOM (201-201). References [1] A. Aleta, S. Meloni, and Y. Moreno. A multilayer perspective for the analysis of urban transportation systems arxiv:107.00072v1 (201) [2] F. Battiston, V. Nicosia, V., and V. Latora. Structural measures for multiplex networks. Physical Review E 89 (201), 03280 [3] S. Boccaletti, G. Bianconi, R. Criado, C.I. del Genio, J. Gómez-Gardeñes, M. Romance, I. Sendiña-Nadal, Z. Wang and M. Zanin, The structure and dynamics of multilayer networks Physics Reports 5(1) (201), 1-122. [] R. Criado, E. García, F. Pedroche and M. Romance, A new method for comparing rankings through complex networks: Model and analysis of competitiveness of major European soccer leagues, Chaos 23 (2013), 0311. doi:10.103/1.82 [5] S. Derrible, Network Centrality of Metro Systems, PLoS ONE, 7(7) (2012), e0575. doi:10.1371/journal.pone.000575 [] R. Fagin, R. Kumar, M. Mahdian, D. Sivakumar and E. Vee, Comparing Partial Rankings, SIAM J. Discrete Math., 20(3) (200), 28-8. doi:10.1137/0503088x [7] E. García, F. Pedroche and M. Romance, On the localization of the Personalized PageRank of Complex Networks, Linear Algebra and its Applications 39, 0 (2013). [8] D. Gattuso, and E. Miriello. Compared Analysis of Metro Networks Supported by Graph Theory. Networks and Spatial Economics, 5: (2005) 3951. [9] A. Halu, R. J. Mondragón, P. Panzarasa,and G. Bianconi, Multiplex PageRank, Plos One, Vol. 8, issue 10, 2013. [10] M. Kivelä, A. Arenas, M Barthelemy, J.P. Gleeson, Y. Moreno and M.A. Porter, Multilayer networks, Journal of Complex Networks 2 (3) (201), 203-271.

F. Pedroche et al 77 [11] F. Pedroche, R. Criado, E. García, M. Romance and V.E. Sánchez, Comparing series of rankings with ties by using complex networks: An analysis of the Spanish stock market (IBEX-35 index), Networks Het.Media 10 (1) (2015), 101-125. doi:10.393/nhm.2015.10.101 [12] F. Pedroche, M. Romance, R. Criado, A biplex approach to PageRank centrality: From classic to multiplex networks, Chaos: An Interdisciplinary Journal of Nonlinear Science. 2, 05301 (201) [13] http://www.metrovalencia.es/