Topology of the Erasmus student mobility network

Similar documents
Example for calculation of clustering coefficient Node N 1 has 8 neighbors (red arrows) There are 12 connectivities among neighbors (blue arrows)

Overlay (and P2P) Networks

TELCOM2125: Network Science and Analysis

Network Theory: Social, Mythological and Fictional Networks. Math 485, Spring 2018 (Midterm Report) Christina West, Taylor Martins, Yihe Hao

Properties of Biological Networks

Basics of Network Analysis

First-improvement vs. Best-improvement Local Optima Networks of NK Landscapes

Empirical analysis of online social networks in the age of Web 2.0

Lesson 18-1 Lesson Lesson 18-1 Lesson Lesson 18-2 Lesson 18-2

SNA 8: network resilience. Lada Adamic

1. Performance Comparison of Interdependent and Isolated Systems

Introduction to network metrics

Topology Enhancement in Wireless Multihop Networks: A Top-down Approach

Supplementary text S6 Comparison studies on simulated data

(Social) Networks Analysis III. Prof. Dr. Daning Hu Department of Informatics University of Zurich

Wednesday, March 8, Complex Networks. Presenter: Jirakhom Ruttanavakul. CS 790R, University of Nevada, Reno

Section 2-2 Frequency Distributions. Copyright 2010, 2007, 2004 Pearson Education, Inc

An Empirical Analysis of Communities in Real-World Networks

Lesson 4. Random graphs. Sergio Barbarossa. UPC - Barcelona - July 2008

Getting to Know Your Data

THE DEPENDENCY NETWORK IN FREE OPERATING SYSTEM

Chapter 2 - Graphical Summaries of Data

Networks and stability

Supplementary material to Epidemic spreading on complex networks with community structures

RANDOM-REAL NETWORKS

ECE 158A - Data Networks

Summary: What We Have Learned So Far

CS224W: Analysis of Networks Jure Leskovec, Stanford University

Models of Network Formation. Networked Life NETS 112 Fall 2017 Prof. Michael Kearns

Dynamic network generative model

Universal Behavior of Load Distribution in Scale-free Networks

CSCI5070 Advanced Topics in Social Computing

Structural Analysis of Paper Citation and Co-Authorship Networks using Network Analysis Techniques

Online Social Networks and Media

Universal Properties of Mythological Networks Midterm report: Math 485

CS-E5740. Complex Networks. Scale-free networks

Network Thinking. Complexity: A Guided Tour, Chapters 15-16

Lecture Slides. Elementary Statistics Twelfth Edition. by Mario F. Triola. and the Triola Statistics Series. Section 2.1- #

Algorithmic and Economic Aspects of Networks. Nicole Immorlica

Constructing a G(N, p) Network

caution in interpreting graph-theoretic diagnostics

BMC Bioinformatics. Open Access. Abstract

Graph-theoretic Properties of Networks

Chapter 2. Frequency distribution. Summarizing and Graphing Data

Structural and Temporal Properties of and Spam Networks

Signal Processing for Big Data

Constructing a G(N, p) Network

Characteristics of Preferentially Attached Network Grown from. Small World

Complex Design Networks: Structure and Dynamics

A Generating Function Approach to Analyze Random Graphs

The Topology and Dynamics of Complex Man- Made Systems

An introduction to the physics of complex networks

CS 534: Computer Vision Segmentation and Perceptual Grouping

Downloaded from

Different network topologies

1 General description of the projects

Vocabulary. 5-number summary Rule. Area principle. Bar chart. Boxplot. Categorical data condition. Categorical variable.

2.1 Objectives. Math Chapter 2. Chapter 2. Variable. Categorical Variable EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES

Graphing with Microsoft Excel

Analysis of guifi.net s Topology

Comparative Evaluation of Community Detection Algorithms: A Topological Approach

Analysis of Internet Topologies

Analytical reasoning task reveals limits of social learning in networks

arxiv: v3 [cs.ni] 3 May 2017

1 More configuration model

Community detection. Leonid E. Zhukov

Predict Topic Trend in Blogosphere

Cycles in Random Graphs

Frequency Distributions

Impact of Clustering on Epidemics in Random Networks

Stable Statistics of the Blogograph

Complex networks: A mixture of power-law and Weibull distributions

Large-Scale Networks. PageRank. Dr Vincent Gramoli Lecturer School of Information Technologies

An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization

CAIM: Cerca i Anàlisi d Informació Massiva

Social, Information, and Routing Networks: Models, Algorithms, and Strategic Behavior

MATH: HIGH SCHOOL M.EE.A-CED.1

Global dynamic routing for scale-free networks

AND NUMERICAL SUMMARIES. Chapter 2

Section 1.1 The Distance and Midpoint Formulas

Plan of the lecture I. INTRODUCTION II. DYNAMICAL PROCESSES. I. Networks: definitions, statistical characterization, examples II. Modeling frameworks

Package fastnet. September 11, 2018

Statistical Analysis of the Metropolitan Seoul Subway System: Network Structure and Passenger Flows arxiv: v1 [physics.soc-ph] 12 May 2008

Extracting Information from Complex Networks

David Easley and Jon Kleinberg January 24, 2007

CSCI5070 Advanced Topics in Social Computing

Topic mash II: assortativity, resilience, link prediction CS224W

Graph Symmetry and Social Network Anonymization

Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities

Advanced Algorithms and Models for Computational Biology -- a machine learning approach

Sixth Grade SOL Tracker Name:

Package fastnet. February 12, 2018

Lecture Series on Statistics -HSTC. Frequency Graphs " Dr. Bijaya Bhusan Nanda, Ph. D. (Stat.)

Characterizing Web Usage Regularities with Information Foraging Agents

D&B Market Insight Release Notes. November, 2015

Specific Communication Network Measure Distribution Estimation

Phase Transitions in Random Graphs- Outbreak of Epidemics to Network Robustness and fragility

Models for the growth of the Web

Graph Theory. Graph Theory. COURSE: Introduction to Biological Networks. Euler s Solution LECTURE 1: INTRODUCTION TO NETWORKS.

An Evolving Network Model With Local-World Structure

Transcription:

Topology of the Erasmus student mobility network Aranka Derzsi, Noemi Derzsy, Erna Káptalan and Zoltán Néda

The Erasmus student mobility network Aim: study the network s topology (structure) and its characteristics for the directed and non-directed graphs, as well for the weighted and non-weighted graphs reveal the network of professional connections between universities Dataset: Data for year 2003 The number of universities that take part in the program in this period: 2333 The number of exchanged students: 134330 The students that take part in an Erasmus mobility form a connection with two universities: in which they study and the one they visit within the framework of the Erasmus program

The Erasmus student mobility network o Nodes: universities that are members of the Erasmus framework (2333) o Connections: students that benefit of an Erasmus scholarship and travel from one university to the other Two directed networks derive depending what type of connection is considered: Incoming: university A Outgoing: university A university B university B Two undirected networks derive In or outgoing: university A university B In and outgoing: university A university B For each case a weighted network can be constructed, as well (the strength of the links is proportional with the number of students in the given exchange)

The NNESM network topology large number of connections: L=134330 large number of nodes: N=2333 NOT informative visualization

The Erasmus Network k-degree component of the NNESM network for k=55. The 180 nodes of this component are presented in hierarchical arrangement. The most highly connected 10 hubs are indicated by their names.

Topological properties of the NNESM network degree distribution with exponential tail! Degree-rank plot on semi-log axis for the NNESM network (filled circles). The thick continuous line fits the exponential tail generated by the highly connected universities (more than 50 connections). high global clustering coefficient: CNNESMg=0.183 high local clustering coefficient: CNNESMl=0.292 several connected components: a giant one (99% of nodes) with average distance lnnesm=2.91

Topological properties of the NNESM network Cumulative degree distribution function r(k) is the rank-degree distibution function Cumulative distribution exponential => the tail of the degree distribution is also exponential

Selective linking (assortative mixing) NO selection based on nation or study language of the participating universities Do highly connected universities tend to connect to universities with large number of connections? Degree of a node neighbors (a) and the local clustering coefficients of the nodes (b) plotted as a function of the degree of a node (filled circles). The continuous line on both panels indicate averages of the points taken with a moving average of length 50 on the horizontal axes. Averages calculated for the network obtained from the configuration model appear with dashed black lines. NO assortative mixing present in the system!

Configuration model the model has same network size and degree distribution free-endpoint links Step 1: Preliminary link allocation procedure randomly select two free-end links belonging to different nodes and connect them multiple connections between two nodes are allowed process repeated until all free-end links are connected Step 2: Random local rewiring rewire randomly multiple connections process repeated until all multiple connections are eliminated from the system The generated network s topology few connected components C local C global l NNESM 0.292 0.183 2.91 Model 0.269 0.132 2.75 The model reproduces well the NNESM network s characteristics!

Understanding the exponential degree distribution Our assumption: k cn ~ i i n i ki degree in the NNESM network of a university with rank r=i ni the size of this university (total number of students) c proportionality constant the tail of the size-rank plot of the universities r(n) should follow an exponential trend

Random sampling universities from the NNESM network arranged in random order number of students for 150 universities looked up on the internet Degree of the universities in the NNESM network as a function of their sizes. Open circles are results for random sampling on 150 universities, and filled circles present the average for an interval of length 5000 on the horizontal axes. The continuous line is a linear fit of the data points. The tail of the size-rank distribution also follows an exponential trend

University size-distribution no data available for European universities University size-distribution by ranking plots: rank-size distribution of the largest UK (a) and US (b) universities. Exponential trend in the system! Exponential size-distribution observed in other group-sizes: Fish-school (J.J. Anderson, Fish. Bull. 79 (1981) 315 323) Cattle-farm (C.T.K. Ching, Am. J. Agric. Econ. 55 (1973) 500 502) (E. Bonabeau, L. Dagorn, Phys. Rev. E 51 (1995) R5220 R5223) (H.-S. Niwa, J. Theoret. Biol. 224 (2003) 451 457) maximum entropy principle

The directed and weighted network study Approach: take into account the direction of the mobility between pairs of universities (directedness) include the number of students traveling on each of these directed links (weight) construct two networks based on these ideas: IN (incoming links) network OUT (outgoing links) network construct a randomly directed network by using the undirected configuration network assign weights to the connections

The directed and weighted network study Part of the weighted ESM network: the figure shows the largest connected component obtained by using links on which there are at least 15 student exchanges in year 2003. This component contains 149 elements, the highlighted nodes are the hubs characterized with the most intense student mobilities.

Some characteristics of the directed and weighted network unidirectional links: 2/3 (random direction) bidirectional links: 1/3 partitioning the number of students on the directed links Scaling of the total number of students sent out from a given university as a function of the number of OUT connections the university has. The continuous line shows a power-law fit with exponent 1.17.

Adapting the configuration model for the directed and weighted network randomly change the links to symmetric or asymmetric type of connections Partitioning methods: 1. randomly assign the k OUT links 2. preferential linking Rank of universities as a function of their IN and OUT degree for the directed ESM network (filled symbols) and for randomly directed configuration network (open symbols). Comparison of weight distributions for the directed and weighted networks.

Conclusions NOT scale-free social network degree distribution function with exponential tail node degree proportional with university size: k ~ N exponential size-distribution explained with maximum entropy principle revealed the most important hubs in network simple random connection models are helpful in understanding the topology A. Derzsi, N. Derzsy, E. Káptalan and Z. Néda; Topology of the Erasmus student mobility network, Physica A, vol. 390, pp. 2601-2610 (2011)