CS-E5740. Complex Networks. Scale-free networks

Similar documents
Summary: What We Have Learned So Far

(Social) Networks Analysis III. Prof. Dr. Daning Hu Department of Informatics University of Zurich

Wednesday, March 8, Complex Networks. Presenter: Jirakhom Ruttanavakul. CS 790R, University of Nevada, Reno

Lesson 4. Random graphs. Sergio Barbarossa. UPC - Barcelona - July 2008

Models of Network Formation. Networked Life NETS 112 Fall 2017 Prof. Michael Kearns

CSE 258 Lecture 12. Web Mining and Recommender Systems. Social networks

CSE 158 Lecture 11. Web Mining and Recommender Systems. Social networks

Overlay (and P2P) Networks

COMP6237 Data Mining and Networks. Markus Brede. Lecture slides available here:

M.E.J. Newman: Models of the Small World

CAIM: Cerca i Anàlisi d Informació Massiva

Exercise set #2 (29 pts)

Characteristics of Preferentially Attached Network Grown from. Small World

CS249: SPECIAL TOPICS MINING INFORMATION/SOCIAL NETWORKS

Critical Phenomena in Complex Networks

Example for calculation of clustering coefficient Node N 1 has 8 neighbors (red arrows) There are 12 connectivities among neighbors (blue arrows)

CS-E5740. Complex Networks. Network analysis: key measures and characteristics

Machine Learning and Modeling for Social Networks

An Evolving Network Model With Local-World Structure

Advanced Algorithms and Models for Computational Biology -- a machine learning approach

Random Graphs CS224W

Erdős-Rényi Model for network formation

Topology and Dynamics of Complex Networks

Small-World Models and Network Growth Models. Anastassia Semjonova Roman Tekhov

Nick Hamilton Institute for Molecular Bioscience. Essential Graph Theory for Biologists. Image: Matt Moores, The Visible Cell

FITNESS-BASED GENERATIVE MODELS FOR POWER-LAW NETWORKS

Graph-theoretic Properties of Networks

An introduction to the physics of complex networks

Properties of Biological Networks

Lecture on Game Theoretical Network Analysis

Plan of the lecture I. INTRODUCTION II. DYNAMICAL PROCESSES. I. Networks: definitions, statistical characterization, examples II. Modeling frameworks

Complex Networks. Structure and Dynamics

V 1 Introduction! Mon, Oct 15, 2012! Bioinformatics 3 Volkhard Helms!

Introduction to Networks and Business Intelligence

Graph Theory. Graph Theory. COURSE: Introduction to Biological Networks. Euler s Solution LECTURE 1: INTRODUCTION TO NETWORKS.

6. Overview. L3S Research Center, University of Hannover. 6.1 Section Motivation. Investigation of structural aspects of peer-to-peer networks

A Generating Function Approach to Analyze Random Graphs

Higher order clustering coecients in Barabasi Albert networks

arxiv:cond-mat/ v5 [cond-mat.dis-nn] 16 Aug 2006

UNIVERSITA DEGLI STUDI DI CATANIA FACOLTA DI INGEGNERIA

Eciency of scale-free networks: error and attack tolerance

Universal Behavior of Load Distribution in Scale-free Networks

Homophily-Based Network Formation Models

Phase Transitions in Random Graphs- Outbreak of Epidemics to Network Robustness and fragility

RANDOM-REAL NETWORKS

V 2 Clusters, Dijkstra, and Graph Layout

Networks and stability

Networks and Discrete Mathematics

The Shape of the Internet. Slides assembled by Jeff Chase Duke University (thanks to Vishal Misra and C. Faloutsos)


6.207/14.15: Networks Lecture 5: Generalized Random Graphs and Small-World Model

Constructing a G(N, p) Network

arxiv:cond-mat/ v1 [cond-mat.dis-nn] 3 Aug 2000

Economic Networks. Theory and Empirics. Giorgio Fagiolo Laboratory of Economics and Management (LEM) Sant Anna School of Advanced Studies, Pisa, Italy

Structure of biological networks. Presentation by Atanas Kamburov

SI Networks: Theory and Application, Fall 2008

Epidemic spreading on networks

Network Thinking. Complexity: A Guided Tour, Chapters 15-16

Complex networks: A mixture of power-law and Weibull distributions

Small World Properties Generated by a New Algorithm Under Same Degree of All Nodes

An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

arxiv: v1 [cs.si] 11 Jan 2019

Topic II: Graph Mining

Random Simplicial Complexes

Peer-to-Peer Data Management

Topic mash II: assortativity, resilience, link prediction CS224W

E6885 Network Science Lecture 5: Network Estimation and Modeling

The Topology and Dynamics of Complex Man- Made Systems

Supply chains involve complex webs of interactions among suppliers, manufacturers,

Social, Information, and Routing Networks: Models, Algorithms, and Strategic Behavior

CSCI5070 Advanced Topics in Social Computing

Social and Technological Network Data Analytics. Lecture 5: Structure of the Web, Search and Power Laws. Prof Cecilia Mascolo

Signal Processing for Big Data

Attack Vulnerability of Network with Duplication-Divergence Mechanism

Degree Distribution, and Scale-free networks. Ralucca Gera, Applied Mathematics Dept. Naval Postgraduate School Monterey, California

Failure in Complex Social Networks

Attack vulnerability of complex networks

Case Studies in Complex Networks

Constructing a G(N, p) Network

Exploiting the Scale-free Structure of the WWW

The missing links in the BGP-based AS connectivity maps

Hyperbolic Geometry of Complex Network Data

Introduction to network metrics

The Establishment Game. Motivation

A generative model for feedback networks

NETWORKS. David J Hill Research School of Information Sciences and Engineering The Australian National University

Basic Graph Properties

SNA 8: network resilience. Lada Adamic

How Do Real Networks Look? Networked Life NETS 112 Fall 2014 Prof. Michael Kearns

γ : constant Goett 2 P(k) = k γ k : degree

Distances in power-law random graphs

Specific Communication Network Measure Distribution Estimation

3 Global Properties of Networks

Synchronizability and Connectivity of Discrete Complex Systems

Impact of Clustering on Epidemics in Random Networks

Chapter 1. Social Media and Social Computing. October 2012 Youn-Hee Han

Internet as a Complex Network. Guanrong Chen City University of Hong Kong

Modeling Traffic of Information Packets on Graphs with Complex Topology

Scalable P2P architectures

Transcription:

CS-E5740 Complex Networks Scale-free networks

Course outline 1. Introduction (motivation, definitions, etc. ) 2. Static network models: random and small-world networks 3. Growing network models: scale-free networks 4. Percolation, error & attack tolerance of networks, epidemic models 5. Network analysis 6. Social networks & (socio)dynamic models 7. Weighted networks 8. Clustering, sampling, inference 9. Temporal networks & multilayer networks

From last week Network Degrees Paths Clustering Real-world Fat-tailed short high Regular lattices Fixed long high* Erdős-Renyi Poissonian short low Configuration Free to choose short low Watts-Strogatz Fixed to Poisson short high Barabási-Albert Power-law* (depending short on layout & clustering lowmeasure)

On Network Models Toy models of networks are common in network science These do not attempt to capture everything there is to networks Rather, the target is to design simplified models that capture some aspects of reality Such models may: Tell something about the origins of networks or their characteristics Allow for simulating processes on networks under controlled circumstances

Degree distributions in model networks so far...

...however, real-world networks look like this: the Internet subcontracting network of a car manufacturer K. Claffy world trade

there are HUBS, nodes of very high degree the Internet subcontracting network of a car manufacturer K. Claffy world trade

Degree examples P (ki) P (ki) Internet autonomous systems 10 0 10 1 10 2 10 3 10 4 10 5 10 6 10 7 10 8 10 9 10 10 10 0 10 1 10 2 10 3 10 4 10 5 10 0 10 1 10 2 10 3 10 4 10 5 10 6 10 7 10 8 last.fm social network 10 9 10 0 10 1 10 2 10 3 10 4 Degree k i 10 0 Flights between airports 10 1 10 2 10 3 10 4 10 5 10 0 10 1 10 2 10 3 10 10 1 10 2 10 3 10 4 10 5 0 Protein interactions (C.Elegans) 10 6 10 0 10 1 10 2 10 3 Degree k i

Degree distributions in real-world networks The tail of the degree distribution can often be approximated with a power law (merkki tarkoittaa verrannollinen networks with power-law distributed degrees are called scale-free networks

What does a power-law degree distribution mean? blue = power law many nodes of small degree red = Poisson degrees not very far from average <k> = <k> = 4

What does a power-law degree distribution mean? blue = power law many nodes of small degree red = Poisson degrees not very far from average the power law distribution has a fat tail <k> = <k> = 4

What does a power-law degree distribution mean? blue = power law many nodes of small degree red = Poisson degrees not very far from average the power law distribution has a fat tail so nodes of very high degrees are always present <k> = <k> = 4

Scale-Free Networks Networks with power-law degree distributions are called scale-free networks For comparison, the Poisson distribution behaves like this: This is because there is no characteristic scale in the distribution If degrees are rescaled, the form of the distribution does not change: Poisson power-law

Power laws are common See: A. Clauset et. al. POWER-LAW DISTRIBUTIONS IN EMPIRICAL DATA, SIAM Rev., 51(4), 661 703 (2009)

Power laws are common See: A. Clauset et. al. POWER-LAW DISTRIBUTIONS IN EMPIRICAL DATA, SIAM Rev., 51(4), 661 703 (2009)

The cumulative degree distribution is also a power law potenssilaki:

Moments of the power-law distribution (but if the moments of the underlying network are infinite, they will change if we get a bigger sample)

Disclaimer: reality check Note! there are other distributions that look like power laws (e.g. stretched exponential, lognormal) and those often better describe real data... not all networks are scale-free, but typically their degree distributions are still broad

Power-law fitting Power-laws give straight lines in log-log plots Draw points to log-log plot, fit a line (e.g. with least squares fit) Tong et al. Science (2004) Find maximum likelihood estimate for parameters of the PL distribution Goodness-of-fit tests (p-value for data being produced with PL) Test for alternative hypothesis (lognormal etc.) See: A. Clauset et. al. POWER-LAW DISTRIBUTIONS IN EMPIRICAL DATA, SIAM Rev., 51(4), 661 703 (2009)

Why are there power laws? Distribution of wealth Pareto distribution, 80/20 Rich get richer Word appearance in texts Zipf s law Any dynamic system where elements get larger with probability that is proportional to their current size (Simon 1955) Networks: preferential attachment

The Barabási-Albert scale-free model Model definition Take a small seed network (a few connected nodes) step 2. is called preferential attachment Repeat until you have N nodes: 1. Add a new node with m stubs (unconnected links) 2. Connect each stub to an existing node i, chosen with probability p i = k i / P k i.

Animation: B-A network growth (m=1)

Properties of the BA model P (k) = 2m2 k 3 C(k, N) / (ln N)2 N No k-dependence! Way too small for large k C(k, N)! 0 when N!1 ` / ln N ln(ln N)

Real-wold networks: between randomness and order Network Degrees Paths Clustering Real-world Fat-tailed short high Erdős-Renyi Poissonian short low Regular lattices Fixed long high* Configuration Free to choose short low Watts-Strogatz Fixed to Poisson short high Barabási-Albert Power-law short low * (depending on layout & clustering measure)

Q: which mechanism leads to preferential attachment in networks? A: any process where links are followed

Following a link leads to high degree nodes Follow a random link, what is the probability that node i with degree k i is picked? There are in total 2m= ki endpoints for links, a node with degree ki has ki endpoints leading to it Example network: 1 4 2 3 Each link picked with probability 1/4 1 2 2 4 2 3 3 4 p( follow random link, reach node i ) = After that, each node is picked with probability 1/2 p 1 = 1 4 1 2 = 1 8 p 2 =3 1 4 1 2 = 3 8 p 3 =2 1 4 1 2 = 2 8 p 4 =2 1 4 1 2 = 2 8 k i P j k j

Following a link leads to high degree nodes Follow a random link, what is the expected degree of the node at the end of the link? Nodes with degree k have k opportunities to be picked: p(k nn = k) / kp(k) Expected value depends on the 2nd moment: hk nn i = X k kp(k nn = k) = X k k kp(k) hki = hk2 i hki

Friendship paradox On average, a person has less friends than a friend has friends Direct consequence of following a link (and variation in degree distributions in social networks) hk nn i = hk2 i hki Applies to any type of network: Network Note that expected neighbour degree hki hk nn i p(k nn >k) Short messages 2.2 146 0.62 Airports & flights 11 65 0.93 Protein interaction 3.0 20 0.85 Internet AS 13 1445 0.96 hk nn iis different than expected average neighbour degree

Following link as implicit preferential attachment Links are often followed in networks even when it is not explicit Triadic closure (social networks) Vertex copying (protein interaction networks, citation networks) Random walks (WWW) Models of network growth with these processes as part of them lead to preferential attachment and power-laws

Other scale-free network models: Holme-Kim the Holme-Kim Model motivation: to get realistic clustering 1. preferential attachment 1. Take a small seed network 1. Take a small seed network 2. Create a new vertex with 2. Create a new vertex with m edges m edges probability p probability 1-p 3. Connect the first of the m edges to existing 3. Connect the first of the m edges to existing vertices with a probability proportional to vertices with a probability proportional to their degree k (just like BA) their degree k (just like BA) 4. 4. With With probability probability p, p, connect connect the the next next edge edge to to a a random random neighbour neighbour of of the the vertex vertex of of step step 3., 3., otherwise do 3. again otherwise do 3. again 5. Repeat 2.-4. until the network 5. Repeat 2.-4. until the network has grown to desired size has grown to desired size of N vertices of N vertices 2A. connect to neighbour (implicit preferential attachment) 2B. preferential attachment for large N, ie clustering more realistic! This type of clustering is found in many real-world networks.

Clustering coefficient as a function of degree Avg clustering given degree c(k) Avg clustering given degree c(k) 0.45 0.40 0.35 0.30 0.25 0.20 0.15 0.10 0.05 0.00 0.45 0.40 0.35 0.30 0.25 0.20 0.15 0.10 0.05 0.00 Internet autonomous systems 10 1 10 2 10 3 10 4 10 5 last.fm social network 10 1 10 2 10 3 10 4 Degree k 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0 0.07 0.06 0.05 0.04 0.03 0.02 0.01 0.00 Flights between airports 10 1 10 2 10 3 Protein interactions (C.Elegans) 10 1 10 2 10 3 Degree k

Real-wold networks: between randomness and order Network Degrees Paths Clustering Real-world Fat-tailed short high Erdős-Renyi Poissonian short low Regular lattices Fixed long high* Configuration Free to choose short low Watts-Strogatz Fixed to Poisson short high Barabási-Albert Power-law short low Holme-Kim Power-law short high * (depending on layout & clustering measure)

Other scale-free network models: vertex copying the vertex copying model (Kleinberg, Kumar): motivation: citations or WWW link lists are often copied a local explanation to preferential attachment asymptotically SF with γ 3 1. copy a vertex 1. 1. Take Take a a small small seed seed network network 2. rewire edges with p 2. 2. Pick Pick a a random random vertex vertex 3. Make a copy of it 3. Make a copy of it 4. 4. With With probability probability p, p, move move each each edge edge of of the the copy copy to to point point to to a a random random vertex vertex 5. Repeat 2.-4. until the network 5. Repeat 2.-4. until the network has has grown grown to to desired desired size size of N vertices of N vertices

Summary Real-wold networks have fat-tailed degree distributions Following a link -> preferential attachment Many realistic processes have this component Any model of network growth where one follows a link leads to scale-free networks Following a link twice from a node, or following link twice can be used to create clustering

Degree distributions Scale-free networks are resistant to random failures but vulnerable to attacks and good at spreading epidemics More on this next week matter