Example for calculation of clustering coefficient Node N 1 has 8 neighbors (red arrows) There are 12 connectivities among neighbors (blue arrows)

Similar documents
Properties of Biological Networks

Graph Theory. Graph Theory. COURSE: Introduction to Biological Networks. Euler s Solution LECTURE 1: INTRODUCTION TO NETWORKS.

How Do Real Networks Look? Networked Life NETS 112 Fall 2014 Prof. Michael Kearns

(Social) Networks Analysis III. Prof. Dr. Daning Hu Department of Informatics University of Zurich

Structure of biological networks. Presentation by Atanas Kamburov

Wednesday, March 8, Complex Networks. Presenter: Jirakhom Ruttanavakul. CS 790R, University of Nevada, Reno

Lesson 4. Random graphs. Sergio Barbarossa. UPC - Barcelona - July 2008

Signal Processing for Big Data

CS-E5740. Complex Networks. Scale-free networks

Critical Phenomena in Complex Networks

CAIM: Cerca i Anàlisi d Informació Massiva

Overlay (and P2P) Networks

V 2 Clusters, Dijkstra, and Graph Layout

Voluntary State Curriculum Algebra II

V 1 Introduction! Mon, Oct 15, 2012! Bioinformatics 3 Volkhard Helms!

arxiv:cond-mat/ v1 21 Oct 1999

Summary: What We Have Learned So Far

Models of Network Formation. Networked Life NETS 112 Fall 2017 Prof. Michael Kearns

RANDOM-REAL NETWORKS

Case Studies in Complex Networks

M.E.J. Newman: Models of the Small World

V 2 Clusters, Dijkstra, and Graph Layout"

Topology of the Erasmus student mobility network

CSCI5070 Advanced Topics in Social Computing

Advanced Algorithms and Models for Computational Biology -- a machine learning approach

V 2 Clusters, Dijkstra, and Graph Layout

Nick Hamilton Institute for Molecular Bioscience. Essential Graph Theory for Biologists. Image: Matt Moores, The Visible Cell

Biological Networks Analysis Dijkstra s algorithm and Degree Distribution

6. Overview. L3S Research Center, University of Hannover. 6.1 Section Motivation. Investigation of structural aspects of peer-to-peer networks

The Betweenness Centrality Of Biological Networks

TELCOM2125: Network Science and Analysis

E6885 Network Science Lecture 5: Network Estimation and Modeling

Basics of Network Analysis

Higher order clustering coecients in Barabasi Albert networks

Graph-theoretic Properties of Networks

Empirical analysis of online social networks in the age of Web 2.0

Ohio Tutorials are designed specifically for the Ohio Learning Standards to prepare students for the Ohio State Tests and end-ofcourse

Mathematics of networks. Artem S. Novozhilov

1 More configuration model

1 Large-scale structure in networks

Erdős-Rényi Model for network formation

Models for the growth of the Web

Network Thinking. Complexity: A Guided Tour, Chapters 15-16

Complex Networks. Structure and Dynamics

Peer-to-Peer Data Management

Response Network Emerging from Simple Perturbation

Clustering CS 550: Machine Learning

The quantitative analysis of interactions takes bioinformatics to the next higher dimension: we go from 1D to 2D with graph theory.

1 Random Graph Models for Networks

Distances in power-law random graphs

DOWNLOAD PDF BIG IDEAS MATH VERTICAL SHRINK OF A PARABOLA

Constructing a G(N, p) Network

The Topology and Dynamics of Complex Man- Made Systems

Network Theory: Social, Mythological and Fictional Networks. Math 485, Spring 2018 (Midterm Report) Christina West, Taylor Martins, Yihe Hao

Mathematics Scope & Sequence Algebra I

Graph Theory for Network Science

Link Prediction for Social Network

Graph Theory for Network Science

An Evolving Network Model With Local-World Structure

Universal Behavior of Load Distribution in Scale-free Networks

3 Global Properties of Networks

NETWORK BASICS OUTLINE ABSTRACT ORGANIZATION NETWORK MEASURES. 1. Network measures 2. Small-world and scale-free networks 3. Connectomes 4.

Scaling Properties of the Internet Graph

Degree Distribution: The case of Citation Networks

Networks and Discrete Mathematics

Basic Graph Properties

Special-topic lecture bioinformatics: Mathematics of Biological Networks

Cluster Analysis. Mu-Chun Su. Department of Computer Science and Information Engineering National Central University 2003/3/11 1

Eciency of scale-free networks: error and attack tolerance

A Study of Random Duplication Graphs and Degree Distribution Pattern of Protein-Protein Interaction Networks

Supply chains involve complex webs of interactions among suppliers, manufacturers,

1 Large-scale network structures

Minnesota Academic Standards for Mathematics 2007

Small-World Models and Network Growth Models. Anastassia Semjonova Roman Tekhov

CS249: SPECIAL TOPICS MINING INFORMATION/SOCIAL NETWORKS

Biological Networks Analysis Network Motifs. Genome 373 Genomic Informatics Elhanan Borenstein

Examples of Complex Networks

ECS 253 / MAE 253, Lecture 8 April 21, Web search and decentralized search on small-world networks

Universal Properties of Mythological Networks Midterm report: Math 485

Constructing a G(N, p) Network

Network Analysis. 1. Large and Complex Networks. Scale-free Networks Albert Barabasi Society

Curriculum Map: Mathematics

Scale Free Network Growth By Ranking. Santo Fortunato, Alessandro Flammini, and Filippo Menczer

Social, Information, and Routing Networks: Models, Algorithms, and Strategic Behavior

caution in interpreting graph-theoretic diagnostics

Lecture on Game Theoretical Network Analysis

TELCOM2125: Network Science and Analysis

Methods of Technical Risk Assessment in a Regional Context

Course of study- Algebra Introduction: Algebra 1-2 is a course offered in the Mathematics Department. The course will be primarily taken by

Topology and Dynamics of Complex Networks

CW High School. Algebra I A

Characterizing Web Usage Regularities with Information Foraging Agents

Social and Technological Network Data Analytics. Lecture 5: Structure of the Web, Search and Power Laws. Prof Cecilia Mascolo

Introduction to Applied and Pre-calculus Mathematics (2008) Correlation Chart - Grade 10 Introduction to Applied and Pre-Calculus Mathematics 2009

Simplicial Complexes of Networks and Their Statistical Properties

1 General description of the projects

Attack vulnerability of complex networks

Analyzing the Trade-offs in the Robustness of Supply Networks against Disruptions

Analyzing the Characteristics of Gnutella Overlays

Modeling Traffic of Information Packets on Graphs with Complex Topology

Transcription:

Example for calculation of clustering coefficient Node N 1 has 8 neighbors (red arrows) There are 12 connectivities among neighbors (blue arrows)

Average clustering coefficient of a graph Overall measure of network s clustering: the arithmetic mean of the C N for all nodes Can be used as another measure of network topology

Degree distribution Degree of a vertex specifies the # of edges by which it is connected to other vertices Degree distribution of a graph is a function measuring the total # of vertices in the graph with a given degree p( k ) = 1 N v V deg( v i 1 i ) = k Counts how many vertices have degree k

The degree distribution is a way to classify graphs into categories Random network: Poisson distribution Scale-free network: Power law linear distribution log-log plot

Remember scale-free networks Scale-free means that while the vast majority of vertices are weakly connected, there also exist some highly inter-connected super-vertices or hubs The term scale-free expresses that the ratio of highly to weakly connected vertices remains the same irrespective of the total # of links in the network Scale-free network is very simple, elegant & intuitive To produce an artificial scale-free network possessing small-world properties, only 2 basic rules need to be followed: Growth: the network is seeded with a small # of vertices Preferential attachment: the more connectivities a vertex has, the more likely new vertices will be connected to it (equivalent to the rich become richer!)

Power law A power law is any polynomial relationship that exhibits the property of scale invariance. The most common power laws relate two scalar quantities and have the form f ( x ) = k ax where a and k are constants. k is typically called the scaling exponent, where the word "scaling" denotes the fact that a power-law function satisfies f ( cx ) where c is a constant. Thus, a rescaling of the function's argument changes the constant of proportionality but preserves the shape of the function itself. Take the logarithm of both sides to clarify This expression has the form of a linear relationship with slope k. Rescaling the argument produces linear shift of the function up or down but leaves both the basic form and the slope k unchanged. [ f ( x )] k log x log a log = + f ( x ) [ f ( cx )] k log x k log c log a log = + + www.wikipedia.org

Power law An example power law graph, being used to demonstrate ranking of popularity. To the right is the long tail, to the left are the few that dominate. log-log plot Also known as the 80-20 rule: for many events, roughly 80% of the effects come from 20% of the causes. www.wikipedia.org

P(k) decays much slower for large k than the exponential decay of a Poisson distribution: highly connected hubs occur at much larger frequency than expected in random graphs.

In scale-free networks, the degree distribution follows a power law: p( k ) γ, = k γ R +

Degree distribution of a graph Measures the proportion of nodes in G that have degree k. m k is the number of nodes of degree k and m is the total number of all nodes P(k) is not a single number, but a set of numbers, with one degree value for each k (k = 0, 1, 2,...) Division by m assures that the sum of P(k) over all values of k is equal to 1

Biological systems: have clustering coefficients that are higher than those of randomly connected graphs with increasing size of the network, the clustering coefficient tends to decrease Implications: biological networks contain many nodes with low degree, but only relatively small numbers of hubs. Hubs are highly connected nodes that are surrounded by dense clusters of nodes with relatively low degree. Example: In both protein interaction network and the transcription regulatory network in yeast, hubs are much more often connected to nodes of low degree, rather than to other hubs

Characterizations of graphs (degree distributions, clustering coefficients, etc) are performed using comparisons with random graphs. Random graph: edges are randomly associated with nodes degree distribution is a Poisson distribution (skinny bell curve with small variance), most nodes have a degree that is close to average.

Most biological systems are not organized randomly A network whose degree distribution more or less follows a power law is called a scale-free network. In biological and other real-world systems: a few hubs are connected to disproportionally many other nodes, while most other nodes are associated with much fewer edges than in a random graph P(k) often follows a power-law distribution

Example of a scale-free network

Summary: network models A hierarchical architecture implies that sparsely connected areas, with communication between the different highly clustered neighborhoods being maintained by a few nodes.

A path in a graph is an alternating sequence of nodes and edges, beginning at a point and ending at a point, and which does not visit any point more than once. The first node is called the start node and the last node is called the end node. Both of them are called end or terminal nodes of the path. The other vertices in the path are internal nodes. Nodes A and D are connected by 5 paths: A B D A B E D A B E C D A E D A E C D

Two paths are independent (also called internally vertexdisjoint) if they do not have any internal node in common, Only 2 of these paths are independent: A B D and either A E D or A E C D. A B D A B E D A B E C D A E D A E C D

Example of paths

Shortest path (or distance) The shortest path between two nodes contains important information about the structure of the network. In random networks, the average shortest path is proportional to the logarithm of the number of nodes, log m. In scale-free networks, the average shortest path is shorter and increases more slowly for larger networks.

An algorithm to calculate the shortest path. See animation example in: http://www.sce.carleton.ca/faculty/chinneck/po.html Visit algorithm animations Dijkstra s Algorithm for Shortest Route Problems

Average shortest path (or distance) of metabolic networks In an analysis of over 40 metabolic networks with 200-500 nodes, Jeong and colleagues found that a few hubs dominated the connectivity patterns. As a consequence, the average shortest path was essentially independent of the size of the network and always had an amazingly low value of about three reaction steps. In a different interpretation, one may consider the average path as an indicator of how quickly information can be sent through the network. According to this criterion, biological networks are indeed very efficient.

Construction of different types of network

Small-world networks It is scale-free (has a power-law degree distribution and therefore a small average shortest distance). It has a high clustering coefficient. Special structure of small-world networks: consist of several loosely connected clusters of nodes, each surrounding a hub. In this organization, most nodes are not neighbors of each other, but moving from any one node to any other node in the network does not take many steps, because of the existence of the well-connected hubs.

Social networks have small world properties The small-world property is reminiscent of John Guare s famous play Six Degrees of Separation, whose premise is that essentially every human is connected to every other human by a chain of six or fewer acquaintances.

Properties of small-world networks The small-world property emerges automatically when a network grows by the addition of new links preferentially to nodes that are already highly connected. It is not surprising that the hubs in a gene or protein network are often among the most important components and that their destruction or mutation is frequently fatal. Identification of hubs may be important for drug target identification.