Introduction to Network Analysis. Some materials adapted from Lada Adamic, UMichigan

Similar documents
Copyright 2008, Lada Adamic. School of Information University of Michigan

Network Thinking. Complexity: A Guided Tour, Chapters 15-16

(Social) Networks Analysis III. Prof. Dr. Daning Hu Department of Informatics University of Zurich

Social Networks. Slides by : I. Koutsopoulos (AUEB), Source:L. Adamic, SN Analysis, Coursera course

Wednesday, March 8, Complex Networks. Presenter: Jirakhom Ruttanavakul. CS 790R, University of Nevada, Reno

M.E.J. Newman: Models of the Small World

CAIM: Cerca i Anàlisi d Informació Massiva

CS-E5740. Complex Networks. Scale-free networks

Properties of Biological Networks

Critical Phenomena in Complex Networks

Social, Information, and Routing Networks: Models, Algorithms, and Strategic Behavior

Examples of Complex Networks

Chapter 1. Social Media and Social Computing. October 2012 Youn-Hee Han

Nick Hamilton Institute for Molecular Bioscience. Essential Graph Theory for Biologists. Image: Matt Moores, The Visible Cell

6.207/14.15: Networks Lecture 5: Generalized Random Graphs and Small-World Model

Girls Talk Math Summer Camp

Topology and Dynamics of Complex Networks

Small-World Models and Network Growth Models. Anastassia Semjonova Roman Tekhov

SNA 8: network resilience. Lada Adamic

Complex Networks. Structure and Dynamics

Introduction to Complex Networks Analysis

Lesson 4. Random graphs. Sergio Barbarossa. UPC - Barcelona - July 2008

Advanced Algorithms and Models for Computational Biology -- a machine learning approach

Models of Network Formation. Networked Life NETS 112 Fall 2017 Prof. Michael Kearns

ECS 253 / MAE 253, Lecture 8 April 21, Web search and decentralized search on small-world networks

Heuristics for the Critical Node Detection Problem in Large Complex Networks

Introduction to Networks and Business Intelligence

Overview of Network Theory, I

Social Network Analysis With igraph & R. Ofrit Lesser December 11 th, 2014

Topologies and Centralities of Replied Networks on Bulletin Board Systems

SI Networks: Theory and Application, Fall 2008

An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization

SI Networks: Theory and Application, Fall 2008

Graph-theoretic Properties of Networks

CSCI5070 Advanced Topics in Social Computing

caution in interpreting graph-theoretic diagnostics

Signal Processing for Big Data

Failure in Complex Social Networks

Erdős-Rényi Model for network formation

What is a Network? Theory and Applications of Complex Networks. Network Example 1: High School Friendships

Biological Networks Analysis

Example for calculation of clustering coefficient Node N 1 has 8 neighbors (red arrows) There are 12 connectivities among neighbors (blue arrows)

Networks and Discrete Mathematics

David Easley and Jon Kleinberg January 24, 2007

Theory and Applications of Complex Networks

Structure of biological networks. Presentation by Atanas Kamburov

TELCOM2125: Network Science and Analysis

Distributed Data Management

Graph Theory. Graph Theory. COURSE: Introduction to Biological Networks. Euler s Solution LECTURE 1: INTRODUCTION TO NETWORKS.

Introduction to the Special Issue on AI & Networks

Volume 2, Issue 11, November 2014 International Journal of Advance Research in Computer Science and Management Studies

STRUCTURAL ANALYSIS OF ELECTRICAL NETWORKS

Overlay (and P2P) Networks

Structural Analysis of Paper Citation and Co-Authorship Networks using Network Analysis Techniques

An Evolving Network Model With Local-World Structure

Design Robust Networks against Overload-Based Cascading Failures

Topic mash II: assortativity, resilience, link prediction CS224W

ECS 289 / MAE 298, Lecture 9 April 29, Web search and decentralized search on small-worlds

Network Mathematics - Why is it a Small World? Oskar Sandberg

Data mining --- mining graphs

Introduction to Engineering Systems, ESD.00. Networks. Lecturers: Professor Joseph Sussman Dr. Afreen Siddiqi TA: Regina Clewlow

Machine Learning and Modeling for Social Networks

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

Epidemic spreading on networks

Algorithmic and Economic Aspects of Networks. Nicole Immorlica

Characteristics of Preferentially Attached Network Grown from. Small World

MINFS544: Business Network Data Analytics and Applications

Peer-to-Peer Data Management

Exploiting the Scale-free Structure of the WWW

Online Social Networks 2

- relationships (edges) among entities (nodes) - technology: Internet, World Wide Web - biology: genomics, gene expression, proteinprotein

CSE 258 Lecture 12. Web Mining and Recommender Systems. Social networks

A quick review. The clustering problem: Hierarchical clustering algorithm: Many possible distance metrics K-mean clustering algorithm:

1 Degree Distributions

Distributed Data Management. Christoph Lofi Institut für Informationssysteme Technische Universität Braunschweig

Attack vulnerability of complex networks

CSCI5070 Advanced Topics in Social Computing

Graph Exploitation Testbed

Information Security in Scale Free Networks

On Complex Dynamical Networks. G. Ron Chen Centre for Chaos Control and Synchronization City University of Hong Kong

A Generating Function Approach to Analyze Random Graphs

Simulation and Analysis of Cascading Failure in Critical Infrastructure

Resilient Networking. Thorsten Strufe. Module 3: Graph Analysis. Disclaimer. Dresden, SS 15

6. Overview. L3S Research Center, University of Hannover. 6.1 Section Motivation. Investigation of structural aspects of peer-to-peer networks

RANDOM-REAL NETWORKS

Social Networks and Graph Theory. Holly Hirst Mathematical Sciences Appalachian State NCCTM - November 2, 2017

CSE 158 Lecture 11. Web Mining and Recommender Systems. Social networks

The importance of networks permeates

A Statistical Method for Synthetic Power Grid Generation based on the U.S. Western Interconnection

Fast Topology Management in Large Overlay Networks

CSE 316: SOCIAL NETWORK ANALYSIS INTRODUCTION. Fall 2017 Marion Neumann

Why Do Computer Viruses Survive In The Internet?

COMP6237 Data Mining and Networks. Markus Brede. Lecture slides available here:

Small World Properties Generated by a New Algorithm Under Same Degree of All Nodes

SMALL WORLD NETWORK. -Naveen Teja Vempati -Ravi Teja Mitta


THE COMPARISON OF IMMUNIZATION MECHANISMS AGAINST DISSEMINATING INFORMATION IN SOCIAL NETWORKS

How Do Real Networks Look? Networked Life NETS 112 Fall 2014 Prof. Michael Kearns

Supply chains involve complex webs of interactions among suppliers, manufacturers,

CS 322: (Social and Information) Network Analysis Jure Leskovec Stanford University

Transcription:

Introduction to Network Analysis Some materials adapted from Lada Adamic, UMichigan

Outline n the role of networks in life, nature, and research n why model networks: structure & dynamics n models (structure): n Erdos-Renyi random graph n Watts-Strogatz small world model n Barabasi-Albert scale-free networks n implications (dynamics): n diffusion of disease and information n search by navigating the network n resilience n IR applications

n Networks are collections of points joined by lines. What are networks? node Network Graph edge points lines vertices edges, arcs math nodes links computer science sites bonds physics actors ties, relations sociology

examples: terrorist networks Six degrees of Mohammed Atta Uncloaking Terrorist Networks, by Valdis Krebs

examples: boards of directors Source: http://theyrule.net

n Friendster examples: online social networks "Vizster: Visualizing Online Social Networks." Jeffrey Heer and danah boyd. IEEE Symposium on Information Visualization ( InfoViz 2005).

examples: Networks of personal homepages Stanford MIT homophily: what attributes are predictive of friendship? group cohesion Source: Lada A. Adamic and Eytan Adar, Friends and neighbors on the web, Social Networks, 25(3):211-230, July 2003.

examples: internet Source: Bill Cheswick http://www.cheswick.com/ches/map/gallery/index.html

examples: airline networks Source: Northwest Airlines WorldTraveler Magazine

examples: railway networks Source: TRTA, March 2003 - Tokyo rail map

other examples, e.g. natural language processing n Wordnet Source: http://wordnet.princeton.edu/man/wnlicens.7wn

examples: gene regulatory networks n gene regulatory networks n humans have only 30,000 genes, 98% shared with chimps n the complexity is in the interaction of genes n can we predict what result of the inhibition of one gene will be? Source: http://www.zaik.uni-koeln.de/bioinformatik/regulatorynets.html.en

n Citric acid cycle n Metabolites participate in chemical reactions examples: metabolic networks Source: undetermined

Campaign Contributions from Oil Companies (from http://oilmoney.priceofoil.org/)

Biochemical pathways (Roche) Source: Roche Applied Science, http://www.expasy.org/cgi-bin/show_thumbnails.pl

n Sample points from the surface n Connect each point to the k closest points as measured by Euclidean distance Creating a network from a surface 2 2 2 ) ( ) ( ) ( ), ( z z y y x x b a b a b a b a dist + + =

Medical Patients Creating a network from data Name Age Weight Height HR SBP DBP SpO 2 D. Johnson 32 153 70 82 134 72 98% S. Knell 47 169 65 130 169 93 99% P. Bryne 42 128 61 102 129 77 98% A. Amit 39 191 68 121 143 92 96% 1.) Measure the distance between pairs dist( a, b) = i w ( a i i b i 2 ) 2.) Connect each patient to its k nearest neighbors 0.1 0.92 S. Vega 0.75 0.03 D. Johnson S. Knell P. Bryne A. Amit M. Sa 0.41 0.62 N. Patters

Medical Patients Creating a network from data Name Age Weight Height HR SBP DBP SpO 2 D. Johnson 32 153 70 82 134 72 98% S. Knell 47 169 65 130 169 93 99% P. Bryne 42 128 61 102 129 77 98% A. Amit 39 191 68 121 143 92 96% 1.) Measure the distance between pairs dist( a, b) = i w ( a i i b i 2 ) 2.) Connect each patient to its k nearest neighbors S. Vega D. Johnson S. Knell P. Bryne A. Amit M. Sa N. Patters

Graph partitioning Goal: Partition the graph into multiple groups (clusters) Find the two clusters Social network of 62 dolphins (Lusseau et al., 2003) But where do we cut? Identify the different parts of the rabbit Predict how the community will split when departs

modeling networks: random networks n Nodes connected at random n Number of edges incident on each node is Poisson distributed Poisson distribution

Erdos-Renyi random graphs n What happens to the size of the giant component as the density of the network increases? http://ccl.northwestern.edu/netlogo/models/run.cgi?giantcomponent.884.534

modeling networks: small worlds n Small worlds n a friend of a friend is also frequently a friend n but only six hops separate any two people in the world Arnold S. thomashawk, Flickr; http://creativecommons.org/licenses/by-nc/2.0/deed.en

Small world models n Duncan Watts and Steven Strogatz n a few random links in an otherwise structured graph make the network a small world: the average shortest path is short regular lattice: my friend s friend is always my friend small world: mostly structured with a few random connections random graph: all connections random Source: Watts, D.J., Strogatz, S.H.(1998) Collective dynamics of 'small-world' networks. Nature 393:440-442.

Watts Strogatz Small World Model n As you rewire more and more of the links and random, what happens to the clustering coefficient and average shortest path relative to their values for the regular lattice? http://projects.si.umich.edu/netlearn/netlogo4/smallworldws.html

SIS models and small worlds n SIS model: nodes return to susceptible state after being infected n What is the role of random shortcuts in diffusion? http://projects.si.umich.edu/netlearn/netlogo4/smallworldws.html

modeling networks: power law networks n Many real world networks contain hubs: highly connected nodes n Usually the distribution of edges is extremely skewed number of nodes with so many edges many nodes with few edges fat tail: a few nodes with a very large number of edges number of edges no typical number of edges

network growth & resulting structure n random attachment: new node picks any existing node to attach to n preferential attachment: new node picks from existing nodes according to their degrees http://projects.si.umich.edu/netlearn/netlogo4/randprefattachment.html

What implications does this have? n Robustness n Search n Spread of disease n Opinion formation n Spread of computer viruses n Gossip

How do we search? Mary Who could introduce me to Richard Gere? Bob Jane Richard Gere spaceodissey, Flickr; http://creativecommons.org/licenses/by/2.0/deed.en Friends collage luc, Flickr; http://creativecommons.org/licenses/by/2.0/deed.en

power-law graph number of nodes found 94 63 67 54 2 6 1

Poisson graph number of nodes found 93 19 11 3 15 7 1

Power-law networks are robust to random breakdown

But are especially vulnerable to targeted attack n Targeting and removing hubs can quickly break up the network

In social networks, it s nice to be a hub mike

But it depends on what you re sharing

The role of hubs in epidemics n In a power-law network, a virus can persist no matter how low its infectiousness n Many real world networks do exhibit power-laws: n needle sharing n sexual contacts n email networks

Spread of computer viruses can be affected by the underlying network

SI models & network structure n Will random or preferential attachment lead to faster diffusion? random growth preferential growth http://projects.si.umich.edu/netlearn/netlogo4/badiffusion.html

resilience: power grids and cascading failures n Vast system of electricity generation, transmission & distribution is essentially a single network n Power flows through all paths from source to sink (flow calculations are important for other networks, even social ones) n All AC lines within an interconnect must be in sync n If frequency varies too much (as line approaches capacity), a circuit breaker takes the generator out of the system n Larger flows are sent to neighboring parts of the grid triggering a cascading failure Source:.wikipedia.org/wiki/File:UnitedStatesPowerGrid.jpg

Cascading failures n n n n n n n 1:58 p.m. The Eastlake, Ohio, First Energy generating plant shuts down (maintenance problems). 3:06 p.m. A First Energy 345-kV transmission line fails south of Cleveland, Ohio. 3:17 p.m. Voltage dips temporarily on the Ohio portion of the grid. Controllers take no action, but power shifted by the first failure onto another power line causes it to sag into a tree at 3:32 p.m., bringing it offline as well. While Mid West ISO and First Energy controllers try to understand the failures, they fail to inform system controllers in nearby states. 3:41 and 3:46 p.m. Two breakers connecting First Energy s grid with American Electric Power are tripped. 4:05 p.m. A sustained power surge on some Ohio lines signals more trouble building. 4:09:02 p.m. Voltage sags deeply as Ohio draws 2 GW of power from Michigan. 4:10:34 p.m. Many transmission lines trip out, first in Michigan and then in Ohio, blocking the eastward flow of power. Generators go down, creating a huge power deficit. In seconds, power surges out of the East, tripping East coast generators to protect them. Source: Eric J. Lerner, What's wrong with the electric grid? http://www.aip.org/tip/inphfa/vol-9/iss-5/p8.html

n Rumor spreading n Urban legends n Word of mouth (movies, products) (dis) information cascades n Web is selfcorrecting: n Satellite image hoax is first passed around, then exposed, hoax fact is blogged about, then written up on urbanlegends.about.com Source: undetermined

Actual satellite images of the effect of the blackout 20 hours prior to blackout 7 hours after blackout Source: NOAA, U.S. Government

IR applications: online info retrieval n It s in the links: n links to URLs can be interpreted as endorsements or recommendations n the more links a URL receives, the more likely it is to be a good/ entertaining/provocative/authoritative/interesting information source n but not all link sources are created equal n a link from a respected information source n a link from a page created by a spammer an important page, e.g. slashdot if a web page is slashdotted, it gains attention Many webpages scattered across the web

Ranking pages by tracking a drunk n A random walker following edges in a network for a very long time will spend a proportion of time at each node which can be used as a measure of importance n Various eigenvalue metrics yield variations of importance measures

Wrap up n networks are everywhere and can be used to describe many, many systems n by modeling networks we can start to understand their properties and the implications those properties have for processes occurring on the network