Algorithmic and Economic Aspects of Networks. Nicole Immorlica

Similar documents
Networks in economics and finance. Lecture 1 - Measuring networks

How Do Real Networks Look? Networked Life NETS 112 Fall 2014 Prof. Michael Kearns

Constructing a G(N, p) Network

bar foo for sum-product for for for for for for end for end for end for end for end for end for end for sum-product foo bar Definition

Graph Theory and Network Measurment

THE KNOWLEDGE MANAGEMENT STRATEGY IN ORGANIZATIONS. Summer semester, 2016/2017

CS224W: Analysis of Networks Jure Leskovec, Stanford University

Constructing a G(N, p) Network

Graph and Digraph Glossary

Social-Network Graphs

Some Graph Theory for Network Analysis. CS 249B: Science of Networks Week 01: Thursday, 01/31/08 Daniel Bilar Wellesley College Spring 2008

Erdős-Rényi Model for network formation

An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization

Introduction to network metrics

Tie strength, social capital, betweenness and homophily. Rik Sarkar

Algorithms and Applications in Social Networks. 2017/2018, Semester B Slava Novgorodov

CS 151. Graphs. Monday, November 19, 12

Monday, November 19, 12. Test #2 is Wednesday. For each data structure, you should know: the supported operations

RANDOM-REAL NETWORKS

Recap! CMSC 498J: Social Media Computing. Department of Computer Science University of Maryland Spring Hadi Amiri

Definition f(n) = O(g(n)) if there exists n 0, c such that for all n n 0, f(n) cg(n). g is an asymptotic upper bound on f.

Distribution-Free Models of Social and Information Networks

Graph Theory for Network Science

Complex-Network Modelling and Inference

David Easley and Jon Kleinberg January 24, 2007

Social Network Analysis

Network Basics. CMSC 498J: Social Media Computing. Department of Computer Science University of Maryland Spring Hadi Amiri

CAIM: Cerca i Anàlisi d Informació Massiva

The Basics of Network Structure

Structural Analysis of Paper Citation and Co-Authorship Networks using Network Analysis Techniques

Topology Enhancement in Wireless Multihop Networks: A Top-down Approach

Non Overlapping Communities

Using! to Teach Graph Theory

Graphs / Networks. CSE 6242/ CX 4242 Feb 18, Centrality measures, algorithms, interactive applications. Duen Horng (Polo) Chau Georgia Tech

Mathematical Foundations

Economics of Information Networks

Social, Information, and Routing Networks: Models, Algorithms, and Strategic Behavior

Basics of Network Analysis

SLANG Session 4. Jason Quinley Roland Mühlenbernd Seminar für Sprachwissenschaft University of Tübingen

UNIVERSITA DEGLI STUDI DI CATANIA FACOLTA DI INGEGNERIA

1 Homophily and assortative mixing

Introduction. Computer Networks: Introduction 1

Computer-to-Computer Networks (cont.)

Math.3336: Discrete Mathematics. Chapter 10 Graph Theory

Supplementary material to Epidemic spreading on complex networks with community structures

Graph Theory. Network Science: Graph theory. Graph theory Terminology and notation. Graph theory Graph visualization

Mining and Analyzing Online Social Networks

Extracting Information from Complex Networks

Social Network Analysis With igraph & R. Ofrit Lesser December 11 th, 2014

ECE 158A - Data Networks

Random Graph Model; parameterization 2

Graph Theory Review. January 30, Network Science Analytics Graph Theory Review 1

Small-World Models and Network Growth Models. Anastassia Semjonova Roman Tekhov

Section 2.7 BIPARTITE NETWORKS

Example for calculation of clustering coefficient Node N 1 has 8 neighbors (red arrows) There are 12 connectivities among neighbors (blue arrows)

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

Structure of Social Networks

CS249: SPECIAL TOPICS MINING INFORMATION/SOCIAL NETWORKS

CS 322: (Social and Information) Network Analysis Jure Leskovec Stanford University

Random Simplicial Complexes

Paths, Circuits, and Connected Graphs

Introduction to Algorithms. Lecture 11. Prof. Patrick Jaillet

Econ 430 Lecture 3: Significance and Structural Properties of N

Community detection algorithms survey and overlapping communities. Presented by Sai Ravi Kiran Mallampati

Basic Network Concepts

Randomized Rumor Spreading in Social Networks

Erdös-Rényi Graphs, Part 2

beyond social networks

Web 2.0 Social Data Analysis

Lecture 7. s.t. e = (u,v) E x u + x v 1 (2) v V x v 0 (3)

An introduction to the physics of complex networks

Big-Θ. Big-Θ example

Modeling and Simulating Social Systems with MATLAB

Graph theory. Po-Shen Loh. June We begin by collecting some basic facts which can be proved via bare-hands techniques.

Analytical reasoning task reveals limits of social learning in networks

L20-21: Graph Data Structure

1. a graph G = (V (G), E(G)) consists of a set V (G) of vertices, and a set E(G) of edges (edges are pairs of elements of V (G))

CSE 258 Lecture 12. Web Mining and Recommender Systems. Social networks

Mining Social Network Graphs

Graph Theory. Graph Theory. COURSE: Introduction to Biological Networks. Euler s Solution LECTURE 1: INTRODUCTION TO NETWORKS.

Week 5 Video 5. Relationship Mining Network Analysis

Networks, Crowds, and Markets:

Strategic Network Formation

Using graph theoretic measures to predict the performance of associative memory models

CSE 158 Lecture 11. Web Mining and Recommender Systems. Social networks

Topic mash II: assortativity, resilience, link prediction CS224W

CS4514 Computer Networks

Graphs / Networks CSE 6242/ CX Centrality measures, algorithms, interactive applications. Duen Horng (Polo) Chau Georgia Tech

Chapter 1. Social Media and Social Computing. October 2012 Youn-Hee Han

Chapter 5: Euler Paths and Circuits The Mathematics of Getting Around

Lesson 4. Random graphs. Sergio Barbarossa. UPC - Barcelona - July 2008

1 Counting triangles and cliques

Elements of Graph Theory

CSE 190 Lecture 16. Data Mining and Predictive Analytics. Small-world phenomena

Chapter 5: Euler Paths and Circuits The Mathematics of Getting Around

6.207/14.15: Networks Lecture 5: Generalized Random Graphs and Small-World Model

Social Networks. Slides by : I. Koutsopoulos (AUEB), Source:L. Adamic, SN Analysis, Coursera course

Graph Algorithms: Part 1. Dr. Baldassano Yu s Elite Education

Extracting Information from Social Networks

15 Graph Theory Counting the Number of Relations. 2. onto (surjective).

Transcription:

Algorithmic and Economic Aspects of Networks Nicole Immorlica

Syllabus 1. Jan. 8 th (today): Graph theory, network structure 2. Jan. 15 th : Random graphs, probabilistic network formation 3. Jan. 20 th : Epidemics 4. Feb. 3 rd : Search 5. Feb. 5 th : Game theory, strategic network formation 6. Feb. 12 th : Diffusion 7. Feb. 19 th : Learning 8. Feb. 26 th : Markets 9. Mar. 5 th : TBA 10. Mar. 12 th : Final project presentations

Assignments Readings, weekly From Social and Economic Networks, by Jackson Research papers, one per week Reaction papers, weekly Class presentations Two problem sets (?) Final projects, end of term Survey paper Theoretical/empiracle analysis

Grading (Approximate) Participation, class presentations [15%] Reaction papers [25%] Problem sets [20%] Final project [40%]

Networks A network is a graph that represents relationships between independent entities. Graph of friendships (or in the virtual world, networks like facebook) Graph of scientific collaborations Web graph (links between webpages) Internet: Inter/Intra-domain graph

New Testament Social Network

New Testament Social Network Visualization from ManyEyes

United Routes Network Honolulu Seattle

United Routes Network

Erdos Collaboration Network Harry Buhrman Lance Fortnow

Erdos Collaboration Network Visualization from Orgnet

Graph Theory Graph G = (V,E) A set V of n nodes or vertices V = {i} A subset E of V x V of m pairs of vertices, called edges E = {(i,j)} Edges can be directed (pair order matters), or undirected.

Drawing Graphs Undirected graphs Directed graphs

Representing Graphs List of edges (A,B), (A,C), (B,C), (B,D), (C,D), (D,E) Node adjacency list A: B, C; B: A, C; C: A, B; D: B, C, E; E: D Adjacency matrix A ij = 1 if (i,j) is an edge, else = 0 B C A D E A 0 1 1 0 0 B 1 0 1 1 0 C 1 1 0 1 0 D 0 1 1 0 1 E 0 0 0 1 0 A B C D E

Modeling Networks symmetric networks = undirected graphs friendship agents agents

Example: Arpanet

Example: Arpanet SRI UTAH LINC MIT CASE UCSB STAN SDC BBN CARN UCLA RAND HARV

Neighborhoods and Degrees neighborhood of node i = nodes j with edge to i degree of node i = number of neighbors SRI UTAH LINC MIT CASE UCSB STAN SDC BBN CARN UCLA RAND HARV

Paths and Cycles path = sequence of nodes such that each consecutive pair is connected SRI UTAH LINC MIT CASE UCSB STAN SDC BBN CARN UCLA RAND HARV MIT UTAH SRI STAN

Paths and Cycles simple path = one that does not repeat nodes SRI UTAH LINC MIT CASE UCSB STAN SDC BBN CARN UCLA RAND HARV

Paths and Cycles cycle = path that starts and ends at same node SRI UTAH LINC MIT CASE UCSB STAN SDC BBN CARN UCLA RAND HARV

Special Graphs Trees = a unique path between each pair of vertices Stars = an edge from each node to center node Cliques (complete graphs) = an edge between each pair of vertices, written K n

Connectivity A graph is connected if there is a path between every pair of nodes.

SRI UTAH LINC MIT CASE UCSB STAN SDC BBN CARN UCLA RAND HARV

Connected components SRI UTAH LINC MIT CASE UCSB STAN SDC BBN CARN UCLA RAND HARV

Giant Components Friendless person Your TA from last quarter You Weird guy in high school Most of the world Your friends Your piano teacher Remote tribe on desert island

Study Guide Graph representations Neighbors and degrees Paths and cycles Connectedness Giant component

Network Questions How popular are we? How connected are we? How tight-knit are we? How important am I? What is degree dist.? What is diameter? What is clustering coeff.? What is centrality?

Degree Distributions relative frequency of nodes w/different degrees P(d) = fraction of nodes with degree d P(d) = probability random node has deg. d

Degree Distributions P(d) = fraction of nodes with degree d SRI UTAH LINC MIT CASE UCSB STAN SDC BBN CARN UCLA RAND HARV P(0) = 0, P(1) = 0, P(2) = 7/13, P(3) = 4/13, P(4) = 2/13

Degree Distributions Frequency 1 Star deg. dist. A clique? star? Clique deg. dist. 3/4 Arpanet deg. dist. 1/2 1/4 0 0 1 2 3 4 5 6 7 8 Degree

Degree Distributions Some special degree distributions Poisson degree distribution: P(d) = for some 0 < p < 1. n d p d (1-p) d Scale-free (power-law) degree dist.: P(d) = cd -

Poisson vs Power-law Frequency 1 3/4 Power-law: P(d) = cd - Poisson: P(d) = n d p d (1-p) d 1/2 1/4 0 0 1 2 3 4 5 6 7 8 Degree

Log-log plots Power-law: P(d) = cd - log [ P(d) ] = log [ cd - ] = log [ c ] log [ d ] So a straight line on a log-log plot.

Log-log plots Log Frequency 0 Power-law: log P(d) = log [ c ] log [ d ] -2-4 -6-8 A straight line, slope = exponent Poisson 0 1 2 3 4 5 6 7 8 Log Degree

Alternate views Frequency (Number) There are j nodes with degree exactly d Log-log plot Degree

Alternate views Frequency (Number) There are j nodes with degree at least d Log-log plot Degree Cumulative Distribution

Alternate views Degree The j th most popular node has degree d Log-log plot Rank

Example: Collaboration Graph Power law exp: c = 2.97 With exponential decay factor, c = 2.46

Example: Inter-Domain Internet Power law exponent: 2.15 < c < 2.2

Example: Web Graph In-Degree Power law exponent: c = 2.09

Q1. How popular are we? Many social networks have power-law degree distributions. A few very popular people, many many unpopular people.

Diameter How far apart are the nodes of a graph? How far apart are nodes i and j? What is the length of a path from i to j?

Length of a Path The length of a path is # of edges it contains. SRI UTAH LINC MIT CASE UCSB STAN SDC BBN CARN UCLA RAND HARV LINC MIT BBN RAND SDC has length 4.

Distance The distance between two nodes is the length of the shortest path between them. SRI UTAH LINC MIT CASE UCSB STAN SDC BBN CARN UCLA RAND HARV Distance between LINC and SDC is 3.

Computing Distance Grow a breadth-first search tree.

Computing Distance Source Target Neighbors (Distance 1) Distance 2

Diameter The diameter of a network is the maximum distance between any two nodes. SRI UTAH LINC MIT CASE UCSB STAN SDC BBN CARN UCLA RAND Diameter is 5. HARV

Diameter Of a. clique? star? tree? tree of height k? binary tree?

Computing Diameter Squaring adjacency matrix. Height of arbitrary BFS tree (2-approx).

Computing Diameter Squaring Adjacency matrix. A ij = 1 if (i,j) is an edge. (A 2 ) ij > 0 if there is a k s.t. (i,k) is an edge and (k,j) is an edge. A p represents paths of length exactly p. (A+Identity) p represents paths of length p.

Computing Diameter Height of arbitrary BFS tree (2-approx). Consider max shortest path (i 1,, i k ). Height of tree when reaching i 1 is at most k. Number of remaining levels is at most k. Height of BFS tree is at most twice the diameter k (the length of the maximum shortest path).

Final Project #1 Efficient algorithms for approximating diameters (and other statistics) of big graphs.

Examples Collaboration graph 401,000 nodes, 676,000 edges (average degree 3.37) Diameter: 23, Average distance: 7.64 Cross-post graph, giant component 30,000 nodes, 800,000 edges (average degree 53.3) Diameter: 13, Average distance: 3.8 Web graph 200 million nodes, 1.5 billion edges (average degree 15) Average connected distance: 16 Inter-domain Internet 3500 nodes, 6500 edges (average degree 3.71) 95% of pairs of nodes within distance 5

Q2. How connected are we? Many social networks have small diameter. There are short connections between most people (6 degrees of separation).

Clustering Coefficient How many of your friends are also friends?

Clustering Coefficient The clustering coeff. of a node is the fraction of its neighbors that are connected. Clustering coeff. = 13/72

Clustering Coefficient The clustering coefficient of a graph is the average clustering coefficient of its nodes, Or the fraction of triangles among all connected triples of nodes.

Examples Collaboration graph Clustering coefficient is 0.14 Density of edges is 0.000008 Cross-post graph Clustering coefficient is 0.4492 Density of edges is 0.0016

Q 3. How tight-knit are we? Social networks have high clustering coeff. Many of our friends are friends.

Centrality Measures of centrality Degree-based: how connected is a node Closeness: how easy can a node reach others Betweenness: how important is a node in connecting other nodes Neighbor s characteristics: how important, central, or influential a nodes neighbors are

Degree Centrality The degree centrality of a node is d(i) / (n-1) where d(i) is the degree of node i. Degree centrality = 6/9

Degree Centrality Low degree centrality, but close to all nodes.

Closeness Centrality The closeness centrality of a node i is ± d(i,j) where ± is a discounting factor in [0,1] and d(i,j) is length of shortest path from i to j. Closeness centrality = 3± + 8± 2

Closeness Centrality High closeness centrality, but peripheral.

Betweenness Centrality Betweennes centrality of node i is fraction of shortest paths passing through i: k,j P i (k,j)/p(k,j) / (n-1)(n-2)/2 where P i (k,j) is # of shortest paths from j to k through i; P(k,j) is total # of shortest paths. Betweennes centrality = [30 *(1/2)] / [12*11/2] = 15/66 = 5/22

Final Project #2 Investigate how these properties change as social network evolves.

Assignment: Readings: Social and Economic Networks, Part I Graph Structure in the Web, Broder et al The Strength of Weak Ties, Granovetter Reactions: Reaction paper to one of research papers, or a research paper of your choice Presentation volunteer?