Random graph models with fixed degree sequences: choices, consequences and irreducibilty proofs for sampling

Similar documents
arxiv: v3 [stat.me] 10 Oct 2017

Configuring Random Graph Models with Fixed Degree Sequences \ast

Intro to Random Graphs and Exponential Random Graph Models

Today. Types of graphs. Complete Graphs. Trees. Hypercubes.

1 More configuration model

Supplementary material to Epidemic spreading on complex networks with community structures

DS UNIT 4. Matoshri College of Engineering and Research Center Nasik Department of Computer Engineering Discrete Structutre UNIT - IV

Structure of Social Networks

CS281 Section 9: Graph Models and Practical MCMC

Distribution-Free Models of Social and Information Networks

Math 170- Graph Theory Notes

Graph Theory S 1 I 2 I 1 S 2 I 1 I 2

Assignment 4 Solutions of graph problems

Discrete mathematics II. - Graphs

Paths, Circuits, and Connected Graphs

Introduction to network metrics

Math 776 Graph Theory Lecture Note 1 Basic concepts

Lecture 4: Trees and Art Galleries

Sequential Algorithms for Generating Random Graphs. Amin Saberi Stanford University

Testing the Cluster Structure of Graphs Christian Sohler

Stanford University CS359G: Graph Partitioning and Expanders Handout 18 Luca Trevisan March 3, 2011

Mesh segmentation. Florent Lafarge Inria Sophia Antipolis - Mediterranee

Algorithms: Graphs. Amotz Bar-Noy. Spring 2012 CUNY. Amotz Bar-Noy (CUNY) Graphs Spring / 95

Chapter 1. Introduction

Mean Field and Variational Methods finishing off

Math 454 Final Exam, Fall 2005

Chapter 12 and 11.1 Planar graphs, regular polyhedra, and graph colorings

Basic Combinatorics. Math 40210, Section 01 Fall Homework 4 Solutions

CS 361 Data Structures & Algs Lecture 11. Prof. Tom Hayes University of New Mexico

Community Detection. Community

2-uniform words: cycle graphs, and an algorithm to verify specific word-representations of graphs

Extremal Graph Theory: Turán s Theorem

Algorithms. Graphs. Algorithms

Orientations of Planar Graphs

Exponential Random Graph Models for Social Networks

CSCI5070 Advanced Topics in Social Computing


Statistical network modeling: challenges and perspectives

Professor: Padraic Bartlett. Lecture 9: Trees and Art Galleries. Week 10 UCSB 2015

Matching. Algorithms and Networks

Introduction to Randomized Algorithms

Mean Field and Variational Methods finishing off

Lesson 4. Random graphs. Sergio Barbarossa. UPC - Barcelona - July 2008

Henneberg construction

CS6702 GRAPH THEORY AND APPLICATIONS 2 MARKS QUESTIONS AND ANSWERS

MAT 280: Laplacian Eigenfunctions: Theory, Applications, and Computations Lecture 18: Introduction to Spectral Graph Theory I. Basics of Graph Theory

Computational Geometry

Induction Review. Graphs. EECS 310: Discrete Math Lecture 5 Graph Theory, Matching. Common Graphs. a set of edges or collection of two-elt subsets

Notes in Computational Geometry Voronoi Diagrams

6.207/14.15: Networks Lecture 5: Generalized Random Graphs and Small-World Model

Math 575 Exam 3. (t). What is the chromatic number of G?

Math236 Discrete Maths with Applications

DEFINITION OF GRAPH GRAPH THEORY GRAPHS ACCORDING TO THEIR VERTICES AND EDGES EXAMPLE GRAPHS ACCORDING TO THEIR VERTICES AND EDGES

1 Some Solution of Homework

Graphs That Are Randomly Traceable from a Vertex

Complex-Network Modelling and Inference

K 4,4 e Has No Finite Planar Cover

Lecture 6: Spectral Graph Theory I

Graphs. Introduction To Graphs: Exercises. Definitions:

Basics of Graph Theory

Available online at ScienceDirect. Procedia Computer Science 20 (2013 )

Chapter 2 Graphs. 2.1 Definition of Graphs

Section 10.1: Graphs and Graph Models. Introduction to Graphs Definition of a Graph Types of Graphs Examples of Graphs

1 Matchings with Tutte s Theorem

Constructing a G(N, p) Network

CAIM: Cerca i Anàlisi d Informació Massiva

6 Randomized rounding of semidefinite programs

Randomized Algorithms Week 4: Decision Problems

Split-Decomposition Trees with Prime Nodes: Enumeration and Random Generation of Cactus Graphs. Maryam Bahrani Jérémie Lumbroso

Networks and Algebraic Statistics

Estimating and Sampling Graphs with Multidimensional Random Walks. Group 2: Mingyan Zhao, Chengle Zhang, Biao Yin, Yuchen Liu

PACKING DIGRAPHS WITH DIRECTED CLOSED TRAILS

Approximation Algorithms: The Primal-Dual Method. My T. Thai

Math.3336: Discrete Mathematics. Chapter 10 Graph Theory

Lecture 9 - Matrix Multiplication Equivalences and Spectral Graph Theory 1

1. a graph G = (V (G), E(G)) consists of a set V (G) of vertices, and a set E(G) of edges (edges are pairs of elements of V (G))

The vertex set is a finite nonempty set. The edge set may be empty, but otherwise its elements are two-element subsets of the vertex set.

On Realizations of a Joint Degree Matrix

Package assortnet. January 18, 2016

GraphBLAS Mathematics - Provisional Release 1.0 -

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

Social Networks. Slides by : I. Koutsopoulos (AUEB), Source:L. Adamic, SN Analysis, Coursera course

Workshop report 1. Daniels report is on website 2. Don t expect to write it based on listening to one project (we had 6 only 2 was sufficient

On Universal Cycles of Labeled Graphs

Some important models

Regular Hypergraphs: Asymptotic Counting and Loose Hamilton Cycles

Assignment # 4 Selected Solutions

Chordal Graphs: Theory and Algorithms

CS 311 Discrete Math for Computer Science Dr. William C. Bulko. Graphs

Combinatorics Summary Sheet for Exam 1 Material 2019

CPCS Discrete Structures 1

ECS 20 Lecture 17b = Discussion D8 Fall Nov 2013 Phil Rogaway

Graphs. Pseudograph: multiple edges and loops allowed

Outline. CS38 Introduction to Algorithms. Approximation Algorithms. Optimization Problems. Set Cover. Set cover 5/29/2014. coping with intractibility

Hamiltonian cycles in bipartite quadrangulations on the torus

Vertical decomposition of a lattice using clique separators

The Mathematical Description of Networks

Lecture 2 - Graph Theory Fundamentals - Reachability and Exploration 1

Math 778S Spectral Graph Theory Handout #2: Basic graph theory

Course Introduction / Review of Fundamentals of Graph Theory

Transcription:

Random graph models with fixed degree sequences: choices, consequences and irreducibilty proofs for sampling Joel Nishimura 1, Bailey K Fosdick 2, Daniel B Larremore 3 and Johan Ugander 4 1 Arizona State Univ. 2 Colorado State 3 Univ. of Colorado 4 Stanford Univ. ASU Discrete Math Seminar 2018

See paper for the numerous literature connections

What is notable about a graph?

Interpretation requires a Null Model karate club

model error Interpretation requires a Null Model Erdős Rényi fixed degree sequence application specific simulation replicated experiments implementation difficulty and/or required understanding

Stub Matching edges to stubs join 2 stubs drop stub labels Vertex-labeled Stub-labeled

Self-loops edges to stubs join 2 stubs drop stub labels

Self-loops and Multiedges Self-loops and multiedges are asymptotically rare (for reasonable degree sequences) Have been frequently been ignored, or simply deleted BUT they can also have large impacts on finite sized null models AND in null models which allow self-loops or multiedges, stub matching does not sample adjacency matrices uniformly at random

model error Interpretation requires a Null Model Erdős Rényi fixed degree sequence??? application specific simulation replicated experiments implementation difficulty and/or required understanding

simple multiedges self-loops vertexlabeled stublabeled Stub labeled graphs are biased against multiedges and self-loops

Consider 1,2,2,1 Uniformly samples from d and e are the same Uniform samples are different

There s a choice of graphs and it matters

Example 1 Geometer s collaboration graph n=9,072 m=22,577 Nodes: computational geometry researchers. Edges: collaboration on a book or paper Degree assortativity Do high productivity authors coauthor with other high productivity authors?

Multiedges, self-loops, and labeling?

Stub-labeling isn t causal Consider a collaboration network, and two potential stub labelings: Node i Stub 1: first paper Stub 2: second paper Node i Stub 1: first paper Stub 2: second paper Node j Stub 1: first paper Stub 2: second paper Node j Stub 1: first paper Stub 2: second paper

Vertex-Labeling is Causal Consider a collaboration network: Nodes: authors Edges: papers/books with unique title Suppose you order each edge s arrival Each vertex labeled graph has m! edge orderings i.e. all adjacency matrices correspond to the same number of timelines where papers were produced in different orders.

Example 2 Swallow graph n=17 Nodes: barn swallows. Edges: bird-bird interactions Trait assortativity (based on bird color) Do birds of a similar color interact together?

Example 3 South Indian village social support network n=782 Nodes: villagers Edges: reported social support Community detection via modularity maximization Modularity has a built in stub-labeled Chung-Lu null model Chung Lu estimation Do results change if we use vertex labeled model? # of edges observed in configuratio n models

Sampling graphs uniformly at random is surprisingly difficult (except pseudo-graphs)

Sampling graphs uniformly at random

Sampling via Markov chain Monte Carlo G 0 G 1 G 2 G 3 G 4 G 5 Goal: A sequence of degree constrained graphs such that subsampling from this sequence approximates a set of graphs drawn uniformly at random.

Double Edge Swaps

, the Graph of Graphs

, the Graph of Graphs

Dealing with Constraints

Dealing with Constraints no self-loops

Dealing with Constraints

MCMC requirements 1. Random walks can reach any graph -Irreducibility/GOG connected 2. Balanced transition probabilities -P(G i G j ) = P(G j G i ) -i.e. edges will be weighted but undirected 3. Markov chain is aperiodic -otherwise subsampling can be biased NOTE: There are mixing time results for some degree sequences. There are also numerical methods to gauge convergence. I will not discuss either.

Is the GoG periodic? Nope! Or

Are transition probabilities balanced? Stub-labeled GoG Vertex-labeled GoG GoG is an undirected simple graph GoG is a directed pseudograph

Stub-labeled GoG Vertex-labeled GoG GoG is an undirected simple graph

Is the GoG connected? Most difficult of the 3 questions Need special proof for each of choice of selfloops/multiedges Stub labeled GoG connectivity iff vertex labeled GoG Connectivity, because the following swap permutes stubs:

Connectivity of Graph of Pseudographs start target diff # of stubs per node # gold = # maroon

Connectivity of Graph of Pseudographs swap start target can always find a graph one edge closer to target

Connectivity on other GoGs?

Disconnectivity of loopy graphs Consider graphs with self-loops but no multiedges There are no swaps between these graphs Two directions for generalizations: cycles and cliques

Degree sequence: 2,2,,2} Swaps can: 1. Merge two cycles into a larger cycle (or do the reverse). 2. Swap two edges inside a cycles, preserving cycle length 3. Make a self-loop & reduce cycle length by 1 (or do the reverse), but only for cycles of length 4 or more. Swaps cannot make every edge a self-loop This can be further generalized

A taxonomy of V 3) V k are vertices k distance from a vertex in V 0 2) Vertices in V 1 have a neighbor in V 0 1) Let V 0 be vertices without a self-loop

Let V k be vertices k hops from a vertex without a selfloop

Deg seq: n+1, n+1,n-1,,n-1 No swaps are possible

Q 1 and Q 2 are exactly the problems

Proof of 4.20 outline graphs with a fixed degree sequence increasing number of self-loops connected components

graphs with most self-loops in yellow m*-loopy graphs: graphs with the most self-loops increasing number of self-loops connected components

graphs with most self-loops in yellow increasing number of self-loops m*-loopy graphs: graphs with the most self-loops Note: connectivity of follows from connectivity of simple graphs and an exchange lemma. connected components

graphs with most self-loops in yellow increasing number of self-loops m*-loopy graphs: graphs with the most self-loops The GoG is disconnected iff there is some component where: U connected components

Zooming into

Easy case: Harder case:

What do we know about? Maximum number of self-loops implies no open wedges in V 0. & No sequence of swaps can net create open wedges in V 0.

Example: V 4 is empty in any Open Wedge

Q 1 Q 2

is m*-loopy Decreasing any degree in K 0 leaves V u1 with excess degree.

is also m*-loopy By an alternating cycle/path argument. Thus Q: Can a different swap connect loopy-graphs?

Triangle swaps connect the GoG

Bonus: other constraints Connected Graphs GoG known to be connected, but algorithms require complicated data-structures to track effect of edge changes. Graphs with the same clustering coefficients Or, triangle constraints

Triangle MCMC constraints Total number of triangles Number of triangles incident at each node Do these affect connectedness in simple graphs?

Can we constrain number of triangles

How about triangle sequence

And more!

Thanks for listening!