Sequential Algorithms for Generating Random Graphs. Amin Saberi Stanford University

Similar documents
A local switch Markov chain on given degree graphs with application in connectivity of Peer-to-Peer networks

By Joseph Blitzstein and Persi Diaconis Stanford University

Networks and Algebraic Statistics

Models and Algorithms for Complex Networks

Graph Theory S 1 I 2 I 1 S 2 I 1 I 2

Math 776 Graph Theory Lecture Note 1 Basic concepts

Cycles in Random Graphs

Random graph models with fixed degree sequences: choices, consequences and irreducibilty proofs for sampling

On Realizations of a Joint Degree Matrix

A Sequential Importance Sampling Algorithm for Generating Random Graphs with Prescribed Degrees

Acyclic orientations of graphs

Branching Distributional Equations and their Applications

CALCULATING the number of vertices of a bounded

Analyzing the Peeling Decoder

Chapter 1. Introduction

On approximately counting colourings of small degree graphs

Sequential Monte Carlo for Counting Vertex Covers in General Graphs

Counting the Number of Eulerian Orientations

Efficient Markov Chain Monte Carlo Algorithms For MIMO and ISI channels

Peer-to-Peer Networks based on Random Transformations of Connected Regular Undirected Graphs

Networks in economics and finance. Lecture 1 - Measuring networks

On the construction of Tanner graphs

Sampling Large Graphs: Algorithms and Applications

GAMES Webinar: Rendering Tutorial 2. Monte Carlo Methods. Shuang Zhao

SEARCHING WITH MULTIPLE RANDOM WALK QUERIES

Algebraic statistics for network models

The flip Markov chain for connected regular graphs

Online Stochastic Matching CMSC 858F: Algorithmic Game Theory Fall 2010

Regular Hypergraphs: Asymptotic Counting and Loose Hamilton Cycles

K-SATURATED GRAPHS CLIFFORD BRIDGES, AMANDA DAY, SHELLY MANBER

Random Simplicial Complexes

Isabel Beichl. National Institute of Standards & Technology 100 Bureau Drive Gaithersburg, MD 20899

Bipartite Graph based Construction of Compressed Sensing Matrices

Coloring Random Graphs

Supplementary material to Epidemic spreading on complex networks with community structures

Judicious bisections

The loop-erased random walk and the uniform spanning tree on the four-dimensional discrete torus

CS249: SPECIAL TOPICS MINING INFORMATION/SOCIAL NETWORKS

Random Simplicial Complexes

Rumour spreading in the spatial preferential attachment model

Introduction to network metrics

Approximately Counting Embeddings into Random Graphs

Random Tilings with the GPU

Orientations of Planar Graphs

Graph Theory. Graph Theory. COURSE: Introduction to Biological Networks. Euler s Solution LECTURE 1: INTRODUCTION TO NETWORKS.

Statistical network modeling: challenges and perspectives

CS281 Section 9: Graph Models and Practical MCMC

Hashing. Dr. Ronaldo Menezes Hugo Serrano. Ronaldo Menezes, Florida Tech

Monochromatic loose-cycle partitions in hypergraphs

Math 170- Graph Theory Notes

Random Numbers Random Walk

Choosing a Random Peer

Distributed Construction of a Fault-Tolerant Network from a Tree

Estimating and Sampling Graphs with Multidimensional Random Walks. Group 2: Mingyan Zhao, Chengle Zhang, Biao Yin, Yuchen Liu

Sampling Large Graphs: Algorithms and Applications

Web consists of web pages and hyperlinks between pages. A page receiving many links from other pages may be a hint of the authority of the page

Are we there yet? When to stop a Markov chain while generating random graphs

Lesson 4. Random graphs. Sergio Barbarossa. UPC - Barcelona - July 2008

Subgraph Frequencies:

CS388C: Combinatorics and Graph Theory

Sparsity and homomorphisms of graphs and digraphs

Markov Chain Monte Carlo (part 1)

Bounds on graphs with high girth and high chromatic number

248 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 58, NO. 1, JANUARY 2012

Preferential attachment models and their generalizations

Impact of Clustering on Epidemics in Random Networks

Enumerating the decomposable neighbours of a decomposable graph under a simple perturbation scheme

Data mining --- mining graphs

Math 778S Spectral Graph Theory Handout #2: Basic graph theory

Chapter 11. Network Community Detection

EFFICIENT ALGORITHMS FOR ASSORTATIVE EDGE SWITCH IN LARGE LABELED NETWORKS

caution in interpreting graph-theoretic diagnostics

Sequential Monte Carlo Method for counting vertex covers

Exact Sampling for Hardy- Weinberg Equilibrium

Assessing Data Mining Results via Swap Randomization

Some important models

arxiv: v1 [cs.ds] 20 Dec 2017

Review of Sets. Review. Philippe B. Laval. Current Semester. Kennesaw State University. Philippe B. Laval (KSU) Sets Current Semester 1 / 16

Available online at ScienceDirect. Procedia Computer Science 20 (2013 )

Understanding Disconnection and Stabilization of Chord

Lecture 1: Examples, connectedness, paths and cycles

Expanders via Local Edge Flips

Research Project Brief

Peer-to-peer clustering

Models of Network Formation. Networked Life NETS 112 Fall 2017 Prof. Michael Kearns

Randomized Progressive Edge-Growth (RandPEG)

Quantitative Biology II!

Combinatorics Summary Sheet for Exam 1 Material 2019

HAMILTON CYCLES IN RANDOM LIFTS OF COMPLETE GRAPHS

The critical node problem in stochastic networks with discrete-time Markov chain

The Complexity of Change

CSCI5070 Advanced Topics in Social Computing

Solutions to Exam Data structures (X and NV)

Approximately Counting Embeddings into Random Graphs

Algorithms IV. Dynamic Programming. Guoqiang Li. School of Software, Shanghai Jiao Tong University

Web search before Google. (Taken from Page et al. (1999), The PageRank Citation Ranking: Bringing Order to the Web.)

1 More stochastic block model. Pr(G θ) G=(V, E) 1.1 Model definition. 1.2 Fitting the model to data. Prof. Aaron Clauset 7 November 2013

Fitting Social Network Models Using the Varying Truncation S. Truncation Stochastic Approximation MCMC Algorithm

What Does Robustness Say About Algorithms?

Cooperative Computing for Autonomous Data Centers

Transcription:

Sequential Algorithms for Generating Random Graphs Amin Saberi Stanford University

Outline Generating random graphs with given degrees (joint work with M Bayati and JH Kim) Generating random graphs with large girth (joint work with M Bayati and A Montanari)

Problem Given integers Generate a simple graph with that degree sequence chosen uniformly at random Example 7/3/2008

Application Generating large sparse graphs, with sparse (typically power-law) degree sequence Simulating Internet topology (e.g. Inet) Biological Networks (motif detection) motif: sub-networks with higher frequency than random Coding theory: bipartite graphs with no small stopping sets 7/3/2008

Existing methods (Theory) Markov Chain Monte Carlo method The switch chain It is ``Rapidly mixing : [Kannan-Tetali-Vampala 99] [Cooper-Dyer-GreenHill 05] [Feder-Guetz-S.-Mihail 07] Running time at least O(n 7 ) Jerrum-Sinclair chain (Walk on the self-reducibility tree) Running time at least O(n 4 ) A month on Pentium 2 for n = 1000 7/3/2008

Existing methods (Practice) Lots of heuristics: INET, PEG, Example: Milo et al., Science 2002; Kashtan et al. 2004 on Network Motifs 7/3/2008 The heuristic used for generating random graphs has a substantial bias

New method: Sequential Importance Sampling Very successful in practice: Knuth 76: for counting self-avoiding random walks estimating the running time of heuristics More recently for random graph generation: Chen-Diaconis-Holmes-Liu 05, Blitzstein-Diaconis 05 No analysis! (with the exception of this work and Blanchet 06) 7/3/2008 7

Our Algorithm Repeat remaining degree Same calculation in 10 microseconds degree Add an edge between (i,j) with probab. Until edges are added 43 21 0 or there are no valid choices left 1 0 3 10 2 21 0 2 10 7/3/2008 failure 2 10

Analysis of the Algorithm Theorem 1 (Bayati-Kim-S. 07): The running time of the algorithm is Furthermore, if Or if the degree sequence is regular and Algorithm is successful with probability The probability of generating any graph is where L is the number of graphs 7/3/2008 9

Basic Idea of The tree of execution The empty graph Choosing the k th edge the prob. of choosing a sub-tree should be proportional to the number of valid leaves 7/3/2008

Basic Idea of The tree of execution The empty graph Choosing the k th edge the prob. of choosing a sub-tree should be proportional to the number of valid leaves 7/3/2008

Basic Idea of The empty graph The tree of execution Choosing the k th edge the prob. of choosing a sub-tree should be proportional to the number of valid leaves Technical ingredient: concentration results on the 7/3/2008 distribution of leaves in each sub-tree (improving Kim-Vu 06, McKay-Wormald 91)

Analysis of the Algorithm Theorem 1 (Bayati-Kim-S. 07): The running time of the algorithm is Furthermore, if Or if the degree sequence is regular and Algorithm is successful with probability The probability of generating any graph is where L is the number of graphs 7/3/2008 13

Sequential Importance Sampling Consider a run of the algorithm Let P r be the probability of the edge chosen in step r Define Crucial observation: X is an unbiased estimator for the number of graphs 7/3/2008 14

Using SIS to get an FPRAS By taking several samples of X, we can have a good estimate of L Then using the right rejection sampling: Theorem 2 (Bayati-Kim-S. 07): Can generate any graph with probability of uniform. In time An FPRAS for counting and random generation 7/3/2008 15

Outline Generating random graphs with given degrees (joint work with M Bayati and JH Kim) Generating random graphs with large girth (joint work with M Bayati and A Montanari)

Graphs with large girth Check bits (mod 2) Challenge: Construct graphs with given degrees with no short cycles Amraoui-Montanari-Urbanke 06. 7/3/2008 17

Example: Triangle free graphs. Consider all graphs with n vertices and m edges. Let G be one such graph chosen uniformly at random. Can think of G as Erdös-Renyi graph where = number of triangle sub-graphs of G. Same phase transition holds when we need graphs of girth k. 7/3/2008 18

Our Algorithm Initialize G by an empty graph with vertices Repeat Choose a pair from the set of suitable pairs and set Until m edges are added or there are no suitable pair available (failure) 7/3/2008 19

Theorem (Bayati, Montanari, S. 07) For a suitable P ij and the output distribution of our algorithm is asymptotically uniform. i.e. Output dist. of algorithm Uniform probability Furthermore, the algorithm is successful almost surely. Remark: can be extended to degree sequences applicable to LDPC codes 7/3/2008 20

What is P ij Consider the partially constructed graph G with t edges. Let S be the n n matrix of all suitable pairs. Let A, A c be adjacency matrix of G and its complement. Coordinate-wise multiplication can be calculated quickly (e.g. with MATLAB) 7/3/2008 21

Where does h come from? The execution tree: Problem: estimate the number of valid leaves of each subtree In other words estimate the number of extensions of the partial graph that do not have short cycles 7/3/2008

Where does h come from? add the remaining m - k edges uniformly at random, and compute the expected number of small cycles Y. Assuming the distribution of small cycles is Poisson, the probability of having no small cycles is e -Y 7/3/2008

Summary Random simple graphs with given degrees Random bipartite graphs with given degrees and large girth More extensive analysis of SIS?