Statistical network modeling: challenges and perspectives

Size: px
Start display at page:

Download "Statistical network modeling: challenges and perspectives"

Transcription

1 Statistical network modeling: challenges and perspectives Harry Crane Department of Statistics Rutgers August 1, 2017 Harry Crane (Rutgers) Network modeling JSM IOL / 25

2 Statistical network modeling: challenges and perspectives Harry Crane Department of Statistics Rutgers August 1, 2017 Subject of upcoming book on network modeling (Summer 2018) Based on joint work with Walter Dempsey (Harvard) Will speak Thursday, August 4 at 11:35 AM in Room CC-309. Harry Crane (Rutgers) Network modeling JSM IOL / 25

3 Modeling network data: What is it? Where does it come from? Harry Crane (Rutgers) Network modeling JSM IOL / 25

4 Modeling network data: What is it? Where does it come from? Assumed Setting: Population Observed network (sample) Guiding Question: How to model network data in the presence of sampling? Harry Crane (Rutgers) Network modeling JSM IOL / 25

5 Key Points and Further Reading Important directions of future statistics research: 1 Network sampling 2 Network dynamics mostly ignored in the current statistics literature References: H. Crane and W. Dempsey. (2015). A framework for statistical network modeling. arxiv: H. Crane and W. Dempsey. (2016). Edge exchangeable models for interaction networks. Journal of the American Statistical Association. H. Crane. Principles of Statistical Network Modeling (tentative title). Forthcoming, H. Crane. (2017). Exchangeable graph-valued Feller processes. Probability Theory and Related Fields. H. Crane. (2017). Combinatorial Lévy processes. Annals of Applied Probability. Available at Harry Crane (Rutgers) Network modeling JSM IOL / 25

6 Network modeling Conventional Definition: A (parameterized) statistical model is a family of probability distributions M = {P θ : θ Θ}, each defined on the sample space. Population or Sample model? And what s the connection? Population Observed network (sample)??? Model {P θ : θ Θ}??? Problem: How to draw sound inferences about population model based on sampled network? Need to model data in a manner consistent with (i) population model and (ii) sampling mechanism. Harry Crane (Rutgers) Network modeling JSM IOL / 25

7 Network modeling Revised Definition: A (parameterized) statistical model is (i) a family of probability distributions M = {P θ : θ Θ}, each defined on the sample space, along with (ii) a description of the sampling scheme (S n,n ) 1 n N used in observing a sample of each size n. Population Observed network (sample) Y N S n,n (Y N ) Model {P θ : θ Θ}??? Sampling scheme S m,n necessary to establish relationship between observation and population. Sampling mechanism often (almost always) left out of model specification. Harry Crane (Rutgers) Network modeling JSM IOL / 25

8 Preview Will discuss 3 network modeling scenarios: 1 Scenario 1: Vertex sampling ERGMs, graphon models, Aldous Hoover theorem, vertex exchangeability 2 Scenario 2: Edge sampling edge exchangeability, Hollywood model 3 Scenario 3: Time-varying networks Graph-valued Markov chains Modeling network dynamics under sampling Harry Crane (Rutgers) Network modeling JSM IOL / 25

9 Scenario 1: Inference from sampled networks Only n of all N Facebook account users are sampled and their relationships to one another are recorded as an array Y n = (Y ij ) 1 i,j n with { 1, i and j Facebook friends, Y ij = 0, otherwise. We are interested in using information Y n to infer structure of entire Facebook network. Population Y N Sample Y n How should Y n be modeled if we are interested in inferring the structure of relationships among all students in the population Y N? Depends on how Y n was sampled from Y N. Harry Crane (Rutgers) Network modeling JSM IOL / 25

10 Scenario 1: ERGM as population model Given any sufficient statistics (T 1,..., T k ) and parameters (θ 1,..., θ k ), assign probability { k } Pr(Y = y; θ, T ) exp θ i T i (y), y = (y ij ) 1 i,j N {0, 1} N N. i=1 Holland and Leinhardt (1981), Frank and Strauss (1986), Wasserman and Pattison (1996), Wasserman and Faust (1994). Typical approach: Estimate θ by fitting ERGM (θ) to Y n, obtain ˆθ n and use as estimate for θ in population. Population Y N Sample Y n Model ERGM (θ)??? Parameter θ θ Harry Crane (Rutgers) Network modeling JSM IOL / 25

11 Consistency under subsampling for ERGM Theorem (Shalizi Rinaldo) Model for S n,n (Y n ) is ERGM (θ) if and only if sufficient statistics T have separable increments. separable increments: in-degree, out-degree, reciprocity non-separable increments: transitivity (triangles), most higher order statistics Unclear how to proceed with inference for θ based on sample. Population Y N Sample Y n Model ERGM (θ)??? Parameter θ θ Estimate??? ˆθ n Harry Crane (Rutgers) Network modeling JSM IOL / 25

12 Graphon models (Glorified Erdős Rényi model) Originally appeared as φ-processes in Diaconis Freedman (1980), also Aldous (1983). Graphon φ : [0, 1] [0, 1] [0, 1] Associate i.i.d. sequence U 1, U 2,... of Uniform[0, 1] random variables to vertices 1, 2,.... Construct Y N = (Y ij ) 1 i,j N by conditionally independently for all 1 i, j N. Pr(Y ij = 1 U 1, U 2,...) = φ(u i, U j ) (1) Population Y N Sample Y n Model graphon (φ) graphon (φ) Parameter φ φ Estimate ˆφ n ˆφ n Harry Crane (Rutgers) Network modeling JSM IOL / 25

13 Implications of Graphon models Vertex exchangeable: Assign same probability to Theorem (Aldous Hoover) Every vertex exchangeable model for countable population is a mixture of graphon models. Vertex-exchangeability sampled vertices representative of the population of all vertices not realistic in most applications. Cannot explain sparse behavior. Graphon models achieve consistency under subsampling, but vertex exchangeability implies unrealistic sampling scheme and network properties. Harry Crane (Rutgers) Network modeling JSM IOL / 25

14 Scenario 2: Phone calls from a database Entries are sampled uniformly at random from a large database of phone calls (or s). Each observation (C i, R i ) contains identity of the caller C i and receiver R i on the ith sampled call. Interested in inferring the structure of connections among users in the database. Caller Receiver Time of Call (a) (b) 15: (c) (a) 15: (d) (e) 16: (c) (a) 15: Call sequence X 1 = (a, b), X 2 = (c, a), X 3 = (d, e), X 4 = (a, c) induces network:.... Harry Crane (Rutgers) Network modeling JSM IOL / 25

15 Interaction Networks Dataset vertices edges Actor collaborations actors movies Enron corpus employees s Karate club dataset club members social interactions Wikipedia voting Wikipedia admin. votes US Airport airports flights Scientific collaborations scientists articles UC Irvine online community members online messages Political blogs Websites hyperlinks These datasets are driven by interactions Edges are the units not represented as a (vertex-labeled) graph Harry Crane (Rutgers) Network modeling JSM IOL / 25

16 Edge exchangeable models Phone calls are sampled uniformly from the database exchangeable sequence of pairs (C 1, R 1 ), (C 2, R 2 ),.... Assign same probability to Edge exchangeability Size-biased vertex sampling Hollywood model: Easy for estimation, prediction, and testing questions. Sparse with probability 1 for 1/2 < α < 1. Power law with exponent α + 1 for 0 < α < 1. H. Crane and W. Dempsey. (2016). Edge exchangeable models for interaction networks. Journal of the American Statistical Association, to appear. Harry Crane (Rutgers) Network modeling JSM IOL / 25

17 Scenario 3: Dynamics in social networks For each day t = 0, 1, 2,... an array Y(t) = (Y ij (t)) 1 i,j N records interactions among a sample of Twitter users with { 1, i retweeted or liked a tweet by j, Y ij (t) = 0, otherwise. Interested in modeling how past interaction behavior may be indicative of future behavior for the population of all Twitter users. What kinds of Markov models can be fit to this data? Harry Crane (Rutgers) Network modeling JSM IOL / 25

18 Scenario 3: Dynamics in social networks For each day t = 0, 1, 2,... an array Y(t) = (Y ij (t)) 1 i,j N records interactions among a sample of Twitter users with { 1, i retweeted or liked a tweet by j, Y ij (t) = 0, otherwise. Interested in modeling how past interaction behavior may be indicative of future behavior for the population of all Twitter users. What kinds of Markov models can be fit to this data? Harry Crane (Rutgers) Network modeling JSM IOL / 25

19 Illustration: TERGM θ: parameter vector η(θ): natural parameter g(g, G ): sufficient statistic for transition from G to G Observations: P{Γ t+1 = G Γ t = G; θ} exp{η(θ) g(g, G )}. Typically choose g so that g(g, G ) = g(g σ, G σ ) for arbitrary permutations σ : [n] [n]. Not projective in general (Shalizi & Rinaldo, 2013) Robins and Pattison (2001), Hanneke, Fu and Xing (2010), Kravitsky & Handcock (2012) What do exchangeability and projective Markov properties imply about transition behavior? Harry Crane (Rutgers) Network modeling JSM IOL / 25

20 Time-varying network models Basic assumptions: (Y(t)) t 0 is a Markov process satisfying exchangeability: for all σ : N N, (Y σ (t)) t 0 has same finite-dimensional distributions as Y. projective Markov property: (Y(t) [n] ) t 0 is a Markov chain for every n = 1, 2,.... Harry Crane (Rutgers) Network modeling JSM IOL / 25

21 Time-varying network models Basic assumptions: (Y(t)) t 0 is a Markov process satisfying exchangeability: for all σ : N N, (Y σ (t)) t 0 has same finite-dimensional distributions as Y. projective Markov property: (Y(t) [n] ) t 0 is a Markov chain for every n = 1, 2,.... Harry Crane (Rutgers) Network modeling JSM IOL / 25

22 Time-varying network models Basic assumptions: (Y(t)) t 0 is a Markov process satisfying exchangeability: for all σ : N N, (Y σ (t)) t 0 has same finite-dimensional distributions as Y. projective Markov property: (Y(t) [n] ) t 0 is a Markov chain for every n = 1, 2,.... Harry Crane (Rutgers) Network modeling JSM IOL / 25

23 Time-varying network models Basic assumptions: (Y(t)) t 0 is a Markov process satisfying exchangeability: for all σ : N N, (Y σ (t)) t 0 has same finite-dimensional distributions as Y. projective Markov property: (Y(t) [n] ) t 0 is a Markov chain for every n = 1, 2,.... Harry Crane (Rutgers) Network modeling JSM IOL / 25

24 Rewiring processes (Crane, 2015, 2017) Let φ : [0, 1] [0, 1] [0, 1] [0, 1] be a Markovian graphon : Write φ(u, v) = (φ 0 (u, v), φ 1 (u, v)) for (u, v) [0, 1] [0, 1]. Rewiring process: Make a transition G G by generating a transition probability for each pair of vertices: ( ) 1 φ0 (U i, U j ) φ 0 (U i, U j ) 1 φ 1 (U i, U j ) φ 1 (U i, U j ) Rewire every edge according to transition probability determined by φ(u i, U j ). Theorem (Crane (2017)) Every exchangeable, projective Markov chain is a rewiring process. H. Crane. (2015). Time-varying network models. Bernoulli. H. Crane. (2017). Exchangeable graph-valued Feller processes. Probability Theory and Related Fields. H. Crane. (2017+). Combinatorial Lévy processes. Annals of Applied Probability. Harry Crane (Rutgers) Network modeling JSM IOL / 25

25 High Level Takeaways sampling statistical units network rep. model selection vertices graph ERGM edge sampling edges edge-labeled network SBM snowball ego-network graphon traceroute. paths. 1 Sampling scheme a key part of modeling. Sampling scheme often depends on the network itself. 2 Edge sampling, snowball sampling, traceroute sampling all depends on network Sampling affects invariance principles of model affects inference. 3 Vertex exchangeability, edge exchangeability, relative exchangeability, relational exchangeability How are network dynamics affected by sampling? Wide open.. Harry Crane (Rutgers) Network modeling JSM IOL / 25

26 Further reading Network Analysis: E. Kolaczyk. (2009). Statistical Analysis of Network Data: Methods and Models. Network Modeling: H. Crane and W. Dempsey. (2015). A framework for statistical network modeling. arxiv: H. Crane. Principles of Statistical Network Modeling (tentative title). Forthcoming, Goldenberg, Zheng, Fienberg and Airoldi. (2009). A survey of statistical network models. arxiv: Exchangeability: D.J. Aldous. (1983). Exchangeability and Related Topics. H. Crane and W. Dempsey. (2016). Edge exchangeable models for interaction networks. Journal of the American Statistical Association. H. Crane and H. Towsner. (2015). Relative exchangeability. Dynamic networks: H. Crane. (2015). Time-varying network models. Bernoulli. H. Crane. (2017). Exchangeable graph-valued Feller processes. Probability Theory and Related Fields. H. Crane. (2017+). Combinatorial Lévy processes. Annals of Applied Probability. Harry Crane (Rutgers) Network modeling JSM IOL / 25

Transitivity and Triads

Transitivity and Triads 1 / 32 Tom A.B. Snijders University of Oxford May 14, 2012 2 / 32 Outline 1 Local Structure Transitivity 2 3 / 32 Local Structure in Social Networks From the standpoint of structural individualism, one

More information

Statistical Modeling of Complex Networks Within the ERGM family

Statistical Modeling of Complex Networks Within the ERGM family Statistical Modeling of Complex Networks Within the ERGM family Mark S. Handcock Department of Statistics University of Washington U. Washington network modeling group Research supported by NIDA Grant

More information

Statistical Methods for Network Analysis: Exponential Random Graph Models

Statistical Methods for Network Analysis: Exponential Random Graph Models Day 2: Network Modeling Statistical Methods for Network Analysis: Exponential Random Graph Models NMID workshop September 17 21, 2012 Prof. Martina Morris Prof. Steven Goodreau Supported by the US National

More information

Networks and Algebraic Statistics

Networks and Algebraic Statistics Networks and Algebraic Statistics Dane Wilburne Illinois Institute of Technology UC Davis CACAO Seminar Davis, CA October 4th, 2016 dwilburne@hawk.iit.edu (IIT) Networks and Alg. Stat. Oct. 2016 1 / 23

More information

Algebraic statistics for network models

Algebraic statistics for network models Algebraic statistics for network models Connecting statistics, combinatorics, and computational algebra Part One Sonja Petrović (Statistics Department, Pennsylvania State University) Applied Mathematics

More information

Instability, Sensitivity, and Degeneracy of Discrete Exponential Families

Instability, Sensitivity, and Degeneracy of Discrete Exponential Families Instability, Sensitivity, and Degeneracy of Discrete Exponential Families Michael Schweinberger Pennsylvania State University ONR grant N00014-08-1-1015 Scalable Methods for the Analysis of Network-Based

More information

CS281 Section 9: Graph Models and Practical MCMC

CS281 Section 9: Graph Models and Practical MCMC CS281 Section 9: Graph Models and Practical MCMC Scott Linderman November 11, 213 Now that we have a few MCMC inference algorithms in our toolbox, let s try them out on some random graph models. Graphs

More information

Lecture 19, August 16, Reading: Eric Xing Eric CMU,

Lecture 19, August 16, Reading: Eric Xing Eric CMU, Machine Learning Application II: Social Network Analysis Eric Xing Lecture 19, August 16, 2010 Reading: Eric Xing Eric Xing @ CMU, 2006-2010 1 Social networks Eric Xing Eric Xing @ CMU, 2006-2010 2 Bio-molecular

More information

Edge-exchangeable graphs and sparsity

Edge-exchangeable graphs and sparsity Edge-exchangeable graphs and sparsity Tamara Broderick Department of EECS Massachusetts Institute of Technology tbroderick@csail.mit.edu Diana Cai Department of Statistics University of Chicago dcai@uchicago.edu

More information

Intro to Random Graphs and Exponential Random Graph Models

Intro to Random Graphs and Exponential Random Graph Models Intro to Random Graphs and Exponential Random Graph Models Danielle Larcomb University of Denver Danielle Larcomb Random Graphs 1/26 Necessity of Random Graphs The study of complex networks plays an increasingly

More information

An Introduction to Exponential Random Graph (p*) Models for Social Networks

An Introduction to Exponential Random Graph (p*) Models for Social Networks An Introduction to Exponential Random Graph (p*) Models for Social Networks Garry Robins, Pip Pattison, Yuval Kalish, Dean Lusher, Department of Psychology, University of Melbourne. 22 February 2006. Note:

More information

Technical report. Network degree distributions

Technical report. Network degree distributions Technical report Network degree distributions Garry Robins Philippa Pattison Johan Koskinen Social Networks laboratory School of Behavioural Science University of Melbourne 22 April 2008 1 This paper defines

More information

Fitting Social Network Models Using the Varying Truncation S. Truncation Stochastic Approximation MCMC Algorithm

Fitting Social Network Models Using the Varying Truncation S. Truncation Stochastic Approximation MCMC Algorithm Fitting Social Network Models Using the Varying Truncation Stochastic Approximation MCMC Algorithm May. 17, 2012 1 This talk is based on a joint work with Dr. Ick Hoon Jin Abstract The exponential random

More information

arxiv: v1 [cs.si] 11 Jan 2019

arxiv: v1 [cs.si] 11 Jan 2019 A Community-aware Network Growth Model for Synthetic Social Network Generation Furkan Gursoy furkan.gursoy@boun.edu.tr Bertan Badur bertan.badur@boun.edu.tr arxiv:1901.03629v1 [cs.si] 11 Jan 2019 Dept.

More information

Statistical Matching using Fractional Imputation

Statistical Matching using Fractional Imputation Statistical Matching using Fractional Imputation Jae-Kwang Kim 1 Iowa State University 1 Joint work with Emily Berg and Taesung Park 1 Introduction 2 Classical Approaches 3 Proposed method 4 Application:

More information

Random graph models with fixed degree sequences: choices, consequences and irreducibilty proofs for sampling

Random graph models with fixed degree sequences: choices, consequences and irreducibilty proofs for sampling Random graph models with fixed degree sequences: choices, consequences and irreducibilty proofs for sampling Joel Nishimura 1, Bailey K Fosdick 2, Daniel B Larremore 3 and Johan Ugander 4 1 Arizona State

More information

Mathematics of networks. Artem S. Novozhilov

Mathematics of networks. Artem S. Novozhilov Mathematics of networks Artem S. Novozhilov August 29, 2013 A disclaimer: While preparing these lecture notes, I am using a lot of different sources for inspiration, which I usually do not cite in the

More information

Generative models for social network data

Generative models for social network data Generative models for social network data Kevin S. Xu (University of Toledo) James R. Foulds (University of California-San Diego) SBP-BRiMS 2016 Tutorial About Us Kevin S. Xu Assistant professor at University

More information

Recovering Temporally Rewiring Networks: A Model-based Approach

Recovering Temporally Rewiring Networks: A Model-based Approach : A Model-based Approach Fan Guo fanguo@cs.cmu.edu Steve Hanneke shanneke@cs.cmu.edu Wenjie Fu wenjief@cs.cmu.edu Eric P. Xing epxing@cs.cmu.edu School of Computer Science, Carnegie Mellon University,

More information

Discrete Mathematics Course Review 3

Discrete Mathematics Course Review 3 21-228 Discrete Mathematics Course Review 3 This document contains a list of the important definitions and theorems that have been covered thus far in the course. It is not a complete listing of what has

More information

Exponential Random Graph (p*) Models for Social Networks

Exponential Random Graph (p*) Models for Social Networks Exponential Random Graph (p*) Models for Social Networks Author: Garry Robins, School of Behavioural Science, University of Melbourne, Australia Article outline: Glossary I. Definition II. Introduction

More information

10-701/15-781, Fall 2006, Final

10-701/15-781, Fall 2006, Final -7/-78, Fall 6, Final Dec, :pm-8:pm There are 9 questions in this exam ( pages including this cover sheet). If you need more room to work out your answer to a question, use the back of the page and clearly

More information

Package hergm. R topics documented: January 10, Version Date

Package hergm. R topics documented: January 10, Version Date Version 3.1-0 Date 2016-09-22 Package hergm January 10, 2017 Title Hierarchical Exponential-Family Random Graph Models Author Michael Schweinberger [aut, cre], Mark S.

More information

Exponential Random Graph (p ) Models for Affiliation Networks

Exponential Random Graph (p ) Models for Affiliation Networks Exponential Random Graph (p ) Models for Affiliation Networks A thesis submitted in partial fulfillment of the requirements of the Postgraduate Diploma in Science (Mathematics and Statistics) The University

More information

Applied Math Research Showcase: Statistics, Algebra, & Randomization

Applied Math Research Showcase: Statistics, Algebra, & Randomization Applied Math Research Showcase: Statistics, Algebra, & Randomization Sonja Petrović Illinois Institute of Technology April 2017 (math.iit.edu/-spetrov1) Alg Stats, Ntwks, Random Alg April 2017 0 / 15 My

More information

E6885 Network Science Lecture 5: Network Estimation and Modeling

E6885 Network Science Lecture 5: Network Estimation and Modeling E 6885 Topics in Signal Processing -- Network Science E6885 Network Science Lecture 5: Network Estimation and Modeling Ching-Yung Lin, Dept. of Electrical Engineering, Columbia University October 7th,

More information

Extracting Information from Complex Networks

Extracting Information from Complex Networks Extracting Information from Complex Networks 1 Complex Networks Networks that arise from modeling complex systems: relationships Social networks Biological networks Distinguish from random networks uniform

More information

Monte Carlo for Spatial Models

Monte Carlo for Spatial Models Monte Carlo for Spatial Models Murali Haran Department of Statistics Penn State University Penn State Computational Science Lectures April 2007 Spatial Models Lots of scientific questions involve analyzing

More information

Clustering. Mihaela van der Schaar. January 27, Department of Engineering Science University of Oxford

Clustering. Mihaela van der Schaar. January 27, Department of Engineering Science University of Oxford Department of Engineering Science University of Oxford January 27, 2017 Many datasets consist of multiple heterogeneous subsets. Cluster analysis: Given an unlabelled data, want algorithms that automatically

More information

Subgraph Frequencies:

Subgraph Frequencies: Subgraph Frequencies: The Empirical and Extremal Geography of Large Graph Collections Johan Ugander, Lars Backstrom, Jon Kleinberg World Wide Web Conference May 16, 2013 Graph collections Neighborhoods:

More information

Exponential Random Graph Models for Social Networks

Exponential Random Graph Models for Social Networks Exponential Random Graph Models for Social Networks ERGM Introduction Martina Morris Departments of Sociology, Statistics University of Washington Departments of Sociology, Statistics, and EECS, and Institute

More information

Random Simplicial Complexes

Random Simplicial Complexes Random Simplicial Complexes Duke University CAT-School 2015 Oxford 8/9/2015 Part I Random Combinatorial Complexes Contents Introduction The Erdős Rényi Random Graph The Random d-complex The Random Clique

More information

Models for Network Graphs

Models for Network Graphs Models for Network Graphs Gonzalo Mateos Dept. of ECE and Goergen Institute for Data Science University of Rochester gmateosb@ece.rochester.edu http://www.ece.rochester.edu/~gmateosb/ March 29, 2018 Network

More information

Turing Workshop on Statistics of Network Analysis

Turing Workshop on Statistics of Network Analysis Turing Workshop on Statistics of Network Analysis Day 1: 29 May 9:30-10:00 Registration & Coffee 10:00-10:45 Eric Kolaczyk Title: On the Propagation of Uncertainty in Network Summaries Abstract: While

More information

The 2-core of a Non-homogeneous Hypergraph

The 2-core of a Non-homogeneous Hypergraph July 16, 2012 k-cores A Hypergraph G on vertex set V is a collection E of subsets of V. E is the set of hyperedges. For ordinary graphs, e = 2 for all e E. The k-core of a (hyper)graph is the maximal subgraph

More information

M.E.J. Newman: Models of the Small World

M.E.J. Newman: Models of the Small World A Review Adaptive Informatics Research Centre Helsinki University of Technology November 7, 2007 Vocabulary N number of nodes of the graph l average distance between nodes D diameter of the graph d is

More information

An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization

An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization Pedro Ribeiro (DCC/FCUP & CRACS/INESC-TEC) Part 1 Motivation and emergence of Network Science

More information

Exchangeability and continuum limits of discrete random structures

Exchangeability and continuum limits of discrete random structures Exchangeability and continuum limits of discrete random structures August 22, 2010 Introduction Probability measures and random variables Exchangeability and de Finetti s theorem Exchangeability and de

More information

1. a graph G = (V (G), E(G)) consists of a set V (G) of vertices, and a set E(G) of edges (edges are pairs of elements of V (G))

1. a graph G = (V (G), E(G)) consists of a set V (G) of vertices, and a set E(G) of edges (edges are pairs of elements of V (G)) 10 Graphs 10.1 Graphs and Graph Models 1. a graph G = (V (G), E(G)) consists of a set V (G) of vertices, and a set E(G) of edges (edges are pairs of elements of V (G)) 2. an edge is present, say e = {u,

More information

Bayesian Inference for Exponential Random Graph Models

Bayesian Inference for Exponential Random Graph Models Dublin Institute of Technology ARROW@DIT Articles School of Mathematics 2011 Bayesian Inference for Exponential Random Graph Models Alberto Caimo Dublin Institute of Technology, alberto.caimo@dit.ie Nial

More information

University of Toronto Department of Mathematics

University of Toronto Department of Mathematics University of Toronto Department of Mathematics MAT332H1F, Graph Theory Midterm, October 21, 2014 Instructor: Kasra Rafi First Last Student Number Instructions: No aids allowed. Write solutions on the

More information

arxiv: v2 [math.st] 16 Sep 2016

arxiv: v2 [math.st] 16 Sep 2016 On the Geometry and Extremal Properties of the Edge-Degeneracy Model Nicolas Kim Dane Wilburne Sonja Petrović Alessandro Rinaldo arxiv:160.00180v [math.st] 16 Sep 016 Abstract The edge-degeneracy model

More information

Understanding Disconnection and Stabilization of Chord

Understanding Disconnection and Stabilization of Chord Understanding Disconnection and Stabilization of Chord Zhongmei Yao Joint work with Dmitri Loguinov Internet Research Lab Department of Computer Science Texas A&M University, College Station, TX 77843

More information

Snowball sampling for estimating exponential random graph models for large networks I

Snowball sampling for estimating exponential random graph models for large networks I Snowball sampling for estimating exponential random graph models for large networks I Alex D. Stivala a,, Johan H. Koskinen b,davida.rolls a,pengwang a, Garry L. Robins a a Melbourne School of Psychological

More information

Sequential Algorithms for Generating Random Graphs. Amin Saberi Stanford University

Sequential Algorithms for Generating Random Graphs. Amin Saberi Stanford University Sequential Algorithms for Generating Random Graphs Amin Saberi Stanford University Outline Generating random graphs with given degrees (joint work with M Bayati and JH Kim) Generating random graphs with

More information

Algorithms and Applications in Social Networks. 2017/2018, Semester B Slava Novgorodov

Algorithms and Applications in Social Networks. 2017/2018, Semester B Slava Novgorodov Algorithms and Applications in Social Networks 2017/2018, Semester B Slava Novgorodov 1 Lesson #1 Administrative questions Course overview Introduction to Social Networks Basic definitions Network properties

More information

Bayesian Fused Lasso regression for dynamic binary networks

Bayesian Fused Lasso regression for dynamic binary networks Bayesian Fused Lasso regression for dynamic binary networks arxiv:1710.01369v1 [stat.ap] 3 Oct 017 Brenda Betancourt Department of Statistics, UC Santa Cruz and Abel Rodríguez Department of Statistics,

More information

Impact of Clustering on Epidemics in Random Networks

Impact of Clustering on Epidemics in Random Networks Impact of Clustering on Epidemics in Random Networks Joint work with Marc Lelarge INRIA-ENS 8 March 2012 Coupechoux - Lelarge (INRIA-ENS) Epidemics in Random Networks 8 March 2012 1 / 19 Outline 1 Introduction

More information

Network Centrality. Saptarshi Ghosh Department of CSE, IIT Kharagpur Social Computing course, CS60017

Network Centrality. Saptarshi Ghosh Department of CSE, IIT Kharagpur Social Computing course, CS60017 Network Centrality Saptarshi Ghosh Department of CSE, IIT Kharagpur Social Computing course, CS60017 Node centrality n Relative importance of a node in a network n How influential a person is within a

More information

Tied Kronecker Product Graph Models to Capture Variance in Network Populations

Tied Kronecker Product Graph Models to Capture Variance in Network Populations Purdue University Purdue e-pubs Department of Computer Science Technical Reports Department of Computer Science 2012 Tied Kronecker Product Graph Models to Capture Variance in Network Populations Sebastian

More information

Simplicial Complexes of Networks and Their Statistical Properties

Simplicial Complexes of Networks and Their Statistical Properties Simplicial Complexes of Networks and Their Statistical Properties Slobodan Maletić, Milan Rajković*, and Danijela Vasiljević Institute of Nuclear Sciences Vinča, elgrade, Serbia *milanr@vin.bg.ac.yu bstract.

More information

Generating Similar Graphs from Spherical Features

Generating Similar Graphs from Spherical Features Generating Similar Graphs from Spherical Features Dalton Lunga Department of ECE, Purdue University West Lafayette, IN 4797-235 USA dlunga@purdue.edu Sergey Kirshner Department of Statistics, Purdue University

More information

Wednesday, March 8, Complex Networks. Presenter: Jirakhom Ruttanavakul. CS 790R, University of Nevada, Reno

Wednesday, March 8, Complex Networks. Presenter: Jirakhom Ruttanavakul. CS 790R, University of Nevada, Reno Wednesday, March 8, 2006 Complex Networks Presenter: Jirakhom Ruttanavakul CS 790R, University of Nevada, Reno Presented Papers Emergence of scaling in random networks, Barabási & Bonabeau (2003) Scale-free

More information

Network cross-validation by edge sampling

Network cross-validation by edge sampling Network cross-validation by edge sampling Abstract Many models and methods are now available for network analysis, but model selection and tuning remain challenging. Cross-validation is a useful general

More information

Randomized Rumor Spreading in Social Networks

Randomized Rumor Spreading in Social Networks Randomized Rumor Spreading in Social Networks Benjamin Doerr (MPI Informatics / Saarland U) Summary: We study how fast rumors spread in social networks. For the preferential attachment network model and

More information

On Asymptotic Cost of Triangle Listing in Random Graphs

On Asymptotic Cost of Triangle Listing in Random Graphs On Asymptotic Cost of Triangle Listing in Random Graphs Di Xiao, Yi Cui, Daren B.H. Cline, Dmitri Loguinov Internet Research Lab Department of Computer Science and Engineering Texas A&M University May

More information

Homework 4: Clustering, Recommenders, Dim. Reduction, ML and Graph Mining (due November 19 th, 2014, 2:30pm, in class hard-copy please)

Homework 4: Clustering, Recommenders, Dim. Reduction, ML and Graph Mining (due November 19 th, 2014, 2:30pm, in class hard-copy please) Virginia Tech. Computer Science CS 5614 (Big) Data Management Systems Fall 2014, Prakash Homework 4: Clustering, Recommenders, Dim. Reduction, ML and Graph Mining (due November 19 th, 2014, 2:30pm, in

More information

Extending the Rash model to a multiclass parametric network model

Extending the Rash model to a multiclass parametric network model Proceedings of the st International Conference and Exhibition on Future RFID Technologies Eszterhazy Karoly University of Applied Sciences and Bay Zoltán Nonprofit Ltd. for Applied Research Eger, Hungary,

More information

DYNAMIC NETWORK ANALYSIS WITH MISSING DATA: THEORY AND METHODS

DYNAMIC NETWORK ANALYSIS WITH MISSING DATA: THEORY AND METHODS Statistica Sinica 28 (2018), 1245-1264 doi:https://doi.org/10.5705/ss.202016.0108 DYNAMIC NETWORK ANALYSIS WITH MISSING DATA: THEORY AND METHODS Zack W. Almquist and Carter T. Butts University of Minnesota

More information

Models of Network Formation. Networked Life NETS 112 Fall 2017 Prof. Michael Kearns

Models of Network Formation. Networked Life NETS 112 Fall 2017 Prof. Michael Kearns Models of Network Formation Networked Life NETS 112 Fall 2017 Prof. Michael Kearns Roadmap Recently: typical large-scale social and other networks exhibit: giant component with small diameter sparsity

More information

A local structure graph model for network analysis

A local structure graph model for network analysis Graduate Theses and Dissertations Graduate College 2014 A local structure graph model for network analysis Emily Taylor Casleton Iowa State University Follow this and additional works at: http://lib.dr.iastate.edu/etd

More information

Stat 547 Assignment 3

Stat 547 Assignment 3 Stat 547 Assignment 3 Release Date: Saturday April 16, 2011 Due Date: Wednesday, April 27, 2011 at 4:30 PST Note that the deadline for this assignment is one day before the final project deadline, and

More information

Branching Distributional Equations and their Applications

Branching Distributional Equations and their Applications Branching Distributional Equations and their Applications Mariana Olvera-Cravioto UNC Chapel Hill molvera@unc.edu August 22nd, 2018 Bath-UNAM-CMAT, Lecture 3 Branching Distributional Equations and their

More information

05 - Surfaces. Acknowledgements: Olga Sorkine-Hornung. CSCI-GA Geometric Modeling - Daniele Panozzo

05 - Surfaces. Acknowledgements: Olga Sorkine-Hornung. CSCI-GA Geometric Modeling - Daniele Panozzo 05 - Surfaces Acknowledgements: Olga Sorkine-Hornung Reminder Curves Turning Number Theorem Continuous world Discrete world k: Curvature is scale dependent is scale-independent Discrete Curvature Integrated

More information

Computational Issues with ERGM: Pseudo-likelihood for constrained degree models

Computational Issues with ERGM: Pseudo-likelihood for constrained degree models Computational Issues with ERGM: Pseudo-likelihood for constrained degree models For details, see: Mark S. Handcock University of California - Los Angeles MURI-UCI June 3, 2011 van Duijn, Marijtje A. J.,

More information

Convexization in Markov Chain Monte Carlo

Convexization in Markov Chain Monte Carlo in Markov Chain Monte Carlo 1 IBM T. J. Watson Yorktown Heights, NY 2 Department of Aerospace Engineering Technion, Israel August 23, 2011 Problem Statement MCMC processes in general are governed by non

More information

Bayesian Inference for Sample Surveys

Bayesian Inference for Sample Surveys Bayesian Inference for Sample Surveys Trivellore Raghunathan (Raghu) Director, Survey Research Center Professor of Biostatistics University of Michigan Distinctive features of survey inference 1. Primary

More information

SOS3003 Applied data analysis for social science Lecture note Erling Berge Department of sociology and political science NTNU.

SOS3003 Applied data analysis for social science Lecture note Erling Berge Department of sociology and political science NTNU. SOS3003 Applied data analysis for social science Lecture note 04-2009 Erling Berge Department of sociology and political science NTNU Erling Berge 2009 1 Missing data Literature Allison, Paul D 2002 Missing

More information

Chapter 1. Social Media and Social Computing. October 2012 Youn-Hee Han

Chapter 1. Social Media and Social Computing. October 2012 Youn-Hee Han Chapter 1. Social Media and Social Computing October 2012 Youn-Hee Han http://link.koreatech.ac.kr 1.1 Social Media A rapid development and change of the Web and the Internet Participatory web application

More information

A noninformative Bayesian approach to small area estimation

A noninformative Bayesian approach to small area estimation A noninformative Bayesian approach to small area estimation Glen Meeden School of Statistics University of Minnesota Minneapolis, MN 55455 glen@stat.umn.edu September 2001 Revised May 2002 Research supported

More information

Random Generation of the Social Network with Several Communities

Random Generation of the Social Network with Several Communities Communications of the Korean Statistical Society 2011, Vol. 18, No. 5, 595 601 DOI: http://dx.doi.org/10.5351/ckss.2011.18.5.595 Random Generation of the Social Network with Several Communities Myung-Hoe

More information

STATISTICAL METHODS FOR NETWORK ANALYSIS

STATISTICAL METHODS FOR NETWORK ANALYSIS NME Workshop 1 DAY 2: STATISTICAL METHODS FOR NETWORK ANALYSIS Martina Morris, Ph.D. Steven M. Goodreau, Ph.D. Samuel M. Jenness, Ph.D. Supported by the US National Institutes of Health Today we will cover

More information

Topology Enhancement in Wireless Multihop Networks: A Top-down Approach

Topology Enhancement in Wireless Multihop Networks: A Top-down Approach Topology Enhancement in Wireless Multihop Networks: A Top-down Approach Symeon Papavassiliou (joint work with Eleni Stai and Vasileios Karyotis) National Technical University of Athens (NTUA) School of

More information

1 Degree Distributions

1 Degree Distributions Lecture Notes: Social Networks: Models, Algorithms, and Applications Lecture 3: Jan 24, 2012 Scribes: Geoffrey Fairchild and Jason Fries 1 Degree Distributions Last time, we discussed some graph-theoretic

More information

Algorithms, Games, and Networks February 21, Lecture 12

Algorithms, Games, and Networks February 21, Lecture 12 Algorithms, Games, and Networks February, 03 Lecturer: Ariel Procaccia Lecture Scribe: Sercan Yıldız Overview In this lecture, we introduce the axiomatic approach to social choice theory. In particular,

More information

3 : Representation of Undirected GMs

3 : Representation of Undirected GMs 0-708: Probabilistic Graphical Models 0-708, Spring 202 3 : Representation of Undirected GMs Lecturer: Eric P. Xing Scribes: Nicole Rafidi, Kirstin Early Last Time In the last lecture, we discussed directed

More information

1 More stochastic block model. Pr(G θ) G=(V, E) 1.1 Model definition. 1.2 Fitting the model to data. Prof. Aaron Clauset 7 November 2013

1 More stochastic block model. Pr(G θ) G=(V, E) 1.1 Model definition. 1.2 Fitting the model to data. Prof. Aaron Clauset 7 November 2013 1 More stochastic block model Recall that the stochastic block model (SBM is a generative model for network structure and thus defines a probability distribution over networks Pr(G θ, where θ represents

More information

Alessandro Del Ponte, Weijia Ran PAD 637 Week 3 Summary January 31, Wasserman and Faust, Chapter 3: Notation for Social Network Data

Alessandro Del Ponte, Weijia Ran PAD 637 Week 3 Summary January 31, Wasserman and Faust, Chapter 3: Notation for Social Network Data Wasserman and Faust, Chapter 3: Notation for Social Network Data Three different network notational schemes Graph theoretic: the most useful for centrality and prestige methods, cohesive subgroup ideas,

More information

Social networks, social space, social structure

Social networks, social space, social structure Social networks, social space, social structure Pip Pattison Department of Psychology University of Melbourne Sunbelt XXII International Social Network Conference New Orleans, February 22 Social space,

More information

DS504/CS586: Big Data Analytics Graph Mining Prof. Yanhua Li

DS504/CS586: Big Data Analytics Graph Mining Prof. Yanhua Li Welcome to DS504/CS586: Big Data Analytics Graph Mining Prof. Yanhua Li Time: 6:00pm 8:50pm R Location: AK232 Fall 2016 Graph Data: Social Networks Facebook social graph 4-degrees of separation [Backstrom-Boldi-Rosa-Ugander-Vigna,

More information

TWITTER USE IN ELECTION CAMPAIGNS: TECHNICAL APPENDIX. Jungherr, Andreas. (2016). Twitter Use in Election Campaigns: A Systematic Literature

TWITTER USE IN ELECTION CAMPAIGNS: TECHNICAL APPENDIX. Jungherr, Andreas. (2016). Twitter Use in Election Campaigns: A Systematic Literature 1 Jungherr, Andreas. (2016). Twitter Use in Election Campaigns: A Systematic Literature Review. Journal of Information Technology & Politics. (Forthcoming). Technical Appendix Coding Process 1. Keyword

More information

arxiv: v1 [stat.ml] 22 Mar 2016

arxiv: v1 [stat.ml] 22 Mar 2016 Completely random measures for modeling power laws in sparse graphs arxiv:1603.06915v1 [stat.ml] 22 Mar 2016 Diana Cai Department of Statistics University of Chicago dcai@uchicago.edu Tamara Broderick

More information

Overlay (and P2P) Networks

Overlay (and P2P) Networks Overlay (and P2P) Networks Part II Recap (Small World, Erdös Rényi model, Duncan Watts Model) Graph Properties Scale Free Networks Preferential Attachment Evolving Copying Navigation in Small World Samu

More information

University of Cambridge Engineering Part IIB Paper 4F10: Statistical Pattern Processing Handout 11: Non-Parametric Techniques.

University of Cambridge Engineering Part IIB Paper 4F10: Statistical Pattern Processing Handout 11: Non-Parametric Techniques. . Non-Parameteric Techniques University of Cambridge Engineering Part IIB Paper 4F: Statistical Pattern Processing Handout : Non-Parametric Techniques Mark Gales mjfg@eng.cam.ac.uk Michaelmas 23 Introduction

More information

Structural Balance and Transitivity. Social Network Analysis, Chapter 6 Wasserman and Faust

Structural Balance and Transitivity. Social Network Analysis, Chapter 6 Wasserman and Faust Structural Balance and Transitivity Social Network Analysis, Chapter 6 Wasserman and Faust Balance Theory Concerned with how an individual's attitudes or opinions coincide with those of others in a network

More information

Structure of Social Networks

Structure of Social Networks Structure of Social Networks Outline Structure of social networks Applications of structural analysis Social *networks* Twitter Facebook Linked-in IMs Email Real life Address books... Who Twitter #numbers

More information

Research Proposal. Guo, He. November 3rd, 2016

Research Proposal. Guo, He. November 3rd, 2016 1 Research Proposal Guo, He November 3rd, 2016 1 Introduction Can the special structure possessed by social networks be exploited algorithmically? Answering this question requires a formal definition of

More information

Random Simplicial Complexes

Random Simplicial Complexes Random Simplicial Complexes Duke University CAT-School 2015 Oxford 9/9/2015 Part II Random Geometric Complexes Contents Probabilistic Ingredients Random Geometric Graphs Definitions Random Geometric Complexes

More information

TRANSITIVE GRAPHS UNIQUELY DETERMINED BY THEIR LOCAL STRUCTURE

TRANSITIVE GRAPHS UNIQUELY DETERMINED BY THEIR LOCAL STRUCTURE TRANSITIVE GRAPHS UNIQUELY DETERMINED BY THEIR LOCAL STRUCTURE JOSHUA FRISCH AND OMER TAMUZ Abstract. We give an example of an infinite, vertex transitive graph that has the following property: it is the

More information

Minimal Universal Bipartite Graphs

Minimal Universal Bipartite Graphs Minimal Universal Bipartite Graphs Vadim V. Lozin, Gábor Rudolf Abstract A graph U is (induced)-universal for a class of graphs X if every member of X is contained in U as an induced subgraph. We study

More information

Convex and Distributed Optimization. Thomas Ropars

Convex and Distributed Optimization. Thomas Ropars >>> Presentation of this master2 course Convex and Distributed Optimization Franck Iutzeler Jérôme Malick Thomas Ropars Dmitry Grishchenko from LJK, the applied maths and computer science laboratory and

More information

Structured Learning. Jun Zhu

Structured Learning. Jun Zhu Structured Learning Jun Zhu Supervised learning Given a set of I.I.D. training samples Learn a prediction function b r a c e Supervised learning (cont d) Many different choices Logistic Regression Maximum

More information

When Network Embedding meets Reinforcement Learning?

When Network Embedding meets Reinforcement Learning? When Network Embedding meets Reinforcement Learning? ---Learning Combinatorial Optimization Problems over Graphs Changjun Fan 1 1. An Introduction to (Deep) Reinforcement Learning 2. How to combine NE

More information

Behavioral Data Mining. Lecture 9 Modeling People

Behavioral Data Mining. Lecture 9 Modeling People Behavioral Data Mining Lecture 9 Modeling People Outline Power Laws Big-5 Personality Factors Social Network Structure Power Laws Y-axis = frequency of word, X-axis = rank in decreasing order Power Laws

More information

DS504/CS586: Big Data Analytics Graph Mining Prof. Yanhua Li

DS504/CS586: Big Data Analytics Graph Mining Prof. Yanhua Li Welcome to DS504/CS586: Big Data Analytics Graph Mining Prof. Yanhua Li Time: 6:00pm 8:50pm R Location: AK 233 Spring 2018 Service Providing Improve urban planning, Ease Traffic Congestion, Save Energy,

More information

Networks in economics and finance. Lecture 1 - Measuring networks

Networks in economics and finance. Lecture 1 - Measuring networks Networks in economics and finance Lecture 1 - Measuring networks What are networks and why study them? A network is a set of items (nodes) connected by edges or links. Units (nodes) Individuals Firms Banks

More information

Graceful Labeling for Cycle of Graphs

Graceful Labeling for Cycle of Graphs International Journal of Mathematics Research. ISSN 0976-5840 Volume 6, Number (014), pp. 173 178 International Research Publication House http://www.irphouse.com Graceful Labeling for Cycle of Graphs

More information

6. Overview. L3S Research Center, University of Hannover. 6.1 Section Motivation. Investigation of structural aspects of peer-to-peer networks

6. Overview. L3S Research Center, University of Hannover. 6.1 Section Motivation. Investigation of structural aspects of peer-to-peer networks , University of Hannover Random Graphs, Small-Worlds, and Scale-Free Networks Wolf-Tilo Balke and Wolf Siberski 05.12.07 * Original slides provided by K.A. Lehmann (University Tübingen, Germany) 6. Overview

More information

Lecture 27: Learning from relational data

Lecture 27: Learning from relational data Lecture 27: Learning from relational data STATS 202: Data mining and analysis December 2, 2017 1 / 12 Announcements Kaggle deadline is this Thursday (Dec 7) at 4pm. If you haven t already, make a submission

More information

Simulating from the Polya posterior by Glen Meeden, March 06

Simulating from the Polya posterior by Glen Meeden, March 06 1 Introduction Simulating from the Polya posterior by Glen Meeden, glen@stat.umn.edu March 06 The Polya posterior is an objective Bayesian approach to finite population sampling. In its simplest form it

More information