Statistical Methods for Network Analysis: Exponential Random Graph Models

Size: px
Start display at page:

Download "Statistical Methods for Network Analysis: Exponential Random Graph Models"

Transcription

1 Day 2: Network Modeling Statistical Methods for Network Analysis: Exponential Random Graph Models NMID workshop September 17 21, 2012 Prof. Martina Morris Prof. Steven Goodreau Supported by the US National Institutes of Health

2 Example: clustering of ties One of the defining features of social networks they are more clustered than a simple random graph Color = grade Color = race Add Health school friendship network Descriptive statistic in the literature: clustering coefficient (the % of all 2-stars that are closed 3-cycles)

3 But why? Friend of a friend, or birds of a feather? Two theories about the process that generates 3cycles: 1. Homophily: People tend to chose friends who are like them, in grade, race, etc. ( birds of a feather ), triad closure is a by-product 2. Transitivity: People who have friends in common tend to become friends ( friend of a friend ), closure is the key process So, for three actors in the same grade Cycle-closing tie forms because of transitivity but also homophily

4 Transitivity and homophily are confounded But not completely Any tie may be : Triangle forming: Grade Pair: Yes No Within Both Homophily Across Transitivity NA Suggests we should be able to disentangle these statistically

5 Generative Model: Basic idea We want to model the probability of a tie as a function of things like: Nodal attributes (that influence degree and mixing) The propensity for certain configurations (like 3-cycles) The ties may be dependent Note that nodal attribute effects do not induce dyad dependence So we model the joint distribution directly

6 Generative Model: Testing Suppose kids have a tendency to become friends with their friends friends, and this is the only generative process occurring. Presumably, this would mean that you would observe more triangles than expected by chance in the graph. How would you test this in an empirical dataset? You begin by observing x triangles. You then determine the probability of observing x or more triangles in a network of your size for which ties are distributed randomly (a random graph ), and see if this is less than 5%. To do this, you might try enumerating all possible networks for a fixed number of nodes, and count their triangles.

7 Example: for 4 nodes: # of dyads is 4*3/2 = 6 # of possible networks = 2 6 = 64 for 10 nodes: # of dyads is 10*9/2 = 45 # of possible networks = trillion for 20 nodes: # of dyads is 20*19/2 = 190 # of possible networks = If one is only interested in calculating probabilities of triangle counts in a random graph, one can use theorems from graph theory. However, we have already seen that other processes (for example, homophily) can affect this statistic as well. Testing arbitrary combinations of generative processes requires a general statistical framework

8 Exponential Random Graph Model (ERGM) Probability of observing a graph (set of relationships) y on a set of nodes X : exp where: z = vector of network statistics = vector of model parameters c = numerator summed over all possible networks on node set X Exponential family model, so has well understood statistical properties Bahadur (1961), Besag (1974), Frank (1986) Very general and flexible

9 ERG Model specification z ( x) The z y terms in the model represent network statistics Counts of network configurations e.g., edges, 2-stars, 3-cycles, ties where nodal attributes match The number of terms can range from minimal (# of edges) to saturated (one term for every dyad) Model specification involves: 1. choosing the set of network statistics 2. choosing the homogeneity constraints on the parameter

10 ERG Model specification: Network Statistics Example models: p1 (Holland and Leinhardt 1981) z(x) I i (x) = number of in-ties for each actor O i (x) = number of out-ties for each actor M(x) = number of mutual dyads (i j, j i) Markov graphs (Frank and Strauss 1986) E(x) = number of ties (undirected) S k (x) = number of k-stars T(x) = number of 3cycles p* / ERGM (Wasserman and Pattison 1996) the most general form

11 Equivalent representation: The conditional probability of a tie This expression for the probability of a network under a given model is equivalent to an expression for the probability of an individual tie conditional on the rest of the network. That is, PY ( y) exp z( y) c can be re expressed as logit(p( Y ij PY ( ij 1 rest of the graph) 1 rest of the graph)) = log PY ( ij 0 rest of the graph) z( y) where zy ( ) represents the change in z y) when Y ij is toggled between 0 and 1

12 ERG Models: Homogeneity Constraints on p1 models have two types of network statistics: degrees, and mutuality no homogeneity constraints on degree parameters every node has a unique parameter, fits nodal degrees exactly homogeneity constraint on the mutuality term so all dyads are homogeneous with respect to reciprocity ERGMs in general have many more network configuration terms but researchers have typically imposed homogeneity constraints on all of these terms (not always a good idea, it turns out) There is also a middle ground: heterogeneity by group # edges # of edges by race # of edges between nodes of the same race # of 3-cycles # of racially homogenous 3-cycles # of racially heterogeneous 3-cycles

13 ERG Model Estimation exp zx ( ) P( X x) exp This is the likelihood function to maximize c - Normalizing constant c makes direct ML estimation of the θ vector impossible - e.g., with 50 nodes, there are graphs - Old approach: pseudolikelihood based on logistic regression approximation Besag 1974, Strauss & Ikeda 1990) - not good when dependence among ties is strong (Geyer and Thompson 1992, Handcock 2003) - MCMC is really the right approach (Geyer and Thompson 1992, Crouch et al. 1998) - theoretically guarantees estimation, but implementation for networks has been a challenge

14 The stumbling block: Model Degeneracy Definition: The graph used to estimate a model is in fact extremely unlikely under this model. Instead, the model places almost all of its probability mass on graphs that bear no resemblance to the original graph (Handcock 2003).

15 What degeneracy looks like The average is exactly what we wanted. All possible 3cycles But none of the graphs will have this average density and clustering. So this model would not have produced a graph with the values we want. And if you try to estimate this model on a graph with those features, it will bounce back and forth between the two regions and not converge.

16 Morals: Network models do not work like linear models. the dyadic dependent terms have highly nonlinear effects Start simple and slowly add complexity does not work here. simple models are most likely degenerate It is going to take some time to develop intuition.

17 ERG Models: new model statistics To capture clustering use a different function of the triad counts 1. Old: t x = # of completed 3-cycles in the graph or C x = 3-cycles as a percent of all 2-stars Here, every 3-cycle here has the same impact, 2. New : Weight the marginal impact of the number of shared partners geometrically weighted edgewise shared partner (GWESP) statistic 1 1 p i = # of ties btwn nodes with i partners in common 3. The more partners in common, the more likely two nodes are linked, but with a declining marginal return (Snijders et al. 2004, Hunter and Handcock 2004)

18 Back to the friendship networks Grade 7 Grade 8 Grade 9 Grade 10 Grade 11 Grade 12 Grade NA White (non-hispanic) Black (non-hispanic) Hispanic (of any race) Asian / Native Am / Other (non-hispanic) Race NA How much of the clustering is due to homophily, and how much to transitivity?

19 Test this by comparing four models Model Edges Network Statistics z(x) # of edges Edges + Attributes (homophily) Edges + GWESP (transitivity) Edges + Attributes + GWESP (both) # of edges # of edges for each race, sex,grade # of edges that are within-race, within-grade, within-sex # of edges weighted shared partners # of edges # of edges for each race, sex,grade # of edges that are within-race, within-grade, within-sex weighted shared partners

20 Testing goodness of fit Traditional GOF stats can be used (AIC, BIC) We also take another approach We are interested in how well we fit aggregate properties of the network structure that we did not include as model terms This helps to identify what the model gets wrong We use 3 higher order statistics: Degree distribution Shared partner distribution (non-parametric) (local clustering) Geodesic distance distribution (global clustering)

21 DATA MODEL MODEL COEFFICIENTS SIMULATED DATA (draws from the prob. dist.) HIGHER ORDER GRAPH STATISTICS OF DATA HIGHER ORDER GRAPH STATISTICS OF SIMULATED DATA GOODNESS OF FIT OF MODEL TO DATA

22 Goodness of fit measure 1: degree distribution Data: node Model: Bernoulli (i.e. edges only)

23 Goodness of fit measure 1: degree distribution Data: node Model: Bernoulli (i.e. edges only)

24 Goodness of fit measure 2: ESP distribution (local clustering) Data: Model: Bernoulli (i.e. edges only) This edge has an ESP value of 3

25 Goodness of fit measure 2: ESP distribution (local clustering) Data: Model: Bernoulli (i.e. edges only) This edge has an ESP value of 3

26 Goodness of fit measure 3: geodesic distribution (global clustering) Data: Model: Bernoulli (i.e. edges only) C A B A/B have geodesic 2 A/C have geodesic

27 Goodness of fit measure 3: geodesic distribution (global clustering) Data: Model: Bernoulli (i.e. edges only) C A B A/B have geodesic 2 A/C have geodesic

28 Goodness of fit measures assembled School 10, Model: Bernoulli degree edgewise shared partner geodesic

29 School 10 degree edgewise shared partner geodesic Model: Edges AIC: Model: Edges + Attributes AIC: Model: Edges + GWESP AIC: Model: Edges + Attributes + GWESP AIC:

30 Attributes + GWESP Model as network size increases n: degree edgewise shared partner geodesic

31 Parameter values: Across all 59 schools Grade 8 Grade 9 Grade 10 Grade 11 Grade 12 Black Hispanic Asian Native Am Other Female Grade 7 Grade 8 Grade 9 Grade 10 Grade 11 Grade 12 White Black Hispanic Asian Native Am Other Sex GWESP Nodal Sociality Attribute Effects parameters Homophily Homophily parameters Transitivity While there is variation across the schools, there are also clear and systematic patterns that are shared. And the homophily effects are generally stronger than the transitivity effects.

32 Transitivity: before and after homophily controls Shared partner parameters homophily coefficients with shared partner Shared partner coefficients with attributes Transitivity estimates fall by nearly 25% once you control for the homophily effect. So some of the observed triad closure is driven by homophily Shared homophily partner coefficients coefficients w/o shared w/o attributes partner

33 Grade mixing: before and after transitivity controls Homophily parameters for grade homophily coefficients with shared partner Grade 7 Grade 8 Grade 9 Grade 10 Grade 11 Grade 12 Grade based homophily estimates fall by about 14% after controlling for transitivity. So part of the observed assortative mixing by grade is driven by friend of a friend homophily coefficients w/o shared partner

34 Race Mixing: before and after transitivity controls Homophily parameters for race homophily coefficients with shared partner White (non-hispanic) Black (non-hispanic) Hispanic (of any race) Asian (non-hispanic) Native American (non-hispanic) Other (non-hispanic) Race based homophily estimates do not consistently fall, and often rise once you control for the transitivity effect. So the assortative mixing here is driven by birds of a feather. Triad closure actually helps to reduce segregation homophily coefficients w/o shared partner

35 Summary Both transitivity and homophily play a role in clustering friendships Homophily alone would generate the distribution of path lengths A simple parametric form of transitivity captures local clustering 25% of the transitivity effect is a by-product of homophily Grade mixing is typically stronger than race mixing but also less robust to the transitivity confound All 4 of these models begin to fail on the largest schools There is more clustering than these models predict Suggests additional sources of heterogeneity Or perhaps an endogeneous fissioning of groups

STATISTICAL METHODS FOR NETWORK ANALYSIS

STATISTICAL METHODS FOR NETWORK ANALYSIS NME Workshop 1 DAY 2: STATISTICAL METHODS FOR NETWORK ANALYSIS Martina Morris, Ph.D. Steven M. Goodreau, Ph.D. Samuel M. Jenness, Ph.D. Supported by the US National Institutes of Health Today we will cover

More information

STATNET WEB THE EASY WAY TO LEARN (OR TEACH) STATISTICAL MODELING OF NETWORK DATA WITH ERGMS

STATNET WEB THE EASY WAY TO LEARN (OR TEACH) STATISTICAL MODELING OF NETWORK DATA WITH ERGMS statnetweb Workshop (Sunbelt 2018) 1 STATNET WEB THE EASY WAY TO LEARN (OR TEACH) STATISTICAL MODELING OF NETWORK DATA WITH ERGMS SUNBELT June 26, 2018 Martina Morris, Ph.D. Skye Bender-deMoll statnet

More information

Statistical Modeling of Complex Networks Within the ERGM family

Statistical Modeling of Complex Networks Within the ERGM family Statistical Modeling of Complex Networks Within the ERGM family Mark S. Handcock Department of Statistics University of Washington U. Washington network modeling group Research supported by NIDA Grant

More information

Transitivity and Triads

Transitivity and Triads 1 / 32 Tom A.B. Snijders University of Oxford May 14, 2012 2 / 32 Outline 1 Local Structure Transitivity 2 3 / 32 Local Structure in Social Networks From the standpoint of structural individualism, one

More information

Exponential Random Graph Models for Social Networks

Exponential Random Graph Models for Social Networks Exponential Random Graph Models for Social Networks ERGM Introduction Martina Morris Departments of Sociology, Statistics University of Washington Departments of Sociology, Statistics, and EECS, and Institute

More information

Fitting Social Network Models Using the Varying Truncation S. Truncation Stochastic Approximation MCMC Algorithm

Fitting Social Network Models Using the Varying Truncation S. Truncation Stochastic Approximation MCMC Algorithm Fitting Social Network Models Using the Varying Truncation Stochastic Approximation MCMC Algorithm May. 17, 2012 1 This talk is based on a joint work with Dr. Ick Hoon Jin Abstract The exponential random

More information

Computational Issues with ERGM: Pseudo-likelihood for constrained degree models

Computational Issues with ERGM: Pseudo-likelihood for constrained degree models Computational Issues with ERGM: Pseudo-likelihood for constrained degree models For details, see: Mark S. Handcock University of California - Los Angeles MURI-UCI June 3, 2011 van Duijn, Marijtje A. J.,

More information

An Introduction to Exponential Random Graph (p*) Models for Social Networks

An Introduction to Exponential Random Graph (p*) Models for Social Networks An Introduction to Exponential Random Graph (p*) Models for Social Networks Garry Robins, Pip Pattison, Yuval Kalish, Dean Lusher, Department of Psychology, University of Melbourne. 22 February 2006. Note:

More information

Exponential Random Graph (p*) Models for Social Networks

Exponential Random Graph (p*) Models for Social Networks Exponential Random Graph (p*) Models for Social Networks Author: Garry Robins, School of Behavioural Science, University of Melbourne, Australia Article outline: Glossary I. Definition II. Introduction

More information

Exponential Random Graph Models Under Measurement Error

Exponential Random Graph Models Under Measurement Error Exponential Random Graph Models Under Measurement Error Zoe Rehnberg Advisor: Dr. Nan Lin Abstract Understanding social networks is increasingly important in a world dominated by social media and access

More information

Exponential Random Graph (p ) Models for Affiliation Networks

Exponential Random Graph (p ) Models for Affiliation Networks Exponential Random Graph (p ) Models for Affiliation Networks A thesis submitted in partial fulfillment of the requirements of the Postgraduate Diploma in Science (Mathematics and Statistics) The University

More information

Instability, Sensitivity, and Degeneracy of Discrete Exponential Families

Instability, Sensitivity, and Degeneracy of Discrete Exponential Families Instability, Sensitivity, and Degeneracy of Discrete Exponential Families Michael Schweinberger Pennsylvania State University ONR grant N00014-08-1-1015 Scalable Methods for the Analysis of Network-Based

More information

University of Groningen

University of Groningen University of Groningen A framework for the comparison of maximum pseudo-likelihood and maximum likelihood estimation of exponential family random graph models van Duijn, Maria; Gile, Krista J.; Handcock,

More information

Alessandro Del Ponte, Weijia Ran PAD 637 Week 3 Summary January 31, Wasserman and Faust, Chapter 3: Notation for Social Network Data

Alessandro Del Ponte, Weijia Ran PAD 637 Week 3 Summary January 31, Wasserman and Faust, Chapter 3: Notation for Social Network Data Wasserman and Faust, Chapter 3: Notation for Social Network Data Three different network notational schemes Graph theoretic: the most useful for centrality and prestige methods, cohesive subgroup ideas,

More information

CHAPTER 3. BUILDING A USEFUL EXPONENTIAL RANDOM GRAPH MODEL

CHAPTER 3. BUILDING A USEFUL EXPONENTIAL RANDOM GRAPH MODEL CHAPTER 3. BUILDING A USEFUL EXPONENTIAL RANDOM GRAPH MODEL Essentially, all models are wrong, but some are useful. Box and Draper (1979, p. 424), as cited in Box and Draper (2007) For decades, network

More information

V2: Measures and Metrics (II)

V2: Measures and Metrics (II) - Betweenness Centrality V2: Measures and Metrics (II) - Groups of Vertices - Transitivity - Reciprocity - Signed Edges and Structural Balance - Similarity - Homophily and Assortative Mixing 1 Betweenness

More information

Bayesian Inference for Exponential Random Graph Models

Bayesian Inference for Exponential Random Graph Models Dublin Institute of Technology ARROW@DIT Articles School of Mathematics 2011 Bayesian Inference for Exponential Random Graph Models Alberto Caimo Dublin Institute of Technology, alberto.caimo@dit.ie Nial

More information

Statistical network modeling: challenges and perspectives

Statistical network modeling: challenges and perspectives Statistical network modeling: challenges and perspectives Harry Crane Department of Statistics Rutgers August 1, 2017 Harry Crane (Rutgers) Network modeling JSM IOL 2017 1 / 25 Statistical network modeling:

More information

Extending ERGM Functionality within statnet: Building Custom User Terms. David R. Hunter Steven M. Goodreau Statnet Development Team

Extending ERGM Functionality within statnet: Building Custom User Terms. David R. Hunter Steven M. Goodreau Statnet Development Team Extending ERGM Functionality within statnet: Building Custom User Terms David R. Hunter Steven M. Goodreau Statnet Development Team Sunbelt 2012 ERGM basic expression Probability of observing a network

More information

Web 2.0 Social Data Analysis

Web 2.0 Social Data Analysis Web 2.0 Social Data Analysis Ing. Jaroslav Kuchař jaroslav.kuchar@fit.cvut.cz Structure (2) Czech Technical University in Prague, Faculty of Information Technologies Software and Web Engineering 2 Contents

More information

Snowball sampling for estimating exponential random graph models for large networks I

Snowball sampling for estimating exponential random graph models for large networks I Snowball sampling for estimating exponential random graph models for large networks I Alex D. Stivala a,, Johan H. Koskinen b,davida.rolls a,pengwang a, Garry L. Robins a a Melbourne School of Psychological

More information

Technical report. Network degree distributions

Technical report. Network degree distributions Technical report Network degree distributions Garry Robins Philippa Pattison Johan Koskinen Social Networks laboratory School of Behavioural Science University of Melbourne 22 April 2008 1 This paper defines

More information

Networks and Algebraic Statistics

Networks and Algebraic Statistics Networks and Algebraic Statistics Dane Wilburne Illinois Institute of Technology UC Davis CACAO Seminar Davis, CA October 4th, 2016 dwilburne@hawk.iit.edu (IIT) Networks and Alg. Stat. Oct. 2016 1 / 23

More information

Algebraic statistics for network models

Algebraic statistics for network models Algebraic statistics for network models Connecting statistics, combinatorics, and computational algebra Part One Sonja Petrović (Statistics Department, Pennsylvania State University) Applied Mathematics

More information

arxiv: v3 [stat.co] 4 May 2017

arxiv: v3 [stat.co] 4 May 2017 Efficient Bayesian inference for exponential random graph models by correcting the pseudo-posterior distribution Lampros Bouranis, Nial Friel, Florian Maire School of Mathematics and Statistics & Insight

More information

Package hergm. R topics documented: January 10, Version Date

Package hergm. R topics documented: January 10, Version Date Version 3.1-0 Date 2016-09-22 Package hergm January 10, 2017 Title Hierarchical Exponential-Family Random Graph Models Author Michael Schweinberger [aut, cre], Mark S.

More information

1 Homophily and assortative mixing

1 Homophily and assortative mixing 1 Homophily and assortative mixing Networks, and particularly social networks, often exhibit a property called homophily or assortative mixing, which simply means that the attributes of vertices correlate

More information

Network Structure and Diffusion

Network Structure and Diffusion Duke Population Research Institute, Duke University, Durham, NC 27705 www.dupri.duke.edu Network Structure and Diffusion James Moody June 2009 PWP-DUKE-2009-004 Duke Population Research Institute On-line

More information

Tie strength, social capital, betweenness and homophily. Rik Sarkar

Tie strength, social capital, betweenness and homophily. Rik Sarkar Tie strength, social capital, betweenness and homophily Rik Sarkar Networks Position of a node in a network determines its role/importance Structure of a network determines its properties 2 Today Notion

More information

Complex-Network Modelling and Inference

Complex-Network Modelling and Inference Complex-Network Modelling and Inference Lecture 8: Graph features (2) Matthew Roughan http://www.maths.adelaide.edu.au/matthew.roughan/notes/ Network_Modelling/ School

More information

The Network Analysis Five-Number Summary

The Network Analysis Five-Number Summary Chapter 2 The Network Analysis Five-Number Summary There is nothing like looking, if you want to find something. You certainly usually find something, if you look, but it is not always quite the something

More information

Generating Similar Graphs from Spherical Features

Generating Similar Graphs from Spherical Features Generating Similar Graphs from Spherical Features Dalton Lunga Department of ECE, Purdue University West Lafayette, IN 4797-235 USA dlunga@purdue.edu Sergey Kirshner Department of Statistics, Purdue University

More information

Rockefeller College University at Albany

Rockefeller College University at Albany Rockefeller College University at Albany Problem Set #7: Handling Egocentric Network Data Adapted from original by Peter V. Marsden, Harvard University Egocentric network data sometimes known as personal

More information

Tied Kronecker Product Graph Models to Capture Variance in Network Populations

Tied Kronecker Product Graph Models to Capture Variance in Network Populations Purdue University Purdue e-pubs Department of Computer Science Technical Reports Department of Computer Science 2012 Tied Kronecker Product Graph Models to Capture Variance in Network Populations Sebastian

More information

Centrality Book. cohesion.

Centrality Book. cohesion. Cohesion The graph-theoretic terms discussed in the previous chapter have very specific and concrete meanings which are highly shared across the field of graph theory and other fields like social network

More information

Assessing Degeneracy in Statistical Models of Social Networks 1

Assessing Degeneracy in Statistical Models of Social Networks 1 Assessing Degeneracy in Statistical Models of Social Networks 1 Mark S. Handcock University of Washington, Seattle Working Paper no. 39 Center for Statistics and the Social Sciences University of Washington

More information

Community Detection as a Method to Control For Homophily in Social Networks

Community Detection as a Method to Control For Homophily in Social Networks Community Detection as a Method to Control For Homophily in Social Networks Hannah Worrall Senior Honors/QSSS Thesis April 30, 2014 Contents 1 Introduction 2 1.1 Background........................... 2

More information

Social networks, social space, social structure

Social networks, social space, social structure Social networks, social space, social structure Pip Pattison Department of Psychology University of Melbourne Sunbelt XXII International Social Network Conference New Orleans, February 22 Social space,

More information

Monte Carlo Maximum Likelihood for Exponential Random Graph Models: From Snowballs to Umbrella Densities

Monte Carlo Maximum Likelihood for Exponential Random Graph Models: From Snowballs to Umbrella Densities Monte Carlo Maximum Likelihood for Exponential Random Graph Models: From Snowballs to Umbrella Densities The Harvard community has made this article openly available. Please share how this access benefits

More information

EpiModel: An R Package for Mathematical Modeling of Infectious Disease over Networks

EpiModel: An R Package for Mathematical Modeling of Infectious Disease over Networks EpiModel: An R Package for Mathematical Modeling of Infectious Disease over Networks June 1, 2017 Samuel M. Jenness Emory University Steven M. Goodreau University of Washington Martina Morris University

More information

Social Networks, Social Interaction, and Outcomes

Social Networks, Social Interaction, and Outcomes Social Networks, Social Interaction, and Outcomes Social Networks and Social Interaction Not all links are created equal: Some links are stronger than others Some links are more useful than others Links

More information

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University http://cs224w.stanford.edu 10/10/2011 Jure Leskovec, Stanford CS224W: Social and Information Network Analysis, http://cs224w.stanford.edu

More information

DAY 2: NETWORK MODELING

DAY 2: NETWORK MODELING DAY 2: NETWORK MODELING NMID WORKSHOP SEPTEMBER 17 21, 2012 PROF. MARTINA MORRIS PROF. STEVEN GOODREAU Supported by the US National Institutes of Health Outline for today 1. Wrap up of compartmental models

More information

Unit 2: Graphs and Matrices. ICPSR University of Michigan, Ann Arbor Summer 2015 Instructor: Ann McCranie

Unit 2: Graphs and Matrices. ICPSR University of Michigan, Ann Arbor Summer 2015 Instructor: Ann McCranie Unit 2: Graphs and Matrices ICPSR University of Michigan, Ann Arbor Summer 2015 Instructor: Ann McCranie Four main ways to notate a social network There are a variety of ways to mathematize a social network,

More information

Tied Kronecker Product Graph Models to Capture Variance in Network Populations

Tied Kronecker Product Graph Models to Capture Variance in Network Populations Tied Kronecker Product Graph Models to Capture Variance in Network Populations Sebastian Moreno, Sergey Kirshner +, Jennifer Neville +, S.V.N. Vishwanathan + Department of Computer Science, + Department

More information

Graph Structure Over Time

Graph Structure Over Time Graph Structure Over Time Observing how time alters the structure of the IEEE data set Priti Kumar Computer Science Rensselaer Polytechnic Institute Troy, NY Kumarp3@rpi.edu Abstract This paper examines

More information

PNet for Dummies. An introduction to estimating exponential random graph (p*) models with PNet. Version Nicholas Harrigan

PNet for Dummies. An introduction to estimating exponential random graph (p*) models with PNet. Version Nicholas Harrigan PNet for Dummies An introduction to estimating exponential random graph (p*) models with PNet Version 1.04 Nicholas Harrigan To download the latest copy of this manual go to: http://www.sna.unimelb.edu.au/pnet/pnet.html#download

More information

1 More configuration model

1 More configuration model 1 More configuration model In the last lecture, we explored the definition of the configuration model, a simple method for drawing networks from the ensemble, and derived some of its mathematical properties.

More information

1 Degree Distributions

1 Degree Distributions Lecture Notes: Social Networks: Models, Algorithms, and Applications Lecture 3: Jan 24, 2012 Scribes: Geoffrey Fairchild and Jason Fries 1 Degree Distributions Last time, we discussed some graph-theoretic

More information

Part II. C. M. Bishop PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 8: GRAPHICAL MODELS

Part II. C. M. Bishop PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 8: GRAPHICAL MODELS Part II C. M. Bishop PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 8: GRAPHICAL MODELS Converting Directed to Undirected Graphs (1) Converting Directed to Undirected Graphs (2) Add extra links between

More information

Small and Other Worlds: Global Network Structures from Local Processes 1

Small and Other Worlds: Global Network Structures from Local Processes 1 Small and Other Worlds: Global Network Structures from Local Processes 1 Garry Robins, Philippa Pattison, and Jodie Woolcock University of Melbourne Using simulation, we contrast global network structures

More information

Preliminary results from an agent-based adaptation of friendship games

Preliminary results from an agent-based adaptation of friendship games Preliminary results from an agent-based adaptation of friendship games David S. Dixon June 29, 2011 This paper presents agent-based model (ABM) equivalents of friendshipbased games and compares the results

More information

Motivation: Shortcomings of Hidden Markov Model. Ko, Youngjoong. Solution: Maximum Entropy Markov Model (MEMM)

Motivation: Shortcomings of Hidden Markov Model. Ko, Youngjoong. Solution: Maximum Entropy Markov Model (MEMM) Motivation: Shortcomings of Hidden Markov Model Maximum Entropy Markov Models and Conditional Random Fields Ko, Youngjoong Dept. of Computer Engineering, Dong-A University Intelligent System Laboratory,

More information

A stochastic agent-based model of pathogen propagation in dynamic multi-relational social networks

A stochastic agent-based model of pathogen propagation in dynamic multi-relational social networks A stochastic agent-based model of pathogen propagation in dynamic multi-relational social networks Bilal Khan, Kirk Dombrowski, and Mohamed Saad, Journal of Transactions of Society Modeling and Simulation

More information

Structured Learning. Jun Zhu

Structured Learning. Jun Zhu Structured Learning Jun Zhu Supervised learning Given a set of I.I.D. training samples Learn a prediction function b r a c e Supervised learning (cont d) Many different choices Logistic Regression Maximum

More information

ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale Learning

ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale Learning ECE 6504: Advanced Topics in Machine Learning Probabilistic Graphical Models and Large-Scale Learning Topics Bayes Nets: Inference (Finish) Variable Elimination Graph-view of VE: Fill-edges, induced width

More information

caution in interpreting graph-theoretic diagnostics

caution in interpreting graph-theoretic diagnostics April 17, 2013 What is a network [1, 2, 3] What is a network [1, 2, 3] What is a network [1, 2, 3] What is a network [1, 2, 3] What is a network a collection of more or less identical agents or objects,

More information

Tom A.B. Snijders Christian E.G. Steglich Michael Schweinberger Mark Huisman

Tom A.B. Snijders Christian E.G. Steglich Michael Schweinberger Mark Huisman Manual for SIENA version 3.2 Provisional version Tom A.B. Snijders Christian E.G. Steglich Michael Schweinberger Mark Huisman University of Groningen: ICS, Department of Sociology Grote Rozenstraat 31,

More information

CS281 Section 9: Graph Models and Practical MCMC

CS281 Section 9: Graph Models and Practical MCMC CS281 Section 9: Graph Models and Practical MCMC Scott Linderman November 11, 213 Now that we have a few MCMC inference algorithms in our toolbox, let s try them out on some random graph models. Graphs

More information

Tom A.B. Snijders Christian E.G. Steglich Michael Schweinberger Mark Huisman

Tom A.B. Snijders Christian E.G. Steglich Michael Schweinberger Mark Huisman Manual for SIENA version 3.2 Provisional version Tom A.B. Snijders Christian E.G. Steglich Michael Schweinberger Mark Huisman University of Groningen: ICS, Department of Sociology Grote Rozenstraat 31,

More information

Structural Balance and Transitivity. Social Network Analysis, Chapter 6 Wasserman and Faust

Structural Balance and Transitivity. Social Network Analysis, Chapter 6 Wasserman and Faust Structural Balance and Transitivity Social Network Analysis, Chapter 6 Wasserman and Faust Balance Theory Concerned with how an individual's attitudes or opinions coincide with those of others in a network

More information

Lecture 3: Conditional Independence - Undirected

Lecture 3: Conditional Independence - Undirected CS598: Graphical Models, Fall 2016 Lecture 3: Conditional Independence - Undirected Lecturer: Sanmi Koyejo Scribe: Nate Bowman and Erin Carrier, Aug. 30, 2016 1 Review for the Bayes-Ball Algorithm Recall

More information

Tom A.B. Snijders Christian Steglich Michael Schweinberger Mark Huisman

Tom A.B. Snijders Christian Steglich Michael Schweinberger Mark Huisman Manual for SIENA version 2.1 Tom A.B. Snijders Christian Steglich Michael Schweinberger Mark Huisman ICS, Department of Sociology Grote Rozenstraat 31, 9712 TG Groningen, The Netherlands February 14, 2005

More information

Subgraph Frequencies:

Subgraph Frequencies: Subgraph Frequencies: The Empirical and Extremal Geography of Large Graph Collections Johan Ugander, Lars Backstrom, Jon Kleinberg World Wide Web Conference May 16, 2013 Graph collections Neighborhoods:

More information

Estimation of Item Response Models

Estimation of Item Response Models Estimation of Item Response Models Lecture #5 ICPSR Item Response Theory Workshop Lecture #5: 1of 39 The Big Picture of Estimation ESTIMATOR = Maximum Likelihood; Mplus Any questions? answers Lecture #5:

More information

Chapter 4 Graphs and Matrices. PAD637 Week 3 Presentation Prepared by Weijia Ran & Alessandro Del Ponte

Chapter 4 Graphs and Matrices. PAD637 Week 3 Presentation Prepared by Weijia Ran & Alessandro Del Ponte Chapter 4 Graphs and Matrices PAD637 Week 3 Presentation Prepared by Weijia Ran & Alessandro Del Ponte 1 Outline Graphs: Basic Graph Theory Concepts Directed Graphs Signed Graphs & Signed Directed Graphs

More information

1 More stochastic block model. Pr(G θ) G=(V, E) 1.1 Model definition. 1.2 Fitting the model to data. Prof. Aaron Clauset 7 November 2013

1 More stochastic block model. Pr(G θ) G=(V, E) 1.1 Model definition. 1.2 Fitting the model to data. Prof. Aaron Clauset 7 November 2013 1 More stochastic block model Recall that the stochastic block model (SBM is a generative model for network structure and thus defines a probability distribution over networks Pr(G θ, where θ represents

More information

CS 229 Midterm Review

CS 229 Midterm Review CS 229 Midterm Review Course Staff Fall 2018 11/2/2018 Outline Today: SVMs Kernels Tree Ensembles EM Algorithm / Mixture Models [ Focus on building intuition, less so on solving specific problems. Ask

More information

arxiv: v2 [stat.co] 17 Jan 2013

arxiv: v2 [stat.co] 17 Jan 2013 Bergm: Bayesian Exponential Random Graphs in R A. Caimo, N. Friel National Centre for Geocomputation, National University of Ireland, Maynooth, Ireland Clique Research Cluster, Complex and Adaptive Systems

More information

Algorithmic and Economic Aspects of Networks. Nicole Immorlica

Algorithmic and Economic Aspects of Networks. Nicole Immorlica Algorithmic and Economic Aspects of Networks Nicole Immorlica Syllabus 1. Jan. 8 th (today): Graph theory, network structure 2. Jan. 15 th : Random graphs, probabilistic network formation 3. Jan. 20 th

More information

Simulation of SpatiallyEmbedded Networks

Simulation of SpatiallyEmbedded Networks Simulation of SpatiallyEmbedded Networks Carter T. Butts12 Ryan Acton1, Zack Almquist1 Department of Sociology 2 Institute for Mathematical Behavioral Sciences University of California, Irvine 1 Prepared

More information

#We will get a bit of practice here using some example ego network data #based on a school, with information on grade and race

#We will get a bit of practice here using some example ego network data #based on a school, with information on grade and race ##### #A short tutorial on analyzing ego network data in R #for the Social Networks and Health #workshop at Duke University on May 17, 2016 #In this tutorial we learn some basic functionality #for R with

More information

Computer Vision Group Prof. Daniel Cremers. 4a. Inference in Graphical Models

Computer Vision Group Prof. Daniel Cremers. 4a. Inference in Graphical Models Group Prof. Daniel Cremers 4a. Inference in Graphical Models Inference on a Chain (Rep.) The first values of µ α and µ β are: The partition function can be computed at any node: Overall, we have O(NK 2

More information

Statistical Analysis and Data Mining, 5 (4):

Statistical Analysis and Data Mining, 5 (4): Provided by the author(s) and University College Dublin Library in accordance with publisher policies. Please cite the published version when available. Title Review of Statistical Network Analysis: Models,

More information

Algorithms and Applications in Social Networks. 2017/2018, Semester B Slava Novgorodov

Algorithms and Applications in Social Networks. 2017/2018, Semester B Slava Novgorodov Algorithms and Applications in Social Networks 2017/2018, Semester B Slava Novgorodov 1 Lesson #1 Administrative questions Course overview Introduction to Social Networks Basic definitions Network properties

More information

Recap! CMSC 498J: Social Media Computing. Department of Computer Science University of Maryland Spring Hadi Amiri

Recap! CMSC 498J: Social Media Computing. Department of Computer Science University of Maryland Spring Hadi Amiri Recap! CMSC 498J: Social Media Computing Department of Computer Science University of Maryland Spring 2015 Hadi Amiri hadi@umd.edu Announcement CourseEvalUM https://www.courseevalum.umd.edu/ 2 Announcement

More information

An open software system for the advanced statistical analysis of social networks

An open software system for the advanced statistical analysis of social networks StOCNET An open software system for the advanced statistical analysis of social networks User s Manual version 1.7 February 2006 Groningen: ICS / Science Plus http://stat.gamma.rug.nl/stocnet/ Peter Boer

More information

V4 Matrix algorithms and graph partitioning

V4 Matrix algorithms and graph partitioning V4 Matrix algorithms and graph partitioning - Community detection - Simple modularity maximization - Spectral modularity maximization - Division into more than two groups - Other algorithms for community

More information

Universal Properties of Mythological Networks Midterm report: Math 485

Universal Properties of Mythological Networks Midterm report: Math 485 Universal Properties of Mythological Networks Midterm report: Math 485 Roopa Krishnaswamy, Jamie Fitzgerald, Manuel Villegas, Riqu Huang, and Riley Neal Department of Mathematics, University of Arizona,

More information

SELECTION OF A MULTIVARIATE CALIBRATION METHOD

SELECTION OF A MULTIVARIATE CALIBRATION METHOD SELECTION OF A MULTIVARIATE CALIBRATION METHOD 0. Aim of this document Different types of multivariate calibration methods are available. The aim of this document is to help the user select the proper

More information

This online supplement includes four parts: (1) an introduction to SAB models; (2) an

This online supplement includes four parts: (1) an introduction to SAB models; (2) an Supplemental Materials Till Stress Do Us Part: On the Interplay Between Perceived Stress and Communication Network Dynamics by Y. Kalish et al., 2015, Journal of Applied Psychology http://dx.doi.org/10.1037/apl0000023

More information

An Introduction to Markov Chain Monte Carlo

An Introduction to Markov Chain Monte Carlo An Introduction to Markov Chain Monte Carlo Markov Chain Monte Carlo (MCMC) refers to a suite of processes for simulating a posterior distribution based on a random (ie. monte carlo) process. In other

More information

Generative models for social network data

Generative models for social network data Generative models for social network data Kevin S. Xu (University of Toledo) James R. Foulds (University of California-San Diego) SBP-BRiMS 2016 Tutorial About Us Kevin S. Xu Assistant professor at University

More information

Graphical Models. David M. Blei Columbia University. September 17, 2014

Graphical Models. David M. Blei Columbia University. September 17, 2014 Graphical Models David M. Blei Columbia University September 17, 2014 These lecture notes follow the ideas in Chapter 2 of An Introduction to Probabilistic Graphical Models by Michael Jordan. In addition,

More information

Models of Network Formation. Networked Life NETS 112 Fall 2017 Prof. Michael Kearns

Models of Network Formation. Networked Life NETS 112 Fall 2017 Prof. Michael Kearns Models of Network Formation Networked Life NETS 112 Fall 2017 Prof. Michael Kearns Roadmap Recently: typical large-scale social and other networks exhibit: giant component with small diameter sparsity

More information

Web Structure Mining Community Detection and Evaluation

Web Structure Mining Community Detection and Evaluation Web Structure Mining Community Detection and Evaluation 1 Community Community. It is formed by individuals such that those within a group interact with each other more frequently than with those outside

More information

Lecture 19, August 16, Reading: Eric Xing Eric CMU,

Lecture 19, August 16, Reading: Eric Xing Eric CMU, Machine Learning Application II: Social Network Analysis Eric Xing Lecture 19, August 16, 2010 Reading: Eric Xing Eric Xing @ CMU, 2006-2010 1 Social networks Eric Xing Eric Xing @ CMU, 2006-2010 2 Bio-molecular

More information

Section 7.13: Homophily (or Assortativity) By: Ralucca Gera, NPS

Section 7.13: Homophily (or Assortativity) By: Ralucca Gera, NPS Section 7.13: Homophily (or Assortativity) By: Ralucca Gera, NPS Are hubs adjacent to hubs? How does a node s degree relate to its neighbors degree? Real networks usually show a non-zero degree correlation

More information

Week 5 Video 5. Relationship Mining Network Analysis

Week 5 Video 5. Relationship Mining Network Analysis Week 5 Video 5 Relationship Mining Network Analysis Today s Class Network Analysis Network Analysis Analysis of anything that can be seen as connections between nodes Most common social networks Connections

More information

Source-Destination Connection in Brain Network

Source-Destination Connection in Brain Network Source-Destination Connection in Brain Network May 7, 018 Li Kaijian, 515030910566 Contents 1 Introduction 3 1.1 What is Brain Network...................... 3 1. Different Kinds of Brain Network................

More information

Dynamic Bayesian network (DBN)

Dynamic Bayesian network (DBN) Readings: K&F: 18.1, 18.2, 18.3, 18.4 ynamic Bayesian Networks Beyond 10708 Graphical Models 10708 Carlos Guestrin Carnegie Mellon University ecember 1 st, 2006 1 ynamic Bayesian network (BN) HMM defined

More information

TELCOM2125: Network Science and Analysis

TELCOM2125: Network Science and Analysis School of Information Sciences University of Pittsburgh TELCOM2125: Network Science and Analysis Konstantinos Pelechrinis Spring 2015 Figures are taken from: M.E.J. Newman, Networks: An Introduction 2

More information

Link Prediction for Social Network

Link Prediction for Social Network Link Prediction for Social Network Ning Lin Computer Science and Engineering University of California, San Diego Email: nil016@eng.ucsd.edu Abstract Friendship recommendation has become an important issue

More information

CS224W: Analysis of Networks Jure Leskovec, Stanford University

CS224W: Analysis of Networks Jure Leskovec, Stanford University CS224W: Analysis of Networks Jure Leskovec, Stanford University http://cs224w.stanford.edu 11/13/17 Jure Leskovec, Stanford CS224W: Analysis of Networks, http://cs224w.stanford.edu 2 Observations Models

More information

Online Supplement to Bayesian Simultaneous Edit and Imputation for Multivariate Categorical Data

Online Supplement to Bayesian Simultaneous Edit and Imputation for Multivariate Categorical Data Online Supplement to Bayesian Simultaneous Edit and Imputation for Multivariate Categorical Data Daniel Manrique-Vallier and Jerome P. Reiter August 3, 2016 This supplement includes the algorithm for processing

More information

Estimation of Bilateral Connections in a Network: Copula vs. Maximum Entropy

Estimation of Bilateral Connections in a Network: Copula vs. Maximum Entropy Estimation of Bilateral Connections in a Network: Copula vs. Maximum Entropy Pallavi Baral and Jose Pedro Fique Department of Economics Indiana University at Bloomington 1st Annual CIRANO Workshop on Networks

More information

Maximizing the Spread of Influence through a Social Network. David Kempe, Jon Kleinberg and Eva Tardos

Maximizing the Spread of Influence through a Social Network. David Kempe, Jon Kleinberg and Eva Tardos Maximizing the Spread of Influence through a Social Network David Kempe, Jon Kleinberg and Eva Tardos Group 9 Lauren Thomas, Ryan Lieblein, Joshua Hammock and Mary Hanvey Introduction In a social network,

More information

SNAscript. friend.mat <- as.matrix(read.table("elfriend.dat",header=false,sep=" "))

SNAscript. friend.mat <- as.matrix(read.table(elfriend.dat,header=false,sep= )) SNAscript ##### Loading the Adjacency Matrices and Nodal Attributes ##data can be loaded from: ##https://www.stats.ox.ac.uk/~snijders/siena/lazega_lawyers_data.htm #####Formatting Data into Standard R

More information

CPSC 340: Machine Learning and Data Mining. Principal Component Analysis Fall 2017

CPSC 340: Machine Learning and Data Mining. Principal Component Analysis Fall 2017 CPSC 340: Machine Learning and Data Mining Principal Component Analysis Fall 2017 Assignment 3: 2 late days to hand in tonight. Admin Assignment 4: Due Friday of next week. Last Time: MAP Estimation MAP

More information

Machine Learning. Sourangshu Bhattacharya

Machine Learning. Sourangshu Bhattacharya Machine Learning Sourangshu Bhattacharya Bayesian Networks Directed Acyclic Graph (DAG) Bayesian Networks General Factorization Curve Fitting Re-visited Maximum Likelihood Determine by minimizing sum-of-squares

More information