Section 7.13: Homophily (or Assortativity) By: Ralucca Gera, NPS

Size: px
Start display at page:

Download "Section 7.13: Homophily (or Assortativity) By: Ralucca Gera, NPS"

Transcription

1 Section 7.13: Homophily (or Assortativity) By: Ralucca Gera, NPS

2 Are hubs adjacent to hubs? How does a node s degree relate to its neighbors degree? Real networks usually show a non-zero degree correlation (defined later compared to random). If it is positive, the network is said to have assortatively mixed degrees (assortativity based other attributes can also be considered). If it is negative, it is disassortatively mixed. According to Newman, social networks tend to be assortatively mixed, while other kinds of networks are generally disassortatively mixed. 2

3 Partitioning the network Sociologists have observed network partitioning based on the following characteristics: Friendships, aquintances, business relationships Relationships based on certain characteristics: Age Nationality Language Education Income level Homophily: It is the tendency of individuals to choose friends with similar characteristic. Like links with like.

4 Example of homophily 4

5 Assortativity by race James Moody 5

6 Friendship networks at a US highschool Homophily or Assortative mixing Homophily (or assortative mixing) is the strong tendency of people to associate with others whom they perceive are similar to themselves in some way Disassortative mixing is the tendency for people to associate with others who are unlike them (such as gender in sexual contact networks, where the majority of partnerships are between opposite sex individuals) 6

7 Assortative mixing Titter data: political retweet network Red = Republicans Blue = Democrats Note that they mostly tweet and re-tweet to each other Conover et at.,

8 Homophily (or Assortative mixing) Homophily or assortativity is a common property of social networks (but not necessary): Papers in citation networks tend to cite papers in the same field Websites tend to point to websites in the same language Political views Race Obesity 8

9 Homophily in Gephi here 9

10 Homophily in Python In NetworkX, to check degree Assortativity: r = nx.attribute_assortativity_coefficient(g) To check an attribute s assortativity: assortivity_val=nx.attribute_assortativity_coefficient(g, "color ) The attribute color can be replaced by other attributes that your data was tagged with. 10

11 Disassortative Disassortative mixing: like links with dislike. Dissasortative networks are the ones in which adjacent nodes tend to be dissimilar: Dating network (females/males) Food web (predator/prey) Economic networks (producers/consumers) 11

12 Assortative mixing (homophily) We will study two types of assortative mixing: 1. Based on enumerative characteristics (the characteristics don t fall in any particular order), such as: 1. Nationality 2. Race 3. Gender 2. Based on scalar characteristics, such as: 1. Age 2. Income 3. By degree: high degree connect to high degree 12

13 Based on enumerative characteristics (characteristics that don t fall in any particular order), such as: 1. Nationality 2. Race 3. Gender 13

14 Possible defn assortativity A network is assortative if there is a significant fraction of the edges between same-type vertices How to quantify how assortative the network is? Let be the assortative class of vertex i, S = # Then: What is the assortativity if we consider V(G) as the only class? Does it make sense? 14

15 Alternative definitions Alternate defintions, as before, will allow to compare the Assortativity of the current network to the one of a random graph: Compute the fraction of edges in in the given network Compute the fraction of edges in in a random graph Consider their difference 15

16 Compute the fraction of edges in in the given network Let be the class of vertex i Let be the total number of classes Let ) = be the Kronecker that accounts for vertices in the same class. Then the number of edges of the same type is: Checks if vertices are in the same class 16

17 Compute the fraction of edges in in the random network Construct a random graph with the same degree distrib. Let be the class of vertex i Let be the total number of classes Let ( ) = be the Kronecker Pick an arbitrary edge in the random graph: If pick a vertex i there are deg edges incident with it, so deg choices for i to be the 1st end vertex of our arbitrary edge and then there are deg choices for j to be the other end edges are placed at random, the expected number of edges between i and j is # of vertices at the end of the E(G) edges Checks if vertices are in the same class 17

18 Let be the class of vertex i Let be the total number of classes Compute the fraction of edges in in the random network Let ( ) = be the Kronecker Checks if vertices are in the same class The expected number of edges between i and j if m edges are placed at random is Then the number of edges of the same type is: duplications: as you choose vertex j above, the edge ji will be counted after edge ij was counted 18

19 Consider their difference - = Checks if vertices are in the same class And now normalize by m = E(G) : Modularity = Which is the same defn as the modularity in community detection (assortativity based on deg). 19

20 Q =, Modularity Checks if vertices are in the same class Measure used to quantify the like vertices being connected to like vertices -1 < Q < 0 means there are fewer edges between like vertices in a class compared to a random network i.e. disassortative network 0 < Q < 1 means there are more edges between like vertices in a class compared to a random network i.e. assortative network Q = 0 means it behaves like a random network. 20

21 Ranges of the modularity values Range: -1 < Q < 1 However, it never reaches 1 (we ll see shortly), even in a perfectly assortative network (where all like nodes are adjacent) The modularity is generally far from 1 So what is a good modularity value (Q >?) that says the network is assortative? Should we normalize Q again? What are some choices? 21

22 Enumerative characteristics We normalize the modularity value Q, by the maximum value that it can get Perfect mixing is when all edges fall between vertices of the same type 1 Q max 1 2m (2m k k i j ij 2m (c i,c j )) Then, the assortativity coefficient is given by: Q ( Aij kk i j /2 m) ( ci, c ) ij j 1 1 Q 2 m ( kk /2 m) ( c, c ) max ij i j i j 22

23 Alternative form for edge list An alternative faster to compute formula, if the network is given by a list of edges rather than adjacency matrix: Let be the fraction of 2m A ij(c i,r)(c j, s) ij edges that join vertices of type to vertices of type Let a r 1 be the fraction of ends of 2m k i(c i, r) ij edges attached to vertices of type r We further have that the edges between group and is (c i,c j ) (c i, r)(c j,r) e rs 1 r 23

24 Q 1 2m r r ij A ij k i k j 2m r Alternative form for edge list (c i,r)(c j,r) 1 2m A ij(c i,r)(c j,r) 1 ij 2m (e rr a r 2 ) i k 1 i(c i,r) 2m k i(c j, r) j Where we defined a r 1 is the fraction of edges 2m k i(c i, r) ij between nodes of type and e rs 1 is the fraction of edges between and, 2m A ij(c i,r)(c j, s) ij and deg 24

25 Based on scalar characteristics, such as: 1. Age 2. Income 25

26 Scalar characteristics Scalar characteristics take values that come in a particular order (enumerative) and take numerical values Such as age, income Age: In this case two people can be considered similar if they are born the same day or within a year or within x years etc. If people are friends with others of the same age, we consider the network assortatively mixed by age (or stratified by age) 26

27 Assortativity by race James Moody 27

28 Assortativity by grade/age James Moody 28

29 Scalar characteristics When we consider scalar characteristics we basically have an approximate notion of similarity between adjacent vertices (i.e. how far/close the values are) There is no approximate similarity that can be measured this way when we talk about enumerative characteristics; rather present/absent 29

30 Friendship networks at a US highschool Recall Homophily (or assortative mixing) is the strong tendency of people to associate with others whom they perceive are similar to themselves in some way Disassortative mixing is the tendency for people to associate with others who are unlike them (such as gender in sexual contact networks, where the majority of partnerships are between opposite sex individuals) 30

31 Scalar characteristics Friendships at the same US high school: each dot represents a friendship (an edge from the network) Denser along the y = x line (because of the way data is displayed) Sparser as the difference in grades increases 31

32 Strongly assortative Top: scatter plot of 1141 married couples Bottom: same data showing a histogram of the age difference Data: 1995 US National Survey of Family Growth Newman, Phys Rev E. 67, (2003) 32

33 Scalar characteristics How do we measure this assortative mixing? Would the idea of the enumartive assortative mixing work? That is to place vertices in bins based on scalar values: Treat vertices that fall in the same bin (such as age) as like vertices or identical Apply modularity metric for enumerative characteristics 33

34 Scalar characteristics This approach misses much of the point of scalar part of the scalar characteristics Why are two vertices identical if they are in the same bin? Should they only be approximately the same type? Why two vertices that fall in different bins are different? That is, if the bins are close enough (1 grade apart), could the vertices be more related than these that fall farther (like 4 grades apart)? We don t have just a binary choice as we did for enumerative characteristics 34

35 Scalar characteristics We will use a covariance measure (next slide) Let x i (or x j ) be the value of the scalar characteristic of vertex i (j), such as the age Then the expected mean of at the end of an edge over all edges of the graph (not just the mean value of ) is: And so ij ij Ax ij i i kx i i 1 A 2m 2m ij A vertex of degree appears at the end of edges i kx i i 35

36 Scalar characteristics What is the covariance of two variables i and j (based on the scalar characteristic x i and x j ) over all edges of the graph? cov(x i, x j ) Which is similar to the enumerative modularity Q = ij A ij (x i )(x j ) ij A ij m ij (A ij k i k j 2m )x i x j Either 0 or 1 Product of values (a function of two variables) 36

37 Scalar characteristics If the covariance is positive, we have assortativity mixing, If the covariance is negative, we have disassortativity As with the modularity Q we need to normalize the above covariance value to be closer to 1 for a perfectly mixed network 37

38 Scalar characteristics Recall 1 kk i j cov( x, x ) ( A ) xx 2m 2m i j ij i j ij Setting x i =x j for all the existing edges (i,j) we get the maximum possible value (which is the variance of the variable) for perfectly assortative network: 1 kk 1 kk 2 i j i j max ( Ax ij i xx i j) ( kiij ) xx i j 2m ij 2m 2m ij 2m This can then be used for comparison 38

39 Scalar characteristics Then the assortativity coefficient r is defined as: r ij ij (A ij k i k j /2m)x i x j (k i ij k i k j /2m)x i x j Similar to the enumerative one again r=1 Perfectly assortative network r=-1 Perfectly disassortative network r=0 no correlation Similar to the Pearson correlation coefficient 39

40 Simpler reformulation for Corr. Coeff. =.621 for the network below (strongly assort.) is the fraction of edges in a network that connect a vertex of type i to one of type j, form the matrix above. Ref: Newman, Phys Rev E. 67, (2003) 40

41 Computer Science faculty 88 Computer Science faculty: vertices are PhD granting institutions in North America Edge (i,j) means that PhD at i and now faculty at j labels are US census regions + Canada Five Lectures on Networks, by Aaron Clauset 41

42 Assortative community structure Communities are tight and dense groups that connect to other dense groups Five Lectures on Networks, by Aaron Clauset 42

43 Community structure Five Lectures on Networks, by Aaron Clauset 43

44 By degree: high degree connect to high degree 44

45 Assortative mixing by degree A special case is when the characteristic of interest is the degree of the node Commonly used in social networks (the most used one of the scalar characteristics) More interesting since degree is a topologial property of the network (not just a value like in the previously seen scalars) In this case studying the assortative mixing will show if high/low-degree nodes connect with other high/low-degree nodes or not

46 Assortative mixing by degree Assortative network by degree core of high degrees and a periphery of low degrees (Figure (a) below) Disassortative network by degree uniform (low degree adjacent to high degree) Either 0 or 1 (a) (b)

47 Assortative mixing by degree

48 Assortative mixing by degree measured like the other scalar assortative requires the same adjacency information of the network cov(k i, k j ) 1 2m ij (A ij k i k j 2m )k i k j The correlation coefficient is the same normalize value: degcorr_ coeff ij ij ( A i ij ( k ij k k i k k i j / 2m) k k j i i j / 2m) k k The scalar is the degree j 48

49 Bottom line Same formula: Q ( Aij kk i j /2 m) ( ci, c ) ij j 1 1 Q 2 m ( kk /2 m) ( c, c ) max i j i j If enumerative: either the property is or is not present (the delta Kronecker function is used) If scalar: use the value of the scalar instead of the delta Kronecker function In particular one could use the degree ij 49

50 Examples Newman, Phys. Rev. E. 67, (2003) 50

51 Same slide with more details Newman, Phys. Rev. E. 67, (2003) 51

52 Assortative mixing by degree The assortative mixing by degree coefficient,, measured how connected are the vertices of the same degree (compared to random graphs) In general the absolute values of the assortativity coefficient r is not large in observed networks (the correlation between the degrees is not especially strong) 52

53 Table 8.1 page 237 r = assortativity coefficient 53

54 Assortative mixing by degree Technological/biological/information networks tend to have negative (disassortative mixing based on degree), while social networks tend to have positive (assortative) Good time to share your thoughts on why. Why? One possible reason is that technological, biological and information networks are simple graphs (no edge multiplicity) and Maslov et al. showed these networks tend to be disassortative since the number of vertices between high degree vertices is limited by the number of vertices of high degree 54

55 Assortative mixing by degree The multiplicity of edges in social networks tend to drive the value of up There might be other biases in place in social networks that cause a slight assortative mixing: Communities: low degree nodes adjacent to low degree nodes within the same community, and the high degree ones connecting the communities might have better chances of being adjacent 55

56 References Besides the references at the bottom left of different slides, this is a good overall reference: Newman, Mark EJ. "Mixing patterns in networks." Physical Review E 67.2 (2003):

1 Homophily and assortative mixing

1 Homophily and assortative mixing 1 Homophily and assortative mixing Networks, and particularly social networks, often exhibit a property called homophily or assortative mixing, which simply means that the attributes of vertices correlate

More information

V2: Measures and Metrics (II)

V2: Measures and Metrics (II) - Betweenness Centrality V2: Measures and Metrics (II) - Groups of Vertices - Transitivity - Reciprocity - Signed Edges and Structural Balance - Similarity - Homophily and Assortative Mixing 1 Betweenness

More information

Section 7.12: Similarity. By: Ralucca Gera, NPS

Section 7.12: Similarity. By: Ralucca Gera, NPS Section 7.12: Similarity By: Ralucca Gera, NPS Motivation We talked about global properties Average degree, average clustering, ave path length We talked about local properties: Some node centralities

More information

Node Similarity. Ralucca Gera, Applied Mathematics Dept. Naval Postgraduate School Monterey, California

Node Similarity. Ralucca Gera, Applied Mathematics Dept. Naval Postgraduate School Monterey, California Node Similarity Ralucca Gera, Applied Mathematics Dept. Naval Postgraduate School Monterey, California rgera@nps.edu Motivation We talked about global properties Average degree, average clustering, ave

More information

SNA 8: network resilience. Lada Adamic

SNA 8: network resilience. Lada Adamic SNA 8: network resilience Lada Adamic Outline Node vs. edge percolation Resilience of randomly vs. preferentially grown networks Resilience in real-world networks network resilience Q: If a given fraction

More information

TELCOM2125: Network Science and Analysis

TELCOM2125: Network Science and Analysis School of Information Sciences University of Pittsburgh TELCOM2125: Network Science and Analysis Konstantinos Pelechrinis Spring 2015 Figures are taken from: M.E.J. Newman, Networks: An Introduction 2

More information

Basics of Network Analysis

Basics of Network Analysis Basics of Network Analysis Hiroki Sayama sayama@binghamton.edu Graph = Network G(V, E): graph (network) V: vertices (nodes), E: edges (links) 1 Nodes = 1, 2, 3, 4, 5 2 3 Links = 12, 13, 15, 23,

More information

TELCOM2125: Network Science and Analysis

TELCOM2125: Network Science and Analysis School of Information Sciences University of Pittsburgh TELCOM2125: Network Science and Analysis Konstantinos Pelechrinis Spring 2015 Figures are taken from: M.E.J. Newman, Networks: An Introduction 2

More information

Topic mash II: assortativity, resilience, link prediction CS224W

Topic mash II: assortativity, resilience, link prediction CS224W Topic mash II: assortativity, resilience, link prediction CS224W Outline Node vs. edge percolation Resilience of randomly vs. preferentially grown networks Resilience in real-world networks network resilience

More information

Social Networks, Social Interaction, and Outcomes

Social Networks, Social Interaction, and Outcomes Social Networks, Social Interaction, and Outcomes Social Networks and Social Interaction Not all links are created equal: Some links are stronger than others Some links are more useful than others Links

More information

Networks in economics and finance. Lecture 1 - Measuring networks

Networks in economics and finance. Lecture 1 - Measuring networks Networks in economics and finance Lecture 1 - Measuring networks What are networks and why study them? A network is a set of items (nodes) connected by edges or links. Units (nodes) Individuals Firms Banks

More information

Centralities (4) By: Ralucca Gera, NPS. Excellence Through Knowledge

Centralities (4) By: Ralucca Gera, NPS. Excellence Through Knowledge Centralities (4) By: Ralucca Gera, NPS Excellence Through Knowledge Some slide from last week that we didn t talk about in class: 2 PageRank algorithm Eigenvector centrality: i s Rank score is the sum

More information

Complex-Network Modelling and Inference

Complex-Network Modelling and Inference Complex-Network Modelling and Inference Lecture 8: Graph features (2) Matthew Roughan http://www.maths.adelaide.edu.au/matthew.roughan/notes/ Network_Modelling/ School

More information

V4 Matrix algorithms and graph partitioning

V4 Matrix algorithms and graph partitioning V4 Matrix algorithms and graph partitioning - Community detection - Simple modularity maximization - Spectral modularity maximization - Division into more than two groups - Other algorithms for community

More information

Introduction to network metrics

Introduction to network metrics Universitat Politècnica de Catalunya Version 0.5 Complex and Social Networks (2018-2019) Master in Innovation and Research in Informatics (MIRI) Instructors Argimiro Arratia, argimiro@cs.upc.edu, http://www.cs.upc.edu/~argimiro/

More information

Clustering. Robert M. Haralick. Computer Science, Graduate Center City University of New York

Clustering. Robert M. Haralick. Computer Science, Graduate Center City University of New York Clustering Robert M. Haralick Computer Science, Graduate Center City University of New York Outline K-means 1 K-means 2 3 4 5 Clustering K-means The purpose of clustering is to determine the similarity

More information

Social Networks. Slides by : I. Koutsopoulos (AUEB), Source:L. Adamic, SN Analysis, Coursera course

Social Networks. Slides by : I. Koutsopoulos (AUEB), Source:L. Adamic, SN Analysis, Coursera course Social Networks Slides by : I. Koutsopoulos (AUEB), Source:L. Adamic, SN Analysis, Coursera course Introduction Political blogs Organizations Facebook networks Ingredient networks SN representation Networks

More information

Community Structure and Beyond

Community Structure and Beyond Community Structure and Beyond Elizabeth A. Leicht MAE: 298 April 9, 2009 Why do we care about community structure? Large Networks Discussion Outline Overview of past work on community structure. How to

More information

Supplementary material to Epidemic spreading on complex networks with community structures

Supplementary material to Epidemic spreading on complex networks with community structures Supplementary material to Epidemic spreading on complex networks with community structures Clara Stegehuis, Remco van der Hofstad, Johan S. H. van Leeuwaarden Supplementary otes Supplementary ote etwork

More information

Centralities (4) Ralucca Gera, Applied Mathematics Dept. Naval Postgraduate School Monterey, California Excellence Through Knowledge

Centralities (4) Ralucca Gera, Applied Mathematics Dept. Naval Postgraduate School Monterey, California Excellence Through Knowledge Centralities (4) Ralucca Gera, Applied Mathematics Dept. Naval Postgraduate School Monterey, California rgera@nps.edu Excellence Through Knowledge Different types of centralities: Betweenness Centrality

More information

Disclaimer. Lect 2: empirical analyses of graphs

Disclaimer. Lect 2: empirical analyses of graphs 462 Page 1 Lect 2: empirical analyses of graphs Tuesday, September 11, 2007 8:30 AM Disclaimer These are my personal notes from this lecture. They may be wrong or inaccurate, and have not carefully been

More information

Extracting Information from Complex Networks

Extracting Information from Complex Networks Extracting Information from Complex Networks 1 Complex Networks Networks that arise from modeling complex systems: relationships Social networks Biological networks Distinguish from random networks uniform

More information

1 More configuration model

1 More configuration model 1 More configuration model In the last lecture, we explored the definition of the configuration model, a simple method for drawing networks from the ensemble, and derived some of its mathematical properties.

More information

THE KNOWLEDGE MANAGEMENT STRATEGY IN ORGANIZATIONS. Summer semester, 2016/2017

THE KNOWLEDGE MANAGEMENT STRATEGY IN ORGANIZATIONS. Summer semester, 2016/2017 THE KNOWLEDGE MANAGEMENT STRATEGY IN ORGANIZATIONS Summer semester, 2016/2017 SOCIAL NETWORK ANALYSIS: THEORY AND APPLICATIONS 1. A FEW THINGS ABOUT NETWORKS NETWORKS IN THE REAL WORLD There are four categories

More information

Universal Properties of Mythological Networks Midterm report: Math 485

Universal Properties of Mythological Networks Midterm report: Math 485 Universal Properties of Mythological Networks Midterm report: Math 485 Roopa Krishnaswamy, Jamie Fitzgerald, Manuel Villegas, Riqu Huang, and Riley Neal Department of Mathematics, University of Arizona,

More information

Tie strength, social capital, betweenness and homophily. Rik Sarkar

Tie strength, social capital, betweenness and homophily. Rik Sarkar Tie strength, social capital, betweenness and homophily Rik Sarkar Networks Position of a node in a network determines its role/importance Structure of a network determines its properties 2 Today Notion

More information

Social Network Analysis

Social Network Analysis Social Network Analysis Mathematics of Networks Manar Mohaisen Department of EEC Engineering Adjacency matrix Network types Edge list Adjacency list Graph representation 2 Adjacency matrix Adjacency matrix

More information

A Generating Function Approach to Analyze Random Graphs

A Generating Function Approach to Analyze Random Graphs A Generating Function Approach to Analyze Random Graphs Presented by - Vilas Veeraraghavan Advisor - Dr. Steven Weber Department of Electrical and Computer Engineering Drexel University April 8, 2005 Presentation

More information

CS224W: Analysis of Networks Jure Leskovec, Stanford University

CS224W: Analysis of Networks Jure Leskovec, Stanford University CS224W: Analysis of Networks Jure Leskovec, Stanford University http://cs224w.stanford.edu 11/13/17 Jure Leskovec, Stanford CS224W: Analysis of Networks, http://cs224w.stanford.edu 2 Observations Models

More information

Web 2.0 Social Data Analysis

Web 2.0 Social Data Analysis Web 2.0 Social Data Analysis Ing. Jaroslav Kuchař jaroslav.kuchar@fit.cvut.cz Structure (2) Czech Technical University in Prague, Faculty of Information Technologies Software and Web Engineering 2 Contents

More information

Clustering analysis of gene expression data

Clustering analysis of gene expression data Clustering analysis of gene expression data Chapter 11 in Jonathan Pevsner, Bioinformatics and Functional Genomics, 3 rd edition (Chapter 9 in 2 nd edition) Human T cell expression data The matrix contains

More information

Community Structure Detection. Amar Chandole Ameya Kabre Atishay Aggarwal

Community Structure Detection. Amar Chandole Ameya Kabre Atishay Aggarwal Community Structure Detection Amar Chandole Ameya Kabre Atishay Aggarwal What is a network? Group or system of interconnected people or things Ways to represent a network: Matrices Sets Sequences Time

More information

CS-E5740. Complex Networks. Network analysis: key measures and characteristics

CS-E5740. Complex Networks. Network analysis: key measures and characteristics CS-E5740 Complex Networks Network analysis: key measures and characteristics Course outline 1. Introduction (motivation, definitions, etc. ) 2. Static network models: random and small-world networks 3.

More information

Machine Learning and Modeling for Social Networks

Machine Learning and Modeling for Social Networks Machine Learning and Modeling for Social Networks Olivia Woolley Meza, Izabela Moise, Nino Antulov-Fatulin, Lloyd Sanders 1 Introduction to Networks Computational Social Science D-GESS Olivia Woolley Meza

More information

Summary: What We Have Learned So Far

Summary: What We Have Learned So Far Summary: What We Have Learned So Far small-world phenomenon Real-world networks: { Short path lengths High clustering Broad degree distributions, often power laws P (k) k γ Erdös-Renyi model: Short path

More information

Graph-theoretic Properties of Networks

Graph-theoretic Properties of Networks Graph-theoretic Properties of Networks Bioinformatics: Sequence Analysis COMP 571 - Spring 2015 Luay Nakhleh, Rice University Graphs A graph is a set of vertices, or nodes, and edges that connect pairs

More information

Graph Theory for Network Science

Graph Theory for Network Science Graph Theory for Network Science Dr. Natarajan Meghanathan Professor Department of Computer Science Jackson State University, Jackson, MS E-mail: natarajan.meghanathan@jsums.edu Networks or Graphs We typically

More information

Network Theory: Social, Mythological and Fictional Networks. Math 485, Spring 2018 (Midterm Report) Christina West, Taylor Martins, Yihe Hao

Network Theory: Social, Mythological and Fictional Networks. Math 485, Spring 2018 (Midterm Report) Christina West, Taylor Martins, Yihe Hao Network Theory: Social, Mythological and Fictional Networks Math 485, Spring 2018 (Midterm Report) Christina West, Taylor Martins, Yihe Hao Abstract: Comparative mythology is a largely qualitative and

More information

CAIM: Cerca i Anàlisi d Informació Massiva

CAIM: Cerca i Anàlisi d Informació Massiva 1 / 72 CAIM: Cerca i Anàlisi d Informació Massiva FIB, Grau en Enginyeria Informàtica Slides by Marta Arias, José Balcázar, Ricard Gavaldá Department of Computer Science, UPC Fall 2016 http://www.cs.upc.edu/~caim

More information

An Optimal Allocation Approach to Influence Maximization Problem on Modular Social Network. Tianyu Cao, Xindong Wu, Song Wang, Xiaohua Hu

An Optimal Allocation Approach to Influence Maximization Problem on Modular Social Network. Tianyu Cao, Xindong Wu, Song Wang, Xiaohua Hu An Optimal Allocation Approach to Influence Maximization Problem on Modular Social Network Tianyu Cao, Xindong Wu, Song Wang, Xiaohua Hu ACM SAC 2010 outline Social network Definition and properties Social

More information

Modularity CMSC 858L

Modularity CMSC 858L Modularity CMSC 858L Module-detection for Function Prediction Biological networks generally modular (Hartwell+, 1999) We can try to find the modules within a network. Once we find modules, we can look

More information

1 Degree Distributions

1 Degree Distributions Lecture Notes: Social Networks: Models, Algorithms, and Applications Lecture 3: Jan 24, 2012 Scribes: Geoffrey Fairchild and Jason Fries 1 Degree Distributions Last time, we discussed some graph-theoretic

More information

Community detection. Leonid E. Zhukov

Community detection. Leonid E. Zhukov Community detection Leonid E. Zhukov School of Data Analysis and Artificial Intelligence Department of Computer Science National Research University Higher School of Economics Network Science Leonid E.

More information

Modeling the Language of Risk in Activism. Hailey Reeves and Evan Brown Mentor: William Lippitt

Modeling the Language of Risk in Activism. Hailey Reeves and Evan Brown Mentor: William Lippitt Modeling the Language of Risk in Activism Hailey Reeves and Evan Brown Mentor: William Lippitt Project Motivation Modeling perceived risks and possible influences that discourage or negatively influence

More information

IAT 355 Visual Analytics. Data and Statistical Models. Lyn Bartram

IAT 355 Visual Analytics. Data and Statistical Models. Lyn Bartram IAT 355 Visual Analytics Data and Statistical Models Lyn Bartram Exploring data Example: US Census People # of people in group Year # 1850 2000 (every decade) Age # 0 90+ Sex (Gender) # Male, female Marital

More information

THE DEPENDENCY NETWORK IN FREE OPERATING SYSTEM

THE DEPENDENCY NETWORK IN FREE OPERATING SYSTEM Vol. 5 (2012) Acta Physica Polonica B Proceedings Supplement No 1 THE DEPENDENCY NETWORK IN FREE OPERATING SYSTEM Tomasz M. Gradowski a, Maciej J. Mrowinski a Robert A. Kosinski a,b a Warsaw University

More information

MITOCW watch?v=r6-lqbquci0

MITOCW watch?v=r6-lqbquci0 MITOCW watch?v=r6-lqbquci0 The following content is provided under a Creative Commons license. Your support will help MIT OpenCourseWare continue to offer high quality educational resources for free. To

More information

An Efficient Algorithm for Community Detection in Complex Networks

An Efficient Algorithm for Community Detection in Complex Networks An Efficient Algorithm for Community Detection in Complex Networks Qiong Chen School of Computer Science & Engineering South China University of Technology Guangzhou Higher Education Mega Centre Panyu

More information

An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization

An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization Pedro Ribeiro (DCC/FCUP & CRACS/INESC-TEC) Part 1 Motivation and emergence of Network Science

More information

Alessandro Del Ponte, Weijia Ran PAD 637 Week 3 Summary January 31, Wasserman and Faust, Chapter 3: Notation for Social Network Data

Alessandro Del Ponte, Weijia Ran PAD 637 Week 3 Summary January 31, Wasserman and Faust, Chapter 3: Notation for Social Network Data Wasserman and Faust, Chapter 3: Notation for Social Network Data Three different network notational schemes Graph theoretic: the most useful for centrality and prestige methods, cohesive subgroup ideas,

More information

Statistical Methods for Network Analysis: Exponential Random Graph Models

Statistical Methods for Network Analysis: Exponential Random Graph Models Day 2: Network Modeling Statistical Methods for Network Analysis: Exponential Random Graph Models NMID workshop September 17 21, 2012 Prof. Martina Morris Prof. Steven Goodreau Supported by the US National

More information

Machine Learning for Data Science (CS4786) Lecture 11

Machine Learning for Data Science (CS4786) Lecture 11 Machine Learning for Data Science (CS4786) Lecture 11 Spectral Clustering Course Webpage : http://www.cs.cornell.edu/courses/cs4786/2016fa/ Survey Survey Survey Competition I Out! Preliminary report of

More information

Correlation. January 12, 2019

Correlation. January 12, 2019 Correlation January 12, 2019 Contents Correlations The Scattterplot The Pearson correlation The computational raw-score formula Survey data Fun facts about r Sensitivity to outliers Spearman rank-order

More information

Mining Social Network Graphs

Mining Social Network Graphs Mining Social Network Graphs Analysis of Large Graphs: Community Detection Rafael Ferreira da Silva rafsilva@isi.edu http://rafaelsilva.com Note to other teachers and users of these slides: We would be

More information

Testing Random- Number Generators

Testing Random- Number Generators Testing Random- Number Generators Raj Jain Washington University Saint Louis, MO 63131 Jain@cse.wustl.edu These slides are available on-line at: http://www.cse.wustl.edu/~jain/cse574-06/ 27-1 Overview

More information

Introduction to Engineering Systems, ESD.00. Networks. Lecturers: Professor Joseph Sussman Dr. Afreen Siddiqi TA: Regina Clewlow

Introduction to Engineering Systems, ESD.00. Networks. Lecturers: Professor Joseph Sussman Dr. Afreen Siddiqi TA: Regina Clewlow Introduction to Engineering Systems, ESD.00 Lecture 7 Networks Lecturers: Professor Joseph Sussman Dr. Afreen Siddiqi TA: Regina Clewlow The Bridges of Königsberg The town of Konigsberg in 18 th century

More information

Theory and Applications of Complex Networks

Theory and Applications of Complex Networks Theory and Applications of Complex Networks 1 Theory and Applications of Complex Networks Class One College of the Atlantic David P. Feldman 12 September 2008 http://hornacek.coa.edu/dave/ 1. What is a

More information

Research and Analysis of Structural Hole and Matching Coefficient

Research and Analysis of Structural Hole and Matching Coefficient J. Software Engineering & Applications, 010, 3, 1080-1087 doi:10.436/jsea.010.31117 Published Online November 010 (http://www.scirp.org/journal/jsea) Research and Analysis of Structural Hole and Matching

More information

Clustering CS 550: Machine Learning

Clustering CS 550: Machine Learning Clustering CS 550: Machine Learning This slide set mainly uses the slides given in the following links: http://www-users.cs.umn.edu/~kumar/dmbook/ch8.pdf http://www-users.cs.umn.edu/~kumar/dmbook/dmslides/chap8_basic_cluster_analysis.pdf

More information

Glossary Common Core Curriculum Maps Math/Grade 6 Grade 8

Glossary Common Core Curriculum Maps Math/Grade 6 Grade 8 Glossary Common Core Curriculum Maps Math/Grade 6 Grade 8 Grade 6 Grade 8 absolute value Distance of a number (x) from zero on a number line. Because absolute value represents distance, the absolute value

More information

Graph Theory for Network Science

Graph Theory for Network Science Graph Theory for Network Science Dr. Natarajan Meghanathan Professor Department of Computer Science Jackson State University, Jackson, MS E-mail: natarajan.meghanathan@jsums.edu Networks or Graphs We typically

More information

Youtube Graph Network Model and Analysis Yonghyun Ro, Han Lee, Dennis Won

Youtube Graph Network Model and Analysis Yonghyun Ro, Han Lee, Dennis Won Youtube Graph Network Model and Analysis Yonghyun Ro, Han Lee, Dennis Won Introduction A countless number of contents gets posted on the YouTube everyday. YouTube keeps its competitiveness by maximizing

More information

A Simple Acceleration Method for the Louvain Algorithm

A Simple Acceleration Method for the Louvain Algorithm A Simple Acceleration Method for the Louvain Algorithm Naoto Ozaki, Hiroshi Tezuka, Mary Inaba * Graduate School of Information Science and Technology, University of Tokyo, Tokyo, Japan. * Corresponding

More information

Degree Distribution, and Scale-free networks. Ralucca Gera, Applied Mathematics Dept. Naval Postgraduate School Monterey, California

Degree Distribution, and Scale-free networks. Ralucca Gera, Applied Mathematics Dept. Naval Postgraduate School Monterey, California Degree Distribution, and Scale-free networks Ralucca Gera, Applied Mathematics Dept. Naval Postgraduate School Monterey, California rgera@nps.edu Overview Degree distribution of observed networks Scale

More information

CSE 258 Lecture 12. Web Mining and Recommender Systems. Social networks

CSE 258 Lecture 12. Web Mining and Recommender Systems. Social networks CSE 258 Lecture 12 Web Mining and Recommender Systems Social networks Social networks We ve already seen networks (a little bit) in week 3 i.e., we ve studied inference problems defined on graphs, and

More information

The vertex set is a finite nonempty set. The edge set may be empty, but otherwise its elements are two-element subsets of the vertex set.

The vertex set is a finite nonempty set. The edge set may be empty, but otherwise its elements are two-element subsets of the vertex set. Math 3336 Section 10.2 Graph terminology and Special Types of Graphs Definition: A graph is an object consisting of two sets called its vertex set and its edge set. The vertex set is a finite nonempty

More information

ALTERNATIVES TO BETWEENNESS CENTRALITY: A MEASURE OF CORRELATION COEFFICIENT

ALTERNATIVES TO BETWEENNESS CENTRALITY: A MEASURE OF CORRELATION COEFFICIENT ALTERNATIVES TO BETWEENNESS CENTRALITY: A MEASURE OF CORRELATION COEFFICIENT Xiaojia He 1 and Natarajan Meghanathan 2 1 University of Georgia, GA, USA, 2 Jackson State University, MS, USA 2 natarajan.meghanathan@jsums.edu

More information

Spectral Methods for Network Community Detection and Graph Partitioning

Spectral Methods for Network Community Detection and Graph Partitioning Spectral Methods for Network Community Detection and Graph Partitioning M. E. J. Newman Department of Physics, University of Michigan Presenters: Yunqi Guo Xueyin Yu Yuanqi Li 1 Outline: Community Detection

More information

COMP6237 Data Mining and Networks. Markus Brede. Lecture slides available here:

COMP6237 Data Mining and Networks. Markus Brede. Lecture slides available here: COMP6237 Data Mining and Networks Markus Brede Brede.Markus@gmail.com Lecture slides available here: http://users.ecs.soton.ac.uk/mb8/stats/datamining.html Outline Why? The WWW is a major application of

More information

Data Mining: Concepts and Techniques. (3 rd ed.) Chapter 3. Chapter 3: Data Preprocessing. Major Tasks in Data Preprocessing

Data Mining: Concepts and Techniques. (3 rd ed.) Chapter 3. Chapter 3: Data Preprocessing. Major Tasks in Data Preprocessing Data Mining: Concepts and Techniques (3 rd ed.) Chapter 3 1 Chapter 3: Data Preprocessing Data Preprocessing: An Overview Data Quality Major Tasks in Data Preprocessing Data Cleaning Data Integration Data

More information

Graph Theory. Network Science: Graph theory. Graph theory Terminology and notation. Graph theory Graph visualization

Graph Theory. Network Science: Graph theory. Graph theory Terminology and notation. Graph theory Graph visualization Network Science: Graph Theory Ozalp abaoglu ipartimento di Informatica Scienza e Ingegneria Università di ologna www.cs.unibo.it/babaoglu/ ranch of mathematics for the study of structures called graphs

More information

1 Large-scale structure in networks

1 Large-scale structure in networks 1 Large-scale structure in networks Most of the network measures we have considered so far can tell us a great deal about the shape of a network. However, these measures generally describe patterns as

More information

CSE 158 Lecture 11. Web Mining and Recommender Systems. Social networks

CSE 158 Lecture 11. Web Mining and Recommender Systems. Social networks CSE 158 Lecture 11 Web Mining and Recommender Systems Social networks Assignment 1 Due 5pm next Monday! (Kaggle shows UTC time, but the due date is 5pm, Monday, PST) Assignment 1 Assignment 1 Social networks

More information

Community Detection as a Method to Control For Homophily in Social Networks

Community Detection as a Method to Control For Homophily in Social Networks Community Detection as a Method to Control For Homophily in Social Networks Hannah Worrall Senior Honors/QSSS Thesis April 30, 2014 Contents 1 Introduction 2 1.1 Background........................... 2

More information

Algorithms and Applications in Social Networks. 2017/2018, Semester B Slava Novgorodov

Algorithms and Applications in Social Networks. 2017/2018, Semester B Slava Novgorodov Algorithms and Applications in Social Networks 2017/2018, Semester B Slava Novgorodov 1 Lesson #1 Administrative questions Course overview Introduction to Social Networks Basic definitions Network properties

More information

Economics of Information Networks

Economics of Information Networks Economics of Information Networks Stephen Turnbull Division of Policy and Planning Sciences Lecture 4: December 7, 2017 Abstract We continue discussion of the modern economics of networks, which considers

More information

What is a Network? Theory and Applications of Complex Networks. Network Example 1: High School Friendships

What is a Network? Theory and Applications of Complex Networks. Network Example 1: High School Friendships 1 2 Class One 1. A collection of nodes What is a Network? 2. A collection of edges connecting nodes College of the Atlantic 12 September 2008 http://hornacek.coa.edu/dave/ 1. What is a network? 2. Many

More information

Exponential Random Graph Models for Social Networks

Exponential Random Graph Models for Social Networks Exponential Random Graph Models for Social Networks ERGM Introduction Martina Morris Departments of Sociology, Statistics University of Washington Departments of Sociology, Statistics, and EECS, and Institute

More information

Acknowledgement: BYU-Idaho Economics Department Faculty (Principal authors: Rick Hirschi, Ryan Johnson, Allan Walburger and David Barrus)

Acknowledgement: BYU-Idaho Economics Department Faculty (Principal authors: Rick Hirschi, Ryan Johnson, Allan Walburger and David Barrus) Math Review Acknowledgement: BYU-Idaho Economics Department Faculty (Principal authors: Rick Hirschi, Ryan Johnson, Allan Walburger and David Barrus) Section 1 - Graphing Data Graphs It is said that a

More information

David Easley and Jon Kleinberg January 24, 2007

David Easley and Jon Kleinberg January 24, 2007 Networks: Spring 2007 Graph Theory David Easley and Jon Kleinberg January 24, 2007 A graph is simply a way of encoding the pairwise relationships among a set of objects: we will refer to the objects as

More information

Jure Leskovec, Cornell/Stanford University. Joint work with Kevin Lang, Anirban Dasgupta and Michael Mahoney, Yahoo! Research

Jure Leskovec, Cornell/Stanford University. Joint work with Kevin Lang, Anirban Dasgupta and Michael Mahoney, Yahoo! Research Jure Leskovec, Cornell/Stanford University Joint work with Kevin Lang, Anirban Dasgupta and Michael Mahoney, Yahoo! Research Network: an interaction graph: Nodes represent entities Edges represent interaction

More information

Navigation in Networks. Networked Life NETS 112 Fall 2017 Prof. Michael Kearns

Navigation in Networks. Networked Life NETS 112 Fall 2017 Prof. Michael Kearns Navigation in Networks Networked Life NETS 112 Fall 2017 Prof. Michael Kearns The Navigation Problem You are an individual (vertex) in a very large social network You want to find a (short) chain of friendships

More information

My favorite application using eigenvalues: partitioning and community detection in social networks

My favorite application using eigenvalues: partitioning and community detection in social networks My favorite application using eigenvalues: partitioning and community detection in social networks Will Hobbs February 17, 2013 Abstract Social networks are often organized into families, friendship groups,

More information

Extracting Communities from Networks

Extracting Communities from Networks Extracting Communities from Networks Ji Zhu Department of Statistics, University of Michigan Joint work with Yunpeng Zhao and Elizaveta Levina Outline Review of community detection Community extraction

More information

Degree Distribution: The case of Citation Networks

Degree Distribution: The case of Citation Networks Network Analysis Degree Distribution: The case of Citation Networks Papers (in almost all fields) refer to works done earlier on same/related topics Citations A network can be defined as Each node is a

More information

This research aims to present a new way of visualizing multi-dimensional data using generalized scatterplots by sensitivity coefficients to highlight

This research aims to present a new way of visualizing multi-dimensional data using generalized scatterplots by sensitivity coefficients to highlight This research aims to present a new way of visualizing multi-dimensional data using generalized scatterplots by sensitivity coefficients to highlight local variation of one variable with respect to another.

More information

Social & Information Network Analysis CS 224W

Social & Information Network Analysis CS 224W Social & Information Network Analysis CS 224W Final Report Alexandre Becker Jordane Giuly Sébastien Robaszkiewicz Stanford University December 2011 1 Introduction The microblogging service Twitter today

More information

Statistics can best be defined as a collection and analysis of numerical information.

Statistics can best be defined as a collection and analysis of numerical information. Statistical Graphs There are many ways to organize data pictorially using statistical graphs. There are line graphs, stem and leaf plots, frequency tables, histograms, bar graphs, pictographs, circle graphs

More information

Matchings, Ramsey Theory, And Other Graph Fun

Matchings, Ramsey Theory, And Other Graph Fun Matchings, Ramsey Theory, And Other Graph Fun Evelyne Smith-Roberge University of Waterloo April 5th, 2017 Recap... In the last two weeks, we ve covered: What is a graph? Eulerian circuits Hamiltonian

More information

Online Social Networks and Media

Online Social Networks and Media Online Social Networks and Media Absorbing Random Walks Link Prediction Why does the Power Method work? If a matrix R is real and symmetric, it has real eigenvalues and eigenvectors: λ, w, λ 2, w 2,, (λ

More information

2009 HMMT Team Round. Writing proofs. Misha Lavrov. ARML Practice 3/2/2014

2009 HMMT Team Round. Writing proofs. Misha Lavrov. ARML Practice 3/2/2014 Writing proofs Misha Lavrov ARML Practice 3/2/2014 Warm-up / Review 1 (From my research) If x n = 2 1 x n 1 for n 2, solve for x n in terms of x 1. (For a more concrete problem, set x 1 = 2.) 2 (From this

More information

8: Statistics. Populations and Samples. Histograms and Frequency Polygons. Page 1 of 10

8: Statistics. Populations and Samples. Histograms and Frequency Polygons. Page 1 of 10 8: Statistics Statistics: Method of collecting, organizing, analyzing, and interpreting data, as well as drawing conclusions based on the data. Methodology is divided into two main areas. Descriptive Statistics:

More information

Exam Review: Ch. 1-3 Answer Section

Exam Review: Ch. 1-3 Answer Section Exam Review: Ch. 1-3 Answer Section MDM 4U0 MULTIPLE CHOICE 1. ANS: A Section 1.6 2. ANS: A Section 1.6 3. ANS: A Section 1.7 4. ANS: A Section 1.7 5. ANS: C Section 2.3 6. ANS: B Section 2.3 7. ANS: D

More information

Community-Based Recommendations: a Solution to the Cold Start Problem

Community-Based Recommendations: a Solution to the Cold Start Problem Community-Based Recommendations: a Solution to the Cold Start Problem Shaghayegh Sahebi Intelligent Systems Program University of Pittsburgh sahebi@cs.pitt.edu William W. Cohen Machine Learning Department

More information

Algorithms: Graphs. Amotz Bar-Noy. Spring 2012 CUNY. Amotz Bar-Noy (CUNY) Graphs Spring / 95

Algorithms: Graphs. Amotz Bar-Noy. Spring 2012 CUNY. Amotz Bar-Noy (CUNY) Graphs Spring / 95 Algorithms: Graphs Amotz Bar-Noy CUNY Spring 2012 Amotz Bar-Noy (CUNY) Graphs Spring 2012 1 / 95 Graphs Definition: A graph is a collection of edges and vertices. Each edge connects two vertices. Amotz

More information

CSE 258 Lecture 6. Web Mining and Recommender Systems. Community Detection

CSE 258 Lecture 6. Web Mining and Recommender Systems. Community Detection CSE 258 Lecture 6 Web Mining and Recommender Systems Community Detection Dimensionality reduction Goal: take high-dimensional data, and describe it compactly using a small number of dimensions Assumption:

More information

Homework 4: Clustering, Recommenders, Dim. Reduction, ML and Graph Mining (due November 19 th, 2014, 2:30pm, in class hard-copy please)

Homework 4: Clustering, Recommenders, Dim. Reduction, ML and Graph Mining (due November 19 th, 2014, 2:30pm, in class hard-copy please) Virginia Tech. Computer Science CS 5614 (Big) Data Management Systems Fall 2014, Prakash Homework 4: Clustering, Recommenders, Dim. Reduction, ML and Graph Mining (due November 19 th, 2014, 2:30pm, in

More information

Complex networks Phys 682 / CIS 629: Computational Methods for Nonlinear Systems

Complex networks Phys 682 / CIS 629: Computational Methods for Nonlinear Systems Complex networks Phys 682 / CIS 629: Computational Methods for Nonlinear Systems networks are everywhere (and always have been) - relationships (edges) among entities (nodes) explosion of interest in network

More information

What students need to know for ALGEBRA I Students expecting to take Algebra I should demonstrate the ability to: General:

What students need to know for ALGEBRA I Students expecting to take Algebra I should demonstrate the ability to: General: What students need to know for ALGEBRA I 2015-2016 Students expecting to take Algebra I should demonstrate the ability to: General: o Keep an organized notebook o Be a good note taker o Complete homework

More information

Chapter 3 Analyzing Normal Quantitative Data

Chapter 3 Analyzing Normal Quantitative Data Chapter 3 Analyzing Normal Quantitative Data Introduction: In chapters 1 and 2, we focused on analyzing categorical data and exploring relationships between categorical data sets. We will now be doing

More information