Tie strength, social capital, betweenness and homophily. Rik Sarkar

Similar documents
Social Networks, Social Interaction, and Outcomes

CS224W: Analysis of Networks Jure Leskovec, Stanford University

Web 2.0 Social Data Analysis

Economics of Information Networks

Recap! CMSC 498J: Social Media Computing. Department of Computer Science University of Maryland Spring Hadi Amiri

Cascades. Rik Sarkar. Social and Technological Networks. University of Edinburgh, 2018.

Clusters and Communities

Social and Information Network Analysis Positive and Negative Relationships

CSE 258 Lecture 12. Web Mining and Recommender Systems. Social networks

TELCOM2125: Network Science and Analysis

CSE 158 Lecture 11. Web Mining and Recommender Systems. Social networks

Online Social Networks and Media. Positive and Negative Edges Strong and Weak Edges Strong Triadic Closure

Non Overlapping Communities

Navigation in Networks. Networked Life NETS 112 Fall 2017 Prof. Michael Kearns

Web 2.0 Social Data Analysis

Structure of Social Networks

Social-Network Graphs

Basic Network Concepts

David Easley and Jon Kleinberg January 24, 2007

Network Community Detection

Algorithmic and Economic Aspects of Networks. Nicole Immorlica

1 Homophily and assortative mixing

THE KNOWLEDGE MANAGEMENT STRATEGY IN ORGANIZATIONS. Summer semester, 2016/2017

Mining Social Network Graphs

Distribution-Free Models of Social and Information Networks

Positive and Negative Links

Web Structure Mining Community Detection and Evaluation

Information Networks and Semantics. Srinath Srinivasa IIIT Bangalore

CSE 158 Lecture 11. Web Mining and Recommender Systems. Triadic closure; strong & weak ties

Extracting Information from Complex Networks

Rockefeller College University at Albany

Statistical Methods for Network Analysis: Exponential Random Graph Models

Tie-Strength and Strategies in Social Capital Management

Econ 430 Lecture 3: Significance and Structural Properties of N

beyond social networks

Failure in Complex Social Networks

Guanxi in the Chinese Web. University of Victoria Victoria, BC, Canada {yul, val,

CAIM: Cerca i Anàlisi d Informació Massiva

CSE 255 Lecture 13. Data Mining and Predictive Analytics. Triadic closure; strong & weak ties

CSE 158 Lecture 13. Web Mining and Recommender Systems. Triadic closure; strong & weak ties

Transitivity and Triads

Local Algorithms for Sparse Spanning Graphs

Detecting and Analyzing Communities in Social Network Graphs for Targeted Marketing

Centrality Book. cohesion.

1 Degree Distributions

CS 124/LINGUIST 180 From Languages to Information. Social Networks: Small Worlds, Weak Ties, and Power Laws. Dan Jurafsky Stanford University

Network Science: Principles and Applications

Lecture Note: Computation problems in social. network analysis

MINFS544: Business Network Data Analytics and Applications

Social-Tie-Based Information Dissemination in Mobile Opportunistic Social Networks

V2: Measures and Metrics (II)

CS 534: Computer Vision Segmentation and Perceptual Grouping

Week 5 Video 5. Relationship Mining Network Analysis

Systems, ESD.00. Networks II. Lecture 8. Lecturers: Professor Joseph Sussman Dr. Afreen Siddiqi TA: Regina Clewlow

Alessandro Del Ponte, Weijia Ran PAD 637 Week 3 Summary January 31, Wasserman and Faust, Chapter 3: Notation for Social Network Data

RANDOM-REAL NETWORKS

Section 7.13: Homophily (or Assortativity) By: Ralucca Gera, NPS

An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization

Centralities (4) By: Ralucca Gera, NPS. Excellence Through Knowledge

SmallBlue: Unlock Collective Intelligence from Information Flows in Social Networks

Dijkstra s algorithm for shortest paths when no edges have negative weight.

Introduction to Engineering Systems, ESD.00. Networks. Lecturers: Professor Joseph Sussman Dr. Afreen Siddiqi TA: Regina Clewlow

How to explore big networks? Question: Perform a random walk on G. What is the average node degree among visited nodes, if avg degree in G is 200?

Graph Data Management

Algorithms and Applications in Social Networks. 2017/2018, Semester B Slava Novgorodov

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

Social and Technological Networks

Math.3336: Discrete Mathematics. Chapter 10 Graph Theory

Ma/CS 6b Class 10: Ramsey Theory

Modeling and Simulating Social Systems with MATLAB

Characterizing Graphs (3) Characterizing Graphs (1) Characterizing Graphs (2) Characterizing Graphs (4)

Social-Aware Routing in Delay Tolerant Networks

Lecture 6: Graph Properties

Clustering Algorithms on Graphs Community Detection 6CCS3WSN-7CCSMWAL

Ma/CS 6b Class 12: Ramsey Theory

Graph Theory. Network Science: Graph theory. Graph theory Terminology and notation. Graph theory Graph visualization

Data-Intensive Computing with MapReduce

Social Network Analysis as an Intelligence Technique: the Iranian Nuclear Weapons Program Revisited.

Matching Theory. Figure 1: Is this graph bipartite?

Assignment # 4 Selected Solutions

MC 302 GRAPH THEORY 10/1/13 Solutions to HW #2 50 points + 6 XC points

Mathematical Foundations

Community Detection. Community

Section 3.1: Nonseparable Graphs Cut vertex of a connected graph G: A vertex x G such that G x is not connected. Theorem 3.1, p. 57: Every connected

The Basics of Network Structure

Discrete Mathematics and Probability Theory Summer 2016 Dinh, Psomas, and Ye HW 2

arxiv: v1 [stat.ml] 2 Nov 2010

Counting Triangles & The Curse of the Last Reducer. Siddharth Suri Sergei Vassilvitskii Yahoo! Research

V10 Metabolic networks - Graph connectivity

MCL. (and other clustering algorithms) 858L

Supplementary material to Epidemic spreading on complex networks with community structures

Community Structure Detection. Amar Chandole Ameya Kabre Atishay Aggarwal

Degree Distribution: The case of Citation Networks

BUBBLE RAP: Social-Based Forwarding in Delay-Tolerant Networks

Networks in economics and finance. Lecture 1 - Measuring networks

Vertex Cover Approximations

Social Networks. Slides by : I. Koutsopoulos (AUEB), Source:L. Adamic, SN Analysis, Coursera course

Social Network Analysis of the Planetary Data System

SOMSN: An Effective Self Organizing Map for Clustering of Social Networks

Big Data Analytics Influx of data pertaining to the 4Vs, i.e. Volume, Veracity, Velocity and Variety

Transcription:

Tie strength, social capital, betweenness and homophily Rik Sarkar

Networks Position of a node in a network determines its role/importance Structure of a network determines its properties 2

Today Notion of strong ties (close friends) and weak ties (remote acquaintances) How they influence the network and spread of information Friendships and their evolution Central locations

Strong and weak ties Survey of job seekers show people often find jobs through social contacts More important: people more often find jobs through acquaintances (weak ties) than close friends (strong ties) Strength of weak ties. Mark S. Granovetter, American journal of Sociology, 1973

Strong and weak ties Explanation: A close friend is likely in the same community and has the same information sources Person in a different community is more likely to have new information, that you do not already know Weak ties are more critical: they can act as bridges across communities Other observation: Job information does not travel far long paths are not involved

Weak ties in social action Psychology: People do not often act on global information (radio, tv) etc People are more likely to act when confirmed by friends (creates trust) Therefore, people are more likely trust a leader when confirmed by direct familiarity or common friends acting as intermediaries A society without bridges is fragmented The leader does not reach a large number of people that trust him

Weak ties in social action Example (from Granovetter): A small town needs to coordinate action on a social issues If everyone works at different places in nearby industries Then people only know their families. There are no workacquaintances, etc. Organizing a protest is hard If everyone works at the same large industry Likely there are work-acquaintances (weak ties) Social action works better See also: Ted talk: Online social change: Easy to organize, hard to win (can you model and explain this?)

Triadic closure: Friends of Friends If two people have a friend in common, they are more likely to become friends Triadic closure If B & C both know A They are likely to meet, may be for extended time Likely to trust each-other

Bridge: Removing a bridge will disconnect network Rare in real networks Local bridge (A, B): If A, B have no friends in common Deleting (A, B) will increase distance to d > 2 d Is called the span of the bridge (A, B) Bridges

Strong triadic closure Suppose we know some ties to be strong, some to be weak For some definition of strong/ weak Strong triadic closure: If ab and bc are strong, then edge ac exists (may be weak, but it is there)

Strong triadic closure Theorem: if a network satisfies strong triadic closure and node A has 2 strong ties then any bridge involving A must be a weak tie. Proof: Easy! In real world, triadic closure is reasonably important Many examples People want their friends to be friends (otherwise it is hard to have groups) Absence of triadic closure implies poor relation between friends, stress etc

An experiment: Cell phone social net Network from phone conversations 18 weeks of all mobile calls for ~20% of US population, 90% had a mobile phone link: at least 1 reciprocating call. tie strength : aggregated duration of calls Onella et al. Structure and tie strengths in mobile communication networks. PNAS 2007

Observations Most people talk to few others, few talk to many people Power law-like distribution Hubs are relatively rare Strong ties are within clusters Onella et al. Structure and tie strengths in mobile communication networks. PNAS 2007

Possible network structures Efficiency: Inter-cluster ties are strong Eg. Highways, Internet routers, water distribution, etc, to allow large flows (C) Dyadic: tie strength depends on individual relationship only Simulated as random(b) Strength of weak ties (A) Opposite of c Argument: Social Information does not have a conservation requirement like transport or water

Other observations When strong ties are removed, network degrades slowly, but remains largely connected When the weak ties are removed, the network quickly and suddenly (phase transistion) falls apart. i.e disconnects into chunks Experiment: Spread a rumor in this network. Anyone having the rumor is likely to transmit probabilistically: ie. More likely in a longer conversation Observation: In majority of cases, people learn of it through ties of intermediate strength.

Neighborhood based tie strength N r (p): neighborhood of r hops centered at p. Sometimes written as B r (p) N(p) = N 1 (p) Neighborhood overlap of ab: A more continuous notion of strength And derived from the network Potential experiment : compare with other definitions of strengths Zero (or small, depending on definition of N) when ab is a local bridge

Neighborhood overlap Vs phone call duration

Embeddedness of an edge The number of common friends Higher embeddedness implies more people monitoring the relation B does not want to cheat A since E will no longer trust B But B can sacrifice relation with C without losing any direct friend

Structural holes B has is part of a bridge that spans a gap/ hole in the network B has early access to information from other parts of network Interesting ideas occur as synthesis of multiple ideas B has control over what the group learns from c and d B has reason to not allow triangles to form On the other hand, B s relations are not so protected by embeddedness How people actually behave in such situations is not well understood Tension between closure and brokerage

Social capital The ability to secure benefits by virtue of membership (and position) in social networks or other social structures Sometimes used as a property of a group

Betweenness & graph partitioning We want to split network into tightly knit groups (communities etc) Idea: Identify the bridges and remove them Bridges are central to the network They lie on shortest paths Betweenness of edge (e) (or vertex (v)): We send 1 unit of traffic between every pair of nodes in the network, and measure what fraction passes through e, assuming the flow is split equally among all shortest paths.

Partitioning (Girvan-newman) Repeat: Find edge e of highest betweenness Remove e Produces a hierarchic paritioning structure as the graph decomposes into smaller components

Computing betweenness Computing all shortest paths separately is inefficient A more efficient way: From each node: Step 1: Compute BFS tree Step 2: Find #shortest paths to each node Step 3: Find the flow through each edge

Computing betweenness From each node: Step 1: Compute BFS tree layers Step 2: Compute #shortest path to a node as sum of shortest paths to neighbors in previous layer of BFS

Computing betweenness Step 3: Work up from bottom layer: Every node receives 1 unit of flow for itself, plus whatever it needs to handle for nodes lower down

Computing betweenness Finally: Do this for all nodes, and add up Complexity?

Other Centrality measures Degree centrality nodes with high degree Eigen vector centrality (similar to pagerank) K-core: A maximal connected subgraph where every vertex has degree k or more in the subgraph

Homophily We are similar to our friends Not always explained by things intrinsic to the network like simple triadic closure External contexts like Culture, hobbies, interests influence networks Suppose the network has 2 types of nodes (eg. Male, female), fractions p and q Expected fraction of cross-gender edges: 2pq A test for homophily: Fraction of cross gender edges < 2pq

Homophily: The obesity epidemic Christakis and fowler (See ted talk: hidden influence of social networks) Is it that: People are selecting similar people? Other correlated hommophilic factors (existing food/cultural habits ) affecting data? Are obese friends influencing the habits causing more people to be obese? Authors argue that tracking data over a period of time shows significant evidence of the influence hypothesis It is an epidemic

S Social foci: affiliation networks

d Triadic closure in affiliation networks

Triadic Closures From student email dataset 32

Focal closure Classes as foci 33

Membership closure 34