Web 2.0 Social Data Analysis

Similar documents
Networks, Crowds, and Markets:

David Easley and Jon Kleinberg January 24, 2007

Web 2.0 Social Data Analysis

CS224W: Analysis of Networks Jure Leskovec, Stanford University

CIP- OSN Online Social Networks as Graphs. Dr. Thanassis Tiropanis -

Structure of Social Networks

THE KNOWLEDGE MANAGEMENT STRATEGY IN ORGANIZATIONS. Summer semester, 2016/2017

CSE 258 Lecture 12. Web Mining and Recommender Systems. Social networks

RANDOM-REAL NETWORKS

Social Networks, Social Interaction, and Outcomes

CSE 158 Lecture 11. Web Mining and Recommender Systems. Social networks

Network Thinking. Complexity: A Guided Tour, Chapters 15-16

Economics of Information Networks

Algorithms and Applications in Social Networks. 2017/2018, Semester B Slava Novgorodov

Tie strength, social capital, betweenness and homophily. Rik Sarkar

Math 443/543 Graph Theory Notes 10: Small world phenomenon and decentralized search

M.E.J. Newman: Models of the Small World

CSE 158 Lecture 11. Web Mining and Recommender Systems. Triadic closure; strong & weak ties

An Optimal Allocation Approach to Influence Maximization Problem on Modular Social Network. Tianyu Cao, Xindong Wu, Song Wang, Xiaohua Hu

CSE 255 Lecture 13. Data Mining and Predictive Analytics. Triadic closure; strong & weak ties

Examples of Complex Networks

Network Mathematics - Why is it a Small World? Oskar Sandberg

CSE 158 Lecture 13. Web Mining and Recommender Systems. Triadic closure; strong & weak ties

CS 124/LINGUIST 180 From Languages to Information. Social Networks: Small Worlds, Weak Ties, and Power Laws. Dan Jurafsky Stanford University

ECS 253 / MAE 253, Lecture 8 April 21, Web search and decentralized search on small-world networks

Extracting Information from Complex Networks

Networks and Discrete Mathematics

Theory and Applications of Complex Networks

Chapter 1. Social Media and Social Computing. October 2012 Youn-Hee Han

CS224W: Social and Information Network Analysis Jure Leskovec, Stanford University

CSE 190 Lecture 16. Data Mining and Predictive Analytics. Small-world phenomena

What is a Network? Theory and Applications of Complex Networks. Network Example 1: High School Friendships

Nick Hamilton Institute for Molecular Bioscience. Essential Graph Theory for Biologists. Image: Matt Moores, The Visible Cell

CSE 316: SOCIAL NETWORK ANALYSIS INTRODUCTION. Fall 2017 Marion Neumann

Basic Network Concepts

A Generating Function Approach to Analyze Random Graphs

Econ 430 Lecture 3: Significance and Structural Properties of N

Case Studies in Complex Networks

1 Comparing networks, and why social networks are different

How Do Real Networks Look? Networked Life NETS 112 Fall 2014 Prof. Michael Kearns

Social, Information, and Routing Networks: Models, Algorithms, and Strategic Behavior

Example 1: An algorithmic view of the small world phenomenon

Introduction to Networks and Business Intelligence

Complex Networks. Structure and Dynamics

TELCOM2125: Network Science and Analysis

- relationships (edges) among entities (nodes) - technology: Internet, World Wide Web - biology: genomics, gene expression, proteinprotein

Failure in Complex Social Networks

1 Degree Distributions

The Establishment Game. Motivation

arxiv: v1 [cs.si] 11 Jan 2019

Introduction to Complex Networks Analysis

Properties of Biological Networks

Modeling and Simulating Social Systems with MATLAB

Complex networks Phys 682 / CIS 629: Computational Methods for Nonlinear Systems

A Document as a Small World. selecting the term which co-occurs in multiple clusters. domains, from earthquake sequences [6] to register

An Investigation into the Free/Open Source Software Phenomenon using Data Mining, Social Network Theory, and Agent-Based

Recap! CMSC 498J: Social Media Computing. Department of Computer Science University of Maryland Spring Hadi Amiri

CAIM: Cerca i Anàlisi d Informació Massiva

Networks in economics and finance. Lecture 1 - Measuring networks

An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization

Networks and stability

SLANG Session 4. Jason Quinley Roland Mühlenbernd Seminar für Sprachwissenschaft University of Tübingen

Algorithmic and Economic Aspects of Networks. Nicole Immorlica

Lesson 18. Laura Ricci 08/05/2017

Machine Learning and Modeling for Social Networks

Party hard! The maths of connections. Colva Roney-Dougal. March 23rd, University of St Andrews

Structural Analysis of Paper Citation and Co-Authorship Networks using Network Analysis Techniques

Introduction Types of Social Network Analysis Social Networks in the Online Age Data Mining for Social Network Analysis Applications Conclusion

The Directed Closure Process in Hybrid Social-Information Networks, with an Analysis of Link Formation on Twitter

Introduction To Graphs and Networks. Fall 2013 Carola Wenk

Breadth-First. Graphs in Python

Cascades. Rik Sarkar. Social and Technological Networks. University of Edinburgh, 2018.

Introduction to network metrics

Online Social Networks and Media. Positive and Negative Edges Strong and Weak Edges Strong Triadic Closure

Graph Theory and Network Measurment

Introduction to the Special Issue on AI & Networks

6. Overview. L3S Research Center, University of Hannover. 6.1 Section Motivation. Investigation of structural aspects of peer-to-peer networks

Online Social Networks and Media

Online Social Networks 2

Lecture 1: Networks: A brief introduction

Network Analysis Networks permeate our lives. Networks play a central role in determining the transmission of information about job opportunities, how

beyond social networks

Mathematics of Networks II

TELCOM2125: Network Science and Analysis

Graph Theory. Network Science: Graph theory. Graph theory Terminology and notation. Graph theory Graph visualization

Peer-to-Peer Data Management

UNIVERSITA DEGLI STUDI DI CATANIA FACOLTA DI INGEGNERIA

(Social) Networks Analysis III. Prof. Dr. Daning Hu Department of Informatics University of Zurich

Models of Network Formation. Networked Life NETS 112 Fall 2017 Prof. Michael Kearns

Information Networks and Semantics. Srinath Srinivasa IIIT Bangalore

Math/Stat 2300 Modeling using Graph Theory (March 23/25) from text A First Course in Mathematical Modeling, Giordano, Fox, Horton, Weir, 2009.

Transitivity and Triads

Small-World Models and Network Growth Models. Anastassia Semjonova Roman Tekhov

Degree Distribution: The case of Citation Networks

Navigation in Networks. Networked Life NETS 112 Fall 2017 Prof. Michael Kearns

L Modeling and Simulating Social Systems with MATLAB

Positive and Negative Links

Information Security in Scale Free Networks

CS 151. Graphs. Monday, November 19, 12

Monday, November 19, 12. Test #2 is Wednesday. For each data structure, you should know: the supported operations

Transcription:

Web 2.0 Social Data Analysis Ing. Jaroslav Kuchař jaroslav.kuchar@fit.cvut.cz Structure(1) Czech Technical University in Prague, Faculty of Information Technologies Software and Web Engineering

2 Contents Overview Graphs Strong and Weak Ties

3 Overview Wayne Zachary. An information flow model for conflict and fission in small groups. Journal of Anthropological Research, 33(4):452 473, 1977.

4 Applications Businesses analyze and improve communication flow in organization, partners, customers Law enforcement agencies (army) identify criminal and terrorist networks, key players Web Sites identify and recommend potential friends Organizations uncover conflicts of interests (government, lobbies, businesses)

5 Network datasets Collaboration Graphs - who works with whom Who-talks-to-Whom Graphs IM, call graphs Information Linkage Graphs pages and links, citations Technological Networks computers, power grid Networks in the Natural World food webs, cascading extinctions, neural connections

6 Graphs A graph is a way of specifying relationships among a collection of items. Objects Nodes Edges Directed Undirected Mathematical models and network structures

Arpanet in December 1970 http://som.csudh.edu/cis/lpress/history/arpamaps/ 7

Transit Schematics http://www.dpp.cz/dopravni-schemata/ 8

9 Graph Theory terminological jungle, in which any newcomer may plant a tree John A. Barnes. Social Networks. Number 26 in Modules in Anthropology. Addison Wesley, 1972. Path sequence of nodes connected by edges Cycles - path with at least three edges, first and last nodes are the same Connectivity if exists path between nodes

10 Graph Theory Components group of nodes, which are connected every node has a path to every other the group is not part of some larger group Giant Components - connected component that contains a significant fraction of all the nodes

The collaboration graph of the biological research center Structural Genomics of Pathogenic Protozoa (SGPP) Shawn M. Douglas, Gaetano T. Montelione, and Mark Gerstein. PubNet: a flexible system for visualizing literature derived networks. Genome Biology, 6(9), 2005. 11

American High School 18 month period Peter Bearman, James Moody, and Katherine Stovel. Chains of affection: The structure of adolescent romantic and sexual networks. American Journal of Sociology, 110(1):44 99, 2004. 12

13 Graph Theory Length - number of edges in the sequence Distance shortest path between nodes Breadth-first search

14 The Small-World Phenomenon The idea that the world looks small How short a path of friends it takes to get from you to almost anyone else Six degrees of separation I read somewhere that everybody on this planet is separated by only six other people. Six degrees of separation between us and everyone else on this planet. John Guare. Six Degrees of Separation: A Play. Vintage Books, 1990.

15 First experimental study Stanley Milgram 1960s $680 budget 296 random starters Forward letter to target person Stockbroker who lived in a suburb of Boston Given personal information about target Forward to someone the knew Same instructions

16 Instant Messaging The distribution of distances in the graph of all active Microsoft Instant Messenger user accounts, with an edge joining two users if they communicated at least once during a month-long observation period 240 million active user accounts average distance of 6.6

17 Paul Erdös Itinerant mathematician 1500 papers Erdös number the distance from him or her to Erdös 4 or 5 Albert Einstein 2 Enrico Fermi 3 James Watson - 6

18 Kevin Bacon Movie actors and actresses His or her distance in this graph to Kevin Bacon Average = 2.9 I found an incredibly obscure 1928 Soviet pirate film, Plenniki Morya, starring P. Savin with a Bacon number of 7, and whose supporting cast of 8 appeared nowhere else

19 Strong and Weak Ties how information flows through a social network how different nodes can play structurally distinct roles shape the evolution of the network Weight of Edge: Frequency of interaction, number of exchanged items, strength of relationship

20 Triadic Closure If two people in a social network have a friend in common, then there is an increased likelihood that they will become friends themselves at some point in the future Anatole Rapoport. Spread of information through a population with socio-structural bias I: Assumption of transitivity. Bulletin of Mathematical Biophysics, 15(4):523 533, December 1953.

21 The Clustering Coefficient simple social network measure probability that two randomly selected friends are friends with each other fraction of pairs of friends that are connected to each other by edges

22 The Clustering Coefficient A coefficient = 1/6 only C-D from all possible (C-D,D-E, B-E, B-C,C-E,D-B) A coefficient = 2/6 = 1/3 C-D and B-C from all possible (C-D,D-E, B-E, B-C,C-E,D-B)

23 Bridge and Local Bridge Bridge Local Bridge

24 Neighborhood overlap Helps identify local bridges Edge between A and B Local Bridge -> NO = 0 NO A-B # nodes who are neighbors of both A and B # nodes who are neighbors of at least one A and B

25 Neighborhood overlap Edge A-F NO = 1/6 Edge A-B NO = 0/8

Different node plays different role 26

27 Embeddedness Embeddedness of edge number of common neighbors the two endpoints have A-B = 2 Common E and F Local bridges has embeddedness = 0 Signifficant embeddedness -> easier trust, confidence (social, economic)

28 Structural hole Node B Access information from non-interacting parts Interface Novel ideas Gatekeeping

Questions 29

30 Resources D. Easley, J. Kleinberg: Networks, Crowds and Markets: Reasoning About a Highly Connected World, Cambridge University Press, 2010, ISBN 0521195330. http://www.slideshare.net/gcheliotis/socialnetwork-analysis-3273045