The Trouble with Community Detection

Size: px
Start display at page:

Download "The Trouble with Community Detection"

Transcription

1 HV 3 The Trouble with Community Detection Aaron Computer Science Dept. & BioFrontiers Institute University of Colorado, Boulder External Faculty, Santa Fe Institute 2016 Aaron Clauset 1 June 2016

2 D community detection large-scale structure = community structure vertices with same pattern of inter-community connections community interaction matrix 9 s D t 600 YQKDAPNY DPNY PNY 13

3 lies, damned lies, and community detection

4 lies, damned lies, and community detection many networks include metadata on their nodes: social networks food webs Internet protein interactions age, sex, ethnicity or race, etc. feeding mode, species body mass, etc. data capacity, physical location, etc. molecular weight, association with cancer, etc. metadata x is often used to evaluate the accuracy of community detection algs. if community detection method finds a partition that correlates with then we say that is good A A P x

5 ct and ation, we ally relevant d vice versa. We rates (Fig. 4). As e the highest loss rate ost conserved across all f p lost (R) for ultra-peripheral ative, but hardly surprising. The n-hub connectors (role R3) and wever, yields a surprising finding. The ial hubs class have many within-module lies, damned lies, and community detection re 3 Cartographic representation of the metabolic network of E. coli. Each circle presents a module and is coloured according to the KEGG pathway classification of the etabolites it contains. Certain important nodes are depicted as triangles (non-hub ctors), hexagons (connector hubs) and squares (provincial hubs). Interactions modules and nodes are depicted using lines, with thickness proportional to the Nature Publishing Group representation number of actual links. Inset: metabolic network of E. coli, which contains 473 metabolites and 574 links. This representation was obtained using the program Pajek. Each node is coloured according to the main colour of its module, as obtained from the cartographic NATURE VOL FEBRUARY !"#$%$&'()')*+,$

6 lies, damned lies, and community detection several recent studies claim these are the exception real networks either do not contain structural communities or communities exist but they do not correlate with metadata groups "classic" data sets online data sets Hric et al. (2014) maximum NMI between any partition layer of the metadata partitions and any layer returned by the community detection method [1] see Leskovec et al. (2009), and Yang & Leskovec (2012), and Hric, Darst & Fortunato (2014)

7 but wait! the idea x P 2 {P} use metadata to help select a partition that correlates with, from among the exponential number of plausible partitions a metadata-aware degree-corrected stochastic block model x metadata degree prior on community assignments edge probability block parameters x = {x u } d = {d u } P (s, x) = Y i p uv = d u d v su,s v st s i,x i [1] technical details in Newman & Clauset (2015) arxiv:

8 but wait! the idea x P 2 {P} use metadata to help select a partition that correlates with, from among the exponential number of plausible partitions a metadata-aware degree-corrected stochastic block model model likelihood (with adjacency matrix A) x P (A,, x) = X s network metadata P (A, s)p (s, x) = X s Y u<v p A uv uv (1 p uv ) 1 A uv Y u s u,x u inference performed using EM algorithm to choose and [1] technical details in Newman & Clauset (2015) arxiv:

9 networks with planted structure does this method recover known structure in synthetic data? we use the planted partition test: 2 groups, equal sizes, constant mean degree assign metadata x u = ground truth with prob. = {0.5,...,0.9} c in n c out n c out n c in n

10 networks with planted structure let mean degree c =8 when =0.5 metadata is irrelevant and we recover regular SBM behavior Fraction of correctly assigned nodes weaker undetectable stronger easy to detect [1] n = c in -c out

11 networks with planted structure let mean degree when for =0.5 > 0.5 metadata + SBM c =8 beats any algorithm without metadata, and beats metadata alone. metadata is irrelevant and we recover regular SBM behavior Fraction of correctly assigned nodes weaker undetectable stronger easy to detect [1] n = c in -c out

12 real-world networks

13 real-world networks 1. high school social network: 795 students in a medium-sized American high school and its feeder middle school 2. marine food web: predator-prey interactions among 488 species in Weddell Sea in Antarctica 3. Malaria gene recombinations: recombination events among 297 var genes 4. Facebook friendships: online friendships among 15,126 Harvard students and alumni 5. Internet graph: peering relations among 46,676 Autonomous Systems

14 real-world networks 1. high school social network: 795 students in a medium-sized American high school and its feeder middle school x = {grade 7-12, ethnicity, gender} method finds a good partition between high-school and middle-school NMI. = without metadata: NMI 2 [0.105, 0.384] [1] Add Health network data, designed by Udry, Bearman & Harris

15 real-world networks 1. high school social network: 795 students in a medium-sized American high school and its feeder middle school x = {grade 7-12, ethnicity, gender} method finds a good partition between blacks and whites (with others scattered among) NMI = without metadata: NMI 2 [0.120, 0.239] [1] Add Health network data, designed by Udry, Bearman & Harris

16 real-world networks 1. high school social network: 795 students in a medium-sized American high school and its feeder middle school x = {grade 7-12, ethnicity, gender} method finds no good partition between males/ females. instead, chooses a mixture of grade/ethnicity partitions NMI = without metadata: NMI 2 [0.000, 0.010] [1] Add Health network data, designed by Udry, Bearman & Harris

17 PNY DDPNY conclusions PNY 13

18 PNY DDPNY conclusions PNY 13 the trouble with community detection 1. many good local optima 2. naive optimization inconsistent against node metadata

19 PNY DDPNY conclusions PNY 13 the trouble with community detection 1. many good local optima 2. naive optimization inconsistent against node metadata two birds with one stone a metadata-aware stochastic block model choose a partition that correlates with edges, and metadata (if possible) learns metadata associations with communities more accurate than any algorithm that uses only edges or metadata highly scalable EM + Belief Propagation algorithm P (A,, x)

20 acknowledgements Funding support: Structure and inference in annotated networks M. E. J. Newman 1, 2 and Aaron Clauset 2, 3 1 Department of Physics and Center for the Study of Complex Systems, University of Michigan, Ann Arbor, MI 48109, USA 2 Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, NM Department of Computer Science and BioFrontiers Institute, University of Colorado, Boulder, CO 80309, USA Mark Newman (Michigan) to appear, Nature Communications (arxiv: ) PHYSICAL REVIEW E 81, Benjamin H. Good (Harvard) Yva de Montjoye (MIT) Performance of modularity maximization in practical contexts Benjamin H. Good, 1,2, * Yves-Alexandre de Montjoye, 3,2, and Aaron Clauset 2, 1 Department of Physics, Swarthmore College, Swarthmore, Pennsylvania 19081, USA 2 Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, New Mexico 87501, USA 3 Department of Applied Mathematics, Université Catholique de Louvain, 4 Avenue Georges Lemaitre, B-1348 Louvain-la-Neuve, Belgium Received 1 October 2009; published 15 April 2010 [1] a C implementation of the metadata SBM is available by request

21 PNY DDPNY PNY 13 fin community detection

22 real-world networks ut metadata 2. marine food web: predator-prey interactions among 488 species in Weddell Sea in Antarctica x = {species body mass, feeding mode, oceanic zone} partition recovers known correlation between body mass, trophic level, and ecosystem role: 8 Probability of community membership Mean body mass (g) [1] here, we re using a continuous FIG. metadata S4: Learned model priors, as a function of body mass, for the [2] Brose et al. (2005) three-community division of the Weddell Sea network shown Detritivore Carnivore Omnivore Herbivore Primary producer

23 real-world networks 3. Malaria gene recombinations: recombination events among 297 var genes x = {Cys-PoLV labels for HVR6 region} with metadata, partition discovers correlation with Cys labels (which are associated with severe disease) HVR6 without metadata with metadata NMI 2 [0.077, 0.675] NMI = [1] Larremore, Clauset & Buckee (2013)

24 real-world networks 3. Malaria gene recombinations: recombination events among 297 var genes x = {Cys-PoLV labels for HVR6 region} on adjacent region of gene, we find Cys-PoLV labels correlate with recombinant structure here, too HVR5 without metadata with metadata [1] Larremore, Clauset & Buckee (2013)

25 real-world networks 4. Facebook friendships: online friendships among 15,126 Harvard students and alumni (in Sept. 2005) x = {graduation year, dormitory} method finds a good partition between alumni, recent graduates, upperclassmen, sophomores, and freshmen NMI. = without metadata: NMI 2 [0.573, 0.641] Prior probability of membership None Year [1] Traud, Mucha & Porter (20012)

26 real-world networks 4. Facebook friendships: online friendships among 15,126 Harvard students and alumni (in Sept. 2005) x = {graduation year, dormitory} method finds a good partition among the dorms NMI. = without metadata: NMI 2 [0.074, 0.224] Prior probability of membership Dorm [1] Traud, Mucha & Porter (20012)

27 real-world networks 5. Internet graph: 262,953 peering relations among 46,676 Autonomous Systems x = {country location of AS} method finds a good partition along the lines of the 173 countries NMI. = without metadata: NMI 2 [0.398, 0.626] [1] here, we re using a continuous metadata model

Learning Community Structure in Networks with Metadata

Learning Community Structure in Networks with Metadata HV 3 Learning Community Structure in Networks with Metadata Aaron Clauset @aaronclauset Computer Science Dept. & BioFrontiers Institute University of Colorado, Boulder External Faculty, Santa Fe Institute

More information

Lecture 8: Generalized large-scale structure

Lecture 8: Generalized large-scale structure 3 Lecture 8: Generalized large-scale structure Aaron Clauset @aaronclauset Assistant Professor of Computer Science University of Colorado Boulder External Faculty, Santa Fe Institute 2017 Aaron Clauset

More information

Scale-free networks are rare

Scale-free networks are rare Scale-free networks are rare Anna Broido 1 and Aaron Clauset 2, 3 1. Applied Math Dept., CU Boulder 2. Computer Science Dept. & BioFrontiers Institute, CU Boulder 3. External Faculty, Santa Fe Institute

More information

Active Learning for Hidden Attributes in Networks

Active Learning for Hidden Attributes in Networks Active Learning for Hidden Attributes in Networks Xiao Ran Yan Department of Computer Science University of New Mexico Albuquerque, NM everyxt@gmail.com Jean-Baptiste Rouquier Complex Systems Institute

More information

Network structure, metadata and the prediction of missing nodes

Network structure, metadata and the prediction of missing nodes Network structure, metadata and the prediction of missing nodes Tiago P. Peixoto Universität Bremen, Germany & ISI Foundation, Italy Darko Hric Aalto University, Finland Santo Fortunato Indiana University,

More information

Community Detection: Comparison of State of the Art Algorithms

Community Detection: Comparison of State of the Art Algorithms Community Detection: Comparison of State of the Art Algorithms Josiane Mothe IRIT, UMR5505 CNRS & ESPE, Univ. de Toulouse Toulouse, France e-mail: josiane.mothe@irit.fr Karen Mkhitaryan Institute for Informatics

More information

Statistical Physics of Community Detection

Statistical Physics of Community Detection Statistical Physics of Community Detection Keegan Go (keegango), Kenji Hata (khata) December 8, 2015 1 Introduction Community detection is a key problem in network science. Identifying communities, defined

More information

Web 2.0 Social Data Analysis

Web 2.0 Social Data Analysis Web 2.0 Social Data Analysis Ing. Jaroslav Kuchař jaroslav.kuchar@fit.cvut.cz Structure (2) Czech Technical University in Prague, Faculty of Information Technologies Software and Web Engineering 2 Contents

More information

Edge Weight Method for Community Detection in Scale-Free Networks

Edge Weight Method for Community Detection in Scale-Free Networks Edge Weight Method for Community Detection in Scale-Free Networks Sorn Jarukasemratana Tsuyoshi Murata Tokyo Institute of Technology WIMS'14 June 2-4, 2014 - Thessaloniki, Greece Modularity High modularity

More information

An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization

An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization Pedro Ribeiro (DCC/FCUP & CRACS/INESC-TEC) Part 1 Motivation and emergence of Network Science

More information

Generative Models for Complex Network Structure

Generative Models for Complex Network Structure Generative Models for Complex Network Structure Aaron Clauset @aaronclauset Computer Science Dept. & BioFrontiers Institute University of Colorado, Boulder External Faculty, Santa Fe Institute 4 June 2013

More information

IF IS FOR OLD PEOPLE, ARE UNDERGRADUATES OLD PEOPLE?

IF  IS FOR OLD PEOPLE, ARE UNDERGRADUATES OLD PEOPLE? IF E-MAIL IS FOR OLD PEOPLE, ARE UNDERGRADUATES OLD PEOPLE? Wallace A. Wood, Bryant University, wwood@bryant.edu Suhong Li, Bryant University, sli@bryant.edu ====================================================================

More information

Community Structure Detection. Amar Chandole Ameya Kabre Atishay Aggarwal

Community Structure Detection. Amar Chandole Ameya Kabre Atishay Aggarwal Community Structure Detection Amar Chandole Ameya Kabre Atishay Aggarwal What is a network? Group or system of interconnected people or things Ways to represent a network: Matrices Sets Sequences Time

More information

Inferring Regulatory Networks by Combining Perturbation Screens and Steady State Gene Expression Profiles

Inferring Regulatory Networks by Combining Perturbation Screens and Steady State Gene Expression Profiles Supporting Information to Inferring Regulatory Networks by Combining Perturbation Screens and Steady State Gene Expression Profiles Ali Shojaie,#, Alexandra Jauhiainen 2,#, Michael Kallitsis 3,#, George

More information

Modularity CMSC 858L

Modularity CMSC 858L Modularity CMSC 858L Module-detection for Function Prediction Biological networks generally modular (Hartwell+, 1999) We can try to find the modules within a network. Once we find modules, we can look

More information

Section 7.13: Homophily (or Assortativity) By: Ralucca Gera, NPS

Section 7.13: Homophily (or Assortativity) By: Ralucca Gera, NPS Section 7.13: Homophily (or Assortativity) By: Ralucca Gera, NPS Are hubs adjacent to hubs? How does a node s degree relate to its neighbors degree? Real networks usually show a non-zero degree correlation

More information

What is a Network? Theory and Applications of Complex Networks. Network Example 1: High School Friendships

What is a Network? Theory and Applications of Complex Networks. Network Example 1: High School Friendships 1 2 Class One 1. A collection of nodes What is a Network? 2. A collection of edges connecting nodes College of the Atlantic 12 September 2008 http://hornacek.coa.edu/dave/ 1. What is a network? 2. Many

More information

Graph Theory Review. January 30, Network Science Analytics Graph Theory Review 1

Graph Theory Review. January 30, Network Science Analytics Graph Theory Review 1 Graph Theory Review Gonzalo Mateos Dept. of ECE and Goergen Institute for Data Science University of Rochester gmateosb@ece.rochester.edu http://www.ece.rochester.edu/~gmateosb/ January 30, 2018 Network

More information

Spectral Clustering and Community Detection in Labeled Graphs

Spectral Clustering and Community Detection in Labeled Graphs Spectral Clustering and Community Detection in Labeled Graphs Brandon Fain, Stavros Sintos, Nisarg Raval Machine Learning (CompSci 571D / STA 561D) December 7, 2015 {btfain, nisarg, ssintos} at cs.duke.edu

More information

Local higher-order graph clustering

Local higher-order graph clustering Local higher-order graph clustering Hao Yin Stanford University yinh@stanford.edu Austin R. Benson Cornell University arb@cornell.edu Jure Leskovec Stanford University jure@cs.stanford.edu David F. Gleich

More information

Eligibility Application for ENP Certification

Eligibility Application for ENP Certification Eligibility Application for ENP Certification Please fill in the following information as indicated. NENA Member $ Non-Member $ Re-Certification $ Re-Examination $1 Type of payment included: Check enclosed

More information

A Simple Acceleration Method for the Louvain Algorithm

A Simple Acceleration Method for the Louvain Algorithm A Simple Acceleration Method for the Louvain Algorithm Naoto Ozaki, Hiroshi Tezuka, Mary Inaba * Graduate School of Information Science and Technology, University of Tokyo, Tokyo, Japan. * Corresponding

More information

Theory and Applications of Complex Networks

Theory and Applications of Complex Networks Theory and Applications of Complex Networks 1 Theory and Applications of Complex Networks Class One College of the Atlantic David P. Feldman 12 September 2008 http://hornacek.coa.edu/dave/ 1. What is a

More information

Statistical Methods for Network Analysis: Exponential Random Graph Models

Statistical Methods for Network Analysis: Exponential Random Graph Models Day 2: Network Modeling Statistical Methods for Network Analysis: Exponential Random Graph Models NMID workshop September 17 21, 2012 Prof. Martina Morris Prof. Steven Goodreau Supported by the US National

More information

CS224W: Analysis of Networks Jure Leskovec, Stanford University

CS224W: Analysis of Networks Jure Leskovec, Stanford University CS224W: Analysis of Networks Jure Leskovec, Stanford University http://cs224w.stanford.edu 11/13/17 Jure Leskovec, Stanford CS224W: Analysis of Networks, http://cs224w.stanford.edu 2 Observations Models

More information

Hierarchical Problems for Community Detection in Complex Networks

Hierarchical Problems for Community Detection in Complex Networks Hierarchical Problems for Community Detection in Complex Networks Benjamin Good 1,, and Aaron Clauset, 1 Physics Department, Swarthmore College, 500 College Ave., Swarthmore PA, 19081 Santa Fe Institute,

More information

1 More stochastic block model. Pr(G θ) G=(V, E) 1.1 Model definition. 1.2 Fitting the model to data. Prof. Aaron Clauset 7 November 2013

1 More stochastic block model. Pr(G θ) G=(V, E) 1.1 Model definition. 1.2 Fitting the model to data. Prof. Aaron Clauset 7 November 2013 1 More stochastic block model Recall that the stochastic block model (SBM is a generative model for network structure and thus defines a probability distribution over networks Pr(G θ, where θ represents

More information

Telephone Survey Response: Effects of Cell Phones in Landline Households

Telephone Survey Response: Effects of Cell Phones in Landline Households Telephone Survey Response: Effects of Cell Phones in Landline Households Dennis Lambries* ¹, Michael Link², Robert Oldendick 1 ¹University of South Carolina, ²Centers for Disease Control and Prevention

More information

CPSC 217 Assignment 4

CPSC 217 Assignment 4 CPSC 217 Assignment 4 Due: Friday December 8, 2017 at 11:55pm Weight: 7% Sample Solution Length: Approximately 130 lines (No comments and no code for the A+ part) Individual Work: All assignments in this

More information

1 Homophily and assortative mixing

1 Homophily and assortative mixing 1 Homophily and assortative mixing Networks, and particularly social networks, often exhibit a property called homophily or assortative mixing, which simply means that the attributes of vertices correlate

More information

Spectral Methods for Network Community Detection and Graph Partitioning

Spectral Methods for Network Community Detection and Graph Partitioning Spectral Methods for Network Community Detection and Graph Partitioning M. E. J. Newman Department of Physics, University of Michigan Presenters: Yunqi Guo Xueyin Yu Yuanqi Li 1 Outline: Community Detection

More information

Topic mash II: assortativity, resilience, link prediction CS224W

Topic mash II: assortativity, resilience, link prediction CS224W Topic mash II: assortativity, resilience, link prediction CS224W Outline Node vs. edge percolation Resilience of randomly vs. preferentially grown networks Resilience in real-world networks network resilience

More information

Community detection algorithms survey and overlapping communities. Presented by Sai Ravi Kiran Mallampati

Community detection algorithms survey and overlapping communities. Presented by Sai Ravi Kiran Mallampati Community detection algorithms survey and overlapping communities Presented by Sai Ravi Kiran Mallampati (sairavi5@vt.edu) 1 Outline Various community detection algorithms: Intuition * Evaluation of the

More information

Advanced Algorithms and Models for Computational Biology -- a machine learning approach

Advanced Algorithms and Models for Computational Biology -- a machine learning approach Advanced Algorithms and Models for Computational Biology -- a machine learning approach Biological Networks & Network Evolution Eric Xing Lecture 22, April 10, 2006 Reading: Molecular Networks Interaction

More information

A Prehistory of Arithmetic

A Prehistory of Arithmetic A Prehistory of Arithmetic History and Philosophy of Mathematics MathFest August 8, 2015 Patricia Baggett Andrzej Ehrenfeucht Dept. of Math Sci. Computer Science Dept. New Mexico State Univ. University

More information

Statistical Analysis of List Experiments

Statistical Analysis of List Experiments Statistical Analysis of List Experiments Kosuke Imai Princeton University Joint work with Graeme Blair October 29, 2010 Blair and Imai (Princeton) List Experiments NJIT (Mathematics) 1 / 26 Motivation

More information

Final report - CS224W: Using Wikipedia pages traffic data to infer pertinence of links

Final report - CS224W: Using Wikipedia pages traffic data to infer pertinence of links Final report - CS224W: Using Wikipedia pages traffic data to infer pertinence of links Marc Thibault; SUNetID number: 06227968 Mael Trean; SUNetIDnumber: 06228438 I - Introduction/Motivation/Problem Definition

More information

Accelerating Genomic Sequence Alignment Workload with Scalable Vector Architecture

Accelerating Genomic Sequence Alignment Workload with Scalable Vector Architecture Accelerating Genomic Sequence Alignment Workload with Scalable Vector Architecture Dong-hyeon Park, Jon Beaumont, Trevor Mudge University of Michigan, Ann Arbor Genomics Past Weeks ~$3 billion Human Genome

More information

Community Detection based on Structural and Attribute Similarities

Community Detection based on Structural and Attribute Similarities Community Detection based on Structural and Attribute Similarities The Anh Dang, Emmanuel Viennet L2TI - Institut Galilée - Université Paris-Nord 99, avenue Jean-Baptiste Clément - 93430 Villetaneuse -

More information

Comparison of Centralities for Biological Networks

Comparison of Centralities for Biological Networks Comparison of Centralities for Biological Networks Dirk Koschützki and Falk Schreiber Bioinformatics Center Gatersleben-Halle Institute of Plant Genetics and Crop Plant Research Corrensstraße 3 06466 Gatersleben,

More information

Incoming, Outgoing Degree and Importance Analysis of Network Motifs

Incoming, Outgoing Degree and Importance Analysis of Network Motifs Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 6, June 2015, pg.758

More information

Bulk Registration File Specifications

Bulk Registration File Specifications Bulk Registration File Specifications 2017-18 SCHOOL YEAR Summary of Changes Added new errors for Student ID and SSN Preparing Files for Upload (Option 1) Follow these tips and the field-level specifications

More information

2018 NSP Student Leader Contact Form

2018 NSP Student Leader Contact Form 2018 NSP Student Leader Contact Form Welcome to the Office of New Student Programs! We are extremely excited to have you on our team. Please complete the below form to confirm your acceptance. Student

More information

Silicosis Prevalence Among Medicare Beneficiaries,

Silicosis Prevalence Among Medicare Beneficiaries, Silicosis Prevalence Among Medicare Beneficiaries, 1999 2014 Megan Casey, RN, BSN, MPH Nurse Epidemiologist Expanding Research Partnerships: State of the Science June 21, 2017 National Institute for Occupational

More information

CAIM: Cerca i Anàlisi d Informació Massiva

CAIM: Cerca i Anàlisi d Informació Massiva 1 / 72 CAIM: Cerca i Anàlisi d Informació Massiva FIB, Grau en Enginyeria Informàtica Slides by Marta Arias, José Balcázar, Ricard Gavaldá Department of Computer Science, UPC Fall 2016 http://www.cs.upc.edu/~caim

More information

eschoolplus Alief Independent School Distirct ONLINE STUDENT ENROLLMENT

eschoolplus Alief Independent School Distirct ONLINE STUDENT ENROLLMENT ONLINE STUDENT ENROLLMENT How to Register with Enrollment Online 1. Go to https://aliefhac1.aliefisd.net/eo_parent/user/login.aspx 2. Click on Register New Account. 3. Complete the registration screen

More information

Community detection. Leonid E. Zhukov

Community detection. Leonid E. Zhukov Community detection Leonid E. Zhukov School of Data Analysis and Artificial Intelligence Department of Computer Science National Research University Higher School of Economics Network Science Leonid E.

More information

Understanding complex networks with community-finding algorithms

Understanding complex networks with community-finding algorithms Understanding complex networks with community-finding algorithms Eric D. Kelsic 1 SURF 25 Final Report 1 California Institute of Technology, Pasadena, CA 91126, USA (Dated: November 1, 25) In a complex

More information

My favorite application using eigenvalues: partitioning and community detection in social networks

My favorite application using eigenvalues: partitioning and community detection in social networks My favorite application using eigenvalues: partitioning and community detection in social networks Will Hobbs February 17, 2013 Abstract Social networks are often organized into families, friendship groups,

More information

Online Social Networks and Media. Community detection

Online Social Networks and Media. Community detection Online Social Networks and Media Community detection 1 Notes on Homework 1 1. You should write your own code for generating the graphs. You may use SNAP graph primitives (e.g., add node/edge) 2. For the

More information

The Generalized Topological Overlap Matrix in Biological Network Analysis

The Generalized Topological Overlap Matrix in Biological Network Analysis The Generalized Topological Overlap Matrix in Biological Network Analysis Andy Yip, Steve Horvath Email: shorvath@mednet.ucla.edu Depts Human Genetics and Biostatistics, University of California, Los Angeles

More information

Crawling and Detecting Community Structure in Online Social Networks using Local Information

Crawling and Detecting Community Structure in Online Social Networks using Local Information Crawling and Detecting Community Structure in Online Social Networks using Local Information Norbert Blenn, Christian Doerr, Bas Van Kester, Piet Van Mieghem Department of Telecommunication TU Delft, Mekelweg

More information

Graph Theory. Graph Theory. COURSE: Introduction to Biological Networks. Euler s Solution LECTURE 1: INTRODUCTION TO NETWORKS.

Graph Theory. Graph Theory. COURSE: Introduction to Biological Networks. Euler s Solution LECTURE 1: INTRODUCTION TO NETWORKS. Graph Theory COURSE: Introduction to Biological Networks LECTURE 1: INTRODUCTION TO NETWORKS Arun Krishnan Koenigsberg, Russia Is it possible to walk with a route that crosses each bridge exactly once,

More information

Tri-C College Credit Plus Application Instructions

Tri-C College Credit Plus Application Instructions Tri-C College Credit Plus Application Instructions 1. Go to the website: http://www.tri-c.edu/college-credit-plus/. This website provides you with enrollment information for prospective and current CCP

More information

Overview of Clustering

Overview of Clustering based on Loïc Cerfs slides (UFMG) April 2017 UCBL LIRIS DM2L Example of applicative problem Student profiles Given the marks received by students for different courses, how to group the students so that

More information

10701 Machine Learning. Clustering

10701 Machine Learning. Clustering 171 Machine Learning Clustering What is Clustering? Organizing data into clusters such that there is high intra-cluster similarity low inter-cluster similarity Informally, finding natural groupings among

More information

Supplementary material to Epidemic spreading on complex networks with community structures

Supplementary material to Epidemic spreading on complex networks with community structures Supplementary material to Epidemic spreading on complex networks with community structures Clara Stegehuis, Remco van der Hofstad, Johan S. H. van Leeuwaarden Supplementary otes Supplementary ote etwork

More information

Hierarchical Overlapping Community Discovery Algorithm Based on Node purity

Hierarchical Overlapping Community Discovery Algorithm Based on Node purity Hierarchical Overlapping ommunity Discovery Algorithm Based on Node purity Guoyong ai, Ruili Wang, and Guobin Liu Guilin University of Electronic Technology, Guilin, Guangxi, hina ccgycai@guet.edu.cn,

More information

Algorithms and Applications in Social Networks. 2017/2018, Semester B Slava Novgorodov

Algorithms and Applications in Social Networks. 2017/2018, Semester B Slava Novgorodov Algorithms and Applications in Social Networks 2017/2018, Semester B Slava Novgorodov 1 Lesson #1 Administrative questions Course overview Introduction to Social Networks Basic definitions Network properties

More information

M ost complex systems in various fields, such as social networks in social science, the Internet in engineering,

M ost complex systems in various fields, such as social networks in social science, the Internet in engineering, OEN SUBJECT AREAS: COMLEX NETWORKS COMUTER SCIENCE Received 19 September 2013 Accepted 27 January 2015 ublished 2 March 2015 Correspondence and requests for materials should be addressed to W.Z. (weixiong.

More information

HIHIM Application for Autumn 2018 Use this list to ensure that you have a complete application to the HIHIM Program.

HIHIM Application for Autumn 2018 Use this list to ensure that you have a complete application to the HIHIM Program. HIHIM Application for Autumn 2018 Use this list to ensure that you have a complete application to the HIHIM Program. Step 1 Type in your information. Hand-write only the signature and date. Step 2 Be sure

More information

HPU COURSE REQUEST FORM INSTRUCTIONS For past course syllabi:

HPU COURSE REQUEST FORM INSTRUCTIONS For past course syllabi: UNDERGRADUATE VISITING STUDENT PRE-APPROVED COURSE LIST: SPRING 2019 HPU COURSE REQUEST FORM INSTRUCTIONS For past course syllabi: http://apps.hpu.edu/cis/web/index.php/search For instructions on how to

More information

College and Career Readiness Mathematics. Time allowed: 90 minutes

College and Career Readiness Mathematics. Time allowed: 90 minutes Mathematics Assessment Program CCR-B1 College and Career Readiness Mathematics Time allowed: 90 minutes These tasks give you a chance to show what you know and how you reason, and to solve mathematical

More information

ESL Student Demographics and Achievements (2004 to 2013)

ESL Student Demographics and Achievements (2004 to 2013) ESL Student Demographics and Achievements (2004 to 2013) Scope of Analysis: Population: Students who passed any one or more of the English 80/82/83/361/362/363 courses Time range for the analysis: Summer

More information

1 Large-scale network structures

1 Large-scale network structures Inference, Models and Simulation for Complex Systems CSCI 7000-001 Lecture 15 25 October 2011 Prof. Aaron Clauset 1 Large-scale network structures The structural measures we met in Lectures 12a and 12b,

More information

Community Detection in Bipartite Networks:

Community Detection in Bipartite Networks: Community Detection in Bipartite Networks: Algorithms and Case Studies Kathy Horadam and Taher Alzahrani Mathematical and Geospatial Sciences, RMIT Melbourne, Australia IWCNA 2014 Community Detection,

More information

Social Data Management Communities

Social Data Management Communities Social Data Management Communities Antoine Amarilli 1, Silviu Maniu 2 January 9th, 2018 1 Télécom ParisTech 2 Université Paris-Sud 1/20 Table of contents Communities in Graphs 2/20 Graph Communities Communities

More information

Extracting Communities from Networks

Extracting Communities from Networks Extracting Communities from Networks Ji Zhu Department of Statistics, University of Michigan Joint work with Yunpeng Zhao and Elizaveta Levina Outline Review of community detection Community extraction

More information

Estimation of Unknown Parameters in Dynamic Models Using the Method of Simulated Moments (MSM)

Estimation of Unknown Parameters in Dynamic Models Using the Method of Simulated Moments (MSM) Estimation of Unknown Parameters in ynamic Models Using the Method of Simulated Moments (MSM) Abstract: We introduce the Method of Simulated Moments (MSM) for estimating unknown parameters in dynamic models.

More information

THE 2018 CENTER FOR DIVERSITY IN PUBLIC HEALTH LEADERSHIP TRAINING PROGRAMS ONLINE GUIDLINES

THE 2018 CENTER FOR DIVERSITY IN PUBLIC HEALTH LEADERSHIP TRAINING PROGRAMS ONLINE GUIDLINES THE 2018 CENTER FOR DIVERSITY IN PUBLIC HEALTH LEADERSHIP TRAINING PROGRAMS ONLINE GUIDLINES James A. Ferguson Emerging Infectious Diseases Research Initiatives for Student Enhancement (RISE) Fellowship

More information

Network Routing Protocol using Genetic Algorithms

Network Routing Protocol using Genetic Algorithms International Journal of Electrical & Computer Sciences IJECS-IJENS Vol:0 No:02 40 Network Routing Protocol using Genetic Algorithms Gihan Nagib and Wahied G. Ali Abstract This paper aims to develop a

More information

Community Structure and Beyond

Community Structure and Beyond Community Structure and Beyond Elizabeth A. Leicht MAE: 298 April 9, 2009 Why do we care about community structure? Large Networks Discussion Outline Overview of past work on community structure. How to

More information

Ensemble registration: Combining groupwise registration and segmentation

Ensemble registration: Combining groupwise registration and segmentation PURWANI, COOTES, TWINING: ENSEMBLE REGISTRATION 1 Ensemble registration: Combining groupwise registration and segmentation Sri Purwani 1,2 sri.purwani@postgrad.manchester.ac.uk Tim Cootes 1 t.cootes@manchester.ac.uk

More information

Quick Review of Graphs

Quick Review of Graphs COMP 102: Excursions in Computer Science Lecture 11: Graphs Instructor: (jpineau@cs.mcgill.ca) Class web page: www.cs.mcgill.ca/~jpineau/comp102 Quick Review of Graphs A graph is an abstract representation

More information

David Easley and Jon Kleinberg January 24, 2007

David Easley and Jon Kleinberg January 24, 2007 Networks: Spring 2007 Graph Theory David Easley and Jon Kleinberg January 24, 2007 A graph is simply a way of encoding the pairwise relationships among a set of objects: we will refer to the objects as

More information

Semantic Annotation for Semantic Social Networks. Using Community Resources

Semantic Annotation for Semantic Social Networks. Using Community Resources Semantic Annotation for Semantic Social Networks Using Community Resources Lawrence Reeve and Hyoil Han College of Information Science and Technology Drexel University, Philadelphia, PA 19108 lhr24@drexel.edu

More information

DESIGN OF EXPERIMENTS and ROBUST DESIGN

DESIGN OF EXPERIMENTS and ROBUST DESIGN DESIGN OF EXPERIMENTS and ROBUST DESIGN Problems in design and production environments often require experiments to find a solution. Design of experiments are a collection of statistical methods that,

More information

Dynamic network generative model

Dynamic network generative model Dynamic network generative model Habiba, Chayant Tantipathanananandh, Tanya Berger-Wolf University of Illinois at Chicago. In this work we present a statistical model for generating realistic dynamic networks

More information

Feature clustering and mutual information for the selection of variables in spectral data

Feature clustering and mutual information for the selection of variables in spectral data Feature clustering and mutual information for the selection of variables in spectral data C. Krier 1, D. François 2, F.Rossi 3, M. Verleysen 1 1,2 Université catholique de Louvain, Machine Learning Group

More information

Evaluating Classifiers

Evaluating Classifiers Evaluating Classifiers Charles Elkan elkan@cs.ucsd.edu January 18, 2011 In a real-world application of supervised learning, we have a training set of examples with labels, and a test set of examples with

More information

Database and Knowledge-Base Systems: Data Mining. Martin Ester

Database and Knowledge-Base Systems: Data Mining. Martin Ester Database and Knowledge-Base Systems: Data Mining Martin Ester Simon Fraser University School of Computing Science Graduate Course Spring 2006 CMPT 843, SFU, Martin Ester, 1-06 1 Introduction [Fayyad, Piatetsky-Shapiro

More information

2019 PROJECT UPLIFT & UPLIFT PLUS PARTICIPANT APPLICATION

2019 PROJECT UPLIFT & UPLIFT PLUS PARTICIPANT APPLICATION Default Question Block 2019 PROJECT UPLIFT & UPLIFT PLUS PARTICIPANT APPLICATION *TO BEGIN THE APPLICATION, CLICK THE BLUE BUTTON AT THE BOTTOM OF THE PAGE* Thank you for your interest in the Project Uplift

More information

Central Valley School District Math Curriculum Map Grade 8. August - September

Central Valley School District Math Curriculum Map Grade 8. August - September August - September Decimals Add, subtract, multiply and/or divide decimals without a calculator (straight computation or word problems) Convert between fractions and decimals ( terminating or repeating

More information

Application of a Genetic Algorithm to Variable Selection in Fuzzy Clustering

Application of a Genetic Algorithm to Variable Selection in Fuzzy Clustering Application of a Genetic Algorithm to Variable Selection in Fuzzy Clustering Christian Röver and Gero Szepannek University of Dortmund Department of Statistics 44221 Dortmund, Germany roever@statistik.uni-dortmund.de

More information

ECLT 5810 Clustering

ECLT 5810 Clustering ECLT 5810 Clustering What is Cluster Analysis? Cluster: a collection of data objects Similar to one another within the same cluster Dissimilar to the objects in other clusters Cluster analysis Grouping

More information

Adaptive Crossover in Genetic Algorithms Using Statistics Mechanism

Adaptive Crossover in Genetic Algorithms Using Statistics Mechanism in Artificial Life VIII, Standish, Abbass, Bedau (eds)(mit Press) 2002. pp 182 185 1 Adaptive Crossover in Genetic Algorithms Using Statistics Mechanism Shengxiang Yang Department of Mathematics and Computer

More information

ANALYSIS COMPUTER SCIENCE Discovery Science, Volume 9, Number 20, April 3, Comparative Study of Classification Algorithms Using Data Mining

ANALYSIS COMPUTER SCIENCE Discovery Science, Volume 9, Number 20, April 3, Comparative Study of Classification Algorithms Using Data Mining ANALYSIS COMPUTER SCIENCE Discovery Science, Volume 9, Number 20, April 3, 2014 ISSN 2278 5485 EISSN 2278 5477 discovery Science Comparative Study of Classification Algorithms Using Data Mining Akhila

More information

A FUZZY LOGIC BASED METHOD FOR EDGE DETECTION

A FUZZY LOGIC BASED METHOD FOR EDGE DETECTION Bulletin of the Transilvania University of Braşov Series I: Engineering Sciences Vol. 4 (53) No. 1-2011 A FUZZY LOGIC BASED METHOD FOR EDGE DETECTION C. SULIMAN 1 C. BOLDIŞOR 1 R. BĂZĂVAN 2 F. MOLDOVEANU

More information

THE ASSOCIATION BETWEEN CELL PHONE RESTRICTIONS AND YOUNG DRIVERS SELF REPORTED CELL PHONE USE

THE ASSOCIATION BETWEEN CELL PHONE RESTRICTIONS AND YOUNG DRIVERS SELF REPORTED CELL PHONE USE THE ASSOCIATION BETWEEN CELL PHONE RESTRICTIONS AND YOUNG DRIVERS SELF REPORTED CELL PHONE USE SAVIR Conference June 5 7, 2013 Johnathon P. Ehsani, Jessamyn G. Perlus, Ashley Russell, Kaigang Li, & Bruce

More information

Updating Personal Information

Updating Personal Information University of Connecticut Updating Personal Information Overview Self-service users can view and update their personal information at any time via self-service pages in Core-CT. This information includes:

More information

CINAHL Plus with Full Text

CINAHL Plus with Full Text Library Services CINAHL Plus with Full Text CINAHL Plus with Full Text is a database which provides details of articles in over 1,200 English-language nursing and allied health journals published since

More information

Non Overlapping Communities

Non Overlapping Communities Non Overlapping Communities Davide Mottin, Konstantina Lazaridou HassoPlattner Institute Graph Mining course Winter Semester 2016 Acknowledgements Most of this lecture is taken from: http://web.stanford.edu/class/cs224w/slides

More information

Students Preferences for Receiving Communication from the University: A Report from the Student Life Survey

Students Preferences for Receiving Communication from the University: A Report from the Student Life Survey Preferences for Receiving Communication from the University: A Report from the Student Life Survey Center for the Study of Student Life March 2017 INTRODUCTION This report explores questions on the Student

More information

New Jersey economic issues poll April 5-14, 2018 Stockton Polling Institute Weighted frequencies

New Jersey economic issues poll April 5-14, 2018 Stockton Polling Institute Weighted frequencies New Jersey economic issues poll April 5-14, 2018 Stockton Polling Institute Weighted frequencies Q1. How would you rate the U.S. economy: Frequency Valid Valid Excellent 47 6.6 6.6 6.6 Good 302 42.1 42.1

More information

SAMPLE - DO NOT SUBMIT

SAMPLE - DO NOT SUBMIT PAGE 2: PERSONAL INFORMATION NAME FIRST/GIVEN NAME MIDDLE NAME LAST/FAMILY/SURNAME PREFERRED FIRST NAME SUFFIX (e.g., Jr., Sr., I, II, III) DATE OF BIRTH (MM/DD/YYYY) SEX FEMALE MALE 2019 PROJECT UPLIFT

More information

Wisconsin. Model Academic Standards. for Mathematics

Wisconsin. Model Academic Standards. for Mathematics 2004 correlated to the Wisconsin Model Academic Standards for Mathematics Grade 8 3/2003 2004 McDougal Littell Middle School Math, Course 1 2004 correlated to the Wisconsin Model Academic Standards for

More information

Relationship of Mobile Phone Usage and GPA among the Undergraduate Students at the University of the West Indies - Mona Campus

Relationship of Mobile Phone Usage and GPA among the Undergraduate Students at the University of the West Indies - Mona Campus Relationship of Mobile Phone Usage and GPA among the Undergraduate Students at the University of the West Indies - Mona Campus Tashfeen Ahmad Researcher, Faculty of Social Sciences The University of the

More information

Distribution-Free Models of Social and Information Networks

Distribution-Free Models of Social and Information Networks Distribution-Free Models of Social and Information Networks Tim Roughgarden (Stanford CS) joint work with Jacob Fox (Stanford Math), Rishi Gupta (Stanford CS), C. Seshadhri (UC Santa Cruz), Fan Wei (Stanford

More information

Finding and evaluating community structure in networks

Finding and evaluating community structure in networks PHYSICAL REVIEW E 69, 026113 2004 Finding and evaluating community structure in networks M. E. J. Newman 1,2 and M. Girvan 2,3 1 Department of Physics and Center for the Study of Complex Systems, University

More information

A Naïve Soft Computing based Approach for Gene Expression Data Analysis

A Naïve Soft Computing based Approach for Gene Expression Data Analysis Available online at www.sciencedirect.com Procedia Engineering 38 (2012 ) 2124 2128 International Conference on Modeling Optimization and Computing (ICMOC-2012) A Naïve Soft Computing based Approach for

More information