A Local Algorithm for Structure-Preserving Graph Cut

Size: px
Start display at page:

Download "A Local Algorithm for Structure-Preserving Graph Cut"

Transcription

1 A Local Algorithm for Structure-Preserving Graph Cut Presenter: Dawei Zhou Dawei Zhou* (ASU) Si Zhang (ASU) M. Yigit Yildirim (ASU) Scott Alcorn (Early Warning) Hanghang Tong (ASU) Hasan Davulcu (ASU) Jingrui He (ASU)

2 Outline Background Related Work Proposed HOSPLOC Algorithm Experiments Conclusion - 2 -

3 Networks Network is everywhere Internet Social Network Online Transaction

4 Applications Networks Dense subgraph detection Internet Social Network Online Transaction Emerging Trends Detection Community Detection Money Laundering Detection

5 High-order structures in practice Stars M. Wilson, and C. Nicholas, 2008, October. Topological analysis of an online social network for older adults. In Proceedings of the 2008 ACM workshop on Search in social media (pp ). ACM. - 5Z. -Li, F. Yan, and Y. Jiang, Cross-layers cascade in multiplex networks. Autonomous Agents and Multi- Agent Systems, 29(6), pp

6 High-order structures in practice Triangle C. Klymko, D. Gleich, and T.-G. Kolda Using triangles to improve community detection in directed networks. arxiv preprint arxiv: (2014). J. W. Berry, B. Hendrickson, R. A. LaViolette, and C. A. Phillips, Tolerating the community detection - 6resolution - limit with edge weighting. Physical Review E, 83(5), p

7 High-order structures in practice Loops K.-K. R. Choo Money laundering risks of prepaid stored value cards. Australian Institute of Criminology. (363) B. J. Kloostra, C. Dalvi. and B. N. Behm, Crowe Horwath Llp, System and method for analyzing and - 7dispositioning - money laundering suspicious activity alerts. U.S. Patent Application Arizona 12/258,784. State University

8 Problem definition 1 2 Given: Graph G = (V, E), user-defined structure N. 3 Find: Find a structure-rich dense subgraph that largely preserves the user-defined structures. Example: G Dense Subgraph with rich 3-node lines User-define structure 3-node line - 8 -

9 Outline Background Related Work Proposed HOSPLOC Algorithm Experiments Conclusion - 9 -

10 Related works High-order graph clustering Frequency based methods [Strogatz, 98; Milo et al., 2012; Vinayagama et al., 16] Spectral clustering based methods [Benson et al., 2015; Tsourakakis et al., 2017] Pros: Improving our understanding of highorder organization of complex systems. Co-clustering based methods [Wu et al., 2016] Cons: Expensive time complexity!

11 Related works Local graph clustering Vector based methods [Spielman and Teng, 2004; Andersen et al., 2006] Approximated methods [Gharan and Trevisan, 2012] Randomized methods [Spielman and Teng, 2013; Andersen et al., 2016] Max-Flow based methods [Andersen and Lang, 2008; Orecchia and Zhu, 2014] Spectral clustering based methods [Chung, 2007] Pros: Running time does not depend on the size of the given graph. Cons: Unable to explore the high-order network structures!

12 Local high-order graph clustering? With HOSPLOC you can!

13 Outline Background Related Work Proposed HOSPLOC Algorithm Experiments Conclusion

14 HOSPLOC HO: High-Order SP: Structure-Preserving LO: Local C: Cut

15 Evaluation metric k th -order conductance Definition 1. For any cluster C in graph G and the k th -order structure, the k th -order conductance Φ(C, N) is defined as Φ C, N = cut C, N min μ C, N, μ( ҧ C, N) The number of network structures broken due to the partition of G into C and Cҧ The number of network structures in C. The number of network structures in C. ҧ

16 Evaluation metric Example Cut C Φ C, N = cut C, N min μ C, N, μ( ҧ C, N) 2 nd -order conductance Φ C, N = 1 min{4,11} = 1/4 Graph G 3 rd -order conductance Φ C, N = 2 min{3,34} =

17 Problem definition 1 2 Given: Graph G = (V, E), user-defined structure 3 N, initial vertex v. Find: Local cluster C including or near v that largely preserves user-defined structures N. 4 C = argmin Φ C, N C V Example: 3 1 G 4 2 Dense Subgraph with rich 3-node line User-define structure 3-node line

18 High-order random walks (HRW) Adjacency tensor Definition 2. Given a graph G = (V, E), the k th -order network structure N on G could be represented in a k-dimensional adjacency tensor T as follows Example: For the set of nodes {2, 4, 1} T 1,4,2 = 0 G: N: For the set of nodes {6, 8, 10} T 10,8,6 =

19 High-order random walks (HRW) Transition tensor Definition 3. Given a graph G = V, E and the adjacency tensor T for the k th -order network structure N, the corresponding transition tensor P could be computed as Example: For the set of nodes {2, 4, 1} P 1,4,2 = 0 G: N: For the set of nodes {6, 8, 10} P 10,8,6 = 1/3-19 -

20 High-order random walks (HRW) k th -order Markov chain interpretation Each vertex represents an individual state Depends on the last k 1 states (nodes) P(i 1,, i k ) = Pr(S t+1 = i 1 S t = i 2,, S t k+2 = i k ) Example: For the set of nodes {2, 4, 1} P 1,4,2 = P r v 1 v 4, v 2 = 0 G: N: For the set of nodes {6, 8, 10} P 10,8,6 = P r v 10 v 6, v 8 = 1/3-20 -

21 High-order random walks (HRW) Using the rank-1 approximation [Li and NG, 2013], the high-order random walks can formulated as q t = Pq (t 1) q (t k+1) Truncated HRW distribution Truncation threshold ε Round small values to 0 to reduce computational load

22 Locally search high-order structure by HRW Vector based graph cut Locally conduct high-order random walks to explore the user-defined structure N. Compute the permutation π of the returned HRW distribution vector q such that: Iteratively check the potential cuts C 1, C 2,, C n 1, where C i = π 1,, π i

23 HOSPLOC Algorithm

24 HOSPLOC Algorithm 3-node line Initialization C = Truncated HRW Dense Subgraph with rich 3-node line Vector-based Graph Cut

25 Theoretical analysis Effectiveness Provide a lower bound of effectiveness. Theorem 4. Let C be a cluster on graph G such that Φ C, N 1 c 2 (l+2). If HOSPLOC runs with starting vertex v Ck,ξ and returns a nonempty set C, then we have μ C C 2 b 1, where b controls the minimum volume of returned cluster C. Efficiency Lemma 5. Given graph G and the k th -order network structure N k 3, the time complexity of HOSPLOC is bounded by O(t max 2 bk φ 2k log3k m)

26 Outline Background Related Work Proposed HOSPLOC Algorithm Experiments Conclusion

27 Data sets Category Network Type Nodes Edges Citation Author Undirected 61, ,074 Paper Undirected 62,602 10,904 Infrastructure Airline Undirected 2,833 15,204 Oregon Undirected 7,352 15,665 Power Undirected 4,941 13,188 Social Epinion Undirected 75, ,837 Review Rating Bipartite 8,724 90,962 Financial PII Multipartite

28 Comparison methods Local graph clustering Nibble [Andersen et al., 2006] Nibble-PageRank [Spielman and Teng, 2013] LS-OQC [Tsourakakis et al., 2013] HOSPLOC We proposed! Global graph clustering NMF [Ding et al., 2008] User-defined High-Order Structure TSC [Benson et al., 2015] Triangle

29 Effectiveness analysis Worse a Φ(C, edge) b Φ(C, triangle) Better Better c Triangle Desnsity Worse 29 -

30 Effectiveness analysis Worse 97% smaller Better Our methods a Φ(C, edge)

31 Effectiveness analysis Worse 12.2% smaller Better Our methods b Φ(C, triangle)

32 Effectiveness analysis Better 80% larger Worse Our methods c Triangle density

33 log t Scalability analysis Polylogarithmic! Linear! Running Time t s (a) The number of vertices v.s running time t log(μ(c)) (b) The lower bound of log(μ(c)) v.s log(t) * Nibble = A special case of HOPLOC

34 Parameter analysis Converge Faster! Worse Better (a) Φ(C, edge) v.s Conductance upper bound φ (b) Φ(C, triangle) v.s Conductance upper bound φ Lower Conductance!

35 Case study 1: Community detection Bipartite Graph 4 th Order Structure Graph Cut Movie Review Network User-defined Structure Miniature of the Identified Cluster Rich 4-node loop!

36 Case study 2: Synthetic ID detection Multipartite Graph 5 th Order Structure Graph Cut PII Network User-defined Structure Miniature of the Identified Cluster Rich 5-node star!

37 Outline Background Related Work Proposed HOSPLOC Algorithm Experiments Conclusion

38 Conclusion Goals: Detect structure-preserving dense subgraph Preserve Structure Time Complexity High-order Clustering Yes Expensive Local Clustering No Scalable HOSPLOC Yes Scalable Solutions: HOSPLOC algorithm Theoretical guaranteed effectiveness. Polylogarithmic time complexity w.r.t # edges. Results: Consistently better than 5 state-of-arts on 6 real networks. Scalability = Polylogarithmic complexity. Sensitivity = Stable. Case studies w.r.t. higher-order (>3) structures

39 - 39 -

A Local Algorithm for Structure-Preserving Graph Cut

A Local Algorithm for Structure-Preserving Graph Cut A Local Algorithm for Structure-Preserving Graph Cut Dawei Zhou dzhou23@asu.edu Scott Alcorn Early Warnings LLC. scott.alcorn@earlywarning.com Si Zhang szhan172@asu.edu Hanghang Tong hanghang.tong@asu.edu

More information

A Local Algorithm for Structure-Preserving Graph Cut

A Local Algorithm for Structure-Preserving Graph Cut A Local Algorithm for Structure-Preserving Graph Cut Dawei Zhou, Si Zhang, Mehmet Yigit Yildirim, Scott Alcorn, Hanghang Tong, Hasan Davulcu, Jingrui He, ABSTRACT Nowadays, large-scale graph data is being

More information

Parallel Local Graph Clustering

Parallel Local Graph Clustering Parallel Local Graph Clustering Kimon Fountoulakis, joint work with J. Shun, X. Cheng, F. Roosta-Khorasani, M. Mahoney, D. Gleich University of California Berkeley and Purdue University Based on J. Shun,

More information

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

CS246: Mining Massive Datasets Jure Leskovec, Stanford University CS246: Mining Massive Datasets Jure Leskovec, Stanford University http://cs246.stanford.edu HITS (Hypertext Induced Topic Selection) Is a measure of importance of pages or documents, similar to PageRank

More information

CS246: Mining Massive Datasets Jure Leskovec, Stanford University

CS246: Mining Massive Datasets Jure Leskovec, Stanford University CS246: Mining Massive Datasets Jure Leskovec, Stanford University http://cs246.stanford.edu SPAM FARMING 2/11/2013 Jure Leskovec, Stanford C246: Mining Massive Datasets 2 2/11/2013 Jure Leskovec, Stanford

More information

Local higher-order graph clustering

Local higher-order graph clustering Local higher-order graph clustering Hao Yin Stanford University yinh@stanford.edu Austin R. Benson Cornell University arb@cornell.edu Jure Leskovec Stanford University jure@cs.stanford.edu David F. Gleich

More information

Computing Heat Kernel Pagerank and a Local Clustering Algorithm

Computing Heat Kernel Pagerank and a Local Clustering Algorithm Computing Heat Kernel Pagerank and a Local Clustering Algorithm Fan Chung and Olivia Simpson University of California, San Diego La Jolla, CA 92093 {fan,osimpson}@ucsd.edu Abstract. Heat kernel pagerank

More information

Parallel Local Graph Clustering

Parallel Local Graph Clustering Parallel Local Graph Clustering Julian Shun Joint work with Farbod Roosta-Khorasani, Kimon Fountoulakis, and Michael W. Mahoney Work appeared in VLDB 2016 2 Metric for Cluster Quality Conductance = Number

More information

Local Partitioning using PageRank

Local Partitioning using PageRank Local Partitioning using PageRank Reid Andersen Fan Chung Kevin Lang UCSD, UCSD, Yahoo! What is a local partitioning algorithm? An algorithm for dividing a graph into two pieces. Instead of searching for

More information

Finding and Visualizing Graph Clusters Using PageRank Optimization. Fan Chung and Alexander Tsiatas, UCSD WAW 2010

Finding and Visualizing Graph Clusters Using PageRank Optimization. Fan Chung and Alexander Tsiatas, UCSD WAW 2010 Finding and Visualizing Graph Clusters Using PageRank Optimization Fan Chung and Alexander Tsiatas, UCSD WAW 2010 What is graph clustering? The division of a graph into several partitions. Clusters should

More information

Lecture 11: Clustering and the Spectral Partitioning Algorithm A note on randomized algorithm, Unbiased estimates

Lecture 11: Clustering and the Spectral Partitioning Algorithm A note on randomized algorithm, Unbiased estimates CSE 51: Design and Analysis of Algorithms I Spring 016 Lecture 11: Clustering and the Spectral Partitioning Algorithm Lecturer: Shayan Oveis Gharan May nd Scribe: Yueqi Sheng Disclaimer: These notes have

More information

Diffusion and Clustering on Large Graphs

Diffusion and Clustering on Large Graphs Diffusion and Clustering on Large Graphs Alexander Tsiatas Thesis Proposal / Advancement Exam 8 December 2011 Introduction Graphs are omnipresent in the real world both natural and man-made Examples of

More information

Diffusion and Clustering on Large Graphs

Diffusion and Clustering on Large Graphs Diffusion and Clustering on Large Graphs Alexander Tsiatas Final Defense 17 May 2012 Introduction Graphs are omnipresent in the real world both natural and man-made Examples of large graphs: The World

More information

Fast Nearest Neighbor Search on Large Time-Evolving Graphs

Fast Nearest Neighbor Search on Large Time-Evolving Graphs Fast Nearest Neighbor Search on Large Time-Evolving Graphs Leman Akoglu Srinivasan Parthasarathy Rohit Khandekar Vibhore Kumar Deepak Rajan Kun-Lung Wu Graphs are everywhere Leman Akoglu Fast Nearest Neighbor

More information

Graph Partitioning Algorithms

Graph Partitioning Algorithms Graph Partitioning Algorithms Leonid E. Zhukov School of Applied Mathematics and Information Science National Research University Higher School of Economics 03.03.2014 Leonid E. Zhukov (HSE) Lecture 8

More information

Non-exhaustive, Overlapping k-means

Non-exhaustive, Overlapping k-means Non-exhaustive, Overlapping k-means J. J. Whang, I. S. Dhilon, and D. F. Gleich Teresa Lebair University of Maryland, Baltimore County October 29th, 2015 Teresa Lebair UMBC 1/38 Outline Introduction NEO-K-Means

More information

Segmentation: Clustering, Graph Cut and EM

Segmentation: Clustering, Graph Cut and EM Segmentation: Clustering, Graph Cut and EM Ying Wu Electrical Engineering and Computer Science Northwestern University, Evanston, IL 60208 yingwu@northwestern.edu http://www.eecs.northwestern.edu/~yingwu

More information

The extendability of matchings in strongly regular graphs

The extendability of matchings in strongly regular graphs The extendability of matchings in strongly regular graphs Sebastian Cioabă Department of Mathematical Sciences University of Delaware Villanova, June 5, 2014 Introduction Matching A set of edges M of a

More information

Spectral Clustering X I AO ZE N G + E L HA M TA BA S SI CS E CL A S S P R ESENTATION MA RCH 1 6,

Spectral Clustering X I AO ZE N G + E L HA M TA BA S SI CS E CL A S S P R ESENTATION MA RCH 1 6, Spectral Clustering XIAO ZENG + ELHAM TABASSI CSE 902 CLASS PRESENTATION MARCH 16, 2017 1 Presentation based on 1. Von Luxburg, Ulrike. "A tutorial on spectral clustering." Statistics and computing 17.4

More information

Visual Representations for Machine Learning

Visual Representations for Machine Learning Visual Representations for Machine Learning Spectral Clustering and Channel Representations Lecture 1 Spectral Clustering: introduction and confusion Michael Felsberg Klas Nordberg The Spectral Clustering

More information

Local Graph Clustering and Applications to Graph Partitioning

Local Graph Clustering and Applications to Graph Partitioning Local Graph Clustering and Applications to Graph Partitioning Veronika Strnadova June 12, 2013 1 Introduction Today there exist innumerable applications which require the analysis of some type of large

More information

arxiv: v3 [cs.ds] 15 Dec 2016

arxiv: v3 [cs.ds] 15 Dec 2016 Computing Heat Kernel Pagerank and a Local Clustering Algorithm Fan Chung and Olivia Simpson University of California, San Diego La Jolla, CA 92093 {fan,_osimpson}@ucsd.edu arxiv:1503.03155v3 [cs.ds] 15

More information

Parallel Local Graph Clustering

Parallel Local Graph Clustering Parallel Local Graph Clustering Julian Shun Farbod Roosta-Khorasani Kimon Fountoulakis Michael W. Mahoney UC Berkeley International Computer Science Institute { jshun@eecs, farbod@icsi, kfount@icsi, mmahoney@stat

More information

MapReduce Algorithms. Barna Saha. March 28, 2016

MapReduce Algorithms. Barna Saha. March 28, 2016 MapReduce Algorithms Barna Saha March 28, 2016 Complexity Model for MapReduce Minimum Spanning Tree in MapReduce Computing Dense Subgraph in MapReduce Complexity Model for MapReduce:MRC i Input: finite

More information

Explore Co-clustering on Job Applications. Qingyun Wan SUNet ID:qywan

Explore Co-clustering on Job Applications. Qingyun Wan SUNet ID:qywan Explore Co-clustering on Job Applications Qingyun Wan SUNet ID:qywan 1 Introduction In the job marketplace, the supply side represents the job postings posted by job posters and the demand side presents

More information

Reduce and Aggregate: Similarity Ranking in Multi-Categorical Bipartite Graphs

Reduce and Aggregate: Similarity Ranking in Multi-Categorical Bipartite Graphs Reduce and Aggregate: Similarity Ranking in Multi-Categorical Bipartite Graphs Alessandro Epasto J. Feldman*, S. Lattanzi*, S. Leonardi, V. Mirrokni*. *Google Research Sapienza U. Rome Motivation Recommendation

More information

A Recommender System Based on Local Random Walks and Spectral Methods

A Recommender System Based on Local Random Walks and Spectral Methods A Recommender System Based on Local Random Walks and Spectral Methods ABSTRACT Zeinab Abbassi Department of Computer Science,UBC 201 2366 Main Mall Vancouver, Canada zeinab@cs.ubc.ca In this paper, we

More information

Similarity Ranking in Large- Scale Bipartite Graphs

Similarity Ranking in Large- Scale Bipartite Graphs Similarity Ranking in Large- Scale Bipartite Graphs Alessandro Epasto Brown University - 20 th March 2014 1 Joint work with J. Feldman, S. Lattanzi, S. Leonardi, V. Mirrokni [WWW, 2014] 2 AdWords Ads Ads

More information

Outline. Main Questions. What is the k-means method? What can we do when worst case analysis is too pessimistic?

Outline. Main Questions. What is the k-means method? What can we do when worst case analysis is too pessimistic? Outline Main Questions 1 Data Clustering What is the k-means method? 2 Smoothed Analysis What can we do when worst case analysis is too pessimistic? 3 Smoothed Analysis of k-means Method What is the smoothed

More information

Clustering: Classic Methods and Modern Views

Clustering: Classic Methods and Modern Views Clustering: Classic Methods and Modern Views Marina Meilă University of Washington mmp@stat.washington.edu June 22, 2015 Lorentz Center Workshop on Clusters, Games and Axioms Outline Paradigms for clustering

More information

CSCI-B609: A Theorist s Toolkit, Fall 2016 Sept. 6, Firstly let s consider a real world problem: community detection.

CSCI-B609: A Theorist s Toolkit, Fall 2016 Sept. 6, Firstly let s consider a real world problem: community detection. CSCI-B609: A Theorist s Toolkit, Fall 016 Sept. 6, 016 Lecture 03: The Sparsest Cut Problem and Cheeger s Inequality Lecturer: Yuan Zhou Scribe: Xuan Dong We will continue studying the spectral graph theory

More information

Latent Space Model for Road Networks to Predict Time-Varying Traffic. Presented by: Rob Fitzgerald Spring 2017

Latent Space Model for Road Networks to Predict Time-Varying Traffic. Presented by: Rob Fitzgerald Spring 2017 Latent Space Model for Road Networks to Predict Time-Varying Traffic Presented by: Rob Fitzgerald Spring 2017 Definition of Latent https://en.oxforddictionaries.com/definition/latent Latent Space Model?

More information

Prepaid Access MIDWEST ANTI-MONEY LAUNDERING CONFERENCE Federal Reserve Bank of Kansas City March 5, 2014

Prepaid Access MIDWEST ANTI-MONEY LAUNDERING CONFERENCE Federal Reserve Bank of Kansas City March 5, 2014 Prepaid Access 2014 MIDWEST ANTI-MONEY LAUNDERING CONFERENCE Federal Reserve Bank of Kansas City March 5, 2014 Discussion Points Emerging Technology Prepaid Access What is it and how does it work? Open

More information

Strategies for simulating pedestrian navigation with multiple reinforcement learning agents

Strategies for simulating pedestrian navigation with multiple reinforcement learning agents Strategies for simulating pedestrian navigation with multiple reinforcement learning agents Francisco Martinez-Gil, Miguel Lozano, Fernando Ferna ndez Presented by: Daniel Geschwender 9/29/2016 1 Overview

More information

Graph Exploration: Taking the User into the Loop

Graph Exploration: Taking the User into the Loop Graph Exploration: Taking the User into the Loop Davide Mottin, Anja Jentzsch, Emmanuel Müller Hasso Plattner Institute, Potsdam, Germany 2016/10/24 CIKM2016, Indianapolis, US Where we are Background (5

More information

Numerical Linear Algebra for Data and Link Analysis

Numerical Linear Algebra for Data and Link Analysis Numerical Linear Algebra for Data and Link Analysis Leonid Zhukov, Yahoo! Inc David Gleich, Stanford SIAM PP-2006, San Francisco, CA Scientific computing Engineering Computing Information Retrieval continuum

More information

Cooperative Computing for Autonomous Data Centers

Cooperative Computing for Autonomous Data Centers Cooperative Computing for Autonomous Data Centers Jared Saia (U. New Mexico) Joint with: Jon Berry, Cindy Phillips (Sandia Labs), Aaron Kearns (U. New Mexico) Outline 1) Bounding clustering coefficients

More information

Testing the Cluster Structure of Graphs Christian Sohler

Testing the Cluster Structure of Graphs Christian Sohler Testing the Cluster Structure of Graphs Christian Sohler Very Large Networks Examples Social networks The World Wide Web Cocitation graphs Coauthorship graphs Data size GigaByte upto TeraByte (only the

More information

A Comparison of Pattern-Based Spectral Clustering Algorithms in Directed Weighted Network

A Comparison of Pattern-Based Spectral Clustering Algorithms in Directed Weighted Network A Comparison of Pattern-Based Spectral Clustering Algorithms in Directed Weighted Network Sumuya Borjigin 1. School of Economics and Management, Inner Mongolia University, No.235 West College Road, Hohhot,

More information

Graphs and Network Flows IE411. Lecture 21. Dr. Ted Ralphs

Graphs and Network Flows IE411. Lecture 21. Dr. Ted Ralphs Graphs and Network Flows IE411 Lecture 21 Dr. Ted Ralphs IE411 Lecture 21 1 Combinatorial Optimization and Network Flows In general, most combinatorial optimization and integer programming problems are

More information

Scalable Influence Maximization in Social Networks under the Linear Threshold Model

Scalable Influence Maximization in Social Networks under the Linear Threshold Model Scalable Influence Maximization in Social Networks under the Linear Threshold Model Wei Chen Microsoft Research Asia Yifei Yuan Li Zhang In collaboration with University of Pennsylvania Microsoft Research

More information

Random graph models with fixed degree sequences: choices, consequences and irreducibilty proofs for sampling

Random graph models with fixed degree sequences: choices, consequences and irreducibilty proofs for sampling Random graph models with fixed degree sequences: choices, consequences and irreducibilty proofs for sampling Joel Nishimura 1, Bailey K Fosdick 2, Daniel B Larremore 3 and Johan Ugander 4 1 Arizona State

More information

Spectral Graph Sparsification: overview of theory and practical methods. Yiannis Koutis. University of Puerto Rico - Rio Piedras

Spectral Graph Sparsification: overview of theory and practical methods. Yiannis Koutis. University of Puerto Rico - Rio Piedras Spectral Graph Sparsification: overview of theory and practical methods Yiannis Koutis University of Puerto Rico - Rio Piedras Graph Sparsification or Sketching Compute a smaller graph that preserves some

More information

Sybil defenses via social networks

Sybil defenses via social networks Sybil defenses via social networks Abhishek University of Oslo, Norway 19/04/2012 1 / 24 Sybil identities Single user pretends many fake/sybil identities i.e., creating multiple accounts observed in real-world

More information

Cycles in Random Graphs

Cycles in Random Graphs Cycles in Random Graphs Valery Van Kerrebroeck Enzo Marinari, Guilhem Semerjian [Phys. Rev. E 75, 066708 (2007)] [J. Phys. Conf. Series 95, 012014 (2008)] Outline Introduction Statistical Mechanics Approach

More information

LINK GRAPH ANALYSIS FOR ADULT IMAGES CLASSIFICATION

LINK GRAPH ANALYSIS FOR ADULT IMAGES CLASSIFICATION LINK GRAPH ANALYSIS FOR ADULT IMAGES CLASSIFICATION Evgeny Kharitonov *, ***, Anton Slesarev *, ***, Ilya Muchnik **, ***, Fedor Romanenko ***, Dmitry Belyaev ***, Dmitry Kotlyarov *** * Moscow Institute

More information

Rate-optimal Binary Locally Repairable Codes with Joint Information Locality CSDL. Yonsei University, Korea ( jh.kim06, my.nam,

Rate-optimal Binary Locally Repairable Codes with Joint Information Locality CSDL. Yonsei University, Korea ( jh.kim06, my.nam, Rate-optimal Binary Locally Repairable Codes with Joint Information Locality CSDL Jung-Hyun Kim, Mi-Young Nam, and Hong-Yeop Song Yonsei University, Korea ( jh.kim6, my.nam, hysong@yonsei.ac.kr) 25 / /

More information

A Local Algorithm for Product Return Prediction in E-Commerce

A Local Algorithm for Product Return Prediction in E-Commerce A Local Algorithm for Product Return Prediction in E-Commerce Yada Zhu 1, Jianbo Li 2, Jingrui He 3, Brian L. Quanz 1, Ajay A. Deshpande 1 1 IBM Research, Yorktown Heights, NY 2 Three Bridges Capital,

More information

Perfect matchings in O(nlogn) time in regular bipartite graph

Perfect matchings in O(nlogn) time in regular bipartite graph Perfect matchings in O(nlogn) time in regular bipartite graphs Research project for computational optimization Presented by:qing Li April 26, 2011 Outline i.. ii.. iii.. iv.. What is d regular bipartite

More information

Mesh segmentation. Florent Lafarge Inria Sophia Antipolis - Mediterranee

Mesh segmentation. Florent Lafarge Inria Sophia Antipolis - Mediterranee Mesh segmentation Florent Lafarge Inria Sophia Antipolis - Mediterranee Outline What is mesh segmentation? M = {V,E,F} is a mesh S is either V, E or F (usually F) A Segmentation is a set of sub-meshes

More information

Laplacian Paradigm 2.0

Laplacian Paradigm 2.0 Laplacian Paradigm 2.0 8:40-9:10: Merging Continuous and Discrete(Richard Peng) 9:10-9:50: Beyond Laplacian Solvers (Aaron Sidford) 9:50-10:30: Approximate Gaussian Elimination (Sushant Sachdeva) 10:30-11:00:

More information

Toward the joint design of electronic and optical layer protection

Toward the joint design of electronic and optical layer protection Toward the joint design of electronic and optical layer protection Massachusetts Institute of Technology Slide 1 Slide 2 CHALLENGES: - SEAMLESS CONNECTIVITY - MULTI-MEDIA (FIBER,SATCOM,WIRELESS) - HETEROGENEOUS

More information

Subgraph Frequencies:

Subgraph Frequencies: Subgraph Frequencies: The Empirical and Extremal Geography of Large Graph Collections Johan Ugander, Lars Backstrom, Jon Kleinberg World Wide Web Conference May 16, 2013 Graph collections Neighborhoods:

More information

An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization

An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization Pedro Ribeiro (DCC/FCUP & CRACS/INESC-TEC) Part 1 Motivation and emergence of Network Science

More information

Dimension-Independent Data Structures for Arbitrary Simplicial Complexes

Dimension-Independent Data Structures for Arbitrary Simplicial Complexes Dimension-Independent Data Structures for Arbitrary Simplicial Complexes David Canino (canino.david@gmail.com) The Incidence Simplicial (IS) data structure The Generalized Indexed data structure with Adjacencies

More information

Temporal Graphs KRISHNAN PANAMALAI MURALI

Temporal Graphs KRISHNAN PANAMALAI MURALI Temporal Graphs KRISHNAN PANAMALAI MURALI METRICFORENSICS: A Multi-Level Approach for Mining Volatile Graphs Authors: Henderson, Eliassi-Rad, Faloutsos, Akoglu, Li, Maruhashi, Prakash and Tong. Published:

More information

Bilinear Programming

Bilinear Programming Bilinear Programming Artyom G. Nahapetyan Center for Applied Optimization Industrial and Systems Engineering Department University of Florida Gainesville, Florida 32611-6595 Email address: artyom@ufl.edu

More information

ROBERTO BATTITI, MAURO BRUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Apr 2015

ROBERTO BATTITI, MAURO BRUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Apr 2015 ROBERTO BATTITI, MAURO BRUNATO. The LION Way: Machine Learning plus Intelligent Optimization. LIONlab, University of Trento, Italy, Apr 2015 http://intelligentoptimization.org/lionbook Roberto Battiti

More information

Relative Centrality and Local Community Detection

Relative Centrality and Local Community Detection Under consideration for publication in Network Science 1 Relative Centrality and Local Community Detection Cheng-Shang Chang, Chih-Jung Chang, Wen-Ting Hsieh, Duan-Shin Lee, Li-Heng Liou, Institute of

More information

Regular decomposition of large graphs and other structures: scalability and robustness towards missing data

Regular decomposition of large graphs and other structures: scalability and robustness towards missing data VTT TECHNICAL RESEARCH CENTRE OF FINLAND LTD Regular decomposition of large graphs and other structures: scalability and robustness towards missing data Hannu Reittu (VTT, Finland) Joint work with Ilkka

More information

Single link clustering: 11/7: Lecture 18. Clustering Heuristics 1

Single link clustering: 11/7: Lecture 18. Clustering Heuristics 1 Graphs and Networks Page /7: Lecture 8. Clustering Heuristics Wednesday, November 8, 26 8:49 AM Today we will talk about clustering and partitioning in graphs, and sometimes in data sets. Partitioning

More information

Local Higher-Order Graph Clustering

Local Higher-Order Graph Clustering Local Higher-Order Graph Clustering ABSTRACT Hao Yin Stanford University yinh@stanford.edu Jure Leskovec Stanford University jure@cs.stanford.edu Local graph clustering methods aim to find a cluster of

More information

MI2LS: Multi-Instance Learning from Multiple Information Sources

MI2LS: Multi-Instance Learning from Multiple Information Sources MI2LS: Multi-Instance Learning from Multiple Information Sources Dan Zhang 1, Jingrui He 2, Richard Lawrence 3 1 Facebook Incorporation, Menlo Park, CA 2 Stevens Institute of Technology Hoboken, NJ 3 IBM

More information

TPA: Fast, Scalable, and Accurate Method for Approximate Random Walk with Restart on Billion Scale Graphs

TPA: Fast, Scalable, and Accurate Method for Approximate Random Walk with Restart on Billion Scale Graphs TPA: Fast, Scalable, and Accurate Method for Approximate Random Walk with Restart on Billion Scale Graphs Minji Yoon Seoul National University riin55@snu.ac.kr Jinhong Jung Seoul National University jinhongjung@snu.ac.kr

More information

CS 5220: Parallel Graph Algorithms. David Bindel

CS 5220: Parallel Graph Algorithms. David Bindel CS 5220: Parallel Graph Algorithms David Bindel 2017-11-14 1 Graphs Mathematically: G = (V, E) where E V V Convention: V = n and E = m May be directed or undirected May have weights w V : V R or w E :

More information

Latest on Linear Sketches for Large Graphs: Lots of Problems, Little Space, and Loads of Handwaving. Andrew McGregor University of Massachusetts

Latest on Linear Sketches for Large Graphs: Lots of Problems, Little Space, and Loads of Handwaving. Andrew McGregor University of Massachusetts Latest on Linear Sketches for Large Graphs: Lots of Problems, Little Space, and Loads of Handwaving Andrew McGregor University of Massachusetts Latest on Linear Sketches for Large Graphs: Lots of Problems,

More information

The Cyclic Cycle Complex of a Surface

The Cyclic Cycle Complex of a Surface The Cyclic Cycle Complex of a Surface Allen Hatcher A recent paper [BBM] by Bestvina, Bux, and Margalit contains a construction of a cell complex that gives a combinatorial model for the collection of

More information

Pregel. Ali Shah

Pregel. Ali Shah Pregel Ali Shah s9alshah@stud.uni-saarland.de 2 Outline Introduction Model of Computation Fundamentals of Pregel Program Implementation Applications Experiments Issues with Pregel 3 Outline Costs of Computation

More information

Clustering in Networks

Clustering in Networks Clustering in Networks (Spectral Clustering with the Graph Laplacian... a brief introduction) Tom Carter Computer Science CSU Stanislaus http://csustan.csustan.edu/ tom/clustering April 1, 2012 1 Our general

More information

Minimum-Spanning-Tree problem. Minimum Spanning Trees (Forests) Minimum-Spanning-Tree problem

Minimum-Spanning-Tree problem. Minimum Spanning Trees (Forests) Minimum-Spanning-Tree problem Minimum Spanning Trees (Forests) Given an undirected graph G=(V,E) with each edge e having a weight w(e) : Find a subgraph T of G of minimum total weight s.t. every pair of vertices connected in G are

More information

Implementation of Network Community Profile using Local Spectral algorithm and its application in Community Networking

Implementation of Network Community Profile using Local Spectral algorithm and its application in Community Networking Implementation of Network Community Profile using Local Spectral algorithm and its application in Community Networking Vaibhav VPrakash Department of Computer Science and Engineering, Sri Jayachamarajendra

More information

Absorbing Random walks Coverage

Absorbing Random walks Coverage DATA MINING LECTURE 3 Absorbing Random walks Coverage Random Walks on Graphs Random walk: Start from a node chosen uniformly at random with probability. n Pick one of the outgoing edges uniformly at random

More information

Advanced Data Management

Advanced Data Management Advanced Data Management Medha Atre Office: KD-219 atrem@cse.iitk.ac.in Sept 26, 2016 defined Given a graph G(V, E) with V as the set of nodes and E as the set of edges, a reachability query asks does

More information

COMP 4601 Hubs and Authorities

COMP 4601 Hubs and Authorities COMP 4601 Hubs and Authorities 1 Motivation PageRank gives a way to compute the value of a page given its position and connectivity w.r.t. the rest of the Web. Is it the only algorithm: No! It s just one

More information

Modeling and Detecting Community Hierarchies

Modeling and Detecting Community Hierarchies Modeling and Detecting Community Hierarchies Maria-Florina Balcan, Yingyu Liang Georgia Institute of Technology Age of Networks Massive amount of network data How to understand and utilize? Internet [1]

More information

Data-Intensive Computing with MapReduce

Data-Intensive Computing with MapReduce Data-Intensive Computing with MapReduce Session 5: Graph Processing Jimmy Lin University of Maryland Thursday, February 21, 2013 This work is licensed under a Creative Commons Attribution-Noncommercial-Share

More information

Design and Analysis of Algorithms (VII)

Design and Analysis of Algorithms (VII) Design and Analysis of Algorithms (VII) An Introduction to Randomized Algorithms Guoqiang Li School of Software, Shanghai Jiao Tong University Randomized Algorithms algorithms which employ a degree of

More information

Data mining --- mining graphs

Data mining --- mining graphs Data mining --- mining graphs University of South Florida Xiaoning Qian Today s Lecture 1. Complex networks 2. Graph representation for networks 3. Markov chain 4. Viral propagation 5. Google s PageRank

More information

Query Sugges*ons. Debapriyo Majumdar Information Retrieval Spring 2015 Indian Statistical Institute Kolkata

Query Sugges*ons. Debapriyo Majumdar Information Retrieval Spring 2015 Indian Statistical Institute Kolkata Query Sugges*ons Debapriyo Majumdar Information Retrieval Spring 2015 Indian Statistical Institute Kolkata Search engines User needs some information search engine tries to bridge this gap ssumption: the

More information

The application of Randomized HITS algorithm in the fund trading network

The application of Randomized HITS algorithm in the fund trading network The application of Randomized HITS algorithm in the fund trading network Xingyu Xu 1, Zhen Wang 1,Chunhe Tao 1,Haifeng He 1 1 The Third Research Institute of Ministry of Public Security,China Abstract.

More information

Shape Optimizing Load Balancing for Parallel Adaptive Numerical Simulations Using MPI

Shape Optimizing Load Balancing for Parallel Adaptive Numerical Simulations Using MPI Parallel Adaptive Institute of Theoretical Informatics Karlsruhe Institute of Technology (KIT) 10th DIMACS Challenge Workshop, Feb 13-14, 2012, Atlanta 1 Load Balancing by Repartitioning Application: Large

More information

Community detection. Leonid E. Zhukov

Community detection. Leonid E. Zhukov Community detection Leonid E. Zhukov School of Data Analysis and Artificial Intelligence Department of Computer Science National Research University Higher School of Economics Network Science Leonid E.

More information

Towards New Heterogeneous Data Stream Clustering based on Density

Towards New Heterogeneous Data Stream Clustering based on Density , pp.30-35 http://dx.doi.org/10.14257/astl.2015.83.07 Towards New Heterogeneous Data Stream Clustering based on Density Chen Jin-yin, He Hui-hao Zhejiang University of Technology, Hangzhou,310000 chenjinyin@zjut.edu.cn

More information

Voronoi Diagram. Xiao-Ming Fu

Voronoi Diagram. Xiao-Ming Fu Voronoi Diagram Xiao-Ming Fu Outlines Introduction Post Office Problem Voronoi Diagram Duality: Delaunay triangulation Centroidal Voronoi tessellations (CVT) Definition Applications Algorithms Outlines

More information

Link Analysis and Web Search

Link Analysis and Web Search Link Analysis and Web Search Moreno Marzolla Dip. di Informatica Scienza e Ingegneria (DISI) Università di Bologna http://www.moreno.marzolla.name/ based on material by prof. Bing Liu http://www.cs.uic.edu/~liub/webminingbook.html

More information

Multiagent Graph Coloring: Pareto Efficiency, Fairness and Individual Rationality By Yaad Blum, Jeffrey S. Rosenchein

Multiagent Graph Coloring: Pareto Efficiency, Fairness and Individual Rationality By Yaad Blum, Jeffrey S. Rosenchein Multiagent Graph Coloring: Pareto Efficiency, Fairness and Individual Rationality By Yaad Blum, Jeffrey S. Rosenchein Presented by: Simina Brânzei Outline Introduction Background Coloring from an Economics

More information

Seminar on Algorithms and Data Structures: Multiple-Edge-Fault-Tolerant Approximate Shortest-Path Trees [1]

Seminar on Algorithms and Data Structures: Multiple-Edge-Fault-Tolerant Approximate Shortest-Path Trees [1] Seminar on Algorithms and Data Structures: Multiple-Edge-Fault-Tolerant Approximate Shortest-Path Trees [1] Philipp Rimle April 14, 2018 1 Introduction Given a graph with positively real-weighted undirected

More information

A Note on Scheduling Parallel Unit Jobs on Hypercubes

A Note on Scheduling Parallel Unit Jobs on Hypercubes A Note on Scheduling Parallel Unit Jobs on Hypercubes Ondřej Zajíček Abstract We study the problem of scheduling independent unit-time parallel jobs on hypercubes. A parallel job has to be scheduled between

More information

Community Detection. Community

Community Detection. Community Community Detection Community In social sciences: Community is formed by individuals such that those within a group interact with each other more frequently than with those outside the group a.k.a. group,

More information

Navigating the Maze of Graph Analytics Frameworks using Massive Graph Datasets

Navigating the Maze of Graph Analytics Frameworks using Massive Graph Datasets Navigating the Maze of Graph Analytics Frameworks using Massive Graph Datasets Nadathur Satish, Narayanan Sundaram, Mostofa Ali Patwary, Jiwon Seo, Jongsoo Park, M. Amber Hassaan, Shubho Sengupta, Zhaoming

More information

Monochromatic Tree Partition for Complete. Multipartite Graphs

Monochromatic Tree Partition for Complete. Multipartite Graphs Int. J. Contemp. Math. Sciences, Vol. 6, 2011, no. 43, 2129-2134 Monochromatic Tree Partition for Complete Multipartite Graphs Shili Wen and Peipei Zhu Department of Mathematics Zhejiang Normal University

More information

Extremal Graph Theory. Ajit A. Diwan Department of Computer Science and Engineering, I. I. T. Bombay.

Extremal Graph Theory. Ajit A. Diwan Department of Computer Science and Engineering, I. I. T. Bombay. Extremal Graph Theory Ajit A. Diwan Department of Computer Science and Engineering, I. I. T. Bombay. Email: aad@cse.iitb.ac.in Basic Question Let H be a fixed graph. What is the maximum number of edges

More information

Direct Routing: Algorithms and Complexity

Direct Routing: Algorithms and Complexity Direct Routing: Algorithms and Complexity Costas Busch, RPI Malik Magdon-Ismail, RPI Marios Mavronicolas, Univ. Cyprus Paul Spirakis, Univ. Patras 1 Outline 1. Direct Routing. 2. Direct Routing is Hard.

More information

Statistical Physics of Community Detection

Statistical Physics of Community Detection Statistical Physics of Community Detection Keegan Go (keegango), Kenji Hata (khata) December 8, 2015 1 Introduction Community detection is a key problem in network science. Identifying communities, defined

More information

Algorithms for Grid Graphs in the MapReduce Model

Algorithms for Grid Graphs in the MapReduce Model University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Computer Science and Engineering: Theses, Dissertations, and Student Research Computer Science and Engineering, Department

More information

Triangle Graphs and Simple Trapezoid Graphs

Triangle Graphs and Simple Trapezoid Graphs JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 18, 467-473 (2002) Short Paper Triangle Graphs and Simple Trapezoid Graphs Department of Computer Science and Information Management Providence University

More information

Mathematical Methods and Computational Algorithms for Complex Networks. Benard Abola

Mathematical Methods and Computational Algorithms for Complex Networks. Benard Abola Mathematical Methods and Computational Algorithms for Complex Networks Benard Abola Division of Applied Mathematics, Mälardalen University Department of Mathematics, Makerere University Second Network

More information

Big Data Management and NoSQL Databases

Big Data Management and NoSQL Databases NDBI040 Big Data Management and NoSQL Databases Lecture 10. Graph databases Doc. RNDr. Irena Holubova, Ph.D. holubova@ksi.mff.cuni.cz http://www.ksi.mff.cuni.cz/~holubova/ndbi040/ Graph Databases Basic

More information

Answer Set Programming or Hypercleaning: Where does the Magic Lie in Solving Maximum Quartet Consistency?

Answer Set Programming or Hypercleaning: Where does the Magic Lie in Solving Maximum Quartet Consistency? Answer Set Programming or Hypercleaning: Where does the Magic Lie in Solving Maximum Quartet Consistency? Fathiyeh Faghih and Daniel G. Brown David R. Cheriton School of Computer Science, University of

More information

Graphs / Networks. CSE 6242/ CX 4242 Feb 18, Centrality measures, algorithms, interactive applications. Duen Horng (Polo) Chau Georgia Tech

Graphs / Networks. CSE 6242/ CX 4242 Feb 18, Centrality measures, algorithms, interactive applications. Duen Horng (Polo) Chau Georgia Tech CSE 6242/ CX 4242 Feb 18, 2014 Graphs / Networks Centrality measures, algorithms, interactive applications Duen Horng (Polo) Chau Georgia Tech Partly based on materials by Professors Guy Lebanon, Jeffrey

More information