Dynamic and Historical Shortest-Path Distance Queries on Large Evolving Networks by Pruned Landmark Labeling
|
|
- Neal Beasley
- 5 years ago
- Views:
Transcription
1 WWW 14 Dynamic and Historical Shortest-Path Distance Queries on Large Evolving Networks by Pruned Landmark Labeling Takuya Akiba (U Tokyo) Yoichi Iwata (U Tokyo) Yuichi Yoshida (NII & PFI)
2 Distance on Graphs Context-Aware Search [Ukkonen+,CIKM 08][Potamias+,CIKM 09] Socially-Sensitive Search [Vieira+,CIKM 07][Yahia+,VLDB 08][Maniu+,CIKM 13] Social Network Analysis Biological Analysis Route Navigation
3 Traditional Static Distance Querying Given a static graph G = V, E 1. construct an index to 2. answer distance d G s, t s t Goal: Good trade-off Scalability Indexing time Index size Query Performance Query time (Precision)
4 Previous Static Methods Exact Methods Approx. Methods 2-Hop Cover [Cohen+,SODA 02] Landmark-based methods [Potamias+, CIKM 09] 2-Hop Cover [Cheng+,EDBT 09] Distance Sketch [Sarma+, WSDM 10] TD-based [Wei, SIGMOD 10] Highway 2-Hop [Jin+, SIGMOD 12] Path Sketch [Gubichev+, CIKM 10] TD-based [Akiba+, EDBT 12] Hierarchical HL [Abraham+,ESA 12] Landmark-based method + LCA, Shortcuts, [Qiao+, ICDE 12] Pruned Landmark Labeling [Akiba-Iwata-Yoshida, SIGMOD 13]
5 Real Networks are Dynamic and Evolving Online Social Networks: New friends Microblogs: New replies, new retweets Collaboration networks: New papers, etc.
6 Contributions Pruned Landmark Labeling [SIGMOD 13] State-of-the-art static exact method Contribution 1 Dynamic Pruned Landmark Labeling Contribution 2 Historical Pruned Landmark Labeling
7 Contribution 1: +Dynamic Update Dynamic Pruned Landmark Labeling Graph Indexing Index Incremental Update In milliseconds Change New edge Index Update
8 Distance Time Contribution 2: +Historical Queries Historical Pruned Landmark Labeling Index Query: transition of distance Time
9 Applications What will be made possible?
10 Real-time Network-aware Search B C New Edge A D A B Search Result Instantly Search Result A B C D
11 Network Evolution Analysis Ego Network of v (subgraph induced by v and neighbors) Distance to v v Real example using our method from Facebook subgraph
12 Network Evolution Analysis Ego Network of v (subgraph induced by v and neighbors) Distance to v v Cluster 1 Cluster 2 Real example using our method from Facebook subgraph
13 Network Evolution Analysis Ego Network of v (subgraph induced by v and neighbors) Distance to v v Cluster 1 Cluster 2 Cluster 1 Cluster 2 Real example using our method from Facebook subgraph
14 Network Evolution Analysis Average Distance Effective Diameter Closeness Centrality Temporal Hop Plot
15 Limitation: Graph Update Types Supported updates: Vertex additions and Edge additions Why no removals?
16 Why No Removals? Application Side 1. Additions are much more frequent than removals 2. There are networks even with no removals such as communication networks and collaboration networks 3. We can combine with periodical reconstruction to also reflect removals in, say, hours
17 Why No Removals? Technical Side Static Methods Efficiency: Fast and scalable Update: N/A Efficiency = Index size, indexing time, query time, etc Better Actually, also better! Dynamic PLL (this work) Efficiency: Comparable with static methods Update: Additions Fully Dynamic Methods Efficiency: Poor (Large index, low scalability) Update: Additions and removals Because removals are much more harder, methods would be too complex.
18 Contribution 1: Dynamic Pruned Landmark Labeling
19 Outline 1. Indexing Framework (=Data structure and query algorithm) 2-Hop Cover [Cohen+,SODA 02] 2. Offline Index Construction Pruned Labeling [SIGMOD 13] 3. Online Index Update Resumed PBFS & Prefixal Queries
20 Assumption Undirected Unweighted (We can easily obtain directed and/or weighted version)
21 Contribution 1: Dynamic Pruned Landmark Labeling Index Framework
22 2-Hop Labeling: Index Data Structure Commonly used framework (= Data Structure + Query Algo.) [Cohen+ 02], [Cheng+ 09], [Jin+ 12], [Abraham+ 12] and ours Index: label L v = l 1, δ 1, l 2, δ 2, v δ 1 δ 2 l 1 l 2 l i V, δ i = d G v, l i δ 3 l 3 Example L(1): L(2): L 3 : Vertex Distance Vertex Distance Vertex Distance d G 1,10 = 5
23 2-Hop Labeling: Query Algorithm Query: d G (s, t) = min l L s L(t) d G s, l + d G l, t Paths through common vertices L(1): Example Vertex Distance s t L 3 : Vertex Distance Distance between vertex 1 and 3: : = 7 Answer min{6, 7} = : = 6
24 2-Hop Labeling: Challenge Challenge: Computing (and maintaining) labels Correctness (Exactness) Sizes of labels (Index Size & Query Time) Efficiency (Scalability)
25 Contribution 1: Dynamic Pruned Landmark Labeling Offline Index Construction
26 Our Approach 1. Naïve Landmark Labeling Conduct a BFS from every vertex 2. Pruned Landmark Labeling Pruning during BFSs
27 Naïve Landmark Labeling (w/o pruning) 1. L 0 an empty index 2. For each vertex v 1, v 2,, v n Conduct a BFS from v i Label all the visited vertices L i u = L i 1 u v i, d G u, v i BFS from v 1 BFS from v 2 BFS from v 3 BFS from v n L 0 L 1 L 2 (n times ) L n Empty Result
28 Naïve Landmark Labeling (w/o pruning) After a BFS from 1 L 1 (1): L 1 (2): L 1 (3): Vertex 1 Distance 0 Vertex 1 Distance 2 Vertex 1 Distance After a BFS from 2 L 2 (1): L 2 (2): L 2 (3): Vertex 1 2 Distance 0 2 Vertex 1 2 Distance 2 0 Vertex 1 2 Distance 2 3
29 Naïve Landmark Labeling (w/o pruning) 1. L 0 an empty index 2. For each vertex v 1, v 2,, v n Conduct a BFS from v i Label all the visited vertices L i u = L i 1 u v i, d G u, v i Θ(nm) preprocessing time, Θ n 2 Inpractical! space
30 Pruned Landmark Labeling 1. L 0 an empty index 2. For each vertex v 1, v 2,, v n Conduct a pruned BFS from v i Label all the visited vertices L i u = L i 1 u v i, d G u, v i PBFS from v 1 PBFS from v 2 PBFS from v 3 PBFS from v n L 0 L 1 L 2 (n times ) L n Empty Result
31 Pruned BFS Pruned BFS from v i v i Distance δ u Current incomplete index L i 1 If QUERY v i, u, L i 1 We do not label u this time We do not traverse edges from u δ Prune u
32 Example First BFS from vertex 1 L 1 (1): L 1 (2): Vertex 1 Distance 0 Vertex 1 Distance 2 L 1 (6): Vertex 1 Distance 1 Second BFS from vertex 2 QUERY 2, 6, L 1 = = 3 = d(2,6) Vertex 6 is pruned.
33 Example The search space gets smaller and smaller
34 Theorems: Correctness Theorem 4.1 for any s, t and i. QUERY s, t, L i = QUERY s, t, L i Naïve: L 0 L 1 L 2 L n Equal through function QUERY Pruned: L 0 L 1 L 2 L n Empty Final
35 Theorems: Correctness Theorem 4.1 for any s, t and i. QUERY s, t, L i = QUERY s, t, L i Corollary 4.1 (Correctness) for any s, t QUERY s, t, L n = d G s, t i.e., our method is exact.
36 Performance in Real Networks We conduct BFSs from vertices with higher degree We can exploit the structures: Hubs [Theorem 4.3, SIGMOD 13] Core-fringe structure [Theorem 4.4, SIGMOD 13]
37 Contribution 1: Dynamic Pruned Landmark Labeling Online Index Update
38 Problem Setting Vertex addition: trivial Insert an isolated vertex Add edges Therefore, we concentrate on edge addition.
39 New edge: (u, v) Update of Naïve Labels For each vertex v 1, v 2,, v n Resume the BFS rooted at v i Update the labels
40 Example New edge Distance from the root on the last snapshot 2 Check and update the distance of endpoints Resume the BFS Not visited
41 New edge: (u, v) Update for Pruned Labels? For each vertex v 1, v 2,, v n Resume a pruned BFS rooted from v i Update the labels Too inefficient since it takes Ω(n) time
42 New edge: (u, v) Update for Pruned Labels? For each vertex v i? Resume a pruned BFS rooted from v i Update the labels
43 New edge: (u, v) Update for Pruned Labels For each vertex v i L v L(u) Resume a pruned BFS rooted from v i Update the labels Why? Pruned before endpoints still no need to traverse (The proof is in the paper)
44 Contribution 2: Historical Pruned Landmark Labeling
45 Problem Definition Offline Indexing Given: Graph timestamp for each vertex and edge Describing the time when each edge (or vertex) appeared
46 Problem Definition [Query 1] Snapshot Query Given: u, v and time τ Answer: d τ (u, v) [Query 2] Change-point Query Given: u, v Answer: {τ 1, τ 2, } and distances Distance at time τ Moments that distance d u, v has changed
47 Contribution 2: Historical Pruned Landmark Labeling Index Framework
48 Historical 2-Hop Labeling: Data Structure Normal 2-Hop Label L(1): Vertex Distance d G 1,10 = 5 Historical 2-Hop Label L(1): Vertex Distance Time d 1, 7 2 if time 3 Since there is no edge deletion, distance only decreases
49 Historical 2-Hop Labeling: Data Structure Normal 2-Hop Label L(1): Vertex Distance d G 1,10 = 5 Historical 2-Hop Label L(1): Vertex Distance Time d 1, 7 2 if time 3 Both queries in (almost) linear time, i.e. O L s + L t time
50 Contribution 2: Historical Pruned Landmark Labeling Offline Index Construction
51 Index Construction Algorithm Standard 2-Hop (Dynamic) 1. Naïve Labeling 2. Pruned Labeling Historical 2-Hop 1. Naïve Labeling 2. Pruned Labeling
52 Naïve Historical Labeling (w/o Pruning) 1. Start from an empty index L 0 2. For each vertex v 1, v 2,, v n Conduct a BFS from v i Add pairs to all the labels 3. L n is the final index Already non-trivial since networks are temporal
53 Dynamic Programming Let s be the source vertex. d: integer(distance), v V T[d][v] = least time that d G s, v d Initialization T 0 s = 0, T 0 v = v s Step T d + 1 v = min {max T d w, t(v, w) } w N v v t v, w edge creation time t v, v 0 Θ(Dm) time
54 BFS-like Dynamic Programming Edges from the vertices whose value did not change in the previous step No need to see in the next step To avoid vain edge check 1. Prepare a queue 2. Push a vertex when its value is changed 3. Only see edges from the vertices in the queue
55 Pruned Historical Labeling 1. Start from an empty index L 0 2. For each vertex v 1, v 2,, v n Conduct a pruned BFS-like DP from v i Add pairs to all the labels 3. L n is the final index
56 Suppose we are Pruned BFS from v i visiting u with distance δ and time t If QUERY v i, u, t, L i 1 We do not label u this time δ Prune u We do not traverse edges from u
57 Online Incremental Update Can be done similarly to the Dynamic PLL
58 Experimental Evaluation
59 Indexing Time (sec) Dynamic PLL: Indexing Time > 1 day > 1 day > 1 day Environment: Xeon X5670, 48GB Linux, g TD [EDBT 12] IS-Label [VLDB 13] DPLL 1 Epinion Enron P2P YouTube Wikipedia More scalable than other static methods
60 MB μs Dynamic PLL:Index Size and Query Time Index Size Query Time Epinions Enron Epinions Enron TD [EDBT 12] IS-Label [VLDB 13] DPLL Comparable Also Better Environment: Xeon X5670, 48GB Linux, g++
61 Time (ms) Dynamic PLL: Update Time Environment: Xeon X5670, 48GB Linux, g Index Rebuild Incremental Update 0.1 Epinion Enron P2P YouTube Wikipedia Update can be done in milliseconds
62 μs ms sec MB Historical PLL (in comparison with Dynamic PLL) Indexing Time Index Size DPLL HPLL DPLL HPLL 1 Epinions Enron 1 Epinions Enron Query Time Update Time 10 DPLL Epinions Enron HPLL (snapshot) HPLL (change-point) 0.1 Epinions Enron DPLL HPLL
63 μs ms sec MB Historical PLL (in comparison with Dynamic PLL) Indexing Time Index Size DPLL HPLL DPLL HPLL 1 Epinions Enron Epinions Enron Just slightly worse Query Timethan Dynamic PLL Update Time 1 10 DPLL Epinions Enron HPLL (snapshot) HPLL (change-point) 0.1 Epinions Enron DPLL HPLL
64 μs ms sec MB Historical PLL (in comparison with Dynamic PLL) Indexing Time Index Size DPLL HPLL DPLL HPLL 1 Epinions Enron 1 Epinions Enron Query Time Update Time 10 DPLL Epinions Enron HPLL (snapshot) HPLL (change-point) 0.1 Epinions Enron DPLL HPLL
65 Summary Distance queries on graphs Based on Pruned Landmark Labeling (PLL) Contribution 1: Dynamic PLL Applications: Real-time network-aware search Contribution 2: Historical PLL Applications: Network evolution analysis
Efficient Top-k Shortest-Path Distance Queries on Large Networks by Pruned Landmark Labeling with Application to Network Structure Prediction
Efficient Top-k Shortest-Path Distance Queries on Large Networks by Pruned Landmark Labeling with Application to Network Structure Prediction Takuya Akiba U Tokyo Takanori Hayashi U Tokyo Nozomi Nori Kyoto
More informationI/O Efficient Algorithms for Exact Distance Queries on Disk- Resident Dynamic Graphs
I/O Efficient Algorithms for Exact Distance Queries on Disk- Resident Dynamic Graphs Yishi Lin, Xiaowei Chen, John C.S. Lui The Chinese University of Hong Kong 9/4/15 EXACT DISTANCE QUERIES ON DYNAMIC
More informationApproximate Shortest Distance Computing: A Query-Dependent Local Landmark Scheme
Approximate Shortest Distance Computing: A Query-Dependent Local Landmark Scheme Miao Qiao, Hong Cheng, Lijun Chang and Jeffrey Xu Yu The Chinese University of Hong Kong {mqiao, hcheng, ljchang, yu}@secuhkeduhk
More informationQuerying Shortest Distance on Large Graphs
.. Miao Qiao, Hong Cheng, Lijun Chang and Jeffrey Xu Yu Department of Systems Engineering & Engineering Management The Chinese University of Hong Kong October 19, 2011 Roadmap Preliminary Related Work
More informationShortest paths on large graphs: Systems, Algorithms, Applications
Shortest paths on large graphs: Systems, Algorithms, Applications Andrey Gubichev TU München January 2012 Andrey Gubichev Shortest paths on large graphs 1 / 53 Outline Introduction Systems Algorithms Applications
More informationHighway Dimension and Provably Efficient Shortest Paths Algorithms
Highway Dimension and Provably Efficient Shortest Paths Algorithms Andrew V. Goldberg Microsoft Research Silicon Valley www.research.microsoft.com/ goldberg/ Joint with Ittai Abraham, Amos Fiat, and Renato
More informationarxiv: v5 [cs.db] 29 Mar 2016
Effective Keyword Search in Graphs arxiv:52.6395v5 [cs.db] 29 Mar 26 n 5 n n 3 n 4 n 6 n 2 (a) n : n 2 : Laurence Fishburne n 3 : Birth Date (info_type) n 4 : The Matrix ABSTRACT n 7 n 8 Mehdi Kargar n,
More informationCS4800: Algorithms & Data Jonathan Ullman
CS4800: Algorithms & Data Jonathan Ullman Lecture 11: Graphs Graph Traversals: BFS Feb 16, 2018 What s Next What s Next Graph Algorithms: Graphs: Key Definitions, Properties, Representations Exploring
More informationAdvanced Data Management
Advanced Data Management Medha Atre Office: KD-219 atrem@cse.iitk.ac.in Sept 26, 2016 defined Given a graph G(V, E) with V as the set of nodes and E as the set of edges, a reachability query asks does
More informationExperimental Study on Speed-Up Techniques for Timetable Information Systems
Experimental Study on Speed-Up Techniques for Timetable Information Systems Reinhard Bauer (rbauer@ira.uka.de) Daniel Delling (delling@ira.uka.de) Dorothea Wagner (wagner@ira.uka.de) ITI Wagner, Universität
More informationAn Experimental Study on Hub Labeling based Shortest Path Algorithms
An Experimental Study on Hub Labeling based Shortest Path Algorithms Ye Li #1 Leong Hou U #2 Man Lung Yiu 3 Ngai Meng Kou #4 # Department of Computer and Information Science, University of Macau 1 yb47438@umac.mo
More informationMemory-Efficient Fast Shortest Path Estimation in Large Social Networks
Proceedings of the Eighth International AAAI Conference on Weblogs and Social Media Memory-Efficient Fast Shortest Path Estimation in Large Social Networks Volodymyr Floreskul and Konstantin Tretyakov
More informationComputing A Near-Maximum Independent Set in Linear Time by Reducing-Peeling
Computing A Near-Maximum Independent Set in Linear Time by Reducing-Peeling Computer Science and Engineering Lijun Chang University of New South Wales, Australia Lijun.Chang@unsw.edu.au Joint work with
More informationEdge Classification in Networks
Charu C. Aggarwal, Peixiang Zhao, and Gewen He Florida State University IBM T J Watson Research Center Edge Classification in Networks ICDE Conference, 2016 Introduction We consider in this paper the edge
More informationReach for A : an Efficient Point-to-Point Shortest Path Algorithm
Reach for A : an Efficient Point-to-Point Shortest Path Algorithm Andrew V. Goldberg Microsoft Research Silicon Valley www.research.microsoft.com/ goldberg/ Joint with Haim Kaplan and Renato Werneck Einstein
More informationAlgorithm Design and Analysis
Algorithm Design and Analysis LECTURE 3 Data Structures Graphs Traversals Strongly connected components Sofya Raskhodnikova L3.1 Measuring Running Time Focus on scalability: parameterize the running time
More informationCS246: Mining Massive Datasets Jure Leskovec, Stanford University
CS246: Mining Massive Datasets Jure Leskovec, Stanford University http://cs246.stanford.edu HITS (Hypertext Induced Topic Selection) Is a measure of importance of pages or documents, similar to PageRank
More informationLarge Scale Graph Algorithms
Large Scale Graph Algorithms A Guide to Web Research: Lecture 2 Yury Lifshits Steklov Institute of Mathematics at St.Petersburg Stuttgart, Spring 2007 1 / 34 Talk Objective To pose an abstract computational
More informationSociaLite: A Datalog-based Language for
SociaLite: A Datalog-based Language for Large-Scale Graph Analysis Jiwon Seo M OBIS OCIAL RESEARCH GROUP Overview Overview! SociaLite: language for large-scale graph analysis! Extensions to Datalog! Compiler
More informationThe exact distance to destination in undirected world
The VLDB Journal (2012) 21:869 888 DOI 10.1007/s00778-012-0274-x REGULAR PAPER The exact distance to destination in undirected world Lijun Chang Jeffrey Xu Yu Lu Qin Hong Cheng Miao Qiao Received: 8 August
More informationOn Graph Query Optimization in Large Networks
On Graph Query Optimization in Large Networks Peixiang Zhao, Jiawei Han Department of omputer Science University of Illinois at Urbana-hampaign pzhao4@illinois.edu, hanj@cs.uiuc.edu September 14th, 2010
More informationParallel Pruned Landmark Labeling for Shortest Path Queries on Unit-Weight Networks
Parallel Pruned Landmark Labeling for Shortest Path Queries on Unit-Weight Networks Bachelor Thesis of Damir Ferizovic At the Department of Informatics Institute of Theoretical Informatics Reviewer: Second
More informationStrong Bridges and Strong Articulation Points of Directed Graphs
Strong Bridges and Strong Articulation Points of Directed Graphs Giuseppe F. Italiano Univ. of Rome Tor Vergata Based on joint work with Donatella Firmani, Luigi Laura, Alessio Orlandi and Federico Santaroni
More informationLecture 2 - Graph Theory Fundamentals - Reachability and Exploration 1
CME 305: Discrete Mathematics and Algorithms Instructor: Professor Aaron Sidford (sidford@stanford.edu) January 11, 2018 Lecture 2 - Graph Theory Fundamentals - Reachability and Exploration 1 In this lecture
More informationAlgorithm Design and Analysis
Algorithm Design and Analysis LECTURE 5 Exploring graphs Adam Smith 9/5/2008 A. Smith; based on slides by E. Demaine, C. Leiserson, S. Raskhodnikova, K. Wayne Puzzles Suppose an undirected graph G is connected.
More informationPoint-to-Point Shortest Path Algorithms with Preprocessing
Point-to-Point Shortest Path Algorithms with Preprocessing Andrew V. Goldberg Microsoft Research Silicon Valley www.research.microsoft.com/ goldberg/ Joint work with Chris Harrelson, Haim Kaplan, and Retato
More informationAlgorithms and Theory of Computation. Lecture 3: Graph Algorithms
Algorithms and Theory of Computation Lecture 3: Graph Algorithms Xiaohui Bei MAS 714 August 20, 2018 Nanyang Technological University MAS 714 August 20, 2018 1 / 18 Connectivity In a undirected graph G
More informationGraph Data Management
Graph Data Management Analysis and Optimization of Graph Data Frameworks presented by Fynn Leitow Overview 1) Introduction a) Motivation b) Application for big data 2) Choice of algorithms 3) Choice of
More informationGraph Symmetry and Social Network Anonymization
Graph Symmetry and Social Network Anonymization Yanghua XIAO ( 肖仰华 ) School of computer science Fudan University For more information, please visit http://gdm.fudan.edu.cn Graph isomorphism determination
More informationGIVEN a large graph and a query node, finding its k-
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL., NO., 2016 1 Efficient and Exact Local Search for Random Walk Based Top-K Proximity Query in Large Graphs Yubao Wu, Ruoming Jin, and Xiang Zhang
More informationAlgorithm Engineering for Route Planning: An Update International Symposium on Algorithms and Computation 2011 Dorothea Wagner December 6, 2011
Algorithm Engineering for Route Planning: An Update International Symposium on Algorithms and Computation 2011 Dorothea Wagner December 6, 2011 FACULTY FOR INFORMATICS INSTITUTE OF THEORETICAL INFORMATICS
More informationCS61BL. Lecture 5: Graphs Sorting
CS61BL Lecture 5: Graphs Sorting Graphs Graphs Edge Vertex Graphs (Undirected) Graphs (Directed) Graphs (Multigraph) Graphs (Acyclic) Graphs (Cyclic) Graphs (Connected) Graphs (Disconnected) Graphs (Unweighted)
More informationDynamic Graph Query Support for SDN Management. Ramya Raghavendra IBM TJ Watson Research Center
Dynamic Graph Query Support for SDN Management Ramya Raghavendra IBM TJ Watson Research Center Roadmap SDN scenario 1: Cloud provisioning Management/Analytics primitives Current Cloud Offerings Limited
More informationBiology, Physics, Mathematics, Sociology, Engineering, Computer Science, Etc
Motivation Motifs Algorithms G-Tries Parallelism Complex Networks Networks are ubiquitous! Biology, Physics, Mathematics, Sociology, Engineering, Computer Science, Etc Images: UK Highways Agency, Uriel
More informationKeyLabel Algorithms for Keyword Search in Large Graphs
KeyLabel Algorithms for Keyword Search in Large Graphs Yue Wang, Ke Wang, Ada Wai-Chee Fu, and Raymond Chi-Wing Wong School of Computing Science, Simon Fraser University Email: {ywa138, wangk }@cs.sfu.ca
More informationSub-Graph Finding Information over Nebula Networks
ISSN (e): 2250 3005 Volume, 05 Issue, 10 October 2015 International Journal of Computational Engineering Research (IJCER) Sub-Graph Finding Information over Nebula Networks K.Eswara Rao $1, A.NagaBhushana
More informationFast Fully Dynamic Landmark-based Estimation of Shortest Path Distances in Very Large Graphs
Fast Fully Dynamic Landmark-based Estimation of Shortest Path Distances in Very Large Graphs Konstantin Tretyakov University of Tartu Estonia kt@ut.ee Jaak Vilo University of Tartu Estonia jaak.vilo@ut.ee
More informationEfficient Aggregation for Graph Summarization
Efficient Aggregation for Graph Summarization Yuanyuan Tian (University of Michigan) Richard A. Hankins (Nokia Research Center) Jignesh M. Patel (University of Michigan) Motivation Graphs are everywhere
More informationEfficient Top-k Shortest-Path Distance Queries on Large Networks by Pruned Landmark Labeling
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence Efficient Top-k Shortest-Path Distance Queries on Large Networks by Pruned Landmark Labeling Takuya Akiba, Takanori Hayashi, Nozomi
More informationLarge Scale Complex Network Analysis using the Hybrid Combination of a MapReduce Cluster and a Highly Multithreaded System
Large Scale Complex Network Analysis using the Hybrid Combination of a MapReduce Cluster and a Highly Multithreaded System Seunghwa Kang David A. Bader 1 A Challenge Problem Extracting a subgraph from
More informationExtending In-Memory Relational Database Engines with Native Graph Support
Extending In-Memory Relational Database Engines with Native Graph Support EDBT 18 Mohamed S. Hassan 1 Tatiana Kuznetsova 1 Hyun Chai Jeong 1 Walid G. Aref 1 Mohammad Sadoghi 2 1 Purdue University West
More informationTimeReach: Historical Reachability Queries on Evolving Graphs
TimeReach: Historical Reachability Queries on Evolving Graphs Konstantinos Semertzidis, Kostas Lillis, Evaggelia Pitoura Computer Science and Engineering Department University of Ioannina, Greece {ksemer,klillis,pitoura}@cs.uoi.gr
More informationarxiv: v2 [cs.ds] 10 Jul 2015
ReHub. Extending Hub Labels for Reverse k-nearest Neighbor Queries on Large-Scale networks arxiv:1504.01497v2 [cs.ds] 10 Jul 2015 Alexandros Efentakis Research Center Athena efentakis@imis.athena-innovation.gr
More informationCSE 417: Algorithms and Computational Complexity. 3.1 Basic Definitions and Applications. Goals. Chapter 3. Winter 2012 Graphs and Graph Algorithms
Chapter 3 CSE 417: Algorithms and Computational Complexity Graphs Reading: 3.1-3.6 Winter 2012 Graphs and Graph Algorithms Slides by Kevin Wayne. Copyright 2005 Pearson-Addison Wesley. All rights reserved.
More informationMining Data Streams. Outline [Garofalakis, Gehrke & Rastogi 2002] Introduction. Summarization Methods. Clustering Data Streams
Mining Data Streams Outline [Garofalakis, Gehrke & Rastogi 2002] Introduction Summarization Methods Clustering Data Streams Data Stream Classification Temporal Models CMPT 843, SFU, Martin Ester, 1-06
More informationCSE 100 Minimum Spanning Trees Prim s and Kruskal
CSE 100 Minimum Spanning Trees Prim s and Kruskal Your Turn The array of vertices, which include dist, prev, and done fields (initialize dist to INFINITY and done to false ): V0: dist= prev= done= adj:
More informationCOP 4531 Complexity & Analysis of Data Structures & Algorithms
COP Complexity & Analysis of Data Structures & Algorithms Overview of Graphs Breadth irst Search, and Depth irst Search hanks to several people who contributed to these slides including Piyush Kumar and
More information1 Minimum Spanning Trees (MST) b 2 3 a. 10 e h. j m
Minimum Spanning Trees (MST) 8 0 e 7 b 3 a 5 d 9 h i g c 8 7 6 3 f j 9 6 k l 5 m A graph H(U,F) is a subgraph of G(V,E) if U V and F E. A subgraph H(U,F) is called spanning if U = V. Let G be a graph with
More informationTELCOM2125: Network Science and Analysis
School of Information Sciences University of Pittsburgh TELCOM2125: Network Science and Analysis Konstantinos Pelechrinis Spring 2015 Figures are taken from: M.E.J. Newman, Networks: An Introduction 2
More informationHierarchical Hub Labelings for Shortest Paths
Hierarchical Hub Labelings for Shortest Paths Ittai Abraham, Daniel Delling, Andrew V. Goldberg, and Renato F. Werneck Microsoft Research Silicon Valley {ittaia,dadellin,goldberg,renatow}@microsoft.com
More informationLecture 3: Graphs and flows
Chapter 3 Lecture 3: Graphs and flows Graphs: a useful combinatorial structure. Definitions: graph, directed and undirected graph, edge as ordered pair, path, cycle, connected graph, strongly connected
More informationAn Optimal and Progressive Approach to Online Search of Top-K Influential Communities
An Optimal and Progressive Approach to Online Search of Top-K Influential Communities Fei Bi, Lijun Chang, Xuemin Lin, Wenjie Zhang University of New South Wales, Australia The University of Sydney, Australia
More informationExtracting Information from Complex Networks
Extracting Information from Complex Networks 1 Complex Networks Networks that arise from modeling complex systems: relationships Social networks Biological networks Distinguish from random networks uniform
More informationLecture 8: PATHS, CYCLES AND CONNECTEDNESS
Discrete Mathematics August 20, 2014 Lecture 8: PATHS, CYCLES AND CONNECTEDNESS Instructor: Sushmita Ruj Scribe: Ishan Sahu & Arnab Biswas 1 Paths, Cycles and Connectedness 1.1 Paths and Cycles 1. Paths
More informationSeminar on Algorithms and Data Structures: Multiple-Edge-Fault-Tolerant Approximate Shortest-Path Trees [1]
Seminar on Algorithms and Data Structures: Multiple-Edge-Fault-Tolerant Approximate Shortest-Path Trees [1] Philipp Rimle April 14, 2018 1 Introduction Given a graph with positively real-weighted undirected
More informationDistribution-Free Models of Social and Information Networks
Distribution-Free Models of Social and Information Networks Tim Roughgarden (Stanford CS) joint work with Jacob Fox (Stanford Math), Rishi Gupta (Stanford CS), C. Seshadhri (UC Santa Cruz), Fan Wei (Stanford
More informationThree-Hop Distance Estimation in Social Graphs
Three-Hop Distance Estimation in Social Graphs Pascal Welke University of Bonn 1 Bonn, Germany welke@uni-bonn.de Alexander Markowetz Markowetz.de Bonn, Germany alexander@markowetz.de Torsten Suel New York
More informationThese notes present some properties of chordal graphs, a set of undirected graphs that are important for undirected graphical models.
Undirected Graphical Models: Chordal Graphs, Decomposable Graphs, Junction Trees, and Factorizations Peter Bartlett. October 2003. These notes present some properties of chordal graphs, a set of undirected
More informationMining Social Network Graphs
Mining Social Network Graphs Analysis of Large Graphs: Community Detection Rafael Ferreira da Silva rafsilva@isi.edu http://rafaelsilva.com Note to other teachers and users of these slides: We would be
More informationCopyright 2000, Kevin Wayne 1
Chapter 3 - Graphs Undirected Graphs Undirected graph. G = (V, E) V = nodes. E = edges between pairs of nodes. Captures pairwise relationship between objects. Graph size parameters: n = V, m = E. Directed
More informationAlgorithm and Complexity of Disjointed Connected Dominating Set Problem on Trees
Algorithm and Complexity of Disjointed Connected Dominating Set Problem on Trees Wei Wang joint with Zishen Yang, Xianliang Liu School of Mathematics and Statistics, Xi an Jiaotong University Dec 20, 2016
More informationA New Parallel Algorithm for Connected Components in Dynamic Graphs. Robert McColl Oded Green David Bader
A New Parallel Algorithm for Connected Components in Dynamic Graphs Robert McColl Oded Green David Bader Overview The Problem Target Datasets Prior Work Parent-Neighbor Subgraph Results Conclusions Problem
More informationAnswering Pattern Match Queries in Large Graph Databases Via Graph Embedding
Noname manuscript No. (will be inserted by the editor) Answering Pattern Match Queries in Large Graph Databases Via Graph Embedding Lei Zou Lei Chen M. Tamer Özsu Dongyan Zhao the date of receipt and acceptance
More informationRoute Planning. Reach ArcFlags Transit Nodes Contraction Hierarchies Hub-based labelling. Tabulation Dijkstra Bidirectional A* Landmarks
Route Planning Tabulation Dijkstra Bidirectional A* Landmarks Reach ArcFlags Transit Nodes Contraction Hierarchies Hub-based labelling [ADGW11] Ittai Abraham, Daniel Delling, Andrew V. Goldberg, Renato
More informationCS 5614: (Big) Data Management Systems. B. Aditya Prakash Lecture #21: Graph Mining 2
CS 5614: (Big) Data Management Systems B. Aditya Prakash Lecture #21: Graph Mining 2 Networks & Communi>es We o@en think of networks being organized into modules, cluster, communi>es: VT CS 5614 2 Goal:
More informationFast and Scalable Analysis of Massive Social Graphs
Fast and Scalable Analysis of Massive Social Graphs Xiaohan Zhao, Alessandra Sala, Haitao Zheng and Ben Y. Zhao Department of Computer Science, U. C. Santa Barbara, Santa Barbara, CA USA {xiaohanzhao,
More informationDecreasing the Diameter of Bounded Degree Graphs
Decreasing the Diameter of Bounded Degree Graphs Noga Alon András Gyárfás Miklós Ruszinkó February, 00 To the memory of Paul Erdős Abstract Let f d (G) denote the minimum number of edges that have to be
More informationNUMA-aware Graph-structured Analytics
NUMA-aware Graph-structured Analytics Kaiyuan Zhang, Rong Chen, Haibo Chen Institute of Parallel and Distributed Systems Shanghai Jiao Tong University, China Big Data Everywhere 00 Million Tweets/day 1.11
More informationAll-Pairs Nearly 2-Approximate Shortest-Paths in O(n 2 polylog n) Time
All-Pairs Nearly 2-Approximate Shortest-Paths in O(n 2 polylog n) Time Surender Baswana 1, Vishrut Goyal 2, and Sandeep Sen 3 1 Max-Planck-Institut für Informatik, Saarbrücken, Germany. sbaswana@mpi-sb.mpg.de
More informationCS/COE 1501 cs.pitt.edu/~bill/1501/ Graphs
CS/COE 1501 cs.pitt.edu/~bill/1501/ Graphs 5 3 2 4 1 0 2 Graphs A graph G = (V, E) Where V is a set of vertices E is a set of edges connecting vertex pairs Example: V = {0, 1, 2, 3, 4, 5} E = {(0, 1),
More informationADHOC SYSTEMSS ABSTRACT. the system. The diagnosis strategy. is an initiator of. can in the hierarchy. paper include. this. than.
Journal of Theoretical and Applied Information Technology 2005-2007 JATIT. All rights reserved. www.jatit.org A HIERARCHICAL APPROACH TO FAULT DIAGNOSIS IN LARGE S CALE SELF-DIAGNOSABLE WIRELESS ADHOC
More informationAdaptive Landmark Selection Strategies for Fast Shortest Path Computation in Large Real-World Graphs
Adaptive Landmark Selection Strategies for Fast Shortest Path Computation in Large Real-World Graphs Frank W. Takes and Walter A. Kosters Leiden Institute of Advanced Computer Science (LIACS), Leiden University,
More informationCounting and Exploring Sizes of Markov Equivalence Classes of Directed Acyclic Graphs
Journal of Machine Learning Research 16 (2015) 2589-2609 Submitted 9/14; Revised 3/15; Published 12/15 Counting and Exploring Sizes of Markov Equivalence Classes of Directed Acyclic Graphs Yangbo He heyb@pku.edu.cn
More informationGraphs. Motivations: o Networks o Social networks o Program testing o Job Assignment Examples: o Code graph:
Graphs Motivations: o Networks o Social networks o Program testing o Job Assignment Examples: o Code graph: S1: int x S2: If x > 0 then S3: X = x + 2; Else S4: X =x -1; End if S5: While x > 1 do S6: Print
More informationFundamental Algorithms
Fundamental Algorithms Chapter 8: Graphs Jan Křetínský Winter 2017/18 Chapter 8: Graphs, Winter 2017/18 1 Graphs Definition (Graph) A graph G = (V, E) consists of a set V of vertices (nodes) and a set
More informationPufferfish: A Semantic Approach to Customizable Privacy
Pufferfish: A Semantic Approach to Customizable Privacy Ashwin Machanavajjhala ashwin AT cs.duke.edu Collaborators: Daniel Kifer (Penn State), Bolin Ding (UIUC, Microsoft Research) idash Privacy Workshop
More informationBig Data Management and NoSQL Databases
NDBI040 Big Data Management and NoSQL Databases Lecture 10. Graph databases Doc. RNDr. Irena Holubova, Ph.D. holubova@ksi.mff.cuni.cz http://www.ksi.mff.cuni.cz/~holubova/ndbi040/ Graph Databases Basic
More informationStrongly connected: A directed graph is strongly connected if every pair of vertices are reachable from each other.
Directed Graph In a directed graph, each edge (u, v) has a direction u v. Thus (u, v) (v, u). Directed graph is useful to model many practical problems (such as one-way road in traffic network, and asymmetric
More informationUri Zwick Tel Aviv University The 6th Max-Planck Advanced Course on the Foundations of Computer Science (ADFOCS)
Approximating distances in graphs Uri Zwick Tel Aviv University The 6th Max-Planck Advanced Course on the Foundations of Computer Science (ADFOCS) All-Pairs Shortest Paths APSP algorithm mn time n 2 space
More informationEfficient Bufferless Packet Switching on Trees and Leveled Networks
Efficient Bufferless Packet Switching on Trees and Leveled Networks Costas Busch Malik Magdon-Ismail Marios Mavronicolas Abstract In bufferless networks the packets cannot be buffered while they are in
More informationAlgorithms: Lecture 10. Chalmers University of Technology
Algorithms: Lecture 10 Chalmers University of Technology Today s Topics Basic Definitions Path, Cycle, Tree, Connectivity, etc. Graph Traversal Depth First Search Breadth First Search Testing Bipartatiness
More informationCS 220: Discrete Structures and their Applications. graphs zybooks chapter 10
CS 220: Discrete Structures and their Applications graphs zybooks chapter 10 directed graphs A collection of vertices and directed edges What can this represent? undirected graphs A collection of vertices
More informationThe Shortest Path Problem
The Shortest Path Problem 1 Shortest-Path Algorithms Find the shortest path from point A to point B Shortest in time, distance, cost, Numerous applications Map navigation Flight itineraries Circuit wiring
More informationCS249: SPECIAL TOPICS MINING INFORMATION/SOCIAL NETWORKS
CS249: SPECIAL TOPICS MINING INFORMATION/SOCIAL NETWORKS Overview of Networks Instructor: Yizhou Sun yzsun@cs.ucla.edu January 10, 2017 Overview of Information Network Analysis Network Representation Network
More informationTie strength, social capital, betweenness and homophily. Rik Sarkar
Tie strength, social capital, betweenness and homophily Rik Sarkar Networks Position of a node in a network determines its role/importance Structure of a network determines its properties 2 Today Notion
More informationEnforcing Customizable Consistency Properties in Software-Defined Networks. Wenxuan Zhou, Dong Jin, Jason Croft, Matthew Caesar, Brighten Godfrey
Enforcing Customizable Consistency Properties in Software-Defined Networks Wenxuan Zhou, Dong Jin, Jason Croft, Matthew Caesar, Brighten Godfrey 1 Network changes control applications, changes in traffic
More informationTopology Enhancement in Wireless Multihop Networks: A Top-down Approach
Topology Enhancement in Wireless Multihop Networks: A Top-down Approach Symeon Papavassiliou (joint work with Eleni Stai and Vasileios Karyotis) National Technical University of Athens (NTUA) School of
More informationProblem Set 4. General Instructions
CS224W: Social and Information Network Analysis Fall 2013 Problem Set 4 Due 9:30am November 21, 2013 General Instructions These questions require thought, but do not require long answers. Please be as
More informationProbabilistic Graph Summarization
Probabilistic Graph Summarization Nasrin Hassanlou, Maryam Shoaran, and Alex Thomo University of Victoria, Victoria, Canada {hassanlou,maryam,thomo}@cs.uvic.ca 1 Abstract We study group-summarization of
More informationMechanism Design in Large Congestion Games
Mechanism Design in Large Congestion Games Ryan Rogers, Aaron Roth, Jonathan Ullman, and Steven Wu July 22, 2015 Routing Game l e (y) Routing Game Routing Game A routing game G is defined by Routing Game
More informationThese are not polished as solutions, but ought to give a correct idea of solutions that work. Note that most problems have multiple good solutions.
CSE 591 HW Sketch Sample Solutions These are not polished as solutions, but ought to give a correct idea of solutions that work. Note that most problems have multiple good solutions. Problem 1 (a) Any
More informationGraph Theory S 1 I 2 I 1 S 2 I 1 I 2
Graph Theory S I I S S I I S Graphs Definition A graph G is a pair consisting of a vertex set V (G), and an edge set E(G) ( ) V (G). x and y are the endpoints of edge e = {x, y}. They are called adjacent
More informationCS246: Mining Massive Datasets Jure Leskovec, Stanford University
CS246: Mining Massive Datasets Jure Leskovec, Stanford University http://cs246.stanford.edu SPAM FARMING 2/11/2013 Jure Leskovec, Stanford C246: Mining Massive Datasets 2 2/11/2013 Jure Leskovec, Stanford
More information12/5/17. trees. CS 220: Discrete Structures and their Applications. Trees Chapter 11 in zybooks. rooted trees. rooted trees
trees CS 220: Discrete Structures and their Applications A tree is an undirected graph that is connected and has no cycles. Trees Chapter 11 in zybooks rooted trees Rooted trees. Given a tree T, choose
More informationManaging and Mining Billion Node Graphs. Haixun Wang Microsoft Research Asia
Managing and Mining Billion Node Graphs Haixun Wang Microsoft Research Asia Outline Overview Storage Online query processing Offline graph analytics Advanced applications Is it hard to manage graphs? Good
More informationMulticore Triangle Computations Without Tuning
Multicore Triangle Computations Without Tuning Julian Shun and Kanat Tangwongsan Presentation is based on paper published in International Conference on Data Engineering (ICDE), 2015 1 2 Triangle Computations
More informationOn total domination and support vertices of a tree
On total domination and support vertices of a tree Ermelinda DeLaViña, Craig E. Larson, Ryan Pepper and Bill Waller University of Houston-Downtown, Houston, Texas 7700 delavinae@uhd.edu, pepperr@uhd.edu,
More informationLecture Note: Computation problems in social. network analysis
Lecture Note: Computation problems in social network analysis Bang Ye Wu CSIE, Chung Cheng University, Taiwan September 29, 2008 In this lecture note, several computational problems are listed, including
More informationAlgorithm Design and Analysis
Algorithm Design and Analysis LECTURE 4 Graphs Definitions Traversals Adam Smith 9/8/10 Exercise How can you simulate an array with two unbounded stacks and a small amount of memory? (Hint: think of a
More informationSolutions to relevant spring 2000 exam problems
Problem 2, exam Here s Prim s algorithm, modified slightly to use C syntax. MSTPrim (G, w, r): Q = V[G]; for (each u Q) { key[u] = ; key[r] = 0; π[r] = 0; while (Q not empty) { u = ExtractMin (Q); for
More information