Approximating Betweenness Centrality

Size: px
Start display at page:

Download "Approximating Betweenness Centrality"

Transcription

1 Approximating Betweenness Centrality David A. Bader, Shiva Kintali, Kamesh Madduri, and Milena Mihail College of Computing Georgia Institute of Technology {bader, kintali, kamesh, Abstract. Betweenness is a centrality measure based on shortest paths, widely used in complex network analysis. It is computationally-expensive to exactly determine betweenness; currently the fastest-known algorithm by Brandes requires O(nm) time for unweighted graphs and O(nm + n 2 log n) time for weighted graphs, where n is the number of vertices and m is the number of edges in the network. These are also the worstcase time bounds for computing the betweenness score of a single vertex. In this paper, we present a novel approximation algorithm for computing betweenness centrality of a given vertex, for both weighted and unweighted graphs. Our approximation algorithm is based on an adaptive sampling technique that significantly reduces the number of single-source shortest path computations for vertices with high centrality. We conduct an extensive experimental study on real-world graph instances, and observe that our random sampling algorithm gives very good betweenness approximations for biological networks, road networks and web crawls. 1 Introduction One of the fundamental problems in network analysis is to determine the importance (or the centrality) of a particular vertex (or an edge) in a network. Some of the well-known metrics for computing centrality are closeness [1], stress [2] and betweenness [3,4]. Of these indices, betweenness has been extensively used in recent years for the analysis of social-interaction networks, as well as other large-scale complex networks. Some applications include lethality in biological networks [5,6,7], study of sexual networks and AIDS [8], identifying key actors in terrorist networks [9,10], organizational behavior [11], and supply chain management processes [12]. Betweenness is also used as the primary routine in popular algorithms for clustering and community identification [13] in real-world networks. For instance, the Girvan-Newman [14] algorithm iteratively partitions a network by identifying edges with high betweenness scores, removing them and recomputing centrality scores. Betweenness is a global centrality metric that is based on shortest-path enumeration. Consider a graph G =(V,E), where V is the set of vertices representing actors or nodes in the complex network, and E, the set of edges representing the relationships between the vertices. The number of vertices and edges are denoted by n and m respectively. The graphs can be directed or undirected. A. Bonato and F.R.K. Chung (Eds.): WAW 2007, LNCS 4863, pp , c Springer-Verlag Berlin Heidelberg 2007

2 Approximating Betweenness Centrality 125 We will assume that each edge e E has a positive integer weight w(e). For unweighted graphs, we use w(e) =1.Apath from vertex s to t is defined as a sequence of edges u i,u i+1,0 i<l,whereu 0 = s and u l = t. Thelength of a path is the sum of the weights of edges. We use d(s, t) to denote the distance between vertices s and t (the minimum length of any path connecting s and t in G). Let us denote the total number of shortest paths between vertices s and t by λ st, and the number passing through vertex v by λ st (v). Let δ st (v) denote the fraction of shortest paths between s and t that pass through a particular vertex v i.e., δ st (v) = λst(v) λ st.wecallδ st (v) thepair-dependency of s, t on v. Betweenness centrality [3,4] of a vertex v is defined as BC(v) = δ st (v) s v t V Currently, the fastest known algorithm for exactly computing betweenness of all the vertices, designed by Brandes [15], requires at least O(nm) timefor unweighted graphs and O(nm + n 2 log n) time for weighted graphs, where n is the number of vertices and m is the number of edges. Thus, for large-scale graphs, exact centrality computation on current workstations is not practically viable. In prior work, we explored high performance computing techniques [16] that exploit the typical small-world graph topology to speed up exact centrality computation. We designed novel parallel algorithms to exactly compute various centrality metrics, optimized for real-world networks. We also demonstrate the capability to compute exact betweenness on several large-scale networks (vertices and edges in the order of millions) from the Internet and social interaction data; these networks are three orders of magnitude larger than instances that can be processed by current social network analysis packages. Fast centrality estimation is thus an important problem, as a good approximation would be an acceptable alternative to exact scores. Currently the fastest exact algorithms for shortest path enumeration-based metrics require n shortestpath computations; however, it is possible to estimate centrality by extrapolating scores from a fewer number of path computations. Using a random sampling technique, Eppstein and Wang [17] show that the closeness centrality of all vertices in a weighted, undirected graph can be approximated with high probability in O( log n ɛ (n log n + m)) time, and an additive error of at most ɛδ 2 G (ɛ is a fixed constant, and Δ G is the diameter of the graph). However, betweenness centrality scores are harder to estimate, and the quality of approximation is found to be dependent on the vertices from which the shortest path computations are initiated from (in this paper, we will refer to them as the set of source vertices for the approximation algorithm). Recently, Brandes and Pich [18] presented centrality estimation heuristics, where they experimented with different strategies for selecting the source vertices. They observe that a random selection of source vertices is superior to deterministic strategies. In addition to exact parallel algorithms, we also discussed parallel techniques to compute approximate betweenness centrality in [16], using a random source selection strategy.

3 126 D.A. Bader et al. While prior approaches approximate centrality scores of all vertices in the graph, there are no known algorithms to compute the centrality of a single vertex in time faster than computing the betweenness of all vertices. In this paper, we present a novel adaptive sampling-based algorithm for approximately computing betweenness centrality of a given vertex. Our primary result is as follows: Theorem: For 0 <ɛ<0.5, if the centrality of a vertex v is n 2 /t for some constant t 1, then with probability 1 2ɛ its centrality can be estimated to within a factor of 1/ɛ with ɛt samples of source vertices. The rest of this paper is organized as follows. We review the currently-known fastest sequential algorithm by Brandes in Section 2. We present our approximation algorithm based on adaptive sampling and its analysis in Section 3. Section 4 is an experimental study of our approximation technique on several real-world networks. We conclude with a summary of open problems in Section 5. 2 Exact Computation of Betweenness Centrality Brandes algorithm [15] shows how to compute centrality scores of all the vertices in the graph in the same asymptotic time bounds as n SSSP computations. 2.1 Brandes Algorithm Define the dependency of a source vertex s V on a vertex v V as δ s (v) = t s v V δ st(v). Then the betweenness score of v can be then expressed as BC(v) = s v V δ s (v). Also, let P s (v) denotethesetofpredecessors of a vertex v on shortest paths from s: P s (v) = {u V : u, v E,d(s, v) = d(s, u) + w(u, v)}. Brandes shows that the dependencies satisfy the following recursive relation, which is the most crucial step in the algorithm analysis. Theorem 1. The dependency of s V on any v V obeys δ s (v) = w:v P s(w) λ sv λ sw (1 + δ s (w)) First, n SSSP computations are done, one for each s V. The predecessor sets P s (v) are maintained during these computations. Next, for every s V, using the information from the shortest paths tree and predecessor sets along the paths, compute the dependencies δ s (v) for all other v V. To compute the centrality value of a vertex v, we finally compute the sum of all dependency values. The O(n 2 ) space requirements can be reduced to O(m+n) by maintaining a running centrality score. For additional details, we refer the reader to Brandes paper [15].

4 Approximating Betweenness Centrality Adaptive-Sampling Based Approximation The adaptive sampling technique was introduced by Lipton and Naughton [19] for estimating the size of the transitive closure of a digraph. Prior to their work, algorithms for estimating transitive closure were based on randomly sampling source-vertices, solving the single-source reachability problem for the sampled vertices, and using this information to estimate the size of the transitive closure. The Lipton-Naughton algorithm introduces adaptive sampling of source-vertices, that is, the number of samples varies with the information obtained from each sample. In this section, we give an adaptive sampling algorithm for computing betweenness of a given vertex v. It is a sampling algorithm in that it estimates the centrality by sampling a subset of vertices and performing SSSP computations from these vertices. It is termed adaptive, because the number of samples required varies with the information obtained from each sample. The following lemma is easy to see and the proof is omitted. Lemma 1. BC(v) is zero iff its neighboring vertices induce a clique. Let a i denote the dependency of the vertex v i on v i.e., a i = δ vi (v). Let A = ai = BC(v). It is easy to verify that 0 a i n 2and0 A (n 1)(n 2)/2. The quantity we wish to estimate is A. Consider doing so with the following algorithm: Algorithm 1. Repeatedly sample a vertex v i V ; perform SSSP (using BFS or Dijkstra s algorithm) from v i and maintain a running sum S of the dependency scores δ vi (v). Sample until S is greater than cn for some constant c 2. Let the total number of samples be k. The estimated betweenness centrality score of v, BC(v) is given by ns k. Let X i be the random variable representing the dependency of a randomly sampled vertex on v. The probability of an event x is denoted by Pr [ x ]. We establish the following lemmas to analyze the above algorithm. Lemma 2. Let E[X i ] denote the expectation of X i and Var[X i ] denote the variance of X i.then,e[x i ]=A/n, E[X i 2 ] A, andvar[x i ] A. The next lemma is useful in proving a lower bound on the expected number of samples made before stopping. The proof is presented in the Appendix. Lemma 3. Let k = ɛn 2 /A. Then, Pr [ X 1 + X X k cn ] Lemma 4. Let k ɛn 2 /A and d>0. Then [ ( k ) ] Pr n X i A da k i=1 ɛ (c ɛ) 2 1 ɛd 2

5 128 D.A. Bader et al. Theorem 2. Let à betheestimateofa in the above procedure and let A>0. Then for 0 <ɛ<0.5 with probability 1 2ɛ, Algorithm 1 estimates A to within a factor of 1/ɛ. Proof. There are two ways that the algorithm can fail: (i) it can stop too early to guarantee a good error bound, (ii) it can stop after enough samples but with a bad estimate. First we claim that the procedure is unlikely to stop with k n 2 /A. Wehave that Pr [( j)(j k) (X 1 + X X j cn) ] Pr [ X 1 + X X k cn ] where k = ɛn2, because the event to the right of the inequality implies the event A to the left. But by Lemma 3, the right side of this equation is at most ɛ/(c ɛ) 2. Substituting c = 2 and noting that 0 < ɛ < 0.5, we get that this probability is less than ɛ. Next we turn to the accuracy of the estimate. If k = ɛn 2 /A, by Lemma 4 the estimate, à = n k X i k is within da of A with probability 1/(ɛd 2 ). Letting d =1/ɛ, thisisjustɛ. Putting the two ways of failure together, we get that the total probability of failure is less than ɛ +(1 ɛ)ɛ, which is less than 2ɛ. Finally, note that if A>0, there must be at least one i such that a i > 0, so the algorithm will terminate. ThecasewhenA = 0 (i.e., centrality of v is 0) can be detected using Lemma 1 (before running the algorithm). An interesting aspect of our theorem is that the sampling is adaptive. usually such sampling procedures perform a fixed number of samples. Here it is critical that the algorithm adapts it behavior. Substituting A = n2 in our analysis we t get the following theorem. Theorem 3. For 0 <ɛ<0.5, if the centrality of a vertex v is n 2 /t for some constant t 1, thenwithprobability 1 2ɛ its centrality can be estimated to within a factor of 1/ɛ with ɛt samples of source vertices. Although our theoretical result is valid only for high centrality nodes, our experimental results (presented in the next section) show a similar behavior for all the vertices. i=1 4 Experimental Study We assess the quality of the sampling-based approximation algorithm on several real-world graph instances (see Table 1). We use the parallel centrality analysis

6 Approximating Betweenness Centrality 129 toolkit SNAP [20] to compute exact betweenness scores. Since the execution time and speedup achieved by the approximation approach are directly proportional to the number of BFS/shortest path computations, we do not report performance results in this section. For a detailed discussion of exact centrality computation in parallel, and optimizations for small-world graphs, please refer to [16]. Network Data We experiment with two synthetic graph instances and four real networks in this study. rand is an unweighted, undirected random network of 2000 vertices and 7980 edges, generated using the Erdős Rényi graph model [27]. This synthetic graph has a low diameter, low clustering, and a Gaussian degree distribution. pref-attach is a synthetic graph generated using the Preferential attachment model proposed by Barabási and Albert [28]. This model generates graphs with heavy-tailed degree distributions and scale-free properties. Vertices are added one at a time, and for each of them, we create a fixed number of edges connecting to existing vertices, with probability proportional to their degree. bio-pin is a biological network that represents interactions in the human proteome [29,23]. This graph is undirected, unweighted and exhibits small-world characteristics. crawl corresponds to the wb-cs-stanford network in the UF sparse matrix collection [24]. It is a directed graph, where vertices correspond to pages in the Stanford Computer Science domain, and edges represent links. cite is a directed graph from the Pajek network collection [25]. It corresponds to papers by and citing J. Lederberg ( ). road is a weighted graph of 3353 vertices and 4435 edges that corresponds to a large portion of the road network of Rome, Italy from 1999 [26]. Vertices correspond to intersections between roads, and edges correspond to roads or road segments. Edge weights are physical distances in metres. Road networks have more structure and a higher diameter than the other networks considered in this study. Table 1. Networks used in the experimental study Label Network n m Details Source rand random graph synthetic, undirected [21] pref-attach preferential attachment synthetic, undirected [22] bio-pin human protein interactions ,191 undirected [23] crawl web-crawl (stanford.edu) ,854 directed [24] cite Lederberg citation network ,601 directed [24,25] road Rome, Italy road network weighted, undirected [26] Methodology Our goal in this study is to quantify the approximation quality, and so we primarily compare the approximation results to exact scores. We first compute exact centrality scores of all the networks in Table 1. In most data sets, we are

7 130 D.A. Bader et al. interested in high-centrality vertices, as they are the critical entities and are used in further analysis. From the exact scores, we identify vertices whose centrality scores are an order of magnitude greater than the rest of the network. For these vertices, we study the trade-off between computation and approximation quality by varying the parameter c in Algorithm 1. We also show that it is easy to estimate scores of low-centrality vertices. We chose small networks for ease of analysis and visualization, but the approximation algorithm can be effectively applied to large networks as well (see, for instance, the networks considered in [16]). Experiments Figure 1 plots the distribution of exact and approximate betweenness scores for the six different test instances. Note that the synthetic networks, rand and pref-attach show significantly lower variation in exact centrality scores compared to the real instances. Also, there are a significant percentage of lowcentrality vertices (scores less than, or close to, n) incite, crawl and bio-pin. We apply Algorithm 1 to estimate betweenness centrality scores of all the vertices in the test instances. In order to visualize the data better, we plot a smoothed curve of the estimated betweenness centrality data that is superimposed with the exact centrality score scatter-plot. We set the parameter c in Algorithm 1 to 5 for these experiments. In addition, we impose a cut-off of n 20 on the number of samples. Observe that in all the networks, the estimated centrality scores are very close to the exact ones, and we are guaranteed to cut down on the computation by a factor of nearly 20. To further study the quality of approximation for high-centrality vertices, we select the top 1% of the vertices (about 30) ordered by exact centrality score in each network, and compute their estimated centrality scores using the adaptivesampling algorithm. Since the source vertices in the adaptive approach are chosen randomly, we repeat the experiment five times for each vertex and report the mean and variance in approximation error. Figure 2 plots the mean percentage approximation error in the computed scores for these high centrality vertices, when the value of c (see Algorithm 3) is set to 5. The vertices are sorted by exact centrality score on the X-axis. The error bars in the charts indicate the variance in estimated score due to random runs, for each network. For the random graph instance, the average error is about 5%, while it is roughly around 10% for the rest of the networks. Except for a few anomalous vertices, the error variance is within reasonable bounds in all the graph classes. Figure 3 plots the percentage of BFS/SSSP computations required for approximating the centrality scores, when c is set to 5. This algorithmic count is an indicator of the amount of work done by the approximation algorithm. The vertices are ordered again by their exact centrality scores from left to right, with the vertex with the least score to the left. A common trend we observe across all graph classes is that the percentage of source vertices decreases as the centrality score increases this implies that the scores of high centrality vertices can be approximated with lesser work using the adaptive sampling approach. Also,

8 Approximating Betweenness Centrality (a) rand (b) pref-attach (c) bio-pin (d) crawl (e) cite (f) road 131 Fig. 1. A scatter plot of exact betweenness scores of all the vertices (in sorted order), and a line plot of their estimated betweenness scores (the approximate betweenness scatter data is smoothed by a local smoothing technique using polynomial regression) this value is significantly lower for crawl, bio-pin and road compared to other graph classes. We can also vary the parameter c, which affects both the percentage of BFS/SSSP computations and the approximation quality. Table 2 summarizes

9 132 D.A. Bader et al. (a) rand (b) pref-attach (c) bio-pin (d) crawl (e) cite (f) road Fig. 2. Average estimated betweenness error percentage (in comparison to the exact centrality score) for multiple runs. The adaptive sampling parameter c is set to 5 for all experiments and the error bars indicate the variance. the average performance on each graph instance, for different values of c. Taking only high-centrality vertices into consideration, we report the mean approximation error and the number of samples for each graph instance. As expected, we

10 Approximating Betweenness Centrality 133 (a) rand (b) pref-attach (c) bio-pin (d) crawl (e) cite (f) road Fig. 3. The number of samples/sssp computations as a fraction of n, the total number of vertices. This algorithmic count is an indicator of the amount of work done by the approximation algorithm. The adaptive sampling parameter c is set to 5, and the error bars indicate the variance from 5 runs. find that the error decreases as the parameter c is increased, while the number of samples increases. Since the highest centrality value is around 10 n for the citation network, a significant number of shortest path computations have to be done even for calculating scores with a reasonable accuracy. But for other

11 134 D.A. Bader et al. Table 2. Observed average-case algorithmic counts, as the value of the sampling parameter c is varied. The average error percentage is the deviation of the estimated score from the exact score, and the k percentage indicates the number of samples/sssp computations. n Network rand pref-attach bio-pin crawl cite road t=2 Avg. error 16.28% 29.39% 46.72% 33.69% 32.51% 22.58% Avg. k n 11.31% 5.36% 1.30% 0.96% 17.00% 0.68 % t=5 Avg. error 6.51% 10.28% 10.49% 10.31% 9.98% 8.79% Avg. k n 27.37% 12.38% 3.20% 2.42% 43.85% 1.68% t=10 Avg. error 5.62% 6.13% 7.17% 7.04% 7.39% Avg. k n 54.51% 24.66% 6.33% 4.89% 3.29% graph instances, particularly the road network, web-crawl and the protein interaction network, c = 5 offers a good trade-off between computation and the approximation quality. 5 Conclusion and Open Problems We presented a novel approximation algorithm for computing betweenness centrality, of a given vertex, in both weighted and unweighted graphs. Our approximation algorithm is based on an adaptive sampling technique that significantly reduces the number of single-source shortest path computations for vertices with high centrality. We conduct an extensive experimental study on real-world graph instances, and observe that the approximation algorithm performs well on web crawls, road networks and biological networks. Approximating the centrality of all vertices in time less than O(nm) for unweighted graphs and O(nm + n 2 log n) for weighted graphs is a challenging open problem. Designing a fully dynamic algorithm for computing betweenness is very useful. Acknowledgments The authors are grateful to the ARC (Algorithms and Randomness Center) of the College of Computing, Georgia Institute of Technology, for funding this project. This work was also supported in part by NSF Grants CAREER CCF , NSF DBI , ITR EF/BIO , and DARPA Contract NBCH Kamesh Madduri s work is supported in part by the NASA Graduate Student Researcher Program Fellowship (NASA NP HQ). We thank Richard Lipton for helpful discussions.

12 Approximating Betweenness Centrality 135 References 1. Sabidussi, G.: The centrality index of a graph. Psychometrika 31, (1966) 2. Shimbel, A.: Structural parameters of communication networks. Bulletin of Mathematical Biophysics 15, (1953) 3. Freeman, L.: A set of measures of centrality based on betweenness. Sociometry 40(1), (1977) 4. Anthonisse, J.: The rush in a directed graph. In: Report BN9/71, Stichting Mathematisch Centrum, Amsterdam, Netherlands (1971) 5. Jeong, H., Mason, S., Barabási, A.L., Oltvai, Z.: Lethality and centrality in protein networks. Nature 411, (2001) 6. Pinney, J., McConkey, G., Westhead, D.: Decomposition of biological networks using betweenness centrality. In: McLysaght, A., Huson, D.H. (eds.) RECOMB LNCS (LNBI), vol. 3678, Springer, Heidelberg (2005) 7. del Sol, A., Fujihashi, H., O Meara, P.: Topology of small-world networks of proteinprotein complex structures. Bioinformatics 21(8), (2005) 8. Liljeros, F., Edling, C., Amaral, L., Stanley, H., Åberg, Y.: The web of human sexual contacts. Nature 411, (2001) 9. Krebs, V.: Mapping networks of terrorist cells. Connections 24(3), (2002) 10. Coffman, T., Greenblatt, S., Marcus, S.: Graph-based technologies for intelligence analysis. Communications of the ACM 47(3), (2004) 11. Buckley, N., van Alstyne, M.: Does make white collar workers more productive? Technical report, University of Michigan (2004) 12. Cisic, D., Kesic, B., Jakomin, L.: Research of the power in the supply chain. In: International Trade, Economics Working Paper Archive EconWPA (April 2000) 13. Newman, M.: The structure and function of complex networks. SIAM Review 45(2), (2003) 14. Girvan, M., Newman, M.: Community structure in social and biological networks. Proceedings of the National Academy of Sciences, USA 99(12), (2002) 15. Brandes, U.: A faster algorithm for betweenness centrality. J. Mathematical Sociology 25(2), (2001) 16. Bader, D., Madduri, K.: Parallel algorithms for evaluating centrality indices in realworld networks. In: ICPP. Proc. 35th Int l Conf. on Parallel Processing, Columbus, OH, IEEE Computer Society, Los Alamitos (2006) 17. Eppstein, D., Wang, J.: Fast approximation of centrality. In: SODA Proc. 12th Ann. Symp. Discrete Algorithms, Washington, DC, pp (2001) 18. Brandes, U., Pich, C.: Centrality estimation in large networks. In: To appear in Intl. Journal of Bifurcation and Chaos, Special Issue on Complex Networks Structure and Dynamics (2007) 19. Lipton, R., Naughton, J.: Estimating the size of generalized transitive closures. In: VLDB, pp (1989) 20. Madduri, K., Bader, D.: Small-world Network Analysis in Parallel: a toolkit for centrality analysis (2007), kamesh 21. Madduri, K., Bader, D.: GTgraph: A suite of synthetic graph generators (2006), kamesh/gtgraph 22. Barabási, A.L.: Network databases (2007), networks/resources.htm 23. Bader, D., Madduri, K.: A graph-theoretic analysis of the human protein interaction network using multicore parallel algorithms. In: HiCOMB Proc. 6th Workshop on High Performance Computational Biology, Long Beach, CA (March 2007)

13 136 D.A. Bader et al. 24. Davis, T.: University of Florida Sparse Matrix Collection (2007), Batagelj, V., Mrvar, A.: PAJEK datasets (2006), Demetrescu, C., Goldberg, A., Johnson, D.: 9th DIMACS implementation challenge Shortest Paths (2006), challenge9/ 27. Erdős, P., Rényi, A.: On random graphs I. Publicationes Mathematicae 6, (1959) 28. Barabási, A.L., Albert, R.: Emergence of scaling in random networks. Science 286(5439), (1999) 29. Peri, S., et al.: Development of human protein reference database as an initial platform for approaching systems biology in humans. Genome Research 13, (2003) Appendix Lemma 3 Let k = ɛn 2 /A. Then, ɛ Pr [ X 1 + X X k cn ] (c ɛ) 2 Proof. We have [( Pr [ X X k cn ]=Pr X 1 A ) ( + + X k A ) n n [( = Pr X 1 A ) ( + + X k A ) n n [ Pr X i An ] (c ɛ)n i cn ka ] n ] cn ɛn 1 (c ɛ) 2 n 2 Var[X i] i 1 = (c ɛ) 2 n 2 Var[X i ] i 1 (c ɛ) 2 n 2 ka ɛ = (c ɛ) 2 Note that we have used Chebychev s inequality and union bounds in the above proof. We bound the error in the estimated value of A with the following lemma. Lemma 4 Let k ɛn 2 /A and d>0. Then [ ( k ) ] Pr n X i A da k i=1 1 ɛd 2

14 Approximating Betweenness Centrality 137 Proof [ ( k ) ] [ ( k ) Pr n X i A t = Pr X i k i=1 i=1 [ ( k ) = Pr X i 1 n A i=1 n2 k 2 t 2 k Var[X i] k n A kt n kt n ] ] to Let k = λ n2,whereλ ɛ. Then the above probability is less than or equal A n 2 k 2 t 2 k Var[X i] which is just A2. Setting Ad = t gives λt2 1 λd 2 1 ɛd 2 n2 λ n2 A t2 A

ALTERNATIVES TO BETWEENNESS CENTRALITY: A MEASURE OF CORRELATION COEFFICIENT

ALTERNATIVES TO BETWEENNESS CENTRALITY: A MEASURE OF CORRELATION COEFFICIENT ALTERNATIVES TO BETWEENNESS CENTRALITY: A MEASURE OF CORRELATION COEFFICIENT Xiaojia He 1 and Natarajan Meghanathan 2 1 University of Georgia, GA, USA, 2 Jackson State University, MS, USA 2 natarajan.meghanathan@jsums.edu

More information

HPCS Scalable Synthetic Compact Applications #2 Graph Analysis

HPCS Scalable Synthetic Compact Applications #2 Graph Analysis HPCS Scalable Synthetic Compact Applications #2 Graph Analysis Contributors: David A. Bader (Georgia Tech), John Feo (Cray), John Gilbert (UC Santa Barbara), Jeremy Kepner (MIT/LL), David Koester (Mitre),

More information

Centrality Estimation in Large Networks

Centrality Estimation in Large Networks Centrality Estimation in Large Networks Ulrik Brandes Christian Pich Department of Computer and Information Science University of Konstanz August 18, 26 Abstract Centrality indices are an essential concept

More information

THE concept of interactome networks represents a very. Sparse Networks-Based Speedup Technique for Proteins Betweenness Centrality Computation

THE concept of interactome networks represents a very. Sparse Networks-Based Speedup Technique for Proteins Betweenness Centrality Computation Sparse Networks-Based Speedup Technique for Proteins Betweenness Centrality Computation Razvan Bocu, University College Cork, Dr Sabin Tabirca, University College Cork Abstract The study of proteomics

More information

HPCS Scalable Synthetic Compact Applications #2 Graph Analysis

HPCS Scalable Synthetic Compact Applications #2 Graph Analysis HPCS Scalable Synthetic Compact Applications #2 Graph Analysis Contributors: David A. Bader (Georgia Tech), John Feo (Cray), John Gilbert (UC Santa Barbara), Jeremy Kepner (MIT/LL), David Koester (Mitre),

More information

is the Capacitated Minimum Spanning Tree

is the Capacitated Minimum Spanning Tree Dynamic Capacitated Minimum Spanning Trees Raja Jothi and Balaji Raghavachari Department of Computer Science, University of Texas at Dallas Richardson, TX 75083, USA raja, rbk @utdallas.edu Abstract Given

More information

An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization

An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization Pedro Ribeiro (DCC/FCUP & CRACS/INESC-TEC) Part 1 Motivation and emergence of Network Science

More information

Scribe from 2014/2015: Jessica Su, Hieu Pham Date: October 6, 2016 Editor: Jimmy Wu

Scribe from 2014/2015: Jessica Su, Hieu Pham Date: October 6, 2016 Editor: Jimmy Wu CS 267 Lecture 3 Shortest paths, graph diameter Scribe from 2014/2015: Jessica Su, Hieu Pham Date: October 6, 2016 Editor: Jimmy Wu Today we will talk about algorithms for finding shortest paths in a graph.

More information

A Faster Parallel Algorithm and Efficient Multithreaded Implementations for Evaluating Betweenness Centrality on Massive Datasets

A Faster Parallel Algorithm and Efficient Multithreaded Implementations for Evaluating Betweenness Centrality on Massive Datasets A Faster Parallel Algorithm and Efficient Multithreaded Implementations for Evaluating Betweenness Centrality on Massive Datasets Kamesh Madduri Computational Research Division Lawrence Berkeley National

More information

Two-Level Heaps: A New Priority Queue Structure with Applications to the Single Source Shortest Path Problem

Two-Level Heaps: A New Priority Queue Structure with Applications to the Single Source Shortest Path Problem Two-Level Heaps: A New Priority Queue Structure with Applications to the Single Source Shortest Path Problem K. Subramani 1, and Kamesh Madduri 2, 1 LDCSEE, West Virginia University Morgantown, WV 26506,

More information

Response Network Emerging from Simple Perturbation

Response Network Emerging from Simple Perturbation Journal of the Korean Physical Society, Vol 44, No 3, March 2004, pp 628 632 Response Network Emerging from Simple Perturbation S-W Son, D-H Kim, Y-Y Ahn and H Jeong Department of Physics, Korea Advanced

More information

A Fast Algorithm to Find All-Pairs Shortest Paths in Complex Networks

A Fast Algorithm to Find All-Pairs Shortest Paths in Complex Networks Available online at www.sciencedirect.com Procedia Computer Science 9 (2012 ) 557 566 International Conference on Computational Science, ICCS 2012 A Fast Algorithm to Find All-Pairs Shortest Paths in Complex

More information

Topic: Local Search: Max-Cut, Facility Location Date: 2/13/2007

Topic: Local Search: Max-Cut, Facility Location Date: 2/13/2007 CS880: Approximations Algorithms Scribe: Chi Man Liu Lecturer: Shuchi Chawla Topic: Local Search: Max-Cut, Facility Location Date: 2/3/2007 In previous lectures we saw how dynamic programming could be

More information

Comparison of Centralities for Biological Networks

Comparison of Centralities for Biological Networks Comparison of Centralities for Biological Networks Dirk Koschützki and Falk Schreiber Bioinformatics Center Gatersleben-Halle Institute of Plant Genetics and Crop Plant Research Corrensstraße 3 06466 Gatersleben,

More information

Complementary Graph Coloring

Complementary Graph Coloring International Journal of Computer (IJC) ISSN 2307-4523 (Print & Online) Global Society of Scientific Research and Researchers http://ijcjournal.org/ Complementary Graph Coloring Mohamed Al-Ibrahim a*,

More information

ARTICLE IN PRESS. JID:JDA AID:300 /FLA [m3g; v 1.81; Prn:3/04/2009; 15:31] P.1 (1-10) Journal of Discrete Algorithms ( )

ARTICLE IN PRESS. JID:JDA AID:300 /FLA [m3g; v 1.81; Prn:3/04/2009; 15:31] P.1 (1-10) Journal of Discrete Algorithms ( ) JID:JDA AID:300 /FLA [m3g; v 1.81; Prn:3/04/2009; 15:31] P.1 (1-10) Journal of Discrete Algorithms ( ) Contents lists available at ScienceDirect Journal of Discrete Algorithms www.elsevier.com/locate/jda

More information

Centrality Measures. Computing Closeness and Betweennes. Andrea Marino. Pisa, February PhD Course on Graph Mining Algorithms, Università di Pisa

Centrality Measures. Computing Closeness and Betweennes. Andrea Marino. Pisa, February PhD Course on Graph Mining Algorithms, Università di Pisa Computing Closeness and Betweennes PhD Course on Graph Mining Algorithms, Università di Pisa Pisa, February 2018 Centrality measures The problem of identifying the most central nodes in a network is a

More information

Exascale Analytics of Massive Social Networks David A. Bader

Exascale Analytics of Massive Social Networks David A. Bader Exascale Analytics of Massive Social Networks David A. Bader SIAM AN09 Minisymposium on HPC on Massive Real-World Graphs Session 1: Wednesday, July 8, 10:30 AM - 12:30 PM Exascale Analytics of Massive

More information

A new edge selection heuristic for computing the Tutte polynomial of an undirected graph.

A new edge selection heuristic for computing the Tutte polynomial of an undirected graph. FPSAC 2012, Nagoya, Japan DMTCS proc. (subm.), by the authors, 1 12 A new edge selection heuristic for computing the Tutte polynomial of an undirected graph. Michael Monagan 1 1 Department of Mathematics,

More information

Parallel Evaluation of Hopfield Neural Networks

Parallel Evaluation of Hopfield Neural Networks Parallel Evaluation of Hopfield Neural Networks Antoine Eiche, Daniel Chillet, Sebastien Pillement and Olivier Sentieys University of Rennes I / IRISA / INRIA 6 rue de Kerampont, BP 818 2232 LANNION,FRANCE

More information

The Buss Reduction for the k-weighted Vertex Cover Problem

The Buss Reduction for the k-weighted Vertex Cover Problem The Buss Reduction for the k-weighted Vertex Cover Problem Hong Xu Xin-Zeng Wu Cheng Cheng Sven Koenig T. K. Satish Kumar University of Southern California, Los Angeles, California 90089, the United States

More information

PULP: Fast and Simple Complex Network Partitioning

PULP: Fast and Simple Complex Network Partitioning PULP: Fast and Simple Complex Network Partitioning George Slota #,* Kamesh Madduri # Siva Rajamanickam * # The Pennsylvania State University *Sandia National Laboratories Dagstuhl Seminar 14461 November

More information

Structural Analysis of Paper Citation and Co-Authorship Networks using Network Analysis Techniques

Structural Analysis of Paper Citation and Co-Authorship Networks using Network Analysis Techniques Structural Analysis of Paper Citation and Co-Authorship Networks using Network Analysis Techniques Kouhei Sugiyama, Hiroyuki Ohsaki and Makoto Imase Graduate School of Information Science and Technology,

More information

On Covering a Graph Optimally with Induced Subgraphs

On Covering a Graph Optimally with Induced Subgraphs On Covering a Graph Optimally with Induced Subgraphs Shripad Thite April 1, 006 Abstract We consider the problem of covering a graph with a given number of induced subgraphs so that the maximum number

More information

CASS-MT Task #7 - Georgia Tech. GraphCT: A Graph Characterization Toolkit. David A. Bader, David Ediger, Karl Jiang & Jason Riedy

CASS-MT Task #7 - Georgia Tech. GraphCT: A Graph Characterization Toolkit. David A. Bader, David Ediger, Karl Jiang & Jason Riedy CASS-MT Task #7 - Georgia Tech GraphCT: A Graph Characterization Toolkit David A. Bader, David Ediger, Karl Jiang & Jason Riedy October 26, 2009 Outline Motivation What is GraphCT? Package for Massive

More information

Theory of Computing. Lecture 4/5 MAS 714 Hartmut Klauck

Theory of Computing. Lecture 4/5 MAS 714 Hartmut Klauck Theory of Computing Lecture 4/5 MAS 714 Hartmut Klauck How fast can we sort? There are deterministic algorithms that sort in worst case time O(n log n) Do better algorithms exist? Example [Andersson et

More information

Who s The Weakest Link?

Who s The Weakest Link? Who s The Weakest Link? Nikhil Devanur, Richard J. Lipton, and Nisheeth K. Vishnoi {nikhil,rjl,nkv}@cc.gatech.edu Georgia Institute of Technology, Atlanta, GA 30332, USA. Abstract. In this paper we consider

More information

Distances in power-law random graphs

Distances in power-law random graphs Distances in power-law random graphs Sander Dommers Supervisor: Remco van der Hofstad February 2, 2009 Where innovation starts Introduction There are many complex real-world networks, e.g. Social networks

More information

Community Structure Detection. Amar Chandole Ameya Kabre Atishay Aggarwal

Community Structure Detection. Amar Chandole Ameya Kabre Atishay Aggarwal Community Structure Detection Amar Chandole Ameya Kabre Atishay Aggarwal What is a network? Group or system of interconnected people or things Ways to represent a network: Matrices Sets Sequences Time

More information

CSCI 5454 Ramdomized Min Cut

CSCI 5454 Ramdomized Min Cut CSCI 5454 Ramdomized Min Cut Sean Wiese, Ramya Nair April 8, 013 1 Randomized Minimum Cut A classic problem in computer science is finding the minimum cut of an undirected graph. If we are presented with

More information

A SOCIAL NETWORK ANALYSIS APPROACH TO ANALYZE ROAD NETWORKS INTRODUCTION

A SOCIAL NETWORK ANALYSIS APPROACH TO ANALYZE ROAD NETWORKS INTRODUCTION A SOCIAL NETWORK ANALYSIS APPROACH TO ANALYZE ROAD NETWORKS Kyoungjin Park Alper Yilmaz Photogrammetric and Computer Vision Lab Ohio State University park.764@osu.edu yilmaz.15@osu.edu ABSTRACT Depending

More information

Betweness Centrality ENDRIAS KAHSSAY

Betweness Centrality ENDRIAS KAHSSAY Betweness Centrality ENDRIAS KAHSSAY What is it? The centrality of a vertex is the fraction of the shortest paths that go through. A measure how important a vertex is in a Graph. An undirected graph colored

More information

On the Approximability of Modularity Clustering

On the Approximability of Modularity Clustering On the Approximability of Modularity Clustering Newman s Community Finding Approach for Social Nets Bhaskar DasGupta Department of Computer Science University of Illinois at Chicago Chicago, IL 60607,

More information

CS 598CSC: Approximation Algorithms Lecture date: March 2, 2011 Instructor: Chandra Chekuri

CS 598CSC: Approximation Algorithms Lecture date: March 2, 2011 Instructor: Chandra Chekuri CS 598CSC: Approximation Algorithms Lecture date: March, 011 Instructor: Chandra Chekuri Scribe: CC Local search is a powerful and widely used heuristic method (with various extensions). In this lecture

More information

Heuristic Algorithms for Multiconstrained Quality-of-Service Routing

Heuristic Algorithms for Multiconstrained Quality-of-Service Routing 244 IEEE/ACM TRANSACTIONS ON NETWORKING, VOL 10, NO 2, APRIL 2002 Heuristic Algorithms for Multiconstrained Quality-of-Service Routing Xin Yuan, Member, IEEE Abstract Multiconstrained quality-of-service

More information

Reducing Directed Max Flow to Undirected Max Flow and Bipartite Matching

Reducing Directed Max Flow to Undirected Max Flow and Bipartite Matching Reducing Directed Max Flow to Undirected Max Flow and Bipartite Matching Henry Lin Division of Computer Science University of California, Berkeley Berkeley, CA 94720 Email: henrylin@eecs.berkeley.edu Abstract

More information

Applications of Geometric Spanner

Applications of Geometric Spanner Title: Name: Affil./Addr. 1: Affil./Addr. 2: Affil./Addr. 3: Keywords: SumOriWork: Applications of Geometric Spanner Networks Joachim Gudmundsson 1, Giri Narasimhan 2, Michiel Smid 3 School of Information

More information

CSCI5070 Advanced Topics in Social Computing

CSCI5070 Advanced Topics in Social Computing CSCI5070 Advanced Topics in Social Computing Irwin King The Chinese University of Hong Kong king@cse.cuhk.edu.hk!! 2012 All Rights Reserved. Outline Graphs Origins Definition Spectral Properties Type of

More information

CAIM: Cerca i Anàlisi d Informació Massiva

CAIM: Cerca i Anàlisi d Informació Massiva 1 / 72 CAIM: Cerca i Anàlisi d Informació Massiva FIB, Grau en Enginyeria Informàtica Slides by Marta Arias, José Balcázar, Ricard Gavaldá Department of Computer Science, UPC Fall 2016 http://www.cs.upc.edu/~caim

More information

On the Complexity of the Policy Improvement Algorithm. for Markov Decision Processes

On the Complexity of the Policy Improvement Algorithm. for Markov Decision Processes On the Complexity of the Policy Improvement Algorithm for Markov Decision Processes Mary Melekopoglou Anne Condon Computer Sciences Department University of Wisconsin - Madison 0 West Dayton Street Madison,

More information

arxiv:cs/ v1 [cs.ds] 20 Feb 2003

arxiv:cs/ v1 [cs.ds] 20 Feb 2003 The Traveling Salesman Problem for Cubic Graphs David Eppstein School of Information & Computer Science University of California, Irvine Irvine, CA 92697-3425, USA eppstein@ics.uci.edu arxiv:cs/0302030v1

More information

A General Class of Heuristics for Minimum Weight Perfect Matching and Fast Special Cases with Doubly and Triply Logarithmic Errors 1

A General Class of Heuristics for Minimum Weight Perfect Matching and Fast Special Cases with Doubly and Triply Logarithmic Errors 1 Algorithmica (1997) 18: 544 559 Algorithmica 1997 Springer-Verlag New York Inc. A General Class of Heuristics for Minimum Weight Perfect Matching and Fast Special Cases with Doubly and Triply Logarithmic

More information

MAXIMAL PLANAR SUBGRAPHS OF FIXED GIRTH IN RANDOM GRAPHS

MAXIMAL PLANAR SUBGRAPHS OF FIXED GIRTH IN RANDOM GRAPHS MAXIMAL PLANAR SUBGRAPHS OF FIXED GIRTH IN RANDOM GRAPHS MANUEL FERNÁNDEZ, NICHOLAS SIEGER, AND MICHAEL TAIT Abstract. In 99, Bollobás and Frieze showed that the threshold for G n,p to contain a spanning

More information

Worst-Case Utilization Bound for EDF Scheduling on Real-Time Multiprocessor Systems

Worst-Case Utilization Bound for EDF Scheduling on Real-Time Multiprocessor Systems Worst-Case Utilization Bound for EDF Scheduling on Real-Time Multiprocessor Systems J.M. López, M. García, J.L. Díaz, D.F. García University of Oviedo Department of Computer Science Campus de Viesques,

More information

An Experimental Investigation into the Rank Function of the Heterogeneous Earliest Finish Time Scheduling Algorithm

An Experimental Investigation into the Rank Function of the Heterogeneous Earliest Finish Time Scheduling Algorithm An Experimental Investigation into the Rank Function of the Heterogeneous Earliest Finish Time Scheduling Algorithm Henan Zhao and Rizos Sakellariou Department of Computer Science, University of Manchester,

More information

On the Min-Max 2-Cluster Editing Problem

On the Min-Max 2-Cluster Editing Problem JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 29, 1109-1120 (2013) On the Min-Max 2-Cluster Editing Problem LI-HSUAN CHEN 1, MAW-SHANG CHANG 2, CHUN-CHIEH WANG 1 AND BANG YE WU 1,* 1 Department of Computer

More information

1 The Traveling Salesperson Problem (TSP)

1 The Traveling Salesperson Problem (TSP) CS 598CSC: Approximation Algorithms Lecture date: January 23, 2009 Instructor: Chandra Chekuri Scribe: Sungjin Im In the previous lecture, we had a quick overview of several basic aspects of approximation

More information

Dithering and Betweenness Centrality in Weighted Graphs

Dithering and Betweenness Centrality in Weighted Graphs GlobalSIP 24: Network Theory Dithering and Betweenness Centrality in Weighted Graphs Santiago Segarra and Alejandro Ribeiro Department of Electrical and Systems Engineering, University of Pennsylvania

More information

arxiv: v1 [cs.ds] 12 May 2013

arxiv: v1 [cs.ds] 12 May 2013 Full Square Rhomboids and Their Algebraic Expressions Mark Korenblit arxiv:105.66v1 [cs.ds] 1 May 01 Holon Institute of Technology, Israel korenblit@hit.ac.il Abstract. The paper investigates relationship

More information

Characteristics of Preferentially Attached Network Grown from. Small World

Characteristics of Preferentially Attached Network Grown from. Small World Characteristics of Preferentially Attached Network Grown from Small World Seungyoung Lee Graduate School of Innovation and Technology Management, Korea Advanced Institute of Science and Technology, Daejeon

More information

Exact Algorithms Lecture 7: FPT Hardness and the ETH

Exact Algorithms Lecture 7: FPT Hardness and the ETH Exact Algorithms Lecture 7: FPT Hardness and the ETH February 12, 2016 Lecturer: Michael Lampis 1 Reminder: FPT algorithms Definition 1. A parameterized problem is a function from (χ, k) {0, 1} N to {0,

More information

Improved Algorithms for Detecting Negative Cost Cycles in Undirected Graphs

Improved Algorithms for Detecting Negative Cost Cycles in Undirected Graphs Improved Algorithms for Detecting Negative Cost Cycles in Undirected Graphs Xiaofeng Gu 1, Kamesh Madduri 2,, K. Subramani 3,, and Hong-Jian Lai 1 1 Department of Mathematics, West Virginia University,

More information

A Faster Algorithm for the Single Source Shortest Path Problem with Few Distinct Positive Lengths

A Faster Algorithm for the Single Source Shortest Path Problem with Few Distinct Positive Lengths A Faster Algorithm for the Single Source Shortest Path Problem with Few Distinct Positive Lengths James B. Orlin Sloan School of Management, MIT, Cambridge, MA {jorlin@mit.edu} Kamesh Madduri College of

More information

A New Parallel Algorithm for Connected Components in Dynamic Graphs. Robert McColl Oded Green David Bader

A New Parallel Algorithm for Connected Components in Dynamic Graphs. Robert McColl Oded Green David Bader A New Parallel Algorithm for Connected Components in Dynamic Graphs Robert McColl Oded Green David Bader Overview The Problem Target Datasets Prior Work Parent-Neighbor Subgraph Results Conclusions Problem

More information

Monotone Paths in Geometric Triangulations

Monotone Paths in Geometric Triangulations Monotone Paths in Geometric Triangulations Adrian Dumitrescu Ritankar Mandal Csaba D. Tóth November 19, 2017 Abstract (I) We prove that the (maximum) number of monotone paths in a geometric triangulation

More information

Graph Sampling Approach for Reducing. Computational Complexity of. Large-Scale Social Network

Graph Sampling Approach for Reducing. Computational Complexity of. Large-Scale Social Network Journal of Innovative Technology and Education, Vol. 3, 216, no. 1, 131-137 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/1.12988/jite.216.6828 Graph Sampling Approach for Reducing Computational Complexity

More information

Lecture and notes by: Nate Chenette, Brent Myers, Hari Prasad November 8, Property Testing

Lecture and notes by: Nate Chenette, Brent Myers, Hari Prasad November 8, Property Testing Property Testing 1 Introduction Broadly, property testing is the study of the following class of problems: Given the ability to perform (local) queries concerning a particular object (e.g., a function,

More information

Chapter 9 Graph Algorithms

Chapter 9 Graph Algorithms Introduction graph theory useful in practice represent many real-life problems can be if not careful with data structures Chapter 9 Graph s 2 Definitions Definitions an undirected graph is a finite set

More information

Bottleneck Steiner Tree with Bounded Number of Steiner Vertices

Bottleneck Steiner Tree with Bounded Number of Steiner Vertices Bottleneck Steiner Tree with Bounded Number of Steiner Vertices A. Karim Abu-Affash Paz Carmi Matthew J. Katz June 18, 2011 Abstract Given a complete graph G = (V, E), where each vertex is labeled either

More information

Objective of the present work

Objective of the present work Objective of the present work Optimize Road Network Optimize Airline Network Optimize Rail Network Optimize Computer Network Optimize Social Network Spanners and Emulators For most graph algorithms, the

More information

Special course in Computer Science: Advanced Text Algorithms

Special course in Computer Science: Advanced Text Algorithms Special course in Computer Science: Advanced Text Algorithms Lecture 8: Multiple alignments Elena Czeizler and Ion Petre Department of IT, Abo Akademi Computational Biomodelling Laboratory http://www.users.abo.fi/ipetre/textalg

More information

Example for calculation of clustering coefficient Node N 1 has 8 neighbors (red arrows) There are 12 connectivities among neighbors (blue arrows)

Example for calculation of clustering coefficient Node N 1 has 8 neighbors (red arrows) There are 12 connectivities among neighbors (blue arrows) Example for calculation of clustering coefficient Node N 1 has 8 neighbors (red arrows) There are 12 connectivities among neighbors (blue arrows) Average clustering coefficient of a graph Overall measure

More information

Randomized Algorithms 2017A - Lecture 10 Metric Embeddings into Random Trees

Randomized Algorithms 2017A - Lecture 10 Metric Embeddings into Random Trees Randomized Algorithms 2017A - Lecture 10 Metric Embeddings into Random Trees Lior Kamma 1 Introduction Embeddings and Distortion An embedding of a metric space (X, d X ) into a metric space (Y, d Y ) is

More information

Thwarting Traceback Attack on Freenet

Thwarting Traceback Attack on Freenet Thwarting Traceback Attack on Freenet Guanyu Tian, Zhenhai Duan Florida State University {tian, duan}@cs.fsu.edu Todd Baumeister, Yingfei Dong University of Hawaii {baumeist, yingfei}@hawaii.edu Abstract

More information

New Upper Bounds for MAX-2-SAT and MAX-2-CSP w.r.t. the Average Variable Degree

New Upper Bounds for MAX-2-SAT and MAX-2-CSP w.r.t. the Average Variable Degree New Upper Bounds for MAX-2-SAT and MAX-2-CSP w.r.t. the Average Variable Degree Alexander Golovnev St. Petersburg University of the Russian Academy of Sciences, St. Petersburg, Russia alex.golovnev@gmail.com

More information

Lecture Note: Computation problems in social. network analysis

Lecture Note: Computation problems in social. network analysis Lecture Note: Computation problems in social network analysis Bang Ye Wu CSIE, Chung Cheng University, Taiwan September 29, 2008 In this lecture note, several computational problems are listed, including

More information

Chapter 9 Graph Algorithms

Chapter 9 Graph Algorithms Chapter 9 Graph Algorithms 2 Introduction graph theory useful in practice represent many real-life problems can be if not careful with data structures 3 Definitions an undirected graph G = (V, E) is a

More information

Lecture 11: Clustering and the Spectral Partitioning Algorithm A note on randomized algorithm, Unbiased estimates

Lecture 11: Clustering and the Spectral Partitioning Algorithm A note on randomized algorithm, Unbiased estimates CSE 51: Design and Analysis of Algorithms I Spring 016 Lecture 11: Clustering and the Spectral Partitioning Algorithm Lecturer: Shayan Oveis Gharan May nd Scribe: Yueqi Sheng Disclaimer: These notes have

More information

Parameterized graph separation problems

Parameterized graph separation problems Parameterized graph separation problems Dániel Marx Department of Computer Science and Information Theory, Budapest University of Technology and Economics Budapest, H-1521, Hungary, dmarx@cs.bme.hu Abstract.

More information

Fast and Simple Algorithms for Weighted Perfect Matching

Fast and Simple Algorithms for Weighted Perfect Matching Fast and Simple Algorithms for Weighted Perfect Matching Mirjam Wattenhofer, Roger Wattenhofer {mirjam.wattenhofer,wattenhofer}@inf.ethz.ch, Department of Computer Science, ETH Zurich, Switzerland Abstract

More information

CS249: SPECIAL TOPICS MINING INFORMATION/SOCIAL NETWORKS

CS249: SPECIAL TOPICS MINING INFORMATION/SOCIAL NETWORKS CS249: SPECIAL TOPICS MINING INFORMATION/SOCIAL NETWORKS Overview of Networks Instructor: Yizhou Sun yzsun@cs.ucla.edu January 10, 2017 Overview of Information Network Analysis Network Representation Network

More information

Clustering Using Graph Connectivity

Clustering Using Graph Connectivity Clustering Using Graph Connectivity Patrick Williams June 3, 010 1 Introduction It is often desirable to group elements of a set into disjoint subsets, based on the similarity between the elements in the

More information

CS261: A Second Course in Algorithms Lecture #16: The Traveling Salesman Problem

CS261: A Second Course in Algorithms Lecture #16: The Traveling Salesman Problem CS61: A Second Course in Algorithms Lecture #16: The Traveling Salesman Problem Tim Roughgarden February 5, 016 1 The Traveling Salesman Problem (TSP) In this lecture we study a famous computational problem,

More information

Assessing and Safeguarding Network Resilience to Nodal Attacks

Assessing and Safeguarding Network Resilience to Nodal Attacks Assessing and Safeguarding Network Resilience to Nodal Attacks Pin-Yu Chen and Alfred O. Hero III, Fellow, IEEE Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor,

More information

REDUCING GRAPH COLORING TO CLIQUE SEARCH

REDUCING GRAPH COLORING TO CLIQUE SEARCH Asia Pacific Journal of Mathematics, Vol. 3, No. 1 (2016), 64-85 ISSN 2357-2205 REDUCING GRAPH COLORING TO CLIQUE SEARCH SÁNDOR SZABÓ AND BOGDÁN ZAVÁLNIJ Institute of Mathematics and Informatics, University

More information

Problem Set 4. General Instructions

Problem Set 4. General Instructions CS224W: Social and Information Network Analysis Fall 2013 Problem Set 4 Due 9:30am November 21, 2013 General Instructions These questions require thought, but do not require long answers. Please be as

More information

6 Randomized rounding of semidefinite programs

6 Randomized rounding of semidefinite programs 6 Randomized rounding of semidefinite programs We now turn to a new tool which gives substantially improved performance guarantees for some problems We now show how nonlinear programming relaxations can

More information

The Betweenness Centrality Of Biological Networks

The Betweenness Centrality Of Biological Networks The Betweenness Centrality Of Biological Networks Shivaram Narayanan Thesis submitted to the Faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements

More information

Critical Node Detection Problem. Panos Pardalos Distinguished Professor CAO, Dept. of Industrial and Systems Engineering, University of Florida

Critical Node Detection Problem. Panos Pardalos Distinguished Professor CAO, Dept. of Industrial and Systems Engineering, University of Florida Critical Node Detection Problem ITALY May, 2008 Panos Pardalos Distinguished Professor CAO, Dept. of Industrial and Systems Engineering, University of Florida Outline of Talk Introduction Problem Definition

More information

CS521 \ Notes for the Final Exam

CS521 \ Notes for the Final Exam CS521 \ Notes for final exam 1 Ariel Stolerman Asymptotic Notations: CS521 \ Notes for the Final Exam Notation Definition Limit Big-O ( ) Small-o ( ) Big- ( ) Small- ( ) Big- ( ) Notes: ( ) ( ) ( ) ( )

More information

arxiv: v1 [cs.dm] 24 Sep 2012

arxiv: v1 [cs.dm] 24 Sep 2012 A new edge selection heuristic for computing the Tutte polynomial of an undirected graph. arxiv:1209.5160v1 [cs.dm] 2 Sep 2012 Michael Monagan Department of Mathematics, Simon Fraser University mmonagan@cecms.sfu.ca

More information

The strong chromatic number of a graph

The strong chromatic number of a graph The strong chromatic number of a graph Noga Alon Abstract It is shown that there is an absolute constant c with the following property: For any two graphs G 1 = (V, E 1 ) and G 2 = (V, E 2 ) on the same

More information

Optimizing Energy Consumption and Parallel Performance for Static and Dynamic Betweenness Centrality using GPUs Adam McLaughlin, Jason Riedy, and

Optimizing Energy Consumption and Parallel Performance for Static and Dynamic Betweenness Centrality using GPUs Adam McLaughlin, Jason Riedy, and Optimizing Energy Consumption and Parallel Performance for Static and Dynamic Betweenness Centrality using GPUs Adam McLaughlin, Jason Riedy, and David A. Bader Motivation Real world graphs are challenging

More information

Online algorithms for clustering problems

Online algorithms for clustering problems University of Szeged Department of Computer Algorithms and Artificial Intelligence Online algorithms for clustering problems Summary of the Ph.D. thesis by Gabriella Divéki Supervisor Dr. Csanád Imreh

More information

1 Introduction and Results

1 Introduction and Results On the Structure of Graphs with Large Minimum Bisection Cristina G. Fernandes 1,, Tina Janne Schmidt,, and Anusch Taraz, 1 Instituto de Matemática e Estatística, Universidade de São Paulo, Brazil, cris@ime.usp.br

More information

A Genetic Algorithm for Graph Matching using Graph Node Characteristics 1 2

A Genetic Algorithm for Graph Matching using Graph Node Characteristics 1 2 Chapter 5 A Genetic Algorithm for Graph Matching using Graph Node Characteristics 1 2 Graph Matching has attracted the exploration of applying new computing paradigms because of the large number of applications

More information

Induced-universal graphs for graphs with bounded maximum degree

Induced-universal graphs for graphs with bounded maximum degree Induced-universal graphs for graphs with bounded maximum degree Steve Butler UCLA, Department of Mathematics, University of California Los Angeles, Los Angeles, CA 90095, USA. email: butler@math.ucla.edu.

More information

Lecture Notes on Karger s Min-Cut Algorithm. Eric Vigoda Georgia Institute of Technology Last updated for Advanced Algorithms, Fall 2013.

Lecture Notes on Karger s Min-Cut Algorithm. Eric Vigoda Georgia Institute of Technology Last updated for Advanced Algorithms, Fall 2013. Lecture Notes on Karger s Min-Cut Algorithm. Eric Vigoda Georgia Institute of Technology Last updated for 4540 - Advanced Algorithms, Fall 2013. Today s topic: Karger s min-cut algorithm [3]. Problem Definition

More information

Dual-Based Approximation Algorithms for Cut-Based Network Connectivity Problems

Dual-Based Approximation Algorithms for Cut-Based Network Connectivity Problems Dual-Based Approximation Algorithms for Cut-Based Network Connectivity Problems Benjamin Grimmer bdg79@cornell.edu arxiv:1508.05567v2 [cs.ds] 20 Jul 2017 Abstract We consider a variety of NP-Complete network

More information

Improved algorithms for constructing fault-tolerant spanners

Improved algorithms for constructing fault-tolerant spanners Improved algorithms for constructing fault-tolerant spanners Christos Levcopoulos Giri Narasimhan Michiel Smid December 8, 2000 Abstract Let S be a set of n points in a metric space, and k a positive integer.

More information

Isometric Diamond Subgraphs

Isometric Diamond Subgraphs Isometric Diamond Subgraphs David Eppstein Computer Science Department, University of California, Irvine eppstein@uci.edu Abstract. We test in polynomial time whether a graph embeds in a distancepreserving

More information

HPCGraph: Benchmarking Massive Graph Analytics on Supercomputers

HPCGraph: Benchmarking Massive Graph Analytics on Supercomputers HPCGraph: Benchmarking Massive Graph Analytics on Supercomputers George M. Slota 1, Siva Rajamanickam 2, Kamesh Madduri 3 1 Rensselaer Polytechnic Institute 2 Sandia National Laboratories a 3 The Pennsylvania

More information

Challenges in large-scale graph processing on HPC platforms and the Graph500 benchmark. by Nkemdirim Dockery

Challenges in large-scale graph processing on HPC platforms and the Graph500 benchmark. by Nkemdirim Dockery Challenges in large-scale graph processing on HPC platforms and the Graph500 benchmark by Nkemdirim Dockery High Performance Computing Workloads Core-memory sized Floating point intensive Well-structured

More information

Multicasting in the Hypercube, Chord and Binomial Graphs

Multicasting in the Hypercube, Chord and Binomial Graphs Multicasting in the Hypercube, Chord and Binomial Graphs Christopher C. Cipriano and Teofilo F. Gonzalez Department of Computer Science University of California, Santa Barbara, CA, 93106 E-mail: {ccc,teo}@cs.ucsb.edu

More information

22 Elementary Graph Algorithms. There are two standard ways to represent a

22 Elementary Graph Algorithms. There are two standard ways to represent a VI Graph Algorithms Elementary Graph Algorithms Minimum Spanning Trees Single-Source Shortest Paths All-Pairs Shortest Paths 22 Elementary Graph Algorithms There are two standard ways to represent a graph

More information

Faster Betweenness Centrality Based on Data Structure Experimentation

Faster Betweenness Centrality Based on Data Structure Experimentation Available online at www.sciencedirect.com Procedia Computer Science 18 (2013 ) 399 408 International Conference on Computational Science, ICCS 2013 Faster Betweenness Centrality Based on Data Structure

More information

Path centrality: A new centrality measure in networks

Path centrality: A new centrality measure in networks University of South Florida Scholar Commons Graduate Theses and Dissertations Graduate School 2010 Path centrality: A new centrality measure in networks Tharaka Alahakoon University of South Florida Follow

More information

The Encoding Complexity of Network Coding

The Encoding Complexity of Network Coding The Encoding Complexity of Network Coding Michael Langberg Alexander Sprintson Jehoshua Bruck California Institute of Technology Email: mikel,spalex,bruck @caltech.edu Abstract In the multicast network

More information

Centrality in Large Networks

Centrality in Large Networks Centrality in Large Networks Mostafa H. Chehreghani May 14, 2017 Table of contents Centrality notions Exact algorithm Approximate algorithms Conclusion Centrality notions Exact algorithm Approximate algorithms

More information

arxiv: v1 [cs.dm] 21 Dec 2015

arxiv: v1 [cs.dm] 21 Dec 2015 The Maximum Cardinality Cut Problem is Polynomial in Proper Interval Graphs Arman Boyacı 1, Tinaz Ekim 1, and Mordechai Shalom 1 Department of Industrial Engineering, Boğaziçi University, Istanbul, Turkey

More information