The missing links in the BGP-based AS connectivity maps

Similar documents
arxiv:cs/ v4 [cs.ni] 13 Sep 2004

(Social) Networks Analysis III. Prof. Dr. Daning Hu Department of Informatics University of Zurich

Complex networks: A mixture of power-law and Weibull distributions

Characterizing and Modelling Clustering Features in AS-Level Internet Topology

Analysis of Internet Topologies

An Empirical Study of Routing Bias in Variable-Degree Networks

Plan of the lecture I. INTRODUCTION II. DYNAMICAL PROCESSES. I. Networks: definitions, statistical characterization, examples II. Modeling frameworks

BGP Routing: A study at Large Time Scale

Complex Networks. Structure and Dynamics

Wednesday, March 8, Complex Networks. Presenter: Jirakhom Ruttanavakul. CS 790R, University of Nevada, Reno

BOSAM: A Tool for Visualizing Topological Structures Based on the Bitmap of Sorted Adjacent Matrix

POWER-LAWS AND SPECTRAL ANALYSIS OF THE INTERNET TOPOLOGY

NIT: A New Internet Topology Generator

On the Characteristics of BGP Routes

Science foundation under Grant No , DARPA award N , TCS Inc., and DIMI matching fund DIM

Structural Analysis of Paper Citation and Co-Authorship Networks using Network Analysis Techniques

On the Evaluation of AS Relationship Inferences

arxiv:cond-mat/ v1 [cond-mat.dis-nn] 3 Aug 2000

Modelling data networks research summary and modelling tools

Anatomy of a Large European IXP

Characteristics of Preferentially Attached Network Grown from. Small World

Attack Vulnerability of Network with Duplication-Divergence Mechanism

Strategies, approaches and ethical considerations

Impact of Topology on the Performance of Communication Networks

THE DEPENDENCY NETWORK IN FREE OPERATING SYSTEM

Statistical Analysis of the Metropolitan Seoul Subway System: Network Structure and Passenger Flows arxiv: v1 [physics.soc-ph] 12 May 2008

On the Importance of Local Connectivity for Internet Topology Models

CS-E5740. Complex Networks. Scale-free networks

Exercise set #2 (29 pts)

Topologies and Centralities of Replied Networks on Bulletin Board Systems

AS Connectedness Based on Multiple Vantage Points and the Resulting Topologies

Social and Technological Network Analysis. Lecture 6: Network Robustness and Applica=ons. Dr. Cecilia Mascolo

On the Origin of Power Laws in Internet Topologies Λ

Response Network Emerging from Simple Perturbation

On characterizing BGP routing table growth

Universal Behavior of Load Distribution in Scale-free Networks

Network Thinking. Complexity: A Guided Tour, Chapters 15-16

Summary: What We Have Learned So Far

Peer-to-Peer Streaming

Global dynamic routing for scale-free networks

On Reshaping of Clustering Coefficients in Degreebased Topology Generators

An Evolving Network Model With Local-World Structure

Scalable overlay Networks

Dynamics of Feedback-induced Packet Delay in Power-law Networks

Lesson 4. Random graphs. Sergio Barbarossa. UPC - Barcelona - July 2008

On Complex Dynamical Networks. G. Ron Chen Centre for Chaos Control and Synchronization City University of Hong Kong

Degree Distribution, and Scale-free networks. Ralucca Gera, Applied Mathematics Dept. Naval Postgraduate School Monterey, California

Supply chains involve complex webs of interactions among suppliers, manufacturers,

Failure in Complex Social Networks

AS Router Connectedness Based on Multiple Vantage Points and the Resulting Topologies

Congestion Propagation among Routers in the Internet

Properties of Biological Networks

Complex Network Metrology

Structure of biological networks. Presentation by Atanas Kamburov

1. Performance Comparison of Interdependent and Isolated Systems

Tearing Down the Internet

Overlay (and P2P) Networks

Exploiting the Scale-free Structure of the WWW

The Shape of the Internet. Slides assembled by Jeff Chase Duke University (thanks to Vishal Misra and C. Faloutsos)

Examples of Complex Networks


arxiv: v1 [cs.si] 2 Sep 2016

A Bandwidth-Broker Based Inter-Domain SLA Negotiation

Dynamic network generative model

Critical Phenomena in Complex Networks

On the Growth of Internet Application Flows: A Complex Network Perspective

Reducing Large Internet Topologies for Faster Simulations

Higher order clustering coecients in Barabasi Albert networks

Analysis and Comparison of Internet Topology Generators

Introduction to network metrics

Onset of traffic congestion in complex networks

M.E.J. Newman: Models of the Small World

Random Generation of the Social Network with Several Communities

On Routing Table Growth

Immunization for complex network based on the effective degree of vertex

Evolution and Enhancement of BitTorrent Network Topologies

Topological Resilience Analysis of Supply Networks under Random Disruptions and Targeted Attacks

Importance of IP Alias Resolution in Sampling Internet Topologies

Example for calculation of clustering coefficient Node N 1 has 8 neighbors (red arrows) There are 12 connectivities among neighbors (blue arrows)

BRITE: An Approach to Universal Topology Generation

SELF-HEALING NETWORKS: REDUNDANCY AND STRUCTURE

Impact of Multi-Access Links on the Internet Topology Modeling

Necessary backbone of superhighways for transport on geographical complex networks

arxiv: v1 [cs.ni] 13 Jul 2008

FITNESS-BASED GENERATIVE MODELS FOR POWER-LAW NETWORKS

Comparing static and dynamic measurements and models of the Internet s AS topology

of the Internet graph?

Dynamic Design of Cellular Wireless Networks via Self Organizing Mechanism

An Investigation into the Free/Open Source Software Phenomenon using Data Mining, Social Network Theory, and Agent-Based

arxiv:cond-mat/ v1 [cond-mat.dis-nn] 14 Sep 2005

An Exploratory Journey Into Network Analysis A Gentle Introduction to Network Science and Graph Visualization

From Centrality to Temporary Fame: Dynamic Centrality in Complex Networks

Fractal small-world dichotomy in real-world networks

Graph Model Selection using Maximum Likelihood

Lecture 4. Does the Internet have an Achilles' heel?

Cascading failures in complex networks with community structure

On-line Estimation of Power System Security Limits

Comparing static and dynamic measurements and models of the Internet s topology

COMMUNITY SHELL S EFFECT ON THE DISINTEGRATION OF SOCIAL NETWORKS

Analyzing the Characteristics of Gnutella Overlays

Transcription:

The missing links in the BGP-based AS connectivity maps Zhou, S; Mondragon, RJ http://arxiv.org/abs/cs/0303028 For additional information about this publication click this link. http://qmro.qmul.ac.uk/xmlui/handle/123456789/13070 Information about this research object was correct at the time of download; we occasionally make corrections to records, please therefore check the published record when citing. For more information contact scholarlycommunications@qmul.ac.uk

The missing links in the BGP-based AS connectivity maps Shi Zhou and Raul J. Mondragon Abstract A number of recent studies of the Internet topology at the autonomous systems level (AS graph) are based on the BGP-based AS connectivity maps (original maps). The so-called extended maps use additional data sources and contain more complete pictures of the AS graph. In this paper, we compare an original map, an extended map and a synthetic map generated by the Barabási-Albert model. We examine the recently reported rich-club phenomenon, alternative routing paths and attack tolerance. We point out that the majority of the missing links of the original maps are the connecting links between rich nodes (nodes with large numbers of links) of the extended maps. We show that the missing links are relevant because links between rich nodes can be crucial for the network structure. Index Terms Internet, topology, modeling, networks. I. INTRODUCTION number of recent studies [1], [2], [5], [6], [7], [8], [9] of Athe Internet topology at the autonomous systems level (AS graph) are based on the BGP-based AS connectivity maps ( original maps) [3] [13] [14]. These studies include the discovery of the power laws by Faloutsos et al [1] and the error and attack tolerance of the Internet by Barabási et al [8]. Chen et al [4] produced more complete AS maps (extended maps), which are constructed using additional data sources and typically have 20-50% or even more links than the original maps. (As we are considering only connectivity in this paper, we do not differentiate between in-degree links and out-degree links.) In this paper we compare an original map, an extended map and a synthetic map generated by the commonly used simulator of Internet topology, the Barabási-Albert (BA) model [2]. We examine the recently reported rich-club phenomenon [10], alternative routing paths and attack tolerance. We find out that the extended map shows a strong rich-club phenomenon, in which a small number of nodes with large numbers of links (rich nodes) are significantly better connected to each other comparing to the original map and the. Shi Zhou and Raul J. Mondragon are with the Department of Electronic Engineering, Queen Mary, University of London, Mile End Road, London, E1 4NS, United Kingdom. Email: {shi.zhou, r.j.mondragon}@elec.qmul.ac.uk This research is supported by the U.K. Engineering and Physical Sciences Research Council (EPSRC) under grant no. GR-R30136-01. The main contribution of this paper is to point out that the majority of the missing links of the original maps are the connecting links between rich nodes of the extended maps. We show that the missing links are relevant, because links between rich nodes can be crucial for the network structure. These results can be useful when choosing additional data sources to effectively improve the completeness of the AS maps and also useful for those modeling the Internet topology and studying its behavior. II. ORIGINAL AND EXTENDED AS MAPS Faloutsos et al [1] showed that the Internet topology at the autonomous systems level (AS graph) has a power law degree distribution P(k) ~ k y, where degree k is the number of links a node has. Barabási et al [2] showed that a power law degree distribution could arise from two generic mechanisms: 1) growth, where networks expand continuously by the addition of new nodes, and 2) preferential attachment, where new nodes are attached preferentially to nodes that are already well connected. The Barabási-Albert (BA) model (or so-called scale-free model) has generated great interests in various research areas. Many modifications of the have been introduced ever since. And the has been applied to the research on error and attack tolerance of the Internet [8]. The above studies of the power laws and the tolerance of the Internet are mainly based on the BGP-based AS connectivity maps, the original maps, which contain connectivity information of the AS graph and are constructed with the BGP routing tables collected by the Oregon route server (routeviews.oregon-ix.net) [3] [13] [14]. The Oregon route server connects to several operational routers within the Internet for the purpose of collecting BGP routing tables. Chen et al [5] showed that AS maps constructed solely from Oregon route server data contain only a portion of AS connectivity on the Internet. They constructed the extended maps [4] of the AS graph using additional data sources, such as the Internet Routing Registry (IRR) data and the Looking Glass (LG) data. The IRR maintains individual ISP s (Internet Service Provider) routing information in several public repositories to coordinate global routing policy. The Looking Glass sites are maintained by individual ISPs to help troubleshoot Internet-wide routing problems. The original maps typically miss 20-50% or even more of the physical links

in the extended maps of the AS graph. The degree distributions of the extended maps are heavily tailed, and they deviate from a strict power law. III. TOPOLOGICAL DIFFERENCES We compare an original map, an extended map and a synthetic map generated by the. The original map and the extended map are measured on May 26 th, 2001. As shown in Table I, the extended map and the original map have similar numbers of nodes and imum degrees, whereas the extended map has 40% more links than the original map. The has the similar size of the extended map, but it has a significantly smaller number of imum degree. We study the topological differences of the three maps by examining the rich-club phenomenon, alternative routing paths and attack tolerance. A. Rich-club phenomenon Power law topologies, such as the AS graph and the BA model, show the property that a small number of nodes have large numbers of links. We call these nodes rich nodes. Recently it was reported [10] that the Internet topology shows a rich-club phenomenon, in which rich nodes are very well connected to each other. Power law topologies can show significantly different degrees of the rich-club phenomenon. In this section we use the rich-club phenomenon to distinguish the structure differences between the networks. 1) Rich-club Coefficient To measure how well nodes are connected to each other, we defined the rich-club coefficient. The imum possible number of links that m nodes can have is m(m-1)/2. The node rank r is the rank of a node on a list sorted in a decreasing order of node degree, and r is normalized by the total number of nodes. The rich-club coefficient ϕ (r) is defined as the ratio of the actual number of links over the imum possible number of links between nodes with node rank less than r. Fig. 1 is a plot of the rich-club coefficient ϕ (r) against node rank r on a logarithmic scale. The plot shows that rich nodes of the two AS maps are significantly better connected to each other than those of the. And rich nodes of the extended map are better connected to each other than those of the original map. For example, ϕ (1%)=32% of the extend map means that the top 1% rich nodes have 32% of imum possible number of links, comparing with ϕ (1%)=17% of the original map and ϕ (1%)=5% of the. 2) Link distribution We define l(r i, r j ) as the number of links connecting nodes with node rank r i and r j, where node rank are divided into 5% bins and r i < r j. Fig. 2 is a 3D plot of l(r i, r j ) against corresponding node rank r i and r j. In all the three power law topologies, the majority of links are links connecting the top 5% rich nodes l(r i <5%). However, the extended map has a TABLE I NETWORK PROPERTIES Properties N 11174 11461 11461 L 23409 32730 34363 k average 4.2 5.7 6 k 2389 2432 329 N total number of nodes. L total number of links. k average average degree. k imum degree. 100% 0.0 ϕ (r) 10% 1.0 1% 2.0 01% 3.0 0.01% 4.0 0.1% 3 1% 2 10% 1 100% 0 r l(r i, r j ) 100% Fig. 1 Rich-club coefficientϕ (r) against node rank r 80% 60% r j Extended Map Original Map 40% 20% 5% 100% 80% 60% Fig. 2. Number of link l (r i, r j ) against node rank r i and r j 40% r i 8919 3795 5% 20%

TABLE II ALTERNATIVE ROUTING PATHS Properties Original Map Extended Map BA Model 3638 7150 100 average 5 23 0.12 10 4.0 4 10 3.0 3 10 2 2.0 5506 8474 683 average 129 207 1.42 imum triangle coefficient. average average triangle coefficient. imum rectangle coefficient. average average rectangle coefficient. 10 0.0 0 0 1 2 3 4 10 0 10 1 10 2 10 3 10 4 significant larger number of connecting links between the top 5% rich nodes l(r i <5%, r j <5%). This phenomenon that rich nodes are very well connected with each other is called the rich-club phenomenon. By comparison, the original map does not show the phenomenon as strong as the extended map. Whereas the does not show this phenomenon at all, instead, the top 5% rich nodes are connected to all nodes with similar probabilities regardless of their ranks. Fig. 2 also shows that the majority of the missing links of the original maps are the connecting links between rich nodes of the extended maps. B. Alternative Routing Paths In a topology, a triangle connects a node and two of its neighbor nodes. A rectangle connects a node, two of its neighbor nodes and one of its neighbor s neighbor nodes. The more triangles and rectangles a network has, the more possible alternative routing paths it can have. The triangle coefficient of a node is defined as the number of triangles the node has, and the rank of triangle coefficient r() is defined as the rank of a node on a list sorted in a decreasing order of the triangle coefficient. The rectangle coefficient of a node is defined as the number of rectangles the node has, and the rank of rectangle coefficient rank r() is defined as the rank of a node on a list sorted in a decreasing order of the rectangle coefficient. Fig. 3, Fig. 4 and Table II show that the two AS maps have significantly larger numbers of triangles and rectangles than the. And the extended map has more triangles and rectangles than the original map. C. Attack tolerance Barabási et al [8] shows that it is difficult to divide power law topologies into separate sub networks by removing nodes at random (error), but it is very easy to split them into sub networks by removing specific nodes (attack). To simulate an attack we first remove the best-connected node, and continue selecting and removing nodes in decreasing order of their degree k. In fact, this is equivalent of removing the members of the rich-club. Fig. 5 is a plot of the size of the largest cluster S, shown as a fraction of the total system size, against f the fraction of the nodes being removed in the attack mode. The figure shows that the extended map and the original 1 0.8 0.6 S 0.4 0.2 10 1 1.0 104.0 4 103.0 3 10 2 2.0 10 1 1.0 10 0 Fig. 3. The triangle coefficient against the triangle coefficient rank r() r() 0.0 0 1 2 3 4 10 0 10 1 10 2 10 3 10 4 r() Fig. 4. The rectangle coefficient against the rectangle coefficient rank r() 0 0 0.05 0.1 0.15 0.2 0.25 0.3 f Fig. 5. The size of the largest cluster, S, against f, the fraction of the nodes being removed in the attack mode. map are extremely vulnerable to attack. When the top 5% rich nodes are removed, both AS maps collapse into small pieces. The original map is more fragile than the extended map under

attack. Comparing with the two AS maps, the shows a higher degree of attack tolerance. We have measured the above properties for the AS maps measured on other dates, the main results are still valid. IV. DISCUSSION AND CONCLUSION A. The topological role of the missing links The extended map shows a rich-club phenomenon stronger than the original map. The majority of the missing links of the original map are the connecting links between rich nodes of the extended map. These links make the rich nodes of the extended map better connected to each other and result in a stronger rich-club phenomenon. As seen from the comparison between the original map and the extended map, the missing links are relevant, because links between rich nodes can be crucial for the network structure. B. Mapping the Internet An explanation of the incompleteness of the original map from a topological perspective is that, because the missing links are mostly the redundant links between rich nodes, they are likely to be the back-up routing paths in the actual Internet. Consequently they are difficult to be fully captured by BGP routing tables, which only provide local views of the global picture. It is important to keep measuring the Internet, because we still do not know if there are other missing links which are of fundamental importance for the description of the network topology. Our result can be useful when choosing additional data sources to effectively improve the completeness of the AS maps. C. Modeling the Internet The has much more links than the original map, but it does not show a rich-club phenomenon at all. Rich nodes of the are connected to all nodes with similar probabilities. This suggests that more links do not necessarily result in the rich-club phenomenon. This result also shows that the topological structure of the is fundamentally different from the AS graph. The rich-club phenomenon illustrates that networks obeying the same power law degree distribution can have different topological structures. This phenomenon is useful when distinguishing power law topologies and evaluating their generators. Recently we have introduced the Interactive Growth (IG) model of the Internet topology. This simple and dynamic model is a modification of the and based on the rich-club phenomenon. The IG model compares favorably with other Internet power law topology generators [11]. D. Redundancy and robustness The extended map has much more links than the original map and these links increase the network redundancy by forming alternative routing paths. Since the rich nodes of the extended map become more dominant in the network, the attack tolerance of the network is only slightly reinforced. This suggests that improved redundancy does not necessarily result in improved robustness. The has a similar size of the extended map and it has only a few triangles and rectangles. Nevertheless, the BA model shows a significantly higher degree of attack tolerance than the two AS maps. This illustrates that different topological structures have a great impact on network property. V. FUTURE WORK Other AS connectivity data sources, such as the RIPE routing information service [12], should be explored to evaluate our result and improve the AS maps. Also our result on attack tolerance shows that a small number of rich nodes are very important for the connectivity of the Internet. It would be interesting to investigate if a change of the local routing policy of one of these nodes has a major impact on the global network. REFERENCES [1] C. Faloutsos, P. Faloutsos, and M. Faloutsos, On Power-Law Relationships of the Internet Topology, in Proceedings of the ACM SIGCOMM 1999, pp. 251-262, Sept. 1999. [2] A.-L. Barabási and R. Albert, Emergence of Scaling in Random Networks, Science, pp. 509 512, Oct. 1999. [3] Routeviews, Route Views archive, http://archive.routeviews.org. [4] Q. Chen, H. Chang, R. Govindan, S. Jamin, S. J. Shenker and W. Willinger, The Origin of Power Laws in Internet Topologies (Revisited), in Proceedings of the IEEE INFOCOM 2002, June 2002. [5] L. A. N. Amaral, A. Scala, M. Barthelemy, and H. E. Stanley, Classes of Small-World Networks, in Proceedings of the National Academy of Sciences, vol. 97, no. 21, October 2000. [6] D. J. Watts and S. H. Strogatz, Collective Dynamics of Small- World Networks, Nature, vol. 363, pp. 202 204, 1998. [7] R. Albert and A.-L. Barabási, Topology of Evolving Networks: Local Events and Universality, Physical Review Letters, vol. 85, pp5234-5237, 2000. [8] R. Albert, H. Jeong, and A.-L. Barabási, Error and attack tolerance of complex networks, Science, pp. 378 381, July 2000. [9] D. S. Callaway, M. E. J. Newman, S. H. Strogatz, and D. J. Watts, Network Robustness and Fragility: Percolation on Random Graphs, Physical Review Letters, vol. 85, no. 25, pp. 5468, 2000. [10] S. Zhou and R. J. Mondragon, The rich-club phenomenon in the Internet topology, IEEE Comm Letters (submitted), Nov. 2002. [11] S. Zhou and R. J. Mondragon, Towards Modelling The Internet Topology The Interactive Growth Model, ITC18 (submitted), 2003. [12] The RIPE routing information service, http://www.ripe.net/ris. [13] National Laboratory for Applied Network Research (NLANR), Route Views archive, http://moat.nlanr.net/routing/rawdata. [14] Packet Clearing House (PCH), Route Views archive, http://www.pch.net/documents/data/routing-tables.