Charm: A charming network coordinate system

Similar documents
Vivaldi Practical, Distributed Internet Coordinates

Internet Coordinate Systems Tutorial

A Hierarchical Approach to Internet Distance Prediction

Vivaldi: A Decentralized Network Coordinate System. Authors: Frank Dabek, Russ Cox, Frans Kaashoek, Robert Morris MIT. Published at SIGCOMM 04

On the Utility of Inference Mechanisms

Pharos: A Decentralized and Hierarchical Network Coordinate System for Internet Distance Prediction

Attacks on Phoenix: A Virtual Coordinate System

Vivaldi: : A Decentralized Network Coordinate System

NetForecast: A Delay Prediction Scheme for Provider Controlled Networks

The Frog-Boiling Attack: Limitations of Anomaly Detection for Secure Network Coordinate Systems

Scalable Route Selection for IPv6 Multihomed Sites

Vivaldi Practical, Distributed Internet Coordinates

Experimental Study on Neighbor Selection Policy for Phoenix Network Coordinate System

Supporting Network Coordinates on PlanetLab

Vivaldi: A Decentralized Coordinate System

Vivaldi: A Decentralized Network Coordinate System

Stable and Accurate Network Coordinates

PlanetLab Deployment, Comparison and Analysis of Network Coordinate Systems

A view from inside a distributed Internet coordinate system

On Suitability of Euclidean Embedding of Internet Hosts

Modeling Distances in Large-Scale Networks by Matrix Factorization

Technical Report. An implementation of a coordinate based location system. David R. Spence. Number 576. November Computer Laboratory

Phoenix: Towards an Accurate, Practical and Decentralized Network Coordinate System

Tarantula: Towards an Accurate Network Coordinate System by Handling Major Portion of TIVs

Securing Virtual Coordinates by Enforcing Physical Laws

Building a low-latency, proximity-aware DHT-based P2P network

Improving Prediction Accuracy of Matrix Factorization Based Network Coordinate Systems

Geometric Exploration of the Landmark Selection Problem

A Reputation-Based Approach for Securing Vivaldi Embedding System

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, VOL. 24, NO. 12, DECEMBER

PIC: Practical Internet Coordinates for Distance Estimation

Pharos: accurate and decentralised network coordinate system Y. Chen 1 Y. Xiong 2 X. Shi 1 J. Zhu 3 B. Deng 1 X. Li 1

Exploiting Heterogeneity for Collective Data Downloading in Volunteer-based Networks

Fundamental Effects of Clustering on the Euclidean Embedding of Internet Hosts

HyCube: A distributed hash table based on a hierarchical hypercube geometry

Re-imagining Vivaldi with HTML5/WebGL. PhD Candidate: Djuro Mirkovic Supervisors: Prof. Grenville Armitage and Dr. Philip Branch

PIC: Practical Internet Coordinates for Distance Estimation

Differentiating Link State Advertizements to Optimize Control Overhead in Overlay Networks

Tuning Vivaldi: Achieving Increased Accuracy and Stability

Wired Geometric Routing

A Survey on Research on the Application-Layer Traffic Optimization (ALTO) Problem

Building Large-scale Distributed Systems with Network Coordinates

The Frog-Boiling Attack: Limitations of Secure Network Coordinate Systems

Network Positioning from the Edge

Exploiting Heterogeneity for Collective Data Downloading in Volunteer-based Networks

Visualizing Weighted Edges in Graphs

Coordinate-based Routing:

Vivaldi: A Decentralized Network Coordinate System

Comparing the performance of distributed hash tables under churn

Decentralized Pairwise Bandwidth Prediction

MISHIMA: Multilateration of Internet hosts hidden using malicious fast-flux agents (Short Paper)

Distributed Hash Table

Towards a Two-Tier Internet Coordinate System to Mitigate the Impact of Triangle Inequality Violations

Towards a Two-Tier Internet Coordinate System to Mitigate the Impact of Triangle Inequality Violations

Quasi-Chord: physical topology aware structured P2P network

An Optimal Median Calculation Algorithm for Estimating Internet Link Delays From Active Measurements.

[Kleinberg04] J. Kleinberg, A. Slivkins, T. Wexler. Triangulation and Embedding using Small Sets of Beacons. Proc. 45th IEEE Symposium on Foundations

Efficient Formation of Edge Cache Groups for Dynamic Content Delivery

Bridge-Node Selection and Loss Recovery in Island Multicast

Constructing Internet Coordinate System Based on Delay Measurement

Network Coordinates in the Wild

Landmark-based routing

Estimating Network Proximity and Latency

Hybrid Overlay Structure Based on Random Walks

Minerva: Learning to Infer Network Path Properties

Topology Optimization in Hybrid Tree/Mesh-based Peer-to-Peer Streaming System

Relative Network Positioning via CDN Redirections

Veracity: Practical Secure Network Coordinates via Vote-Based Agreements

PAPER A Proximity-Based Self-Organizing Hierarchical Overlay Framework for Distributed Hash Tables

A Hybrid Peer-to-Peer Architecture for Global Geospatial Web Service Discovery

Joint Server Selection and Routing for Geo-Replicated Services

Understanding Chord Performance

Survey of DHT Evaluation Methods

KoNKS: Konsensus-style Network Koordinate System

L-CAN: Locality Aware Structured Overlay for P2P Live Streaming

NTS6: IPv6 based Network Topology Service System of CERNET2

Intelligent Neighbor Selection in P2P with CapProbe and Vivaldi

Effective File Replication and Consistency Maintenance Mechanism in P2P Systems

Work in Progress: Detecting Triangle Inequality Violations in Internet Coordinate Systems by Supervised Learning

Querying Shortest Distance on Large Graphs

Towards Optimal Data Replication Across Data Centers

Securing Application-Level Topology Estimation Networks: Facing the Frog-Boiling Attack

Scalability In Peer-to-Peer Systems. Presented by Stavros Nikolaou

Exploring Proximity Based Peer Selection in a BitTorrent-like Protocol

Should we build Gnutella on a structured overlay? We believe

Approximate Shortest Distance Computing: A Query-Dependent Local Landmark Scheme

The performance of locality-aware topologies for peer-to-peer live streaming

Path Optimization in Stream-Based Overlay Networks

Network Delay Model for Overlay Network Application

Peer-to-Peer Networks Pastry & Tapestry 4th Week

PinPoint: A Ground-Truth Based Approach for IP Geolocation

Semester Project Network Mappings

Clustering algorithms and introduction to persistent homology

R/Kademlia: Recursive and Topology-aware Overlay Routing

Low-stress data embedding in the hyperbolic plane using multidimensional scaling

Greedy Virtual Coordinates for Geographic Routing

Load balancing in Content Delivery Networks in Novel Distributed Equilibrium

Evaluating DHT-Based Service Placement for Stream-Based Overlays

ProRenaTa: Proactive and Reactive Tuning to Scale a Distributed Storage System

Spray and forward: Efficient routing based on the Markov location prediction model for DTNs

Transcription:

Charm: A charming network coordinate system Çağatay Demiralp December 5, 27 Abstract We present a new network coordinate system, Charm, embedding round-trip time latencies into the Euclidean space. Charm is based on a particle system model with two spatial force interactions: attraction and repulsion. The update algorithm (Charming) that Charm uses is simple and lightweight: For each latency sample, the algorithm either brings the communicating nodes closer (attraction) or moves them farther (repulsion). We evaluate Charm by comparing with Vivaldi algorithm using simulations. Preliminary results suggests that Charm is fast and stable, and has a good converging rate. Introduction Methods that allow hosts to predict round-trip times (RTT) to other hosts in a distributed network can be useful in reducing communication overhead and increasing the utility of the network resources (i.e., bandwidth). There have been several schemes proposed for embedding the measured latencies into a metric space in order to create synthetic coordinates. Unfortunately most of these schemes do not produce stable and accurate coordinates under live network conditions. There are two reasons for this under-performance. First, RTT measurements are prone to random noise which makes modeling the variations between among RTT measurements very difficult. The second reason is more fundamental: While most of the existing network coordinate (NC) systems rely on metric embedding techniques, RTT measurements may not satisfy triangular inequality due to routing policies. This situation exacerbates with the presence of random noise. One of the approaches proposed to mitigate these problems is to use filters on the histories of measurements and coordinates [5, 3]. Charm should be seen as an attempt to develop a simple and stable NC system. Our algorithm considers two basic force interactions between hosts. At each latency sample, the algorithm either attracts the communicating nodes to each other or repels them from each other. In the following sections, we first give an overview of related work (section 2) and move on to discussing the A metric space is a pair of a point set X and a distance function d defined over X such that for all x i, x j, x k X, d(x i, x j ) = d(x j, x i ) (symmetry), d(x i, x j ) = x i = x j (positive definiteness) and d(x i, x k ) d(x i, x j ) + d(x j, x k ) (triangle inequality), are all true.

details of Charm system (section 3). We report the results from our simulations in section 4. And finally, we offer our conclusions and point some future work directions in section 5. 2 Previous Work Our method can be considered a simulation-based technique in a sense that we assume a model of a physical system and its equilibrium as the energy minimizing state. There have been several such methods proposed. For example, Vivaldi [2] is based on a simulation of springs, which we use as the baseline algorithm in our experiments to evaluate Charm. Similarly, Big-Bang Simulation [9] uses particle explosion interactions to determine network coordinates. At the other end of the spectrum of NC research, there are landmark-based systems. In landmark-based systems, distance between two nodes are computed indirectly via predetermined, fixed nodes. For example, GNP determines the coordinates of a node by measuring its latency to a fixed set of landmarks and then solving a nonlinear minimization problem iteratively [7]. To address some of the computational issues (i.e, being expensive) related to GNP, Virtual Landmarks [6] and ICS [] propose using PCA for linear approximation to the minimization problem. One of the potential drawbacks of using fixed landmarks is that it is not clear what happens in case of failure of these fixed nodes. In addition, there is a need to optimize the number and spatial configuration of the landmarks to avoid unbalanced workload. Concerned with these issues, PCoord [] and PIC [] use a hybrid approach, where landmarks are used only for bootstrapping; After bootstrapping, nodes adjust their coordinates with respect to the coordinates of peers. In a somewhat different approach, Lighthouse [8] assumes that shape of the network is a manifold and uses local coordinates on locally flat patches to create a global coordinate system. Not all NC systems use RTT measurements as their distance metric. For example, landmark-based IDMaps [4] use dissimilarities between the IP addresses of the hosts as the metric to be predicted. Note that all the methods we discussed here embed nodes into a Euclidean coordinate space. 3 Charm Charm is an NC coordinate system where nodes are modeled as particles with spatial force interactions: attraction and repulsion. Charm uses the same single algorithm, Charming, for continuously updating node coordinates both centralized and distributed settings. The algorithm corrects a constant fraction of the error between the measured and the current distance by either repelling or attracting the nodes. Algorithm shows Charming algorithm which takes new latency measurement l ij and the node j coordinates x j as parameters and computes the new x i and x j. There is only one constant (a correction fraction) c, which we set to. for all the experiments that we show results for. 2

Algorithm Charming(l ij, x j ). e = l ij x i x j 2 {difference between the current and measured distance} 2. d = 2 e c {length of repulsion/attraction} 3. x i = x i + d xj xi x j x i 2 {repel/attract x i } 4. x j = x j + d xi xj x i x j 2 {repel/attract x j } Since we compare Charming algorithm to both centralized and adaptive Vivaldi algorithms we give them here for completeness (see Algorithms 3 and 2). Further details for these algorithms can be found in [2]. The main difference between Charming and Vivaldi is that Charming adjusts the node coordinates bi-directionally. Note that mass-spring model used by Vivaldi is a particle system as well. We set the adaptive Vivaldi constants c e and c c to. and.25 throughout our comparisons. Algorithm 2 Centralized Vivaldi(l, x). while error(l, x) > ǫ do 2. for all i do 3. F = 4. for all j do 5. e = l ij x i x j 2 6. F = F + e xi xj x i x j 2 7. x i = x i + t F F 2 8. end for 9. end for. end while Algorithm 3 Vivaldi(l ij, x i, x j, e j ). w = ei e i+e j 2. e s = xi xj 2 lij l ij 3. e i = e s c e w + e i ( c e w) 4. δ = c c w 5. x i = x i + δ (l ij x i x j 2 ) xj xi x j x i 2 {adjust x i } 3

4 Experiments We evaluate the performance of Charming using about 3 nodes with RTT measurements sampled on a 2D regular grid. The spatial configuration of the nodes according to the latencies is shown in Figure 2a (a 3D view is given for better view of the layout). Initial values for coordinates are determined by sampling from a multivariate normal distribution (X N(µ, Σ) where µ = [ ] T and Σ = diag([ ] T ). In our experiments, we consider three cases: centralized embedding, new node accommodation, and distributed embedding. For each of the three cases Charming performs the best. Figure shows the evolution of the nodes for the centralized and adaptive Vivaldi (where δ constant changes adaptively), and Charming algorithms for a centralized coordinate update scenario. Figure 2b show the change of mean square error for each algorithm. 3.46 secs 3 3.78 secs 6 6.56 secs 2 2 4 2 2 2 2 4 3 3 2 2 3 3 3 2 2 3 6 5 4 3 2 2 3 4 5 5.25 secs 5 3.33 secs 4 6.94 secs 3 5 5 2 5 5 2 2 25 25 3 3 2 35 35 3 4 4 4 45 25 2 5 5 5 5 2 25 45 3 2 2 3 5 5 4 3 2 2 3 4 5 5.469 secs 3 3.25 secs 25 6.469 secs 2 5 2 5 5 5 5 5 2 2 5 25 2 3 25 2 5 5 5 5 2 25 3 25 2 5 5 5 5 2 25 25 25 2 5 5 5 5 2 25 Figure : Evolution of the nodes (for the toy network layout see Figure 2a when centralized Vivaldi (st row), adaptive Vivaldi (2nd row) and Charming (3rd row) algorithms are used in a centralized fashion. See Figure 2b for convergence and mean square error rates. 4

.5 6 5 x 4 mean square error charming adaptive vivaldi centralized vivaldi 4.5 mse 3 2 2 2 2 2 2 3 4 5 6 7 8 time (s) (a) (b) Figure 2: (a) shows the layout of our toy network where latencies (RTTs) are sampled on a regular grid for simulation purposes. There are about 3 nodes in the network. (b) Mean square error values from the simulation (shown in Figure comparing the centralized Vivaldi, adaptive Vivaldi, and Charming algorithms for a centralized NC computation. Notice the oscillatory behavior of adaptive Vivaldi algorithm. In Figure 4, we compare how well the adaptive Vivaldi and Charming algorithms accommodate the new nodes entering the network. The orange-colored nodes in Figure 4a represent the new nodes (coordinates of which are initialized from N(µ, Σ)) entering to a network with stabilized coordinates (shown as blue-colored nodes in the same figure). As seen Figures 4and 4b, Charming converges and stabilizes quick in this case as well. As the last case, we compare Charming and adaptive Vivaldi in a simulation where nodes measure latencies to every other node randomly (see Figure 4). While Charming still performs significantly better than Vivaldi, Vivaldi performed better than it did in the previous settings. This suggests that oscillations in Vivaldi could be due to localized measurements. 5 Conclusions and Future Work We presented Charm, a new network coordinate system using spatial force interactions of attraction and repulsion to embed measured latencies into a Euclidean space. The initial results show that the update algorithm (Charming) of this system converges faster and it is more stable than Vivaldi algorithm. However, without further evaluation on real and simulated network environments it would be premature to claim that these would scale to large networks and real world domains. In this study, we confirm that Vivaldi is very sensitive to parameter setting, initial conditions and prone to oscillations. The fact that adaptive Vivaldi worked better with random measurements from the complete set of the nodes could be important. This might suggest that oscillations in Vivaldi could be due to localized measurements, which is the most likely situation, for example, in the Internet. Combining Vivaldi with a landmark-based system (i.e., 5

4 35 3 25 2 5 5 5 3.33 secs 3 3.469 secs 4 6.78 secs 2 2 3 2 2 2 2 3 3 3 4 4 4 35 3 25 2 5 5 5 4 3 2 2 3 4 25.25 secs 25 3.56 secs 25 6.469 secs 2 2 2 5 5 5 5 5 5 5 5 5 5 5 5 2 2 2 25 3 25 2 5 5 5 5 25 3 25 2 5 5 5 5 2 25 3 25 2 5 5 5 5 2 Figure 3: Evolution of nodes after new nodes enter the system (for the configuration of the stable and entering nodes see Figure 4a) when adaptive Vivaldi (st row) and Charming (2nd row) algorithms are used. See Figure 4b for convergence and mean square error rates. GNP) could mitigate the problems related to oscillations. In the future, we would like to explore the behavior of Charming in distributed settings further. Since it will be easy to modify existing simulation-based NC algorithms to apply Charming-like interactions, we would like to experiment with hybrid algorithms similar to the one shown in Algorithm 4. Algorithm 4 Charming Vivaldi(l ij, x i, x j, e j ). w = ei e i+e j 2. e s = xi xj 2 lij l ij 3. e i = e s c e w + e i ( c e w) 4. δ = c c w 5. x i = x i + δ (l ij x i x j 2 ) xj xi x j x i 2 {repel/attract x i } 6. x j = x j + δ (l ij x i x j 2 ) xi xj x i x j 2 {repel/attract x j } 6

3.5 x 4 3 mean square error Charming adaptive Vivaldi.5.5 mse 2.5 2.5 2 2 2 2.5 2 3 4 5 6 7 8 time (s) (a) (b) Figure 4: (a) shows the stable nodes (in blue) and the nodes that are about to join the network (in orange). Latencies (RTTs) for the new nodes are sampled on a regular grid extended from the stable grid embedding of the existing nodes for simulation purposes. There are about 5 existing and 5 new nodes in the network. (b) Mean square error values from the simulation (shown in Figure 4 comparing adaptive Vivaldi and Charming algorithms in how well the new nodes are accommodated. Notice again the oscillatory behavior of adaptive Vivaldi algorithm. 4.33 secs 3 3.56 secs 3 6.56 secs 3 2 2 2 2 2 2 3 3 2 2 3 4 3 4 3 2 2 3 3 4 3 2 2 3.94 secs 3 3.469 secs 3 6.46 secs 25 2 2 2 5 5 5 2 2 5 2 3 2 2 3 3 3 2 2 3 3 3 2 2 3 Figure 5: Evolution of the nodes (see the network layout in Figure 2a when the adaptive Vivaldi (st row) and Charming (2nd row) algorithms are used in a distributed setting. See Figure 4 for convergence and mean square error rates. 7

References [] Manuel Costa, Miguel Castro, Antony Rowstron, and Peter Key. Pic: Practical internet coordinates for distance estimation. In ICDCS 4: Proceedings of the 24th International Conference on Distributed Computing Systems (ICDCS 4), pages 78 87, Washington, DC, USA, 24. IEEE Computer Society. [2] Frank Dabek, Russ Cox, Frans Kaashoek, and Robert Morris. Vivaldi: A decentralized network coordinate system. In Proceedings of the ACM SIGCOMM 4 Conference, Portland, Oregon, August 24. [3] C. de Launois, S. Uhlig, and O. Bonaventure. A stable and distributed network coordinate system. Technical report, Universite Catholique de Louvain, December 24. [4] Paul Francis, Sugih Jamin, Cheng Jin, Yixin Jin, Danny Raz, Yuval Shavitt, and Lixia Zhang. Idmaps: a global internet host distance estimation service. IEEE/ACM Trans. Netw., 9(5):525 54, 2. [5] Jonathan Ledlie, Peter Pietzuch, and Margo Seltzer. Stable and accurate network coordinates. In ICDCS 6: Proceedings of the 26th IEEE International Conference on Distributed Computing Systems, page 74, Washington, DC, USA, 26. IEEE Computer Society. [6] Hyuk Lim, Jennifer C. Hou, and Chong-Ho Choi. Constructing internet coordinate system based on delay measurement. In IMC 3: Proceedings 6 x 4 mean square error 5 Charming adaptive Vivaldi 4 mse 3 2 2 4 6 8 2 time (s) Figure 6: Mean square error values from the simulation (shown in Figure 4 comparing adaptive Vivaldi and Charming algorithms for a distributed NC computation. Note that, although it is still significantly slower than Charming, adaptive Vivaldi converges much faster than it did in the centralized computation. This result with the randomized nature of our distributed network simulation suggests that oscillations caused by Vivaldi could be due to localized measurements. This might also explain the relatively poor performance of Vivaldi in the Internet where most of the measurements are likely to come from a close neighbor instead of a random node of the complete network. 8

of the 3rd ACM SIGCOMM conference on Internet measurement, pages 29 42, New York, NY, USA, 23. ACM Press. [7] T. S. Eugene Ng and Hui Zhang. Predicting internet network distance with coordinates-based approaches. In INFOCOM, 22. [8] Marcelo Pias, Jon Crowcroft, Steve R. Wilbur, Tim Harris, and Saleem N. Bhatti. Lighthouses for scalable distributed location. In IPTPS, pages 278 29, 23. [9] Yuval Shavitt and Tomer Tankel. Big-bang simulation for embedding network distances in euclidean space. IEEE/ACM Trans. Netw., 2(6):993 6, 24. [] Liying Tang and Mark Crovella. Virtual landmarks for the internet. In IMC 3: Proceedings of the 3rd ACM SIGCOMM conference on Internet measurement, pages 43 52, New York, NY, USA, 23. ACM Press. [] Li wei Lehman and Steven Lerman. Pcoord: Network position estimation using peer-to-peer measurements. In NCA 4: Proceedings of the Network Computing and Applications, Third IEEE International Symposium on (NCA 4), pages 5 24, Washington, DC, USA, 24. IEEE Computer Society. 9