Internet Measurement Huaiyu Zhu, Rim Kaddah CS538 Fall 2011

Similar documents
Feng Wang, Zhuoqing Morley Mao, Jia Wang, Lixin Gao, Randy Bush. Presenter s Qihong Wu (Dauphin)

A Measurement Study on the Impact of Routing Events on End-to-End Internet Path Performance

Watching Data Streams Toward a Multi-Homed Sink Under Routing Changes Introduced by a BGP Beacon

6.829 BGP Recitation. Rob Beverly September 29, Addressing and Assignment

The Impact of Router Outages on the AS-Level Internet

Finding a Needle in a Haystack: Pinpointing Significant BGP Routing Changes in an IP Network

Happy Packets: Some Initial Results

Computer Networks CS 552

Computer Networks CS 552

California Fault Lines: Understanding the Causes and Impact of Network Failures

Taming BGP. An incremental approach to improving the dynamic properties of BGP. Geoff Huston. CAIA Seminar 18 August

Module 1b IS-IS. Prerequisites: The setup section of Module 1. The following will be the common topology used for the first series of labs.

Module 14 Transit. Objective: To investigate methods for providing transit services. Prerequisites: Modules 12 and 13, and the Transit Presentation

Measuring and Characterizing IPv6 Router Availability

Configuring IP SLAs LSP Health Monitor Operations

Investigating occurrence of duplicate updates in BGP announcements

CS557 Spring 2013 Midterm Exam

Module 16 An Internet Exchange Point

AS Connectedness Based on Multiple Vantage Points and the Resulting Topologies

Table of Contents 1 System Maintaining and Debugging Commands 1-1

Ping, tracert and system debugging commands

Advanced Computer Networks

Module 10 An IPv6 Internet Exchange Point

Module 18 Transit. Objective: To investigate methods for providing transit services. Prerequisites: Modules 12 and 13, and the Transit Presentation

Measurement of Highly Active Prefixes in BGP

CS 557 Internet Routing Policies

Lab Guide 2 - BGP Configuration

Table of Contents 1 System Maintenance and Debugging Commands 1-1

C. The ESP that is installed in the Cisco ASR 1006 Router does not support SSO.

Cisco IOS Optimized Edge Routing Command Reference

Pathlet Routing. P. Brighten Godfrey, Igor Ganichev, Scott Shenker, and Ion Stoica SIGCOMM (maurizio patrignani)

TCP Nice: A Mechanism for Background Transfers

Measuring IPv6 Adoption

EXAM TCP/IP NETWORKING Duration: 3 hours

Low-Rate TCP-Targeted DoS Attack Disrupts Internet Routing

How does a router know where to send a packet next?

Achieving scale: Large scale active measurements from PlanetLab

Outline Computer Networking. Inter and Intra-Domain Routing. Internet s Area Hierarchy Routing hierarchy. Internet structure

Pathlet Routing P. Brighten Godfrey Igor Ganichev, Scott Shenker, and Ion Stoica

Introduction to IP Routing. Geoff Huston

IPv6 Module 16 An IPv6 Internet Exchange Point

COMPUTER NETWORK. Homework #3. Due Date: May 22, 2017 in class

NetPilot: Automating Datacenter Network Failure Mitigation

DailyCatch: A Provider-centric View of Anycast Behaviour

Routing(2) Inter-domain Routing

Lecture 11: WAN Routing Alternatives"

BGP Convergence in much less than a second

LARGE SCALE IP ROUTING

Chapter 5: Maintaining and Troubleshooting Routing Solutions

User-level Internet Path Diagnosis

Revealing the load-balancing behavior of YouTube traffic of interdomain links

CS 5114 Network Programming Languages Control Plane. Nate Foster Cornell University Spring 2013

Module 19 Internet Exchange Points

A Comparison of Overlay Routing and Intelligent BGP Route Control

COURSE OUTLINE: Course: CCNP Route Duration: 40 Hours

BGP Scaling Techniques

Stable Internet Route Selection

Introduction. Routing & Addressing: Multihoming 10/25/04. The two SIGCOMM papers. A Comparison of Overlay Routing and Multihoming Route Control

Announcements. CS 5565 Network Architecture and Protocols. Project 2B. Project 2B. Project 2B: Under the hood. Routing Algorithms

Border Gateway Protocol - BGP

Christos Papadopoulos

CS4450. Computer Networks: Architecture and Protocols. Lecture 15 BGP. Spring 2018 Rachit Agarwal

Chapter 7 Lab 7-1, Configuring BGP with Default Routing

Networking Review & Grand Challenges

Internet Routing Dynamics

Cisco Performance Routing

Cisco XR Series Service Separation Architecture Tests

BGP Scaling Techniques

Chapter 16 OSPF Version 3 Commands

! Distance vector routing! Link state routing.! Path vector routing! BGP: Border Gateway Protocol! Route aggregation

EXAM TCP/IP NETWORKING Duration: 3 hours With Solutions

Open Source Routing. OSPF Testplan. Revision Date Author Comment Martin Winter Initial draft

Flooding Attacks by Exploiting Persistent Forwarding Loops

In questo documento mettiamo in evidenza:

Primitives for Active Internet Topology Mapping: Toward High-Frequency Characterization

Analysis of Country-wide Internet Outages Caused by Censorship

COM-208: Computer Networks - Homework 6

BGP Convergence in Virtual Private Networks

Important Lessons From Last Lecture Computer Networking. Outline. Routing Review. Routing hierarchy. Internet structure. External BGP (E-BGP)

Dynamics of Hot-Potato Routing in IP Networks

RFC2547 Convergence Characterization and Optimization

MPLS VPN--Inter-AS Option AB

An overview on Internet Measurement Methodologies, Techniques and Tools

BGP Policy Lab - Partial Routing

Routing(2) Inter-domain Routing

Understanding Network Delay Changes Caused by Routing Events

Inter-domain Routing. Outline. Border Gateway Protocol

Strategies, approaches and ethical considerations

Network Layer (Routing)

Networking Review & Grand Challenges

Interdomain Routing. Networked Systems (H) Lecture 11

Chapter 4: outline. Network Layer 4-1

CS Networks and Distributed Systems. Lecture 8: Inter Domain Routing

Lecture outline. Internet Routing Security Issues. Previous lecture: Effect of MinRouteAdver Timer. Recap of previous lecture

Introduction to Computer Networks

IT Certification Exams Provider! Weofferfreeupdateserviceforoneyear! h ps://

Organization of Product Documentation... xi

Lessons learnt from the Beijing Olympic Games

Multihoming with BGP and NAT

CS 268: Computer Networking. Next Lecture: Interdomain Routing

Transcription:

Internet Measurement Huaiyu Zhu, Rim Kaddah CS538 Fall 2011

OUTLINE California Fault Lines: Understanding the Causes and Impact of Network Failures. Feng Wang, Zhuoqing Morley MaoJia Wang3, Lixin Gao and Randy Bush A Measurement Study on the Impact of Routing Events on End to End Internet Path Performance Daniel Turner, Kirill Levchenko, Alex C. Snoeren, and Stefan Savage.

OUTLINE California Fault Lines: Understanding the Causes and Impact of Network Failures. Feng Wang, Zhuoqing Morley MaoJia Wang3, Lixin Gao and Randy Bush A Measurement Study on the Impact of Routing Events on End to End Internet Path Performance Daniel Turner, Kirill Levchenko, Alex C. Snoeren, and Stefan Savage.

Why Study Failure Failure is a reality for large network Achieving high availability requires engineering the network to be robust to failure Designing mechanisms to effectively mitigate failures requires deep understanding of real failures

CENIC Network Serving California educational institutions Over 200 routers 5 years of data Three Types of Components: The Digital California (DC) network The High-Performance Research (HPR) network Customer-premises equipment (CPE)

Contribution Methodology to reconstruct historical failure events of CENIC network Using only commonly available data, No need for additional instrumentation Analyze the network based on failure measurement

Reconstruction What data are available to reconstruct a failure 4 years later? Syslog Describes interface state changes Router Configuration Files Maps interfaces to Links Operation announcements on mailing list Data are not intended for failure reconstruction!

Validation Internal consistency Using the administrator announcements to validate the event history reconstructed. External consistency CAIDA Skitter project (now Ark) validating UP. Route Views project validating DOWN.

Overview of Link Failures

Overview of Link Failures

Overview of Link Failures Vertical banding V1: a network-wide IS-IS configuration change requiring a router restart V2: a network-wide software upgrade V3: a network-wide configuration change in preparation for IPv6 Horizontal banding H1: a series of failures on a link between a core router and a County of Education office (hardware) H2: this link experienced over 33,000 short-duration failures (fiber cut)

CDFs of Individual Failure Events

Various Link Hardware Types

Cause of Failure

Failure Events

Summary Engineering for failure requires real data - Data has historically been difficult to obtain Methodology to perform historical failure analysis with low-quality data sources Shared our findings in the CENIC network - Reliability of individual components - Causes of failures - Impact of failure

OUTLINE California Fault Lines: Understanding the Causes and Impact of Network Failures. Feng Wang, Zhuoqing Morley MaoJia Wang3, Lixin Gao and Randy Bush A Measurement Study on the Impact of Routing Events on End to End Internet Path Performance Daniel Turner, Kirill Levchenko, Alex C. Snoeren, and Stefan Savage.

Key Questions How could routing events cause degraded end-to-end path performance? How topological properties and routing policies affect performance degradation?

Approach Study end-to-end performance under realistic topologies. Investigate several metrics to characterize the end-to-end loss, delay, and out-of-order packets. Characterize the kinds of routing changes that impact end-to-end path performance. Analyze the impact of topology, routing policies, MRAI timer and ibgp configurations on end-to-end path performance.

Experiment Methodology A multi-homed prefix BGP Beacon prefix: 192.83.230.0/24 Controlled Routing Changes Failover events: Beacon changes from the state of being connected to both providers to the state of being connected to a single provider. Recovery events: Beacon changes from the state of being connected to a single provider to the state of being connected to both providers. ISP1 ISP 2 ISP 1 ISP 2 ISP 1 ISP 2 Failover event Recovery event Beacon Beacon Beacon

Controlled Routing Changes 12 routing events every day 8 for beacon events: o Failover events o Recovery events 4 for resetting the Beacon Connectivity. Time schedule (GMT) for BGP Beacon routing transitions

Active Probing Goal: capture the impact of routing changes on the end-toend performance. From 37 PlanetLab hosts to the Beacon host (a host within the Beacon prefix). host B host A Internet ISP 1 ISP 2 Beacon host host C Three probing methods: - Back-to-back traceroutes - Back-to-back pings - UDP probing (50msec interval) Data Plane Performance metrics Pack loss Delay Out-of-order Active probing traceroute ping UDP probing

Packet Loss Loss burst: consecutive UDP probing packets lost during a routing change event. Failover Recovery

Packet Delay Roundtrip delays from the probe host to the Beacon host (clock skews problem when using one-way delays). Failover Recovery

Out-of-order Packets Number of reordering (number of packets out of order) Reordering offset Failover Recovery

How Routing Failures Occur (Failover)? Prefer-customer routing policy: routes received from a provider s customers are always preferred over those received from its peers. 0 R2 Provider 1 Provider 2 Peer link 0 R1 R3 0 2 0 0 1 0 R4 R6 0 R5 0 Customer link Beacon AS 0

How Routing Failures Occur (Failover)? (contd.) No-valley routing policy: peers do not transit traffic from one peer to another. Peer link R7 1 0 R8 Provider 3 1 0 2 0 R9 2 0 1 0 0 R2 0 R1 R3 0 2 0 Peer link R4 0 1 0 R6 0 R5 0 Provider 1 Provider 2 Beacon AS 0

How Routing Failures Occur? (Recovery) ibgp constraint: a route received from an ibgp router cannot be transited to another ibgp router 1. Path 0 R3 recovery. 2. R3 sends the path to R2 3. R2 sends a withdrawal R1 path (0) to R1 4. R3 sends the recovery path to R1 5. R1 regains its connection to the Beacon Withdraw (2 0) Provider 1 R3 Path (0) R2 0 Beacon AS 0 Provider 2 R4

Summary During failover and recovery events Routing events impact packet loss significantly. Routing failures contribute to end-to-end packet loss significantly. Routing events can lead to long packet round-trip delays and reordering Routing policies and ibgp configuration play a major role in causing packet loss during routing events.

Discussion How could we prevent packet loss during path exploration? Would storing an alternative path in each router be a good idea? What are the downsides? How could we exploit the previous results to improve endto-end performance? How realistic could we consider the topology in the second paper?

References Feng Wang, Zhuoqing Morley MaoJia Wang, Lixin Gao and Randy Bush. A Measurement Study on the Impact of Routing Events on End-to-End Internet Path Performance. SIGCOMM 2006. Feng Wang, Zhuoqing Morley MaoJia Wang, Lixin Gao and Randy Bush. Presentation on SIGCOMM 2006. Daniel Turner, Kirill Levchenko, Alex C. Snoeren, and Stefan Savage. California Fault Lines. SIGCOMM 2010. Daniel Turner, Kirill Levchenko, Alex C. Snoeren, and Stefan Savage. Presentation on SIGCOMM 2010.