User-level Internet Path Diagnosis

Similar documents
Available Bandwidth Estimation. Probing Packet Train in Pathneck. Transmission of RPT. Choke Point Detection. Packet train probing

User-level Internet Path Diagnosis

Table of Contents 1 System Maintaining and Debugging 1-1

ECE 697J Advanced Topics in Computer Networks

Reverse Traceroute. NSDI, April 2010 This work partially supported by Cisco, Google, NSF

TVA: A DoS-limiting Network Architecture L

A Suite of Schemes for User-level Network Diagnosis without Infrastructure

Table of Contents 1 System Maintaining and Debugging Commands 1-1

TCP Nice: A Mechanism for Background Transfers

Table of Contents 1 System Maintenance and Debugging Commands 1-1

Configuring Cisco IOS IP SLA Operations

Path Diagnosis with IPMP

ELEC / COMP 177 Fall Some slides from Kurose and Ross, Computer Networking, 5 th Edition

Configure IP SLA Tracking for IPv4 Static Routes on an SG550XG Switch

Problems of IP. Unreliable connectionless service. Cannot acquire status information from routers and other hosts

Configuring Cisco IOS IP SLAs Operations

Lecture 16: Network Layer Overview, Internet Protocol

ICMP (Internet Control Message Protocol)

Lecture 19: Network Layer Routing in the Internet

Network Layer (4): ICMP

MPLS Ping and Traceroute for BGP and IGP Prefix-SID

Measuring and Characterizing IPv6 Router Availability

Lessons learnt from the Beijing Olympic Games

Configuring Cisco IOS IP SLAs Operations

Ping, tracert and system debugging commands

DoS Attacks. Network Traceback. The Ultimate Goal. The Ultimate Goal. Overview of Traceback Ideas. Easy to launch. Hard to trace.

Using Segment Routing OAM

ICMP (Internet Control Message Protocol)

A simple coordination mechanism for interdomain routing

Assignment 1 (2011/12)

Xiapu Luo, Edmond W. W. Chan, Rocky K. C. Chang Department of Computing The Hong Kong Polytechnic University

Programming Assignment 3: Transmission Control Protocol

Network layer: Overview. Network layer functions IP Routing and forwarding NAT ARP IPv6 Routing

CPSC 826 Internetworking. The Network Layer: Routing & Addressing Outline. The Network Layer

Network layer: Overview. Network Layer Functions

Network Working Group Request for Comments: Juniper Networks D. Meyer Sprint September 2003

ICMP Traceback Messages

CS 5114 Network Programming Languages Control Plane. Nate Foster Cornell University Spring 2013

Introduction Problems and Improvements RR DLP DisCarte Results. DisCarte. A Disjunctive Internet Cartographer. Sebastian Schlunke

Lessons learnt from the Beijing Olympic Games Website Measurement

HP 5120 SI Switch Series

Active Measurement of Data-Path Quality in a Non-cooperative Internet

Cisco CCNA Basic IP Routing Part I

Contents. Ping, tracert, and system debugging commands 1 debugging 1 display debugging 2 ping 2 ping ipv6 5 tracert 7 tracert ipv6 9

Basic Cisco Router Configuration: Multiple Routers

MESSAGES error-reporting messages and query messages. problems processes IP packet specific information

Using ping, tracert, and system debugging

The Internet Measurement Toolbox. Justine Sherry, University of University of Puget Sound April 12, 2010

Each ICMP message contains three fields that define its purpose and provide a checksum. They are TYPE, CODE, and CHECKSUM fields.

CS 457 Lecture 11 More IP Networking. Fall 2011

CS459 Internet Measurements

Segment Routing MPLS OAM Support

IP SLAs QFP Time Stamping

Contents. Ping, tracert, and system debugging commands 1. debugging 1 display debugging 1 ping 2 ping ipv6 5 tracert 7 tracert ipv6 10

A Non-cooperative Active Measurement Technique for Estimating the Average and Variance of the One-way Delay

I Commands. iping, page 2 iping6, page 4 itraceroute, page 5 itraceroute6 vrf, page 6. itraceroute vrf encap vxlan, page 12

Measurement: Techniques, Strategies, and Pitfalls. David Andersen CMU

Lecture 10: Addressing

The Internetworking Problem. Internetworking. A Translation-based Solution

Operational Security Capabilities for IP Network Infrastructure

Outline. Internet. Router. Network Model. Internet Protocol (IP) Design Principles

Table of Contents 1 Static Routing Configuration 1-1

Flexible NetFlow - Top N Talkers Support

Shim6: Reference Implementation and Optimization

HP 5820X & 5800 Switch Series Network Management and Monitoring. Configuration Guide. Abstract

Lab Troubleshooting Using traceroute Instructor Version 2500

PfR Voice Traffic Optimization Using Active Probes

Segment Routing MPLS OAM Support

Reverse traceroutes. Presented by F.E. Bustamante

Lab Assignment 4 ECE374 Spring Posted: 03/22/18 Due: 03/29/18. In this lab, we ll take a quick look at the ICMP and IP protocols.

Module 7 Internet And Internet Protocol Suite

Internet Protocol. Outline Introduction to Internet Protocol Header and address formats ICMP Tools CS 640 1

Design and development of the reactive BGP peering in softwaredefined routing exchanges

HP A6600 Routers Network Management and Monitoring. Command Reference. Abstract

CS 640 Introduction to Computer Networks Spring 2009

Outline. SC/CSE 3213 Winter Sebastian Magierowski York University. ICMP ARP DHCP NAT (not a control protocol) L9: Control Protocols

IP - The Internet Protocol. Based on the slides of Dr. Jorg Liebeherr, University of Virginia

Experiment No: Group B_4

Communications Software. CSE 123b. CSE 123b. Spring Lecture 2: Internet architecture and. Internetworking. Stefan Savage

Efficient Internet Routing with Independent Providers

Introduction to Computer Networks

Mohammad Hossein Manshaei 1393

Network Layer. The Network Layer. Contents Connection-Oriented and Connectionless Service. Recall:

Network Layer. Recall: The network layer is responsible for the routing of packets The network layer is responsible for congestion control

Internet Measurement Huaiyu Zhu, Rim Kaddah CS538 Fall 2011

Operation Manual IP Addressing and IP Performance H3C S5500-SI Series Ethernet Switches. Table of Contents

CS268: Beyond TCP Congestion Control

Chapter 4 Network Layer: The Data Plane. Part A. Computer Networking: A Top Down Approach

Our Narrow Focus Computer Networking Security Vulnerabilities. Outline Part II

SCRIPT: An Architecture for IPFIX Data Distribution

Chapter 12 Network Protocols

Table of Contents 1 Static Routing Configuration 1-1

Introduction to Internetworking

Computer Networks CS 552

Computer Networks CS 552

CSE 461. TCP and network congestion

6.033 Computer System Engineering

Diagnosis of TCP Overlay Connection Failures using Bayesian Networks

Configuring IP SLAs LSP Health Monitor Operations

CSC 401 Data and Computer Communications Networks

Transcription:

User-level Internet Path Diagnosis Ratul Mahajan Neil Spring David Wetherall Thomas Anderson University of Washington

Diagnosing performance of Internet paths is hard X X Multiple administrative domains operators may be equally clueless Policy routing The Internet as a black box asymmetric paths (round trip tools such as ping don t work well) path to intermediate routers may not be a prefix of the end-toend path to the destination Performance may depend on the application packet size, inter-packet spacing, protocol, port number,.. ratul sosp 2003 user-level internet path diagnosis 2

Our goal is user-level diagnosis Diagnosis: identify and localize performance faults that impact applications loss, reordering, queuing delay, User-level: without privileged access to routers useful for both end users and network operators Diagnosis is useful (even if you cannot fix yourself) transparency will lead to faster problem resolution intelligently route around the fault ratul sosp 2003 user-level internet path diagnosis 3

Existing diagnosis tools have limitations ping X X reverse path conflation implicates the wrong link ping/traceroute/pathchar measure round trip path to routers path asymmetry conflates forward and reverse paths Effective diagnosis requires router support beyond packet reflection ratul sosp 2003 user-level internet path diagnosis 4

Approach and outline Architecture what minimal support is needed to enable user-level diagnosis in Internet-like networks? Build practical tools tulip Explore Internet evolution to improve diagnostic support ratul sosp 2003 user-level internet path diagnosis 5

An architecture for path diagnosis Start with an ideal solution routers log all packets they forward users diagnose their paths through trace analysis complete but impractical Reduce to a practical architecture 1. all routers on the path embed diagnostic info in packets timing, flow counters, and path information 2. the source samples one router to embed diagnostic info ratul sosp 2003 user-level internet path diagnosis 6

An architecture for path diagnosis (2) Lightweight, in-band packet marking almost as powerful as the complete path trace field sampler timestamp flow counter path signature comments selects the sampling router local time at the sampling router # of pkts processed for this flow to detect path changes Timing, flow counters and path information provide effective diagnostic support ratul sosp 2003 user-level internet path diagnosis 7

Approach and outline Architecture what minimal support is needed to enable user-level diagnosis in Internet-like networks? Build practical tools tulip Explore Internet evolution to improve diagnostic support ratul sosp 2003 user-level internet path diagnosis 8

Overview of tulip tulip Measuring forward path to routers is the basic building block Localizes reordering, queuing and loss (so far) single-ended: works from a host to an arbitrary IP address Infers link properties by subtracting path properties path to router should be a prefix of the end-to-end forward path ratul sosp 2003 user-level internet path diagnosis 9

Queuing on the forward path ICMP timestamp request ICMP timestamp reply Source time Router ICMP timestamps are used to access router s clock [cing] 1 ms resolution; supported by over 90% routers prefix path property may not hold Queuing inferred from delay variation Engineering clock calibration, response generation time ratul sosp 2003 user-level internet path diagnosis 10

Control Data Control Loss on the forward path X X id id+1 id+2 no loss Loss measurements use the IP identifier field in IP packets over 70% of routers implement IP-ID as a counter common counter for all probing sources Unambiguous detection of forward path loss for data packets when control responses get consecutive IP-IDs Robust to response rate-limiting at the routers id id+1 unambiguous forward data loss id X id+x ambiguous data loss direction ratul sosp 2003 user-level internet path diagnosis 11

Experimental evaluation of tulip What is the resolution of fault localization? diagnosis granularity Is it accurate? end-to-end correctness consistency (monotonic increase along the path) ratul sosp 2003 user-level internet path diagnosis 12

Diagnosis granularity of tulip tulip X Granularity: uncertainty in the location of the fault when a router does not support the required features when probes take a non-prefix path to a router ratul sosp 2003 user-level internet path diagnosis 13

Diagnosis granularity of tulip (2) Median is 2 hops for loss and 4 hops for queuing ICMP timestamp probes do not have the prefix path property tulip X Round trip probing can further improve diagnosis granularity ratul sosp 2003 user-level internet path diagnosis 14

Experimental evaluation of tulip What is the resolution of fault localization? diagnosis granularity Is it accurate? end-to-end correctness internal consistency (monotonic increase along the path) ratul sosp 2003 user-level internet path diagnosis 15

Consistency along the path (queuing) median queuing delay to intermediate routers in an example path Tulip s one-way measurements are consistent Round trip measurements are polluted by reverse path conflation ratul sosp 2003 user-level internet path diagnosis 16

Consistency along the path (queuing) one-way (tulip) round trip (ping) queuing delta = delay at the far end delay at the near end Tulip s one-way measurements are consistent Round trip measurements are polluted by reverse path conflation ratul sosp 2003 user-level internet path diagnosis 17

Tulip in action Washington Michigan Harvard LBL Stanford Northwestern Virginia Equinix-Ashburn Texas Tulip can help build more scalable network monitoring and overlay routing systems ratul sosp 2003 user-level internet path diagnosis 18

Approach and outline Architecture what minimal support is needed to enable user-level diagnosis in Internet-like networks? Build practical tools tulip a tool to diagnose reordering, loss, and queuing delay Explore Internet evolution to improve diagnostic support ratul sosp 2003 user-level internet path diagnosis 19

Recall: an architecture for path diagnosis Lightweight, in-band packet marking almost as powerful as the complete path trace field sampler timestamp flow counter path signature comments selects the sampling router local time at the sampling router # of pkts processed for this flow to detect path changes Timing, flow counters and path information provide effective diagnostic support ratul sosp 2003 user-level internet path diagnosis 20

Tulip approximates the architecture in the Internet sampler timestamp flow counter path signature TTL or router s IP addr ICMP timestamps IP-ID counter hop count Approximations (and tulip) have limitations measurement probes are out-of-band ICMP timestamp issues (next slide) IP-ID counter is shared path changes can go undetected Moving the Internet towards the architecture improves diagnostic support identify small changes with big benefits ratul sosp 2003 user-level internet path diagnosis 21

Better timing information Problems: timing information is separate from flow counters ICMP timestamps require directly addressing the router routing issues reduces their value Simple fix: timestamp TTL-expired messages backwards compatible, incrementally deployable use 32 unused bits in the TTL-expired messages ratul sosp 2003 user-level internet path diagnosis 22

Better counter support Problem: IP-ID is a shared counter what if all of you start using tulip? the architecture suggests per-flow counters Simple fix: maintain N (constant) counters hash source address and probe IP-ID to pick the counter backwards compatible, incrementally deployable (today, N=1) ratul sosp 2003 user-level internet path diagnosis 23

Summary Tulip enables end users to diagnose Internet paths co-opts router support by exploiting well-deployed router features http://www.cs.washington.edu/research/networking/tulip Architectural arguments: features used by tulip approximate a lightweight architecture for user-level path diagnosis approximations suggest evolutionary changes to improve Internet s diagnostic support Future work: extend tulip with tomography to improve diagnosis granularity higher layer protocol diagnosis ratul sosp 2003 user-level internet path diagnosis 24