CS 557 BGP Convergence

Similar documents
Destination Reachability and BGP Convergence Time. Beichuan Zhang (UCLA) Dan Massey (Colorado State) Lixia Zhang (UCLA)

An Analysis of Convergence Delay in Path-Vector Routing Protocols

On the Impact of Route Processing and MRAI Timers on BGP Convergence Times

BGP-RCN: improving BGP convergence through root cause notification q

Preventing the unnecessary propagation of BGP withdraws

Quantifying Path Exploration in the Internet

The Internet now-a-days has become indispensable to each and everyone. It is vulnerable to

Bringing Order To BGP: Decreasing Time and Message Complexity

Quantifying Path Exploration in the Internet

Computer Networks 53 (2009) Contents lists available at ScienceDirect. Computer Networks. journal homepage:

THE INTERNET S inter-domain routing protocol, the

CS557 Spring 2013 Midterm Exam

Bringing Order To BGP: Decreasing Time and Message Complexity

Inter-Domain Routing: BGP

BGP Convergence in Virtual Private Networks

Introduction to IP Routing. Geoff Huston

Top-Down Network Design

Reducing BGP convergence time by fine tuning the MRAI timer on different topologies

APT Incremental Deployment

CS Networks and Distributed Systems. Lecture 8: Inter Domain Routing

CS 43: Computer Networks. 24: Internet Routing November 19, 2018

Lecture 19: Network Layer Routing in the Internet

Table of Contents. Cisco Introduction to EIGRP

BGP Routing and BGP Policy. BGP Routing. Agenda. BGP Routing Information Base. L47 - BGP Routing. L47 - BGP Routing

A Technique for Reducing BGP Update Announcements through Path Exploration Damping

PART III. Implementing Inter-Network Relationships with BGP

Differentiated BGP Update Processing for Improved Routing Convergence

CS4450. Computer Networks: Architecture and Protocols. Lecture 15 BGP. Spring 2018 Rachit Agarwal

CS4450. Computer Networks: Architecture and Protocols. Lecture 11 Rou+ng: Deep Dive. Spring 2018 Rachit Agarwal

Broadcast: Befo re 1

BGP Path Exploration Damping (PED)

COMP/ELEC 429 Introduction to Computer Networks

Quantifying Path Exploration in the Internet

Inter-AS routing and BGP. Network Layer 4-1

Network Protocols. Routing. TDC375 Autumn 03/04 John Kristoff - DePaul University 1

Network Layer: Routing

CS4700/CS5700 Fundamentals of Computer Networks

CSCE 463/612 Networks and Distributed Processing Spring 2018

Department of Computer and IT Engineering University of Kurdistan. Computer Networks II Border Gateway protocol (BGP) By: Dr. Alireza Abdollahpouri

This appendix contains supplementary Border Gateway Protocol (BGP) information and covers the following topics:

More Routing. EE122 Fall 2012 Scott Shenker

Software-Defined Inter-Domain Routing Revisited

TDC 363 Introduction to LANs

Outline. Organization of the global Internet. BGP basics Routing policies The Border Gateway Protocol How to prefer some routes over others

Topics for This Week

CS555, Spring /5/2005. April 12, 2005 No classes attend Senior Design Projects conference. Chapter 4 roadmap. Internet AS Hierarchy

Network Routing - II Failures, Recovery, and Change

Taming BGP. An incremental approach to improving the dynamic properties of BGP. Geoff Huston. CAIA Seminar 18 August

CS BGP v4. Fall 2014

CS 640: Introduction to Computer Networks. Intra-domain routing. Inter-domain Routing: Hierarchy. Aditya Akella

Advanced Topics in Routing

Lecture 4: Intradomain Routing. CS 598: Advanced Internetworking Matthew Caesar February 1, 2011

Examination. ANSWERS IP routning på Internet och andra sammansatta nät, DD2491 IP routing in the Internet and other complex networks, DD2491

Timer Interaction in Route Flap Damping

Routing, Routers, Switching Fabrics

Important Lessons From Last Lecture Computer Networking. Outline. Routing Review. Routing hierarchy. Internet structure. External BGP (E-BGP)

FINAL EXAM - SLOT 2 TCP/IP NETWORKING Duration: 90 min. With Solutions

BGP Route Flap Damping Algorithms

Improving Packet Delivery Performance of BGP During Large-Scale Failures

The fundamentals of Ethernet!

EECS 122, Lecture 16. Link Costs and Metrics. Traffic-Sensitive Metrics. Traffic-Sensitive Metrics. Static Cost Metrics.

Homework Assignment #3b Due 11/22 at 3:50pm EE122 Fall 2010

Other Developments: CIDR

PHAS: A Prefix Hijack Alert System

Distance vector and RIP

Lecture 13: Routing in multihop wireless networks. Mythili Vutukuru CS 653 Spring 2014 March 3, Monday

Chapter 4: Network Layer

Routing Protocols --- Exterior Gateway Protocol

Computer Networking. Intra-Domain Routing. RIP (Routing Information Protocol) & OSPF (Open Shortest Path First)

Outline Computer Networking. Inter and Intra-Domain Routing. Internet s Area Hierarchy Routing hierarchy. Internet structure

Interdomain Routing Reading: Sections K&R EE122: Intro to Communication Networks Fall 2007 (WF 4:00-5:30 in Cory 277)

APT: A Practical Transit-Mapping Service Overview and Comparisons

State of routing research

Chapter IV: Network Layer

BGP. Autonomous system (AS) BGP version 4

The Impact of Router Outages on the AS-Level Internet

Achieving Sub-50 Milliseconds Recovery Upon BGP Peering Link Failures

On the Impact of Filters on Analyzing Prefix Reachability in the Internet. Ravish Khosla, Sonia Fahmy, Y. Charlie Hu Purdue University ICCCN 2009

Communication Networks

802.1AS Fast Master Clock Selection

Effect of Minimal Route Advertisement Interval Timers on Border Gateway Protocol Convergence

Routing in the Internet

DATA COMMUNICATOIN NETWORKING

BGP. Autonomous system (AS) BGP version 4

Internet Routing Dynamics

Routers & Routing : Computer Networking. Binary Search on Ranges. Speeding up Prefix Match - Alternatives

Simulating Internet Scale Topologies with Metarouting

The Global-ISP Paradigm

BGP. Autonomous system (AS) BGP version 4. Definition (AS Autonomous System)

Announcements. CS 5565 Network Architecture and Protocols. Project 2B. Project 2B. Project 2B: Under the hood. Routing Algorithms

On the structure and application of BGP Policy Atoms. Tel-Aviv University

Ravi Chandra cisco Systems Cisco Systems Confidential

internet technologies and standards

LOUP: The Principles and Practice of Intra-Domain Route Dissemination. Nikola Gvozdiev, Brad Karp, Mark Handley

Network Protocols. Routing. TDC375 Winter 2002 John Kristoff - DePaul University 1

CS 43: Computer Networks Internet Routing. Kevin Webb Swarthmore College November 16, 2017

Routing on the Internet. Routing on the Internet. Hierarchical Routing. Computer Networks. Lecture 17: Inter-domain Routing and BGP

LARGE SCALE IP ROUTING LECTURE BY SEBASTIAN GRAF

Backbone Networks. Networking Case Studies. Backbone Networks. Backbone Topology. Mike Freedman COS 461: Computer Networks.

Border Gateway Protocol (an introduction) Karst Koymans. Monday, March 10, 2014

Transcription:

CS 557 BGP Convergence Improved BGP Convergence via Ghost Flushing Bremler-Barr, fek, Schwarz, 2003 BGP-RCN: Improving Convergence Through Root Cause Notification Pei, zuma, Massey, Zhang, 2005 Spring 2013

Z s Candidate paths: () (B ) (C () B ) (E () D B ) (H (I H G G F E F ) BGP Path Exploration Z ( ) ( ) ( ) C B F E D ( ) ( ) ( ) dest. I H G n Obsolete paths (C B ) and (E D B ) explored before converging on valid path (I H G F )

Potential to Explore N! Paths Paths Explored by D B C S,C,S Link C,S fails,b,c,s,d,c,s,b,d,c,s,d,b,c,s. No route Theoretically can explore N! paths before no route

Some Routing Terminology Tup = route to previously unreachable prefix is announced. Tdown = route to current reachable prefix is withdrawn and no replacement exists Tshort = route to current reachable prefix switches to shorter path Tlong = route to current reachable prefix switches to a longer path Other terminology Tdown = fail-down Tlong = fail-over

BGP MRI Time and Convergence n Minimum Route dvertisement Interval (MRI) timer: n Within M=30 seconds, at most one announcement from to B n not for the first announcement, not for the withdrawal n Impact: a. suppress transient changes b. delay convergence s path changes: P 1 w P 2 P 3 P 4 P 5 time=0 Msgs from to B: P 1 w time=30 time=60 P 4 P 5

[BS03] Improving BGP Convergence Objective: Improve convergence time after a legitimate route change. pproach: Flush out ghost information that is blocked by the MRI timer P4 in previous slide is ghost information Contributions: Simple, easily deployed, and clever approach to improve convergence Theoretical understanding of convergence behavior Improves on 2002 result from Pei et al.

Basic Model Each S is treated as one node Though not strictly required in ghost flushing Routers use shortest path routing policy Helps with analysis, but not strictly required SPVP (simple path vector protocol) approximates BGP MRI timer between updates Minimum Route dvertisement Interval Two consecutive updates must be at least MRI time apart.

Ghost Information Obsolete path information stored at node Could be preferred route or backup route stored at a node. MRI timer can block removal of ghost information Router cannot announce its current choice of paths because it recently announced a different path. Typical MRI value is 30 seconds Can lead to increased convergence time and increased chance of selecting ghost paths.

Ghost Flushing Very Simple Rule for BGP Routers When route to P is updated to a worse path and MRI timer is delaying path announcement send withdraw(p) (no route to P)

Path Length and Time ssume Tdown Event Let H = message passing time Claim at time K*H, every message or node has SPath length > K By induction. True at time H since neighboring routers received withdraw ssume true at time KH, all paths longer than KH. Suppose K or less path exists at time (K+1)H Must have come from some peer P with path length KH. Path must have been removed prior to time KH Withdraw or longer path announced prior to time KH Must be received prior to time (K+1)H (contradiction)

Implications of Time/Length Shown that at the K*H, every message or node has SPath length > K Implications: Longest possible path has length N t time N*H, all paths are longer than longest possible path By time N*H, all routers know that path is withdrawn Convergence time is (N*H) Reduced from N*MRI

Message Complexity Claim at most 2 messages sent during each MRI timer interval Resulting complexity Number of MRI rounds is NH/(MRI) Updates per round is 2E Complexity is O(2ENH/MRI) (BGP complexity is EN)

Tlong (fail-over) Complexity Expect good results, but no theoretical results presented here Simulations show solid improvement Other simulations (not shown here) show some surprises Theoretical results later determined by Pei et al. Covered next week.

[P+05] Improving BGP Convergence Objective: Improve convergence time after a legitimate route change. pproach: Signal the cause of the path failure Contributions: Dramatic reduction in convergence time plus ability to improve other parts of BGP Theoretical understanding of convergence behavior

BGP Path Exploration Revisited Z s Candidate paths: () (B ) (C () B ) (E () D B ) (H (I H G G F E F ) Z ( ) ( ) ( ) C B F E D ( ) ( ) ( ) dest. I H G n Observation: if Z know [B ] failed, it could ve avoided the obsolete paths

Root Cause Notification The node who detects the failure attaches root cause to msg Other nodes copy the root cause to outgoing messages n the first msg is enough for Z to remove all the obsolete paths Z s Candidate paths: () (B ) (C B ) (E D B ) ) (H (I H G G F E F ) Z ( ), [B ] failure ( ), [B ] failure C E D B ( ), [B ] failure F I H G

Overlapping Events nother topology change happens before the previous change s convergence finishes. [B ] failure Z B E D [B ] failure dest. Propagation along lower path is slower than upper path

Overlapping Events Path: (B ) [B ] recovery Z B E D [B ] recovery dest. [B ] failure

Wrong! Path: (B ) Overlapping Events Observation: need to order the relative timing of the root causes Z B E D dest. [B ] failure [B ] recovery

Solution: adding sequence number Node B maintains a sequence number for link [B ] Incremented each time the link status changes [B ] failure, seqnum=1 Z B E D [B ] failure, seqnum=1 dest.

Solution: adding sequence number (B ), [B ] recovery, seqnum=2 Path: (C B ), seqnum of [B ]=2 Z B E D dest. [B ] failure, seqnum=1

Solution: adding sequence number Sequence number orders the relative timing of the root causes Path: (B ), seqnum of [B ]=2 Z B E [B ] failure, seqnum=1 D dest.

Evaluation: analysis and simulation n Two types of topology changes: dest. n Fail-over: nodes switch to worse paths Z I H B G F n Fail-down: destination becomes unreachable dest.

Z Fail-down convergence delay (worst case) bound Withdrawals are not delayed by MRI! w C h seconds w long shortest path: it takes at most d*h seconds B h seconds d: network diameter w nodal processing delay dest. d << N-1 and h <<M RCN BGP d * h (N-1) * (h+m) Length of the longest possible path(n) MRI value

Fail-down simulation results n 2-3 orders of magnitudes reduction Convergence Time 1000 Seconds 100 10 BGP RCN 1 14 28 56 112 Number of nodes

Border nodes in fail-over convergence unaffected nodes Z s eventual path has always been available H I J D Z ffected nodes B C dest. Border node Z: connected to an unaffected node H its eventual path is through H

RCN s fail-over delay bound H First message unaffected is nodes not delayed by MRI! D Phase 2: (M+h)* d affected ffected nodes Node D s convergence: Z Phase 1: h*d affected B C dest. Phase 1: Z receives the root cause Phase 2: Z s path is propagated to D (MRI delay applies in this phase) diameter of the sub-graph of affected nodes RCN (M + 2*h)*d affected

BGP s fail-over delay bound H unaffected nodes D Phase 2: (M+h)* d affected ffected nodes Node D s convergence: Z B C dest. Phase 1: Z explores paths shorter than Z s eventual path Phase 2: the same as in RCN BGP (M+h) * min{d - J, V affected + d affected -1}

Fail-over simulation results n BGP does fine : <(M+h) * d n d : 2~6 d : length of the longest path from any affected node to the destination Seconds 25 20 15 10 5 0 14 28 56 112 Number of nodes BGP RCN Constructed topologies with large d : RCN has much more pronounced improvement

RCN Overhead n Transmission & storage of a path: doubled path:seqnum (Z C B ):(3 2 2 1) n Storage overhead in the routing table: n doubled n Transmission overhead reduced n 1~2 orders of magnitudes reduction in msg counts

Related Work n Reducing negative impact of MRI: n [Griffin:ICNP01], Ghost-Flushing [Bremler-Barr:Infocom03] n don t deal with path exploration n Reducing path exploration n Consistency ssertion [Pei:Infocom02] n path exploration still exists n Explicitly signaling failure n RCO [Luo:Globecom02], BGP-CT [Wattenhofer:talkslides03]: may result in wrong routing decision n EPIC [Chandrasheka:Infocom05]: encoding difference