BGP Convergence in much less than a second

Similar documents
RFC2547 Convergence Characterization and Optimization

BGP Routing and BGP Policy. BGP Routing. Agenda. BGP Routing Information Base. L47 - BGP Routing. L47 - BGP Routing

RFC2547 Convergence: Characterization and Optimization

ibgp Multipath Load Sharing

FC LS Slovakia Expo Cisco Systems, Inc. All rights reserved. Cisco Public

Internet Routing Basics

Routing Basics. SANOG July, 2017 Gurgaon, INDIA

BGP Diverse Path Using a Diverse-Path Route Reflector

Back to basics J. Addressing is the key! Application (HTTP, DNS, FTP) Application (HTTP, DNS, FTP) Transport. Transport (TCP/UDP) Internet (IPv4/IPv6)

BGP Support for Next-Hop Address Tracking

Vendor: Alcatel-Lucent. Exam Code: 4A Exam Name: Alcatel-Lucent Border Gateway Protocol. Version: Demo

BGP Event-Based VPN Import

Routing Resiliency Latest Enhancements

How BGP Routers Use the Multi Exit Discriminator for Best Path Selection

COM-208: Computer Networks - Homework 6

BGP Best External. Finding Feature Information

LARGE SCALE IP ROUTING LECTURE BY SEBASTIAN GRAF

Module 1b IS-IS. Prerequisites: The setup section of Module 1. The following will be the common topology used for the first series of labs.

Chapter 4: outline. Network Layer 4-1

Juniper JN0-647 Exam. Volume: 65 Questions. Question: 1 Which protocol is a multicast routing protocol? A. OSPF B. BGP C. PIM D. IS-IS.

Segment Routing MPLS data plane. Clarence Filsfils Kris Michielsen

BGP Support for Next-Hop Address Tracking

Multi Topology Routing Truman Boyes

Operation Manual MCE H3C S3610&S5510 Series Ethernet Switches. Table of Contents

MPLS VPN--Inter-AS Option AB

BGP Cost Community. Prerequisites for the BGP Cost Community Feature

Configure Segment Routing for BGP

BGP Next Hop Unchanged

Deploy MPLS L3 VPN. APNIC Technical Workshop October 23 to 25, Selangor, Malaysia Hosted by:

Routing Basics ISP/IXP Workshops

Examination. ANSWERS IP routning på Internet och andra sammansatta nät, DD2491 IP routing in the Internet and other complex networks, DD2491

Inter-Domain Routing: BGP

BGP mvpn BGP safi IPv4

BGP Commands: M through N

BGP Attributes and Path Selection

Evaluating the Benefits of the Locator/Identifier Separation

MPLS L3VPN. The MPLS L3VPN model consists of three kinds of devices: PE CE Site 2. Figure 1 Network diagram for MPLS L3VPN model

MPLS VPN Inter-AS Option AB

Routing Basics. Routing Concepts. IPv4. IPv4 address format. A day in a life of a router. What does a router do? IPv4 Routing

BGP. Autonomous system (AS) BGP version 4

MPLS VPN Carrier Supporting Carrier IPv4 BGP Label Distribution

internet technologies and standards

BGP. Autonomous system (AS) BGP version 4

UNDERSTANDING CONVERGENCE IN MPLS VPN NETWORKS. Mukhtiar A. Shaikh Moiz Moizuddin

Protecting an EBGP peer when memory usage reaches level 2 threshold 66 Configuring a large-scale BGP network 67 Configuring BGP community 67

Dynamics of Hot-Potato Routing in IP Networks

Ravi Chandra cisco Systems Cisco Systems Confidential

MPLS VPN Carrier Supporting Carrier IPv4 BGP Label Distribution

BGP Additional Paths. Finding Feature Information. Information About BGP Additional Paths. Problem That Additional Paths Can Solve

BGP. Autonomous system (AS) BGP version 4

Inter-AS routing and BGP. Network Layer 4-1

Configuring MPLS L3VPN

BGP Scaling (RR & Peer Group)

Routing Concepts. IPv4 Routing Forwarding Some definitions Policy options Routing Protocols

HPE FlexFabric 5940 Switch Series

Interdomain Routing and Connectivity

Network Layer (Routing)

Nexus 9000/3000 Graceful Insertion and Removal (GIR)

Introduction to IS-IS

IOS Routing Internals

Deploying MPLS-based IP VPNs

Deploying MPLS L3VPN. Apricot Cisco and/or its affiliates. All rights reserved. Cisco Public

ECMP Load Balancing. MPLS: Layer 3 VPNs Configuration Guide, Cisco IOS XE Release 3S (Cisco ASR 900 Series) 1

IBGP scaling: Route reflectors and confederations

Achieving Sub-50 Milliseconds Recovery Upon BGP Peering Link Failures

Introduction. Keith Barker, CCIE #6783. YouTube - Keith6783.

IPv6 Module 6 ibgp and Basic ebgp

Initial motivation: 32-bit address space soon to be completely allocated. Additional motivation:

Routing Protocols. Technology Description BGP CHAPTER

Cisco Evolved Programmable Network Implementation Guide for Large Network with End-to-End Segment Routing, Release 5.0

BGP Attributes and Policy Control

ibgp Multipath Load Sharing

MPLS VPN Carrier Supporting Carrier Using LDP and an IGP

Chapter 4: Network Layer. Lecture 12 Internet Routing Protocols. Chapter goals: understand principles behind network layer services:

HP 5920 & 5900 Switch Series

Cisco CISCO Configuring BGP on Cisco Routers Exam. Practice Test. Version

BGP. Autonomous system (AS) BGP version 4. Definition (AS Autonomous System)

LARGE SCALE IP ROUTING LECTURE BY SEBASTIAN GRAF

IPv6 Switching: Provider Edge Router over MPLS

Implementing BGP. BGP Functional Overview. Border Gateway Protocol (BGP) is an Exterior Gateway Protocol (EGP) that allows you to create loop-free

Configuring IPv6 Provider Edge over MPLS (6PE)

Egress Protection (draft-shen-mpls-egress-protection-framework) Presented by Krzysztof G. Szarkowicz NANOG71 October 4, 2017

BGP Event-Based VPN Import

Configuring MPLS L3VPN

Chapter 4: Network Layer

Implementing Static Routes

MPLS VPN C H A P T E R S U P P L E M E N T. BGP Advertising IPv4 Prefixes with a Label

CS 43: Computer Networks. 24: Internet Routing November 19, 2018

BGP Enhancements for IPv6. ISP Training Workshops

Last time. Transitioning to IPv6. Routing. Tunneling. Gateways. Graph abstraction. Link-state routing. Distance-vector routing. Dijkstra's Algorithm

CS 457 Networking and the Internet. The Global Internet (Then) The Global Internet (And Now) 10/4/16. Fall 2016

Connecting to a Service Provider Using External BGP

BGP Part-1.

IBGP internals. BGP Advanced Topics. Agenda. BGP Continuity 1. L49 - BGP Advanced Topics. L49 - BGP Advanced Topics

Request for Comments: 3277 Category: Informational April Intermediate System to Intermediate System (IS-IS) Transient Blackhole Avoidance

MPLS VPN Carrier Supporting Carrier Using LDP and an IGP

! Distance vector routing! Link state routing.! Path vector routing! BGP: Border Gateway Protocol! Route aggregation

Lecture 4: Intradomain Routing. CS 598: Advanced Internetworking Matthew Caesar February 1, 2011

UniNets MPLS LAB MANUAL MPLS. UNiNets Multiprotocol label Switching MPLS LAB MANUAL. UniNets MPLS LAB MANUAL

Examination. IP routning på Internet och andra sammansatta nät, DD2491 IP routing in the Internet and other complex networks, DD2491

Transcription:

BGP Convergence in much less than a second Clarence Filsfils - cf@cisco.com Presented by Martin Winter - mwinter@cisco.com 1

Down Convergence T1 Down Convergence T2 Default metric = 1 Src R R 20 F Dst Link L Assume a flow from Src to Dest T1: when L dies, the best path is impacted loss of traffic T2: when the network converges, a next best path is computed traffic reaches the destination again Loss of Connectivity: T2 T1, called Down convergence hereafter Analyzed for streams going to IGP and BGP learned prefixes 2

Up Convergence T1 Up Convergence T2 Default metric = 1 Src R R 20 F Dst Link L Assume a flow from Src to Dest T1: when L comes up, a better best path is available there is no traffic loss T2: the new bestpath is applied there is no traffic loss T2-T1 is called UP convergence. This does not imply any loss of connectivity 3

Convergence Down versus UP Down convergence impacts customer service (there is loss between T2 and T1) UP Convergence does not impact customer service (there is no loss between T2 and T1 because modern router implementation supports lossless switch-over from valid to valid path) This paper focuses on BGP Convergence upon Down transition 4

BGP PIC Core 5

BGP Prefix-Independent Convergence Core failure PE1 s RIB: 10.1.1/24 via PE2 10.1.2/24 via PE2. 99.99.99/24 via PE2 When? PE1 PoS1 PoS2 PE2 PE2 is reachable via PoS1 PoS2 Upon a core failure, the IGP finds an alternate path to the BGP next-hop PE2 in a few 100 s of msec Requirement: the BGP prefixes depending on reachability to PE2 must leverage the new ISIS path as soon as it is updated in the FIB 6

The right behavior is BGP PIC/core iox-crs1-p-isis-bgpipv4-dpi2i-021307_154358-l-nlb 400 Agilent measurements The measured convergence to the BGP nhop traffic loss (msec) 350 300 250 200 150 100 50 0% 50% 90% 100% 0% 50% 90% 100% The measured convergence to the first BGP dependent to this BGP nhop The measured convergence to the middle (175k) BGP dependent to this BGP nhop The measured convergence to the last (350k) BGP dependent to this BGP nhop 0 0 1000 2000 3000 4000 5000 6000 7000 prefix nr BGP PIC effect Prefix 1 to 5000 are ISIS prefixes. The BGP nhop is an important ISIS prefix and hence is prioritized to the very start of the convergence Testbed: Tier1 ISP topology, CRS1, IOX3.5, 5000 ISIS prefixes, 350k IPv4 BGP dependents to impacted BGP nhop same behavior for VPNv4 as of 3.5 When ISIS converges, all the BGP dependents immediately leverage the ISIS convergence 7

The right architecture: hierarchical FIB BGP net BGP net BGP LDI IGP LDI oif BGP nexthop(s) IGP nexthop(s) Output Interface BGP net Pointer Indirection between BGP and IGP entries allow for immediate leveraging of the IGP convergence 8

The unoptimal way: flattened FIB BGP net BGP net oif oif BGP net IGP LDI oif oif Control Plane flattens the recursion such that any BGP FIB entry has its own local oif information 9

Hierarchical FIB - Advantages Routing Convergence: BGP PIC Core the BGP dependents converge at IGP convergence of their nhop Scaling and Robustness Smaller FIB Memory (thanks to sharing) Much less CPU requirement (no need to reflatten all BGP prefixes upon IGP change) Commercially available proven 10

BGP PIC Edge 11

BGP PIC Edge ibgp R1 ebgp B1 R3 ibgp ibgp R2 ebgp B2 Z prefixes Eg. N/n Two ISP s peer at multiple locations and exchange 350k BGP prefixes. R3, a typical PE within the grey ISP, installs each of these 350k BGP prefixes as multipath pair entries (BGP nhop B1, BGP nhop B2) We send traffic from R3 to each of the 350k IPv4 BGP prefixes of the yellow ISP. We measure the loss of connectivity upon a peering node failure (R1) The UUT (R3) is a 12k running IOX 3.3 12

350-crs1-mp-ipv4-350k-pic-edge Results 180msec! LoC (ms) 200 180 160 140 120 100 80 60 40 20 0 1 25000 Agilent measurements In 180msec, as a consequence of its IGP convergence, R3 has updated its FIB tree such that all the 350k IPv4 BGP prefixes be forwarded via the alternate path through R2 50000 The normal BGP convergence will then take place which involves R2 sending 350k of withdraws and R3 running 350k of bestpath s and RIB modifications. This process will likely take multiple tens of seconds but has no impact on the service thanks to the FIB fixup provided by BGP PIC at IGP convergence time. 75000 13 100000 100001 125000 150000 175000 200000 prefix nr 225000 225001 250000 275000 300000 325000 350000 1 2 3 4 5 6 7 8 9 10

If Non BGP PIC edge R3 R1 B1 Z prefixes Eg. N/n R2 B2 R3 receives a NHT/down and trigger on a per-prefix basis bestpath (eg. R2 is now best as R1 is gone) RIB/FIB update peer update Results: for Z = 100.000 IPv4 BGP prefixes 12k/Eng3/32S: 30 seconds Other vendors: 30 seconds as well Impossible to perform good here due to the prefix-dependency 14

Another advantage of Hierarchical FIB BGP net BGP net IGP LDI oif BGP LDI BGP net IGP LDI oif Pointer Indirection between BGP and IGP entries allow for immediate update of the multipath BGP LDI at IGP convergence 15

Assumption The failure must result into an IGP-based deletion of the BGP nhop PE node failure: its IGP neighbors detect the loss of adjacency and trigger a convergence at all remote PE s which concludes with a deletion of this node from the topology and hence a deletion of any BGP nhop on that node (its /32) or on peering links behind that node. Peering link failure: next-hop-self should not be used (the BGP nhop is on the peering link) and the IGP should run passive on the peering link. Upon peering link failure, the IGP converges and concludes with a deletion of the BGP nhop The BGP prefixes impacted by the failure must have alternate paths installed as multipath BGP entries 16

Conclusion Hierarchical FIB allows for better FIB memory scaling much lower CPU consumption higher robustness And especially BGP PIC Core BGP PIC Edge 17

Availability BGP PIC CORE 12k: IOX 3.3 (ipv4, vpnv4, ipv6) CRS: IOX 3.5 (ipv4, vpnv4, ipv6) BGP PIC EDGE - Multipath assumption E3/E5: IPv4 & IPv6 = IOX 3.3, VPNv4 = IOX 3.5 CRS: IPv4 & IPv6 & VPNv4 = IOX 3.5 18