Achieving Efficient Bandwidth Utilization in Wide-Area Networks While Minimizing State Changes

Similar documents
KULFI. Robust Traffic Engineering Using Semi- Oblivious Routing TASTES GREAT, NO CHURN!

SMORE: Semi-Oblivious Traffic Engineering

Semi-Oblivious Traffic Engineering:

From Routing to Traffic Engineering

arxiv: v1 [cs.ni] 3 Mar 2016

Hardware Evolution in Data Centers

Semi-Oblivious Traffic Engineering: The Road Not Taken

Chapter 7 Slicing and Dicing

Application-Aware SDN Routing for Big-Data Processing

Routing Domains in Data Centre Networks. Morteza Kheirkhah. Informatics Department University of Sussex. Multi-Service Networks July 2011

CONGA: Distributed Congestion-Aware Load Balancing for Datacenters

Application of SDN: Load Balancing & Traffic Engineering

An Approach Using Ant-like Agents. Alice Forehand Robert Pienta

SPARTA: Scalable Per-Address RouTing Architecture

in Oblivious Routing

Robust validation of network designs under uncertain demands and failures

T9: SDN and Flow Management: DevoFlow

Knowledge-Defined Network Orchestration in a Hybrid Optical/Electrical Datacenter Network

DCRoute: Speeding up Inter-Datacenter Traffic Allocation while Guaranteeing Deadlines

Mellanox Virtual Modular Switch

Cisco CloudCenter Solution with Cisco ACI: Common Use Cases

SpecPaxos. James Connolly && Harrison Davis

Advanced Computer Networks Exercise Session 7. Qin Yin Spring Semester 2013

Cisco Extensible Network Controller

Unicast Routing. Information About Layer 3 Unicast Routing CHAPTER

Basic Switch Organization

Lecture 10.1 A real SDN implementation: the Google B4 case. Antonio Cianfrani DIET Department Networking Group netlab.uniroma1.it

White Paper. OCP Enabled Switching. SDN Solutions Guide

Enabling Efficient and Scalable Zero-Trust Security

SOFTWARE DEFINED NETWORKING/ OPENFLOW: A PATH TO PROGRAMMABLE NETWORKS

Expeditus: Congestion-Aware Load Balancing in Clos Data Center Networks

EC 513 Computer Architecture

Enhancing Quality of Service in Software-Defined Networks

Interconnection Networks: Routing. Prof. Natalie Enright Jerger

Fast-Response Multipath Routing Policy for High-Speed Interconnection Networks

Utilizing Datacenter Networks: Centralized or Distributed Solutions?

Topic 6: SDN in practice: Microsoft's SWAN. Student: Miladinovic Djordje Date:

Forwarding Architecture

Introduction to Segment Routing

Managing and Securing Computer Networks. Guy Leduc. Chapter 2: Software-Defined Networks (SDN) Chapter 2. Chapter goals:

GUIDE. Optimal Network Designs with Cohesity

NETWORK OVERLAYS: AN INTRODUCTION

Internet Technology. 15. Things we didn t get to talk about. Paul Krzyzanowski. Rutgers University. Spring Paul Krzyzanowski

Bit Index Explicit Replication (BIER) Multicasting in Transport Networks

A Policy-aware Switching Layer for Data Centers

Plexxi Theory of Operations White Paper

DARD: A Practical Distributed Adaptive Routing Architecture for Datacenter Networks

Software-Defined Networking (Continued)

Advanced Computer Networks Data Center Architecture. Patrick Stuedi, Qin Yin, Timothy Roscoe Spring Semester 2015

Adaptive Routing. Claudio Brunelli Adaptive Routing Institute of Digital and Computer Systems / TKT-9636

Data Center Network Topologies II

Lecture 7: Data Center Networks

Flow Analyzer 1.0 Help Guide FLOW ANALYZER 1.0. By Nuviso

Informatica Universiteit van Amsterdam. Distributed Load-Balancing of Network Flows using Multi-Path Routing. Kevin Ouwehand. September 20, 2015

Lecture 6: Overlay Networks. CS 598: Advanced Internetworking Matthew Caesar February 15, 2011

New OVS instrumentation features aimed at real-time monitoring of virtual networks

Lecture 14 SDN and NFV. Antonio Cianfrani DIET Department Networking Group netlab.uniroma1.it

The Nostrum Network on Chip

Transport Protocols for Data Center Communication. Evisa Tsolakou Supervisor: Prof. Jörg Ott Advisor: Lect. Pasi Sarolahti

Cengiz Alaettinoglu. SDN Traffic Engineering A Natural Evolution

TCEP: Traffic Consolidation for Energy-Proportional High-Radix Networks

CS644 Advanced Networks

Top-Down Network Design

ENTERPRISE MPLS. Kireeti Kompella

GIAN Course on Distributed Network Algorithms. Network Topologies and Local Routing

Scalable Enterprise Networks with Inexpensive Switches

Best Practices for Validating the Performance of Data Center Infrastructure. Henry He Ixia

MUD: Send me your top 1 3 questions on this lecture

A High-Performance Storage and Ultra- High-Speed File Transfer Solution for Collaborative Life Sciences Research

ANALYSIS AND IMPROVEMENT OF VALIANT ROUTING IN LOW- DIAMETER NETWORKS

Data Center Networking

Practical MU-MIMO User Selection on ac Commodity Networks

THE increasing reliance on streaming mobile video from

TDT Appendix E Interconnection Networks

MIND: Machine Learning based Network Dynamics. Dr. Yanhui Geng Huawei Noah s Ark Lab, Hong Kong

DevoFlow: Scaling Flow Management for High-Performance Networks

Chapter 1. Introduction

COM-208: Computer Networks - Homework 6

CS 498 Hot Topics in High Performance Computing. Networks and Fault Tolerance. 9. Routing and Flow Control

ONOS: TOWARDS AN OPEN, DISTRIBUTED SDN OS. Chun Yuan Cheng

THE increasing reliance on streaming mobile video from

What is SD WAN and should I know or care about it? Ken LaMere Ecessa

SDN-based Network Obfuscation. Roland Meier PhD Student ETH Zürich

Data Center Virtualization: Open vswitch

MULTILAYER REQUIREMENTS FOR NBI OF IP/MPLS DOMAIN CONTROLLERS

Centralization of Network using Openflow Protocol

Let It Flow Resilient Asymmetric Load Balancing with Flowlet Switching

Competitive and Deterministic Embeddings of Virtual Networks

Message Passing Models and Multicomputer distributed system LECTURE 7

How Emerging Optical Technologies will affect the Future Internet

VMware vsan Network Design-OLD November 03, 2017

Not all SD-WANs are Created Equal: Performance Matters

Implementing VXLAN. Prerequisites for implementing VXLANs. Information about Implementing VXLAN

ProAc&ve Rou&ng In Scalable Data Centers with PARIS

ProgrammableFlow: OpenFlow Network Fabric

Modeling UGAL routing on the Dragonfly topology

The Round Complexity of Distributed Sorting

Be Fast, Cheap and in Control with SwitchKV Xiaozhou Li

DESIGN GUIDE. VMware NSX for vsphere (NSX-v) and F5 BIG-IP Design Guide

Designing Distributed Systems using Approximate Synchrony in Data Center Networks

Transcription:

1 Achieving Efficient Bandwidth Utilization in Wide-Area Networks While Minimizing State Changes

2 WAN Traffic Engineering Maintaining private WAN infrastructure is expensive Must balance latency-sensitive customer traffic with high-volume, operational traffic Many competing goals: Distribute load Improve utilization Operate more efficiently

3 Example Network West East

3 Example Network Sporadic shortcuts West East

3 Example Network Sporadic shortcuts Sparse bisection West East

4 Multi-Commodity Flow 1. Continuously monitor flows, build a traffic matrix 2. Encode as constraint problem 3. Map solution to physical paths 4. Update forwarding tables 5. Repeat West

4 Multi-Commodity Flow 1. Continuously monitor flows, build a traffic matrix 2. Encode as constraint problem 3. Map solution to physical paths 4. Update forwarding tables 5. Repeat West

5 Outline of This Talk Motivation Case for Randomized Routing Randomized Routing Evaluation Conclusions

6 The Case for Randomized Routing

7 Constantly Changing Routing State With optimal MCF, forwarding state must constantly be changed.

8 Increasing Number of Unique Paths With optimal MCF, the number of paths used increases over time.

9 Expensive Optimization Problem Solving MCF is computationally expensive.

Randomized Routing 10

11 Equal-Cost Multi-Path Routing 1. Pre-compute a set of least-cost paths 2. Identify flows by hashing packet header fields 3. Randomly forward along least cost paths West

11 Equal-Cost Multi-Path Routing 1. Pre-compute a set of least-cost paths 2. Identify flows by hashing packet header fields 3. Randomly forward along least cost paths West

12 Valiant Load Balancing 1. Choose a random intermediate node 2. Route from source to intermediate node 3. Route from intermediate node to destination West

12 Valiant Load Balancing 1. Choose a random intermediate node 2. Route from source to intermediate node 3. Route from intermediate node to destination West

13 Valiant Load Balancing West East

13 Valiant Load Balancing West East

14 Randomized Routing Trees A routing tree is a logical tree topology whose leaves correspond to physical topology A randomized routing tree is probability distribution over routing trees West Over-provisioning a network by Räcke's factor of O(log n) is usually not acceptable

15 Computing Low-Stretch Trees Stretch is the ratio of the weighted length of the path in logical tree over the weighted length of the path in the physical graph Key idea: reduce the oblivious routing problem to the low-stretch routing tree problem Fakcharoenphol, Rao, and Talwar (FRT) algorithm finds trees with average stretch O(log n)

16 Constructing RRTs Räcke s algorithm iteratively constructs a sequence of instances of the low-stretch tree problem At each iteration, it penalizes edges that have been heavily utilized in previous trees Balances load among edges to ensure O(log n) congestion ratio regardless of the traffic matrix (i.e. it is oblivious)

17 Rate Adaptation Congestion and churn minimization are conflicting goals Semi-oblivious routing is a potentially attractive solution Fixed set of paths, dynamically adjusted sending rates Con: Hajiaghayi et al. proved Ω(log(n)/log (log(n))) congestion Pro: Realistic workloads are different from worst-case Requires additional infrastructure (e.g., monitoring, updating weights)

18 Kulfi A framework for evaluating different traffic engineering policies Abstraction: users write modules that can modify path selection algorithm, or probability distribution over set of paths Example modules include: MCF, Raeke, AK, VLB, ECMP, and more. https://github.com/merlin-lang/kulfi

Evaluation 19

20 Implementation Linux End Host Linux Kernel User- Space Agent Netfilter Module Traffic matrix + Path-to-tag mapping OpenFlow rules VLAN- tagged packets SDN Controller SDN Switch Traffic statistics

20 Implementation Linux End Host Linux Kernel User- Space Agent Netfilter Module Traffic matrix + Path-to-tag mapping OpenFlow rules VLAN- tagged packets SDN Controller SDN Switch Historical Data Traffic statistics Traffic statistics

21 Oblivious Routing on Facebook Traffic Oblivious routing performs much better in practice than theory worst-case predictions

21 Oblivious Routing on Facebook Traffic Constant factor, not O (log n) Oblivious routing performs much better in practice than theory worst-case predictions

22 Oblivious Congestion on Facebook Network On a majority of links, the flow rate demanded by oblivious routing is at most 2x the link capacity

23 Abilene Network Experiment S10 S8 S11 S4 S5 S7 S3 S6 S2 S1 S12 S9 Emulated Abilene topology in hardware test bed Used real-world and worst case traffic scenarios Compared shortest-path, ECMP, MCF, oblivious, and semi-oblivious

23 Abilene Network Experiment S10 S8 S11 Artificial traffic S4 S5 S7 S3 S6 S2 S1 S12 S9 Emulated Abilene topology in hardware test bed Used real-world and worst case traffic scenarios Compared shortest-path, ECMP, MCF, oblivious, and semi-oblivious

24 Abilene Network, Various Workloads 1 Artificial Traffic 1 Abilene Observed Traffic Link congestion 0.8 0.6 0.4 0.2 0 SPF max ECMP max Obliv max Semi Obliv max MCF max 0 20 40 60 80 100 120 140 160 180 Time (minutes) SPF median ECMP median Obliv median Semi Obliv median MCF median Link congestion 0.8 0.6 0.4 0.2 0 0 20 40 60 80 100 120 140 160 180 Time (minutes) 1 Abilene Gravity Traffic 1 Abilene Gravity + Artificial Traffic Link congestion 0.8 0.6 0.4 0.2 Link congestion 0.8 0.6 0.4 0.2 0 0 20 40 60 80 100 120 140 160 180 Time (minutes) 0 0 20 40 60 80 100 120 140 160 180 Time (minutes)

25 Abilene Summary SPF and ECMP achieve slightly better median congestion MCF and semi-oblivious achieve better maximum congestion Max congestion with oblivious was never worse than 50% greater than optimal MCF, despite theory prediction No significant difference between semi-oblivious and optimal MCF, despite prior theory results

26 Conclusions Oblivious routing performs much better in practice than expected, avoids problems associated with churn, and load-balances better Semi-oblivious routing can provide close to optimal bandwidth utilization on real workloads Randomization can dramatically simplify traffic engineering infrastructure, balance competing requirements for traffic engineering goals

27