Flexplane: An Experimenta0on Pla3orm for Resource Management in Datacenters

Size: px
Start display at page:

Download "Flexplane: An Experimenta0on Pla3orm for Resource Management in Datacenters"

Transcription

1 Flexplane: An Experimenta0on Pla3orm for Resource Management in Datacenters Amy Ousterhout 1, Jonathan Perry 1, Hari Balakrishnan 1, Petr Lapukhov 2 1 MIT, 2 Facebook

2 Datacenter Networks Applica0ons have diverse requirements Dozens of new resource management schemes Low latency: DCTCP Min FCT: PDQ, RCP, pfabric, PERC Deadlines: D 3, D 2 TCP Difficult to experiment with schemes in real networks Requires changes to hardware routers

3 Experimenta0on with Resource Management Experimenta0on in real networks Programmable hardware - limited flexibility SoVware routers - limited throughput Emula0on limited throughput By Altera Corpora0on - Altera Corpora0on, CC BY 3.0

4 Experimenta0on with Resource Management Experimenta0on in simula0on (e.g., ns, OMNeT++) Does not accurately model real network stacks, NICs, and distributed applica0ons Does not run in real 0me No exis0ng approach to experimenta0on provides accuracy, flexibility, and high throughput

5 Our Contribu0ons Key idea: whole- network emula0on Flexplane: a pla3orm for faithful experimenta0on with resource management schemes Accurate predicts behavior of hardware Flexible express schemes in C++ High throughput 761 Gbits/s

6 Approach: Whole- Network Emula0on Real Network Emulated Network Emulator class MyScheduler { } class MyAQM { } packet

7 Abstract Packets Resource management schemes are data- independent Concise representa0on of one MTU Source, des0na0on, flow, ID Custom per- scheme fields Transmiged over the real network Consume 1-2% of network bandwidth

8 Emulator Real- 0me network simulator Faster than standard network simulators Time divided into abstract- packet- sized 0meslots Omits endpoint sovware Emulated Network Real Network Emulator

9 Sending Real Packets Drop packet if dropped in emula0on Otherwise, modify and send ECN marks, rates (e.g., in RCP, PDQ, PERC)

10 Accuracy Goal: predict behavior of a hardware network Hardware latency: unloaded delay + queuing delay Added latency of Flexplane: RTT to emulator Unloaded delay Queuing delay in real network Real Network Emulator Emulated Network

11 Flexplane API Decouples schemes from framework int route(abstractpkt *pkt) Emulator int classify(abstractpkt *pkt, int port) Emulator (pagern) enqueue(abstractpkt *pkt, int port, int queue) AbstractPkt *schedule(int output_port) Emulator (generic) Endpoints input(abstractpkt **pkts, int n) output(abstractpkt **pkts, int n) prepare_request(sk_buff *skb, char *request_data) prepare_to_send(sk_buff *skb, char *alloca0on_data) incoming packets outgoing packets route classify enqueue schedule

12 Mul0core Emulator Architecture Common approaches are ill- suited for Flexplane Parallel (RouteBricks, IX) Pipeline (Fastpass) CPU core abstract packet 4 - > 3 aggrega0on 1 - > 3 ToR endpoints 8 - > > 8 endpoints 1-6 ToR, aggrega0on endpoints 7-12

13 Mul0core Emulator Architecture Pin network components to cores, connect in a graph Router state not shared across cores Loose synchroniza0on, batching, prefetching aggrega0on ToR endpoints CPU core FIFO queues

14 Implementa0on Emulator uses Intel DPDK for low- latency NIC access Endpoints run a Linux qdisc

15 Accuracy U0lity Emulator throughput Evalua0on

16 Flexplane is Accurate Bulk TCP: 5 senders, 1 receiver Throughput Gbits/s vs. 9.4 Gbits/s in hardware Similar queue occupancies Median Queue Occupancies (MTUs) Hardware Flexplane DropTail RED DCTCP : Flexplane achieves similar queue occupancie

17 Flexplane is Accurate Average Normalized FCT RPC web search workload DCTCP Average Normalized FCT (0, 100KB] Load Average Normalized FCT Load Load 250 Average Normalized FCT Load (b) DCTCP, (0, 100KB] 0 (b) DCTCP, (0, (c) 100KB] DropTail, (10MB, ) (d) DCTCP, (10MB, ) Accurate 0.2 to within % 0.8 for loads up to 60% Load es the average normalized flow completion times of hardware for DropTail and DCTCP. The left ow sizes; the right two graphs show results for large flow sizes. ic Scheduler { Average Normalized FCT Flexplane 5 Hardware Observe behavior not visible in simula0ons 250 Priority Queueing DCTCP (10MB, )

18 Flexplane is Easy to Use Implemented several schemes in dozens of lines of code scheme LOC drop tail queue manager 39 RED queue manager 125 DCTCP queue manager 43 priority queueing scheduler 29 round robin scheduler 40 HULL scheduler 60 pfabric QM, queues, scheduler 251

19 Example: DCTCP in Flexplane void enqueue(abstractpkt *pkt, uint32_t port, uint32_t queue) { uint32_t qlen = m_bank- >occupancy(port, queue); if (qlen >= m_dctcp_params.q_capacity) { /* no space to enqueue, drop this packet */ drop(pkt, port); return; } /* mark if queue occupancy is greater than K */ if (qlen > m_dctcp_params.k_threshold) { /* Set ECN mark on packet */ mark_ecn(pkt, port); } } m_bank- >enqueue(port, queue, pkt);

20 Flexplane Enables Experimenta0on Evalua0ng trade- offs between resource management schemes High priority FCT (us) HULL DCTCP DropTail HULL better Priority Queueing RED Round Robin Priority Queueing DropTail pfabric Low priority FCT (ms) 35 30

21 Flexplane Enables Experimenta0on Experiment with real distributed applica0ons such as Spark % Change in Completion Time Relative to DropTail Coordinate descent Sort DCTCP +4.4% -4.8% HULL +29.4% -2.6% e 4: Percent change in completion time of two Spark Performance depends on network and CPU

22 Emulator Throughput Emulator provides 761 Gbits/s of aggregate throughput with 10 total cores Router throughput (Gbits/s) racks connected with Agg isolated racks Emulation cores

23 Emulator Throughput Throughput varies by 29% across schemes Throughput (Gbits/s) DCTCP DropTail Priority Queueing Round Robin Resource management scheme HULL RED pfabric 81x as much throughput per clock cycle as RouteBricks

24 Flexplane: an Experimenta0on Pla3orm Whole- network emula0on Flexplane: a pla3orm for faithful experimenta0on with resource management schemes Accuracy, flexibility, and high throughput hgps://github.com/aousterh/flexplane

Flexplane: An Experimenta0on Pla3orm for Resource Management in Datacenters. Amy Ousterhout, Jonathan Perry, Hari Balakrishnan, Petr Lapukhov

Flexplane: An Experimenta0on Pla3orm for Resource Management in Datacenters. Amy Ousterhout, Jonathan Perry, Hari Balakrishnan, Petr Lapukhov Flexplane: An Experimenta0on Pla3orm for Resource Management in Datacenters Amy Ousterhout, Jonathan Perry, Hari Balakrishnan, Petr Lapukhov Datacenter Networks Applica0ons have diverse requirements Dozens

More information

Fastpass A Centralized Zero-Queue Datacenter Network

Fastpass A Centralized Zero-Queue Datacenter Network Fastpass A Centralized Zero-Queue Datacenter Network Jonathan Perry Amy Ousterhout Hari Balakrishnan Devavrat Shah Hans Fugal Ideal datacenter network properties No current design satisfies all these properties

More information

Packet Scheduling in Data Centers. Lecture 17, Computer Networks (198:552)

Packet Scheduling in Data Centers. Lecture 17, Computer Networks (198:552) Packet Scheduling in Data Centers Lecture 17, Computer Networks (198:552) Datacenter transport Goal: Complete flows quickly / meet deadlines Short flows (e.g., query, coordination) Large flows (e.g., data

More information

Episode 5. Scheduling and Traffic Management

Episode 5. Scheduling and Traffic Management Episode 5. Scheduling and Traffic Management Part 3 Baochun Li Department of Electrical and Computer Engineering University of Toronto Outline What is scheduling? Why do we need it? Requirements of a scheduling

More information

An Implementation of the Homa Transport Protocol in RAMCloud. Yilong Li, Behnam Montazeri, John Ousterhout

An Implementation of the Homa Transport Protocol in RAMCloud. Yilong Li, Behnam Montazeri, John Ousterhout An Implementation of the Homa Transport Protocol in RAMCloud Yilong Li, Behnam Montazeri, John Ousterhout Introduction Homa: receiver-driven low-latency transport protocol using network priorities HomaTransport

More information

6.888 Lecture 5: Flow Scheduling

6.888 Lecture 5: Flow Scheduling 6.888 Lecture 5: Flow Scheduling Mohammad Alizadeh Spring 2016 1 Datacenter Transport Goal: Complete flows quickly / meet deadlines Short flows (e.g., query, coordina1on) Large flows (e.g., data update,

More information

EXPERIENCES EVALUATING DCTCP. Lawrence Brakmo, Boris Burkov, Greg Leclercq and Murat Mugan Facebook

EXPERIENCES EVALUATING DCTCP. Lawrence Brakmo, Boris Burkov, Greg Leclercq and Murat Mugan Facebook EXPERIENCES EVALUATING DCTCP Lawrence Brakmo, Boris Burkov, Greg Leclercq and Murat Mugan Facebook INTRODUCTION Standard TCP congestion control, which only reacts to packet losses has many problems Can

More information

Information-Agnostic Flow Scheduling for Commodity Data Centers

Information-Agnostic Flow Scheduling for Commodity Data Centers Information-Agnostic Flow Scheduling for Commodity Data Centers Wei Bai, Li Chen, Kai Chen, Dongsu Han (KAIST), Chen Tian (NJU), Hao Wang Sing Group @ Hong Kong University of Science and Technology USENIX

More information

Fastpass: A Centralized Zero-Queue Datacenter Network

Fastpass: A Centralized Zero-Queue Datacenter Network Fastpass: A Centralized Zero-Queue Datacenter Network Jonathan Perry, Amy Ousterhout, Hari Balakrishnan, Devavrat Shah, Hans Fugal M.I.T. Computer Science & Artificial Intelligence Lab Facebook http://fastpass.mit.edu/

More information

DeTail Reducing the Tail of Flow Completion Times in Datacenter Networks. David Zats, Tathagata Das, Prashanth Mohan, Dhruba Borthakur, Randy Katz

DeTail Reducing the Tail of Flow Completion Times in Datacenter Networks. David Zats, Tathagata Das, Prashanth Mohan, Dhruba Borthakur, Randy Katz DeTail Reducing the Tail of Flow Completion Times in Datacenter Networks David Zats, Tathagata Das, Prashanth Mohan, Dhruba Borthakur, Randy Katz 1 A Typical Facebook Page Modern pages have many components

More information

DIBS: Just-in-time congestion mitigation for Data Centers

DIBS: Just-in-time congestion mitigation for Data Centers DIBS: Just-in-time congestion mitigation for Data Centers Kyriakos Zarifis, Rui Miao, Matt Calder, Ethan Katz-Bassett, Minlan Yu, Jitendra Padhye University of Southern California Microsoft Research Summary

More information

Linux Plumbers Conference TCP-NV Congestion Avoidance for Data Centers

Linux Plumbers Conference TCP-NV Congestion Avoidance for Data Centers Linux Plumbers Conference 2010 TCP-NV Congestion Avoidance for Data Centers Lawrence Brakmo Google TCP Congestion Control Algorithm for utilizing available bandwidth without too many losses No attempt

More information

Towards a Software Defined Data Plane for Datacenters

Towards a Software Defined Data Plane for Datacenters Towards a Software Defined Data Plane for Datacenters Arvind Krishnamurthy Joint work with: Antoine Kaufmann, Ming Liu, Naveen Sharma Tom Anderson, Kishore Atreya, Changhoon Kim, Jacob Nelson, Simon Peter

More information

Handles all kinds of traffic on a single network with one class

Handles all kinds of traffic on a single network with one class Handles all kinds of traffic on a single network with one class No priorities, no reservations required Quality? Throw bandwidth at the problem, hope for the best 1000x increase in bandwidth over 2 decades

More information

Lecture 15: Datacenter TCP"

Lecture 15: Datacenter TCP Lecture 15: Datacenter TCP" CSE 222A: Computer Communication Networks Alex C. Snoeren Thanks: Mohammad Alizadeh Lecture 15 Overview" Datacenter workload discussion DC-TCP Overview 2 Datacenter Review"

More information

A Generalized Blind Scheduling Policy

A Generalized Blind Scheduling Policy A Generalized Blind Scheduling Policy Hanhua Feng 1, Vishal Misra 2,3 and Dan Rubenstein 2 1 Infinio Systems 2 Columbia University in the City of New York 3 Google TTIC SUMMER WORKSHOP: DATA CENTER SCHEDULING

More information

Information-Agnostic Flow Scheduling for Commodity Data Centers. Kai Chen SING Group, CSE Department, HKUST May 16, Stanford University

Information-Agnostic Flow Scheduling for Commodity Data Centers. Kai Chen SING Group, CSE Department, HKUST May 16, Stanford University Information-Agnostic Flow Scheduling for Commodity Data Centers Kai Chen SING Group, CSE Department, HKUST May 16, 2016 @ Stanford University 1 SING Testbed Cluster Electrical Packet Switch, 1G (x10) Electrical

More information

Investigating the Use of Synchronized Clocks in TCP Congestion Control

Investigating the Use of Synchronized Clocks in TCP Congestion Control Investigating the Use of Synchronized Clocks in TCP Congestion Control Michele Weigle (UNC-CH) November 16-17, 2001 Univ. of Maryland Symposium The Problem TCP Reno congestion control reacts only to packet

More information

RoGUE: RDMA over Generic Unconverged Ethernet

RoGUE: RDMA over Generic Unconverged Ethernet RoGUE: RDMA over Generic Unconverged Ethernet Yanfang Le with Brent Stephens, Arjun Singhvi, Aditya Akella, Mike Swift RDMA Overview RDMA USER KERNEL Zero Copy Application Application Buffer Buffer HARWARE

More information

Appendix B. Standards-Track TCP Evaluation

Appendix B. Standards-Track TCP Evaluation 215 Appendix B Standards-Track TCP Evaluation In this appendix, I present the results of a study of standards-track TCP error recovery and queue management mechanisms. I consider standards-track TCP error

More information

Router s Queue Management

Router s Queue Management Router s Queue Management Manages sharing of (i) buffer space (ii) bandwidth Q1: Which packet to drop when queue is full? Q2: Which packet to send next? FIFO + Drop Tail Keep a single queue Answer to Q1:

More information

Data Center TCP (DCTCP)

Data Center TCP (DCTCP) Data Center TCP (DCTCP) Mohammad Alizadeh, Albert Greenberg, David A. Maltz, Jitendra Padhye Parveen Patel, Balaji Prabhakar, Sudipta Sengupta, Murari Sridharan Microsoft Research Stanford University 1

More information

Advanced Computer Networks. Datacenter TCP

Advanced Computer Networks. Datacenter TCP Advanced Computer Networks 263 3501 00 Datacenter TCP Spring Semester 2017 1 Oriana Riva, Department of Computer Science ETH Zürich Today Problems with TCP in the Data Center TCP Incast TPC timeouts Improvements

More information

Titan: Fair Packet Scheduling for Commodity Multiqueue NICs. Brent Stephens, Arjun Singhvi, Aditya Akella, and Mike Swift July 13 th, 2017

Titan: Fair Packet Scheduling for Commodity Multiqueue NICs. Brent Stephens, Arjun Singhvi, Aditya Akella, and Mike Swift July 13 th, 2017 Titan: Fair Packet Scheduling for Commodity Multiqueue NICs Brent Stephens, Arjun Singhvi, Aditya Akella, and Mike Swift July 13 th, 2017 Ethernet line-rates are increasing! 2 Servers need: To drive increasing

More information

IX: A Protected Dataplane Operating System for High Throughput and Low Latency

IX: A Protected Dataplane Operating System for High Throughput and Low Latency IX: A Protected Dataplane Operating System for High Throughput and Low Latency Belay, A. et al. Proc. of the 11th USENIX Symp. on OSDI, pp. 49-65, 2014. Reviewed by Chun-Yu and Xinghao Li Summary In this

More information

Speeding up Linux TCP/IP with a Fast Packet I/O Framework

Speeding up Linux TCP/IP with a Fast Packet I/O Framework Speeding up Linux TCP/IP with a Fast Packet I/O Framework Michio Honda Advanced Technology Group, NetApp michio@netapp.com With acknowledge to Kenichi Yasukata, Douglas Santry and Lars Eggert 1 Motivation

More information

Tales of the Tail Hardware, OS, and Application-level Sources of Tail Latency

Tales of the Tail Hardware, OS, and Application-level Sources of Tail Latency Tales of the Tail Hardware, OS, and Application-level Sources of Tail Latency Jialin Li, Naveen Kr. Sharma, Dan R. K. Ports and Steven D. Gribble February 2, 2015 1 Introduction What is Tail Latency? What

More information

Enabling ECN over Generic Packet Scheduling

Enabling ECN over Generic Packet Scheduling Enabling ECN over Generic Packet Scheduling Wei Bai Kai Chen Li Chen Changhoon Kim Haitao Wu 3 SING Group, Hong Kong University of Science and Technology Barefoot Networks 3 Microsoft ABSTRACT Explicit

More information

Project Computer Networking. Resource Management Approaches. Start EARLY Tomorrow s recitation. Lecture 20 Queue Management and QoS

Project Computer Networking. Resource Management Approaches. Start EARLY Tomorrow s recitation. Lecture 20 Queue Management and QoS Project 3 15-441 Computer Networking Start EARLY Tomorrow s recitation Lecture 20 Queue Management and QoS Lecture 20: QOS (c) CMU, 2005-10 2 Traffic and Resource Management Resource Management Approaches

More information

Advanced Computer Networks. Flow Control

Advanced Computer Networks. Flow Control Advanced Computer Networks 263 3501 00 Flow Control Patrick Stuedi Spring Semester 2017 1 Oriana Riva, Department of Computer Science ETH Zürich Last week TCP in Datacenters Avoid incast problem - Reduce

More information

Coflow. Recent Advances and What s Next? Mosharaf Chowdhury. University of Michigan

Coflow. Recent Advances and What s Next? Mosharaf Chowdhury. University of Michigan Coflow Recent Advances and What s Next? Mosharaf Chowdhury University of Michigan Rack-Scale Computing Datacenter-Scale Computing Geo-Distributed Computing Coflow Networking Open Source Apache Spark Open

More information

Got Loss? Get zovn! Daniel Crisan, Robert Birke, Gilles Cressier, Cyriel Minkenberg, and Mitch Gusat. ACM SIGCOMM 2013, August, Hong Kong, China

Got Loss? Get zovn! Daniel Crisan, Robert Birke, Gilles Cressier, Cyriel Minkenberg, and Mitch Gusat. ACM SIGCOMM 2013, August, Hong Kong, China Got Loss? Get zovn! Daniel Crisan, Robert Birke, Gilles Cressier, Cyriel Minkenberg, and Mitch Gusat ACM SIGCOMM 2013, 12-16 August, Hong Kong, China Virtualized Server 1 Application Performance in Virtualized

More information

EECS 122: Introduction to Computer Networks Switch and Router Architectures. Today s Lecture

EECS 122: Introduction to Computer Networks Switch and Router Architectures. Today s Lecture EECS : Introduction to Computer Networks Switch and Router Architectures Computer Science Division Department of Electrical Engineering and Computer Sciences University of California, Berkeley Berkeley,

More information

15-744: Computer Networking. Overview. Queuing Disciplines. TCP & Routers. L-6 TCP & Routers

15-744: Computer Networking. Overview. Queuing Disciplines. TCP & Routers. L-6 TCP & Routers TCP & Routers 15-744: Computer Networking RED XCP Assigned reading [FJ93] Random Early Detection Gateways for Congestion Avoidance [KHR02] Congestion Control for High Bandwidth-Delay Product Networks L-6

More information

pfabric: Minimal Near-Optimal Datacenter Transport

pfabric: Minimal Near-Optimal Datacenter Transport pfabric: Minimal Near-Optimal Datacenter Transport Mohammad Alizadeh,ShuangYang, Milad Sharif, Sachin Katti, Nick McKeown, Balaji Prabhakar,andScottShenker Stanford University Insieme Networks U.C. Berkeley

More information

Data Center TCP (DCTCP)

Data Center TCP (DCTCP) Data Center TCP (DCTCP) Mohammad Alizadeh, Albert Greenberg, David A. Maltz, Jitendra Padhye Parveen Patel, Balaji Prabhakar, Sudipta Sengupta, Murari Sridharan Stanford University MicrosoD Research Case

More information

Congestion Control in Datacenters. Ahmed Saeed

Congestion Control in Datacenters. Ahmed Saeed Congestion Control in Datacenters Ahmed Saeed What is a Datacenter? Tens of thousands of machines in the same building (or adjacent buildings) Hundreds of switches connecting all machines What is a Datacenter?

More information

Multimedia-unfriendly TCP Congestion Control and Home Gateway Queue Management

Multimedia-unfriendly TCP Congestion Control and Home Gateway Queue Management Multimedia-unfriendly TCP Congestion Control and Home Gateway Queue Management Lawrence Stewart α, David Hayes α, Grenville Armitage α, Michael Welzl β, Andreas Petlund β α Centre for Advanced Internet

More information

TX bulking and qdisc layer

TX bulking and qdisc layer Session: Linux packet processing performance improvements TX bulking and qdisc layer Jesper Dangaard Brouer (Red Hat) John Fastabend (Intel) John Ronciak (Intel) 1/16 Linux Plumbers Conference 16 th Oct

More information

Data center Networking: New advances and Challenges (Ethernet) Anupam Jagdish Chomal Principal Software Engineer DellEMC Isilon

Data center Networking: New advances and Challenges (Ethernet) Anupam Jagdish Chomal Principal Software Engineer DellEMC Isilon Data center Networking: New advances and Challenges (Ethernet) Anupam Jagdish Chomal Principal Software Engineer DellEMC Isilon Bitcoin mining Contd Main reason for bitcoin mines at Iceland is the natural

More information

The Network Layer and Routers

The Network Layer and Routers The Network Layer and Routers Daniel Zappala CS 460 Computer Networking Brigham Young University 2/18 Network Layer deliver packets from sending host to receiving host must be on every host, router in

More information

6.888 Lecture 8: Networking for Data Analy9cs

6.888 Lecture 8: Networking for Data Analy9cs 6.888 Lecture 8: Networking for Data Analy9cs Mohammad Alizadeh ² Many thanks to Mosharaf Chowdhury (Michigan) and Kay Ousterhout (Berkeley) Spring 2016 1 Big Data Huge amounts of data being collected

More information

RESTRUCTURING ENDPOINT CONGESTION CONTROL

RESTRUCTURING ENDPOINT CONGESTION CONTROL RESTRUCTURING ENDPOINT CONGESTION CONTROL Akshay Narayan, Frank Cangialosi, Deep5 Raghavan, Prateesh Goyal, Srinivas Narayana, Radhika Mi>al, Mohammad Alizadeh, Hari Balakrishnan CONGESTION CONTROL APPLICATION

More information

Systems (th)at Scale. Jon Crowcroft,

Systems (th)at Scale. Jon Crowcroft, Systems (th)at Scale Jon Crowcroft, http://www.cl.cam.ac.uk/~jac22 Cloud, Data Center, Networks 1. New Cloud OS to meet new workloads Includes programming language Collabs incl REMS (w/ P.Gardner/Imperial)

More information

Transport Protocols for Data Center Communication. Evisa Tsolakou Supervisor: Prof. Jörg Ott Advisor: Lect. Pasi Sarolahti

Transport Protocols for Data Center Communication. Evisa Tsolakou Supervisor: Prof. Jörg Ott Advisor: Lect. Pasi Sarolahti Transport Protocols for Data Center Communication Evisa Tsolakou Supervisor: Prof. Jörg Ott Advisor: Lect. Pasi Sarolahti Contents Motivation and Objectives Methodology Data Centers and Data Center Networks

More information

CS 268: Computer Networking

CS 268: Computer Networking CS 268: Computer Networking L-6 Router Congestion Control TCP & Routers RED XCP Assigned reading [FJ93] Random Early Detection Gateways for Congestion Avoidance [KHR02] Congestion Control for High Bandwidth-Delay

More information

CSC 4900 Computer Networks: Network Layer

CSC 4900 Computer Networks: Network Layer CSC 4900 Computer Networks: Network Layer Professor Henry Carter Fall 2017 Villanova University Department of Computing Sciences Review What is AIMD? When do we use it? What is the steady state profile

More information

Flowtune: Flowlet Control for Datacenter Networks

Flowtune: Flowlet Control for Datacenter Networks Flowtune: Flowlet Control for Datacenter Networks Jonathan Perry, Hari Balakrishnan, and Devavrat Shah, Massachusetts Institute of Technology https://www.usenix.org/conference/nsdi17/technical-sessions/presentation/perry

More information

Chapter 6 Queuing Disciplines. Networking CS 3470, Section 1

Chapter 6 Queuing Disciplines. Networking CS 3470, Section 1 Chapter 6 Queuing Disciplines Networking CS 3470, Section 1 Flow control vs Congestion control Flow control involves preventing senders from overrunning the capacity of the receivers Congestion control

More information

Generic Architecture. EECS 122: Introduction to Computer Networks Switch and Router Architectures. Shared Memory (1 st Generation) Today s Lecture

Generic Architecture. EECS 122: Introduction to Computer Networks Switch and Router Architectures. Shared Memory (1 st Generation) Today s Lecture Generic Architecture EECS : Introduction to Computer Networks Switch and Router Architectures Computer Science Division Department of Electrical Engineering and Computer Sciences University of California,

More information

MUD: Send me your top 1 3 questions on this lecture

MUD: Send me your top 1 3 questions on this lecture Administrivia Review 1 due tomorrow Email your reviews to me Office hours on Thursdays 10 12 MUD: Send me your top 1 3 questions on this lecture Guest lectures next week by Prof. Richard Martin Class slides

More information

TCP Incast problem Existing proposals

TCP Incast problem Existing proposals TCP Incast problem & Existing proposals Outline The TCP Incast problem Existing proposals to TCP Incast deadline-agnostic Deadline-Aware Datacenter TCP deadline-aware Picasso Art is TLA 1. Deadline = 250ms

More information

IX: A Protected Dataplane Operating System for High Throughput and Low Latency

IX: A Protected Dataplane Operating System for High Throughput and Low Latency IX: A Protected Dataplane Operating System for High Throughput and Low Latency Adam Belay et al. Proc. of the 11th USENIX Symp. on OSDI, pp. 49-65, 2014. Presented by Han Zhang & Zaina Hamid Challenges

More information

Data Center TCP (DCTCP)

Data Center TCP (DCTCP) Data Center Packet Transport Data Center TCP (DCTCP) Mohammad Alizadeh, Albert Greenberg, David A. Maltz, Jitendra Padhye Parveen Patel, Balaji Prabhakar, Sudipta Sengupta, Murari Sridharan Cloud computing

More information

Flowtune: Flowlet Control for Datacenter Networks Jonathan Perry, Hari Balakrishnan, and Devavrat Shah

Flowtune: Flowlet Control for Datacenter Networks Jonathan Perry, Hari Balakrishnan, and Devavrat Shah Computer Science and Artificial Intelligence Laboratory Technical Report MIT-CSAIL-TR-2016-011 August 15, 2016 Flowtune: Flowlet Control for Datacenter Networks Jonathan Perry, Hari Balakrishnan, and Devavrat

More information

CS268: Beyond TCP Congestion Control

CS268: Beyond TCP Congestion Control TCP Problems CS68: Beyond TCP Congestion Control Ion Stoica February 9, 004 When TCP congestion control was originally designed in 1988: - Key applications: FTP, E-mail - Maximum link bandwidth: 10Mb/s

More information

6.033 Spring Lecture #12. In-network resource management Queue management schemes Traffic differentiation spring 2018 Katrina LaCurts

6.033 Spring Lecture #12. In-network resource management Queue management schemes Traffic differentiation spring 2018 Katrina LaCurts 6.033 Spring 2018 Lecture #12 In-network resource management Queue management schemes Traffic differentiation 1 Internet of Problems How do we route (and address) scalably, while dealing with issues of

More information

Gateware Defined Networking (GDN) for Ultra Low Latency Trading and Compliance

Gateware Defined Networking (GDN) for Ultra Low Latency Trading and Compliance Gateware Defined Networking (GDN) for Ultra Low Latency Trading and Compliance STAC Summit: Panel: FPGA for trading today: December 2015 John W. Lockwood, PhD, CEO Algo-Logic Systems, Inc. JWLockwd@algo-logic.com

More information

CS551 Router Queue Management

CS551 Router Queue Management CS551 Router Queue Management Bill Cheng http://merlot.usc.edu/cs551-f12 1 Congestion Control vs. Resource Allocation Network s key role is to allocate its transmission resources to users or applications

More information

Announcements. Congestion Control. A few words from Panda. Congestion Control Review. Load and Delay. Caveat: In this lecture

Announcements. Congestion Control. A few words from Panda. Congestion Control Review. Load and Delay. Caveat: In this lecture Announcements Project 3 is out! Congestion Control EE Fall 0 Scott Shenker http://inst.eecs.berkeley.edu/~ee/ Materials with thanks to Jennifer Rexford, Ion Stoica, Vern Paxson and other colleagues at

More information

Overview Computer Networking What is QoS? Queuing discipline and scheduling. Traffic Enforcement. Integrated services

Overview Computer Networking What is QoS? Queuing discipline and scheduling. Traffic Enforcement. Integrated services Overview 15-441 15-441 Computer Networking 15-641 Lecture 19 Queue Management and Quality of Service Peter Steenkiste Fall 2016 www.cs.cmu.edu/~prs/15-441-f16 What is QoS? Queuing discipline and scheduling

More information

Circuit Switching Under the Radar with REACToR

Circuit Switching Under the Radar with REACToR Circuit Switching Under the Radar with REACToR He Liu, Feng Lu, Alex Forencich, Rishi Kapoor Malveeka Tewari, Geoffrey M. Voelker George Papen, Alex C. Snoeren, George Porter 1 2 How to build 100G datacenter

More information

Low-Latency Datacenters. John Ousterhout Platform Lab Retreat May 29, 2015

Low-Latency Datacenters. John Ousterhout Platform Lab Retreat May 29, 2015 Low-Latency Datacenters John Ousterhout Platform Lab Retreat May 29, 2015 Datacenters: Scale and Latency Scale: 1M+ cores 1-10 PB memory 200 PB disk storage Latency: < 0.5 µs speed-of-light delay Most

More information

TCP EX MACHINA: COMPUTER-GENERATED CONGESTION CONTROL KEITH WINSTEIN AND HARI BALAKRISHNAN. Presented by: Angela Jiang

TCP EX MACHINA: COMPUTER-GENERATED CONGESTION CONTROL KEITH WINSTEIN AND HARI BALAKRISHNAN. Presented by: Angela Jiang TCP EX MACHINA: COMPUTER-GENERATED CONGESTION CONTROL KEITH WINSTEIN AND HARI BALAKRISHNAN Presented by: Angela Jiang Network congestion Cause: Sources trying to send data faster than the network can process

More information

Congestion Control. EE122 Fall 2012 Scott Shenker

Congestion Control. EE122 Fall 2012 Scott Shenker Congestion Control EE122 Fall 2012 Scott Shenker http://inst.eecs.berkeley.edu/~ee122/ Materials with thanks to Jennifer Rexford, Ion Stoica, Vern Paxson and other colleagues at Princeton and UC Berkeley

More information

Coupled Conges,on Control for RTP Media. Safiqul Islam, Michael Welzl, Stein Gjessing and Naeem Khademi Department of Informa,cs University of Oslo

Coupled Conges,on Control for RTP Media. Safiqul Islam, Michael Welzl, Stein Gjessing and Naeem Khademi Department of Informa,cs University of Oslo Coupled Conges,on Control for RTP Media Safiqul Islam, Michael Welzl, Stein Gjessing and Naeem Khademi Department of Informa,cs University of Oslo Problem statement Each Flow has its own Conges@on Control

More information

Better Never than Late: Meeting Deadlines in Datacenter Networks

Better Never than Late: Meeting Deadlines in Datacenter Networks Better Never than Late: Meeting Deadlines in Datacenter Networks Christo Wilson, Hitesh Ballani, Thomas Karagiannis, Ant Rowstron Microsoft Research, Cambridge User-facing online services Two common underlying

More information

Investigating the Use of Synchronized Clocks in TCP Congestion Control

Investigating the Use of Synchronized Clocks in TCP Congestion Control Investigating the Use of Synchronized Clocks in TCP Congestion Control Michele Weigle Dissertation Defense May 14, 2003 Advisor: Kevin Jeffay Research Question Can the use of exact timing information improve

More information

Queuing. Congestion Control and Resource Allocation. Resource Allocation Evaluation Criteria. Resource allocation Drop disciplines Queuing disciplines

Queuing. Congestion Control and Resource Allocation. Resource Allocation Evaluation Criteria. Resource allocation Drop disciplines Queuing disciplines Resource allocation Drop disciplines Queuing disciplines Queuing 1 Congestion Control and Resource Allocation Handle congestion if and when it happens TCP Congestion Control Allocate resources to avoid

More information

Chapter 4. Routers with Tiny Buffers: Experiments. 4.1 Testbed experiments Setup

Chapter 4. Routers with Tiny Buffers: Experiments. 4.1 Testbed experiments Setup Chapter 4 Routers with Tiny Buffers: Experiments This chapter describes two sets of experiments with tiny buffers in networks: one in a testbed and the other in a real network over the Internet2 1 backbone.

More information

DAQ: Deadline-Aware Queue Scheme for Scheduling Service Flows in Data Centers

DAQ: Deadline-Aware Queue Scheme for Scheduling Service Flows in Data Centers DAQ: Deadline-Aware Queue Scheme for Scheduling Service Flows in Data Centers Cong Ding and Roberto Rojas-Cessa Abstract We propose a scheme to schedule the transmission of data center traffic to guarantee

More information

6.033 Computer System Engineering

6.033 Computer System Engineering MIT OpenCourseWare http://ocw.mit.edu 6.033 Computer System Engineering Spring 2009 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms. L13: Sharing in network

More information

CS557: Queue Management

CS557: Queue Management CS557: Queue Management Christos Papadopoulos Remixed by Lorenzo De Carli 1 Congestion Control vs. Resource Allocation Network s key role is to allocate its transmission resources to users or applications

More information

Hardware Evolution in Data Centers

Hardware Evolution in Data Centers Hardware Evolution in Data Centers 2004 2008 2011 2000 2013 2014 Trend towards customization Increase work done per dollar (CapEx + OpEx) Paolo Costa Rethinking the Network Stack for Rack-scale Computers

More information

15-744: Computer Networking. Data Center Networking II

15-744: Computer Networking. Data Center Networking II 15-744: Computer Networking Data Center Networking II Overview Data Center Topology Scheduling Data Center Packet Scheduling 2 Current solutions for increasing data center network bandwidth FatTree BCube

More information

Computer Networking. Queue Management and Quality of Service (QOS)

Computer Networking. Queue Management and Quality of Service (QOS) Computer Networking Queue Management and Quality of Service (QOS) Outline Previously:TCP flow control Congestion sources and collapse Congestion control basics - Routers 2 Internet Pipes? How should you

More information

Congestion Control and Resource Allocation

Congestion Control and Resource Allocation Congestion Control and Resource Allocation Lecture material taken from Computer Networks A Systems Approach, Third Edition,Peterson and Davie, Morgan Kaufmann, 2007. Advanced Computer Networks Congestion

More information

High bandwidth, Long distance. Where is my throughput? Robin Tasker CCLRC, Daresbury Laboratory, UK

High bandwidth, Long distance. Where is my throughput? Robin Tasker CCLRC, Daresbury Laboratory, UK High bandwidth, Long distance. Where is my throughput? Robin Tasker CCLRC, Daresbury Laboratory, UK [r.tasker@dl.ac.uk] DataTAG is a project sponsored by the European Commission - EU Grant IST-2001-32459

More information

Implemen'ng IPv6 Segment Rou'ng in the Linux Kernel

Implemen'ng IPv6 Segment Rou'ng in the Linux Kernel Implemen'ng IPv6 Segment Rou'ng in the Linux Kernel David Lebrun, Olivier Bonaventure ICTEAM, UCLouvain Work supported by ARC grant 12/18-054 (ARC-SDN) and a Cisco grant Agenda IPv6 Segment Rou'ng Implementa'on

More information

BSDCan 2015 June 13 th Extensions to FreeBSD Datacenter TCP for Incremental Deployment Support. Midori Kato

BSDCan 2015 June 13 th Extensions to FreeBSD Datacenter TCP for Incremental Deployment Support. Midori Kato BSDCan 2015 June 13 th Extensions to FreeBSD Datacenter TCP for Incremental Deployment Support Midori Kato DCTCP has been available since FreeBSD 11.0!! 2 FreeBSD DCTCP highlight

More information

Overview. TCP & router queuing Computer Networking. TCP details. Workloads. TCP Performance. TCP Performance. Lecture 10 TCP & Routers

Overview. TCP & router queuing Computer Networking. TCP details. Workloads. TCP Performance. TCP Performance. Lecture 10 TCP & Routers Overview 15-441 Computer Networking TCP & router queuing Lecture 10 TCP & Routers TCP details Workloads Lecture 10: 09-30-2002 2 TCP Performance TCP Performance Can TCP saturate a link? Congestion control

More information

phost: Distributed Near-Optimal Datacenter Transport Over Commodity Network Fabric

phost: Distributed Near-Optimal Datacenter Transport Over Commodity Network Fabric phost: Distributed Near-Optimal Datacenter Transport Over Commodity Network Fabric Peter X. Gao petergao@berkeley.edu Rachit Agarwal ragarwal@berkeley.edu Akshay Narayan akshay@berkeley.edu Sylvia Ratnasamy

More information

Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities

Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities : A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities Behnam Montazeri, Yilong Li, Mohammad Alizadeh, and John Ousterhout Stanford University, MIT ABSTRACT is a new transport protocol

More information

Advanced Computer Networks. Datacenter TCP

Advanced Computer Networks. Datacenter TCP Advanced Computer Networks 263 3501 00 Datacenter TCP Patrick Stuedi, Qin Yin, Timothy Roscoe Spring Semester 2015 1 Oriana Riva, Department of Computer Science ETH Zürich Last week Datacenter Fabric Portland

More information

Congestion Control In the Network

Congestion Control In the Network Congestion Control In the Network Brighten Godfrey cs598pbg September 9 2010 Slides courtesy Ion Stoica with adaptation by Brighten Today Fair queueing XCP Announcements Problem: no isolation between flows

More information

Quality of Service in the Internet. QoS Parameters. Keeping the QoS. Leaky Bucket Algorithm

Quality of Service in the Internet. QoS Parameters. Keeping the QoS. Leaky Bucket Algorithm Quality of Service in the Internet Problem today: IP is packet switched, therefore no guarantees on a transmission is given (throughput, transmission delay, ): the Internet transmits data Best Effort But:

More information

OpenDataPlane: network packet journey

OpenDataPlane: network packet journey OpenDataPlane: network packet journey Maxim Uvarov This presentation describes what is ODP. Touches common data types and APIs to work with network packets. Answers the question why ODP is needed and how

More information

New Lecture Schedule. Advanced Topics in Congestion Control. Office Hours This Week. Announcements. Project 3: Ask Panda

New Lecture Schedule. Advanced Topics in Congestion Control. Office Hours This Week. Announcements. Project 3: Ask Panda New Lecture Schedule Advanced Topics in Congestion Control EE122 Fall 2012 Scott Shenker http://inst.eecs.berkeley.edu/~ee122/ Materials with thanks to Jennifer Rexford, Ion Stoica, Vern Paxson and other

More information

NetSlices: Scalable Mul/- Core Packet Processing in User- Space

NetSlices: Scalable Mul/- Core Packet Processing in User- Space NetSlices: Scalable Mul/- Core Packet Processing in - Space Tudor Marian, Ki Suh Lee, Hakim Weatherspoon Cornell University Presented by Ki Suh Lee Packet Processors Essen/al for evolving networks Sophis/cated

More information

Advanced Topics in Congestion Control

Advanced Topics in Congestion Control Advanced Topics in Congestion Control EE122 Fall 2012 Scott Shenker http://inst.eecs.berkeley.edu/~ee122/ Materials with thanks to Jennifer Rexford, Ion Stoica, Vern Paxson and other colleagues at Princeton

More information

Network Processors and their memory

Network Processors and their memory Network Processors and their memory Network Processor Workshop, Madrid 2004 Nick McKeown Departments of Electrical Engineering and Computer Science, Stanford University nickm@stanford.edu http://www.stanford.edu/~nickm

More information

Overview. Lecture 22 Queue Management and Quality of Service (QoS) Queuing Disciplines. Typical Internet Queuing. FIFO + Drop tail Problems

Overview. Lecture 22 Queue Management and Quality of Service (QoS) Queuing Disciplines. Typical Internet Queuing. FIFO + Drop tail Problems Lecture 22 Queue Management and Quality of Service (QoS) Overview Queue management & RED Fair queuing Khaled Harras School of Computer Science niversity 15 441 Computer Networks Based on slides from previous

More information

ADVANCED TOPICS FOR CONGESTION CONTROL

ADVANCED TOPICS FOR CONGESTION CONTROL ADVANCED TOPICS FOR CONGESTION CONTROL Congestion Control The Internet only functions because TCP s congestion control does an effective job of matching traffic demand to available capacity. TCP s Window

More information

Resource allocation in networks. Resource Allocation in Networks. Resource allocation

Resource allocation in networks. Resource Allocation in Networks. Resource allocation Resource allocation in networks Resource Allocation in Networks Very much like a resource allocation problem in operating systems How is it different? Resources and jobs are different Resources are buffers

More information

Core-Stateless Fair Queueing: Achieving Approximately Fair Bandwidth Allocations in High Speed Networks. Congestion Control in Today s Internet

Core-Stateless Fair Queueing: Achieving Approximately Fair Bandwidth Allocations in High Speed Networks. Congestion Control in Today s Internet Core-Stateless Fair Queueing: Achieving Approximately Fair Bandwidth Allocations in High Speed Networks Ion Stoica CMU Scott Shenker Xerox PARC Hui Zhang CMU Congestion Control in Today s Internet Rely

More information

Fairness, Queue Management, and QoS

Fairness, Queue Management, and QoS Fairness, Queue Management, and QoS 15-441 Fall 2017 Profs Peter Steenkiste & Justine Sherry Slides borrowed from folks at CMU, Berkeley, and elsewhere. YINZ I AM GETTING T-SHIRTS If you TA for me next

More information

The Effects of Asymmetry on TCP Performance

The Effects of Asymmetry on TCP Performance The Effects of Asymmetry on TCP Performance Hari Balakrishnan Venkata N. Padmanabhan Randy H. Katz University of California at Berkeley Daedalus/BARWAN Retreat June 1997 Outline Overview Bandwidth asymmetry

More information

Friends, not Foes Synthesizing Existing Transport Strategies for Data Center Networks

Friends, not Foes Synthesizing Existing Transport Strategies for Data Center Networks Friends, not Foes Synthesizing Existing Transport Strategies for Data Center Networks Ali Munir, Ghufran Baig 2, Syed M. Irteza 2, Ihsan A. Qazi 2, Alex X. Liu, Fahad R. Dogar 3 Michigan State University,

More information

Computer Networks Spring 2017 Homework 2 Due by 3/2/2017, 10:30am

Computer Networks Spring 2017 Homework 2 Due by 3/2/2017, 10:30am 15-744 Computer Networks Spring 2017 Homework 2 Due by 3/2/2017, 10:30am (please submit through e-mail to zhuoc@cs.cmu.edu and srini@cs.cmu.edu) Name: A Congestion Control 1. At time t, a TCP connection

More information

Optical Packet Switching

Optical Packet Switching Optical Packet Switching DEISNet Gruppo Reti di Telecomunicazioni http://deisnet.deis.unibo.it WDM Optical Network Legacy Networks Edge Systems WDM Links λ 1 λ 2 λ 3 λ 4 Core Nodes 2 1 Wavelength Routing

More information