Attaining the Promise and Avoiding the Pitfalls of TCP in the Datacenter. Glenn Judd Morgan Stanley

Similar documents
Transport Protocols for Data Center Communication. Evisa Tsolakou Supervisor: Prof. Jörg Ott Advisor: Lect. Pasi Sarolahti

Cloud e Datacenter Networking

Data Center TCP(DCTCP)

Data Center TCP (DCTCP)

TCP Incast problem Existing proposals

DIBS: Just-in-time congestion mitigation for Data Centers

EXPERIENCES EVALUATING DCTCP. Lawrence Brakmo, Boris Burkov, Greg Leclercq and Murat Mugan Facebook

TCP LoLa Toward Low Latency and High Throughput Congestion Control

Problem 7. Problem 8. Problem 9

TCP so far Computer Networking Outline. How Was TCP Able to Evolve

Lecture 15: Datacenter TCP"

Data Center TCP (DCTCP)

CS 268: Lecture 7 (Beyond TCP Congestion Control)

CS268: Beyond TCP Congestion Control

Data center Networking: New advances and Challenges (Ethernet) Anupam Jagdish Chomal Principal Software Engineer DellEMC Isilon

Advanced Computer Networks. Datacenter TCP

Overview. TCP & router queuing Computer Networking. TCP details. Workloads. TCP Performance. TCP Performance. Lecture 10 TCP & Routers

Router s Queue Management

Network Management & Monitoring

A Hybrid Systems Modeling Framework for Fast and Accurate Simulation of Data Communication Networks. Motivation

Wireless TCP Performance Issues

Toward a Reliable Data Transport Architecture for Optical Burst-Switched Networks

WRED Explicit Congestion Notification

Hybrid Control and Switched Systems. Lecture #17 Hybrid Systems Modeling of Communication Networks

One More Bit Is Enough

CS 5520/ECE 5590NA: Network Architecture I Spring Lecture 13: UDP and TCP

Computer Networking. Queue Management and Quality of Service (QOS)

TCP EX MACHINA: COMPUTER-GENERATED CONGESTION CONTROL KEITH WINSTEIN AND HARI BALAKRISHNAN. Presented by: Angela Jiang

BSDCan 2015 June 13 th Extensions to FreeBSD Datacenter TCP for Incremental Deployment Support. Midori Kato

Congestion Control for High-Bandwidth-Delay-Product Networks: XCP vs. HighSpeed TCP and QuickStart

Packet Scheduling in Data Centers. Lecture 17, Computer Networks (198:552)

On the Effectiveness of CoDel in Data Centers

Congestion Control 3/16/09

Congestion Control for High Bandwidth-delay Product Networks. Dina Katabi, Mark Handley, Charlie Rohrs

a) Adding the two bytes gives Taking the one s complement gives

Better Never than Late: Meeting Deadlines in Datacenter Networks

Congestion / Flow Control in TCP

XCo: Explicit Coordination for Preventing Congestion in Data Center Ethernet

CS 349/449 Internet Protocols Final Exam Winter /15/2003. Name: Course:

CS644 Advanced Networks

Computer Networks Spring 2017 Homework 2 Due by 3/2/2017, 10:30am

TCP Congestion Control

TCP Congestion Control

The Network Layer and Routers

Preprint. Until published, please cite as:

ADVANCED TOPICS FOR CONGESTION CONTROL

Mahout: Low-Overhead Datacenter Traffic Management using End-Host-Based Elephant Detection. Vasileios Dimitrakis

Probe or Wait : Handling tail losses using Multipath TCP

Low Latency Low Loss Scalable throughput (L4S) and RACK an opportunity to remove HoL blocking from links

2.993: Principles of Internet Computing Quiz 1. Network

Impact of transmission errors on TCP performance. Outline. Random Errors

Tuning RED for Web Traffic

Congestion Control In the Network

15-744: Computer Networking TCP

Computer Networking

Congestion Control for High Bandwidth-delay Product Networks

15-744: Computer Networking. Overview. Queuing Disciplines. TCP & Routers. L-6 TCP & Routers

Answers to Sample Questions on Transport Layer

Principles of congestion control

! Network bandwidth shared by all users! Given routing, how to allocate bandwidth. " efficiency " fairness " stability. !

TCP Sendbuffer Advertising. Costin Raiciu University Politehnica of Bucharest

Ultra high-speed transmission technology for wide area data movement

A report on a few steps in the evolution of congestion control. Sally Floyd June 10, 2002 IPAM Program in Large Scale Communication Networks

Random Early Detection (RED) gateways. Sally Floyd CS 268: Computer Networks

CS 268: Computer Networking

ICS 451: Today's plan. Sliding Window Reliable Transmission Acknowledgements Windows and Bandwidth-Delay Product Retransmission Timers Connections

Congestion Control in Datacenters. Ahmed Saeed

Internet-Draft. G. Judd Morgan Stanley July 8, 2016

The War Between Mice and Elephants

Lecture 14: Congestion Control"

XCP: explicit Control Protocol

TRANSMISSION CONTROL PROTOCOL

Congestion control in TCP

Congestion Control In The Internet Part 2: How it is implemented in TCP. JY Le Boudec 2015

Congestion Control In The Internet Part 2: How it is implemented in TCP. JY Le Boudec 2014

8. TCP Congestion Control

WRED-Explicit Congestion Notification

WB-RTO: A Window-Based Retransmission Timeout. Ioannis Psaras, Vassilis Tsaoussidis Demokritos University of Thrace, Xanthi, Greece

Fast Retransmit. Problem: coarsegrain. timeouts lead to idle periods Fast retransmit: use duplicate ACKs to trigger retransmission

Computer Networks. Course Reference Model. Topic. Congestion What s the hold up? Nature of Congestion. Nature of Congestion 1/5/2015.

Chapter 3 Review Questions

Transport Layer Congestion Control

Congestion Control In The Internet Part 2: How it is implemented in TCP. JY Le Boudec 2014

Transport Layer Protocols TCP

Performance Analysis of TCP Variants

Network Performance: Queuing

Router participation in Congestion Control. Techniques Random Early Detection Explicit Congestion Notification

Appendix B. Standards-Track TCP Evaluation

Equation-Based Congestion Control for Unicast Applications. Outline. Introduction. But don t we need TCP? TFRC Goals

RPT: Re-architecting Loss Protection for Content-Aware Networks

CMPE 257: Wireless and Mobile Networking

TCP and BBR. Geoff Huston APNIC

CS519: Computer Networks. Lecture 5, Part 4: Mar 29, 2004 Transport: TCP congestion control

TCP. CSU CS557, Spring 2018 Instructor: Lorenzo De Carli (Slides by Christos Papadopoulos, remixed by Lorenzo De Carli)

Improving the Robustness of TCP to Non-Congestion Events

Multi-Hop ARQ 林咨銘 2006/8/24.

TCP on High-Speed Networks

Investigating the Use of Synchronized Clocks in TCP Congestion Control

Chapter 24 Congestion Control and Quality of Service 24.1

TCP and BBR. Geoff Huston APNIC

Transcription:

Attaining the Promise and Avoiding the Pitfalls of TCP in the Datacenter Glenn Judd Morgan Stanley 1

Introduction Datacenter computing pervasive Beyond the Internet services domain BigData, Grid Computing, Cloud etc. Communication is essential Demand is increasing TCP Leverage existing base of TCP applications Avoid re-inventing flow control, congestion control, etc. Not designed with datacenter computing in mind Internet: propagation delay >>> queueing delay Datacenter: propagation delay <<< queueing delay 2

Outline Background TCP Datacenter Challenges Latency & Application Coupling Incast DCTCP Deployment Challenges Performance Receive Buffer Tuning Related Work Conclusion 3

Application Types Large-scale computation Monte Carlo simulation Data analysis Latency-sensitive computation End-user-facing Multiple groups sharing the datacenter 4

Fraction Complete Task Completion PDF Motivation Workarounds Rerun slow tasks Overlap computation Underlying Cause I/O Network Cross-application Interference Time 5

Outline Background TCP Datacenter Challenges Latency & Application Coupling Incast DCTCP Deployment Challenges Performance Receive Buffer Tuning Related Work Conclusion 6

Latency & Application Coupling Sender Sender Switch Receiver All traffic transiting queue incurs latency By design, TCP fills queues Even simple congestion Latency microseconds milliseconds Application coupling Shared congested link Buffer pressure Disparate groups coupled through network 7

Incast Sender L = Link bandwidth Sender n * L Switch L Receiver Sender Sender n senders Discards Triggers timeouts Exacerbated by synchronized requests Common in distributed storage 8

Impact of Timeouts Majority of delay eliminated by Reducing RTOmin But: Zero delay preferred Latency remains 9

Outline Background TCP Datacenter Challenges Latency & Application Coupling Incast DCTCP Deployment Challenges Performance Receive Buffer Tuning Related Work Conclusion 10

DCTCP Key idea: reduce queue occupancy Reduced loss Reduce timeouts Reduced queueing delay Mechanism Leverage ECN RED/AQM capabilities Available in current technology Can it work in production? At scale? 11

Coexistence with TCP TCP DCTCP 10 Gbps 10 Gbps Switch 10 Gbps Receiver 12

Coexistence with TCP TCP DCTCP 13

RED-ECN AQM Implementation Marking Threshold TCP TCP DCTCP CE DCTCP CE TCP DCTCP DCTCP packets marked TCP packets discarded Solution: Segregate DCTCP traffic using QoS 14

Connection Establishment SYNs ECT? Paper:? Implementation: No 15

Non-technical Challenges Network administrators are necessarily risk averse You want to do what?! Keys to deploying DCTCP Demonstrated reduction in coupling Primum non nocere Support for TCP and DCTCP coeixistence Timing 16

Outline Background TCP Datacenter Challenges Latency & Application Coupling Incast DCTCP Deployment Challenges Performance Receive Buffer Tuning Related Work Conclusion 17

Throughput, Fairness, & Latency 19 senders Sender Sender Sender 190 Gbps aggregate Switch 10 Gbps Receiver Sender Latency measurements 18

Throughout (Mbps) Throughput and Fairness 800 700 600 500 400 300 200 100 0 TCP Mean throughput Error bars show maximum and minimum DCTCP 19

Latency Latency in ms DCTCP latency significantly lower than TCP 20

Stability, Convergence, & Fairness: TCP Flows Introduced one at a time Removed one at a time 21

Stability, Convergence, & Fairness: DCTCP DCTCP is better! Contradicts analytical results? 22

Stability, Convergence, & Fairness: TCP, 2 Flows Digging deeper: TCP is performing very poorly 23

Receive Buffer Tuning Off: TCP 24

Scale Retransmissions Min. Load Senders Total (Mbps) RTT (ms) Total % Timeouts (Gbps) 100 9,901 1.6 0 0 0 3.27 200 9,900 3.11 0 0 0 6.55 300 9,901 4.38 3 0 0 9.82 400 9,894 4.42 702 4.6 274 13.09 500 9,895 4.44 1,110 8.7 655 16.36 25

Conclusion TCP has significant performance problems in datacenters Latency & Application Coupling Incast Lowest feasible RTOmin DCTCP Receive buffer tuning DCTCP Requires changes to work in production Coexistence Connection establishment Has scaling limits, but significantly improves performance In production Coming soon to your favorite OS 26

Questions? 27