CS 655 System and Network Architectures and Implementation. Module 3 - Transport

Similar documents
Chapter 3 outline. 3.5 connection-oriented transport: TCP segment structure reliable data transfer flow control connection management

CS 3516: Advanced Computer Networks

CMSC 332 Computer Networks Reliable Data Transfer

CSC 4900 Computer Networks: Reliable Data Transport

Transport services and protocols. Chapter 3 outline. Internet transport-layer protocols Chapter 3 outline. Multiplexing/demultiplexing

Chapter 3 Transport Layer

CC451 Computer Networks

Data Communications & Networks. Session 6 Main Theme Reliable Data Transfer. Dr. Jean-Claude Franchitti

Chapter 3 Transport Layer

Transport layer. Our goals: Understand principles behind transport layer services: Learn about transport layer protocols in the Internet:

Rdt2.0: channel with packet errors (no loss!)

Transport layer: Outline

CMPE 150/L : Introduction to Computer Networks. Chen Qian Computer Engineering UCSC Baskin Engineering Lecture 8

Chapter 3 Transport Layer

TDTS06: Computer Networks

Chapter III: Transport Layer

Chapter 3 Transport Layer

Internet transport-layer protocols. Transport services and protocols. Sending and receiving. Connection-oriented (TCP) Connection-oriented

CS 3516: Computer Networks

Lecture 07 The Transport Layer (TCP & UDP) Dr. Anis Koubaa

CSC 8560 Computer Networks: Transport Layer

Chapter 3 outline. TDTS06 Computer networks. Principles of Reliable data transfer. Reliable data transfer: getting started

Computer Networks & Security 2016/2017

CSC 401 Data and Computer Communications Networks

CSC358 Week 4. Adapted from slides by J.F. Kurose and K. W. Ross. All material copyright J.F Kurose and K.W. Ross, All Rights Reserved

Lecture 10: Transpor Layer Principles of Reliable Data Transfer

Last time. Mobility in Cellular networks. Transport Layer. HLR, VLR, MSC Handoff. Introduction Multiplexing / demultiplexing UDP 14-1

COSC4377. Useful Linux Tool: screen

CSCE 463/612 Networks and Distributed Processing Spring 2018

Chapter 2: outline. 2.1 principles of network applications app architectures app requirements

rdt2.0 has a fatal flaw!

CSC 401 Data and Computer Communications Networks

Chapter 3 Transport Layer

Distributed Systems. 5. Transport Protocols. Werner Nutt

CSCD 330 Network Programming

Distributed Systems. 5. Transport Protocols

Announcement. Homework 1 due last night, how is that? Will discuss some problems in the lecture next week

Lecture 11: Transport Layer Reliable Data Transfer and TCP

Chapter 3: Transport Layer

Chapter 3: Transport Layer. Chapter 3 Transport Layer. Chapter 3 outline. Transport services and protocols

CSCE 463/612 Networks and Distributed Processing Spring 2018

Lecture 5. Transport Layer. Transport Layer 1-1

Chapter 3: Transport Layer Part A

Chapter 3 outline. Chapter 3: Transport Layer. Transport vs. network layer. Transport services and protocols. Internet transport-layer protocols

Chapter 3 Transport Layer

Chapter 3 Transport Layer

Chapter 3 Transport Layer

Chapter 3 Transport Layer

Computer Networks 1 (Mạng Máy Tính 1) Lectured by: Dr. Phạm Trần Vũ

Computer Networks. Transport Layer

Chapter 3 Transport Layer

Chapter 3: Transport Layer. Chapter 3 Transport Layer. Chapter 3 outline. Transport services and protocols

Chapter 3 Transport Layer

Chapter 3: Transport Layer

Transport Layer. CMPS 4750/6750: Computer Networks

Chapter 3. Transport Layer. Computer Networking: A Top Down Approach 5th edition. Jim Kurose, Keith Ross Addison-Wesley, April 2009.

Course on Computer Communication and Networks. Lecture 4 Chapter 3; Transport Layer, Part A

Course on Computer Communication and Networks. Lecture 4 Chapter 3; Transport Layer, Part A

Chapter 3 Transport Layer

Chapter 3: Transport Layer. Computer Networks. Transport Layer. Transport services and protocols. Chapter 3 outline. Bu-Ali Sina University, Hamedan

CSCI Computer Networks Fall 2016

Chapter 3: Transport Layer. Chapter 3 Transport Layer. Chapter 3 outline. Transport services and protocols

Chapter 3 Transport Layer. Chapter 3: Transport Layer. Chapter 3 outline. Our goals: understand principles behind transport layer services:

Chapter 3: Transport Layer

CSCI Computer Networks Spring 2017

Chapter 3 Transport Layer. Chapter 3: Transport Layer. Chapter 3 outline

Chapter 3: Transport Layer

Chapter 3 Transport Layer

COMP211 Chapter 3 Transport Layer

Transport services and protocols. Chapter 3 Transport Layer. Chapter 3: Transport Layer. Transport vs. network layer

Chapter III: Transport Layer

Chapter 3 Transport Layer

CSE 3214: Computer Network Protocols and Applications Transport Layer (Part 2) Chapter 3 outline. UDP checksum. Port numbers

Go-Back-N. Pipelining: increased utilization. Pipelined protocols. GBN: sender extended FSM

Chapter 3 Transport Layer

Chapter 3. Kultida Rojviboonchai, Ph.D. Dept. of Computer Engineering Faculty of Engineering Chulalongkorn University

Transport Layer. Chapter 3. Computer Networking: A Top Down Approach

Chapter 3 Transport Layer

Chapter 3 Transport Layer

Chapter 3: Transport Layer

Architettura di Reti

The Transport Layer Multiplexing, Error Detection, & UDP

Chapter 3 Transport Layer

Chapter 3: Transport Layer

Chapter 3 Transport Layer

Chapter 3 Transport Layer

Chapter 3 Transport Layer

EC441 Fall 2018 Introduction to Computer Networking Chapter 3: Transport Layer

CS/ECE 438: Communication Networks Fall Transport Layer

Chapter 3: Transport Layer. Chapter 3 Transport Layer. Transport Services and Protocols. Chapter 3 Outline

Chapter 3 Transport Layer

Chapter 3 Transport Layer

Chapter 3 Transport Layer

Chapter 3 Transport Layer

Chapter 3 Transport Layer

Lecture 3 The Transport Control Protocol (TCP) Antonio Cianfrani DIET Department Networking Group netlab.uniroma1.it

Transport Layer. Dr Ahmad Al-Zubi. Transport Layer 3-1

Chapter 3: Transport Layer. Chapter 3 Transport Layer. Transport layer. Position of transport layer. Transport layer.

Computer Networks. 3.Transport Layer. Transport Layer - 1

Chapter 3 Transport Layer

Transcription:

CS 655 System and Network Architectures and Implementation Module 3 - Transport Martin Karsten mkarsten@uwaterloo.ca 3-1

Notice Some slides and elements of slides are taken from third-party slide sets. In this module, parts are taken from the Kurose/Ross slide set. See detailed statement on next slide... 3-2

A note on the use of these ppt slides: We re making these slides freely available to all (faculty, students, readers). They re in PowerPoint form so you can add, modify, and delete slides (including this one) and slide content to suit your needs. They obviously represent a lot of work on our part. In return for use, we only ask the following: If you use these slides (e.g., in a class) in substantially unaltered form, that you mention their source (after all, we d like people to use our book!) If you post any slides in substantially unaltered form on a www site, that you note that they are adapted from (or perhaps identical to) our slides, and note our copyright of this material. Thanks and enjoy! JFK/KWR Computer Networking: A Top Down Approach 5 th edition. Jim Kurose, Keith Ross Addison-Wesley, April 2009. All material copyright 1996-2009 J.F Kurose and K.W. Ross, All Rights Reserved 3-3

Overview multiplexing, virtual channel reliable transmission flow and congestion control connection management and semantics 3-4

Networks Review network graph goal: facilitate any-to-any communication main concerns routing addressing scalability virtualization 3-5

Transport virtual channel end-to-end transmission performance reliable transmission rate control connection management 3-6

Hop by Hop vs. End to End? some services only hop by hop delay control throughput guarantees others also end to end multiplexing loss control reliable transmission rate control pinciple: if possible, use end to end 3-7

Multiplexing multiple logical sessions over same channel here: IP connectivity provides virtual channel transport session also provides virtual channel multiplexing encapsulation / stacking of multiplex label demultiplexing forwarding according to multiplex label decapsulation / remove multiplex label 3-8

Multiplexing Internet addressing convention: IP address network node address transport port transport session identifier Other Approaches virtual circuit integrated with network layer hop-by-hop transport service session identified locally by virtual circuit identifier 3-9

Layered Service app transport network data link physical logical end-end transport app transport network data link physical logical end-end transport network data link physical network data link physical app transport network data link physical network data link physical network data link physical network data link physical network data link physical app transport network data link physical 3-10

Operating System Integration OS implements transport protocol handles asynchronous execution provides send/receive queue at socket socket: named communication endpoint OS system calls create/remove sockets establish names, connections send/receive data more details later... 3-11

Multiplexing Multiple Sockets = socket = process application P3 P1 application P2 P4 application transport transport transport network network network link link link physical physical physical host 1 host 2 host 3 3-12

Multiplexing Single Socket P2 P3 P1 SP: 6428 DP: 9157 SP: 6428 DP: 5775 SP: 9157 SP: 5775 client IP: A DP: 6428 server IP: C DP: 6428 Client IP:B SP provides return address 3-13

Multiplexing Multiple Connections P1 P5 P2 P1P3 SP: 5775 DP: 80 S-IP: B D-IP:C SP: 9157 SP: 9157 client IP: A DP: 80 S-IP: A D-IP:C server IP: C DP: 80 S-IP: B D-IP:C Client IP:B 3-14

Reliable Transmission use acknowledgements to indicate receipt sender knows data has been received BUT: two-army problem look at functionality first, then performance 3-15

Principles of Reliable data transfer important in app., transport, link layers top-10 list of important networking topics! characteristics of unreliable channel will determine complexity of reliable data transfer protocol (rdt) 3-16

Principles of Reliable data transfer important in app., transport, link layers top-10 list of important networking topics! characteristics of unreliable channel will determine complexity of reliable data transfer protocol (rdt) 3-17

Principles of Reliable data transfer important in app., transport, link layers top-10 list of important networking topics! characteristics of unreliable channel will determine complexity of reliable data transfer protocol (rdt) 3-18

Reliable data transfer: getting started rdt_send(): called from above, (e.g., by app.). Passed data to deliver to receiver upper layer deliver_data(): called by rdt to deliver data to upper send side receive side udt_send(): called by rdt, to transfer packet over unreliable channel to receiver rdt_rcv(): called when packet arrives on rcv-side of channel 3-19

We ll: Reliable data transfer: getting started incrementally develop sender, receiver sides of reliable data transfer protocol (rdt) consider only unidirectional data transfer but control info will flow on both directions! use finite state machines (FSM) to specify sender, receiver event causing state transition actions taken on state transition state: when in this state next state uniquely determined by next event state 1 event actions state 2 3-20

Rdt1.0: reliable transfer over a reliable channel underlying channel perfectly reliable no bit errors no loss of packets separate FSMs for sender, receiver: sender sends data into underlying channel receiver read data from underlying channel Wait for call from above rdt_send(data) packet = make_pkt(data) udt_send(packet) Wait for call from below rdt_rcv(packet) extract (packet,data) deliver_data(data) sender receiver 3-21

Rdt2.0: channel with bit errors underlying channel may flip bits in packet checksum to detect bit errors the question: how to recover from errors: acknowledgements (ACKs): receiver explicitly tells sender that pkt received OK negative acknowledgements (NAKs): receiver explicitly tells sender that pkt had errors sender retransmits pkt on receipt of NAK new mechanisms in rdt2.0 (beyond rdt1.0): error detection receiver feedback: control msgs (ACK,NAK) rcvr->sender 3-22

rdt2.0: FSM specification rdt_send(data) snkpkt = make_pkt(data, checksum) udt_send(sndpkt) Wait for call from above Wait for ACK or NAK rdt_rcv(rcvpkt) && isnak(rcvpkt) udt_send(sndpkt) receiver rdt_rcv(rcvpkt) && corrupt(rcvpkt) udt_send(nak) rdt_rcv(rcvpkt) && isack(rcvpkt) Λ sender Wait for call from below rdt_rcv(rcvpkt) && notcorrupt(rcvpkt) extract(rcvpkt,data) deliver_data(data) udt_send(ack) 3-23

rdt2.0: operation with no errors rdt_send(data) snkpkt = make_pkt(data, checksum) udt_send(sndpkt) Wait for call from above Wait for ACK or NAK rdt_rcv(rcvpkt) && isnak(rcvpkt) udt_send(sndpkt) rdt_rcv(rcvpkt) && corrupt(rcvpkt) udt_send(nak) rdt_rcv(rcvpkt) && isack(rcvpkt) Λ Wait for call from below rdt_rcv(rcvpkt) && notcorrupt(rcvpkt) extract(rcvpkt,data) deliver_data(data) udt_send(ack) 3-24

rdt2.0: error scenario rdt_send(data) snkpkt = make_pkt(data, checksum) udt_send(sndpkt) Wait for call from above Wait for ACK or NAK rdt_rcv(rcvpkt) && isnak(rcvpkt) udt_send(sndpkt) rdt_rcv(rcvpkt) && corrupt(rcvpkt) udt_send(nak) rdt_rcv(rcvpkt) && isack(rcvpkt) Λ Wait for call from below rdt_rcv(rcvpkt) && notcorrupt(rcvpkt) extract(rcvpkt,data) deliver_data(data) udt_send(ack) 3-25

rdt2.0 has a fatal flaw! What happens if ACK/NAK corrupted? sender doesn t know what happened at receiver! can t just retransmit: possible duplicate Handling duplicates: sender retransmits current pkt if ACK/NAK garbled sender adds sequence number to each pkt receiver discards (doesn t deliver up) duplicate pkt stop and wait Sender sends one packet, then waits for receiver response 3-26

rdt2.1: sender, handles garbled ACK/NAKs rdt_rcv(rcvpkt) && notcorrupt(rcvpkt) && isack(rcvpkt) Λ rdt_rcv(rcvpkt) && ( corrupt(rcvpkt) isnak(rcvpkt) ) udt_send(sndpkt) rdt_send(data) sndpkt = make_pkt(0, data, checksum) udt_send(sndpkt) Wait for call 0 from above Wait for ACK or NAK 1 rdt_send(data) Wait for ACK or NAK rdt_rcv(rcvpkt) && ( corrupt(rcvpkt) isnak(rcvpkt) ) 0 udt_send(sndpkt) Wait for call 1 from above sndpkt = make_pkt(1, data, checksum) udt_send(sndpkt) rdt_rcv(rcvpkt) && notcorrupt(rcvpkt) && isack(rcvpkt) Λ 3-27

rdt2.1: receiver, handles garbled ACK/NAKs rdt_rcv(rcvpkt) && notcorrupt(rcvpkt) && has_seq0(rcvpkt) rdt_rcv(rcvpkt) && (corrupt(rcvpkt) sndpkt = make_pkt(nak, chksum) udt_send(sndpkt) rdt_rcv(rcvpkt) && not corrupt(rcvpkt) && has_seq1(rcvpkt) sndpkt = make_pkt(ack, chksum) udt_send(sndpkt) extract(rcvpkt,data) deliver_data(data) sndpkt = make_pkt(ack, chksum) udt_send(sndpkt) Wait for 0 from below Wait for 1 from below rdt_rcv(rcvpkt) && notcorrupt(rcvpkt) && has_seq1(rcvpkt) extract(rcvpkt,data) deliver_data(data) sndpkt = make_pkt(ack, chksum) udt_send(sndpkt) rdt_rcv(rcvpkt) && (corrupt(rcvpkt) sndpkt = make_pkt(nak, chksum) udt_send(sndpkt) rdt_rcv(rcvpkt) && not corrupt(rcvpkt) && has_seq0(rcvpkt) sndpkt = make_pkt(ack, chksum) udt_send(sndpkt) 3-28

rdt2.1: discussion Sender: seq # added to pkt two seq. # s (0,1) will suffice. Why? must check if received ACK/NAK corrupted twice as many states state must remember whether current pkt has 0 or 1 seq. # Receiver: must check if received packet is duplicate state indicates whether 0 or 1 is expected pkt seq # note: receiver can not know if its last ACK/NAK received OK at sender 3-29

rdt2.2: a NAK-free protocol same functionality as rdt2.1, using ACKs only instead of NAK, receiver sends ACK for last pkt received OK receiver must explicitly include seq # of pkt being ACKed duplicate ACK at sender results in same action as NAK: retransmit current pkt 3-30

rdt2.2: sender, receiver fragments rdt_rcv(rcvpkt) && (corrupt(rcvpkt) has_seq1(rcvpkt)) udt_send(sndpkt) rdt_send(data) sndpkt = make_pkt(0, data, checksum) udt_send(sndpkt) Wait for call 0 from above Wait for 0 from below sender FSM fragment receiver FSM fragment Wait for ACK 0 rdt_rcv(rcvpkt) && ( corrupt(rcvpkt) isack(rcvpkt,1) ) udt_send(sndpkt) rdt_rcv(rcvpkt) && notcorrupt(rcvpkt) && isack(rcvpkt,0) Λ rdt_rcv(rcvpkt) && notcorrupt(rcvpkt) && has_seq1(rcvpkt) extract(rcvpkt,data) deliver_data(data) sndpkt = make_pkt(ack1, chksum) udt_send(sndpkt) 3-31

rdt3.0: channels with errors and loss New assumption: underlying channel can also lose packets (data or ACKs) checksum, seq. #, ACKs, retransmissions will be of help, but not enough Approach: sender waits reasonable amount of time for ACK retransmits if no ACK received in this time if pkt (or ACK) just delayed (not lost): retransmission will be duplicate, but use of seq. # s already handles this receiver must specify seq # of pkt being ACKed requires countdown timer 3-32

rdt3.0 sender rdt_rcv(rcvpkt) && notcorrupt(rcvpkt) && isack(rcvpkt,1) stop_timer timeout rdt_rcv(rcvpkt) Λ udt_send(sndpkt) start_timer rdt_rcv(rcvpkt) && ( corrupt(rcvpkt) isack(rcvpkt,0) ) Λ Wait for call 0from above Wait for ACK1 rdt_send(data) sndpkt = make_pkt(0, data, checksum) udt_send(sndpkt) start_timer rdt_send(data) Wait for ACK0 Wait for call 1 from above sndpkt = make_pkt(1, data, checksum) udt_send(sndpkt) start_timer rdt_rcv(rcvpkt) && ( corrupt(rcvpkt) isack(rcvpkt,1) ) timeout udt_send(sndpkt) start_timer rdt_rcv(rcvpkt) && notcorrupt(rcvpkt) && isack(rcvpkt,0) stop_timer Λ rdt_rcv(rcvpkt) Λ 3-33

rdt3.0 in action 3-34

rdt3.0 in action 3-35

Performance of rdt3.0 rdt3.0 works, but performance stinks ex: 1 Gbps link, 15 ms prop. delay, 8000 bit packet: d trans = R L = 8000bits 10 bps 9 = 8microseconds U sender : utilization fraction of time sender busy sending U sender = L / R RTT +L / R =.008 30.008 = 0.00027 microsec 1KB pkt every 30 msec -> 33kB/sec thruput over 1 Gbps link network protocol limits use of physical resources! 3-36

rdt3.0: stop-and-wait operation sender receiver first packet bit transmitted, t = 0 last packet bit transmitted, t = L / R RTT first packet bit arrives last packet bit arrives, send ACK ACK arrives, send next packet, t = RTT + L / R U sender = L / R RTT +L / R =.008 30.008 = 0.00027 microsec 3-37

Pipelined protocols Pipelining: sender allows multiple, in-flight, yet-to-beacknowledged pkts range of sequence numbers must be increased buffering at sender and/or receiver Two generic forms of pipelined protocols: go-back-n, selective repeat 3-38

Pipelining: increased utilization sender receiver first packet bit transmitted, t = 0 last bit transmitted, t = L / R RTT ACK arrives, send next packet, t = RTT + L / R first packet bit arrives last packet bit arrives, send ACK last bit of 2 nd packet arrives, send ACK last bit of 3 rd packet arrives, send ACK Increase utilization by a factor of 3! U sender = 3 * L / R RTT +L / R =.024 30.008 = 0.0008 microsecon 3-39

Pipelining Protocols Go-back-N: overview sender: up to N unacked pkts in pipeline receiver: only sends cumulative ACKs doesn t ACK pkt if there s a gap sender: has timer for oldest unacked pkt if timer expires: retransmit all unacked packets Selective Repeat: overview sender: up to N unacked packets in pipeline receiver: ACKs individual pkts sender: maintains timer for each unacked pkt if timer expires: retransmit only unacked packet 3-40

Go-Back-N Sender: k-bit seq # in pkt header window of up to N, consecutive unacked pkts allowed ACK(n): ACKs all pkts up to, including seq # n - cumulative ACK may receive duplicate ACKs (see receiver) timer for each in-flight pkt timeout(n): retransmit pkt n and all higher seq # pkts in window 3-41

GBN: sender extended FSM rdt_send(data) Λ base=1 nextseqnum=1 rdt_rcv(rcvpkt) && corrupt(rcvpkt) if (nextseqnum < base+n) { sndpkt[nextseqnum] = make_pkt(nextseqnum,data,chksum) udt_send(sndpkt[nextseqnum]) if (base == nextseqnum) start_timer nextseqnum++ } else refuse_data(data) Wait timeout start_timer udt_send(sndpkt[base]) udt_send(sndpkt[base+1]) udt_send(sndpkt[nextseqnum-1]) rdt_rcv(rcvpkt) && notcorrupt(rcvpkt) base = getacknum(rcvpkt)+1 If (base == nextseqnum) stop_timer else start_timer 3-42

GBN: receiver extended FSM default udt_send(sndpkt) Λ Wait expectedseqnum=1 sndpkt = make_pkt(expectedseqnum,ack,chksum) rdt_rcv(rcvpkt) && notcurrupt(rcvpkt) && hasseqnum(rcvpkt,expectedseqnum) extract(rcvpkt,data) deliver_data(data) sndpkt = make_pkt(expectedseqnum,ack,chksum) udt_send(sndpkt) expectedseqnum++ ACK-only: always send ACK for correctly-received pkt with highest in-order seq # may generate duplicate ACKs need only remember expectedseqnum out-of-order pkt: discard (don t buffer) -> no receiver buffering! Re-ACK pkt with highest in-order seq # 3-43

GBN in action 3-44

Selective Repeat receiver individually acknowledges all correctly received pkts buffers pkts, as needed, for eventual in-order delivery to upper layer sender only resends pkts for which ACK not received sender timer for each unacked pkt sender window N consecutive seq # s again limits seq #s of sent, unacked pkts 3-45

Selective repeat: sender, receiver windows 3-46

Selective repeat sender data from above : if next available seq # in window, send pkt timeout(n): resend pkt n, restart timer ACK(n) in [sendbase,sendbase+n]: mark pkt n as received if n smallest unacked pkt, advance window base to next unacked seq # pkt n in [rcvbase, rcvbase+n-1] send ACK(n) out-of-order: buffer in-order: deliver (also deliver buffered, in-order pkts), advance window to next not-yet-received pkt pkt n in [rcvbase-n,rcvbase-1] ACK(n) otherwise: receiver ignore 3-47

Selective repeat in action 3-48

Selective repeat: dilemma Example: seq # s: 0, 1, 2, 3 window size=3 receiver sees no difference in two scenarios! incorrectly passes duplicate data as new in (a) Q: what relationship between seq # size and window size? 3-49

Flow Control: TCP flow control receive side of TCP connection has a receive buffer: sender won t overflow receiver s buffer by transmitting too much, too fast IP datagrams (currently) unused buffer space TCP data (in buffer) application process speed-matching service: matching send rate to receiving application s drain rate app process may be slow at reading from buffer 3-50

TCP Flow Control: how it works IP datagrams (suppose TCP receiver discards out-of-order segments) unused buffer space: = rwnd (currently) unused buffer space rwnd RcvBuffer TCP data (in buffer) = RcvBuffer-[LastByteRcvd - LastByteRead] application process receiver: advertises unused buffer space by including rwnd value in segment header sender: limits # of unacked bytes to rwnd guarantees receiver s buffer doesn t overflow 3-51

Congestion Control decoupled network and transport service: multiple senders might overwhelm routers => packet delay and loss certain situations: congestion collapse another goal: fair sharing of network resources 3-52

Overload without Reliability one router, infinite buffers no retransmission Host B Host A λ in : original data unlimited shared output link buffers λ out large delays maximum throughput 3-53

Overload with Reliability one router, finite buffers retransmission of lost packets Host A λ in : original data λ' in : original data, plus retransmitted data λ out Host B finite shared output link buffers 3-54

Overload with Reliability a) perfect send rate b) finite buffer -> loss & retransmission c) retransmission too eager (timeout to small) R/2 R/2 R/2 λ out λ out λ out R/4 λ in R/2 λ in R/2 λ in R/2 a. b. c. 3-55

Circular Bottlenecks Host A λ in : original data λ' in : original data, plus retransmitted data finite shared output link buffers λ out Host B 3-56

Congestion Collapse H o s t A λ o u t H o s t B packet drop after upstream bottleneck => upstream capacity wasted 3-57

Congestion Control senders must control rate to avoid permanent network overload input signals? direct feedback from network? overhead! indirect feedback through receiver? reaction time! TCP congestion control use drop or packet marking to indicate overload 3-58

TCP Congestion Control limit depth of pipeline congestion window sending rate (roughly): CongWin / RTT adjust CongWin based on network feedback sender infers packet loss duplicate ACK -> assume light overload timeout -> assume severe overload adaptation regimes/phases slow start: start at very small rate, increase fast congestion avoidance: hold rate, increase slow 3-59

TCP Slow Start start with small fixed CongWin increase exponentially until first loss double CongWin every RTT, i.e.: increment CongWin for every ACK start slow, but increase fast RTT Host A Host B one segment two segments four segments time 3-60

TCP Congestion Avoidance regular operation (no loss): increase CongWin by fixed amount per RTT receiver detects missing segment -> send duplicate ACK for previous one sender receives 3 duplicate ACKs -> reduce CongWin in half but after sender timeout: -> restart Slow Start procedure 3-61

TCP Rate Control Slow Start -> Congestion Avoidance based on threshold (CongWin/2 of last timeout) 3-62

TCP Fairness Two competing sessions: Additive increase gives slope of 1, as throughout increases multiplicative decrease decreases throughput proportionally R equal bandwidth share Connection 2 throughput Connection 1 throughput loss: decrease window by factor of 2 congestion avoidance: additive increase loss: decrease window by factor of 2 congestion avoidance: additive increase R 3-63

TCP Discussion hybrid of Go-Back-N and Selective Repeat SACK: more precise acknowledgements (limited) reduction of CongWin -> pause sending until ACKs catch up with outstanding data refinements fast retransmit & fast recovery -> resume sending faster during dupack losses keep sending at ACK-clocked pace 3-64

TCP Discussion TCP fairness relies on configuration values initial window for slow start additive increase during congestion avoidance -> problem with highspeed / long-delay links more agile congestion control -> robustness? per-session fairness? lots of other approaches in the literature very little real-world adoption 3-65

Connection Management: TCP Connection Establishment: 3-Way Handshake Step 1: initiator sends SYN to responder sets up initial variables, e.g., sequence number Step 2: responder responds with SYNACK sets up initial variables, e.g., sequence number responder allocates internal buffer Step 3: initiator responds with ACK might already send data along 3-66

Connection Management: TCP Connection Teardown Be Aware of Reliability! Step 1: client sends TCP FIN to server Step 2: server responds with ACK & sends FIN FIN -> no more data ACK -> all data received close timed wait closed client FIN ACK FIN ACK server close 3-67

Connection Management: TCP Step 3: client responds with ACK enters TIMED WAIT closing client FIN server in case ACK is lost Step 4: server receives ACK connection closed timed wait closed ACK FIN ACK closing closed 3-68

Interface Semantics message interface message boundaries preserved across transport byte-stream interface message boundaries not preserved simpler and more flexible for implementation 3-69