Evaluation of inter-cluster data transfer on grid environment

Similar documents
Exercises TCP/IP Networking With Solutions

Hybrid Control and Switched Systems. Lecture #17 Hybrid Systems Modeling of Communication Networks

No, the bogus packet will fail the integrity check (which uses a shared MAC key).!

Multi-class Applications for Parallel Usage of a Guaranteed Rate and a Scavenger Service

NET ID. CS519, Prelim (March 17, 2004) NAME: You have 50 minutes to complete the test. 1/17

UNIT IV -- TRANSPORT LAYER

STUDY OF SOCKET PROGRAMMING AND CLIENT SERVER MODEL

Performance Analysis of TCP Variants

Congestion Control In The Internet Part 2: How it is implemented in TCP. JY Le Boudec 2014

Reliable Transport II: TCP and Congestion Control

Congestion Control In The Internet Part 2: How it is implemented in TCP. JY Le Boudec 2014

Computer Networking

Congestion. Can t sustain input rate > output rate Issues: - Avoid congestion - Control congestion - Prioritize who gets limited resources

TCP Congestion Control

Congestion Control In The Internet Part 2: How it is implemented in TCP. JY Le Boudec 2015

Outline Computer Networking. TCP slow start. TCP modeling. TCP details AIMD. Congestion Avoidance. Lecture 18 TCP Performance Peter Steenkiste

Bandwidth Allocation & TCP

Chapter 3 Transport Layer

Reasons not to Parallelize TCP Connections for Fast Long-Distance Networks

Answers to Sample Questions on Transport Layer

IJSRD - International Journal for Scientific Research & Development Vol. 2, Issue 03, 2014 ISSN (online):

ECEN Final Exam Fall Instructor: Srinivas Shakkottai

The War Between Mice and Elephants

CS 557 Congestion and Complexity

CMSC 417. Computer Networks Prof. Ashok K Agrawala Ashok Agrawala. October 30, 2018

The Effects of Asymmetry on TCP Performance

2. Traffic. Contents. Offered vs. carried traffic. Characterisation of carried traffic

Computer Networked games

TCP so far Computer Networking Outline. How Was TCP Able to Evolve

2. Traffic lect02.ppt S Introduction to Teletraffic Theory Spring

Overview. TCP & router queuing Computer Networking. TCP details. Workloads. TCP Performance. TCP Performance. Lecture 10 TCP & Routers

Wireless TCP Performance Issues

end systems, access networks, links circuit switching, packet switching, network structure

Transport Layer PREPARED BY AHMED ABDEL-RAOUF

Edge versus Host Pacing of TCP Traffic in Small Buffer Networks

Congestion control in TCP

Investigating the Use of Synchronized Clocks in TCP Congestion Control

Solution to Question 1: ``Quickies'' (25 points, 15 minutes)

Variable Step Fluid Simulation for Communication Network

Chapter 3 outline. 3.5 Connection-oriented transport: TCP. 3.6 Principles of congestion control 3.7 TCP congestion control

Computer Network Fundamentals Spring Week 10 Congestion Control Andreas Terzis

Question Points Score total 100

Q23-5 In a network, the size of the receive window is 1 packet. Which of the follow-ing protocols is being used by the network?

Chapter 3 Review Questions

CS519: Computer Networks. Lecture 5, Part 4: Mar 29, 2004 Transport: TCP congestion control

Chapter III: Transport Layer

CSE/EE 461 Lecture 16 TCP Congestion Control. TCP Congestion Control

Switching Networks (Fall 2010) EE 586 Communication and. August 30, Lecture 3. (modified by Cheung for EE586; based on K&R original) 1-1

Solutions to Select Exercises

Congestion Control In The Internet Part 2: How it is implemented in TCP. JY Le Boudec 2015

The Present and Future of Congestion Control. Mark Handley

Student ID: CS457: Computer Networking Date: 3/20/2007 Name:

Chapter III. congestion situation in Highspeed Networks

PERFORMANCE COMPARISON OF THE DIFFERENT STREAMS IN A TCP BOTTLENECK LINK IN THE PRESENCE OF BACKGROUND TRAFFIC IN A DATA CENTER

Department of EECS - University of California at Berkeley EECS122 - Introduction to Communication Networks - Spring 2005 Final: 5/20/2005

Chapter 3 Transport Layer

Communication Networks

TCP congestion control:

10 Reasons your WAN is Broken

CMSC 417. Computer Networks Prof. Ashok K Agrawala Ashok Agrawala. October 11, 2018

Computer Network. Direct Link Networks Reliable Transmission. rev /2/2004 1

ECE 610: Homework 4 Problems are taken from Kurose and Ross.

Homework #4. Due: December 2, 4PM. CWND (#pkts)

CS3600 SYSTEMS AND NETWORKS

6.033 Computer System Engineering

Transmission Control Protocol. ITS 413 Internet Technologies and Applications

Evaluation of Advanced TCP Stacks on Fast Long-Distance Production Networks p. 1

Data transfer over the wide area network with a large round trip time

Multiple unconnected networks

Communication Networks

Congestion Control In The Internet Part 2: How it is implemented in TCP. JY Le Boudec 2017

End-to-End Mechanisms for QoS Support in Wireless Networks

Fast Retransmit. Problem: coarsegrain. timeouts lead to idle periods Fast retransmit: use duplicate ACKs to trigger retransmission

TCP/IP Performance ITL

XCo: Explicit Coordination for Preventing Congestion in Data Center Ethernet

CUBIC. Qian HE (Steve) CS 577 Prof. Bob Kinicki

Advanced Computer Networks

CS CS COMPUTER NETWORKS CS CS CHAPTER 6. CHAPTER 6 Congestion Control

Today s Agenda. Today s Agenda 9/8/17. Networking and Messaging

Improving Application Performance and Predictability using Multiple Virtual Lanes in Modern Multi-Core InfiniBand Clusters

CSE 461. TCP and network congestion

CS4700/CS5700 Fundamentals of Computer Networks

Congestion Control In the Network

Circuit Breakers for Multimedia Congestion Control

There are 10 questions in total. Please write your SID on each page.

A Survey on the Performance of Parallel Downloading. J. L. Chiang April 7, 2005

Transport Protocols for Networked Games

Chapter I: Introduction

A TRANSPORT PROTOCOL FOR DEDICATED END-TO-END CIRCUITS

Competing Tahoe Senders Adam Stambler Brian Do Group 7 12/7/2010. Computer and Communication Networks : 14:332:423

Transmission Control Protocol (TCP)

Maelstrom: An Enterprise Continuity Protocol for Financial Datacenters

Congestion Control. Resource allocation and congestion control problem

Lecture 11. Transport Layer (cont d) Transport Layer 1

Homework 1. Question 1 - Layering. CSCI 1680 Computer Networks Fonseca

CS 349/449 Internet Protocols Final Exam Winter /15/2003. Name: Course:

CS 421: COMPUTER NETWORKS SPRING FINAL May 24, minutes. Name: Student No: TOT

CS644 Advanced Networks

Lecture 21: Congestion Control" CSE 123: Computer Networks Alex C. Snoeren

Transport Layer TCP / UDP

Transcription:

Evaluation of inter-cluster data transfer on grid environment Shoji Ogura *1, Satoshi Matsuoka *1,*2, Hidemoto Nakada *3,*1 *1 :Tokyo Institute of Technology *2 :National Institute of Informatics *3 :National Institute of Advanced Industrial Science and Technology

Background Large-scale cluster nodes are becoming mainstream platform for Grid computing CERN DataGrid Project Construct a data processing fabric Processing petascale data Large scale transfer of massive data MPI jobs running across multiple clusters on the Grid Demands on inter-cluster data transfer Tier 0 Tier 1 Tier 2 Tier 3 CERN Computer Center Regional Center Regional Center Regional Center 2.4Gbits/sec Tier2 Center Tier2 Center Tier2 Center Tier2 Center Tier2 Center 622Mbits/sec Institute Institute Institute InstituteInstitute

Inter-cluster data transfer No tools that optimize inter-cluster data transfer (or we don t know) Can adapt the peer-to-peer method of optimizing data transfer to inter-cluster data transfer

Overview Our goal Propose an efficient transfer scheme when the peers are clusters Overview of this paper Propose a method of coordinated transfer that determines parameters statically by simple model Validate the model by simulating inter-cluster data transfer The model is improved insufficient Propose an efficient data transfer method that adjusts parameters dynamically

TCP s problems on Grid environment Behavior of TCP window size Slow start phase When window size is minimum value Increase of TCP window size exponentially until a packet loss occurs Congestion avoidance phase Increase of TCP window size one by one TCP window size is halved when a packet loss occurs Upper limit of TCP window size Limited to a constant value (in the early TCP, the value is 64KB) bandwidth the window size limitation / RTT A packet loss Increase exponentially 512Kbits Halve window size Increase one by one Transmitter 10Mbps x 100ms = 1Mbit Receiver

Existing methods of exploiting TCP bandwidth of peer-to-peer transfer Window size tuning Change the upper limit of TCP s window size For extremely high-bandwidth networks, the TCP window size becomes substantial To recover the appropriate window size will take considerable amount of time because of packet losses Using UDP Compensate for the lost packets later Devastate most competing TCP transmissions SABUL, FOBS, Tsunami, Network Striping Transfer data in parallel, using multiple sockets PSockets, GridFTP,

Adapting P2P data transfer methods to cluster-to-cluster data transfer All peers attempt to optimize the transfer individually TCP s window size may become excessively large The network pipe may be stuffed with excessive packets Packet losses increase P2P data transfer optimization methods may be insufficient

Coordinated striping for peer-topeer transfer between clusters Coordinated transfers are needed Use Network Striping Able to develop at the application level Less greedy than Using UDP Propose the use of striping properly coordinated across the cluster nodes Method of determining the number of stripes Determine the parameters statically, using simple model Simple model with static network parameters (e.g., bandwidth, RTT) Able to utilize the current existing method Probably less overhead Determine the parameters dynamically

Simple modeling of optimal parameters Simple modeling Available data on the link RTT x Bandwidth The amount of data transferred The number of stripes x Maximum window size Optimal number of stripes (RTT x Bandwidth) / Maximum window size stripes bandwidth RTT Maximum window size The formula denotes that the optimal number of stripes is proportional to RTT Confirm the validity of this simple modeling by simulations

Simulating data transfer between clusters on the Grid NS Network Simulator ver.2.1b8 A discrete event simulator dedicated to simulating various network behaviors Providing various protocols for different networking layers Low-level unicast/multicast protocols Standard TCP protocols HTTP,FTP, Our settings The sizes of transmitted data are identical among the nodes(8mbytes) The data on each node is striped to be the same size The number of nodes are varied 116, stripes 164 Between the node to router: latency 0.3ms, bandwidth 100Mbps Between the routers: latency 10ms320ms, bandwidth 1.5Mbps1Gbps When the bandwidths of the network link between routers are 100Mbps, 1Gbps, the bottleneck link is between the node to router

Single-Node Transfers Increase in transfer time The transfer time initially decreases as we increase the number of stripes Increase in transfer time at some point (approximately 50 stripes) But these changes are slight

The amount of data transmitted versus amount of data retransmitted Significant rise in the retransmitted data Packet losses cause decrease of performance

Multiple node transfers over low bandwidth link 10Mbps, 8nodes Catastrophic loss in performance Even with striping, uncoordinated transfer that would saturate the network will have unacceptable performance due to explosive increase in packet loss

Optimal number of stripes over low bandwidth link Very irregular Difficult to determine the trend given the network bandwidth/delay and number of nodes

Multiple node transfers over high bandwidth link (100Mbps, 8/16nodes) Overstriping causes catastrophic bandwidth loss Data transmission time at 16nodes data transfers is almost same to one at 8nodes, twice number of stripes When the bottleneck resides in the network between routers, the product of optimal number of stripes and nodes may be constant

Optimal number of stripes over high bandwidth links (100M/1Gbps, 1/8/16nodes) The optimal number of stripes increases as the delay-bandwidth product grows In 100Mbps-1node 1000Mbps-1node 1000Mbps-8node graphs, the bottleneck of the link is the same (between node router links)

Comparison with simple modeling The formula denotes that the optimal number of stripes is proportional to latency, but is not so in the simulation

Proposal for dynamic parameter tuning Difficult to apply the simple model above to determining the optimal number of stripes Need to determine the parameters dynamically Adjust the maximum stripe size dynamically A coordinated fashion across the nodes

Dynamic parameter tuning Previously opening multiple sockets How to determine the parameters 1. Using observable dynamic values 2. By running simulations during transfer 3. Using periodic observable dynamic values

1. Using observable dynamic values Tune the parameters such as the number of nodes or stripes dynamically during transfer, using observable dynamic values of the transfer (e.g., packet loss ratio) Apply an extended and corrected model Using dynamic parameters

2. By running simulations during transfer Tune the parameters such as the number of nodes dynamically by running simulations More accurate and faster compliance with the optimal setting difficult to manage Transmitter Network simulator Receiver Network information Optimal parameters

3. Using periodic observable dynamic values Determine the parameters Using periodic observable dynamic values of the transfer Searching the database that indicates the optimal values The database may be constructed from real observation or from simulation accuracy and lightweight difficult to construct the database DB Network information Optimal parameters

Conclusion Propose a method of coordinated transfer that determines parameters statically by simple model Validate the model by simulating intercluster data transfer The model is improved insufficient Propose an efficient data transfer method that adjusts parameters dynamically

Future work Conduct experiments on real environment to verify the validity of our simulation Develop a coordinated data transfer tool that tunes a parameters dynamically

end