Monitoring and Accelera/ng GridFTP

Similar documents
End- Site Control Plane Service (ESCaPeS) Monitoring with. Perisc pe. Mar8n Swany U. Delaware. (Indiana University effec8ve 8.1.

SDN Peering with XSP. Ezra Kissel Indiana University. Internet2 Joint Techs / TIP2013 January 2013

Monitoring in GENI with Periscope (and other related updates)

November 1 st 2010, Internet2 Fall Member Mee5ng Jason Zurawski Research Liaison

ANSE: Advanced Network Services for [LHC] Experiments

Extending dynamic Layer-2 services to campuses

LHC and LSST Use Cases

Zhengyang Liu! Oct 25, Supported by NSF Grant OCI

DYNES: DYnamic NEtwork System

Programmable Information Highway (with no Traffic Jams)

Traffic engineering and GridFTP log analysis. Jan 17, 2013 Project web site:

Abstract Topology and Cost Maps for So5ware- Defined Inter- Domain Circuits

Enabling a SuperFacility with Software Defined Networking

CMS Data Transfer Challenges and Experiences with 40G End Hosts

5 August 2010 Eric Boyd, Internet2 Deputy CTO

SLIDE 1 - COPYRIGHT 2015 ELEPHANT FLOWS IN THE ROOM: SCIENCEDMZ NATIONALLY DISTRIBUTED

Parallel File Systems. John White Lawrence Berkeley National Lab

Embedded Network Systems. Internet2 Technology Exchange 2018 October, 2018 Eric Boyd Ed Colone

Towards Network Awareness in LHC Computing

Zhengyang Liu University of Virginia. Oct 29, 2012

Arrakis: The Operating System is the Control Plane

Understanding Your Network Using NetSage

Securing Grid Data Transfer Services with Active Network Portals

OSiRIS. Project and participants overview Structural overview and site details Orchestration, monitoring and visualization Networking, NMAL, SDN

A GridFTP Transport Driver for Globus XIO

Analysis of CPU Pinning and Storage Configuration in 100 Gbps Network Data Transfer

Beyond Petascale. Roger Haskin Manager, Parallel File Systems IBM Almaden Research Center

Globus Online and HPSS. KEK, Tsukuba Japan October 16 20, 2017 Guangwei Che

Profiling Grid Data Transfer Protocols and Servers. George Kola, Tevfik Kosar and Miron Livny University of Wisconsin-Madison USA

Brent Sweeny GRNOC at Indiana University APAN 32 (Delhi), 25 August 2011

Globus XIO Compression Driver: Enabling On-the-fly Compression in GridFTP

DICE Diagnostic Service

Maelstrom: An Enterprise Continuity Protocol for Financial Datacenters

Engagement With Scientific Facilities

Evolution of OSCARS. Chin Guok, Network Engineer ESnet Network Engineering Group. Winter 2012 Internet2 Joint Techs. Baton Rouge, LA.

Internet2 Technology Update. Eric Boyd Deputy Technology Officer

Securing Grid Data Transfer Services with Active Network Portals

On using virtual circuits for GridFTP transfers

perfsonar: A Look Ahead Andrew Lake, ESnet Mark Feit, Internet2 October 16, 2017

Database Services at CERN with Oracle 10g RAC and ASM on Commodity HW

The Software Journey: from networks to visualization

Cisco Wide Area Application Services (WAAS) Mobile

Data Staging: Moving large amounts of data around, and moving it close to compute resources

Magellan Project. Jeff Broughton NERSC Systems Department Head October 7, 2009

Introduction to. Network Startup Resource Center. Partially adopted from materials by

Cognitive radio technology and GENI project

Internet2 DCN and Dynamic Circuit GOLEs. Eric Boyd Deputy Technology Officer Internet2 GLIF Catania March 5, 2009

HIDDEN SLIDE Summary These slides are meant to be used as is to give an upper level view of perfsonar for an audience that is not familiar with the

Introduction to Grid Computing

The Grid Architecture

Advanced Computer Networks. End Host Optimization

Network Virtualization Based on Flows

Distributed File System Support for Virtual Machines in Grid Computing

RDMA and Hardware Support

PIRE ExoGENI ENVRI preparation for Big Data science

Scaling Internet TV Content Delivery ALEX GUTARIN DIRECTOR OF ENGINEERING, NETFLIX

Distributing weather data via multipoint layer-2 paths using DYNES

CERN openlab Summer 2006: Networking Overview

International Big Science Coming to Your Campus Soon (Sooner Than You Think )

GridMonitor: Integration of Large Scale Facility Fabric Monitoring with Meta Data Service in Grid Environment

Performance assessment of CORBA for the transport of userplane data in future wideband radios. Piya Bhaskar Lockheed Martin

Data Staging: Moving large amounts of data around, and moving it close to compute resources

Mark Falco Oracle Coherence Development

Investigating the Use of Synchronized Clocks in TCP Congestion Control

ESnet s (100G) SDN Testbed

Vess A2000 Series NVR Storage Appliance

DiPerF: automated DIstributed PERformance testing Framework

Traffic Characteristics of Bulk Data Transfer using TCP/IP over Gigabit Ethernet

OPENFLOW & SOFTWARE DEFINED NETWORKING. Greg Ferro EtherealMind.com and PacketPushers.net

MPI History. MPI versions MPI-2 MPICH2

Building a Digital Bridge from Ohio

Pacific Wave: Building an SDN Exchange

Solutions for Scalable HPC

ASPERA HIGH-SPEED TRANSFER. Moving the world s data at maximum speed

Production Petascale Climate Data Replication at NCI Lustre and our engagement with the Earth Systems Grid Federation (ESGF)

BNL Dimitrios Katramatos Sushant Sharma Dantong Yu

Data transfer over the wide area network with a large round trip time

The OSiRIS Project: Collaborative Access to Data CNSECCS Symposium May 18, 2017

Building Security Services on top of SDN

TransPAC3- Asia US High Performance International Networking (Award # ) Quarterly Report 1-March-2014 through 31-May-2014

A Performance Evaluation of WS-MDS in the Globus Toolkit

NASPInet 2.0 The Evolu4on of Synchrophasor Networks

DevoFlow: Scaling Flow Management for High Performance Networks

A TRANSPORT PROTOCOL FOR DEDICATED END-TO-END CIRCUITS

Packet-Optical SDN Field Trial for Multi-Layer Network Optimization. Jim Theodoras May 2016

A Virtual Circuit Multicast Transport Protocol (VCMTP) for Scientific Data Distribution

A New Key-value Data Store For Heterogeneous Storage Architecture Intel APAC R&D Ltd.

2008 International ANSYS Conference

High-Performance Remote File I/O for the XT3 Nathan Stone Pittsburgh Supercomputing Center a.k.a. PITTSC

Networks & protocols research in Grid5000 DAS3

The Future of Interconnect Technology

NetFlow Configuration Guide

Functional Requirements for Grid Oriented Optical Networks

International Climate Network Working Group (ICNWG) Meeting

These circuits are used in production to support a wide variety of science applications and demonstrations of advanced networking technologies.

Wide-Area InfiniBand RDMA: Experimental Evaluation

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

Research Cyberinfrastructure Upgrade Proposal - CITI

STORAGE CONSOLIDATION WITH IP STORAGE. David Dale, NetApp

Software-Defined Networking (Continued)

Transcription:

Monitoring and Accelera/ng GridFTP Ezra Kissel & Mar/n Swany Indiana University Dan Gunter Lawrence Berkeley Na/onal Laboratory Jason Zurawski Internet2 GlobusWORLD 2013

Globus XIO Modular drivers that can be used within the Globus Toolkit GlobusXIO User API We have focused on extending features for GridFTP XSP for monitoring and path signaling Framework Driver Stack Transform Transform Transport Phoebus for WAN accelera/on Source: Globus XIO developer guide

XSP - extensible Session Protocol Session layer (Layer 5 in the OSI model) We can also think of a session in the most literal sense: a period of *me devoted to a par*cular ac*vity xio- xsp implemented as a Globus XIO driver Loadable with dcstack and fsstack Use cases: Monitoring with NetLogger, Calipers, and Periscope Dynamic network provisioning (e.g., OSCARS, OpenFlow) WAN Accelera/on with Phoebus Gateways

End- to- end measurement perspec/ve NetLogger instruments read and write system calls, and Calipers summarizes these in memory The XSP collector daemon collates and forwards to Periscope RESTful Measurement Store and Unified Network Informa/on Service (UNIS) BLiPP: Basic Lightweight Periscope Probes GridFTP Periscope XIO driver libxsp Calipers XSP Daemon perfsonar Network Monitoring XSP Daemon GridFTP XSP Daemon libxsp Periscope XIO driver Calipers Host / Disk TCP stats BLiPP WAN BLiPP Host / Disk TCP stats

Metadata { } 'eventtype': 'nl:tools:calipers:summary:write', 'id': '8d83ccd4-a708-11e2-b6fa-33d5436b6f2a', 'parameters': { 'collectioninterval': 5, 'datumschema': 'http://unis.incntre.iu.edu/schema/ 20120709/datum#'}, 'subject': { 'dport': '41186', 'dst': '127.0.0.1', 'sport': '60512', 'src': '127.0.0.1', 'stream_id': '60512:41186', 'task_id': 'xsp_test', 'type': 'network', 'xfer_id': '8d83a8a8-a708-11e2-b6fa-33d5436b6f2a'}

Data { } 'data': [ 1366166610.043317, 2, 5866638858, 262161, 262161, 262161, 0, 52064.35928057134, 0.1114156396090098, 3.318493670886076, 2.32658679419838, 0.4392224221097757, 9945.430479743363, 0.3013415420295162, 8.975400612600653, 0.4444289248254251, 0.09869011618422469, 22378, 5.000102, 2.607303999999963], ts _sample sum_v min_v max_v mean_v sd_v sum_r min_r max_r mean_r sd_r sum_g min_g max_g mean_g sd_g count dur dur_inst 'mid': '8d83ccd4-a708-11e2-b6fa-33d5436b6f2a

Average Rate Mean (MB/s) instantaneous throughput (Mb/s) Log 10 scale 100000 10^5 disk read disk write (a) 10G TCP P1 MEMDISK 100_LAT Bottleneck = network 0 20 40 60 80 100 (c) 10G TCP P1 DISK NO_LAT 36GB disk nl.read.summary disk nl.write.summary network nl.read.summary network nl.write.summary network read (b) 10G TCP P1 MEM NO_LAT Bottleneck = network network write 0 20 40 60 80 100 120 (d) 10G TCP P4 MEMDISK 100_LAT 01_LOSS 10^5 100000 10^4 10000 10^3 1000 10^2 100 10^1 10000 10^4 10^3 1000 10^2 100 Bottleneck = disk read Bottleneck = disk write.01% loss event 10^1 0 100 200 300 TCP throughput Time (s) 0 20 40 60 80 100 Time series of throughput for representa/ve TCP experiments: (a) 1 stream memory- to- disk with 100ms latency, (b) 1 stream memory- to- memory with no latency, (c) 1 stream disk- to- disk with no latency, (d) 4 streams memory- to- disk with 100ms latency and 1% loss added at 60 seconds. 1. Ezra Kissel, Dan Gunter, et. al. Scalable Integrated Performance Analysis of MulA- Gigabit Networks. In 5 th Interna/onal Workshop on Distributed Autonomous Network Management Systems (DANMS), April 2012.

Total CPU time for all processors 0.01% loss System time User time Legend: Time (s) stdev[instantaneous throughput] (Mb/s) 50 100 150 200 0 20 40 60 80 100 Time(s) Number of successful reads per sample 200 400 600 800 0 20 40 60 80 100 Half as many read()s. Others return zero, not counted Variance Less work being done

Topology- aware monitoring

XSP and Dynamic Circuits Common interface for path provisioning ESnet OSCARS, Internet2 ION, OpenFlow, Linux end- hosts, OS 3 E/NDDI Prototype installa/on for DYNES DYNES is an NSF MRI that is distribu/ng storage and OSCARS IDCs for ION to various sites Internet2, Caltech, Vanderbilt, U. Michigan Recent tes/ng with mul/- path GridFTP at SC12 (Science Research Sandbox) 4/22/13 Internet2 Spring Member Mee/ng 2012 10

SC12 SRS experiments Mul/- path GridFTP transfers to DYNES end- sites xio- xsp and OpenFlow used to dynamically redirect parallel streams over circuit services Many lessons learned at SC 4 parallel streams

GridFTP+XSP using Phoebus Phoebus is an open source WAN accelerator funded by the DOE and now the NSF Phoebus uses XSP to communicate via gateways that can tune, adapt and translate protocols TCP tuning, UDP, RDMA over the WAN globus-url-copy -vb -p 4 -dcstack xsp:"xsp_hop=<host/5006>;xsp_net_path=<type>, phoebus:"phoebus_path=<gw1>/5006#<gw2>/5006 " ftp://<src host>:2811/dev/zero ftp://<dst host>:2811/dev/null

IU- Tokyo with and without Phoebus Single TCP stream (CUBIC), 185ms, well- connected end- hosts 13

Conclusion Quite a few topics focused on the performance of GridFTP Flexible and scalable monitoring for troubleshoo/ng Adapt performance using emerging network technologies and protocols Ongoing work: early prototype of xio- xsp for GlobusOnline (GCMU installs)

Thank you for your /me Thanks to our colleagues at IU, Internet2, ESnet Support: NSF OCI- 0910812, OCI- 1127349, and CNS- 105011, DOE DE- SC0001421 hlps://damsl.cs.indiana.edu/projects/phoebus Email: ezkissel@indiana.edu Ques/ons?