How PERTs Can Help With Network Performance Issues

Similar documents
FEDERICA Federated E-infrastructure Dedicated to European Researchers Innovating in Computing network Architectures

Introduction to perfsonar. RIPE SEE5, Tirana, Albania Szymon Trocha Poznań Supercomputing and Networking Center, Poland April 2016

perfsonar Update Jason Zurawski Internet2 March 5, 2009 The 27th APAN Meeting, Kaohsiung, Taiwan

EGEE (JRA4) Loukik Kudarimoti DANTE. RIPE 51, Amsterdam, October 12 th, 2005 Enabling Grids for E-sciencE.

Network Virtualization for Future Internet Research

Federated E-infrastructure Dedicated to European Researchers Innovating in Computing network Architectures

Outline. Infrastructure and operations architecture. Operations. Services Monitoring and management tools

Ethernet Operation Administration and Maintenance Deliverable 2010

GÉANT2 Security: Year 1 (aka JRA2)

EGI-InSPIRE RI NGI_IBERGRID ROD. G. Borges et al. Ibergrid Operations Centre LIP IFCA CESGA

GÉANT Support for Research Within and Beyond Europe

The New Infrastructure Virtualization Paradigm, What Does it Mean for Campus?

Connect. Communicate. Collaborate. Click to edit Master title style. Using the perfsonar Visualisation Tools

IPv6 Deployment Survey. Based on responses from the RIPE community during June 2009 Maarten Botterman RIPE 59, Lisbon, 6 October 2009

GÉANT Mission and Services

e-infrastructure for Research and Education in Georgia

Presentation of the LHCONE Architecture document

WLCG Network Throughput WG

A collaboration overview: From TF-VSS to GN2 SA6

GÉANT: Supporting R&E Collaboration

SaaS Providers. ThousandEyes for. Summary

Multi Domain Service Architecture for Heterogonous Networks A view from GÉANT 3 - SA2: Task 1

Balancing the pressures of a healthcare SQL Server DBA

Introduction to FEDERICA

6Net WP6 final report

GÉANT : e-infrastructure connectivity for the data deluge

Connectivity Services, Autobahn and New Services

Funet Activities. GN3 Campus best practise kick-off meeting Trondheim May 27th, 2009

Big Computing and the Mitchell Institute for Fundamental Physics and Astronomy. David Toback

RESEARCH NETWORKS & THEIR ROLE IN e-infrastructures

ORIENT/ORIENTplus - Connecting Academic Networks in China and Europe. Jennifer(Jie) An CERNET Center/Tsinghua University 14 Feb.

ONOS: SDN gets real. The future of telco is software-defined; it s also open-source. Open Network Operating

CS193D Handout 15 Winter 2005/2006 February 8, 2006 Software Engineering Methodologies and XP. See also: Chapter 6. The Stagewise Model

EGEE II - Network Service Level Agreement (SLA) Implementation

EUMEDCONNECT3 and European R&E Developments

GÉANT and other projects Update

Note. Some History 8/8/2011. TECH 6 Approaches in Network Monitoring ip/f: A Novel Architecture for Programmable Network Visibility

Using Wireshark as an Applica1on Support Engineer Tim Poth. Senior Priority Response Analyst Bentley Systems, Inc.

ThousandEyes for. Application Delivery White Paper

LHC Open Network Environment. Artur Barczyk California Institute of Technology Baton Rouge, January 25 th, 2012

AutoBAHN Provisioning guaranteed capacity circuits across networks

DICE Network Diagnostic Services

WELCOME. to the 1 st online DG CONNECT NIPS Study workshop. July 25, 2013

IPv6 deployment, European Commission involvement. RIPE 60 Prague 4May Per Blixt

Maintenance Renewal Process for SolarWinds Partners STREAMLINING RENEWALS, ENSURING CUSTOMER SATISFACTION

Web Hosting. Important features to consider

Administrivia. Added 20 more so far. Software Process. Only one TA so far. CS169 Lecture 2. Start thinking about project proposal

GÉANT3 Services. Ann Harding, SWITCH TNC Connectivity and Monitoring Services by and for NRENs. connect communicate collaborate

Wide-Area Networking at SLAC. Warren Matthews and Les Cottrell (SCS Network Group) Presented at SLAC, April

DICE Diagnostic Service

WebRTC: Possible? Don McGregor Research Associate MOVES Institute.

HOME GOLE Status. David Foster CERN Hawaii GLIF January CERN - IT Department CH-1211 Genève 23 Switzerland

Options for Joining edugain. Lukas Hämmerle, SWITCH DARIAH Workshop, Köln 18 October 2013

D Demonstration of integrated BoD testing and validation

The European DataGRID Production Testbed

Fermilab WAN Performance Analysis Methodology. Wenji Wu, Phil DeMar, Matt Crawford ESCC/Internet2 Joint Techs July 23, 2008

6DEPLOY: IPv6 Deployment Support

GÉANT Community Programme

Advancing European R&E through collaboration

521262S Computer Networks 2 (fall 2007) Laboratory exercise #4: Multimedia, QoS and testing

RIPE. Réseaux IP Européens. Rob Blokzijl. RIPE Chairman. Rob Blokzijl. RIPE NCC, 7-9 December 2003, Dubai.

GÉANT Open Service Description. High Performance Interconnectivity to Support Advanced Research

APNIC Update. RIPE 59 October 2009

Ch. 13: Measuring Performance

GEANT testbed service (GTS) for R&E community Based on cloud technologies

GÉANT perspective of Virtual Networks and Implementation

Diagnosing the cause of poor application performance

Calcanas 1. Edgar Calcanas Dr. Narayanan CST March 2010 CST 412 Mid-Term. Workstations

DESIGN AS RISK MINIMIZATION

For those who might be worried about the down time during Lync Mobility deployment, No there is no down time required

EXAM - HP0-Y52. Applying HP FlexNetwork Fundamentals. Buy Full Product.

TERENA 2 nd School-NET Workshop

Internet data transfer record between CERN and California. Sylvain Ravot (Caltech) Paolo Moroni (CERN)

Creating an Intranet using Lotus Web Content Management. Part 2 Project Planning

Virtualization. A very short summary by Owen Synge

SAFARICOM MANAGED WIDE AREA NETWORK. Safaricom MWAN CUSTOMER SERVICE MANAGEMENT: OR

CERN network overview. Ryszard Jurga

Assignment #2. Csci4211 Spring Due on March 6th, Notes: There are five questions in this assignment. Each question has 10 points.

Marko Bonač, Tomi Dolenc Academic and Research Network of Slovenia

Developing Networking and Human Expertise in Support of International Science

TCP/IP Clearly Explained, Fourth Edition (The Morgan Kaufmann Series In Networking) Ebooks Free

The 6DEPLOY Project for IPv6 Training and Support for Deployments

Hands-On Activity. Firewall Simulation. Simulated Network. Firewall Simulation 3/19/2010. On Friday, February 26, we will be meeting in

Assignment #1. Csci4211 Spring Due on Feb. 13th, Notes: There are five questions in this assignment. Each question has 10 points.

Net Edu Romanian Education Network

Yealink VCS Network Deployment Solution

Manually Installing Windows Updates Server 2008 R2 On Vmware 8 >>>CLICK HERE<<<

TCP Performance Analysis Based on Packet Capture

TERENA, the NRENs, GÉANT & promoting Campus Best Practice

GN2 JRA5: Roaming and Authorisation

NORDUnet GN3. Next Generation Network in Europe. Click to edit Master subtitle style. Lars Fischer SUNET TREFFpunkt

2017 DNSSEC KSK Rollover. Guillermo Cicileo LACNIC March 22, 2017

SURFnet network developments 10th E-VLBI workshop 15 Nov Wouter Huisman SURFnet

MPA (Marker PDU Aligned Framing for TCP)

Deliverable D8.4 Certificate Transparency Log v2.0 Production Service

Tempora mutantur. Niall O Reilly - RIPE 50 - Stockholm, SE - May 2005

WebRTC video-conferencing facilities for research, educational and art societies

The perfsonar Project at 10 Years: Status and Trajectory

A large-scale International IPv6 Network. A large-scale International IPv6 Network.

Most underused MikroTik hardware and software features. The path between fastpath and advanced features. MUM, Mexico 2018

Transcription:

How PERTs Can Help With Network Performance Issues Simon Leinen (SWITCH) <simon@limmat.switch.ch> 2006 SWITCH

End-to-End Performance Issues Discrepancy between user expectations and perceived performance You tell me we have gigabit networks, yet I only get <low number> Mb/s? Networks span many administrative domains Finger pointing loops my network is fine, see [ping shows no loss] so it must be that other network our network is also fine [all lights are green!] so it must be...... is fine so it must be you! Performance results from multi-layer interactions Application and protocol design End-system issues (OS, I/O, other internal bottlenecks) Network path (delay, loss, reordering) The interactions are often difficult to understand! 2006 SWITCH 2

Approaches to Improve E2E Performance Training (users, programmers, admins, network eng/ops) Examples: Internet2 Performance Workshops Instrumentation (network & end-system stacks) Examples: RIPE TTM, PerfSONAR, Web100, NDT, NPAD Support infrastructure: PERT (Performance Enhancement/Response Team) 2006 SWITCH 3

PERT History 2001: Thought experiment in the Internet2 community 2003: Trial PERT (some European NRENs) 2004: GN2 Pilot PERT Since March 2005: GN2 PERT as production service Several individual NRENs have PERT groups and services emerging. 2006 SWITCH 4

GN2 PERT Organization GN2 is a (partly) EU-funded project that includes the GEANT2 network. The SA3 (Service Activity 3) work package on End-to-end Quality of Service includes PERT activities. Participants include DANTE and many NRENs: GARR (IT), RENATER (FR), RCCN (PT), SWITCH (CH), PSNC (PL), CESNET (CZ),... Case Managers are responsible for progress with open issues (cases): Duty Case Managers change weekly, handle new cases, track progress Special Case Managers adopt a specific case and handle it until closure Subject Matter Experts support the PERT with their expertise Mostly by commenting on the pert-discuss mailing list Support systems: PERT Ticket System (PTS) on https://www.pert.geant2.net/pert/ Mailing lists: pert-report@geant2.net, pert-discuss@geant2.net PERT Knowledge Base Wiki on http://kb.pert.switch.ch/ 2006 SWITCH 5

GN2 PERT Ticket System (PTS) Custom system developed for the needs of the GN2 PERT J2EE Web application with SQL back-end database Access through X.509 client certificate for Cms/SMEs Username/password administration for cust^h^h^h^husers Developed at PSNC (Poznań, PL) Operational since February 2005, PTSv2 deployed in August 2006 2006 SWITCH 6

GN2 PERT Ticket System (PTS) 2006 SWITCH 7

PERT Knowledge Base Tool for managing knowledge gathered by the PERT Currently implemented as a Wiki (open registration, anti-spam) http://kb.pert.switch.ch/ aka http://pace.geant2.net/cgibin/twiki/view/pertkb/webhome Should become useful resource for network users/admins Content from the PERT KB sometimes extracted to deliverables DS3.3.3 (August 2006) - 2006 SWITCH 8

PERT Knowledge Base 2006 SWITCH 9

Example Cases 2006 SWITCH 10

Downloads from microsoft.com are slow Customer claims they get 30-40 kb/s from download.microsoft.com Customer has a 100 Mb/s connection First tests from backbone: $ wget http://download.microsoft.com/.../dotnetfx.exe --13:44:16-- http://download.microsoft.com/.../dotnetfx.exe => `dotnetfx.exe' Resolving download.microsoft.com... 195.176.255.136, 195.176.255.135 Connecting to download.microsoft.com[195.176.255.136]:80... connected. HTTP request sent, awaiting response... 200 OK Length: 24,265,736 [application/octet-stream] 100%[====================================>] 24,265,736 4.49M/s 13:44:22 (3.54 MB/s) - `dotnetfx.exe' saved [24265736/24265736] 2006 SWITCH 11

Downloads from microsoft.com are slow Obviously the customer was doing something wrong here......think of possible problems before calling them. But let's do another quick test: $ wget http://download.microsoft.com/.../dotnetfx.exe --13:53:29-- http://download.microsoft.com/.../dotnetfx.exe => `dotnetfx.exe.1' Resolving download.microsoft.com... 212.162.0.30 Connecting to download.microsoft.com[212.162.0.30]:80... connected. HTTP request sent, awaiting response... 200 OK Length: 24,265,736 [application/x-msdownload] 12% [===> ] 2,968,483 7.37K/s ETA 51:32 2006 SWITCH 12

Downloads from microsoft.com are slow Oops... what happened? It turns out Microsoft uses not just Akamai for content distribution, but another layer of load distribution which sometimes selects Akamai, sometimes other servers. It doesn't always do a good job at choosing these servers, at least when throughput is concerned. Passed the explanation to the customer, with two suggestions: Complain to Microsoft as a customer (recommended, but not quick fix) Hack local DNS to hardwire download.microsoft.com to Akamai (evil hack) Customer seems to be happy. 2006 SWITCH 13

FNAL-DESY Throughput Limitation Fermilab-DESY (7100km GC) TCP transfers only reach 150 Mb/s DESY-CERN (860km GC) reaches >900 Mb/s Initial investigation focused on TCP settings... After a lot of measurement, we found a problem in the backbone: The London-Paris GEANT2 link was losing ~0.003% packets (SDH misconf.) Several months after that was fixed, another problem appeared: Underprovisioned LAN switch card at DESY What we learned: It is very hard to locate packet loss in the network We managed using custom ACLs and ACL counters takes a lot of coordination Tiny amounts of packet loss (as in: nobody notices for months) prevent TCP from going fast over an LFN (Long Fat Network) 2006 SWITCH 14

General PERT Issues Is this a good idea at all? Yes, I think so... The potential of the (high-speed parts of the) Internet isn't used well I believe that a shared network is a better substrate for research than dedicated networks for each researcher/project/... (the lightpath approach) It works well as a vehicle for learning/teaching about network performance But to be useful, the PERT has to improve Measurement infrastructure Promising developments (PerfSONAR for Research Networks, RIPE TTM etc.) Communication (PERT-customers, PERT-netadmins, intra-pert) Can this be applied in a competitive multi-provider world? Probably not all of it... Breaks the telco-like concepts (illusions?) of service levels etc. 2006 SWITCH 15

If you're interested in the PERT... Test us if you suspect problems with GEANT etc. Contribute to the PERT Knowledge Base wiki Provide feedback 2006 SWITCH 16