October 5 th 2010 NANOG 50 Jason Zurawski Internet2

Similar documents
HIDDEN SLIDE Summary These slides are meant to be used as is to give an upper level view of perfsonar for an audience that is not familiar with the

The new perfsonar: a global tool for global network monitoring

perfsonar Update Jason Zurawski Internet2 March 5, 2009 The 27th APAN Meeting, Kaohsiung, Taiwan

Con$nuous Integra$on Development Environment. Kovács Gábor

RAD, Rules, and Compatibility: What's Coming in Kuali Rice 2.0

globus online The Galaxy Project and Globus Online

CLOUD SERVICES. Cloud Value Assessment.

November 1 st 2010, Internet2 Fall Member Mee5ng Jason Zurawski Research Liaison

Introduction to. Network Startup Resource Center. Partially adopted from materials by

The perfsonar Project at 10 Years: Status and Trajectory

Best Practices and Pitfalls for Building Products out of OpenDaylight

perfsonar Deployment on ESnet

NASPInet 2.0 The Evolu4on of Synchrophasor Networks

Gigabit Broadband, Interconnec1on proposi1ons, and the Challenge of Managing Expecta1ons

Today s Objec4ves. Data Center. Virtualiza4on Cloud Compu4ng Amazon Web Services. What did you think? 10/23/17. Oct 23, 2017 Sprenkle - CSCI325

Oracle VM Workshop Applica>on Driven Virtualiza>on

Monitoring & Analy.cs Working Group Ini.a.ve PoC Setup & Guidelines

Distributed Systems INF Michael Welzl

Direc>ons in Distributed Compu>ng

ESnet s (100G) SDN Testbed

DICE Diagnostic Service

Shepherd s Presentation Draft Policy Allocation of IPv4 and IPv6 Address Space to Out-of-region Requestors

WLCG Network Throughput WG

Elas%c Load Balancing, Amazon CloudWatch, and Auto Scaling Sco) Linder

perfsonar psui in a multi-domain federated environment

DICE Network Diagnostic Services

Composite Compliance: Demonstra1ng Suitability of Cloud Layering for Sensi1ve and Regulated Workloads

Crea%ng and U%lizing Linked Open Sta%s%cal Data for the Development of Advanced Analy%cs Services E. Kalampokis, A. Karamanou, A. Nikolov, P.

perfsonar: A Look Ahead Andrew Lake, ESnet Mark Feit, Internet2 October 16, 2017

An introduc/on to Sir0i

Measuring and Managing Power Consump2on

CoG: The NEW ESGF WEB USER INTERFACE

ThinManager and FactoryTalk View SE. John Ter8n; ESE, Inc.

REsources linkage for E-scIence - RENKEI -

IRODS USER GROUP 2014 CAMBRIDGE,MA John Burns. 6/25/14 Archive Analy3cs Solu3ons 1

Cyber Security and Power System Communica4ons Essen4al Parts of a Smart Grid Infrastructure. Talal El Awar

perfsonar Going Forward Eric Boyd, Internet2 Internet2 Technology Exchange September 27 th 2016

CISC327 - So*ware Quality Assurance

Monitoring IPv6 Content Accessibility and Reachability. Contact: R. Guerin University of Pennsylvania

Infrastructure Analy=cs: Driving Outcomes through Prac=cal Uses and Applied Data Science at Cisco

How Cloud is working as a disruptor to shake up middleware design EVOLVE OR DIE! Billy Newport

ProAc&ve Rou&ng In Scalable Data Centers with PARIS

Overcoming the Barriers of Graphs on GPUs: Delivering Graph Analy;cs 100X Faster and 40X Cheaper

Developing an IDE4L Grid

Senate Subcommi-ee on Flooding & Evacua5ons. Presenta5on by Commission on State Emergency Communica5ons August 24, 2010 Houston, Texas

Connectivity Services, Autobahn and New Services

CAREER PATH FOR THE NEXT GENERATION RECORDS MANAGER

Preliminary ACTL-SLOW Design in the ACS and OPC-UA context. G. Tos? (19/04/2016)

November 2 nd 2010, Internet2 Fall Member Mee5ng Eric Boyd Deputy CTO

Virtualization. Introduction. Why we interested? 11/28/15. Virtualiza5on provide an abstract environment to run applica5ons.

Abstrac(ons for Middleboxes. à StonyBrook

TECHNICAL SESSIONS OFFICE DEVELOPMENT

EGEE (JRA4) Loukik Kudarimoti DANTE. RIPE 51, Amsterdam, October 12 th, 2005 Enabling Grids for E-sciencE.

Autonomous Threat Hun?ng With Niddel And Splunk Enterprise Security: Mars Inc. Customer Case Study

John Hicks Internet2 - Network Research Engineer perfsonar

Symantec Data Loss Preven2on 12.5 Demo Presenta2on

csc444h: so(ware engineering I matt medland

IPv6 in the DREN III acquisi4on. Ron Broersma DREN Chief Engineer

Best Prac:ces + New Feature Overview for the Latest Version of Splunk Deployment Server

Introduction to iscsi

COSC 310: So*ware Engineering. Dr. Bowen Hui University of Bri>sh Columbia Okanagan

New World BGP. Geoff Huston January2010 APNIC

Top five Docker performance tips

Architectures, and Protocol Design Issues for Mobile Social Networks: A Survey

A Step Towards Automated Event Diagnosis Stanford Linear Accelerator Center. Adnan Iqbal, Yee-Ting Li, Les Cottrell Connie A. Log.

October 3, 2011 ADVANCED NETWORK SERVICES TODAY

Business Case Components

Barry D. Lamkin Executive IT Specialist Capitalware's MQ Technical Conference v

@ COUCHBASE CONNECT. Using Couchbase. By: Carleton Miyamoto, Michael Kehoe Version: 1.1w LinkedIn Corpora3on

Philippe Laurens, Michigan State University, for USATLAS. Atlas Great Lakes Tier 2 collocated at MSU and the University of Michigan

Tightly Integrated: Mike Cormier Bill Thackrey. Achieving Fast Time to Value with Splunk. Managing Directors Splunk Architects Concanon LLC

Turning the Network NANOG 70. Andrew Gray IP Engineer IV Cox Communica7ons (AS22773)

Embracing Failure. Fault Injec,on and Service Resilience at Ne6lix. Josh Evans Director of Opera,ons Engineering, Ne6lix

Cisco SAN Analytics and SAN Telemetry Streaming

Storwize in IT Environments Market Overview

Distributed Systems. Communica3on and models. Rik Sarkar Spring University of Edinburgh

TPP On The Cloud. Joe Slagel

Introduction to perfsonar. RIPE SEE5, Tirana, Albania Szymon Trocha Poznań Supercomputing and Networking Center, Poland April 2016

Cloud Adop)on, Risks & Security & GDPR An Ac)on Guide

Understanding Opera.onal Rou.ng (part II) Geoffrey Xie Naval Postgraduate School

DNSSEC Activities In North America: Comcast

Kaseya Fundamentals Workshop DAY TWO. Developed by Kaseya University. Powered by IT Scholars

Web applica*on security for dynamic

Toward Wireless Clinical Monitoring

CS 152 Computer Architecture and Engineering CS252 Graduate Computer Architecture. Lecture 9 Virtual Memory

Peraso Corporate Presenta5on CES 2017

MapReduce. Cloud Computing COMP / ECPE 293A

Modifying an Exis.ng Commercial Product for Cryptographic Module Evalua.on

PhD in Computer And Control Engineering XXVII cycle. Torino February 27th, 2015.

CONTAINERIZING JOBS ON THE ACCRE CLUSTER WITH SINGULARITY

Architecture of So-ware Systems Massively Distributed Architectures Reliability, Failover and failures. Mar>n Rehák

Performance Update. Jeff Boote Senior Network Software Engineer Internet2 Martin Swany Assistant Professor University of Delaware

Overview of XSEDE for HPC Users Victor Hazlewood XSEDE Deputy Director of Operations

Kaseya Fundamentals Workshop DAY FOUR. Developed by Kaseya University. Powered by IT Scholars

The balancing act: Research vs. Produc7on. Kobus Van der Merwe

Distributed Systems. Communica3on and models. Rik Sarkar 2015/2016. University of Edinburgh

Assessing Medical Device. Cyber Risks in a Healthcare. Environment

From Continuous Integration To Continuous Delivery With Jenkins

5 August 2010 Eric Boyd, Internet2 Deputy CTO

Embedded Network Systems. Internet2 Technology Exchange 2018 October, 2018 Eric Boyd Ed Colone

Transcription:

October 5 th 2010 NANOG 50 Jason Zurawski Internet2

Overview Introduc=on and Mo=va=on Design & Func=onality Current Deployment Footprint Conclusion 2 10/5/10, 2010 Internet2

Why Worry About Network Performance? Why does it mamer? Flows geong larger (e.g. Science datasets in the R&E world) Special requirements (e.g. Streaming media is sensi=ve to jimer) Number of users/devices is increasing Everything is cross domain Where are the problems? Network Core? Everything is well connected, well provisioned, and flawlessly configured, RIGHT? End Systems? Properly tuned for op=mal TCP performance (no mamer the opera=ng system), proper drivers installed and func=oning op=mally, RIGHT? Performance Debugging = Difficult Problem Which Tools to Use? Availability and Loca=on of Tools Facilitate the Sharing of Results could be useful to others 3 10/5/10, 2010 Internet2

Why Worry About Network Performance? Most network design lends itself to the introduc=on of flaws: Heterogeneous equipment Cost factors heavily into design e.g. Get what you pay for Design heavily favors protec3on and availability over performance Communica=on protocols are not advancing as fast as networks TCP/IP is s=ll the king of the protocol stack Guarantees reliable transfers Adjusts to failures in the network Adjusts speed to be fair for all End Systems Good enough for web and email? User Expecta=ons The Network is Slow/Broken Empower users to be more informed/more helpful 4 10/5/10, 2010 Internet2

Underlying Assump=on When problems exist, it s the networks fault! Easy to blame a resource, but where is the real problem? Host (Disk, CPU, Kernel, NIC Drivers) Network Interface Cards (Host and Network Device) Routers/Switches Fabric, Rou=ng Algorithms, Configura=ons Physical Infrastructure (Fibers, Connectors) Protocols The network is viewed as a single resource in many cases Reality complex series of components Mul=ple vendors/technologies Mul=ple configura=on op=ons Crossing administra=ve domains 5 10/5/10, 2010 Internet2

Network View (Layman s Terms) Bob s Host The Internets Carol s Host 6 10/5/10, 2010 Internet2

Network View (Actual) Bob s Host Switch 1 Other hosts Other hosts Switch 2 Switch 3 R5 R4 R8 R1 R3 R6 R2 R7 Switch 4 Other hosts R9 Carol s Host 7 10/5/10, 2010 Internet2

A New Way Forward Encourage use of exis=ng tools Don t need new tools, lots of fine measurement sopware already Work with what others are using, because it probably works Facilitate use of remote tools and sharing of resources Single Step Deployment Set it and forget it! Ease of installa=on = wider deployment Debugging problems end to end Instrumenta=on and monitoring on all parts of the path Data views that correlate results automa=cally, suggest solu=ons 8 10/5/10, 2010 Internet2

Possible Solu=ons Finding a solu=on to network performance problems can be broken into two dis=nct steps: Use of Diagnos?c Tools to locate problems Tools that ac=vely measure performance (e.g. Latency, Available Bandwidth) Tools that passively observe performance (e.g. error counters) Regular Monitoring to establish performance baselines and alert when expecta=on drops. Using diagnos=c tools in a structured manner Visualiza=ons and alarms to analyze the collected data Incorpora=on of either of these techniques must be: ubiquitous, e.g. the solu=on works best when it is available everywhere seamless (e.g. federated) in presen=ng informa=on from different resources and domains 9 10/5/10, 2010 Internet2

Everyone should participate in the monitoring infrastructure user performance GUI measurement archive measurement archive Analysis tool measurement archive m1 m3 m4 measurement archive m1 m3 m4 measurement archive m1 m3 m4 Internet2 (AS11537) [US] m1 m4 ATT (AS7132) [US] m1 m3 m4 CNN (AS5662) [US] m3 10 10/5/10, 2010 Internet2 MERIT (AS237) [US] ATDN (AS1668) [US] 10"

Concept is still possible with partial information user performance GUI measurement archive measurement archive Analysis tool measurement archive m1 m3 m4 X m1 m3 m4 measurement archive m1 m3 m4 Internet2 (AS11537) [US] m1 m4 ATT (AS7132) [US] m1 m3 m4 CNN (AS5662) [US] m3 11 10/5/10, 2010 Internet2 MERIT (AS237) [US] ATDN (AS1668) [US] 11"

Possible Solu=ons Desirable design features for any solu=on Component Based Func=onality should be split into logical units Each func=on (e.g. visualiza=on) should func=on through well defined communica=on with other components (e.g. data storage) Modular Monolithic designs rarely work Components allow choice of how to operate a customized end solu=on. Accessible Well defined interfaces (e.g. APIs) Easily installed and configured Ini=al design should facilitate future expansion 12 10/5/10, 2010 Internet2

What is perfsonar? A collabora=on Produc=on network operators focused on designing and building tools that they will deploy and use on their networks to provide monitoring and diagnos=c capabili=es to themselves and their user communi=es. An architecture & set of communica=on protocols Web Services (WS) Architecture Protocols established in the Open Grid Forum Network Measurement Working Group (NM- WG) Network Measurement Control Working Group (NMC- WG) Network Markup Language Working Group (NML- WG) Several interoperable sopware implementa=ons perfsonar- MDM perfsonar- PS A Deployed Measurement infrastructure 13 10/5/10, 2010 Internet2

perfsonar Incep=on perfsonar originated from discussions between Internet2 s End- to- End Performance Ini=a=ve (E2Epi), and the Géant2 project in September 2004. Members of the OGF s (then GGF) NM- WG provided guidance on the encoding of network measurement data. Addi=onal network partners, including ESnet and RNP provided development resources and served as early adopters. The first release of perfsonar branded sopware was available in July 2006 (Java based sopware). All perfsonar branded sopware is open source All products looking to be labeled as perfsonar compliant must establish protocol compliance based on the public standards of the OGF 14 10/5/10, 2010 Internet2

perfsonar Architecture Overview Interoperable network measurement middleware designed as a Service Oriented Architecture (SOA): Each component is modular All are Web Services (WS) based The global perfsonar framework as well as individual deployments are decentralized All perfsonar tools are Locally controlled All perfsonar tools are capable of federa=ng locally and globally perfsonar Integrates: Network measurement tools and archives (e.g. stored measurement results) Data manipula=on Informa=on Services Discovery Topology Authen=ca=on and authoriza=on 15 10/5/10, 2010 Internet2

perfsonar Architecture Overview The key concept of perfsonar is that each en=ty (e.g. services ) performs a func=on Each service provides a limited set of func=onality Collec=ng measurements between arbitrary points Managing the registra=on and loca=on of distributed services Services interact through exchanges Standardized message formats and protocols A collec=on of perfsonar services within a domain is a deployment Deploying perfsonar can be done À la carte, or through a complete solu=on Services federate with each other, locally and globally Services are designed to automa=cally discover the presence of other perfsonar components Clients are designed with this distributed paradigm in mind 16 10/5/10, 2010 Internet2

Example perfsonar Use Case perfsonar should be used to diagnose an end- to- end performance problem User is amemp=ng to download a remote resource Resource and user are separated by distance Both are assumed to be connected to high speed networks Opera=on does not go as planned, where to start? 17 10/5/10, 2010 Internet2

Example perfsonar Use Case Simple tools like traceroute can be used to determine the path traveled There could be a performance problem anywhere in here The problem may be something we could fix, but the chances are greater that it is not 18 10/5/10, 2010 Internet2

Example perfsonar Use Case Each segment of the path is controlled by a different domain. Each domain will have network staff that could help fix the problem, but how to contact them? All we really want is some informa=on regarding performance 19 10/5/10, 2010 Internet2

Example perfsonar Use Case Each domain has made measurement data available via perfsonar The user was able to discover this automa=cally Automated tools such as visualiza=ons and analyzers can be powered by this network data 20 10/5/10, 2010 Internet2

Example perfsonar Use Case In the end, the problem is isolated based on tes=ng. May have gone unno=ced in some cases (e.g. a sop failure ) Could have been observed by many others that didn t think to report it The user (or opera=ons staff) can contact the domain in ques=on to inquire about this performance problem When fixed the transfer should progress as intended 21 10/5/10, 2010 Internet2

Overview Introduc=on and Mo=va=on Design & Func=onality Current Deployment Footprint Conclusion 22 10/5/10, 2010 Internet2

Complete Network Monitoring Solu=on Wide Array of Tools Hit some major metrics (network u=liza=on, latency, bandwidth) Use what others are using (Cac=, ping, iperf) Installa=on One Step package management or all in one appliance Linux Live CD Popular way to try out before commiong to a distro Func=onality One off diagnos=c tes=ng Regular monitoring Aler=ng capability Visualiza=ons 23 10/5/10, 2010 Internet2

ps Performance Toolkit The ps stands for perfsonar hmp://psps.perfsonar.net/toolkit Current Version: 3.2 CentOS 5.5 Based Previous genera=ons: based on Knoppix Available Tools: Latency: PingER/OWAMP/Reverse Traceroute Bandwidth: BWCTL (Iperf/NuMcp), NDT, NPAD Passive Measurements: (Cac=) Purpose: Install in any free machine Diagnos=c = non destruc=ve to host Regular Monitoring = store results on host Live vs Disk Based Installs 24 10/5/10, 2010 Internet2

Support Structure Open Source Development Team comprised of R&E Network operators, Universi=es, Federal Labs All source code publically available though anonymous SVN Open Source Support Mailing list(s) Public bug tracking Upgrade Path Burn a new CD Use of Package Managers (YUM) Migra=on tools to backup historical measurement data 25 10/5/10, 2010 Internet2

ps- Performance Toolkit On Deck 6-12 Month =me horizon New Tools (Nagios, Ganglia) in the works Improved GUIs for data analysis Integrated data views Network health views Alert infrastructure for services and data No=fy if bad results are seen Improved support for mul=layer measurement correla=on Authen=ca=on and Authoriza=on Protect access to tools and data Beyond Web interface for managing groups of ps- Performance Nodes More diagnos=c tools and archives Integra=on of outside research and produc=on tools 26 10/5/10, 2010 Internet2

ps Performance Toolkit Web Interface 27 10/5/10, 2010 Internet2

ps Performance Toolkit Scheduling Interface 28 10/5/10, 2010 Internet2

ps Performance Toolkit Data View 29 10/5/10, 2010 Internet2

ps Performance Toolkit Graphing Results 30 10/5/10, 2010 Internet2

ps Performance Toolkit Graphing Results 31 10/5/10, 2010 Internet2

ps Performance Toolkit Cac= Integra=on 32 10/5/10, 2010 Internet2

ps Performance Toolkit Reverse Traceroute 33 10/5/10, 2010 Internet2

Overview Introduc=on and Mo=va=on Design Current Deployment Footprint Conclusion 34 10/5/10, 2010 Internet2

Current Deployment Footprint perfsonar is gaining trac=on as an interoperable and extensible monitoring solu=on Adop=on has progressed in the following areas: R&E networks including backbone, regional, and exchange points Universi=es on a na=onal and interna=onal basis Federal labs and agencies in the United States (e.g. JET nets) Scien=fic Virtual Organiza=ons, notably the LHC project Recent interest has also accrued from: Interna=onal R&E network partners and exchange points Commercial Providers in the United States Hardware manufactures Live ps Status: Services: hmp://www.perfsonar.net/ac=veservices Loca=ons: hmp://www.perfsonar.net/ac=veservices/is 35 10/5/10, 2010 Internet2

Deployment Footprint - Targets Commercial ISPs Success with measurement projects like MLab Would like to see wider adop=on of toolkit as a test playorm Internal performance tes=ng Tes=ng from the home router to an aggrega=on point For Profit Companies Major content providers Use on VPNs R&E/Federal Space Con=nued adop=on for Campuses/Backbones/Labs Monitoring of commercial peering as well as R&E (with the help of the commercial provider) 36 10/5/10, 2010 Internet2

Overview Introduc=on and Mo=va=on Design Current Deployment Footprint Conclusion 37 10/5/10, 2010 Internet2

Conclusion Performance Problems Exist Networks expanding, growing in size and capability Number of users and end devices growing Solving Problems Takes Time Tools also Exist to Help Deployment of a ps Performance Toolkit is simple Availability of Diagnos=cs and Regular Monitoring Upgrade path = burn a new CD Open Source Growing Deployments R&E Networks Inroads into Commercial Ventures 38 10/5/10, 2010 Internet2

For more informa=on General informa=on: hmp://www.perfsonar.net More about what you heard today: hmp://psps.perfsonar.net/toolkit perfsonar- PS tools and sopware: hmp://sopware.internet2.edu A hook to the global lookup service: hmp://www.perfsonar.net/ac=veservices/is/ More human- readable list of services: hmp://www.perfsonar.net/ac=veservices/ 39 10/5/10, 2010 Internet2

Mailing Lists Development (by approval of the project) hmps://lists.internet2.edu/sympa/subscribe/perfsonar- dev User Support hmps://lists.internet2.edu/sympa/subscribe/perfsonar- ps- users hmps://lists.internet2.edu/sympa/subscribe/performance- node- users Announcements hmps://lists.internet2.edu/sympa/subscribe/perfsonar- ps- announce hmps://lists.internet2.edu/sympa/subscribe/performance- node- announce Working Groups hmps://lists.internet2.edu/sympa/subscribe/performance- wg hmps://lists.internet2.edu/sympa/subscribe/is- wg hmp://www.ogf.org/mailman/lis=nfo/nm- wg hmp://www.ogf.org/mailman/lis=nfo/nmc- wg hmp://www.ogf.org/mailman/lis=nfo/nml- wg 40 10/5/10, 2010 Internet2

Thanks! Ques=ons or Comments? zurawski@internet2.edu 41 10/5/10, 2010 Internet2

October 5 th 2010, NANOG 50 Jason Zurawski Internet2 For more informa=on, visit hmp://psps.perfsonar.net/toolkit 42 10/5/10, 2010 Internet2