GRID monitoring with NetSaint

Similar documents
Monitoring a HPC Cluster with Nagios

The European DataGRID Production Testbed

CHAPTER 3 GRID MONITORING AND RESOURCE SELECTION

The EU DataGrid Fabric Management

op5 Monitor 4.1 Manual

MONITORING OF GRID RESOURCES

Auto-Notify Setup - Begin from Contacts (for Contacts/Searches Copied from Tempo/Fusion)

Graphing and statistics with Cacti. AfNOG 11, Kigali/Rwanda

The Art of Container Monitoring. Derek Chen

High bandwidth, Long distance. Where is my throughput? Robin Tasker CCLRC, Daresbury Laboratory, UK

MONITOR YOUR ENTIRE IT INFRASTRUCTURE WITH NAGIOS XI

SYSLOG and SUPERVISOR S WORKSHOP Knowledge Module for PATROL - Data Sheet Version Made by AXIVIA Conseil

Feedback:

Automatic Customer Group Switching Magento Extension

HPE Operations Agent. Concepts Guide. Software Version: For the Windows, HP-UX, Linux, Solaris, and AIX operating systems

Nagios XI Host and Service Details Overview

Learning Nagios 4. Wojciech Kocjan. Chapter No.1 "Introducing Nagios"

Job-Oriented Monitoring of Clusters

The Cacti Graphing Solution

Manage Contact Center, Agent Settings, and Scheduled Reports

Setting up the Collaboration Center for a Client

ElasterStack 3.2 User Administration Guide - Advanced Zone

Understanding StoRM: from introduction to internals

Centerity Monitor Standard V3.8.4 USER GUIDE VERSION 2.15

Multi Mediation 18 Training Programs. Catalog of Course Descriptions

A self-configuring control system for storage and computing departments at INFN-CNAF Tierl

Configuring Communication Services

Linux Clusters Institute: Monitoring

Performance Monitor Administrative Options

VM Power Prediction in Distributed Systems for Maximizing Renewable Energy Usage

Important DevOps Technologies (3+2+3days) for Deployment

Rhapsody Interface Management and Administration

VMware vrealize Log Insight Getting Started Guide

Network Management & Monitoring Overview

McAfee Network Security Platform 8.3

Centerity Monitor 4.0. Administration Guide

A Survey on Open Source Tools - for Server Monitoring using SNMP

Performance Monitoring and SiteScope

TECHNICAL DESCRIPTION

CrossGrid testbed status

RRDTool: A Round Robin Database for Network Monitoring

COSC 301 Network Management

TELE 301 Network Management

Configuring Network-based IDS and IPS Devices

VMware vsphere. Administration VMware Inc. All rights reserved

Stager. A Web Based Application for Presenting Network Statistics. Arne Øslebø

R5: Configuring Windows Server 2008 R2 Network Infrastructure

& ESCALA NOVASCALE BSM 1.2. User's Guide REFERENCE 86 A2 55FA 02

Network and Server Statistics Using Cacti

New Features in ehealth Release 5.7

ISTITUTO NAZIONALE DI FISICA NUCLEARE

Get Success in Passing Your Certification Exam at first attempt!

Toward Scalable Monitoring on Large-Scale Storage for Software Defined Cyberinfrastructure

Monitoring (and) IPv6

The power of centralized computing at your fingertips

Consumer Broadband Monitoring: A Proof-of-Concept

Acronis Monitoring Service

MONitoring Agents using a Large Integrated Services Architecture. Iosif Legrand California Institute of Technology

FastTrack to Red Hat Linux System Administrator Course Overview

SPINOSO Vincenzo. Optimization of the job submission and data access in a LHC Tier2

Architecture Proposal

You can administer Policy and Distribution Services by using the following:

VMWARE VREALIZE OPERATIONS MANAGEMENT PACK FOR. Nagios. User Guide

Portnox CORE. On-Premise. Technology Introduction AT A GLANCE. Solution Overview

Systemwalker Service Quality Coordinator. Technical Guide. Windows/Solaris/Linux

The EU DataGrid Testbed

Product Inquiry for Magento 2.X

Systemwalker Service Quality Coordinator. Technical Guide. Windows/Solaris/Linux

McAfee Red and Greyscale

Lanka Education and Research Network. Network Monitoring LEARN. 28 th November IT Center, University of Peradeniya Dilum Samarasinhe (LEARN)

HPC Metrics in OSCAR based on Ganglia

Network Management & Monitoring Overview

Utilizing Slurm and Passive Nagios Plugins for Scalable KNL Compute Node Monitoring

IBM Security QRadar Deployment Intelligence app IBM

SysAidTM. Monitoring Guide

IBM Tivoli Netcool Service Quality Manager V4.1.1

CMS HLT production using Grid tools

2 Click RoomWizard Setup.

<Insert Picture Here> Lustre Development

Application Guide. Connection Broker. Advanced Connection and Capacity Management For Hybrid Clouds

Network and Server Statistics using Cacti

Pinnacle3 Professional

BlackBerry Enterprise Server for Microsoft Office 365. Version: 1.0. Administration Guide

How to use Genetec Omnicast with Hikvision devices V1.0.0 ( )

Overview. SUSE OpenStack Cloud Monitoring

WhatsUp Gold. Evaluation Guide

Network Monitoring & Management Using Cacti

KRISHNA KANTA HANDIQUI STATE OPEN UNIVERSITY Hiranya Kumar Bhuyan School of Science and Technology

GridMonitor: Integration of Large Scale Facility Fabric Monitoring with Meta Data Service in Grid Environment

Network and Server Statistics using Cacti

Migration and Building of Data Centers in IBM SoftLayer

OPMANTEK NETWORK MANAGEMENT AND IT AUDIT SOFTWARE. Troubleshooting Open-AudIT Discoveries v1 January 2019

Report Submission User s Manual

Using Trend Reports. Understanding Reporting Options CHAPTER

MUM - Beirut, Lebanon - June 14 th 2016

SSIM Collection & Archiving Infrastructure Scaling & Performance Tuning Guide

B2B Wholesale Private Shop

Plugin Monitoring for GLPI

QRM+ Tutorials. QNAP s Remote Server Management Solution. rev

Application Level Protocols

Transcription:

GRID monitoring with NetSaint Roberto Barbera [barbera@ct.infn.it] Paolo Lo Re [lore@na.infn.it] Giuseppe Sava [sava@ct.infn.it] Gennaro Tortone [tortone@na.infn.it] Bologna - Datagrid WP7 meeting January 2002

Index NetSaint overview Role of NetSaint in GRID monitoring Interesting features of NetSaint for GRID monitoring Addons developed by INFNGRID WP7 group Conclusions A short demo on INFN implementation

NetSaint overview 1/2 NetSaint (www.netsaint.org) is a network monitoring tool developed by Ethan Galstad and designed to run under Linux. Some of its features include: simple plugins design that allows users to easily develop their own service checks monitoring of network services (FTP, HTTP, SSH, ) monitoring of host resources (CPU load, disk usage, ) ability to define network host (or device) hierarchy using parent host, allowing detection and distinction between host that are down and those that are unreachable distributed monitoring: a central NetSaint server obtains check results from one or more NetSaint distributed servers

NetSaint overview 2/2 contact notifications when service or host problems occour (via email or user defined method) ability to define event handlers to be run during service or host events for proactive problem resolution logging mechanism and automatic log-file rotation optional plugins to send SNMP queries to host or network devices (router, switches, ); web interface for view current network status, notifications and problem history, logfile,

Role of NetSaint in GRID monitoring (from Requirement of network monitoring for the GRID - by Robin Tasker) Immediate network monitoring: a single view/access point of the available tools needs to be produced to allow a GRID user access to determine the "health" of the network. Such a snapshot of the network will likely include route information between specified end points; the characterisation of the network using, for example, pathchar; and the means of measuring throughput... The pre-testbed sites are encouraged to develop this concept to demonstrate capability and to allow WP7 to further refine the ideas based upon their experience and input from the users of these products. The idea is to use NetSaint: to view a snapshot of the GRID/Testbed resources status, services availability and network measurements to receive notifications on host or service faults to view graphs of resource monitoring results or network measurements

NetSaint is the official choice of INFN Testbed Technical Board for monitoring of INFN Testbed 1 Presently a NetSaint server is installed in Catania and checks approximately 120 services on 35 hosts http://infngrid.ct.infn.it (user: infn-tb - pass: guest)

Interesting features of NetSaint for GRID monitoring 1/3 notifications: it s possible to define a group of users (site admins) to notify when a service (or host) is in critical state event handlers: they are optional commands that are executed whenever a host or service state change occours; an obvious use of event handlers is the ability for NetSaint to proactively fix problems before anyone is notified; another use is to log service or host events to an external database; plugin architecture: NetSaint does not include any internal mechanism to check the status of services (or hosts); instead, NetSaint relies on external programs (plugins) to do all the monitoring activity; this feature allows users to easily develop their own service checks;

Interesting features of NetSaint for GRID monitoring 2/3 remote service checks - NRPEP addon: this addon is designed to provide a way for executing plugins on a remote host. The check_nrpe plugin runs on the NetSaint server and is used to send plugin execution requests to the NRPEP agent on the remote host. The nrpe agent will then run an appropriate plugin on the remote host and return the plugin output and return code to the check_nrpe plugin on the NetSaint server. The check_nrpe plugin then passes the remote plugin's output and return code back to NetSaint as if it were its own. All data in transit are in TripleDES encription format; passive checks : NetSaint can process service check results that are submitted by remote hosts through a daemon that runs on the NetSaint server and a client that is executed on remote hosts;

Interesting features of NetSaint for GRID monitoring 3/3 distributed monitoring - scalability: a possible usage of NetSaint is to install one NetSaint sensor (in barebone configuration) for each site to collect monitoring results from resources and one main NetSaint collector (in full configuration) to collect groups of monitoring results from sensors; this feature shows the functionality overlap that exists between NetSaint distributed architecture and GIIS/MDS GRID architecture; NetSaint collector site A host monitoring results NetSaint sensor NetSaint sensor monitoring results site B host

Addons developed by INFNGRID WP7 group graphs of resources (or network) monitoring results: we have developed a wrapper that parses the output of a plugin execution and insert monitoring values into a RRD (Round Robin Database - www.rrdtool.org). An user, from NetSaint web interface, can view daily, weekly, monthly or yearly graphs for a selected resource LDAP based plugins: another thread of development activities is the implementation of plugins that pull information from a MDS server, instead than from resources

Conclusions Work in progress: integration of NetSaint layout with MDS information providers of Datagrid WP7 (edg-pinger, edg-iperf, edg-udpmon); the idea is to pull from the MDS (Ftree) server all the network measurements pushed by WP7 information providers and enable graphs and notifications on these attributes; development of LDAP based plugins for all resource monitoring parameters (CPU load, disk usage, ); RPM packaging of NetSaint distribution with all configuration files and plugins for automatic installation by LCFG WP4 tool; possible integration of MapCenter as frontend of NetSaint to display various views of GRIDs with a link (for each host) to NetSaint status page for that host; Ethan Galstad, the author of the program, has been already contacted and he is willing to introduce any change in the NetSaint core logic our developments should need; We are open for any kind of collaboration