Information and monitoring

Similar documents
Client tools know everything

The ARC Information System

Introduction to Programming and Computing for Scientists

Introduction to Programming and Computing for Scientists

EGI-InSPIRE. ARC-CE IPv6 TESTBED. Barbara Krašovec, Jure Kranjc ARNES. EGI-InSPIRE RI

The Grid Monitor. Usage and installation manual. Oxana Smirnova

The NorduGrid/ARC Information System

PoS(EGICF12-EMITC2)074

Introduction to Programming and Computing for Scientists

Grid services. Enabling Grids for E-sciencE. Dusan Vudragovic Scientific Computing Laboratory Institute of Physics Belgrade, Serbia

Resource Allocation in computational Grids

ARC NOX AND THE ROADMAP TO THE UNIFIED EUROPEAN MIDDLEWARE

MONITORING OF GRID RESOURCES

Scientific data management

The NorduGrid Architecture and Middleware for Scientific Applications

glite Grid Services Overview

ARC-XWCH bridge: Running ARC jobs on the XtremWeb-CH volunteer

ARC middleware. The NorduGrid Collaboration

PoS(EGICF12-EMITC2)081

Architecture Proposal

QosCosGrid Middleware

Grid Experiment and Job Management

EGEE and Interoperation

g-eclipse A Framework for Accessing Grid Infrastructures Nicholas Loulloudes Trainer, University of Cyprus (loulloudes.n_at_cs.ucy.ac.

Interoperating AliEn and ARC for a distributed Tier1 in the Nordic countries.

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy

NorduGrid Tutorial. Client Installation and Job Examples

Job Usage Reporter of ARC JURA

ARC Clients User Manual for ARC (client versions 1.0.0) and above

Exam : JN Title : Juniper Networks Certified Internet Assoc(JNCIA-SSL) Exam. Version : Demo

LDAP Configuration Guide

Obtaining the LDAP Search string (Distinguished Name)?

Xton Access Manager GETTING STARTED GUIDE

Lessons Learned in the NorduGrid Federation

Securing ArcGIS Services

Monitoring ARC services with GangliARC

LCG-2 and glite Architecture and components

Troubleshooting Grid authentication from the client side

ISBG May LDAP: It s Time. Gabriella Davis - Technical Director The Turtle Partnership

Grids and Security. Ian Neilson Grid Deployment Group CERN. TF-CSIRT London 27 Jan

European Grid Infrastructure

Usage statistics and usage patterns on the NorduGrid: Analyzing the logging information collected on one of the largest production Grids of the world

ArcGIS for Server: Administration and Security. Amr Wahba

LDAP Directory Integration

Ivane Javakhishvili Tbilisi State University High Energy Physics Institute HEPI TSU

Grid Computing Systems: A Survey and Taxonomy

Administering Your ArcGIS Enterprise Portal Bill Major Craig Cleveland

CLI users are not listed on the Cisco Prime Collaboration User Management page.

Grid Computing with NorduGrid-ARC

LDAP Directory Integration

The LHC Computing Grid

Java Development and Grid Computing with the Globus Toolkit Version 3

Setup Desktop Grids and Bridges. Tutorial. Robert Lovas, MTA SZTAKI

Regional SEE-GRID-SCI Training for Site Administrators Institute of Physics Belgrade March 5-6, 2009

Configuring a glideinwms factory

Installation of CMSSW in the Grid DESY Computing Seminar May 17th, 2010 Wolf Behrenhoff, Christoph Wissing

Active Directory Synchronisation

EUROPEAN MIDDLEWARE INITIATIVE

Cisco Next Generation Firewall Services

Grid Computing Fall 2005 Lecture 5: Grid Architecture and Globus. Gabrielle Allen

Configuring the Oracle Network Environment. Copyright 2009, Oracle. All rights reserved.

Managing External Identity Sources

Users. LDAP Synchronization Overview

ARC integration for CMS

WP3 Final Activity Report

Cisco Expressway Authenticating Accounts Using LDAP

NorduGrid Tutorial Exercises

CYAN SECURE WEB HOWTO. SSL Intercept

Molecular dynamics simulations in the MolDynGrid Virtual Laboratory by means of ARC between Grid and Cloud

Grid Interoperation and Regional Collaboration

Realms and Identity Policies

LCG User Registration & VO management

Cisco TelePresence Authenticating Cisco VCS Accounts Using LDAP

SAS Environment Manager A SAS Viya Administrator s Swiss Army Knife

Towards sustainability: An interoperability outline for a Regional ARC based infrastructure in the WLCG and EGEE infrastructures

Understanding ACS 5.4 Configuration

An LDAP server may implement its own schema or a standard schema defined as in RFC Mainstream implementations of LDAP include Netscape

CIOC API User Guide. Release Online Resources 3.7 / Client Tracker 3.2. Katherine Lambacher, KCL Software Solutions Inc.

Authenticating and Importing Users with AD and LDAP

Realms and Identity Policies

vrealize Operations Manager Customization and Administration Guide vrealize Operations Manager 6.4

Hortonworks Data Platform

Authenticating Cisco VCS accounts using LDAP

Enterprise Manager: Scalable Oracle Management

The CORAL Project. Dirk Düllmann for the CORAL team Open Grid Forum, Database Workshop Barcelona, 4 June 2008

Enterprise Workflow Resource Management

Novell Access Manager

A New Grid Manager for NorduGrid

Entrust. Discovery 2.4. Administration Guide. Document issue: 3.0. Date of issue: June 2014

DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Naming WHAT IS NAMING? Name: Entity: Slide 3. Slide 1. Address: Identifier:

WMS overview and Proposal for Job Status

Pure Storage FlashArray Management Pack for VMware vrealize Operations Manager User Guide. (Version with Purity 4.9.

A Survey Paper on Grid Information Systems

Linux Operating System Environment Computadors Grau en Ciència i Enginyeria de Dades Q2

Identity Policies. Identity Policy Overview. Establishing User Identity through Active Authentication

Secure Web Appliance. SSL Intercept

Advanced Network and System Administration. Accounts and Namespaces

SchoolBooking LDAP Integration Guide

NetIQ Identity Manager Jobs Guide. February 2017

Authenticating and Importing Users with Active Directory and LDAP

Transcription:

Information and monitoring

Information is essential Application database Certificate Certificate Authorised users directory Certificate Certificate Grid tools Researcher Certificate Policies Information services Grid job management service Certificate Data Certificate Small but most important component: List of Grid services and their properties Necessary for service discovery and status monitoring Certificate Data Grid job management service Researcher Grid tools 2014 Oxana Smirnova, Dept. of Physics 114

Grid bookkeeping We now can submit jobs to different clusters, e.g.: > arcsub -c cluster1.lab.org myjob.xrsl > arcsub -c bigcluster.inst.org myjob.xrsl Doing this, we benefit from 2 Grid advantages: Common login using proxy credentials Common job description But how do we know that these clusters exist? Keeping a list on a piece of paper sounds silly Should Grid client keep such a list? Who should keep an eye on changes? What if these clusters do not fit my job description? Cluster 1 PC Grid client List of clusters?? Cluster 2 2014 Oxana Smirnova, Dept. of Physics 115

Information system concept Each Grid service must provide information about itself on request True for all services, but currently implemented only for computing services, and partially for storage A request can be anonymous or authorised Anonymous requests are useful for overall monitoring Authorised requests allow to provide user-specific information» Details of user s or VO s jobs In practice, only anonymous requests are fully implemented» You can check status of any service or anybody s job on the Grid» You don t even need a proxy to do this In what follows, only clusters information is considered 2014 Oxana Smirnova, Dept. of Physics 116

Purposes of information system Discover available clusters Enable matching of job description to cluster s properties Check status of clusters Check status of jobs 2014 Oxana Smirnova, Dept. of Physics 117

Types of information published by clusters Information published by clusters can be used for matchmaking and monitoring Each of these can be either semi-static or dynamic, for example: Matchmaking Semi-static Type of interface Maximum memory Performance benchmark Dynamic Free CPU count Free storage Queue length Monitoring Cluster name Cluster location Total CPU count Occupied CPU count Number of jobs Queue status (up/down) 2014 Oxana Smirnova, Dept. of Physics 118

Many dilemmas Minimum vs maximum Centralised vs distributed Generic vs specific Job descriptions state minimal requirements Distributed dynamic information is wrong for everybody (users are unaware of each other) Generic information is not enough for specific needs Clusters advertise maximal capacity Centralised databases have synchronisation and scalability issues Specific information changes from user to user and from cluster to cluster 2014 Oxana Smirnova, Dept. of Physics 119

Common approach: LDAP LDAP: Lightweight Directory Access Protocol A hierarchical database (directory) and a protocol to interact with it (ldap:// ) Perfect for storing information like phone directories» Widely used to store personal information, even in LU Clusters have hierarchical structure, hence LDAP is a perfect fit» Each cluster has queues, each queue has jobs and users, etc. LDAP is a subset of the X.500 standard The directory consists of objects (entries), each of which has a number of attributes» Attributes are defined in a schema» Several schemas exist on the Grid Objects are organised in trees and sub-trees Each object has a Distinguished Name (DN)» LDAP DNs have same structure as X.509 certificate DNs, which is no coincidence» DN is not an attribute it is just a name 2014 Oxana Smirnova, Dept. of Physics 120

ARC schema Schema defines a set of rules defining the structure of a Directory Information Tree (DIT) ARC comes with an own schema and DIT Not a standard, but quite logical ARC cluster DIT an LDAP subtree representing a computing resource 2014 Oxana Smirnova, Dept. of Physics 121

Some ARC cluster attributes Distinguished Name Certificate Authority { Ordered by hand: LDAP doesn t care much about ordering dn: nordugrid-cluster-name=arc-iridium.lunarc.lu.se,mds-vo-name=local,o=grid nordugrid-cluster-totalcpus: 192 nordugrid-cluster-name: arc-iridium.lunarc.lu.se nordugrid-cluster-aliasname: Iridum Cluster ARC nordugrid-cluster-lrms-type: SLURM nordugrid-cluster-lrms-version: 2.6.6-2 nordugrid-cluster-middleware: nordugrid-arc-4.0.0 nordugrid-cluster-cpudistribution: 16cpu:12 nordugrid-cluster-issuerca: /C=NL/O=TERENA/CN=TERENA escience SSL CA nordugrid-cluster-sessiondir-total: 1330688 nordugrid-cluster-architecture: x86_64 nordugrid-cluster-prelrmsqueued: 0 nordugrid-cluster-cache-free: 4194167 nordugrid-cluster-contactstring: gsiftp://arc-iridium.lunarc.lu.se:2811/jobs nordugrid-cluster-totaljobs: 0 nordugrid-cluster-usedcpus: 0 nordugrid-cluster-sessiondir-free: 1303773 objectclass: Mds objectclass: nordugrid-cluster Mds-validfrom: 20140407234414Z Mds-validto: 20140407235414Z 2014 Oxana Smirnova, Dept. of Physics 122

Standard in works: GLUE2 GLUE2 is a common Grid information standard Takes *long* time for all the players to agree Can be implemented as LDAP, or as XML etc A proposed ARC GLUE2 LDAP tree 2014 Oxana Smirnova, Dept. of Physics 123

Shared file system arcinfo Information publishing for an ARC cluster Session dir RTE Head Node Control dir SLURM info A-REX ARIS DTR JURA certs gridftp/http A-REX periodically launches information providers which: Collect all details defined by relevant information schemas, such as» Hardware details» LRMS details cache RP map users» Available application software (RTE)» Authorised users (DNs)» etc slurmd Worker Node arcinfo Create formatted output ready to be served on request Populate LDAP databases of ARIS slurmd Worker Node ARIS serves information when queried via LDAP 2014 Oxana Smirnova, Dept. of Physics 124

Information indexing So far, so good: each cluster provides a list of its properties One cluster is not really a Grid, even two clusters is not too impressive What provides the list of clusters when there are hundreds of them? Answer: each cluster regularly announces itself to information indexing servers Certain (minimal) amount of information is being sent» Most importantly, cluster s contact details Information indexing servers usually make use of LDAP, too Indexing servers can even store entire set of information from each cluster, but it does not scale well 2014 Oxana Smirnova, Dept. of Physics 125

Information indexing hierarchy Index can register to another index and so on 2014 Oxana Smirnova, Dept. of Physics 126

Shared file system Registration Process on an ARC cluster Session dir RTE cache Head Node Control dir SLURM info A-REX ARIS RP DTR JURA certs gridftp/http map users Information Registration Process periodically sends pre-configured information to one or more preconfigured information registries Service type (cluster in this case) Service contact details (contact string, port etc) slurmd slurmd Worker Node Worker Node EGIIS EMIR BDII Currently all such data are communicated via LDAP Other technologies are possible, as primary data are stored internally as regular files 2014 Oxana Smirnova, Dept. of Physics 127

Extended information Many users and VOs want to know additional information, e.g. E-mail of a security officer Planned downtimes Performance history for specific kind of tasks Details of particular software Such information is often user-specific, and often can not be automatically generated Several additional approaches are used: VOs create their own information systems» Harvest information from standard Grid channels» Add own details Jobs are instructed to collect information themselves, and change workflow depending on the result» Pilot/agent jobs: empty jobs that collect information and pull in matching tasks from VO databases 2014 Oxana Smirnova, Dept. of Physics 128

Monitoring Current status of the Grid can be inspected via the native information system Historical details need an additional database Various VO-specific dashboards exist to monitor changes of status Some information can only be collected by special monitoring jobs probes A probe can test whether a specific environment is set up in a way that suits a VO Results of such jobs can be used by standard monitoring systems, e.g. Nagios 2014 Oxana Smirnova, Dept. of Physics 129

Exercises Discover the local information tree published by the LDAP server running on the CE using the ldapsearch CLI Read the ldapsearch man pages, then connect to the root entries of the LDAP trees on the Iridium computing service: ldapsearch h arc-iridium.lunarc.lu.se p 2135 x b mds-vo-name=local,o=grid s base Or try the GLUE2 tree: ldapsearch h arc-iridium.lunarc.lu.se p 2135 x b o=glue s base Get the child elements containing all the attributes ldapsearch h arc-iridium.lunarc.lu.se p 2135 x b o=glue s one Or the full tree skeleton only with the DNs: ldapsearch h arc-iridium.lunarc.lu.se p 2135 x b mds-vo-name=local,o=grid dn Check the dynamic behaviour of the information system. Run the following ldapsearch for fetching information about Grid jobs BEFORE and AFTER submitting a couple of hello grid jobs: ldapsearch h arc-iridium.lunarc.lu.se p 2135 x b mds-vo-name=local,o=grid objectclass=nordugrid-job Can you find out e.g. the submission time of certain job? (hint: check the nordugrid-job-submissiontime attribute value) 2014 Oxana Smirnova, Dept. of Physics 130

Exercises Browse the local information LDAP Trees with a Graphical LDAP browser On Windows, install either ldapadmin (preferred) or jxplorer (requires Java): http://www.ldapadmin.org/download/ldapadmin.html http://jxplorer.org/downloads/users.html On Linux, install the jxplorer (can be installed without root privileges) http://sourceforge.net/projects/jxplorer/files/jxplorer/version%203.3.1/jxplorer- 3.3.1-linux-installer.run/download Define a new connection with following settings: server: arc-iridium.lunarc.lu.se port: 2135, base: mds-voname=local, o=grid or base: o=glue Walk through the tree, check the different LDAP entries and their attributes 2014 Oxana Smirnova, Dept. of Physics 131

Exercises Discover the hierarchical registration system implemented by the NorduGrid Index Services (top and country EGIISes): Check the content of a Top NorduGrid EGIIS: ldapsearch h index1.nordugrid.org p 2135 x b mds-voname=nordugrid, o=grid s base giisregistrationstatus Check the content e.g of the Country EGIIS of Norway: ldapsearch h <host name obtained from the top GIIS> p 2135 x b mds-vo-name=norway, o=grid s base giisregistrationstatus Unfortunately, there are no graphical browsers for the EGIIS hierarchy 2014 Oxana Smirnova, Dept. of Physics 132

Exercises Monitoring & dashboards Fire up the Grid monitor from www.nordugrid.org Browse the ATLAS dashboard: http://dashboard.cern.ch/atlas/ SAM: http://operations-portal.in2p3.fr/ ( other tools - > ) Yet another site monitoring database, the GOCDB: http://goc.egi.eu/ 2014 Oxana Smirnova, Dept. of Physics 133