COD DECH giving feedback on their initial shifts

Similar documents
Monitoring tools in EGEE

EGI-InSPIRE RI NGI_IBERGRID ROD. G. Borges et al. Ibergrid Operations Centre LIP IFCA CESGA

Geographical failover for the EGEE-WLCG Grid collaboration tools. CHEP 2007 Victoria, Canada, 2-7 September. Enabling Grids for E-sciencE

Computing for LHC in Germany

Regional SEE-GRID-SCI Training for Site Administrators Institute of Physics Belgrade March 5-6, 2009

Alfresco Content Services 5.2. Getting Started Guide

Welcome to Hitachi Vantara Customer Support for Pentaho

Assigns a ticket to you. A ticket has to be assigned to you in order for you to work on it and see the full list of actions.

Outline. Infrastructure and operations architecture. Operations. Services Monitoring and management tools

On the EGI Operational Level Agreement Framework

EGEE and Interoperation

Security Service Challenge & Security Monitoring

EGI-InSPIRE COD F2F. Ron Trompert. 11/14/12 1 EGI-InSPIRE RI

ITSM Training for Field Service

Service Level Agreement Metrics

E-course Manual. Instruction manual for the participants of the Eurogentest online courses. 8/9/2012 EuroGentest2

[MS20414]: Implementing an Advanced Server Infrastructure

EELA Operations: A standalone regional dashboard implementation

Technology Background Development environment, Skeleton and Libraries

Cisdem ContactsMate Tutorial

The EGEE-III Project Towards Sustainable e-infrastructures

Installation of CMSSW in the Grid DESY Computing Seminar May 17th, 2010 Wolf Behrenhoff, Christoph Wissing

Insight s Remote Technical Assistance Center: A User s Guide to RTAC

Acronis Data Cloud plugin for ConnectWise Automate

MANAGE YOUR CONSTRUCTION21 COMMUNITY

How can you get help? Ticketing system: Service-Now. SMB-SMS Group

Presenting Online in Elluminate Live!

Integrating with your Third Party Monitoring Software

2018 End-of-Year Guidelines

Failover procedure for Grid core services

NotifyMDM Device Application User Guide Installation and Configuration for Android

EGEE II - Network Service Level Agreement (SLA) Implementation

Track: EXTOL Business Integrator (EBI)

Visual Guide to Online Campus

Skype for Business Features Overview

IWR GridKa 6. July GridKa Site Report. Holger Marten

Acronis Data Cloud plugin for ConnectWise Automate

DESY. Andreas Gellrich DESY DESY,

Getting Started with the Severe Weather Shelter Scheduling Software

Connect Support Request Guide

Service Description: Identity Services Engine Implementation-Subscription Service

Chapter 11: Editorial Workflow

MAPS Agent Contact Types

Phire Frequently Asked Questions - FAQs

The LCG 3D Project. Maria Girone, CERN. The 23rd Open Grid Forum - OGF23 4th June 2008, Barcelona. CERN IT Department CH-1211 Genève 23 Switzerland

Getting Started with Blackboard A Guide for Students

MyUni - Discussion Boards, Blogs, Wikis & Journals

Campus Community Guide October 2012

EXPLORING COURSE TOOLS

From EGEE OPerations Portal towards EGI OPerations Portal

RC View DAT Dispatch: Quick Sheet

HEP replica management

Andrea Sciabà CERN, Switzerland

Microsoft SharePoint is provided by Information Services for staff in Aberystwyth University.

1. Create a customer account.

Blackboard User Guide for Participants

Session Administration System

Introduction to Google Calendar and Google Keep

Dynamics 365 for Customer Service - User's Guide

Magellan HUB: Making the Most of Our Collaboration Tools CAYLAN HORD DIRECTOR, END USER COMPUTE DAWN ROMBERG VP, CORPORATE COMMUNICATIONS

FACETs. Technical Report 05/19/2010

Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft. Presented by Manfred Alef Contributions of Jos van Wezel, Andreas Heiss

BUSINESS CONTINUITY TOOLKIT

In this presentation you will get a feature overview of the forums in SAP NetWeaver Portal 7.3.

Receiving and Acknowledging Automated Dispatch Notifications

ATLAS operations in the GridKa T1/T2 Cloud

Accreditation & Certification Supplier Guide

Invoices associated with one customer can be combined into one PDF. Navigate to Transactions > Invoices > Combine Invoices to start the process.

RUSSIAN DATA INTENSIVE GRID (RDIG): CURRENT STATUS AND PERSPECTIVES TOWARD NATIONAL GRID INITIATIVE

InLoox PM 7 User Manual

Implementing an Advanced Server Infrastructure

HC3 Move Powered by Double-Take Quick Start Guide

E u r o p e a n G r i d I n i t i a t i v e

Kaseya IT Services KASEYA IT SERVICES PROGRAM CATALOG 2014 Q3

You can participate in Learning Catalytics with any device that has a browser (laptop, smartphone, or tablet).

Complaint Handling Procedure and Escalation Policy

INFORMATION TECHNOLOGIES & SERVICES

Case study on PhoneGap / Apache Cordova

Introduction... 4 I-Learn Introduction (Video)... 5 A Successful Start in Pathway (Video)... 6

MY MEDIASITE.

Update Manual Ios 7 Ipad 1 Won't >>>CLICK HERE<<<

Working in the MeetMe Conference Window

CANVAS STUDENT QUICKSTART GUIDE

GradLeaders Service Request Process (for schools using GradLeaders Career Center)

ARMS FORMS AND BEST PRACTICES. Department of Athletics and Recreation Athletics Compliance Office Monday s with your Manual September 112, 2016

Atlassian JIRA Introduction to JIRA Issue and Project Tracking Software Tutorial 1

The LHC Computing Grid

Monitoring System for the GRID Monte Carlo Mass Production in the H1 Experiment at DESY

Service Availability Monitor tests for ATLAS

feel free to poke around and change things. It's hard to break anything in a Moodle course, and even if you do it's usually easy to fix it.

Grid Services Security Vulnerability and Risk Analysis

Dealing with & Chat

OTRS Quick Reference

Lasell College s Moodle 3 Student User Guide. Access to Moodle

CollabNet TeamForge 6.2 User Guide

2012 Microsoft Corporation. All rights reserved. Microsoft, Active Directory, Excel, Lync, Outlook, SharePoint, Silverlight, SQL Server, Windows,

Zendesk II End User Updates

Atlassian Confluence 5 Essentials

A MANUAL ON HOW TO MANAGE PECB TRAINING EVENTS PROCESSES

Customer Support Procedures Sage X3 North America

Transcription:

COD DECH giving feedback on their initial shifts Clemens Koerdt, Victor Penso, Sven Hermann www.eu-egee.org

Centres in DECH contributing to the infrastructure Enabling Grids for E-sciencE U Dortmund DESY U Wuppertal DESY Zeuthen RWTH Aachen FhG SCAI TU Karlsruhe, IEKP FhG ITWM GSI FZK U Freiburg LRZ München MPI München CSCS Dist. ROC (EGEE2) RC RC to be certified 2

COD-DECH People Single-Point-of-contact (restricted to COD people) egee-dech_gridoperator-on-duty@savannah.fzk.de COD DECH discussion forum: egee-dech_gridoperator-on-duty https://savannah.fzk.de/forum/forum.php?forum_id=248 Team A Clemens Koerdt FZK Christian Peter FhG-ITWM Kläre Cassirer FhG-SCAI Team B Victor Penso GSI Christoph Wissing DESY Peter Kunszt CSCS Coordination&duty backup: Sven Hermann FZK 3

Project Week 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 Enabling Grids for E-sciencE Calendar Week Week start date 18 2006 01/05/2006 19 2006 08/05/2006 20 2006 15/05/2006 21 2006 22/05/2006 22 2006 29/05/2006 23 2006 05/06/2006 24 2006 12/06/2006 25 2006 19/06/2006 26 2006 26/06/2006 27 2006 03/07/2006 28 2006 10/07/2006 29 2006 17/07/2006 30 2006 24/07/2006 31 2006 31/07/2006 32 2006 07/08/2006 33 2006 14/08/2006 34 2006 21/08/2006 35 2006 28/08/2006 36 2006 04/09/2006 Timetable COD-DECH Kick-off meeting COD-8 DECH training session at CERN DECH Team A 1st unofficial shift with I as backup COD-9 DECH Team A 2nd unofficial shift (3days) with UK/I as backup DECH Team B 1st unofficial shift with CE as backup DECH Team B 2nd unofficial shift with TW as backup DECH Team first official shift as lead team

Our experience with COD CIC dashboard & monitoring tools convenient access to the relevant monitoring tools good overview of existing tickets and of sites experiencing problems tickets/problems listed according to priorities convenient and efficient notification tool (problem specific email templates) Operations procedures Very practical guide to the task The task in general Well organized task Great community 'Cod spirit' reinforced by regular COD meetings

What is happening at that site? Enabling Grids for E-sciencE Problem: sometimes missing information on a ticket/site Example: Ticket history not complete Consequently add support@ggus.org when replying from your email client Better: allow sending followup emails from the dashboard (without escalating the ticket) Example: Site has specific (and recurring) problems add a notepad per site to include comments (already raised by Alessandro) use it also in dealing with sites for which special agreements exist (US-sites?) use it to gather experience on some site's special way of reacting to tickets

Anything new to keep in mind? Enabling Grids for E-sciencE Problem: What were the recent 'orders'? Not everything is in the operations manual Nobody wants to ask the same questions at the handover again and again Decisions are taken at different meetings (COD, ROC, OpsMeet,..) Orders are given, adapted and revoked,... Example: how to handle ops VO tickets? collect recent instructions and publish them on the dashboard's handover tap should be kept up to date by someone who participates in most of the mentioned meetings

Multiple tickets for one site Problem: Handling multiple tickets at a site Examples: Importance and priority of tickets (don't bother sites) Tickets having changed character over time Dependencies of tickets Sometimes sites refer to GGUS tickets not appearing on the dashboard Include the tips and tricks section of Marcin's Quick Tutorial Synchronize expiration dates to the ticket with the highest priority Close tickets with multiple problems and open a clean one Watch for tickets of a site that exist only outside the dashboard Implement a tool that gives the supporter this overview

Operations procedures Problem: How does dashboard and GGUS interact? Examples: What is the relationship of their respective ticket states (GGUS <-> dashboard) How it the synchronization working? Site OK 'bug' Interaction of dashboard should be explained in the operations manual In particular the ticket states should be clearly defined Do not allow external people to control ticket state

Problem: Sites not always accept a ticket well Acceptance Examples: very short term problems and fast shooting of COD agent known issues at a site and COD agent unaware of that The site has other more serious problems and COD does not know include some general rules in the operations manual to be considered before opening/escalating a ticket Collect notes on how best to deal with a specific site (notepad) Be aware of a the details of a ticket (history, emails,..) Gather feedback from site administrators?

Central problems and relevance of tests Enabling Grids for E-sciencE Problem: How to detect (efficiently) central problems? Is the critical test indeed critical for the site and their users? Examples: Sometimes sites receive tickets for problems that should be addressed centrally There are occasions where a site has more serious problems than the one addressed in a COD ticket Need for additional monitoring tools (network,..)? Mark sites hosting central services like BDII, RB Start a discussion on that issue by doing some brainstorming Make it continuous by creating a specific wiki page Discuss relevance of tests in a similar fashion

Additional recommendations Response from discussion list (cic-on-duty) not always guaranteed (needs a moderator) Improve dashboard stability (Lavoisier, ) (already addressed with move to SAM) Reduce dashboard synchronization time (it takes a while before changes to tickets show up in the overview page) change links to GGUS tickets from user view to supporter view (to see the internal comments) Solve conflict of interest between ROC and COD role by relying on the backup team role for the backup teams could be substantiated (in the operations manual)

Conclusion Presented the structure of our distributed ROC Showed the various stages of DECH joining COD Found a well organized task with convenient tools Presented our list of recommendations Wanted to convince you that COD DECH is now ready to join the various working groups!