On the EGI Operational Level Agreement Framework

Similar documents
Outline. Infrastructure and operations architecture. Operations. Services Monitoring and management tools

E u r o p e a n G r i d I n i t i a t i v e

EGI Operations and Best Practices

E GI - I ns P I R E EU DELIVERABLE: D4.1.

European Grid Infrastructure

SA1 and JRA1 Operations and Operational Tools

Open Science Commons: A Participatory Model for the Open Science Cloud

Regional e-infrastructures

EGI, LSGC: What s in it for you?

The Role and Functions of European Grid Infrastructure

EGI: Linking digital resources across Eastern Europe for European science and innovation

GRIDS INTRODUCTION TO GRID INFRASTRUCTURES. Fabrizio Gagliardi

High Performance Computing from an EU perspective

On-demand Research Computing: the European Grid Infrastructure

e-infrastructures in FP7: Call 7 (WP 2010)

Interconnected NRENs in Europe & GÉANT: Mission & Governance Issues

Provisioning of Grid Middleware for EGI in the framework of EGI InSPIRE

Some aspect of research and development in ICT in Bulgaria. Authors Kiril Boyanov and Stefan Dodunekov

EGI federated e-infrastructure, a building block for the Open Science Commons

The CHAIN-REDS Project

EU Projects. Christoph Witzig

Travelling securely on the Grid to the origin of the Universe

Coupled Computing and Data Analytics to support Science EGI Viewpoint Yannick Legré, EGI.eu Director

GÉANT and other projects Update

EGI-InSPIRE. Cloud Services. Steven Newhouse, EGI.eu Director. 23/05/2011 Cloud Services - ASPIRE - May EGI-InSPIRE RI

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez

The LHC Computing Grid

GÉANT Mission and Services

The EPIKH, GILDA and GISELA Projects

A European Cloud federation

Conference The Data Challenges of the LHC. Reda Tafirout, TRIUMF

PROJECT FINAL REPORT. Tel: Fax:

REPORT 2015/186 INTERNAL AUDIT DIVISION

Introduction to EGI. Mission Services Governance International collaborations Contribution to DSM and EOSC.

e-infrastructure: objectives and strategy in FP7

EUROPEAN MIDDLEWARE INITIATIVE

3.4 NON-DOMESTIC SERVICES (L )

EU Liaison Update. General Assembly. Matthew Scott & Edit Herczog. Reference : GA(18)021. Trondheim. 14 June 2018

INSPIRE and Service Level Management Why it matters and how to implement it

REPORT 2015/149 INTERNAL AUDIT DIVISION

The Africa Utilities Telecom Council Johannesburg CC, South Africa 1 st December, 2015

EGI-InSPIRE RI NGI_IBERGRID ROD. G. Borges et al. Ibergrid Operations Centre LIP IFCA CESGA

First Session of the Asia Pacific Information Superhighway Steering Committee, 1 2 November 2017, Dhaka, Bangladesh.

Pan-European Grid einfrastructure for LHC Experiments at CERN - SCL's Activities in EGEE

EUMEDCONNECT3 and European R&E Developments

Petaflop Computing in the European HPC Ecosystem

3.4 NON-DOMESTIC SERVICES (L )(C.2.1.9)

EGI Strategy Enabling collaborative data- and compute-intensive science

Uptime and Proactive Support Services

Challenges in the field of Research Infrastructures, ESFRI

Progress towards the EOSC

e-infrastructures in FP7 INFO DAY - Paris

AARC Overview. Licia Florio, David Groep. 21 Jan presented by David Groep, Nikhef.

Federated Identities and Services: the CHAIN-REDS vision

Helix Nebula The Science Cloud

CEF e-invoicing. Presentation to the European Multi- Stakeholder Forum on e-invoicing. DIGIT Directorate-General for Informatics.

Service Level Agreement Metrics

Parallel Computing in EGI

Accountability Conceptual Framework

eplus Managed Services eplus. Where Technology Means More.

First Experience with LCG. Board of Sponsors 3 rd April 2009

WP JRA1: Architectures for an integrated and interoperable AAI

Senior Manager Information Technology (India) Duration of job

Helix Nebula Science Cloud Pre-Commercial Procurement pilot. 5 April 2016 Bob Jones, CERN

A business model for the establishment of the European grid infrastructure

PROGRAM GUIDE RED HAT CONNECT FOR TECHNOLOGY PARTNERS

The changing world of Distributed Computing Infrastructures (DCI) - role and contributions of EU e-infrastructure programme

The EGEE-III Project Towards Sustainable e-infrastructures

We are also organizational home of the Internet Engineering Task Force (IETF), the premier Internet standards-setting body.

Arkadin helps you achieve more at work: The voice expert for Microsoft Skype for Business and Office 365 For Large Enterprises

Interconnection of Armenian e- Infrastructures with the pan- Euroepan Integrated Environments

The National Research and Education Network. Problems and Solutions

The 6DISS Project: IPv6 Dissemination and Exploitation. IPv6 Summit, Beijing, China 14/04/2006

COMMUNITY OR ENTERPRISE? Choosing between JBoss community projects and Red Hat JBoss Middleware

Accelerate Your Enterprise Private Cloud Initiative

Agenda: Alberto dimeglio (AdM) Balazs Konya (BK) Helmut Heller (HH) Steve Crouch (SC)

EUROPEAN ICT PROFESSIONAL ROLE PROFILES VERSION 2 CWA 16458:2018 LOGFILE

GÉANT Community Programme

ICANN Policy Update & KSK Rollover

SLHC-PP DELIVERABLE REPORT EU DELIVERABLE: Document identifier: SLHC-PP-D v1.1. End of Month 03 (June 2008) 30/06/2008

Avanade s Approach to Client Data Protection

Europe and its Open Science Cloud: the Italian perspective. Luciano Gaido Plan-E meeting, Poznan, April

CGM and Energy Data Architecture a TSO Perspective

Federated Services and Data Management in PRACE

Italy - Information Day: 2012 FP7 Space WP and 5th Call. Peter Breger Space Research and Development Unit

IPR implementation EEA perspective. Road map to AQ e-reporting March 2012

Global Research and Education Networking Ecosystem

Reporting on EOSC matters. Onur Temizsoylu TÜBİTAK ULAKBİM

Advancing European R&E through collaboration

Cyber Risk Program Maturity Assessment UNDERSTAND AND MANAGE YOUR ORGANIZATION S CYBER RISK.

CrossGrid testbed status

GÉANT Open Service Description. High Performance Interconnectivity to Support Advanced Research

GN3plus External Advisory Committee. White Paper on the Structure of GÉANT Research & Development

Microsoft IT Leverages its Compute Service to Virtualize SharePoint 2010

A European Vision and Plan for a Common Grid Infrastructure

Final Project Report. Abstract. Document information

Globally Networked Customs Context, Concept, Rationale and Benefits - Indian Customs Perspective

Enabling BigData Workflows in Earth Observation Science

TX CIO Leadership Journey Texas CIOs Bowden Hight Texas Health and Human Services Commission Tim Jennings Texas Department of Transportation Mark

History of NERC December 2012

Transcription:

EGI-InSPIRE On the EGI Operational Level Agreement Framework Tiziana Ferrari, EGI.eu EGI Chief Operations Officer 1

Outline EGI and its ecosystem EGI Service Infrastructure Operational level agreements Scope Negotiation Reporting Service level agreements Future work and conclusions 2

European Grid Initiative A sustainable resource infrastructure supporting the computing needs of structured international research (Virtual Research Organizations) providers in Europe and worldwide With new technologies as they mature Mission Operate a production-quality infrastructure Provide operational and user support services Attract new international user communities e.g. ESFRI - European Strategic Forum on Research Infrastructures 3

EGI Status and yearly increase (April 2011) Centres 338 +6.8% Europe, Asia Pacific, North and South America 96 supporting MPI +31.5% Countries 51 (57 with integrated RPs) +18.75% Capacity 240,000 CPU cores (339,000 with integrated and peer infrastructures) 24.9% 1.89 Million HEP-SPEC 06* 102 PB disk, 89 PB tape * HEP-SPEC 06: Computing benchmark based on SPECCPU2006, 10 HEP-SPEC = 4 ksi2k 4

EGI Ecosystem Policies + Funding User Community Requirements + Feedback Services + Support European Commission National Research Councils Public Funding Bodies Policies + Funding Strategic Feedback Service & Providers Infrastructure Providers Requirements + Feedback Technology + Support Policies + Funding Requirements + Feedback Technology Providers Open Source Providers Commercial Providers 5

Layer II. EGI Participant: Infrastructure National infrastructure Grid Provider (RP) The federation Initiatives of The (NGIs), legal European organisation Centres, which responsible are for interconnected Intergovernmental by any the matter National Research that Research concerns and the Education Organisations Networks respective (NRENs) (EIROs) and GÉANT. Infrastructure EGI Infrastructure Layer III. EGI Infrastructure Infrastructure Centres Centres NGI/EIRO EGI.eu Layer I. Centre (RC) A localised or geographically distributed administration domain, where EGI resources (CPUs, data storage, instruments and digital libraries) are managed and operated to be accessed by end-users Network MoUs Provider Infrastructure Provider Infrastructure Centres Centres Peer infrastructures: Centres accessible Integrated to EGI Infrastructures: users, but relying on operated own operational by a non-egi-inspire services, e.g. partner Open Centres but Science relying on Grid EGI (USA) operational services, e.g. Latin American and Caribbean 6

EGI.eu Governance central organization for the coordination of European Grid resources and services Established 8 February 2010 Central policy & services needed to run a grid, technical services from partners EGI Council 34 National Grid Initiatives and European Intergovernmental Organizations (CERN) participants and associated participants EGI and EGI.eu: Supported by the EGI-InSPIRE project 7

EGI-InSPIRE Project Integrated Sustainable Pan-European Infrastructure for Researchers in Europe A 4-year project with 25M EC contribution Project cost 72M Total Effort ~ 330M Effort: 9261PMs Project Partners (50) EGI.eu, 38 NGIs, 2 EIROs Asia Pacific (9 unfunded partners) 8

EGI Operations Operation of a production infrastructure Validate new technology releases (tools and middleware) Support end-users and Centre administrators Service Level Management, grid oversight, documentation and procedures Operate tools, the accounting infrastructure and the EGI Helpdesk Evolve the operational tools used by the production infrastructure - Maintenance, development and support of national deployment - Accounting for the use of new resources (desktop, virtualisation, storage, data, application and billing) 9

EGI OLA and SLA framework National Providers Deployed Infrastructure Services Used by Researchers Current User Communities EGI.eu Technology Providers New Technology Assessed Gathering New Requirements New User Communities 10

EGI Service Infrastructure The service infrastructure enables secure, interoperable and reliable access to distributed resources. EGI services are provided locally by Operations Centres and globally by EGI.eu. and partners Global Services Infrastructure Infrastructure Infrastructure Infrastructure Local Services Operations Operations Centres Operations Centres Centres Service categories: I. Infrastructure Services Tools II. Technical Services Grid middleware III. Support Services Helpdesk and Support Teams IV. Human Services Service Level Management, security, documentation, coordination 11

Operational Level Agreements 1/2 1. Centre OLA - approved 2. Infrastructure Provider OLA (Local Services) in progress 3. EGI.eu OLA (Global Services) to be defined 12

Operational Level Agreements 2/2 Parties (Minimum set of ) services exchanged Service hours Metrics and the minimum service targets The reporting period (where available) Penalties (if defined) suspension 13

Negotiation The EGI OLAs can be customised by the Parties to meet local requirements, consistency must be ensured All OLAs and their updates are approved by the Operations Management Board (OMB) Centre OLA Its acceptance (including the updates) is a pre-requisite for being a certified Centre The Provider is responsible of handling the negotiation and to record the agreement Provider OLA EGI participants: discussion within the OMB, acceptance is a pre-requisite for integration Others: negotiation is part of the Infrastructure Provider MoU 14

Monitoring Monitoring and Reporting All Centre services are monitored Monitoring of local and global service is in progress Reporting on a monthly/yearly basis depending on the metric Follow-up A central support team is responsible of identifying underperforming Centres, of collecting justifications and of performing suspension as needed penalties for Providers and EGI.eu to be defined 15

Service Availability Monitoring (SAM) SAM: monitoring framework for RCs and services main data sources for the Operations Dashboard data source to generate Availability/Reliability statistics local/central components: 1. test submission framework: based on the Nagios system and customised by the Nagios Configurator Generator 2. databases for storage of information about topology (Aggregated Topology Provider), metrics (Metrics Description DataBase) and results (Metrics Results Store) 3. visualisation tool GUI: MyEGI 16

EGI AVG Availability/Reliability (%) Availability the percentage of time that the service/rc was up and running (uptime / total time) x 100 minimum RC availability: 70% Reliability the percentage of time that the service/rc was up and running, excluding periods of scheduled interventions [uptime / (total time scheduled time)] x 100 minimum RC reliability: 75% Suspension policy RC availability < 50% for 3 consecutive months 6 RCs suspended stricter policy from PY2: from 50 to 70% Service/ Centre (RC) Availability and Reliability 97.00% 96.00% 95.00% 94.00% 93.00% 92.00% 91.00% 90.00% 89.00% 88.00% Overall PY1 EGI availability: 92.73% Overall PY1 EGI reliability: 93.85% PQ1 PQ2 PQ3 PQ4 monthly availability Reporting monthly reliability monthly performance reports per RC new ticket-based procedure for monitoring of underperforming RCs 17

EGI Service Level Agreements Steering the EGI software evolution Publish the Unified Middleware Distribution (UMD) Roadmap Collect and prioritise strategic requirements Engage with external Technology Providers Provision software for the EGI community Ensure the quality of delivered software Provide a software repository for UMD and other components Provide 2 nd level support for the deployed middleware MoU SLA Technology Providers MoUs & SLAs New Technology Assessed 18

Software component delivery SLA Template release plan, release delivery and format Quality Assurance Acceptance criteria and test plans Issue management Issue management infrastructure Issue resolution Vulnerability management Performance measurement Target date Estimated time of availability Metrics Problem management and escalation 19

Agreements with software providers 4 MoUs signed with: European Middleware Initiative http://go.egi.eu/483 Initiative for Globus in Europe http://go.egi.eu/484 Simple API for Grid Applications (SAGA) http://go.egi.eu/485 StratusLab Project http://go.egi.eu/448 3 SLAs signed with: EMI http://go.egi.eu/461 IGE http://go.egi.eu/442 SAGA http://go.egi.eu/449 20

Future work Improvement of the RC performance Finalization of the OLA framework (2011) Monitoring: extension of the Service Availability Monitoring framework and of the reporting system for monitoring of new services (e.g. EGI.eu central services) customisation of service level targets Reporting: increasing automation Follow-up: automated proactive control systems fully relying on the existing incident management system and processes Development of a SLA framework involving the end-users 21

Conclusions Increasing focus on sustainability of EGI services OLA framework being consolidated A complete EGI service business model is a pre-requisite for the finalization of the SLA and OLA frameworks 22