EGEE and Interoperation

Similar documents
ARC integration for CMS

g-eclipse A Framework for Accessing Grid Infrastructures Nicholas Loulloudes Trainer, University of Cyprus (loulloudes.n_at_cs.ucy.ac.

PoS(EGICF12-EMITC2)081

Grid Computing Middleware. Definitions & functions Middleware components Globus glite

Interoperation Scenarios of Production e-science Infrastructures

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy

Grid Scheduling Architectures with Globus

The LHC Computing Grid

The EGEE-III Project Towards Sustainable e-infrastructures

The LHC Computing Grid

The impact and adoption of GLUE 2.0 in the LCG/EGEE production Grid

The glite middleware. Presented by John White EGEE-II JRA1 Dep. Manager On behalf of JRA1 Enabling Grids for E-sciencE

Operating the Distributed NDGF Tier-1

Introduction to Grid Infrastructures

glite Grid Services Overview

Integration of Cloud and Grid Middleware at DGRZR

ALICE Grid Activities in US

A short introduction to the Worldwide LHC Computing Grid. Maarten Litmaath (CERN)

Grid Interoperation and Regional Collaboration

SZDG, ecom4com technology, EDGeS-EDGI in large P. Kacsuk MTA SZTAKI

Enabling Grids for E-sciencE. EGEE security pitch. Olle Mulmo. EGEE Chief Security Architect KTH, Sweden. INFSO-RI

Towards sustainability: An interoperability outline for a Regional ARC based infrastructure in the WLCG and EGEE infrastructures

ARC NOX AND THE ROADMAP TO THE UNIFIED EUROPEAN MIDDLEWARE

Austrian Federated WLCG Tier-2

Grid Programming: Concepts and Challenges. Michael Rokitka CSE510B 10/2007

Grid services. Enabling Grids for E-sciencE. Dusan Vudragovic Scientific Computing Laboratory Institute of Physics Belgrade, Serbia

The glite middleware. Ariel Garcia KIT

EUROPEAN MIDDLEWARE INITIATIVE

Andrea Sciabà CERN, Switzerland

The INFN Tier1. 1. INFN-CNAF, Italy

Grid Architectural Models

I Tier-3 di CMS-Italia: stato e prospettive. Hassen Riahi Claudio Grandi Workshop CCR GRID 2011

Easy Access to Grid Infrastructures

Interoperating AliEn and ARC for a distributed Tier1 in the Nordic countries.

Distributing storage of LHC data - in the nordic countries

The Role and Functions of European Grid Infrastructure

Evolution of the ATLAS PanDA Workload Management System for Exascale Computational Science

THEBES: THE GRID MIDDLEWARE PROJECT Project Overview, Status Report and Roadmap

Grids and Security. Ian Neilson Grid Deployment Group CERN. TF-CSIRT London 27 Jan

CHAPTER 2 LITERATURE REVIEW AND BACKGROUND

The EPIKH, GILDA and GISELA Projects

Interconnect EGEE and CNGRID e-infrastructures

Moving e-infrastructure into a new era the FP7 challenge

Grid Computing. MCSN - N. Tonellotto - Distributed Enabling Platforms

What s new in HTCondor? What s coming? HTCondor Week 2018 Madison, WI -- May 22, 2018

Overview of HEP software & LCG from the openlab perspective

Monitoring tools in EGEE

Travelling securely on the Grid to the origin of the Universe

WLCG Lightweight Sites

Advanced School in High Performance and GRID Computing November Introduction to Grid computing.

Independent Software Vendors (ISV) Remote Computing Usage Primer

The University of Oxford campus grid, expansion and integrating new partners. Dr. David Wallom Technical Manager

Interoperable job submission and management with GridSAM, JMEA, and UNICORE

AMGA metadata catalogue system

Considerations for a grid-based Physics Analysis Facility. Dietrich Liko

Leveraging the InCommon Federation to access the NSF TeraGrid

SLCS and VASH Service Interoperability of Shibboleth and glite

Grid Services Security Vulnerability and Risk Analysis

Sustainable DCI Operations scalable in size, time, and complexity

ATLAS NorduGrid related activities

Grid Infrastructure For Collaborative High Performance Scientific Computing

Application of Virtualization Technologies & CernVM. Benedikt Hegner CERN

Resource Allocation in computational Grids

A distributed tier-1. International Conference on Computing in High Energy and Nuclear Physics (CHEP 07) IOP Publishing. c 2008 IOP Publishing Ltd 1

Grid Computing at Ljubljana and Nova Gorica

Introducing the HTCondor-CE

Michigan Grid Research and Infrastructure Development (MGRID)

INITIATIVE FOR GLOBUS IN EUROPE. Dr. Helmut Heller Leibniz Supercomputing Centre (LRZ) Munich, Germany IGE Project Coordinator

Gergely Sipos MTA SZTAKI

where the Web was born Experience of Adding New Architectures to the LCG Production Environment

CHIPP Phoenix Cluster Inauguration

The LHC Computing Grid. Slides mostly by: Dr Ian Bird LCG Project Leader 18 March 2008

Eclipse Technology Project: g-eclipse

Abstract. 1. Introduction. M. Marzolla. L. Zangrando INFN Padova, Italy

VMs at a Tier-1 site. EGEE 09, Sander Klous, Nikhef

L3.4. Data Management Techniques. Frederic Desprez Benjamin Isnard Johan Montagnat

REsources linkage for E-scIence - RENKEI -

Grid Computing Security hack.lu 2006 :: Security in Grid Computing :: Lisa Thalheim 1

Implementing GRID interoperability

How to build Scientific Gateways with Vine Toolkit and Liferay/GridSphere framework

Deploying the TeraGrid PKI

Ivane Javakhishvili Tbilisi State University High Energy Physics Institute HEPI TSU

Pan-European Grid einfrastructure for LHC Experiments at CERN - SCL's Activities in EGEE

Ganga - a job management and optimisation tool. A. Maier CERN

Introduction, History

Deploying virtualisation in a production grid

LCG-2 and glite Architecture and components

PoS(ACAT2010)029. Tools to use heterogeneous Grid schedulers and storage system. Mattia Cinquilli. Giuseppe Codispoti

Grid Challenges and Experience

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez

Open Grid Forum. OGF s Role in the Community

Interfacing Operational Grid Security to Site Security. Eileen Berman Fermi National Accelerator Laboratory

Lessons Learned in the NorduGrid Federation

Status of KISTI Tier2 Center for ALICE

Outline. Infrastructure and operations architecture. Operations. Services Monitoring and management tools

Parallel Computing in EGI

Concepts and Design of an Interoperability Reference Model for Scientific- and Grid Computing Infrastructures

Overview of ATLAS PanDA Workload Management

Bob Jones. EGEE and glite are registered trademarks. egee EGEE-III INFSO-RI

Chapter 2 Introduction to the WS-PGRADE/gUSE Science Gateway Framework

Transcription:

EGEE and Interoperation Laurence Field CERN-IT-GD ISGC 2008 www.eu-egee.org EGEE and glite are registered trademarks

Overview The grid problem definition GLite and EGEE The interoperability problem The interoperation problem Interoperation activities in EGEE Grid Interoperability Now! The need for standards 2

What is a Grid? Campus Grids Volunteer Computing Cross-organizational Grids Clusters Data Centers Intra-organizational Grids Virtualization Cloud Computing Vaporware? 3

What is the problem? Organization A Organization B Organization A and B are administrative domains Independent policies, systems and authentication mechanisms Users have local access to their local system using local methods Users from A wish to collaborate with users from B Pool the resources Split tasks by specialty Share common frameworks 4

The Solution Organization A Virtual Organization Organization B The Users from A and B create a Virtual Organization Users have a unique identify but also the identity of the VO Organizations A and B support the Virtual Organization Place grid interfaces at the organizational boundary These map the generic grid functions/information/credentials To the local security functions/information/credentials Multi-institutional e-science Infrastructures 5

What is glite GLite is an integrated middleware distribution that provides the abstract interfaces required for building a grid infrastructure which enables resource sharing across administrative domains. The distribution consists of software repositories containing validated components from multiple software providers, including components from glite, with the documentation and tools required for deploying this as a production quality service. The release procedure for the glite distribution follows the same release methodology used by many Linux distributions; A major baseline release to which updates are continually added. The standard tools for the reference operating system are leveraged to create the software repositories which are logical separated by service to allow them to evolve independently. The latest major release is 3.1, which is available for the reference OS SL4 in both 32 and 64 bit flavors. Availability for other OS is a high priority and the order the priority is driven by demand. http://glite.web.cern.ch/glite/ 6

What is EGEE? The Enabling Grids for E-SciencE (EGEE) Project 139 partners institutes from over 32 countries Providing a service grid infrastructure of ~50000 CPUs and ~ 5 PB disk (5 million Gigabytes) of disk storage + tape MSS Distributed across 260+ sites in 48 countries Which is available to more than 7500 users Organized over 200 Virtual Organizations across 10 applications domains Who run are running more than 190K jobs per day 24 hours-a-day, 7 Days a week, 365 days a year 7

The Solutions EGEE CREAM LSF ARC Nordugrid OSG GRAM v2 PBS/Torque Sun Grid Engine Load Leveler Condor GRAM Unicore DEISA v4 NAREGI Teragrid Naregi 8

The New Problem Grid A Virtual Organization Grid B Multiple grid infrastructures have evolved Using different interfaces at the organizational boundary Users have grid access to their grid systems using grid methods A grid itself can be seen as an organizational domain Independent policies, systems and authentication mechanisms VOs from Grid A wish to use resources in grid B Pool the resources Split task by specialty Share common frameworks 9

Why? Required common interfaces Now have multiple common interfaces Tried to solve one problem, but created another Reasons: The infrastructures were developed independently Funding based on regions and application domains Grid infrastructures are based on different middleware Experimentation with different approaches Initially there were no standards Standards take time to mature We need to build the infrastructures now! The infrastructures outpaced standardization Good standards require experience 10

What can we do? Interoperability: The ability to exchange information and to use what has been exchanged (software) Interoperation The use of interoperable systems (Infrastructures) 11

How to Start Understanding the differences Compatibility matrix Domains that have to be linked for interoperability Security Information Services Job Management Data Management For interoperation you have to add Monitoring Accounting Operational links and joint policies Trouble ticket systems Operational security 12

Interoperability Matrix 1. Understand both middleware stacks 2. Identify the common interfaces 3. Create an interoperability matrix ARC OSG EGEE Job Submission GridFTP GRAM GRAM Service Discovery LDAP/GIIS LDAP/GIIS LDAP/BDII Schema ARC GLUE v1 GLUE v1.2 Storage Transfer Protocol GridFTP GridFTP GridFTP Storage Control Protocol SRM SRM SRM Security GSI/VOMS GSI/VOMS GSI/VOMS 13

Select Strategy Long term solution Common interfaces Standards Medium term solutions Gateways Adaptors and Translators Short term solutions Parallel Infrastructures User driven Site driven 14

Parallel Infrastructures User Driven The user joins both grids Uses different clients Depending on which interface More work for the User Required for each infrastructure Keyhole approach Restricts functionality Method initially used by ATLAS Split workload between grids 15

Parallel Infrastructures Site Driven The site joins both grids Deploys both interfaces User only sees their grid interface More work for the site Can only be supported by large sites Reduced resources Use By FZK Participating in EGEE, Nordugrid and D-grid 16

Gateway A gateway is a bridge between grid infrastructures Single point of failure Gateway breaks, grid disappears Scalability bottleneck All the load through one service Useful as a proof concept and to demonstrate the need NAREGI approach using glite-ce Gateway 17

Adaptors and Translators Adaptors allow connection Translators understand/modify information They are built into the middleware The middleware can then work with both interfaces Useful feature even when using standards! Requires modification to the grid middleware Existing service interfaces can still be used Using in the GIN information System API Plugin Plugin 18

Bilateral Activities within EGEE Enabling Grids for E-sciencE EGEE / OSG Already interoperating since Autumn 2005 EGEE /NDGF Working on interoperability since Summer 2005 Anticipated completion May 2008 EGEE/Unicore Started Summer 2006 Prototype components available EGEE/Naregi Working on interoperability with EGEE since winter 2006 Interoperable components available EUCHINAGrid Separate project EGEE/Garuda See talk in the next session! 19

Grid Interoperability Now Building upon the many bi-lateral activities Started at GGF-16 (now OGF) in Feb 2006 Demonstrate what we can for SC 2006 Applications, Security, Job Management Information Systems, Data Management GIN 20

GIN Information System Glue GIN BDII ARC BDII Generic Information Provider Translators Provider EGEE Provider OSG Provider NDGF Provider Naregi Provider Teragrid Provider Pragma EGEE OSG NDGF Naregi Teragrid Pragma Site Site Site Grid Grid Grid 21

Google Earth Demo EGEE OSG Naregi Teragrid Pragma Nordugrid 22

The Need For Standards Identified areas where standards are needed From the various interoperation activities Common interfaces Critical interfaces at the organizational boundary Security Information Computing Storage Standards are less important for higher level services Problem constrained within the VO Chose one solution and somewhere to host it. 23

Security Security is the fundamental aspect Users belong to a VO and do work on behalf of the VO Their identity is their experiment, not their institution Require a common security mechanism All other standards will inherit from this one Most grids use X509 credentials Already an existing standard This has significantly reduced interoperability problems Roots of trust, CAs, coordinated by the IGTF Require common methods for VO policy management Groups and roles within a VO Capabilities etc. 24

Information Service Separate content and interface Schema defines the content. Glue Schema created to facilitate interoperation Currently v1.3 Now and OGF working group Draft of v2.0 ready now! LDAP is the dominant interface 55% grids, 95% sites provide an LDAP interface Grids and sites participating in GIN Various web service interfaces These all have problems with large query results 25

Data Management GridFTP Supported in most grid infrastructures Reduced interoperability problems Storage Resource Manager Is proposed interface to storage Problems with different interpretations of the specification Incompatible implementations With a huge amount of effort it has taken 18 months to get right The Storage Resource Broker (SRB) An alternative which is widely used. 26

Job Management Job Description Language JSDL as defined by the OGF Computing Interface As many interfaces as batch systems! Need to agree on a common interface OGSA-BES is the current candidate OGSA-BES V1.0 draft document A number of prototypes exist but unproven in production Cream CE and KnowARC CE will implement BES Need to think about accounting 27

Final Thoughts The problem of grid interoperation A second attempt at the original problem The solution is common interfaces Most crucially at the site boundary The only way forward is real standards The most important part is to agree Production feedback will ensure it works! The initial choice only select the starting point Interoperability can be overcome short term But only standards are sustainable in the long term 28

Summary We need to put Grids into context What problem you are addressing? Multi-institutional e-science Infrastructures Grid Interoperability is an avoidable problem Grid Interoperation is not! More focus is needed on the interfaces Less focus required on specific implantations Standards are critical for the future It doesn t matter what they are as long a we agree Existing use cases will ensure the standards work 29