Architecture Proposal

Similar documents
Grid Architectural Models

NorduGrid Tutorial. Client Installation and Job Examples

Grid services. Enabling Grids for E-sciencE. Dusan Vudragovic Scientific Computing Laboratory Institute of Physics Belgrade, Serbia

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy

The NorduGrid Architecture and Middleware for Scientific Applications

Layered Architecture

Day 1 : August (Thursday) An overview of Globus Toolkit 2.4

glite Grid Services Overview

The EU DataGrid Testbed

A New Grid Manager for NorduGrid

ARC middleware. The NorduGrid Collaboration

LCG-2 and glite Architecture and components

A Distributed Media Service System Based on Globus Data-Management Technologies1

Grid Computing. MCSN - N. Tonellotto - Distributed Enabling Platforms

Scientific data management

The Grid Monitor. Usage and installation manual. Oxana Smirnova

Usage statistics and usage patterns on the NorduGrid: Analyzing the logging information collected on one of the largest production Grids of the world

EGI-InSPIRE. ARC-CE IPv6 TESTBED. Barbara Krašovec, Jure Kranjc ARNES. EGI-InSPIRE RI

GridNEWS: A distributed Grid platform for efficient storage, annotating, indexing and searching of large audiovisual news content

Data Management for the World s Largest Machine

g-eclipse A Framework for Accessing Grid Infrastructures Nicholas Loulloudes Trainer, University of Cyprus (loulloudes.n_at_cs.ucy.ac.

ALHAD G. APTE, BARC 2nd GARUDA PARTNERS MEET ON 15th & 16th SEPT. 2006

Globus Toolkit Firewall Requirements. Abstract

ARC-XWCH bridge: Running ARC jobs on the XtremWeb-CH volunteer

The glite middleware. Ariel Garcia KIT

On the employment of LCG GRID middleware

Troubleshooting Grid authentication from the client side

Grid Computing with NorduGrid-ARC

Cloud Computing. Up until now

WMS overview and Proposal for Job Status

Using the MyProxy Online Credential Repository

Monitoring the Usage of the ZEUS Analysis Grid

Grid Computing Fall 2005 Lecture 5: Grid Architecture and Globus. Gabrielle Allen

The University of Oxford campus grid, expansion and integrating new partners. Dr. David Wallom Technical Manager

Outline. ASP 2012 Grid School

Gridbus Portlets -- USER GUIDE -- GRIDBUS PORTLETS 1 1. GETTING STARTED 2 2. AUTHENTICATION 3 3. WORKING WITH PROJECTS 4

Overview of HEP software & LCG from the openlab perspective

FileCruiser. Administrator Portal Guide

First evaluation of the Globus GRAM Service. Massimo Sgaravatto INFN Padova

Grid Compute Resources and Job Management

Grid Infrastructure For Collaborative High Performance Scientific Computing

Globus GTK and Grid Services

Understanding StoRM: from introduction to internals

Introduction to SRM. Riccardo Zappi 1

Course 6231A: Maintaining a Microsoft SQL Server 2008 Database

Introduction to Programming and Computing for Scientists

The NorduGrid/ARC Information System

Grid Authentication and Authorisation Issues. Ákos Frohner at CERN

Course 6231A: Maintaining a Microsoft SQL Server 2008 Database

Globus Toolkit 4 Execution Management. Alexandra Jimborean International School of Informatics Hagenberg, 2009

Monitoring tools in EGEE

Agent Teamwork Research Assistant. Progress Report. Prepared by Solomon Lane

Using MATLAB on the TeraGrid. Nate Woody, CAC John Kotwicki, MathWorks Susan Mehringer, CAC

Integrating SGE and Globus in a Heterogeneous HPC Environment

Using Resources of Multiple Grids with the Grid Service Provider. Micha?Kosiedowski

Information and monitoring

Grid Computing Middleware. Definitions & functions Middleware components Globus glite

SPGrid Efforts in Italy

Grid Examples. Steve Gallo Center for Computational Research University at Buffalo

Managing GSS Devices from the GUI

Grid Compute Resources and Grid Job Management

Data Movement and Storage. 04/07/09 1

ATLAS NorduGrid related activities

NorduGrid Tutorial Exercises

Grid Programming: Concepts and Challenges. Michael Rokitka CSE510B 10/2007

The GridWay. approach for job Submission and Management on Grids. Outline. Motivation. The GridWay Framework. Resource Selection

Resource Allocation in computational Grids

The NorduGrid production Grid infrastructure, status and plans

Maintaining a Microsoft SQL Server 2008 Database (Course 6231A)

VMs at a Tier-1 site. EGEE 09, Sander Klous, Nikhef

A Federated Grid Environment with Replication Services

By Ian Foster. Zhifeng Yun

Grid Computing. Olivier Dadoun LAL, Orsay. Introduction & Parachute method. Socle 2006 Clermont-Ferrand Orsay)

Introduction to Grid Computing

High Performance Computing Course Notes Grid Computing I

McAfee VirusScan and McAfee epolicy Orchestrator Administration Course

Introduction to Programming and Computing for Scientists

GRID COMPANION GUIDE

Grid Data Management

Atlas Data-Challenge 1 on NorduGrid

Note: Who is Dr. Who? You may notice that YARN says you are logged in as dr.who. This is what is displayed when user

The Problem of Grid Scheduling

Developing Microsoft Azure Solutions (70-532) Syllabus

Client tools know everything

GRID Stream Database Managent for Scientific Applications

igrid: a Relational Information Service A novel resource & service discovery approach

The Integration of Grid Technology with OGC Web Services (OWS) in NWGISS for NASA EOS Data

Future Developments in the EU DataGrid

Developing Microsoft Azure Solutions (70-532) Syllabus

Grid Computing Fall 2005 Lecture 10 and 12: Globus V2. Gabrielle Allen

A Simple Mass Storage System for the SRB Data Grid

GT 4.2.0: Community Scheduler Framework (CSF) System Administrator's Guide

History of SURAgrid Deployment

Molecular dynamics simulations in the MolDynGrid Virtual Laboratory by means of ARC between Grid and Cloud

Introduction to Programming and Computing for Scientists

Chapter 4:- Introduction to Grid and its Evolution. Prepared By:- NITIN PANDYA Assistant Professor SVBIT.

An Evaluation of Alternative Designs for a Grid Information Service

Grid Middleware and Globus Toolkit Architecture

2779 : Implementing a Microsoft SQL Server 2005 Database

Application Guide. Connection Broker. Advanced Connection and Capacity Management For Hybrid Clouds

Transcription:

Nordic Testbed for Wide Area Computing and Data Handling NORDUGRID-TECH-1 19/02/2002 Architecture Proposal M.Ellert, A.Konstantinov, B.Kónya, O.Smirnova, A.Wäänänen

Introduction The document describes the minimal architecture, necessary to enable a basic Grid functionality, and suggests ways to implement it. Globus Toolkit is used as the basis; all modifications should preserve backward compatibility as much as possible Terms BR Broker CA Authentication service (Certificate Authority) CN Computing Node (one machine in a cluster) IS Information service MDS Globus MDS (GIIS and GRIS) RC Replica Catalog RM Replication Manager SE Storage Element VO Authorization service UI User interface WN Worker Node = CN SD Session Directory GM Grid Manager GWD Grid Working Directory Architecture In what follows, a description of components is given and their functionality outlined Cluster At present, a PBS cluster WN's are not required to access external network Application software is installed on the front-end and exported to the nodes; application software installation on WN's is not required Recommended shared Grid Working Directory (e.g. /scratch, or /job) Services (e.g. data movement, IS) are run only from the front-end Storage Element A separate (stand-alone) machine for flat file storage and/or database server Can be "local" to a cluster (i.e. exported disks) Runs GridFTP server Produces entries to the MDS (if a separate machine, runs its own GRIS) Has Grid certificate-based authorization and quotas Mirroring and replication is done by services running on the SEs Front-end Authorization and authentication is done by the gatekeeper Provides pre- and post-job file/data transfers via GridFTP (runs the server) and/or 1

symbolic linking in case of local SE Provides job submission services, using standard Globus interface The Grid Manager (GM) is introduced, which: - creates the Session Directory (SD) - creates the job ID (to appear in the MDS) - pre-stages input files from SE's - initializes PBS job submission - moves requested output files to SE's and registers them in RC - manages job cancellation - maintains the set of job status files - sends an (optional) e-mail notification to a job owner, both at the start and end of a job (including staging in/out) Has a Grid Working Directory (GWD) for stage in/out, shared with the rest of the cluster Maintains SD's: - a new SD is created in GWD per job - only the actual owner of the job can access the SD contents (using the job handle) - the SD should be locked for deletion during the course of the job; files should be removed/overwritten only by the GM - SD is erased after users' retrieval of its contents; otherwise it is erased after its the pre-defined lifetime expires Information System Based only on Globus MDS Hierarchic Providers run on clusters and on SEs Contains cluster information, both static and dynamic: - total/available disk space - cluster and jobs status - authorization information Contains SE information User interface Integrated with the broker (aka decision engine) Does not run any server (i.e., client-only) 2

Figure 1. NorduGrid task flow (see explanations in Table 1) 3

1 User Interface queries GIIS and the Replica Catalog, and selects the site; session directory name and the job contact string are created by the Broker 2 Job request: xrsl file is submitted to the Gatekeeper, along with the session directory name and the [resolved by the Broker] physical file names 3 Gatekeeper parses the job request to the Grid Manager 4 Grid Manager: - creates the Session Directory - downloads/links files from Storage Elements to the Session Directory 5 User Interface uploads input files and executables via GridFTP 6 After all files are pre-staged, the Grid Manager submits the job to PBS 7 PBS schedules the job on the Working Nodes 8 Information Providers collect job and queue information, disk usage information and other static parameters (pull); write it to MDS 9 User Interface monitors the job status by querying the MDS 10 User Interface may cancel jobs (Session Directory should be removed) 11 Grid Manager moves requested output results to Storage Elements (initially, Internal only) and registers in the Replica Catalog 12 GIIS queries GRIS Table 1: NorduGrid task flow (numbering corresponds to Figure 1) 1. Job submission Makes use of extended RSL syntax for the job options file Performs matching (using the MDS) of job options to a resource, e.g.: - required CPU time - required system - required disk space - required runtime environment - required memory - required data from SE's Creates SD name (to be parsed along with the job options) Resolves logical file names for input files Triggers the GM: parses the job options along with resolved file URL's and the SD name to the matching cluster Transfers (GridFTP push) input files and binaries to the matching cluster's SD: either uploads local files, or initializes 3d-party transfer of remote files (or both) Job monitoring Query job status (stored in the MDS) on the cluster via the contact string (contains job handle) Capture snapshot of any user-specified file in the SD 4

Job output User initializes download of output files from the stage area Other features Proxy initialization List of matching resources (w/o actual job submission, dryrun) Job cancellation Possibility to submit jobs to explicitly specified resources Data movement All the data are moved before the job submission to the clusters stage area - pushed from the user interface machine - downloaded from an external SE - linked from a local SE - job can not request new data sets during execution - in the future, a Grid file system can remove such a restriction Output is downloaded by the user interface from the SD of the GWD of a cluster, or is written to a SE Proposed architecture and the task flow is shown schematically in Figure 1 and Table 1. 5