Gergely Sipos MTA SZTAKI

Similar documents
Advanced Job Submission on the Grid

Multiple Broker Support by Grid Portals* Extended Abstract

glite Grid Services Overview

The glite middleware. Ariel Garcia KIT

Workflow applications on EGI with WS-PGRADE. Peter Kacsuk and Zoltan Farkas MTA SZTAKI

SZDG, ecom4com technology, EDGeS-EDGI in large P. Kacsuk MTA SZTAKI

Chapter 2 Introduction to the WS-PGRADE/gUSE Science Gateway Framework

Grid services. Enabling Grids for E-sciencE. Dusan Vudragovic Scientific Computing Laboratory Institute of Physics Belgrade, Serbia

GRID COMPANION GUIDE

g-eclipse A Framework for Accessing Grid Infrastructures Nicholas Loulloudes Trainer, University of Cyprus (loulloudes.n_at_cs.ucy.ac.

International Collaboration to Extend and Advance Grid Education. glite WMS Workload Management System

Problemi di schedulazione distribuita su Grid

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy

Grid Scheduling Architectures with Globus

Architecture of the WMS

How to use computing resources at Grid

LCG-2 and glite Architecture and components

Setup Desktop Grids and Bridges. Tutorial. Robert Lovas, MTA SZTAKI

Parallel Computing in EGI

glite Middleware Usage

Introduction to Grid Infrastructures

E UFORIA G RID I NFRASTRUCTURE S TATUS R EPORT

A Login Shell interface for INFN-GRID

The LHC Computing Grid

Interconnect EGEE and CNGRID e-infrastructures

Integration of the guse/ws-pgrade and InSilicoLab portals with DIRAC

Grid Interoperation and Regional Collaboration

PoS(ISGC 2011 & OGF 31)119

EGEE and Interoperation

Grid Computing. Olivier Dadoun LAL, Orsay. Introduction & Parachute method. Socle 2006 Clermont-Ferrand Orsay)

How to build Scientific Gateways with Vine Toolkit and Liferay/GridSphere framework

On the employment of LCG GRID middleware

Grid Computing Middleware. Definitions & functions Middleware components Globus glite

Grid Infrastructure For Collaborative High Performance Scientific Computing

EUROPEAN MIDDLEWARE INITIATIVE

Pegasus Workflow Management System. Gideon Juve. USC Informa3on Sciences Ins3tute

A Practical Approach for a Workflow Management System

Bookkeeping and submission tools prototype. L. Tomassetti on behalf of distributed computing group

Grid Programming: Concepts and Challenges. Michael Rokitka CSE510B 10/2007

glite/egee in Practice

Juliusz Pukacki OGF25 - Grid technologies in e-health Catania, 2-6 March 2009

Grid Compute Resources and Job Management

The glite middleware. Presented by John White EGEE-II JRA1 Dep. Manager On behalf of JRA1 Enabling Grids for E-sciencE

FREE SCIENTIFIC COMPUTING

WMS overview and Proposal for Job Status

Gatlet - a Grid Portal Framework

Dr. Giuliano Taffoni INAF - OATS

BOINC extensions in the SZTAKI DesktopGrid system

Access the power of Grid with Eclipse

Overview of HEP software & LCG from the openlab perspective

Grid Computing Fall 2005 Lecture 5: Grid Architecture and Globus. Gabrielle Allen

DIRAC Documentation. Release integration. DIRAC Project. 09:29 20/05/2016 UTC

Workflow Support for Complex Grid Applications: Integrated and Portal Solutions 1

CMS Tier-2 Program for user Analysis Computing on the Open Science Grid Frank Würthwein UCSD Goals & Status

UNIT IV PROGRAMMING MODEL. Open source grid middleware packages - Globus Toolkit (GT4) Architecture, Configuration - Usage of Globus

Easy Access to Grid Infrastructures

Programming the Grid with glite

J O B D E S C R I P T I O N L A N G U A G E A T T R I B U T E S S P E C I F I C A T I O N

Deploying virtualisation in a production grid

Monitoring System for the GRID Monte Carlo Mass Production in the H1 Experiment at DESY

The University of Oxford campus grid, expansion and integrating new partners. Dr. David Wallom Technical Manager

XRAY Grid TO BE OR NOT TO BE?

DataGrid. Document identifier: Date: 24/11/2003. Work package: Partner: Document status. Deliverable identifier:

Grid Computing. Olivier Dadoun LAL, Orsay Introduction & Parachute method. APC-Grid February 2007

ARC integration for CMS

THE WIDE AREA GRID. Architecture

Grid Compute Resources and Grid Job Management

Garuda : The National Grid Computing Initiative Of India. Natraj A.C, CDAC Knowledge Park, Bangalore.

AGATA Analysis on the GRID

Eclipse Technology Project: g-eclipse

R-GMA (Relational Grid Monitoring Architecture) for monitoring applications

Cloud Computing. Up until now

glite Advanced Job Management

Authors: Josh Nedelka & Matthew Reiter

Gridbus Portlets -- USER GUIDE -- GRIDBUS PORTLETS 1 1. GETTING STARTED 2 2. AUTHENTICATION 3 3. WORKING WITH PROJECTS 4

CHAPTER 2 LITERATURE REVIEW AND BACKGROUND

Job submission and management through web services: the experience with the CREAM service

CREAM-WMS Integration

Introduction of NAREGI-PSE implementation of ACS and Replication feature

DESY. Andreas Gellrich DESY DESY,

User Tools and Languages for Graph-based Grid Workflows

Improved 3G Bridge scalability to support desktop grid executions

DataGrid D EFINITION OF ARCHITECTURE, TECHNICAL PLAN AND EVALUATION CRITERIA FOR SCHEDULING, RESOURCE MANAGEMENT, SECURITY AND JOB DESCRIPTION

QosCosGrid Middleware

NAREGI PSE with ACS. S.Kawata 1, H.Usami 2, M.Yamada 3, Y.Miyahara 3, Y.Hayase 4, S.Hwang 2, K.Miura 2. Utsunomiya University 2

Advanced School in High Performance and GRID Computing November Introduction to Grid computing.

Future of Grid parallel exploitation

Grid Architectural Models

The Problem of Grid Scheduling

Globus GTK and Grid Services

Programming the Grid with glite *

WMS Application Program Interface: How to integrate them in your code

MPI SUPPORT ON THE GRID. Kiril Dichev, Sven Stork, Rainer Keller. Enol Fernández

PARALLEL PROGRAM EXECUTION SUPPORT IN THE JGRID SYSTEM

Implementing GRID interoperability

Monitoring the Usage of the ZEUS Analysis Grid

UCLA Grid Portal (UGP) A Globus Incubator Project

Architecture Proposal

I Tier-3 di CMS-Italia: stato e prospettive. Hassen Riahi Claudio Grandi Workshop CCR GRID 2011

ISTITUTO NAZIONALE DI FISICA NUCLEARE Sezione di Padova

Transcription:

Application development on EGEE with P-GRADE Portal Gergely Sipos MTA SZTAKI sipos@sztaki.hu EGEE Training and Induction EGEE Application Porting Support www.lpds.sztaki.hu/gasuc www.portal.p-grade.hu 1

User Interface Write JDL, Submit workload Running jobs in EGEE from command line Retrieve status & (small) output files Workload Management System EGEE VO Workload: Single job One executable & one or more files Sequential or MPI Workflow DAG of jobs Parameter study Independent, similar jobs 2

Defining a single job in JDL $ vi OR nano OR my.jdl JobType = "Normal"; Executable = gridtest ; StdError = stderr.log ; StdOutput = stdout.log ; InputSandbox = { /home/sipos/gridtest }; OutputSandbox = { result.txt, stderr.log, stdout.log }; $ glite-wms-job-submit my.jdl JobID $ glite-wms-job-status JobID $ glite-wms-job-output JobID 3

[ Type = dag"; InputSandbox = { sharedfile1 ;...; sharedfilen }; nodes = [ job1 = [ descpription = [ JobType = "Normal";...; ], ];...... ]; dependencies = { }; }; ] Defining a workflow in JDL JDL of 1 st job Job 1 Job 2 Job 3 Job 4 {job1,{job2,job3}}, {job2,job4}, {job3,job4} Graph structure 4

Defining a parameter study in JDL [ Type = Parametric";... ParameterStart = 0; ParameterStep = 1; Parameters= 6; _PARAM_ run from 0 to 5 ] Arguments = figure_param_.jpg"; StdOutput = transformed_param_.jpg"; OutputSandbox = {" transformed_param_.jpg ", };... Figure0.jpg Figure1.jpg Figure5.jpg Job 0 Job 1... Job 5 Transformed0.jpg Transformed1.jpg Transformed5.jpg 5

User Interface Managing data in EGEE from command line Run file management commands Upload, transfer, register files EGEE VO Download files Delete files File catalog & Storage Elements $ lfc-mkdir mycatalogentry $ lfc-rmdir mycatalogentry $ lcg-cr vo voname d SEName l mycatalogentry file... 6

Short History of P-GRADE portal Parallel Grid Application Development Environment Initial development started in the Hungarian SuperComputing Grid project in 2003 Motivation: Simplify grid usage for application developers and end users It has been continuously developed since 2003 Around 30 manyear development + training + user support Detailed information: http://portal.p-grade.hu/ Open Source community since January 2008: https://sourceforge.net/projects/pgportal/ Current version: 2.8 7

Current P-GRADE Portal related projects GGF GIN (Since 2006) Providing the GIN Resource Testing portal EU EGEE-II, EGEE-III (2006-2010) Tool recommended for application development Intensively used in new users training EU SEE-GRID-SCI (2008-2010) Interfacing to DSpace-based workflow storage Infrastructure testing workflows EU CancerGrid (2007-2009) Development of new generation P-GRADE (guse and WS-PGRADE) Integration with desktop grids EU EDGeS (2008-2009) Transparent access to Desktop Grid systems 8

Portal installations P-GRADE Portal services: Several VOs of EGEE: Biomed, Astronomy, Central European, NA4,... GILDA: Training VO of EGEE Southern Eastern European Grid Several national Grids (UK National Grid Service, HunGrid, Turkish Grid, etc.) US Open Science Grid, TeraGrid OGF Grid Interoperability Now (GIN) VO Under development: KnowledgeGrid of Malaysia MIMOS & SZTAKI Portal services and account request: http://portal.p-grade.hu/index.php?m=3&s=0 Account request form on portal login page 9

Design principles of P-GRADE portal P-GRADE Portal is not only a user interface, it is a General purpose Application Development and Execution Environment P-GRADE Portal includes a high-level middleware layer for orchestrating jobs on grid resources inside a grid among several different grids and several VOs P-GRADE Portal is grid-neutral: It is not tailored to any particular middleware grid type Can be connected to various grids based on different grid middleware LCG-2, glite, GT2, GT4, ARC, Unicore, etc. Implementation for GT2, GT4, glite, ARC, LSF GUI is the same no matter which type of grid is connected to it 10

What is a P-GRADE Portal workflow? A directed acyclic graph where Nodes represent jobs (batch programs to be executed on a computing element) Ports represent input/output files the jobs expect/produce Arcs represent file transfer operations semantics of the workflow: A job can be executed if all of its input files are available 11

Three levels of parallelism Multiple instances of the same workflow process different data files Job level: Parallel execution inside a workflow node (MPI job as workflow component) Workflow level: Parallel execution among workflow nodes (WF branch parallelism) PS workflow level: Parameter study execution of the workflow Multiple jobs run parallel Each job can be a parallel program 12

Example: Computational Chemistry ~100 independent jobs to run Department of Chemistry, University of Perugia SOLUTION OF SCHRODINGER EQUATION FOR TRIATOMIC SYSTEMS USING TIME- DEPENDENT (RWAVEPR) OR TIME INDEPENDENT (ABC) METHOD A single execution can be between 5 hours and 10 hours Many simulations at the same time SEQUENTIAL FORTRAN 90 13

Typical user scenario Job compilation phase Client UPLOAD JOB SOURCE(S) Portal server COMPILE EDIT Grid services DOWNLOAD BINARI(ES) 14

Typical user scenario Workflow development phase Client SAVE WORKFLOW Portal server Grid services OPEN & EDIT WORKFLOW ADD BINARIES START EDITOR DSpace WF repository IMPORT WORKFLOW 15

MyProxy Certificate servers Typical user scenarios Workflow execution phase DOWNLOAD PROXY CERTIFICATES TRANSFER FILES, SUBMIT JOBS Client VISUALIZE JOBS and WORKFLOW PROGRESS Portal server MONITOR JOBS Grid services DOWNLOAD (SMALL) RESULTS DOWNLOAD (SMALL) RESULTS 16

Accessing local and remote files Use legacy executables with Grid files without touching the code Grid services LOCAL INPUT FILES & EXECUTABLES LOCAL OUTPUT FILES Portal server Only the permanent files! LOCAL INPUT FILES & EXECUTABLES LOCAL OUTPUT FILES REMOTE INPUT FILES Storage elements and File catalogs Computing elements REMOTE OUTPUT FILES 17

P-GRADE Portal structural overview Source: Jia Yu and Rajkumar Buyya: A Taxonomy of Workflow Management Systems for Grid Computing, Journal of Grid Computing, Volume 3, Numbers 3-4 / September, 2005 Web browser Java Webstart workflow editor DSpace workflow repository Workflow language: Extended DAGMan Globus and glite information systems (Globus GIIS, glite BDII) Workflow scheduler: Extended DAGMan Grid interface: command line clients + scripts EGEE, Globus (and ARC) Grid services (glite WMS, LFC, ; Globus GRAM, ) 18

Web interface - Portlets 19

Email notifications NOTIFY 20

Workflow portlet WORKFLOW EDITOR 21

Settings Portal administrator can connect the portal to several grids register default resources of the connected grids 22

Settings User can customize the connected grids by adding and removing resources 23

Graphical workflow editing To define a graph: 1. Drag & drop components: jobs and ports 2. Define their properties 3. Connect ports by channels (no cycles, no loops) System generates JDL for each job automatically 24

Workflow Editor Properties of a job Properties of a job: Executable file Type of executable (Sequential / Parallel) Command line parameters Which resource to use? Which VO? Broker or Computing element? 25

Workflow Editor Defining input-output files File properties Type: input: the executable reads output: the executable generates File type: local: comes from my desktop remote: comes from an SE File: location of the file Internal file name: Executable uses this e.g. fopen( file.in, ) File storage type (output files only): Permanent: final result Volatile: temp. data channel 26

Scaling up a workflow to a parameter study Complete workflow Grid files (files in the same EGEE catalog entry) Results stored in grid files 27

Advanced parameter studies Initial input data Generator component(s) Generate or cut input into smaller pieces Complete workflow Collector component(s) Aggregate result 28

Concept of parameter study workflows GEN Generator part generates the input parameter space SEQ SEQ SEQ SEQ Parameter study part Collector part evaluates and integrates the results COLL 29

Example: Image processing /grid/gilda/sipos/inputimages Image.0 Image.1 Image.2 /grid/gilda/sipos/xcoordinates XCoordinate.0 XCoordinate.1 /grid/gilda/sipos/ycoordinates YCoordinate.0 YCoordinate.1 3 x 2 x 2 = 12 execution of the whole workflow CROSS PRODUCT of data items /grid/gilda/sipos/output ImagePart.0 ImagePart.1... 30

How to refer to an I/O file? Input file Local file Output file Client side location: c:\experiments\11-04.dat LFC logical file name (LFC file catalog is required EGEE VOs) lfn:/grid/gilda/sipos/11-04.dat GridFTP address (in Globus Grids): gsiftp://somengshost.ac.uk/mydir/11-04.dat Client side location: result.dat LFC logical file name (LFC file catalog is required EGEE VOs) lfn:/grid/gilda/sipos/11-04_-_result.dat GridFTP address (in Globus Grids): gsiftp://somengshost.ac.uk/mydir/result.dat Remote file 31

Upload a workflow from client side or from FTP server UPLOAD STORED on FTP server 32

Importing an application INCOMPLETE WORKFLOW Open it in editor and save it again 33

Import a workflow from DSpace repository 34

External access to DSpace http://pgrade-dspace.sztaki.hudspace.sztaki.hu 35

User Interface (UI) or Portal Write JDL, Submit job (executable + small inputs) Proxy creation: The first step of grid usage Workload Management System (WMS) query Information System (BDII) create proxy Retrieve status & (small) output files Job status Logging Retrieve output Submit job Site X of YOUR VO Computing Element (CE) publish state Storage Element (SE) VO Management Service (VOMS) Logging and bookkeeping (LB) Job status process Read/write files 36

Certificate and proxy management Portlet 37

OGF GIN interoperability portal by P-GRADE Acccessing Globus, glite and ARC based grids/vos simultaneously Proxy 1 Proxy 6 P-GRADE P-GRADE GEMLCA portal Portal Proxy 2 Proxy 5 GEMLCA Repository Proxy 3 Proxy 4 38

Application execution 39

Detailed view of a PS workflow Generator job(s) Overall statistics of workflow instances Workflow instances Collector job(s) 40

Fault-tolerant tolerant execution Utilizing Condor DAGMan s rescue mechanism Resubmit only failed jobs, not completed jobs! If the EGEE broker leaves a job stuck in a CEs queue, the portal automatically kills the job on this site and resubmits the job to the broker by prohibiting this site. As a result the portal guarantees the correct submission of a job as long as there exists at least one matching resource job submission is reliable even in an unreliable grid 41

Information system visualization 42

LFC-SE file browser portlet 43

Compilation support 44

Learn once, use everywhere Develop once, execute anywhere Thank you! www.portal.p-grade.hu pgportal@lpds.sztaki.hu 45