Integrating SGE and Globus in a Heterogeneous HPC Environment

Similar documents
Update on EZ-Grid. Priya Raghunath University of Houston. PI : Dr Barbara Chapman

The University of Oxford campus grid, expansion and integrating new partners. Dr. David Wallom Technical Manager

Grid Architectural Models

Installation and Administration

Introduction to GT3. Overview. Installation Pre-requisites GT3.2. Overview of Installing GT3

Layered Architecture

Architecture Proposal

The Team. Using Sun Grid Engine and Globus to Schedule Jobs Across a Combination of Local and Remote Machines

ICENI: An Open Grid Service Architecture Implemented with Jini Nathalie Furmento, William Lee, Anthony Mayer, Steven Newhouse, and John Darlington

GT 4.2.0: Community Scheduler Framework (CSF) System Administrator's Guide

Grid Computing Fall 2005 Lecture 5: Grid Architecture and Globus. Gabrielle Allen

GT-OGSA Grid Service Infrastructure

University College London. Department of Computer Science

S.No QUESTIONS COMPETENCE LEVEL UNIT -1 PART A 1. Illustrate the evolutionary trend towards parallel distributed and cloud computing.

Grid Services and the Globus Toolkit

Workflow, Planning and Performance Information, information, information Dr Andrew Stephen M c Gough

CSF4:A WSRF Compliant Meta-Scheduler

NUSGRID a computational grid at NUS

Grid Compute Resources and Job Management

rid Computing: An Indistry View

Running Climate Models On The NERC Cluster Grid Using G-Rex

and how to use TORQUE & Maui Piero Calucci

First evaluation of the Globus GRAM Service. Massimo Sgaravatto INFN Padova

Globus GTK and Grid Services

Introduction to parallel computing

Java Development and Grid Computing with the Globus Toolkit Version 3

Batch Systems. Running calculations on HPC resources

Component-based Grid Programming Using the HOC-Service Architecture

Grid Scheduling Architectures with Globus

Clusters & Grid Computing

Agent Teamwork Research Assistant. Progress Report. Prepared by Solomon Lane

CS6703 GRID AND CLOUD COMPUTING. Question Bank Unit-I. Introduction

WS-Resource Framework: Globus Alliance Perspectives

Dynamic Virtual Clusters in a Grid Site Manager

Tutorial 1: Introduction to Globus Toolkit. John Watt, National e-science Centre

Introduction to GT3. Introduction to GT3. What is a Grid? A Story of Evolution. The Globus Project

How to install Condor-G

Introduction. Software Trends. Topics for Discussion. Grid Technology. GridForce:

MPI History. MPI versions MPI-2 MPICH2

Why You Should Consider Grid Computing

GROWL Scripts and Web Services

UNICORE Globus: Interoperability of Grid Infrastructures

The GridWay. approach for job Submission and Management on Grids. Outline. Motivation. The GridWay Framework. Resource Selection

DiPerF: automated DIstributed PERformance testing Framework

High performance Linux clusters - design/configuration/maintenance

Using Resources of Multiple Grids with the Grid Service Provider. Micha?Kosiedowski

Managing CAE Simulation Workloads in Cluster Environments

Grid Compute Resources and Grid Job Management

Managing MPICH-G2 Jobs with WebCom-G

High Performance Computing (HPC) Using zcluster at GACRC

Day 1 : August (Thursday) An overview of Globus Toolkit 2.4

The GAT Adapter to use GT4 RFT

Online Steering of HEP Grid Applications

A Hands-On Tutorial: RNA Sequencing Using High-Performance Computing

Introduction to HPC Using zcluster at GACRC

Globus Toolkit Firewall Requirements. Abstract

Task Management Service

Discovery Net : A UK e-science Pilot Project for Grid-based Knowledge Discovery Services. Patrick Wendel Imperial College, London

Designing a Java-based Grid Scheduler using Commodity Services

Customized way of Resource Discovery in a Campus Grid

History of SURAgrid Deployment

Grid Computing Middleware. Definitions & functions Middleware components Globus glite

Queue systems. and how to use Torque/Maui. Piero Calucci. Scuola Internazionale Superiore di Studi Avanzati Trieste

Introduction to High Performance Computing (HPC) Resources at GACRC

Visual Design Flows for Faster Debug and Time to Market FlowTracer White Paper

HPC Metrics in OSCAR based on Ganglia

Sensor Grid: Integration of Wireless Sensor Networks and the Grid

Edinburgh (ECDF) Update

Tutorial 4: Condor. John Watt, National e-science Centre

ICENI Dataflow and Workflow: Composition and Scheduling in Space and Time

The SweGrid Accounting System

By Ian Foster. Zhifeng Yun

OPEN SOURCE GRID MIDDLEWARE PACKAGES

Introduction to HPC Using zcluster at GACRC

Providing Dynamic Virtualized Access to Grid Resources via the Web 2.0 Paradigm

The EU DataGrid Fabric Management

Cloud Computing. Up until now

Introduction to HPC Using zcluster at GACRC

QUESTION BANK 10IS845: Clouds, Grids, and Clusters

Gatlet - a Grid Portal Framework

MPI versions. MPI History

MERCED CLUSTER BASICS Multi-Environment Research Computer for Exploration and Discovery A Centerpiece for Computational Science at UC Merced

ARC BATCH SYSTEM BACK-END INTERFACE GUIDE WITH SUPPORT FOR GLUE 2

Scientific Computing with UNICORE

EPCC Sun Data and Compute Grids Project Update

A Web-Services Based Architecture for Dynamic- Service Deployment

Batch Systems. Running your jobs on an HPC machine

CloudBATCH: A Batch Job Queuing System on Clouds with Hadoop and HBase. Chen Zhang Hans De Sterck University of Waterloo

Interconnect EGEE and CNGRID e-infrastructures

Extensible Job Managers for Grid Computing

Introduction to Grid Technology

Grid Examples. Steve Gallo Center for Computational Research University at Buffalo

NorduGrid Tutorial. Client Installation and Job Examples

Grid Computing Security

Infrastructure Underpinnings of the GFDL Workflow

Batch system usage arm euthen F azo he Z J. B T

C IBM. IBM WebSphere Application Server Network Deployment V8.0 Core Administrati

Corral: A Glide-in Based Service for Resource Provisioning

Production Grids. Outline

MPI 1. CSCI 4850/5850 High-Performance Computing Spring 2018

Transcription:

Integrating SGE and Globus in a Heterogeneous HPC Environment David McBride <dwm99@doc.ic.ac.uk> London e-science Centre, Department of Computing, Imperial College

Presentation Outline Overview of Centre resources A brief description of Globus Integration of SGE Deployment issues Further information

LeSC Hardware Saturn E6800 GT2 SGEEE Viking Viking Linux Viking Linux SMP Linux P4 P4 P4 GT2 GT3 x133 LeSC Firewall Imperial Firewall Codon V880 Sunshine V880 SGE Pioneer Pioneer Pioneer Linux Linux Athlon Linux AM AM GT2 x20 Rhea V880 GT2 GT3 RMS Atlas Pioneer Tru64 Pioneer Linux Linux AM AM SMP Alpha x16 SGE Mariner Mariner Linux Linux P4 SMP P4 GT2 GT3 x2

Sun Grid Engine Deployment SGEEE v5.3 deployed on each cluster Uses reserved port-based authentication Single scheduler for all production resources Multiple queues exist with different constraints A subset of nodes have restrictions on maximum runtime to allow a quick response time for short tasks Multiple environments are available which specify how many nodes a job may use. Some are restricted. Myrinet-equipped nodes share a primary queue for parallel jobs and a secondary, restricted queue for sequential tasks

The Globus Toolkit Open-source project managed by the Globus Alliance (formerly Globus Project) Provides a hosting environment for OGSA Grid s Also provides an implementation of core services and client-side tools

Major Globus version differences Globus 2.x implements bespoke standards Job execution framework uses a seperate gatekeeper process Globus 3.x implements OGSI standards Job execution framework uses a grid service to provide gatekeeper functions. Both use a modular Perl job-manager to handle scheduler interaction

Globus 3.x Job Execution Architecture Globus Server Master Hosting Environment (MHE) User jbloggs Master Managed Job Factory (MMJFS) Suid-root wrapper globus Proxy User Hosting Environment (UHE) User Hosting Environment (UHE) User Hosting Environment (UHE) Managed Managed Job Managed Factory Job Managed Factory Job Job Factory Job Job Factory Factory Managed Job Managed Job Managed Job jbloggs hsimspon iceni Job Manager Fork See http://www-unix.globus.org/developer/gram-architecture.html for more details

Globus and SGE integration Globus jobs are specified using RSL, the Resource Specification Language Used to construct the local execution environment and specfies the program to be executed. job-manager executes parsed RSL instructions and monitors running jobs LeSC Perl module provides SGE-specific backend within the Globus framework

Globus 3.x and SGE integration Globus Server Master Hosting Environment (MHE) User jbloggs Master Managed Job Factory (MMJFS) Suid-root wrapper globus Proxy User Hosting Environment (UHE) User Hosting Environment (UHE) User Hosting Environment (UHE) Managed Managed Job Managed Factory Job Managed Factory Job Job Factory Job Job Factory Factory Managed Job Managed Job Managed Job jbloggs hsimspon iceni Job Manager Fork SGE Scheduler Sun Grid Engine qsub qstat

Deployment Issues Globus Toolkit is an evolving codebase OGSI-capable 3.x series is relatively new Not quite production quality UHE's are intended to be short lived but are sometimes not reaped Components initialised on demand sometimes exceed timeouts set for their response

Further information Globus Alliance: http://www.globus.org/ SGE and Globus integration: http://www.lesc.ic.ac.uk/projects/epic-gt-sge.html GPT packages and documentation freely available