Grid Architectural Models

Similar documents
Grid Computing. MCSN - N. Tonellotto - Distributed Enabling Platforms

Layered Architecture

Day 1 : August (Thursday) An overview of Globus Toolkit 2.4

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy

Grid Computing Fall 2005 Lecture 5: Grid Architecture and Globus. Gabrielle Allen

Architecture Proposal

Grid Computing Middleware. Definitions & functions Middleware components Globus glite

The University of Oxford campus grid, expansion and integrating new partners. Dr. David Wallom Technical Manager

Cloud Computing. Up until now

Grid Scheduling Architectures with Globus

Grid services. Enabling Grids for E-sciencE. Dusan Vudragovic Scientific Computing Laboratory Institute of Physics Belgrade, Serbia

glite Grid Services Overview

High Performance Computing Course Notes Grid Computing I

Clusters & Grid Computing

The GridWay. approach for job Submission and Management on Grids. Outline. Motivation. The GridWay Framework. Resource Selection

An Example Grid Middleware - The Globus Toolkit. MCSN N. Tonellotto Complements of Distributed Enabling Platforms

Knowledge Discovery Services and Tools on Grids

EPCC Sun Data and Compute Grids Project Update

Grid Compute Resources and Job Management

Globus Toolkit Firewall Requirements. Abstract

Installation and Administration

Introduction to Grid Computing

Introduction to Programming and Computing for Scientists

First evaluation of the Globus GRAM Service. Massimo Sgaravatto INFN Padova

Grid Programming: Concepts and Challenges. Michael Rokitka CSE510B 10/2007

A Distributed Media Service System Based on Globus Data-Management Technologies1

Grids and Security. Ian Neilson Grid Deployment Group CERN. TF-CSIRT London 27 Jan

Chapter 4:- Introduction to Grid and its Evolution. Prepared By:- NITIN PANDYA Assistant Professor SVBIT.

Grid Computing Systems: A Survey and Taxonomy

Globus GTK and Grid Services

EGEE and Interoperation

CMS HLT production using Grid tools

igrid: a Relational Information Service A novel resource & service discovery approach

MOC 6232A: Implementing a Microsoft SQL Server 2008 Database

Grid Computing Fall 2005 Lecture 10 and 12: Globus V2. Gabrielle Allen

GEMS: A Fault Tolerant Grid Job Management System

On the employment of LCG GRID middleware

GT 4.2.0: Community Scheduler Framework (CSF) System Administrator's Guide

Introduction to Programming and Computing for Scientists

Michigan Grid Research and Infrastructure Development (MGRID)

GT-OGSA Grid Service Infrastructure

The Problem of Grid Scheduling

The Globus Toolkit Lecture of the course of Complements of Enabling Platforms

Grid Middleware and Globus Toolkit Architecture

Integrating SGE and Globus in a Heterogeneous HPC Environment

UNIT IV PROGRAMMING MODEL. Open source grid middleware packages - Globus Toolkit (GT4) Architecture, Configuration - Usage of Globus

Grid-Based Data Mining and the KNOWLEDGE GRID Framework

Dynamic fault tolerant grid workflow in the water threat management project

Classification and Characterization of Core Grid Protocols for Global Grid Computing

Juliusz Pukacki OGF25 - Grid technologies in e-health Catania, 2-6 March 2009

By Ian Foster. Zhifeng Yun

CS6703 GRID AND CLOUD COMPUTING. Question Bank Unit-I. Introduction

DESY. Andreas Gellrich DESY DESY,

Design of Distributed Data Mining Applications on the KNOWLEDGE GRID

The glite middleware. Ariel Garcia KIT

Globus Toolkit Manoj Soni SENG, CDAC. 20 th & 21 th Nov 2008 GGOA Workshop 08 Bangalore

The EU DataGrid Testbed

Introduction to Grid Technology

Grid Computing Security hack.lu 2006 :: Security in Grid Computing :: Lisa Thalheim 1

IMAGE: An approach to building standards-based enterprise Grids

Using the MyProxy Online Credential Repository

Grid Compute Resources and Grid Job Management

A New Grid Manager for NorduGrid

[MS20414]: Implementing an Advanced Server Infrastructure

Grid Authentication and Authorisation Issues. Ákos Frohner at CERN

EUROPEAN MIDDLEWARE INITIATIVE

Based on: Grid Intro and Fundamentals Review Talk by Gabrielle Allen Talk by Laura Bright / Bill Howe

S.No QUESTIONS COMPETENCE LEVEL UNIT -1 PART A 1. Illustrate the evolutionary trend towards parallel distributed and cloud computing.

Functional Requirements for Grid Oriented Optical Networks

Deploying the TeraGrid PKI

Extensible Job Managers for Grid Computing

Introduction to GT3. Introduction to GT3. What is a Grid? A Story of Evolution. The Globus Project

Implementing an Advanced Server Infrastructure

Distributed Data Management with Storage Resource Broker in the UK

Programming Environment Oct 9, Grid Programming (1) Osamu Tatebe University of Tsukuba

GSI-based Security for Web Services

I Tier-3 di CMS-Italia: stato e prospettive. Hassen Riahi Claudio Grandi Workshop CCR GRID 2011

Grid Computing. Lectured by: Dr. Pham Tran Vu Faculty of Computer and Engineering HCMC University of Technology

The NorduGrid Architecture and Middleware for Scientific Applications

Design The way components fit together

UNICORE Globus: Interoperability of Grid Infrastructures

Grid Computing Training Courseware v-1.0

Design The way components fit together

Introduction to SRM. Riccardo Zappi 1

Resource Allocation in computational Grids

CSF4:A WSRF Compliant Meta-Scheduler

Microsoft Implementing an Advanced Server Infrastructure

MCSE Server Infrastructure. This Training Program prepares and enables learners to Pass Microsoft MCSE: Server Infrastructure exams

DHANALAKSHMI COLLEGE OF ENGINEERING, CHENNAI

Introduction to GT3. Overview. Installation Pre-requisites GT3.2. Overview of Installing GT3

Grid Infrastructure For Collaborative High Performance Scientific Computing

Problems for Resource Brokering in Large and Dynamic Grid Environments

Leveraging the InCommon Federation to access the NSF TeraGrid

DATA MANAGEMENT FOR PHYSICS ANALYSIS IN PHENIX (BNL, RHIC)

2 Model. 2.1 Introduction

Administering Cloud Pod Architecture in Horizon 7. Modified on 4 JAN 2018 VMware Horizon 7 7.4

THE WIDE AREA GRID. Architecture

A Simulation Model for Large Scale Distributed Systems

N. Marusov, I. Semenov

The EU DataGrid Fabric Management

Transcription:

Grid Architectural Models Computational Grids - A computational Grid aggregates the processing power from a distributed collection of systems - This type of Grid is primarily composed of low powered computers with minimal application logic awareness and minimal storage capacity - The primary benefits of computational Grids are a reduced Total Cost of Ownership (TCO), and shorter deployment life cycles - Example: SETI@Home DataGrids - Data Grids focus on providing secure access to distributed, heterogeneous pools of data - Through collaboration, data Grids can also include a new concept such as a federated database - Data Grids also harness data, storage, and network resources located in distinct administrative domains, respect local and global policies governing how data can be used, schedule resources efficiently, again subject to local and global constraints, and provide high speed and reliable access to data - Example: CERN s LHC Data Grid 19

Computational Grid topologies 20

Data Grid topologies 21

Grid 5000 Example 1000 nodes 8000 cores 500+ users http://grid5000.fr 22

TeraGrid Example http://xsede.org 23

European Data Grid Example http://eu-datagrid.web.cern.ch/eu-datagrid/ 24

An example Grid middleware Globus Toolkit http://www.globus.org/toolkit/ A software toolkit addressing key technical problems in the development of Grid enabled tools, services, and applications - Offer a modular bag of technologies - Enable incremental development of Grid-enabled tools and applications - Implement standard Grid protocols and APIs (the core of the hourglass) - Is available under liberal open source license Now is evolving to Cloud middleware 25

Key Protocols 26

GT2 Protocols and Services Resource Management Information Services Data Management process process process process process resource process process file jobmanager gatekeeper GASS GRIS GIIS GISS GISS GridFTP server Grid Security GSI client tools User 27

Grid Security Resources being used may be valuable & the problems being solved sensitive Resources are often located in distinct administrative domains - Each resource has own policies & procedures Set of resources used by a single computation may be large, dynamic, and unpredictable - Not just client/server, requires delegation It must be broadly available & applicable Standard, well-tested, well-understood protocols; integrated with wide variety of tools 28

Public Key Infrastructure PKI allows you to know that a given public key belongs to a given user PKI builds upon asymmetric encryption: - Each entity has two keys: public and private - Data encrypted with one key can only be decrypted with the other - The private key is known only to the owner The public key is given to the world encapsulated in a X.509 certificate - Similar to passport or driver s license 29

Certificate Authority Your private key Grid User/Machine CA s public key ➀ send Certificate Authority CA s public key GSI ➁ create Your certificate signing request Your certificate ➂ send ➄ copy Your certificate signing request Your certificate CA s private key ➃ sign 30

Authentication & Authorization Your private key ➄ encrypt & send ➅ Grid Host A Your certificate random number xah3:4daa ➀ send Your certificate ➃ create & send random number xah3:4daa decrypt & identify random number Grid Host B ➁ decrypt CA s public key ➂ extract Your public key ➆ mapping Your subject name grid-mapfile Local user name 31

Grid Information Service Provide access to static and dynamic information regarding system components A basis for configuration and adaptation in heterogeneous, dynamic environments Resource Description Services - Supplies information about a specific resource Aggregate Directory Services - Supplies collection of information which was gathered from multiple resource description services - Customized naming and indexing 32

MDS Protocols and Services Grid Host C Grid Host A request/response resource(s) information Grid Host B register GIIS MDS client GIIS register GRIS local resource information IP IP IP local resource information resource resource resource 33

Grid Resource Management Grid Resource Management System consists of : - Local resource management system (Resource Layer) Basic resource management unit Provide a standard interface for using remote resources Grid Resource Allocation Manager (GRAM) - Global resource management system (Collective Layer) Coordinate all Local resource management system within multiple or distributed Virtual Organizations (VOs) Provide high-level functionalities to efficiently use all of resources Job Submission Resource Discovery and Selection Scheduling Co-allocation Job Monitoring, etc. e.g. Meta-scheduler, Resource Broker, etc. 34

GRAM Protocols and Services GRAM client GSI client Job Request HTTP/1.1 RSL Message Gatekeeper GSI server callbacks/job status (opt) commands/status query Job Manager GASS client stdin/file transfer stdout/file transfer GASS server (opt) RSL parser RSL Message input input file input file file output input file file input file Scripts fork psub qsub Process PBS Job Torque Job Local Resource Managers 35

Grid Scheduling Levels Resource-level scheduler - low-level scheduler, local scheduler, local resource manager - scheduler close to the resource, controlling a supercomputer, cluster, or network of workstations, on the same local area network - Examples: Open PBS, PBS Pro, LSF, SGE Enterprise-level scheduler - Scheduling across multiple local schedulers belonging to the same organization - Examples: PBS Pro peer scheduling, LSF Multicluster Grid-level scheduler - Also known as super-scheduler, broker, community scheduler - Discovers resources that can meet a job s requirements - Schedules across lower level schedulers 36

Grid Scheduler Grid- User/Application Grid- Scheduler Information Services Monitoring Services Security Services Grid Middleware Grid Middleware Grid Middleware Grid Middleware Accounting/Billing Local RMS Local RMS Local RMS Local RMS Other Grid Services time time time time Schedule Schedule Schedule Schedule Compute Resources Data Resources Network Resources Other Resources/ Services A Grid scheduler allows the user to specify the required resources and environment of the job without having to indicate the exact location of the resources 37

Grid Scheduler Activities Phase One-Resource Discovery 1. Authorization Filtering 2. Application Definition 3. Min. Requirement Filtering Phase Three- Job Execution 6. Advance Reservation 7. Job Submission 8. Preparation Tasks Phase Two - System Selection 4. Information Gathering 5. System Selection 9. Monitoring Progress 10 Job Completion 11. Clean-up Tasks 38

Issues Resources may dynamically join and leave the Grid Not all currently unused resources are available to grid jobs Resource owner local policies can restrict maximum number of grid jobs allowed Hard to predict how long jobs will wait in a queue User information not accurate as mentioned before New jobs arrive that may surpass current queue entries due to higher priority Local jobs have typically higher priority than Grid jobs Limited information about the local schedulers is available (privacy) Data Management Network Management 39

Example of Grid Scheduling Process Scheduling Service: 1. receives job description 2. queries Information Service for static resource information 3. prioritizes and pre-selects resources 4. queries for dynamic information about resource availability 5. queries Data and Network Management Services 6. generates schedule for job 7. reserves allocation if possible otherwise selects another allocation 8. delegates job monitoring to Job Supervisor Job Supervisor/Network and Data Management: service, monitor and initiate allocation Example: 40 resources of requested type are found. 12 resources are selected. 8 resources are available. Network and data dependencies are detected. Utility function is evaluated. 6 th tried allocation is confirmed. Data/network provided and job is started 40

Data access and transfer Grid Data Management - GASS: Simple multi-protocol tool to transfer normal files; integrated in GRAM - GridFTP: Reliable and high-performance file transfer protocol for big files in computer networks Replica Management - Replica Catalog: Service to keep updated information on sets of replicated data - Replica Management: Service to create and manage sets of replicated data 41

GridFTP 42