Cloud Computing. Up until now

Similar documents
Grid Middleware and Globus Toolkit Architecture

Grid Architectural Models

By Ian Foster. Zhifeng Yun

Globus GTK and Grid Services

Grid Compute Resources and Job Management

Grid Computing. MCSN - N. Tonellotto - Distributed Enabling Platforms

Cloud Computing. Summary

Grid Scheduling Architectures with Globus

Grid Programming: Concepts and Challenges. Michael Rokitka CSE510B 10/2007

Grid Computing Fall 2005 Lecture 5: Grid Architecture and Globus. Gabrielle Allen

The Problem of Grid Scheduling

Grid services. Enabling Grids for E-sciencE. Dusan Vudragovic Scientific Computing Laboratory Institute of Physics Belgrade, Serbia

Layered Architecture

Grid Computing Middleware. Definitions & functions Middleware components Globus glite

Introduction to Grid Computing

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy

Chapter 4:- Introduction to Grid and its Evolution. Prepared By:- NITIN PANDYA Assistant Professor SVBIT.

Gridbus Portlets -- USER GUIDE -- GRIDBUS PORTLETS 1 1. GETTING STARTED 2 2. AUTHENTICATION 3 3. WORKING WITH PROJECTS 4

The University of Oxford campus grid, expansion and integrating new partners. Dr. David Wallom Technical Manager

UNIT IV PROGRAMMING MODEL. Open source grid middleware packages - Globus Toolkit (GT4) Architecture, Configuration - Usage of Globus

Day 1 : August (Thursday) An overview of Globus Toolkit 2.4

The EU DataGrid Fabric Management

Globus Toolkit Manoj Soni SENG, CDAC. 20 th & 21 th Nov 2008 GGOA Workshop 08 Bangalore

CSF4:A WSRF Compliant Meta-Scheduler

Pegasus Workflow Management System. Gideon Juve. USC Informa3on Sciences Ins3tute

From Web Services Toward Grid Services

The GridWay. approach for job Submission and Management on Grids. Outline. Motivation. The GridWay Framework. Resource Selection

Database Assessment for PDMS

Grid Compute Resources and Grid Job Management

Data Management 1. Grid data management. Different sources of data. Sensors Analytic equipment Measurement tools and devices

Juliusz Pukacki OGF25 - Grid technologies in e-health Catania, 2-6 March 2009

GT 4.2.0: Community Scheduler Framework (CSF) System Administrator's Guide

NUSGRID a computational grid at NUS

Grid Infrastructure For Collaborative High Performance Scientific Computing

Design The way components fit together

Introduction to GT3. Introduction to GT3. What is a Grid? A Story of Evolution. The Globus Project

Design The way components fit together

The GAT Adapter to use GT4 RFT

XSEDE Software and Services Table For Service Providers and Campus Bridging

Architecture Proposal

GEMS: A Fault Tolerant Grid Job Management System

WSRF Services for Composing Distributed Data Mining Applications on Grids: Functionality and Performance

Globus Toolkit 4 Execution Management. Alexandra Jimborean International School of Informatics Hagenberg, 2009

Independent Software Vendors (ISV) Remote Computing Usage Primer

Grid Data Management

glite Grid Services Overview

Grid Computing Security

HEP replica management

Grid-enabled Probabilistic Model Checking with PRISM

Work Queue + Python. A Framework For Scalable Scientific Ensemble Applications

Chapter 3. Design of Grid Scheduler. 3.1 Introduction

Grid Computing Training Courseware v-1.0

GT-OGSA Grid Service Infrastructure

Globus Online: File Transfer Made Easy!

Using MATLAB on the TeraGrid. Nate Woody, CAC John Kotwicki, MathWorks Susan Mehringer, CAC

GridNEWS: A distributed Grid platform for efficient storage, annotating, indexing and searching of large audiovisual news content

WMS overview and Proposal for Job Status

OPEN SOURCE GRID MIDDLEWARE PACKAGES

BOSCO Architecture. Derek Weitzel University of Nebraska Lincoln

Using the MyProxy Online Credential Repository

Gatlet - a Grid Portal Framework

Sphinx: A Scheduling Middleware for Data Intensive Applications on a Grid

S.No QUESTIONS COMPETENCE LEVEL UNIT -1 PART A 1. Illustrate the evolutionary trend towards parallel distributed and cloud computing.

M. Roehrig, Sandia National Laboratories. Philipp Wieder, Research Centre Jülich Nov 2002

Task Management Service

Knowledge Discovery Services and Tools on Grids

Customized way of Resource Discovery in a Campus Grid

An Example Grid Middleware - The Globus Toolkit. MCSN N. Tonellotto Complements of Distributed Enabling Platforms

Installation and Administration

Advanced School in High Performance and GRID Computing November Introduction to Grid computing.

Andrea Sciabà CERN, Switzerland

Research and Design Application Platform of Service Grid Based on WSRF

GROWL Scripts and Web Services

Introduction to Grid Computing

A RESOURCE MANAGEMENT FRAMEWORK FOR INTERACTIVE GRIDS

XSEDE Software and Services Table For Service Providers and Campus Bridging

Inca as Monitoring. Kavin Kumar Palanisamy Indiana University Bloomington

Globus Toolkit Firewall Requirements. Abstract

Federated Services for Scientists Thursday, December 9, p.m. EST

The glite middleware. Ariel Garcia KIT

Accelerating the Scientific Exploration Process with Kepler Scientific Workflow System

Grid Authentication and Authorisation Issues. Ákos Frohner at CERN

Index Introduction Setting up an account Searching and accessing Download Advanced features

A Globus Toolkit Primer

CMS HLT production using Grid tools

Troubleshooting Grid authentication from the client side

GridWay interoperability through BES

Grid Computing. Lectured by: Dr. Pham Tran Vu Faculty of Computer and Engineering HCMC University of Technology

EGEE and Interoperation

UCLA Grid Portal (UGP) A Globus Incubator Project

DHANALAKSHMI COLLEGE OF ENGINEERING, CHENNAI

Building Campus HTC Sharing Infrastructures. Derek Weitzel University of Nebraska Lincoln (Open Science Grid Hat)

Architectural Tradeoffs for Unifying Campus Grid Resources

Climate Data Management using Globus

Grid Computing Security: A Survey

Designing a Resource Broker for Heterogeneous Grids

DSpace Fedora. Eprints Greenstone. Handle System

Opal: Wrapping Scientific Applications as Web Services

enanos Grid Resource Broker

[workshop welcome graphics]

Transcription:

Cloud Computing Lecture 4 and 5 Grid: 2012-2013 Introduction. Up until now Definition of Cloud Computing. Grid Computing: Schedulers: Condor SGE 1

Summary Core Grid: Toolkit Condor-G Grid: Conceptual Architecture Tools and Applications Aplications Descovery, negotiation, diagnostics& monitoring Secure access to resources Colective Services Communication and Resource Management Protocols Resources: CPU, storage, networking Fabric 2

What is Toolkit? Toolkit is a set of tools that solve common issues in distributed application development specially in the context of grids: Heterogeneity. Complexity and security issues. Lack of standardization and inter-operability. It began in 1997 and is developed by the Alliance. Tookit: Core Grid Infra-Structure Uses existing resources: Clusters with scheduling, distributed file systems, networks, security systems Layers it s own services on top of them. Provides XML configuration. In order to create a generic service-centred interaction model based on: Command line utilities. Web Services. 3

Examples of the benefits of using Toolkit in a Grid Replace login in different systems by a single sign-on. Group system information into a single repository. Submit remote tasks at: Remote machines. CPU sharing portals. Workflow engines. Provide high bandwidth data transfers. Provide managed data transfers. Core features: Toolkit v.4 (GT4): Scope Infra-structure for building new services. Security: Applying a uniform policy among different systems. Execution management: Manage the lifecycle of the application and its jobs/processes. Data management: Locate, transfer and access data. Monitoring: Monitoring dynamic grid systems. 4

GT4: Base Toolkit Java GSI- OpenSSH MyProxy Data Rep C Delegation GridFTP Replica Location Python CAS GridWay Reliable File MDS4 Base Segurança Security Execution Execução Dados Data Monitoring Monit. Web Services with: WS-Resource Framework: records service call state and provides information. Security using WS-Security Tools for compiling and starting Web Services in C, Java e Python. GT4: Base 5

WS-Resource Framework EPR EPR EPR Service Resource RPs GetRP GetMultRPs SetRP QueryRPs Subscribe SetTermTime Destroy Service State representation: Resource Property State identification: Endpoint Reference State interfaces: GetRP, QueryRPs, GetMultipleRPs, SetRP Service lifecycle management: SetTerminationTime ImmediateDestruction Notification interfaces: Subscribe Notify GT4: Security Toolkit Java GSI- OpenSSH MyProxy Data Rep C Delegation GridFTP Replica Location Python CAS GridWay Reliable File MDS4 Base Security Execution Data Monitoring 6

GT4: Security Delegation: mechanism to ensure that certificates are valid across different systems. CAS: Community Authorization Service. Provides authorization for groups of users. GSI-OpenSSH: sshwith support for credentials avoiding multiple logins. MyProxy: Credential server. Users keep certificates at the server and provide a single key to read all needed certificates. GT4: Monitoring Toolkit Java GSI- OpenSSH MyProxy Data Rep C Delegation GridFTP Replica Location Python CAS GridWay Reliable File MDS4 Base Segurança Security Execution Execução Dados Data Monitoring Monit. 7

MDS4: GT4: Monitoring MDS-Index: Gathers monitoring information (e.g. GetRP). MDS-Trigger: Compares gathered information with management rules and sends alerts (e.g. scripts). MDS-Archive: Manages monitoring information archive. GT4: Data Management Toolkit Java GSI- OpenSSH MyProxy Data Rep C Delegation GridFTP Replica Location Python CAS GridWay Reliable File MDS4 Base Segurança Security Execution Execução Dados Data Monitoring Monit. 8

GT4: Data Management GridFTP: Efficient data transfer. Reliable File : Manages GridFTP. Data Replication. Replica Location. GridFTP FTP service optimized for high debit in large scale networks: FTP with added extensions. The channels use security. Multiple transfer channels. of partial files. Server-server transfers. Basic Server-Server 9

Striped GridFTP GridFTPsupports striped (multi-node) transfers because often is highbandwidth networks the local file system is a bottleneck: A control channel. Multiple channel on each of several nodes. Requires a shared FS on all nodes. RFT Reliable File A manager for data transfer request with: Server-server transfers. Monitoring for restarts. Database to tolerate failures. Allows clients to submit a request and disconnect. 26 10

RLS - Replica Location Service Data repository manager: Consistent local state stored in the Local Replica Catalogs(LRCs). Global state stored in the Replica Location Indices (RLIs) without consistency guarantees. Configurable topology. GT4 Tools Toolkit Java GSI- OpenSSH MyProxy Data Rep C Delegation GridFTP Replica Location Python CAS GridWay Reliable File MDS4 Base Segurança Security Execution Execução Dados Data Monitoring Monit. 11

GT4: Execution : Job Management. GridWay: Meta Scheduler. : Remote Job Management Triggers data transfers and manages jobs. Keeps persistent state of the jobs. Uses security services. s user credentials. It s not a scheduler. Used as an interface for schedulers and meta-schedulers. Applications Workflow, Meta-schedulers, Batch Jobs, Parameter Sweep Schedulers Condor, SGE, LSF, PBS, Loadleveler, Fork 12

Scalability: 4 Receives a job file with the executable name, input filenames, output filenames and destination machine in RSL (Resource Specification Language). Interacts with schedulers and send jobs to them (SGE, Condor, etc.) Manages job and file information with high scalability: Manages up to 32k active jobs. Monitors node load. Handles bursts of up to 50 jobs. Processes a job every 2 seconds. GridWay: Meta-Scheduler Users GridWay Portal Comand Line Applications Services: MDS,, GridFTP Scheduler Middleware SGE Cluster PBS Cluster LSF Cluster May be heterogeneous and distributed. Infra-structure 13

GridWay Architecture Portal GridWay Core Job Pool Host Pool Command Line Request Manager Dispatch Manager Submission Submission Monitoring Monitoring Control Control Scheduler Job Preparation and Job Manager GridFTP RFT Grid Data Services Execution Manager pre-ws WS Grid Job Execution Services MDS2 Information Manager MDS2 GLUE MDS4 Resource Discovery and Monitoring Grid Monitoring Services GridWay Scheduling Relevant Information: About jobs: Fixed priority Urgency flag User quota Deadline Waiting time About resources: Rank (Preferences). Fixed priority. Past use history. Failure history. 14

GridWay: Meta-Scheduler Dynamic scheduler: Set of pre-defined policies: priority-based, proportional, by waiting time, by deadlines, etc.) Interface for user scheduling code. Based on services. Job resubmission when better nodes become available. Detects violations of advertised characteristics. Able to handle dynamic changes in job execution requests. Security in Grids. Grid case studies. Next time 15