High Performance Computing Course Notes Grid Computing I

Similar documents
Grid Computing. MCSN - N. Tonellotto - Distributed Enabling Platforms

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy

Introduction to Grid Technology

Grid Architectural Models

Grid Computing. Lectured by: Dr. Pham Tran Vu Faculty of Computer and Engineering HCMC University of Technology

Introduction to Grid Infrastructures

Knowledge Discovery Services and Tools on Grids

DIRAC distributed secure framework

Grids and Security. Ian Neilson Grid Deployment Group CERN. TF-CSIRT London 27 Jan

Chapter 4:- Introduction to Grid and its Evolution. Prepared By:- NITIN PANDYA Assistant Professor SVBIT.

Access the power of Grid with Eclipse

A Federated Grid Environment with Replication Services

Grid Computing Fall 2005 Lecture 5: Grid Architecture and Globus. Gabrielle Allen

UNICORE Globus: Interoperability of Grid Infrastructures

Scientific Computing with UNICORE

Introduction to Grid Computing

Distributed Data Management with Storage Resource Broker in the UK

DIRAC Distributed Secure Framework

Grid-Based Data Mining and the KNOWLEDGE GRID Framework

Grid Middleware and Globus Toolkit Architecture

THE GLOBUS PROJECT. White Paper. GridFTP. Universal Data Transfer for the Grid

On the employment of LCG GRID middleware

Advanced School in High Performance and GRID Computing November Introduction to Grid computing.

Troubleshooting Grid authentication from the client side

The GENIUS Grid Portal

A Distributed Media Service System Based on Globus Data-Management Technologies1

Design of Distributed Data Mining Applications on the KNOWLEDGE GRID

2. HDF AAI Meeting -- Demo Slides

glite Grid Services Overview

Grid Technologies & Applications: Architecture & Achievements

ATLAS NorduGrid related activities

Development of new security infrastructure design principles for distributed computing systems based on open protocols

Day 1 : August (Thursday) An overview of Globus Toolkit 2.4

Problems for Resource Brokering in Large and Dynamic Grid Environments

Grid Computing a new tool for science

Grid Computing Security hack.lu 2006 :: Security in Grid Computing :: Lisa Thalheim 1

P2P technologies, PlanetLab, and their relevance to Grid work. Matei Ripeanu The University of Chicago

Accounting in Grid Environments

Automatic Job Resubmission in the Nordugrid Middleware

Scientific data management

HEP replica management

Layered Architecture

Grid Infrastructure For Collaborative High Performance Scientific Computing

Kepler and Grid Systems -- Early Efforts --

IMPROVING THE CURRENCY OF INFORMATION IN LARGE-SCALE GRID INFORMATION SYSTEMS

Functional Requirements for Grid Oriented Optical Networks

Data Replication: Automated move and copy of data. PRACE Advanced Training Course on Data Staging and Data Movement Helsinki, September 10 th 2013

Grid Challenges and Experience

Leveraging the InCommon Federation to access the NSF TeraGrid

Application of Virtualization Technologies & CernVM. Benedikt Hegner CERN

IEPSAS-Kosice: experiences in running LCG site

By Ian Foster. Zhifeng Yun

Storage Virtualization. Eric Yen Academia Sinica Grid Computing Centre (ASGC) Taiwan

Grid Computing: dealing with GB/s dataflows

Wireless Java-enabled MIDP devices as peers in Grid infrastructure

Monitoring the Usage of the ZEUS Analysis Grid

Report. Middleware Proxy: A Request-Driven Messaging Broker For High Volume Data Distribution

Web Services for Visualization

DHANALAKSHMI COLLEGE OF ENGINEERING, CHENNAI

Classification and Characterization of Core Grid Protocols for Global Grid Computing

A Data-Aware Resource Broker for Data Grids

Grid services. Enabling Grids for E-sciencE. Dusan Vudragovic Scientific Computing Laboratory Institute of Physics Belgrade, Serbia

EUROPEAN MIDDLEWARE INITIATIVE

Federated Services for Scientists Thursday, December 9, p.m. EST

UNIT IV PROGRAMMING MODEL. Open source grid middleware packages - Globus Toolkit (GT4) Architecture, Configuration - Usage of Globus

AMAZON S3 FOR SCIENCE GRIDS: A VIABLE SOLUTION?

Globus Toolkit 4 Execution Management. Alexandra Jimborean International School of Informatics Hagenberg, 2009

A Resource Discovery Algorithm in Mobile Grid Computing Based on IP-Paging Scheme

A Simulation Model for Large Scale Distributed Systems

High-Energy Physics Data-Storage Challenges

The NASA/GSFC Advanced Data Grid: A Prototype for Future Earth Science Ground System Architectures

CernVM-FS beyond LHC computing

EGEE and Interoperation

Data Management 1. Grid data management. Different sources of data. Sensors Analytic equipment Measurement tools and devices

Data Management for the World s Largest Machine

Cyberinfrastructure Framework for 21st Century Science & Engineering (CIF21)

Globus Platform Services for Data Publication. Greg Nawrocki University of Chicago & Argonne National Lab GeoDaRRS August 7, 2018

Introduction Data Management Jan Just Keijser Nikhef Grid Tutorial, November 2008

WHEN the Large Hadron Collider (LHC) begins operation

Globus GTK and Grid Services

The Grid Authentication System for Mobile Grid Environment

LCG User Registration & VO management

Framework for Interactive Parallel Dataset Analysis on the Grid

Design The way components fit together

Big Computing and the Mitchell Institute for Fundamental Physics and Astronomy. David Toback

XtreemFS a case for object-based storage in Grid data management. Jan Stender, Zuse Institute Berlin

Conference The Data Challenges of the LHC. Reda Tafirout, TRIUMF

Data Management for Distributed Scientific Collaborations Using a Rule Engine

Design The way components fit together

Heterogeneous Grid Computing: Issues and Early Benchmarks

Architecture Proposal

FREE SCIENTIFIC COMPUTING

The EU DataGrid Testbed

Delivering Data Management for Engineers on the Grid 1

Data Intensive Science Impact on Networks

Worldwide Production Distributed Data Management at the LHC. Brian Bockelman MSST 2010, 4 May 2010

Grid Computing Systems: A Survey and Taxonomy

Knowledge-based Grids

Introduction to Programming and Computing for Scientists

Distributed Repository for Biomedical Applications

Transcription:

High Performance Computing Course Notes 2008-2009 2009 Grid Computing I

Resource Demands Even as computer power, data storage, and communication continue to improve exponentially, resource capacities are failing to keep up with what scientists demand of them A personal computer in 2001 is as fast as a supercomputer of 1990, but 10 years ago, biologists were happy to compute a single molecular structure. Now, they want to simulate the activities of human brain Personal computers now ship with more than 100 gigabytes (GB) of storage--as much as an entire 1990 supercomputer center. But by 2006, several physics projects, CERN's Large Hadron Collider (LHC) among them, will produce multiple petabytes (10 15 byte) of data per year Some wide area networks can now operate at 100 megabits per second (Mb/s), three orders of magnitude faster than the state-of-the-art 56 kilobits per second (Kb/s) that connected US supercomputer centers in 1985. But to work with colleagues across the world on petabyte data sets, scientists now demand tens of gigabits per second (Gb/s). 2

Technology trends Computer power doubles every 18 month Storage capacity doubles every 12 month Performance of wide area networks doubles every 9 months If the trend continues, the communication will become essentially free, which changes how we think about and undertake collaboration, that is, using remote resources to do things that we cannot do easily at home Not just conventional C/S model, but a large-scale, heterogeneous, dynamic infrastructure; we needs to deal with Different administrative domains Different architectures Different security and access control policies 3

Grid Infrastructure Built on the Internet and the World Wide Web, the Grid is a new class of infrastructure providing scalable, secure, high-performance mechanisms for discovering and negotiating access to remote resources make it possible for scientific collaborations to share resources on an unprecedented scale, and for geographically distributed groups to work together in ways that were previously impossible 4

What Grid can do? Using remote resources involves: Discover the resource Negotiate access Configuration of local hardware and software protect my own security and respect the security of the remote resources Implementing these steps requires uniform mechanisms for: creating and managing services on remote resources supporting single sign-on to distributed resources transferring large volume of data at high speeds forming large distributed virtual organisations Maintaining information about status and usage policies of the resources in the virtual organisations 5

What Grid can do? Today's Internet and Web technologies address basic communication requirements, but not the tasks just outlined The aim of Grid is to provide the infrastructure and tools that make large-scale, secure resource sharing possible and straightforward Grid tools are concerned with resource discovery, data management, Job scheduling, security, and so forth. 6

How Grid works The users enter grid by using the software interface running on their own computers Perform security validation Interact with resource broker Resource broker inquires information service to know which resources and services are available to process your program Resource broker inquires replica catalog to know all existing data Once the suitable resources are allocated, the resource broker assigns the job to the resources for execution 7

Grid Infrastructure 8

Grid Infrastructure Fabric layer contains the physical devices or resources that Grid users want to share and access Connectivity and Resource layer Connectivity layer Contains core communication and authentication protocols Communication protocols enable the exchange of data between resources authentication protocols provide cryptographically secure mechanisms Resource layer contains protocols that exploit communication and authentication protocols to enable the secure submission, initiation, monitoring, and control of resource-sharing operations Implementation: Globus 9

Grid Infrastructure collective layer contains protocols, services, and APIs that implement interactions across collections of resources. Exemplar services include brokering and information services for resource discovery and allocation; data replication services; diagnostic services; membership and policy services for keeping track of who in a community is allowed to access resources. user applications Call on the components in any other layer For example obtaining authentication credentials (connectivity layer protocols) querying an information system and replica catalog (collective services) submitting requests to appropriate resources (resource protocols) monitoring the progress of the various computations (resource protocols). 10

Security in Grid environments Traditional security technologies are concerned primarily with securing the interactions between clients and servers a client and a server mutually authenticate each other's identity the server determines whether to authorize requests issued by the client In Grid environments, the situation is more complex The distinction between client and server tends to disappear Single sign-on a user authenticate once and then assign to the computation the right to operate on his or her behalf achieved through the creation of a proxy credential Mapping to local security mechanisms Delegation Community authorization and policy It is infeasible for each resource to keep track of identity of every user Instead, users can be verified using group membership, which can be achieved with a cryptographic credential issued by a trusted third party (a community representative) 11

Security in Grid environments 12

Reference Ian Foster, The Grid: A new Infrastructure for 21st Century Science, http://www.aip.org/pt/vol-55/iss- 2/p42.html 13