Managing Scientific Computations in Grid Systems

Similar documents
Chelonia. a lightweight self-healing distributed storage

Dedicated to my Family

Empowering a Flexible Application Portal with a SOA-based Grid Job Management Framework

Performance and Stability of the Chelonia Storage System

arxiv: v1 [cs.dc] 3 Feb 2010

Scientific data management

Chelonia User s manual

Lessons Learned in the NorduGrid Federation

A Composable Service-Oriented Architecture for Middleware-Independent and Interoperable Grid Job Management

Usage statistics and usage patterns on the NorduGrid: Analyzing the logging information collected on one of the largest production Grids of the world

Data Management 1. Grid data management. Different sources of data. Sensors Analytic equipment Measurement tools and devices

EGEE and Interoperation

Interoperating AliEn and ARC for a distributed Tier1 in the Nordic countries.

ARC NOX AND THE ROADMAP TO THE UNIFIED EUROPEAN MIDDLEWARE

Multiple Broker Support by Grid Portals* Extended Abstract

I Tier-3 di CMS-Italia: stato e prospettive. Hassen Riahi Claudio Grandi Workshop CCR GRID 2011

ARC integration for CMS

Chelonia - Self-healing distributed storage system

EUROPEAN MIDDLEWARE INITIATIVE

DISTRIBUTED COMPUTER SYSTEMS ARCHITECTURES

The LHC Computing Grid

Interconnect EGEE and CNGRID e-infrastructures

Operating the Distributed NDGF Tier-1

Introduction to Grid Computing

NORDUGRID NORDUGRID-MANUAL-15 18/2/2013. The Hopi manual. Zsombor Nagy.

TDP3471 Distributed and Parallel Computing

CloudBATCH: A Batch Job Queuing System on Clouds with Hadoop and HBase. Chen Zhang Hans De Sterck University of Waterloo

A Grid Portal Implementation for Genetic Mapping of Multiple QTL

GRID Stream Database Managent for Scientific Applications

<Insert Picture Here> Enterprise Data Management using Grid Technology

Monitoring ARC services with GangliARC

Distributing storage of LHC data - in the nordic countries

GRID Application Portal

First European Globus Community Forum Meeting

Understanding StoRM: from introduction to internals

Boundary control : Access Controls: An access control mechanism processes users request for resources in three steps: Identification:

Requirements for TINA Platform towards Information Sharing Business. Long-term Trend of Telephone Business

Knowledge Discovery Services and Tools on Grids

High Throughput WAN Data Transfer with Hadoop-based Storage

Advanced School in High Performance and GRID Computing November Introduction to Grid computing.

MDHIM: A Parallel Key/Value Store Framework for HPC

Performance of the NorduGrid ARC and the Dulcinea Executor in ATLAS Data Challenge 2

Andrea Sciabà CERN, Switzerland

Grid Scheduling Architectures with Globus

High Performance Computing Course Notes Grid Computing I

Introduction to Programming and Computing for Scientists

Managing CAE Simulation Workloads in Cluster Environments

PNUTS: Yahoo! s Hosted Data Serving Platform. Reading Review by: Alex Degtiar (adegtiar) /30/2013

PoS(EGICF12-EMITC2)081

Outline. Definition of a Distributed System Goals of a Distributed System Types of Distributed Systems

Maths for Signals and Systems Linear Algebra in Engineering. Some problems by Gilbert Strang

IOS: A Middleware for Decentralized Distributed Computing

UNICORE Globus: Interoperability of Grid Infrastructures

Introduction to The Storage Resource Broker

Architecture Proposal

Data Management for the World s Largest Machine

Introduction to Programming and Computing for Scientists

The Grid Architecture

Semantic SOA - Realization of the Adaptive Services Grid

Grid Compute Resources and Job Management

Gatlet - a Grid Portal Framework

Exploiting peer group concept for adaptive and highly available services

Assignment 5. Georgia Koloniari

Storage and Compute Resource Management via DYRE, 3DcacheGrid, and CompuStore Ioan Raicu, Ian Foster

Functional Requirements for Grid Oriented Optical Networks

CUDA GPGPU Workshop 2012

ARC-XWCH bridge: Running ARC jobs on the XtremWeb-CH volunteer

Task Management Service

Grid-Based Data Mining and the KNOWLEDGE GRID Framework

L3S Research Center, University of Hannover

News / Outlook / Visions

Distributed Systems. Overview. Distributed Systems September A distributed system is a piece of software that ensures that:

The LHC Computing Grid

XBS Application Development Platform

T-Systems Solutions for Research. Data Management and Security. T-Systems Solutions for Research GmbH

The Problem of Grid Scheduling

Architectures for Distributed Systems

Technology for a better society. SINTEF ICT, Applied Mathematics, Heterogeneous Computing Group

REDCAP INTRODUCTION CLASS. November 9, 2017

PoS(EGICF12-EMITC2)074

Case-Study for Different Models of Resource Brokering in Grid Systems

Workshop 4.4: Lessons Learned and Best Practices from GI-SDI Projects II

DISCRETE TIME ADAPTIVE LINEAR CONTROL

MONET MONET. Mechanisms for Optimization of hybrid ad-hoc and satellite NETworks FP7 INFSO-ICT

Swedish National Storage Infrastructure for Academic Research with irods

PROOF-Condor integration for ATLAS

A Distributed Media Service System Based on Globus Data-Management Technologies1

Design and Implementation of High Performance and Availability Java RMI Server Group

Client tools know everything

A MODEL FOR MANAGING MEDICAL IMAGE DATA ON THE CLOUD

An agent-based peer-to-peer grid computing architecture

Write a technical report Present your results Write a workshop/conference paper (optional) Could be a real system, simulation and/or theoretical

XtreemFS a case for object-based storage in Grid data management. Jan Stender, Zuse Institute Berlin

MATH 423 Linear Algebra II Lecture 17: Reduced row echelon form (continued). Determinant of a matrix.

Scientific Computing with UNICORE

Cluster Setup and Distributed File System

Towards Generic Models

CAS 703 Software Design

AGORA: A Dependable High-Performance Coordination Service for Multi-Cores

Introduction to SRM. Riccardo Zappi 1

Transcription:

Managing Scientific Computations in Grid Systems Salman Toor Division of Scientific Computing Department of Information Technology Uppsala University November 11, 2008

Motivation Integration architecture Major components Architecture System functionality Current status Problem description Example of Mathematical IR

Motivation Integration architecture Example-1:

Motivation Integration architecture A joint project between Lund, Umeå, and Uppsala Universities. The aim is to build a: Robust Reliable User friendly Middleware independent grid application portal by combining already available building blocks.

Motivation Integration architecture Motivation A grid application portal should be user-friendly and based on a reliable job management system. LUNARC Application Portal (LAP) has a user-friendly web interface, that is tightly connected with the underlying middleware (ARC). Grid Job Management Framework (GJMF) has a very strong job management framework, which can be used as a robust interface. This integration project is the natural extension of these projects to jointly build a more reliable and easy-to-use framework for grid applications.

Motivation Integration architecture Integration architecture The integration architecture is based of the following building blocks: A 3-tier, loosely coupled model. The proposed architecture relies on: Application management by LAP. Grid job management by GJMF. The notification-based state monitoring with the fall-back option of polling.

Motivation Integration architecture Detailed picture of integration architecture

Motivation Integration architecture Advantages of integration architecture File staging is completely independent from the logic layer. GJMF introduces: A fault tolerant and robust submission for single and multiple jobs. Middleware independent job management.

Motivation Integration architecture Based on this architecture, our paper was accepted in the 9th International Workshop on State-of-the-Art in Scientific and Parallel Computing 2008. Title: Empowering a Flexible Application Portal with a SOA-based Grid Job Management Framework. Authors: E. Elmroth, S. Holmgren, J. Lindemann, S. Toor, and P-O Östberg.

Major components Architecture System functionality Current status Example-2: KnowARC storage system

Major components Architecture System functionality Current status What is KnowARC? The KnowARC (Grid-enabled Know-how Sharing Technology Based on ARC Services and Open Standards) is a project within the NorduGrid consortium. Goal: Improve and extend the existing state-of-the-art technology found in the Advanced Resource Connector (ARC) middleware. Major components: HED (Hosting Environment Daemon) with novel security framework A-REX, a standard compliant (BES/JSDL) job execution service Storage system Client tools...

Major components Architecture System functionality Current status KnowARC storage system The goal is to implement services which can form a reliable, robust, scalable, high-performance, and consistent data storage system by using the components provided in the KnowARC framework. Consists of following services: Bartender Librarian Shepherd A-Hash Gateway Also provides client tools using FUSE model.

Major components Architecture System functionality Current status Bartender: Provides a high-level interface for the users by communicating with the Librarian and the Shepherd services. Librarian: Handles the global namespace and registering the Shepherd services. Shepherd: Use for storing the replicas and the files on different storage solutions. Gateway: Use for communicating with external storage system using different protocols like gridftp and srm. Currently available for the Dcache. A-Hash: A database service which is used by the Librarian services to store all the data. Currently it is centralized but will be distributed in the final release.

Download file from internal store Major components Architecture System functionality Current status

Download file from external store Major components Architecture System functionality Current status

Current status Major components Architecture System functionality Current status Recently, we completed the first prototype version of storage system with the basic level of security and centralized A-Hash service. By the start of next year, production version of KnowARC storage system will be available with full security implementation and with the distributed A-Hash service. Project partners: Bjarte Mohn (Uppsala University) Jon Kerr Nilsen (Oslo University) Salman Toor (Uppsala University) Zsombor Nagy (NIIF, Hungary)

Problem description Example of Mathematical IR Example-3: Brokering in storage systems using Linear Algebra

Problem description Example of Mathematical IR Problem description How to design an improved model for load balancing or brokering component for several geographically distributed storage nodes? The functionality of the resource brokering or load-balancing depends on: 1. The amount of information available about the resources. 2. How efficient is the response of the broker? 3. How reliable the decision is?

Problem description Example of Mathematical IR Proposed idea In the heterogeneous environment of the grid, it is more realistic to have an estimation model of finding the resources that may or may not be the best of its kind, but should fulfill the requirement.

Problem description Example of Mathematical IR Proposed idea In the heterogeneous environment of the grid, it is more realistic to have an estimation model of finding the resources that may or may not be the best of its kind, but should fulfill the requirement. For this purpose we want to use the technique of Mathematical Information Retrieval (Math IR).

Problem description Example of Mathematical IR Math IR example Dimension of the matrix is: Terms x Documents Each document is represented as a four dimensional vector, where each dimension is the required term in the document. Query: cooking help or help on cooking

Problem description Example of Mathematical IR Math IR example: Possible solutions Check the similarity of cosine angles(naive approach): aj T q a j 2 q j = 2 P t i=1 a ij q i cos(θ j ) = Pt i=1 a 2 P t ij i=1 q i 2 Where a ij : Elements of the Term x Document matrix. q i : Elements of the Query vector. Use the Reduced Rank Approximation (advanced approach) by using QR factorization or some other method. Source: Matrix Rank Reduction for Data Analysis and Feature Extraction, Haesun Park, Lars Elden Matrices, Vector Spaces, and Information Retrieval, by Michael W. Berry, Zlatko Drmat and Elizabeth R. Jessup

Problem description Example of Mathematical IR Aim is to use reduced rank approximation or some other method to improve the resource brokering in the grid systems.

Problem description Example of Mathematical IR This approach can be useful for handling the requests of uploading files and creating multiple replicas in the system.

Problem description Example of Mathematical IR This approach can be useful for handling the requests of uploading files and creating multiple replicas in the system. Main Challenges: Matrix need to be updated recurrently. cannot help to handle downloading requests. Need a separate matrix. Which method to use?