Modeling User Submission Strategies on Production Grids

Size: px
Start display at page:

Download "Modeling User Submission Strategies on Production Grids"

Transcription

1 Modeling User Submission Strategies on Production Grids Diane Lingrand, Johan Montagnat, Tristan Glatard Sophia-Antipolis, FRANCE University of Lyon, FRANCE HPDC 2009 München, Germany

2 The EGEE production grid Huge computing power and data storage facility: > 80,000 CPUs > 250 computing centers worldwide > 200,000 jobs/day > 9,000 registered users Toll: latency and faults 2

3 Variable latencies RDRC Job term. succ. RDR Job term. succ. RDRC RDR RDR - warnings RDRC - warnings Heavy-tailed, multimodal Based on 33 millions of EGEE jobs, [Lingrand et al, JSSPP'09] 3

4 Faults RA - No compatible resource RA RA Job proxy expired RA cannot retrieve previous matches 35% of faults, various distributions Based on 33 millions of EGEE jobs, [Lingrand et al, JSSPP'09] 4

5 EGEE: a complex system Meta-scheduling (no global view) latency R Pre-scheduling (subm. params) Scheduling (heterogeneous algorithms) 5

6 Quality of Service for grid applications Assumption: infrastructure only provides best-effort Analogy with TCP/IP model Grid stack Local Infrastructure Network stack Application (jobs) Application (HTTP)? Transport (TCP) Workload Management System Network (IP) to computing resources to network resources [Meng et al IPDPS'09] => Need for user-level submission strategies 6

7 Outline Introduction Model and evaluation of 3 strategies Single resubmission after timeout [CCGrid'07] Multiple submissions [Casanova, JGC 07] Delayed resubmission Cost criterion Conclusion 7

8 Modeling & evaluation method Infrastructure behavior Latency R is a random variable with c.d.f FR Some jobs (outliers, fraction ρ) have infinite latency ~ FR Submission strategies modeling FR Expectation & stdev of total latency J w.r.t. R and ρ Evaluation Based on 11,000 probe jobs (Sept. 06 Feb 08) 8

9 Single resubmission: modeling Job timed-out => canceled and resubmitted Expectation of total latency time: ~ EJ has a min <=> FR is heavy-tailed EJ(tꝏ) 9

10 Single resubmission: evaluation Total latencies Without With outliers outliers 104s 104s Without outliers 104s Conclusions Manages to filter out outliers Reduces standard-deviation 10

11 Multiple resubmission: modeling Submit b copies of the job With timeout on the collection Expectation of total latency time ~ ~ FR(t) replaced by 1 (1 - FR(t))b b jobs have latency > t At least 1 job has latency < t 11

12 Multiple resubmission: evaluation EJ has a minimum for all values of b Slope after minimum decreases as b increases 12

13 Multiple resubmission: evaluation Stdev minimal EJ standard deviation σj Mean number of job copies (b) number of job copies (b) Strong improvements of mean and stdev Goes smoother as b increases 13

14 Delayed resubmission strategy Goal: limit the number of simultaneous job copies Job submitted at time t with timeout tꝏ Copy submitted at t+t0 0 Job 1 Job 2 (copy 1) Job 3 (copy 2)... t0 tꝏ 2t0 t0+tꝏ Constraints: 3t0 2t0+tꝏ 0 < t0 < t ꝏ tꝏ < 2t0 Expectation of total latency: 14

15 Delayed resubmission strategy Minimal value of EJ Single resubmission: EJ = 471s Multiple resubmissions (b=2): EJ = 314s Delayed resubmission: EJ = 431s 15

16 Total latency of delayed VS mult. subm. 16

17 Outline Introduction Model and evaluation of 3 strategies Single resubmission after timeout Multiple submissions Delayed resubmission Cost criterion Conclusion 17

18 Cost of the strategies Based on total middleware time 0 T/4 T/2 T single submission => middleware time T double submission => middleware time T/2 When N// copies are submitted in parallel: 18

19 Cost of delayed VS multiple submission 19 delayed resubmission strategy multiple resubmission strategy

20 Cost of delayed VS multiple submission 20

21 Conclusion Need for local user-level submission strategies Modeling Evaluation Studied 3 submission strategies Single, multiple, delayed Cost metric based on total middleware time Conclusions Multiple resubmission reduces total latency at a high cost (b=2 => cost = 1.3) Delayed strategy improves total latency at a lower cost than single resubmission Future work: do it live on real applications 21

22 Thank you! Questions? 22

23 23

24 Back slides 24

25 Future work Implementations in real applications Need for latency estimates Using data from the Grid Observatory non sparse data non limited to the biomed VO 25

26 Job's life cycle on EGEE 26

27 EGEE Workload Management System 27

28 Data probe jobs: /bin/hostname periodically submitted to maintain constant load 11,000 traces acquired on the biomed VO between September 2006 and February

N. Marusov, I. Semenov

N. Marusov, I. Semenov GRID TECHNOLOGY FOR CONTROLLED FUSION: CONCEPTION OF THE UNIFIED CYBERSPACE AND ITER DATA MANAGEMENT N. Marusov, I. Semenov Project Center ITER (ITER Russian Domestic Agency N.Marusov@ITERRF.RU) Challenges

More information

Grid Scheduling Architectures with Globus

Grid Scheduling Architectures with Globus Grid Scheduling Architectures with Workshop on Scheduling WS 07 Cetraro, Italy July 28, 2007 Ignacio Martin Llorente Distributed Systems Architecture Group Universidad Complutense de Madrid 1/38 Contents

More information

Multiple Sclerosis Brain MRI Segmentation Workflow deployment on the EGEE grid

Multiple Sclerosis Brain MRI Segmentation Workflow deployment on the EGEE grid Multiple Sclerosis Brain MRI Segmentation Workflow deployment on the EGEE grid Erik Pernod 1, Jean-Christophe Souplet 1, Javier Rojas Balderrama 2, Diane Lingrand 2, Xavier Pennec 1 Speaker: Grégoire Malandain

More information

AMGA metadata catalogue system

AMGA metadata catalogue system AMGA metadata catalogue system Hurng-Chun Lee ACGrid School, Hanoi, Vietnam www.eu-egee.org EGEE and glite are registered trademarks Outline AMGA overview AMGA Background and Motivation for AMGA Interface,

More information

g-eclipse A Framework for Accessing Grid Infrastructures Nicholas Loulloudes Trainer, University of Cyprus (loulloudes.n_at_cs.ucy.ac.

g-eclipse A Framework for Accessing Grid Infrastructures Nicholas Loulloudes Trainer, University of Cyprus (loulloudes.n_at_cs.ucy.ac. g-eclipse A Framework for Accessing Grid Infrastructures Trainer, University of Cyprus (loulloudes.n_at_cs.ucy.ac.cy) EGEE Training the Trainers May 6 th, 2009 Outline Grid Reality The Problem g-eclipse

More information

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez Scientific data processing at global scale The LHC Computing Grid Chengdu (China), July 5th 2011 Who I am 2 Computing science background Working in the field of computing for high-energy physics since

More information

Towards Production-level Cardiac Image Analysis with Grids

Towards Production-level Cardiac Image Analysis with Grids Towards Production-level Cardiac Image Analysis with Grids Ketan MAHESHWARI a,1, Tristan GLATARD b, Joël SCHAERER b, Bertrand DELHAY b, Sorina CAMARASU-POP b, Patrick CLARYSSE b and Johan MONTAGNAT a a

More information

Expériences d'imagerie médicale sur EGEE

Expériences d'imagerie médicale sur EGEE Expériences d'imagerie médicale sur EGEE Tristan Glatard CNRS CREATIS, Lyon Journée d'information grille de calcul CPPM Luminy Marseille 27 avril 2010 http://www.creatis.insa-lyon.fr 1 EGEE medical imaging

More information

The Elasticity and Plasticity in Semi-Containerized Colocating Cloud Workload: a view from Alibaba Trace

The Elasticity and Plasticity in Semi-Containerized Colocating Cloud Workload: a view from Alibaba Trace The Elasticity and Plasticity in Semi-Containerized Colocating Cloud Workload: a view from Alibaba Trace Qixiao Liu* and Zhibin Yu Shenzhen Institute of Advanced Technology Chinese Academy of Science @SoCC

More information

The LHC Computing Grid. Slides mostly by: Dr Ian Bird LCG Project Leader 18 March 2008

The LHC Computing Grid. Slides mostly by: Dr Ian Bird LCG Project Leader 18 March 2008 The LHC Computing Grid Slides mostly by: Dr Ian Bird LCG Project Leader 18 March 2008 The LHC Computing Grid February 2008 Some precursors Computing for HEP means data handling Fixed-target experiments

More information

Energy Consumption in Mobile Phones: A Measurement Study and Implications for Network Applications (IMC09)

Energy Consumption in Mobile Phones: A Measurement Study and Implications for Network Applications (IMC09) Energy Consumption in Mobile Phones: A Measurement Study and Implications for Network Applications (IMC09) Niranjan Balasubramanian Aruna Balasubramanian Arun Venkataramani University of Massachusetts

More information

The LCG 3D Project. Maria Girone, CERN. The 23rd Open Grid Forum - OGF23 4th June 2008, Barcelona. CERN IT Department CH-1211 Genève 23 Switzerland

The LCG 3D Project. Maria Girone, CERN. The 23rd Open Grid Forum - OGF23 4th June 2008, Barcelona. CERN IT Department CH-1211 Genève 23 Switzerland The LCG 3D Project Maria Girone, CERN The rd Open Grid Forum - OGF 4th June 2008, Barcelona Outline Introduction The Distributed Database (3D) Project Streams Replication Technology and Performance Availability

More information

From gridified scripts to workflows: the FSL Feat case

From gridified scripts to workflows: the FSL Feat case From gridified scripts to workflows: the FSL Feat case Tristan Glatard and Sílvia D. Olabarriaga Academic Medical Center Informatics Institute University of Amsterdam MICCAI-G workshop September 6 th 2008

More information

L3.4. Data Management Techniques. Frederic Desprez Benjamin Isnard Johan Montagnat

L3.4. Data Management Techniques. Frederic Desprez Benjamin Isnard Johan Montagnat Grid Workflow Efficient Enactment for Data Intensive Applications L3.4 Data Management Techniques Authors : Eddy Caron Frederic Desprez Benjamin Isnard Johan Montagnat Summary : This document presents

More information

Software composition for scientific workflows

Software composition for scientific workflows Software composition for scientific workflows Johan Montagnat CNRS / UNS, I3S, MODALIS April 17, 2009 http://gwendia.polytech.unice.fr Grid infrastructures Scientific production Europe: EGEE (www.eu-egee.org),

More information

Advanced Job Submission on the Grid

Advanced Job Submission on the Grid Advanced Job Submission on the Grid Antun Balaz Scientific Computing Laboratory Institute of Physics Belgrade http://www.scl.rs/ 30 Nov 11 Dec 2009 www.eu-egee.org Scope User Interface Submit job Workload

More information

Controlling fairness and task granularity in distributed, online, non-clairvoyant workflow executions

Controlling fairness and task granularity in distributed, online, non-clairvoyant workflow executions CONCURRENCY AND COMPUTATION: PRACTICE AND EXPERIENCE Concurrency Computat.: Pract. Exper. 0000; 00:1 20 Published online in Wiley InterScience (www.interscience.wiley.com). Controlling fairness and task

More information

A short introduction to the Worldwide LHC Computing Grid. Maarten Litmaath (CERN)

A short introduction to the Worldwide LHC Computing Grid. Maarten Litmaath (CERN) A short introduction to the Worldwide LHC Computing Grid Maarten Litmaath (CERN) 10-15 PetaByte/year The LHC challenge Data analysis requires at least ~100k typical PC processor cores Scientists in tens

More information

The LHC Computing Grid

The LHC Computing Grid The LHC Computing Grid Visit of Finnish IT Centre for Science CSC Board Members Finland Tuesday 19 th May 2009 Frédéric Hemmer IT Department Head The LHC and Detectors Outline Computing Challenges Current

More information

An Adaptive Online System for Efficient Processing of Hierarchical Data

An Adaptive Online System for Efficient Processing of Hierarchical Data Dimitrios Tsoumakos Nectarios Koziris {nasia, dtsouma, nkoziris}@cslab.ece.ntua.gr Motivation (1) Efficient, on-line processing of bulk data Organized in concept hierarchies Over one or more dimensions

More information

Reliability Support in Virtual Infrastructures

Reliability Support in Virtual Infrastructures Reliability Support in Virtual Infrastructures 2 nd IEEE International Conference on Cloud Computing Technology and Science, Indianapolis, 2010 RESO Guilherme Koslovski (INRIA University of Lyon) Wai-Leong

More information

Monitoring System for the GRID Monte Carlo Mass Production in the H1 Experiment at DESY

Monitoring System for the GRID Monte Carlo Mass Production in the H1 Experiment at DESY Journal of Physics: Conference Series OPEN ACCESS Monitoring System for the GRID Monte Carlo Mass Production in the H1 Experiment at DESY To cite this article: Elena Bystritskaya et al 2014 J. Phys.: Conf.

More information

Towards Production-level Cardiac Image Analysis with Grids

Towards Production-level Cardiac Image Analysis with Grids Author manuscript, published in "HealthGrid 2009, Berlin : Germany (2009)" DOI : 10.3233/978-1-60750-027-8-31 Towards Production-level Cardiac Image Analysis with Grids Ketan MAHESHWARI a,1, Tristan GLATARD

More information

glite Grid Services Overview

glite Grid Services Overview The EPIKH Project (Exchange Programme to advance e-infrastructure Know-How) glite Grid Services Overview Antonio Calanducci INFN Catania Joint GISELA/EPIKH School for Grid Site Administrators Valparaiso,

More information

EGI, LSGC: What s in it for you?

EGI, LSGC: What s in it for you? EGI, LSGC: What s in it for you? Sílvia Delgado Olabarriaga e-bioscience Group Bioinformatics Laboratory Dept of Epidemiology, Biostatistics and Bioinformatics 2 Summary Before EGI EGI Life Science Grid

More information

HammerCloud: A Stress Testing System for Distributed Analysis

HammerCloud: A Stress Testing System for Distributed Analysis HammerCloud: A Stress Testing System for Distributed Analysis Daniel C. van der Ster 1, Johannes Elmsheuser 2, Mario Úbeda García 1, Massimo Paladin 1 1: CERN, Geneva, Switzerland 2: Ludwig-Maximilians-Universität

More information

Low Latency via Redundancy

Low Latency via Redundancy Low Latency via Redundancy Ashish Vulimiri, Philip Brighten Godfrey, Radhika Mittal, Justine Sherry, Sylvia Ratnasamy, Scott Shenker Presenter: Meng Wang 2 Low Latency Is Important Injecting just 400 milliseconds

More information

Application of Virtualization Technologies & CernVM. Benedikt Hegner CERN

Application of Virtualization Technologies & CernVM. Benedikt Hegner CERN Application of Virtualization Technologies & CernVM Benedikt Hegner CERN Virtualization Use Cases Worker Node Virtualization Software Testing Training Platform Software Deployment }Covered today Server

More information

Monitoring the Usage of the ZEUS Analysis Grid

Monitoring the Usage of the ZEUS Analysis Grid Monitoring the Usage of the ZEUS Analysis Grid Stefanos Leontsinis September 9, 2006 Summer Student Programme 2006 DESY Hamburg Supervisor Dr. Hartmut Stadie National Technical

More information

Multiple Sclerosis Brain MRI Segmentation Workflow deployment on the EGEE Grid

Multiple Sclerosis Brain MRI Segmentation Workflow deployment on the EGEE Grid Multiple Sclerosis Brain MRI Segmentation Workflow deployment on the EGEE Grid Erik Pernod 1, Jean-Christophe Souplet 1, Javier Rojas Balderrama 2, Diane Lingrand 2 and Xavier Pennec 1 July 18, 2008 1

More information

CHAPTER 3 GRID MONITORING AND RESOURCE SELECTION

CHAPTER 3 GRID MONITORING AND RESOURCE SELECTION 31 CHAPTER 3 GRID MONITORING AND RESOURCE SELECTION This chapter introduces the Grid monitoring with resource metrics and network metrics. This chapter also discusses various network monitoring tools and

More information

Self-healing of workflow activity incidents on distributed computing infrastructures

Self-healing of workflow activity incidents on distributed computing infrastructures Self-healing of workflow activity incidents on distributed computing infrastructures Rafael Ferreira da Silva a, Tristan Glatard a, Frédéric Desprez b a University of Lyon, CNRS, INSERM, CREATIS, Villeurbanne,

More information

EGEE and Interoperation

EGEE and Interoperation EGEE and Interoperation Laurence Field CERN-IT-GD ISGC 2008 www.eu-egee.org EGEE and glite are registered trademarks Overview The grid problem definition GLite and EGEE The interoperability problem The

More information

A Non-Parametric Approach to Generation and Validation of Synthetic Network Traffic

A Non-Parametric Approach to Generation and Validation of Synthetic Network Traffic The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL A Non-Parametric Approach to Generation and Validation of Synthetic Network Traffic Félix Hernández-Campos ndez-campos Kevin Jeffay Don Smith Department

More information

glite Middleware Usage

glite Middleware Usage glite Middleware Usage Dusan Vudragovic dusan@phy.bg.ac.yu Scientific Computing Laboratory Institute of Physics Belgrade, Serbia Nov. 18, 2008 www.eu-egee.org EGEE and glite are registered trademarks Usage

More information

Status of KISTI Tier2 Center for ALICE

Status of KISTI Tier2 Center for ALICE APCTP 2009 LHC Physics Workshop at Korea Status of KISTI Tier2 Center for ALICE August 27, 2009 Soonwook Hwang KISTI e-science Division 1 Outline ALICE Computing Model KISTI ALICE Tier2 Center Future Plan

More information

Corral: A Glide-in Based Service for Resource Provisioning

Corral: A Glide-in Based Service for Resource Provisioning : A Glide-in Based Service for Resource Provisioning Gideon Juve USC Information Sciences Institute juve@usc.edu Outline Throughput Applications Grid Computing Multi-level scheduling and Glideins Example:

More information

Preemptive, Low Latency Datacenter Scheduling via Lightweight Virtualization

Preemptive, Low Latency Datacenter Scheduling via Lightweight Virtualization Preemptive, Low Latency Datacenter Scheduling via Lightweight Virtualization Wei Chen, Jia Rao*, and Xiaobo Zhou University of Colorado, Colorado Springs * University of Texas at Arlington Data Center

More information

Rapid Deployment of VS Workflows. Meta Scheduling Service

Rapid Deployment of VS Workflows. Meta Scheduling Service Rapid Deployment of VS Workflows on PHOSPHORUS using Meta Scheduling Service M. Shahid, Bjoern Hagemeier Fraunhofer Institute SCAI, Research Center Juelich. (TNC 2009) Outline Introduction and Motivation

More information

Performance Monitors Setup Guide

Performance Monitors Setup Guide Performance Monitors Setup Guide Version 1.0 2017 EQ-PERF-MON-20170530 Equitrac Performance Monitors Setup Guide Document Revision History Revision Date May 30, 2017 Revision List Initial Release 2017

More information

Monitoring tools in EGEE

Monitoring tools in EGEE Monitoring tools in EGEE Piotr Nyczyk CERN IT/GD Joint OSG and EGEE Operations Workshop - 3 Abingdon, 27-29 September 2005 www.eu-egee.org Kaleidoscope of monitoring tools Monitoring for operations Covered

More information

Part 12 殷亚凤. Homepage: Room 301, Building of Computer Science and Technology

Part 12 殷亚凤.   Homepage:   Room 301, Building of Computer Science and Technology Part 12 殷亚凤 Email: yafeng@nju.edu.cn Homepage: http://cs.nju.edu.cn/yafeng/ Room 301, Building of Computer Science and Technology Distributed Web-based systems The WWW is a huge client-server system with

More information

Jobs Resource Utilization as a Metric for Clusters Comparison and Optimization. Slurm User Group Meeting 9-10 October, 2012

Jobs Resource Utilization as a Metric for Clusters Comparison and Optimization. Slurm User Group Meeting 9-10 October, 2012 Jobs Resource Utilization as a Metric for Clusters Comparison and Optimization Joseph Emeras Cristian Ruiz Jean-Marc Vincent Olivier Richard Slurm User Group Meeting 9-10 October, 2012 INRIA - MESCAL Jobs

More information

Managing CAE Simulation Workloads in Cluster Environments

Managing CAE Simulation Workloads in Cluster Environments Managing CAE Simulation Workloads in Cluster Environments Michael Humphrey V.P. Enterprise Computing Altair Engineering humphrey@altair.com June 2003 Copyright 2003 Altair Engineering, Inc. All rights

More information

European Grid Infrastructure

European Grid Infrastructure EGI-InSPIRE European Grid Infrastructure A pan-european Research Infrastructure supporting the digital European Research Area Michel Drescher Technical Manager, EGI.eu Michel.Drescher@egi.eu TPDL 2013

More information

An Introduction to Virtualization and Cloud Technologies to Support Grid Computing

An Introduction to Virtualization and Cloud Technologies to Support Grid Computing New Paradigms: Clouds, Virtualization and Co. EGEE08, Istanbul, September 25, 2008 An Introduction to Virtualization and Cloud Technologies to Support Grid Computing Distributed Systems Architecture Research

More information

Topics in P2P Networked Systems

Topics in P2P Networked Systems 600.413 Topics in P2P Networked Systems Week 4 Measurements Andreas Terzis Slides from Stefan Saroiu Content Delivery is Changing Thirst for data continues to increase (more data & users) New types of

More information

Introduction. Distributed Systems IT332

Introduction. Distributed Systems IT332 Introduction Distributed Systems IT332 2 Outline Definition of A Distributed System Goals of Distributed Systems Types of Distributed Systems 3 Definition of A Distributed System A distributed systems

More information

Fault tolerance in Grid and Grid 5000

Fault tolerance in Grid and Grid 5000 Fault tolerance in Grid and Grid 5000 Franck Cappello INRIA Director of Grid 5000 fci@lri.fr Fault tolerance in Grid Grid 5000 Applications requiring Fault tolerance in Grid Domains (grid applications

More information

The PanDA System in the ATLAS Experiment

The PanDA System in the ATLAS Experiment 1a, Jose Caballero b, Kaushik De a, Tadashi Maeno b, Maxim Potekhin b, Torre Wenaus b on behalf of the ATLAS collaboration a University of Texas at Arlington, Science Hall, PO Box 19059, Arlington, TX

More information

Overview Job Management OpenVZ Conclusions. XtreemOS. Surbhi Chitre. IRISA, Rennes, France. July 7, Surbhi Chitre XtreemOS 1 / 55

Overview Job Management OpenVZ Conclusions. XtreemOS. Surbhi Chitre. IRISA, Rennes, France. July 7, Surbhi Chitre XtreemOS 1 / 55 XtreemOS Surbhi Chitre IRISA, Rennes, France July 7, 2009 Surbhi Chitre XtreemOS 1 / 55 Surbhi Chitre XtreemOS 2 / 55 Outline What is XtreemOS What features does it provide in XtreemOS How is it new and

More information

FROM HPC TO THE CLOUD WITH AMQP AND OPEN SOURCE SOFTWARE

FROM HPC TO THE CLOUD WITH AMQP AND OPEN SOURCE SOFTWARE FROM HPC TO THE CLOUD WITH AMQP AND OPEN SOURCE SOFTWARE Carl Trieloff cctrieloff@redhat.com Red Hat Lee Fisher lee.fisher@hp.com Hewlett-Packard High Performance Computing on Wall Street conference 14

More information

Update on EZ-Grid. Priya Raghunath University of Houston. PI : Dr Barbara Chapman

Update on EZ-Grid. Priya Raghunath University of Houston. PI : Dr Barbara Chapman Update on EZ-Grid Priya Raghunath University of Houston PI : Dr Barbara Chapman chapman@cs.uh.edu Outline Campus Grid at the University of Houston (UH) Functionality of EZ-Grid Design and Implementation

More information

RUSSIAN DATA INTENSIVE GRID (RDIG): CURRENT STATUS AND PERSPECTIVES TOWARD NATIONAL GRID INITIATIVE

RUSSIAN DATA INTENSIVE GRID (RDIG): CURRENT STATUS AND PERSPECTIVES TOWARD NATIONAL GRID INITIATIVE RUSSIAN DATA INTENSIVE GRID (RDIG): CURRENT STATUS AND PERSPECTIVES TOWARD NATIONAL GRID INITIATIVE Viacheslav Ilyin Alexander Kryukov Vladimir Korenkov Yuri Ryabov Aleksey Soldatov (SINP, MSU), (SINP,

More information

Jithendar Paladugula, Ming Zhao, Renato Figueiredo

Jithendar Paladugula, Ming Zhao, Renato Figueiredo Support for Data-Intensive, Variable- Granularity Grid Applications via Distributed File System Virtualization: A Case Study of Light Scattering Spectroscopy Jithendar Paladugula, Ming Zhao, Renato Figueiredo

More information

12d Synergy Requirements

12d Synergy Requirements This document outlines the requirements for a 12d Synergy implementation. The server requirements may differ based on the number of users. Operating below these minimum requirements will put you in an

More information

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy

30 Nov Dec Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy Advanced School in High Performance and GRID Computing Concepts and Applications, ICTP, Trieste, Italy Why the Grid? Science is becoming increasingly digital and needs to deal with increasing amounts of

More information

Evolution of the ATLAS PanDA Workload Management System for Exascale Computational Science

Evolution of the ATLAS PanDA Workload Management System for Exascale Computational Science Evolution of the ATLAS PanDA Workload Management System for Exascale Computational Science T. Maeno, K. De, A. Klimentov, P. Nilsson, D. Oleynik, S. Panitkin, A. Petrosyan, J. Schovancova, A. Vaniachine,

More information

Computing grids, a tool for international collaboration and against digital divide Guy Wormser Director of CNRS Institut des Grilles (CNRS, France)

Computing grids, a tool for international collaboration and against digital divide Guy Wormser Director of CNRS Institut des Grilles (CNRS, France) Computing grids, a tool for international collaboration and against digital divide Guy Wormser Director of CNRS Institut des Grilles (CNRS, France) www.eu-egee.org EGEE and glite are registered trademarks

More information

CMS Tier-2 Program for user Analysis Computing on the Open Science Grid Frank Würthwein UCSD Goals & Status

CMS Tier-2 Program for user Analysis Computing on the Open Science Grid Frank Würthwein UCSD Goals & Status CMS Tier-2 Program for user Analysis Computing on the Open Science Grid Frank Würthwein UCSD Goals & Status High Level Requirements for user analysis computing Code Development Environment Compile, run,

More information

Petri net modeling and simulation of a distributed computing system

Petri net modeling and simulation of a distributed computing system Petri net modeling and simulation of a distributed computing system Jan Magne Tjensvold October 19, 2007 1 Introduction This Petri net model, simulation and analysis project has been inspired by the author

More information

Introduction to Grid Infrastructures

Introduction to Grid Infrastructures Introduction to Grid Infrastructures Stefano Cozzini 1 and Alessandro Costantini 2 1 CNR-INFM DEMOCRITOS National Simulation Center, Trieste, Italy 2 Department of Chemistry, Università di Perugia, Perugia,

More information

Improving Grid User's Privacy with glite Pseudonymity Service

Improving Grid User's Privacy with glite Pseudonymity Service Improving Grid User's Privacy with glite Pseudonymity Service Henri Mikkonen, Joni Hahkala and John White 5 th EGEE User Forum 12-16 April 2010 Uppsala, Sweden www.eu-egee.org EGEE and glite are registered

More information

Gergely Sipos MTA SZTAKI

Gergely Sipos MTA SZTAKI Application development on EGEE with P-GRADE Portal Gergely Sipos MTA SZTAKI sipos@sztaki.hu EGEE Training and Induction EGEE Application Porting Support www.lpds.sztaki.hu/gasuc www.portal.p-grade.hu

More information

PARALLEL PROGRAM EXECUTION SUPPORT IN THE JGRID SYSTEM

PARALLEL PROGRAM EXECUTION SUPPORT IN THE JGRID SYSTEM PARALLEL PROGRAM EXECUTION SUPPORT IN THE JGRID SYSTEM Szabolcs Pota 1, Gergely Sipos 2, Zoltan Juhasz 1,3 and Peter Kacsuk 2 1 Department of Information Systems, University of Veszprem, Hungary 2 Laboratory

More information

A Fault Tolerant Scheduler with Dynamic Replication in Desktop Grid Environment

A Fault Tolerant Scheduler with Dynamic Replication in Desktop Grid Environment A Fault Tolerant Scheduler with Dynamic Replication in Desktop Grid Environment Jyoti Bansal 1, Dr. Shaveta Rani 2, Dr. Paramjit Singh 3 1 Research Scholar,PTU, Kapurthala 2,3 Punjab Technical University

More information

S. Narravula, P. Balaji, K. Vaidyanathan, H.-W. Jin and D. K. Panda. The Ohio State University

S. Narravula, P. Balaji, K. Vaidyanathan, H.-W. Jin and D. K. Panda. The Ohio State University Architecture for Caching Responses with Multiple Dynamic Dependencies in Multi-Tier Data- Centers over InfiniBand S. Narravula, P. Balaji, K. Vaidyanathan, H.-W. Jin and D. K. Panda The Ohio State University

More information

Project GRACE: A grid based search tool for the global digital library

Project GRACE: A grid based search tool for the global digital library Project GRACE: A grid based search tool for the global digital library Frank Scholze 1, Glenn Haya 2, Jens Vigen 3, Petra Prazak 4 1 Stuttgart University Library, Postfach 10 49 41, 70043 Stuttgart, Germany;

More information

VIRTUAL DOMAIN SHARING IN E-SCIENCE BASED ON USAGE SERVICE LEVEL AGREEMENTS

VIRTUAL DOMAIN SHARING IN E-SCIENCE BASED ON USAGE SERVICE LEVEL AGREEMENTS VIRTUAL DOMAIN SHARING IN E-SCIENCE BASED ON USAGE SERVICE LEVEL AGREEMENTS Cǎtǎlin L. Dumitrescu CoreGRID Institute on Programming Models Mathematics and Computer Science Department, The University of

More information

Multimedia Streaming. Mike Zink

Multimedia Streaming. Mike Zink Multimedia Streaming Mike Zink Technical Challenges Servers (and proxy caches) storage continuous media streams, e.g.: 4000 movies * 90 minutes * 10 Mbps (DVD) = 27.0 TB 15 Mbps = 40.5 TB 36 Mbps (BluRay)=

More information

Replicate It! Scalable Content Delivery: Why? Scalable Content Delivery: How? Scalable Content Delivery: How? Scalable Content Delivery: What?

Replicate It! Scalable Content Delivery: Why? Scalable Content Delivery: How? Scalable Content Delivery: How? Scalable Content Delivery: What? Accelerating Internet Streaming Media Delivery using Azer Bestavros and Shudong Jin Boston University http://www.cs.bu.edu/groups/wing Scalable Content Delivery: Why? Need to manage resource usage as demand

More information

Performance Characterization of a Commercial Video Streaming Service. Mojgan Ghasemi, Akamai Technologies - Princeton University

Performance Characterization of a Commercial Video Streaming Service. Mojgan Ghasemi, Akamai Technologies - Princeton University Performance Characterization of a Commercial Video Streaming Service Mojgan Ghasemi, Akamai Technologies - Princeton University MGhasemi,PKanuparthy,AMansy,TBenson,andJRexford ACM IMC 2016 1 2 First study

More information

Grid Compute Resources and Grid Job Management

Grid Compute Resources and Grid Job Management Grid Compute Resources and Job Management March 24-25, 2007 Grid Job Management 1 Job and compute resource management! This module is about running jobs on remote compute resources March 24-25, 2007 Grid

More information

Andrea Sciabà CERN, Switzerland

Andrea Sciabà CERN, Switzerland Frascati Physics Series Vol. VVVVVV (xxxx), pp. 000-000 XX Conference Location, Date-start - Date-end, Year THE LHC COMPUTING GRID Andrea Sciabà CERN, Switzerland Abstract The LHC experiments will start

More information

The ATLAS Distributed Analysis System

The ATLAS Distributed Analysis System The ATLAS Distributed Analysis System F. Legger (LMU) on behalf of the ATLAS collaboration October 17th, 2013 20th International Conference on Computing in High Energy and Nuclear Physics (CHEP), Amsterdam

More information

WIBDirect Corporate Uploading a payment file

WIBDirect Corporate Uploading a payment file WIBDirect Corporate Uploading a payment file Document issue: 2.1 Date of issue: November 2015 Contents Uploading a Payment file... 3 Page 2 Uploading a Payment file Important: Based on your feedback we

More information

MySQL As A Service. Operationalizing 19 Years of Infrastructure at GoDaddy

MySQL As A Service. Operationalizing 19 Years of Infrastructure at GoDaddy MySQL As A Service Operationalizing 19 Years of Infrastructure at GoDaddy WHOAMI Nathan Northcutt Senior Software Engineer MySQL DevOps ~10 years performance engineering & distributed data services. Email:

More information

The LHC Computing Grid

The LHC Computing Grid The LHC Computing Grid Gergely Debreczeni (CERN IT/Grid Deployment Group) The data factory of LHC 40 million collisions in each second After on-line triggers and selections, only 100 3-4 MB/event requires

More information

Designing Next Generation Data-Centers with Advanced Communication Protocols and Systems Services

Designing Next Generation Data-Centers with Advanced Communication Protocols and Systems Services Designing Next Generation Data-Centers with Advanced Communication Protocols and Systems Services P. Balaji, K. Vaidyanathan, S. Narravula, H. W. Jin and D. K. Panda Network Based Computing Laboratory

More information

Non-Uniform Memory Access (NUMA) Architecture and Multicomputers

Non-Uniform Memory Access (NUMA) Architecture and Multicomputers Non-Uniform Memory Access (NUMA) Architecture and Multicomputers Parallel and Distributed Computing Department of Computer Science and Engineering (DEI) Instituto Superior Técnico February 29, 2016 CPD

More information

LCG-2 and glite Architecture and components

LCG-2 and glite Architecture and components LCG-2 and glite Architecture and components Author E.Slabospitskaya www.eu-egee.org Outline Enabling Grids for E-sciencE What are LCG-2 and glite? glite Architecture Release 1.0 review What is glite?.

More information

Workflow, Planning and Performance Information, information, information Dr Andrew Stephen M c Gough

Workflow, Planning and Performance Information, information, information Dr Andrew Stephen M c Gough Workflow, Planning and Performance Information, information, information Dr Andrew Stephen M c Gough Technical Coordinator London e-science Centre Imperial College London 17 th March 2006 Outline Where

More information

Investigating the Use of Synchronized Clocks in TCP Congestion Control

Investigating the Use of Synchronized Clocks in TCP Congestion Control Investigating the Use of Synchronized Clocks in TCP Congestion Control Michele Weigle (UNC-CH) November 16-17, 2001 Univ. of Maryland Symposium The Problem TCP Reno congestion control reacts only to packet

More information

ATLAS NorduGrid related activities

ATLAS NorduGrid related activities Outline: NorduGrid Introduction ATLAS software preparation and distribution Interface between NorduGrid and Condor NGlogger graphical interface On behalf of: Ugur Erkarslan, Samir Ferrag, Morten Hanshaugen

More information

MOHA: Many-Task Computing Framework on Hadoop

MOHA: Many-Task Computing Framework on Hadoop Apache: Big Data North America 2017 @ Miami MOHA: Many-Task Computing Framework on Hadoop Soonwook Hwang Korea Institute of Science and Technology Information May 18, 2017 Table of Contents Introduction

More information

Chapter 3. Design of Grid Scheduler. 3.1 Introduction

Chapter 3. Design of Grid Scheduler. 3.1 Introduction Chapter 3 Design of Grid Scheduler The scheduler component of the grid is responsible to prepare the job ques for grid resources. The research in design of grid schedulers has given various topologies

More information

Non-Uniform Memory Access (NUMA) Architecture and Multicomputers

Non-Uniform Memory Access (NUMA) Architecture and Multicomputers Non-Uniform Memory Access (NUMA) Architecture and Multicomputers Parallel and Distributed Computing Department of Computer Science and Engineering (DEI) Instituto Superior Técnico September 26, 2011 CPD

More information

The Grid. Processing the Data from the World s Largest Scientific Machine II Brazilian LHC Computing Workshop

The Grid. Processing the Data from the World s Largest Scientific Machine II Brazilian LHC Computing Workshop The Grid Processing the Data from the World s Largest Scientific Machine II Brazilian LHC Computing Workshop Patricia Méndez Lorenzo (IT-GS/EIS), CERN Abstract The world's largest scientific machine will

More information

IBM InfoSphere Streams v4.0 Performance Best Practices

IBM InfoSphere Streams v4.0 Performance Best Practices Henry May IBM InfoSphere Streams v4.0 Performance Best Practices Abstract Streams v4.0 introduces powerful high availability features. Leveraging these requires careful consideration of performance related

More information

Enabling Grids for E-sciencE. EGEE security pitch. Olle Mulmo. EGEE Chief Security Architect KTH, Sweden. INFSO-RI

Enabling Grids for E-sciencE. EGEE security pitch. Olle Mulmo. EGEE Chief Security Architect KTH, Sweden.  INFSO-RI EGEE security pitch Olle Mulmo EGEE Chief Security Architect KTH, Sweden www.eu-egee.org Project PR www.eu-egee.org EGEE EGEE is the largest Grid infrastructure project in the World? : 70 leading institutions

More information

Executing distributed applications on virtualized infrastructures specified with the VXDL language and managed by the HIPerNET framework

Executing distributed applications on virtualized infrastructures specified with the VXDL language and managed by the HIPerNET framework Executing distributed applications on virtualized infrastructures specified with the VXDL language and managed by the HIPerNET framework Guilherme Koslovski, Tram Truong Huu, Johan Montagnat, Pascale Vicat-Blanc

More information

The EGEE-III Project Towards Sustainable e-infrastructures

The EGEE-III Project Towards Sustainable e-infrastructures The EGEE-III Project Towards Sustainable e-infrastructures Erwin Laure EGEE-III Technical Director Erwin.Laure@cern.ch www.eu-egee.org EGEE-II INFSO-RI-031688 EGEE and glite are registered trademarks EGEE

More information

A Library and Proxy for SPDY

A Library and Proxy for SPDY A Library and Proxy for SPDY Interdisciplinary Project Andrey Uzunov Chair for Network Architectures and Services Department of Informatics Technische Universität München April 3, 2013 Andrey Uzunov (TUM)

More information

Sphinx: A Scheduling Middleware for Data Intensive Applications on a Grid

Sphinx: A Scheduling Middleware for Data Intensive Applications on a Grid Sphinx: A Scheduling Middleware for Data Intensive Applications on a Grid Richard Cavanaugh University of Florida Collaborators: Janguk In, Sanjay Ranka, Paul Avery, Laukik Chitnis, Gregory Graham (FNAL),

More information

Reprocessing DØ data with SAMGrid

Reprocessing DØ data with SAMGrid Reprocessing DØ data with SAMGrid Frédéric Villeneuve-Séguier Imperial College, London, UK On behalf of the DØ collaboration and the SAM-Grid team. Abstract The DØ experiment studies proton-antiproton

More information

Box s 1 minute Bio l B. Eng (AE 1983): Khon Kean University

Box s 1 minute Bio l B. Eng (AE 1983): Khon Kean University CSC469/585: Winter 2011-12 High Availability and Performance Computing: Towards non-stop services in HPC/HEC/Enterprise IT Environments Chokchai (Box) Leangsuksun, Associate Professor, Computer Science

More information

QCN: Quantized Congestion Notification. Rong Pan, Balaji Prabhakar, Ashvin Laxmikantha

QCN: Quantized Congestion Notification. Rong Pan, Balaji Prabhakar, Ashvin Laxmikantha QCN: Quantized Congestion Notification Rong Pan, Balaji Prabhakar, Ashvin Laxmikantha Overview Description the QCN scheme Pseudocode available with Rong Pan: ropan@cisco.com Basic simulations Infinitely

More information

Medical Image Content-Based Queries using the Grid

Medical Image Content-Based Queries using the Grid Medical Image Content-Based Queries using the Grid Johan Montagnat, Hector Duque, Jean-Marc Pierson, Vincent Breton, Lionel Brunie, Isabelle Magnin To cite this version: Johan Montagnat, Hector Duque,

More information

COMMUNICATION PROTOCOLS

COMMUNICATION PROTOCOLS COMMUNICATION PROTOCOLS Index Chapter 1. Introduction Chapter 2. Software components message exchange JMS and Tibco Rendezvous Chapter 3. Communication over the Internet Simple Object Access Protocol (SOAP)

More information

SLCS and VASH Service Interoperability of Shibboleth and glite

SLCS and VASH Service Interoperability of Shibboleth and glite SLCS and VASH Service Interoperability of Shibboleth and glite Christoph Witzig, SWITCH (witzig@switch.ch) www.eu-egee.org NREN Grid Workshop Nov 30th, 2007 - Malaga EGEE and glite are registered trademarks

More information