High Availability irods System (HAIRS)

Similar documents
High Availability irods System (HAIRS)

Failover procedure for Grid core services

Proceedings. irods User Group Meeting Policy-Based Data Management Sharing and Preservation

AMGA metadata catalogue system

MetaData Management Control of Distributed Digital Objects using irods. Venkata Raviteja Vutukuri

AMGA Metadata catalogue service for managing mass data of high-energy physics

Database Replication in Tashkent. CSEP 545 Transaction Processing Sameh Elnikety

Part2: Let s pick one cloud IaaS middleware: OpenStack. Sergio Maffioletti

Comparative evaluation of software tools accessing relational databases from a (real) grid environments

Metadaten Workshop 26./27. März 2007 Göttingen. Chimera. a new grid enabled name-space service. Martin Radicke. Tigran Mkrtchyan

Grid services. Enabling Grids for E-sciencE. Dusan Vudragovic Scientific Computing Laboratory Institute of Physics Belgrade, Serbia

irods and Objectstorage UGM 2016, Chapel Hill / Othmar Weber, Bayer Business Services / v0.2

TECHNICAL OVERVIEW irods Technical Overview 2016 edition RCI_iROD_Report_final2.indd 1-2 5/26/16 9:21 AM

glite Grid Services Overview

Oracle Database and Application Solutions

WHEN the Large Hadron Collider (LHC) begins operation

Academic Workflow for Research Repositories Using irods and Object Storage

Auditing with the Pluggable Rule Engine and AMQP

Operating the Distributed NDGF Tier-1

DSpace Fedora. Eprints Greenstone. Handle System

Pgpool-II Development Status Updates. Pgpool-II Global Development Group

PoS(EGICF12-EMITC2)106

IRODS: the Integrated Rule- Oriented Data-Management System

Xen and CloudStack. Ewan Mellor. Director, Engineering, Open-source Cloud Platforms Citrix Systems

CI-BER tutorial. Richard Marciano Chien-Yi Hou 11/14/2013. Title

Data Intensive processing with irods and the middleware CiGri for the Whisper project Xavier Briand

DIRAC File Replica and Metadata Catalog

Deploying virtualisation in a production grid

Web Object Scaler. WOS and IRODS Data Grid Dave Fellinger

Oracle Cloud Metric Reference for Oracle Infrastructure Monitoring

An overview of virtual machine architecture

Analysis of Derby Performance

Application of Virtualization Technologies & CernVM. Benedikt Hegner CERN

Enhancement of Real Time EPICS IOC PV Management for Data Archiving System. Jae-Ha Kim

Distributed KIDS Labs 1

PostgreSQL in Mission-Critical Financial Systems May 20th 2010

ASG-Rochade Reconciliation Toolkit Release Notes

Toward Scalable Monitoring on Large-Scale Storage for Software Defined Cyberinfrastructure

Transcontinental Persistent Archive Prototype

Oracle Enterprise Manager. 1 Before You Install. System Monitoring Plug-in for Oracle Unified Directory User's Guide Release 1.0

Swedish National Storage Infrastructure for Academic Research with irods

Distributed Computing Framework. A. Tsaregorodtsev, CPPM-IN2P3-CNRS, Marseille

Database Assessment for PDMS

At Course Completion Prepares you as per certification requirements for AWS Developer Associate.

Introduction to Database Systems. Motivation. Werner Nutt

Monte Carlo Production on the Grid by the H1 Collaboration

<Insert Picture Here> Introducing Oracle WebLogic Server on Oracle Database Appliance

Data transfer over the wide area network with a large round trip time

Outline. Definition of a Distributed System Goals of a Distributed System Types of Distributed Systems

The LCG 3D Project. Maria Girone, CERN. The 23rd Open Grid Forum - OGF23 4th June 2008, Barcelona. CERN IT Department CH-1211 Genève 23 Switzerland

Distributing BaBar Data using the Storage Resource Broker (SRB)

The glite middleware. Ariel Garcia KIT

External Database Requirements

OPENSTACK: THE OPEN CLOUD

Design of the Advanced Metadata Service System with AMGA for the Belle II Experiment

Oracle WebCenter Portal Performance Tuning

External Database Requirements

System Description. System Architecture. System Architecture, page 1 Deployment Environment, page 4

The Project Presentation April 28, 2006

Distributed Systems COMP 212. Lecture 18 Othon Michail

Copyright 2012 EMC Corporation. All rights reserved.

CIT 668: System Architecture. Amazon Web Services

Using MVCC for Clustered Databases

Understanding StoRM: from introduction to internals

Beob Kyun KIM, Christophe BONNAUD {kyun, NSDC / KISTI

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez

Oracle E-Business Suite 11i with Cisco ACE Series Application Control Engine Deployment Guide, Version 1.0

Chapter 12: File System Implementation

CC-IN2P3 activity. irods in production: irods developpements in Lyon: SRB to irods migration. Hardware setup. Usage. Prospects.

A High Availability Solution for GRID Services

Cisco Enterprise Cloud Suite Overview Cisco and/or its affiliates. All rights reserved.

Let's Play... Try to name the databases described on the following slides...

Dr Markus Hagenbuchner CSCI319. Distributed Systems Chapter 3 - Processes

Remote Visualization, Analysis and other things

DTM a lightweight computing virtualization system based on irods. Yonny Cardenas, Pascal Calvat, Jean-Yves Nief, Thomas Kachelhoffer

Richard Marciano Alexandra Chassanoff David Pcolar Bing Zhu Chien-Yi Hu. March 24, 2010

where the Web was born Experience of Adding New Architectures to the LCG Production Environment

Using AWS Data Migration Service with RDS

XRAY Grid TO BE OR NOT TO BE?

Real Life Web Development. Joseph Paul Cohen

irods Scalable Architecture

Oracle Secure Backup: Achieve 75 % Cost Savings with Your Tape Backup

Virtual Connect Support Utility Version User Guide 1.5.2

Layer-4 to Layer-7 Services

ParaFS: A Log-Structured File System to Exploit the Internal Parallelism of Flash Devices

Grid and Cloud Activities in KISTI

Deep Dive on Amazon Elastic File System

Bottom line: A database is the data stored and a database system is the software that manages the data. COSC Dr.

irods User Group integrated Rule Oriented Data System Reagan Moore

Simplifying Collaboration in the Cloud

Unit 5: Distributed, Real-Time, and Multimedia Systems

Paperspace. Architecture Overview. 20 Jay St. Suite 312 Brooklyn, NY Technical Whitepaper

ActiveNET. #202, Manjeera Plaza, Opp: Aditya Park Inn, Ameerpetet HYD

EventStore A Data Management System

Difference Between Oracle Database Instance Vs

The Architecture of Inspur Cloud Storage

goals monitoring, fault tolerance, auto-recovery (thousands of low-cost machines) handle appends efficiently (no random writes & sequential reads)

ECE 650 Systems Programming & Engineering. Spring 2018

WHY SHOULD I USE OVM. For Oracle Databases By Francisco Munoz Alvarez Oracle Professional Services Manager

Utilizing Databases in Grid Engine 6.0

Transcription:

KEK HIGH ENERGY ACCELERATOR RESEARCH ORGANIZATION High Availability irods System (HAIRS) Yutaka Kawai, KEK Adil Hasan, ULiv March 25th, 2010 irods User Meeting @ UNC in Chapel Hill, USA -- Yutaka (KEK), Adil (ULiv)

Outline Introduction irods HA system with Director Large File Transfer Speed Performance Summary March 25th, 2010 irods User Meeting @ UNC in Chapel Hill, USA -- Yutaka (KEK), Adil (ULiv)

Introduction Replication enables high availability (HA) system for catalog service Replicate by back-end, i.e. irods Replicate by front-end; i.e. AMGA (ARDA [1] Metadata Grid Application) Metadata Catalogue of EGEE s glite 3.1 Middleware Back-end : Oracle, PostgreSQL, MySQL, SQLite http://amga.web.cern.ch/amga/ The current irods HA is implemented by replicating ICAT DB with PgPool tool [2] A problem when irods server fails Solve the problem by using Director March 25th, 2010 irods User Meeting @ UNC in Chapel Hill, USA -- Yutaka (KEK), Adil (ULiv)

The Current irods HA ICAT DB replication by Pgpool A ICAT irods Client Change the server info in.irodenv B ICAT Pgpool irods Server PostgreSQL March 25th, 2010 irods User Meeting @ UNC in Chapel Hill, USA -- Yutaka (KEK), Adil (ULiv)

Problem of the current HA Even if the irods server fails, clients still continue to access the same server without noticing the failure.? A ICAT irods Client Need to change server info in.irodenv B ICAT Pgpool irods Server PostgreSQL March 25th, 2010 irods User Meeting @ UNC in Chapel Hill, USA -- Yutaka (KEK), Adil (ULiv)

Solution by using Director Place a Director between Client and Server Monitor the irods server statuses Load balance to the irods servers A ICAT irods Client B Pgpool Director ICAT irods Server PostgreSQL March 25th, 2010 irods User Meeting @ UNC in Chapel Hill, USA -- Yutaka (KEK), Adil (ULiv)

How to Implement Director? UltraMonkey [3] Linux based director Low cost but not so high speed Need some steps to setup Hardware Director High cost and high speed Easy to setup (?) Cisco, HP, etc. March 25th, 2010 irods User Meeting @ UNC in Chapel Hill, USA -- Yutaka (KEK), Adil (ULiv)

UltraMonkey UltraMonkey consists of 3 components Linux Virtual Server (LVS) : Load balancing ldirectord : Monitoring real servers Linux-HA (LHA) : Monitoring directors LVS and ldirectord are used here LVS : Provide Virtual IP for load balance ldirectord : Monitoring irods service LHA : Future use for director redundancy March 25th, 2010 irods User Meeting @ UNC in Chapel Hill, USA -- Yutaka (KEK), Adil (ULiv)

Virtual IP for load balance irods Client.100 192.168.1.0/24 irods Client can specify only this VIP in.irodsenv.240.101.240 VIP 192.168.1.200 Linux Director 192.168.2.0/24.102 irods Real Severs Gateway of Real Servers is Director March 25th, 2010 irods User Meeting @ UNC in Chapel Hill, USA -- Yutaka (KEK), Adil (ULiv)

Monitoring irods service ldirector monitors irods real servers Polling server status via irods control port req: any string Port# 1247 Director ack: irods MsgHeader <MsgHeader_PI> <type>rods_version</type> <msglen>182</msglen> <errorlen>0</errorlen> <bslen>0</bslen> <intinfo>0</intinfo> </MsgHeader_PI> <Version_PI> <status>-4000</status> <relversion>rods2.1</relversion> <apiversion>d</apiversion> <reconnport>0</reconnport> <reconnaddr></reconnaddr> <cookie>0</cookie> </Version_PI> irods Server

Outline Introduction irods HA system with Director Large File Transfer Speed Performance Summary

Large File Transfer irods uses parallel ports to transfer a large file. Smaller than 32MB file is transferred through irods control port #1247. irods catalog server directs a server to open parallel ports to transfer a large file irods clients can directly connect with the server through the parallel ports.

Process of Large File Transfer Steps to transfer a large file in irods iput a large file (1) Find physical location to store A (2) ICAT irods Client File Transfer via Parallel I/O (4) irods Server w/o ICAT C Physical Data (3) irods Server Start service for Parallel I/O PostgreSQL

Large File Transfer w/ Director Need to confirm whether Director interferes in transferring a large file or not The physical storage should be located out of the local network of irods real servers Director handles only irods catalog server IP Director cannot manage all of the parallel ports

Process using Director Works as same as normal case Only one additional step between (1) and (2) (1) A ICAT irods Client (1) B (2) Pgpool (4) C Director (3) irods Server ICAT PostgreSQL irods Server w/o ICAT Physical Data

Outline Introduction irods HA system with Director Large File Transfer Speed Performance Summary

Speed Performance Test Program concurrent-test in irods package iput, imeta, iget, imv 1000 entries Servers are VMs (Xen) on same physical machine Client is located on the different machine Test Case Case1: Normal case. Client directly accesses one irods server. Case2: Using a director. Client accesses one irods server via Director. Case3: Load sharing case. Client accesses two irods servers via Director.

Speed Performance (cont d) Using a Director (Case2) About 10% slower than no Director (Case1) Reasonable to consider tradeoff between speed and availability

Speed Performance (cont d) Load sharing case (Case3) About 5% slower than Case2 The concurrent-test is not suitable under such a Load balanced system. Need a program using multi-clients or multi-threading methods.

Opinions in this study Network limitation Director works as NAT. Difficult to place irods catalog servers in different subnets. But the problem depends on NAT technology. We hope some NAT vender can implement extensions. Speed Performance The concurrent-test consumes overhead. The result 10% slow is in one of the worst cases. We may see less than 10% in actual uses. PostgreSQL only? How about other DB services? They have the same tools as PgPool? Back-end replication is enough? Front-end replication should be considered for irods?

Summary irods HA system The current approach using only PgPool The new approach using Director The new one can solve the current problem Large File Transfer irods large file transfer works well when using Director Speed Performance Director results in the speed performance of concurrenttest getting slower 10% Future works Apply this solution to other catalog services

References [1] : ARDA is A Realization of Distributed Analysis for LHC, http://lcg.web.cern.ch/lcg/activities/arda/arda.html [2] : irods High Avaliability, https://www.irods.org/index.php/irods_high_avaliability [3] : Ultra Monkey project, http://www.ultramonkey.org/ [4] : citation from abstract of Resource Namespace Service Specification, https://forge.gridforum.org/sf/go/doc8272 [5] : http://www.cs.virginia.edu/~vcgr/wiki/index.php/ Understanding_Your_Genesis_II_Distribution#RNS_Namespa ce

Back up

Outline Introduction irods HA system with Director Large File Transfer Speed Performance Future works (apply to RNS application) Summary

Outline Introduction irods HA system with Director Large File Transfer Speed Performance Future works (apply to RNS application) Summary

Outline Introduction irods HA system with Director Large File Transfer Speed Performance Future works (apply to RNS application) Summary

Outline Introduction irods HA system with Director Large File Transfer Speed Performance Future works (apply to RNS application) Summary

What is RNS? RNS : Resource Namespace Service RNS offers a simple standard way of mapping names to endpoints within a grid or distributed network [4] The latest version is available here; https://forge.gridforum.org/sf/go/doc8272 Java based RNS application is being developed by Osaka University and Tsukuba University This application is similar to irods The other kind of RNS application is Grid Shell of Genesis II by The Virginia Center for Grid Research (VCGR) [5].

Apply to RNS application?? Derby can do replication? http://wiki.apache.org/db-derby/replicationwriteup No load-sharing in the above example DB RNS Client? Replication Director DB RNS Server Derby

Issues in RNS application Several issues to be solved Derby is not enough to work replication as same as using PostgreSQL w/pgpool Need some developments to replace Derby by PostgreSQL The catalog implementation in the current RNS application has specific IP addresses