Supporting Application- Tailored Grid File System Sessions with WSRF-Based Services

Similar documents
Distributed File System Support for Virtual Machines in Grid Computing

A User-level Secure Grid File System

Supporting Application-Tailored Grid File System Sessions with WSRF-Based Services

Jithendar Paladugula, Ming Zhao, Renato Figueiredo

Distributed File System Virtualization Techniques Supporting On-Demand Virtual Machine Environments for Grid Computing

FILE SYSTEM VIRTUALIZATION AND SERVICE FOR GRID DATA MANAGEMENT

Science gateways made easy: the In-VIGO approach

Virtual Machine Systems

NFSv4 as the Building Block for Fault Tolerant Applications

Experimental Study of Virtual Machine Migration in Support of Reservation of Cluster Resources

Distributed System. Gang Wu. Spring,2018

Chapter 4:- Introduction to Grid and its Evolution. Prepared By:- NITIN PANDYA Assistant Professor SVBIT.

IOS: A Middleware for Decentralized Distributed Computing

Service and Cloud Computing Lecture 10: DFS2 Prof. George Baciu PQ838

iscsi Target Usage Guide December 15, 2017

An Empirical Study of High Availability in Stream Processing Systems

Introduction to Grid Computing

Chapter 10 DISTRIBUTED OBJECT-BASED SYSTEMS

Ming Zhao ACIS Laboratory University of Florida Jithendar Paladugula ACIS Laboratory University of Florida

Hedvig as backup target for Veeam

High Throughput WAN Data Transfer with Hadoop-based Storage

Single Sign-On in In-VIGO: Role-based Access via Delegation Mechanisms Using Short-lived User Identities

Day 1 : August (Thursday) An overview of Globus Toolkit 2.4

Chapter 18: Database System Architectures.! Centralized Systems! Client--Server Systems! Parallel Systems! Distributed Systems!

Distributed Systems 16. Distributed File Systems II

THE GLOBUS PROJECT. White Paper. GridFTP. Universal Data Transfer for the Grid

Chapter 20: Database System Architectures

The Google File System

Science Computing Clouds.

DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S. TANENBAUM MAARTEN VAN STEEN. Chapter 1. Introduction

Current Topics in OS Research. So, what s hot?

Lecture 23 Database System Architectures

Write a technical report Present your results Write a workshop/conference paper (optional) Could be a real system, simulation and/or theoretical

Sky Computing on FutureGrid and Grid 5000 with Nimbus. Pierre Riteau Université de Rennes 1, IRISA INRIA Rennes Bretagne Atlantique Rennes, France

Today CSCI Coda. Naming: Volumes. Coda GFS PAST. Instructor: Abhishek Chandra. Main Goals: Volume is a subtree in the naming space

Distributed File Systems. CS432: Distributed Systems Spring 2017

MySQL Multi-Site/Multi-Master Done Right

Disaster Recovery-to-the- Cloud Best Practices

ECE 7650 Scalable and Secure Internet Services and Architecture ---- A Systems Perspective

GFS: The Google File System

Cloud Computing CS

Concepts of Distributed Systems 2006/2007

Advanced Topics in Operating Systems

ECE 7650 Scalable and Secure Internet Services and Architecture ---- A Systems Perspective

Support for Urgent Computing Based on Resource Virtualization

Understanding StoRM: from introduction to internals

Distributed File Systems II

Introduction. Distributed Systems IT332

SQL Azure. Abhay Parekh Microsoft Corporation

Outline. ASP 2012 Grid School

A Case for High Performance Computing with Virtual Machines

! Replication comes with consistency cost: ! Reasons for replication: Better performance and availability. ! Replication transforms client-server

Google File System, Replication. Amin Vahdat CSE 123b May 23, 2006

Database Assessment for PDMS

Maestro-VC: On-Demand Secure Cluster Computing Using Virtualization

Send me up to 5 good questions in your opinion, I ll use top ones Via direct message at slack. Can be a group effort. Try to add some explanation.

Fault Tolerance. Goals: transparent: mask (i.e., completely recover from) all failures, or predictable: exhibit a well defined failure behavior

Bigtable. A Distributed Storage System for Structured Data. Presenter: Yunming Zhang Conglong Li. Saturday, September 21, 13

CloudBATCH: A Batch Job Queuing System on Clouds with Hadoop and HBase. Chen Zhang Hans De Sterck University of Waterloo

DISTRIBUTED SYSTEMS. Second Edition. Andrew S. Tanenbaum Maarten Van Steen. Vrije Universiteit Amsterdam, 7'he Netherlands PEARSON.

By Ian Foster. Zhifeng Yun

Grid Computing Middleware. Definitions & functions Middleware components Globus glite

DISTRIBUTED FILE SYSTEMS & NFS

MCSA Windows Server 2012 Configuring Advanced Services

Atrium Webinar- What's new in ADDM Version 10

Administrative Details. CS 140 Final Review Session. Pre-Midterm. Plan For Today. Disks + I/O. Pre-Midterm, cont.

Adaptive Cluster Computing using JavaSpaces

High Availability/ Clustering with Zend Platform

Distributed File Systems Issues. NFS (Network File System) AFS: Namespace. The Andrew File System (AFS) Operating Systems 11/19/2012 CSC 256/456 1

The Google File System. Alexandru Costan

Layered Architecture

Design The way components fit together

Distributed File Systems. Case Studies: Sprite Coda

Overview. Prerequisites. VMware vsphere 6.5 Optimize, Upgrade, Troubleshoot

University of Wisconsin-Madison

VioCluster: Virtualization for Dynamic Computational Domains

Staggeringly Large Filesystems

Managing Update Conflicts in Bayou. Lucy Youxuan Jiang, Hiwot Tadese Kassa

Designing Next Generation Data-Centers with Advanced Communication Protocols and Systems Services

Application-Transparent Checkpoint/Restart for MPI Programs over InfiniBand

Globus Toolkit 4 Execution Management. Alexandra Jimborean International School of Informatics Hagenberg, 2009

Chapter 18 Distributed Systems and Web Services

Distributed Systems. Hajussüsteemid MTAT Distributed File Systems. (slides: adopted from Meelis Roos DS12 course) 1/25

Large Scale Sky Computing Applications with Nimbus

Design The way components fit together

The Google File System (GFS)

Multiprocessor Scheduling. Multiprocessor Scheduling

Microsoft Windows Embedded Server Overview

Xen and the Art of Virtualization. CSE-291 (Cloud Computing) Fall 2016

Cycle Sharing Systems

Voldemort. Smruti R. Sarangi. Department of Computer Science Indian Institute of Technology New Delhi, India. Overview Design Evaluation

Cluster-Based Scalable Network Services

Chapter 17: Distributed Systems (DS)

HPC learning using Cloud infrastructure

Fault Tolerance Causes of failure: process failure machine failure network failure Goals: transparent: mask (i.e., completely recover from) all

IBM Tivoli Access Manager for e-business V6.1.1 Implementation

MySQL As A Service. Operationalizing 19 Years of Infrastructure at GoDaddy

Science Clouds: Early Experiences in Cloud Computing for Scientific Applications

Engineering Fault-Tolerant TCP/IP servers using FT-TCP. Dmitrii Zagorodnov University of California San Diego

GFS: The Google File System. Dr. Yingwu Zhu

Transcription:

Supporting Application- Tailored Grid File System Sessions with WSRF-Based Services Ming Zhao, Vineet Chadha, Renato Figueiredo Advanced Computing and Information Systems Electrical and Computer Engineering University of Florida Advanced Computing and Information Systems laboratory

Motivating Example Shared file system facilitates communication and synchronization between coupled applications Regional-scale, curvilinear simulation grid of Tampa Bay Curvilinear-grid Hydrodynamics 3D model CH3D Every 30 timesteps 1.5MB per exchange Every 30 timesteps 1.8MB per exchange SWAN Simulating WAves Nearshore model Tampa Bay Basin-scale, unstructured ADCIRC simulation grid Every timesteps 40KB per exchange ADCIRC ADvanced CIRCulation model for coastal waters Coastal surge coupled modeling Advanced Computing and Information Systems laboratory 2

Motivating Example Shared file system facilitates communication and synchronization between coupled applications Distributed file systems in wide-area environments? LAN file systems have shortcomings WAN file systems not widely deployed Curvilinear-grid Hydrodynamics 3D model UFL CH3D Every timesteps 40KB per exchange Every 30 timesteps 1.5MB per exchange WAN Every 30 timesteps 1.8MB per exchange ADCIRC SWAN Simulating WAves Nearshore model LSU ADvanced CIRCulation model for coastal waters VIMS Coastal surge coupled modeling Advanced Computing and Information Systems laboratory 3

Motivating Example Grid Virtual File System (GVFS) Virtualization, user-level proxy, unmodified kernel NFS Cross-domain user identity mapping Performance, security, consistency, reliability enhancements Dynamic, independent, application-tailored GVFS sessions UFL CH3D GVFS Session1 WAN SWAN LSU GVFS Session2 ADCIRC VIMS Coastal surge coupled modeling Advanced Computing and Information Systems laboratory 4

Motivating Example How to manage Grid data sessions Creation, cleanup, isolation, customization WSRF-based data management services Interoperability, flexibility, state management UFL CH3D GVFS Session1 WAN SWAN LSU Service GVFS Session2 ADCIRC VIMS Coastal surge coupled modeling Advanced Computing and Information Systems laboratory 5

Overview Goal: Seamless and high-performance data provision for applications in Grid environments Challenges: Application transparency for Grid-enabling of a wide range of applications Application-tailored enhancements on performance and reliability for diverse application needs Contributions: WSRF-based data management services Enabling of application-tailored grid data sessions Advanced Computing and Information Systems laboratory 6

Outline Introduction Architecture Data Access: Application-Tailored Sessions Control: Data Management Services Evaluation Summary Advanced Computing and Information Systems laboratory 7

Architecture Service Job Job Compute server Data plane -based data sessions Control plane Data management services Service Service data Service File server Job Job Compute server Advanced Computing and Information Systems laboratory 8

Application-Tailored Data Sessions Grid data access Implicit: GVFS proxy RPC interception Partial file transfer, block-based disk caching Configurable parameters: Capacity, associativity, read/write, write-through/-back Security mechanisms Session-key authentication, encrypted data channel Explicit: GridFTP/SFTP Full file transfer, file-based disk caching Data accessible through GVFS interface Cache consistency models Fault tolerance techniques Advanced Computing and Information Systems laboratory 9

Cache Consistency Models Per-session customization Overlaid upon native NFS client polling mechanism Reconfigurable at run-time Suitable for various scenarios Single-client sessions: Aggressive read/write caching with write delay Multiple-client sessions: Relaxed polling-based model Strong callback-based model kernel user System calls APP NFS client RPCs to localhost RPCs to server Advanced Computing and Information Systems laboratory 10

Fault Tolerance Copy-on-write file system Fail-over client failures Buffers data modifications on local stable storage Application checkpointed with file system changes consistently Session redirection Fail-over server failures Fault detected by RPC timeout Subsequent requests redirected to replica server remaps file handles transparently from kernel kernel user System calls APP NFS client RPCs to loalhost RPCs to server X COW sever RPC redirection shadow files Advanced Computing and Information Systems laboratory 11

Architecture Service Job Job Compute server Data plane -based data sessions Control plane Data management services Service Service data Service File server Job Job Compute server Advanced Computing and Information Systems laboratory 12

Data Management Services Service oriented middleware Creation, customization, management of sessions File System Service (FSS) Data Scheduler Service (DSS) Data Replication Service (DRS) Built using WS-Resource Framework Interoperability and state management Implemented with Perl-based WSRF::Lite WS-Addressing, WS-ResourceProperties, WS- ResourceLifetime, WS-BaseFaults, WS-Security etc. Advanced Computing and Information Systems laboratory 13

File System Service (FSS) Management of GVFS proxies Customization Defined in a configuration file Represented as WS-Resource Property Reconfiguration: By signaling proxy to reload configuration file Monitoring: By signaling proxy to report accumulated statistics Configuration File base_path /home/ session_key XXYYZZ a_enabled 1 d_enabled 1 wb_enabled 1 inval_enabled 1 a_size 65536 a_asso 8 a_banks 128 d_size 1048576 d_asso 16 d_banks 512 inval_min 3 inval_max 60 Advanced Computing and Information Systems laboratory 14

Data Scheduler Service (DSS) Manages Grid data sessions Interacting with client- and server-side FSS Session information Represented as WS-Resource Property Stored in MySQL database Resolves conflicts when scheduling a session If another session accesses with write caching Forces it to write back and disable write caching If another session has exclusive access Denies the new session request Advanced Computing and Information Systems laboratory 15

Data Replication Service (DRS) Manages data replication Replica information represented as WS-Resource Property, stored in MySQL database Interacts with DSS for replication and recovery Replication: requests a session for data transfer Recovery: provides replica information to client FSS Supports various consistency schemes Uses COW to avoid propagation of writes Active-style or primary-based Advanced Computing and Information Systems laboratory 16

Example FSS Job Job Compute server Data Scheduler Service (DSS) Session management File System Service (FSS) management Data Replication Service (DRS) Replication management FSS Service client e.g. In-VIGO FSS DSS File server data Job Job DRS FSS replica Compute server File server Advanced Computing and Information Systems laboratory 17

Outline Introduction Architecture Evaluation Summary Advanced Computing and Information Systems laboratory 18

Weak Consistency: Experiment I Benchmark: NanoMOS (MATLAB-based 2- D n-mosfet simulator) Scenario: Software accessed by WAN users and updated by local administrator: (a) Major: entire MATLAB (b) Minor: one MATLAB toolbox NFS vs. GVFS Observation: With warm disk GVFS Filters substantial kernel issued consistency checks Delivers performance close to local disk Runtime (seconds) Runtime (seconds) 700 600 500 400 300 200 100 0 700 600 500 400 300 200 100 0 NFS GVFS LOCAL Major update Minor update (a) (b) 1 2 3 4 5 6 7 8 Execution Iterations Advanced Computing and Information Systems laboratory 19

Weak Consistency: Experiment II Benchmark: CH1D (coupled hydrodynamics simulation and post-processing) 80 70 NFS GVFS Scenario: Real-time data accumulated onsite, and processed off-site 30 new inputs available before each run of data processing NFS vs. GVFS Runtime (seconds) 60 50 40 30 20 10 0 1 2 3 4 5 6 7 8 9 10 Observation: As input dataset grows overhead caused by consistency checks: Grows linearly in native NFS, Stays constant in GVFS Execution Iterations Advanced Computing and Information Systems laboratory 20

Checkpointing and Recovery Application: Gaussian (computational chemistry tool) Scenario: Client (a virtual machine) is checkpointed, continues to execute and later fails The program changes the state of the file server irreversibly by deleting temporary files after the checkpointing Observation: When the VM is resumed to the checkpoint: Native NFS: stale file handle error; program aborts GVFS with COW: program recovered successfully Advanced Computing and Information Systems laboratory 21

Error Detection and Data Redirection Application: SPECseis96 (seismic data processing) Scenario: File server fails during the program s execution Observation: Upon native NFS: program fails (aborts or hangs) Upon GVFS and data replica: detected the error after a RPC timeout The data request is redirected to the replica within 5 seconds Program continues successfully and is unaware of the failure Advanced Computing and Information Systems laboratory 22

Summary Problem: Application-transparent and application-tailored Grid data access Solution: WSRF-based data management services for application-tailored Grid file system sessions Evidence: Experiments based on scientific application execution demonstrate good performance and fault tolerance of GVFS Advanced Computing and Information Systems laboratory 23

Related Work Grid data management approaches GASS, GridFTP Explicit transfer via middleware or use of specialized API Condor, BAD-FS On-demand remote data access by interception of system call Control caching, consistency and fault tolerance to middleware LegionFS, Avaki s Data Grid Access Servers Access of Grid data based on NFS WSRF-based Grid middleware Globus Toolkit 4 based data management middleware WSRF.NET based (remote job execution grid) WSRF::Lite based (WEDS) Advanced Computing and Information Systems laboratory 24

Acknowledgments In-VIGO team http://invigo.acis.ufl.edu Dr. Peter Dinda Dr. Peter Sheng, SCOOP resources NSF Middleware Initiative NSF Research Resources IBM Shared University Research VMware Questions? Advanced Computing and Information Systems laboratory 25

References [FGCS 05] S. Adabala, V. Chadha, P. Chawla, R. Figueiredo, J. Fortes, I. Krsul, A. Matsunaga, M. Tsugawa, J. Zhang, M. Zhao, L. Zhu, and X. Zhu, From Virtualized Resources to Virtual Computing Grids: The In-VIGO System, special issue on Complex Problem-Solving Environments for Grid Computing, Vol 21/6, 2005. [CC 04 ] R. Figueiredo, N. Kapadia, J. Fortes, Seamless Access to Decentralized Storage Services in Computational Grids via a Virtual File System, In Cluster Computing, 2004. [HPDC 04] M. Zhao, R. Figueiredo, Distributed File System Support for Virtual Machines in Grid Computing, In Proceedings of 13th IEEE International Symposium on High Performance Distributed Computing, June 2004. WSRF::Lite: An Implementation of the Web Services Resource Framework http://www.sve.man.ac.uk/research/atoz/ilct In-VIGO: In-VIGO prototype can be accessed from http://invigo.acis.ufl.edu; courtesy accounts available. Advanced Computing and Information Systems laboratory 26

Future Work Extensive evaluation and performance tuning of service-based middleware Use of application profiling to assist the customization of Grid data sessions Fine grained replication management and load balancing schemes Advanced Computing and Information Systems laboratory 27