A time machine for the OCDB

Similar documents
ALICE ANALYSIS PRESERVATION. Mihaela Gheata DASPOS/DPHEP7 workshop

Offline Tutorial I. Małgorzata Janik Łukasz Graczykowski. Warsaw University of Technology

CedCommerce. All rights reserved.

Source Control. Deployment. What usync Does. usync Versions

Using Dell Repository Manager to Manage Your Repositories Efficiently

What s New in Laserfiche 10

DNA Evolution 4.0 Workflow Guide for Automated Third-party Preview Generation

Ios 7 Manual Restore From Icloud Slow >>>CLICK HERE<<<


BOSS and LHC computing using CernVM and BOINC

New Import/Export Features in Maximo 7.5

9 th CA 2E/CA Plex Worldwide Developer Conference 1

WOJCIECH FROELICH CTO ARGOS MULTILINGUAL

BMS Managing Users in Modelpedia V1.1

The Salesforce Migration Playbook

Localization of AOO proposal for new workflow

JBOT (Juniper Bot) Oct Pablo Sagrera Efrain Gonzalez Copyright 2014 Juniper Networks, Inc.

PHOTO GALLERY. USER GUIDE by Decima Digital. d e c i m a d i g i t a l. c o m

We offer our customers the opportunity to get 100 words translated in 30 minutes by one of our expert translators.

The CMS data quality monitoring software: experience and future prospects

CALIFORNIA EARTHQUAKE CLEARINGHOUSE. Tools for Mobile Data Collection

1. Which of these Git client commands creates a copy of the repository and a working directory in the client s workspace. (Choose one.

Data services for LHC computing

Extending Blaise Capabilities in Complex Data Collections

Migration from Metalogix Archive Manager (MAM) to Enterprise Vault (EV)

Managing Modular Infrastructure by using OpenManage Essentials (OME)

ZFS. Right Now! Jeff Bonwick Sun Fellow

ispring Suite 8. The all-in-one development tool that every e-learner must have

FAQ on Real Time Online Data connectivity to APPCB server

DVCS software and analysis tutorial

Getting Started Guide

Manual Trigger Sql Server 2008 Examples Insert Update

Validating Your PSQL Database Backups

Lab 1: Adding a Rhapsody Model to RMM

Mail & Deploy Reference Manual. Version 2.0.5

OnCommand Unified Manager 6.0

Journaling. CS 161: Lecture 14 4/4/17

Synology Alex Wang CEO, Synology America

Computing at Belle II

Data Protection Guide

Managed Backup Service VMware Console Plug-In and ESX Server Agent Version Release Notes

Data Analysis in ATLAS. Graeme Stewart with thanks to Attila Krasznahorkay and Johannes Elmsheuser

USER MANUAL. SEO Hub TABLE OF CONTENTS. Version: 0.1.1

Deccansoft Software Services

VMware Mirage Web Management Guide. VMware Mirage 5.9.1

Proposals for a New Workflow for Level-4 Content

EVALUATION COPY. Unauthorized Reproduction or Distribution Prohibited SHAREPOINT 2013 END USER

SLI Learning Search Connect For Magento 2

Welcome Back! Without further delay, let s get started! First Things First. If you haven t done it already, download Turbo Lister from ebay.

Data Protection Guide

Getting Started with Multilizer Day Evaluation

Automating Essbase Outline Extracts to a Relational Database Using the Next Generation Outline Extractor

Version is the follow-on release after version 8.1, featuring:

Accuracy Assessment of Ames Stereo Pipeline Derived DEMs Using a Weighted Spatial Dependence Model

Installation Guide Advanced Authentication - Logon Filter. Version 6.1

openlca user guide for the 1.4 beta version

A Tool for Conditions Tag Management in ATLAS

IBM Endpoint Manager. OS Deployment V3.5 User's Guide

Galaxy Pasteur. Patchwork of experiences and improvements. Olivia Doppelt-Azeroual, Sophie Créno et Fabien Mareuil CIB, Institut Pasteur, Paris

Active Directory Synchronisation

COS 318: Operating Systems. NSF, Snapshot, Dedup and Review

Agility on Steroids with a Docker-based CI/CD pipeline on AWS. Jon Christensen July 19, 2018

Project #4: Implementing NFS

Analysis of Σ 0 baryon, or other particles, or detector outputs from the grid data at ALICE

cdiscount version BoostMyShop

Brief Overview of the new features of Turbostore 7x24 TrueOnline

Datazen. Bent On-premise mobile BI. November 28, #sqlsatparma #sqlsat462

Printing Drafts in Outlook Showing Date sent Asked by: OntarioMedicalAssociatin

Integrate the Cisco Identity Services Engine

skype ID: store.belvg US phone number:

What's New New Features in Primavera P6 EPPM 18

Lessons learned while automating MySQL in the AWS cloud. Stephane Combaudon DB Engineer - Slice

Product Release Summary AVEVA LFM NetView

Using SDL Trados Studio with SDL TMS Quick Start Guide.

Polarion Enterprise Setup 17.2

MB Microsoft. Microsoft Dynamics 365 customer engagement Online Deployment

To the Cloud and Back: A Distributed Photo Processing Pipeline

Version Control. Second level Third level Fourth level Fifth level. - Software Development Project. January 17, 2018

[GSoC Proposal] Securing Airavata API

More info: Complete demo of CoZone Collaborate:

Backup and Restore Operations

- Evergreen Reports Training Session - Handouts. September 29, 2016 Hermiston Public Library

The modusgate console is composed of five configuration modules, these are described in the following table:

Polarion 18 Enterprise Setup

Transaction Management in Fully Temporal System

RADU POPESCU IMPROVING THE WRITE SCALABILITY OF THE CERNVM FILE SYSTEM WITH ERLANG/OTP

TOP Server with Telemetry Systems. Real Time and EFM Data Collection Best Practices

A Guide to Finding the Best WordPress Backup Plugin: 10 Must-Have Features

TPF Debugger / Toolkit update PUT 12 contributions!

High Availability iscsi diskless boot solution

Calibrating your monitor

Core DDI Basics NIOS 8.1

One of the fundamental kinds of websites that SharePoint 2010 allows

EGI-InSPIRE RI NGI_IBERGRID ROD. G. Borges et al. Ibergrid Operations Centre LIP IFCA CESGA

versiondog Export Module

Printer and Driver Management

Lightweight scheduling of elastic analysis containers in a competitive cloud environment: a Docked Analysis Facility for ALICE

Managing Native Snapshots

Polarion 18.2 Enterprise Setup

Configuring Artifactory

Transcription:

A time machine for the OCDB Dario Berzano ALICE Offline Week - July 19, 2017

OCDB source: AliEn Primary source of OCDB Calibration data in multiple ROOT files One XML file mapping run ranges to years Accessed by using raw:// as OCDB storage in AliRoot Multiple files to be accessed remotely: slow, causes large memory footprint No time machine feature: cannot select the OCDB status at a given moment in time!2

OCDB source: CVMFS As authoritative as AliEn: live sync upon AliEn updates, and has caching for faster access Directory structure identical to AliEn, except for Monte Carlos (with no reason ): AliEn: /alice/simulation/2008/v4-15-release/{full,ideal,residual} CVMFS: /cvmfs/alice-ocdb.cern.ch/calibration/mc/{full,ideal,residual} Fixes to make it work for real: alisw/aliroot#232 + alisw/alidpg#31 (usable from v5-09-10) Accessible as an alternative to AliEn by simply doing the following two steps together: Storage set to raw:// (same as AliEn) Environment variable exported: OCDB_PATH=/cvmfs/alice-ocdb.cern.ch Has native snapshotting feature, but it s still not usable in practice, see later on!3

OCDB source: snapshots Snapshots: make sure all jobs of same set will use the exact same OCDB A single dummy Grid job is run first to see what OCDB objects are actually used Objects used only are stored in a ROOT file This single ROOT file is the OCDB source for all jobs of the same set Solves the access problem: ROOT file is made available in the input box, no remote access!4

Current proposals for time machine snapshots Current snapshots contain only the objects, at a given moment in time, that serve the purpose of a single set of jobs Time machine snapshots: access the full OCDB as it was at a given moment in time We are file-based: just filter in only the files ones available at the selected time Three major proposals: Work from Raffaele (and an earlier implementation from me): save list of files at the time Current proposal: rely on files timestamps Native CVMFS snapshots and tagging!5

Snapshotting by saving list of files OCDB experts will start a procedure when they want the snapshot to be taken The procedure saves the result of a find command on the whole tree to a file This file is then used by AliRoot to know what files to consider Implementation problems: Requires manual intervention Requires an additional package where to store the list of files!6

Snapshotting by using files timestamps OCDB users will request with AliRoot the given timestamp It can be as easy as AliCDBManager::SetTimestamp(), and that s it The CDB manager will filter out all files newer than that date Works both on current CVMFS and AliEn Requires at most two full days of development: we already have all that s needed Relies on two pillars: File creation timestamps are reliable No OCDB file is ever deleted (a new version is uploaded instead) Only disadvantage: we would have to use timestamps instead of names But: easy workarounds can be made, if names are really important!7

Snapshotting by using CVMFS native features Whenever a CVMFS upload occurs, a new snapshot is created: no file is ever deleted Snapshots can be given a name (they can be tagged) Always possible to mount a CMVFS snapshot/tag from the past, by setting the configuration variable CVMFS_REPOSITORY_TAG currently accessible only as root user, and affects all users on the node :-( Proposal from the CVMFS team: make all snapshots, or tags, or both, accessible as special directory trees: /cvmfs/alice-ocdb.cern.ch/.tags/myspecialtag/ This seems to be the ideal solution, however time is a tyrant: Needs implementation, and testing: feature is not yet implemented Needs deployment: wait for Grid sites to use a CVMFS version supporting this feature!8

Where do we go?!9