Using CernVM-FS to deploy Euclid processing S/W on Science Data Centres

Similar documents
BIDS 2016 Santa Cruz de Tenerife

Euclid Science Ground Segment (SGS) Processing Operations Concept

THE EUCLID ARCHIVE SYSTEM: A DATA-CENTRIC APPROACH TO BIG DATA

Application of Virtualization Technologies & CernVM. Benedikt Hegner CERN

CernVM-FS beyond LHC computing

THE EUCLID ARCHIVE SYSTEM: A DATA-CENTRIC APPROACH TO BIG DATA

RADU POPESCU IMPROVING THE WRITE SCALABILITY OF THE CERNVM FILE SYSTEM WITH ERLANG/OTP

Euclid Archive Science Archive System

Global Software Distribution with CernVM-FS

Euclid Consortium The Euclid Challenges. Pierre Dubath SDC-CH Department of Astronomy University of Geneva. IAU Symposium 325.

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez

CernVM-FS. Catalin Condurache STFC RAL UK

Security in the CernVM File System and the Frontier Distributed Database Caching System

Evolution of Cloud Computing in ATLAS

Using Puppet to contextualize computing resources for ATLAS analysis on Google Compute Engine

Your cloud solution for EO Data access and processing

STATUS OF PLANS TO USE CONTAINERS IN THE WORLDWIDE LHC COMPUTING GRID

Version Control for PL/SQL

ATLAS Nightly Build System Upgrade

Euclid Mission Database

The Euclid Ground Segment Design for File-based Operations

Singularity tests at CC-IN2P3 for Atlas

Conference The Data Challenges of the LHC. Reda Tafirout, TRIUMF

Developing and Testing Java Microservices on Docker. Todd Fasullo Dir. Engineering

ELFms industrialisation plans

How can the Future Internet

On the EGI Operational Level Agreement Framework

COURSE OUTLINE MOC : PLANNING AND ADMINISTERING SHAREPOINT 2016

CernVM a virtual software appliance for LHC applications

Coupled Computing and Data Analytics to support Science EGI Viewpoint Yannick Legré, EGI.eu Director

Helix Nebula, the Science Cloud

Desktop Central Managing Windows Computers in WAN

Cloud du CCIN2P3 pour l' ATLAS VO

University of Groningen

Data services for LHC computing

Team Foundation Consulting. Training. Team Member Training User training designed to cater for specific roles within your team. Developer TFVC / Git

D5.5. Open Source Client Library and Server Tools for Delegations. Document Identification. Final UBISECURE, OIX

[MS20414]: Implementing an Advanced Server Infrastructure

Moving e-infrastructure into a new era the FP7 challenge

NET EXPERT SOLUTIONS PVT LTD

Basic Concepts of the Energy Lab 2.0 Co-Simulation Platform

WLCG SOC Working Group

ThinkPalm s BreakThrough DevOps Capabilities ThinkPalm

Continuous integration & continuous delivery. COSC345 Software Engineering

Version control system (VCS)

AMM Feb/2018. Frederic Marec Embedded Engineer

Co-ReSyF Hands-on sessions

Linux System Management with Puppet, Gitlab, and R10k. Scott Nolin, SSEC Technical Computing 22 June 2017

Software installation and condition data distribution via CernVM File System in ATLAS

MONTE CARLO SIMULATION FOR RADIOTHERAPY IN A DISTRIBUTED COMPUTING ENVIRONMENT

Install your scientific software stack easily with Spack

CONTINUOUS DELIVERY IN THE ORACLE CLOUD

MCTS Guide to Microsoft Windows Server 2008 Applications Infrastructure Configuration (Exam # ) Chapter One Introducing Windows Server 2008

XSEDE s Campus Bridging Project Jim Ferguson National Institute for Computational Sciences

DevOps Course Content

Continuous Delivery of Micro Applications with Jenkins, Docker & Kubernetes at Apollo

SBB. Java User Group 27.9 & Tobias Denzler, Philipp Oser

Quick Prototyping+CI with LXC and Puppet

Troubleshoot PCRF OAM VM Recovery- Openstack

Containers, Serverless and Functions in a nutshell. Eugene Fedorenko

SEVENMENTOR TRAINING PVT.LTD

Giovanni Lamanna LAPP - Laboratoire d'annecy-le-vieux de Physique des Particules, Université de Savoie, CNRS/IN2P3, Annecy-le-Vieux, France

The Main Concepts of the European Ground Systems Common Core (EGS-CC)

MCSA: Windows Server MCSA 2016 Windows 2016 Server 2016 MCSA 2016 MCSA : Installation, Storage, and Compute with Windows Server 2016

IN2P3-CC cloud computing (IAAS) status FJPPL Feb 9-11th 2016

MDS1 SESAR. The Single European Sky Programme DG TREN

Modeling & Simulation as a Service (M&SaaS)

Virtualization. A very short summary by Owen Synge

Azure DevOps. Randy Pagels Intelligent Cloud Technical Specialist Great Lakes Region

Course : Planning and Administering SharePoint 2016

Nexus for Component Management. Nexus for Component Management

Advanced Continuous Delivery Strategies for Containerized Applications Using DC/OS

ONAP Developer Typical Setup 2017 July ONAP Virtual Developers Event

Helix Nebula The Science Cloud

DEVOPS COURSE CONTENT

Performance-related aspects in the Big Data Astronomy Era: architects in software optimization

Development and Operations: Continuous Delivery in Practice

Distributed Archive System for the Cherenkov Telescope Array

ZenHub Enterprise. ZenHub Enterprise. Features & Implementation Outline

Course Outline 20743B

Virtualization of the ATLAS Tier-2/3 environment on the HPC cluster NEMO

AEM Code Promotion and Content Synchronization Best Practices

Database Services at CERN with Oracle 10g RAC and ASM on Commodity HW

LSST software stack and deployment on other architectures. William O Mullane for Andy Connolly with material from Owen Boberg

The Long Road from Capistrano to Kubernetes

Scheduling Computational and Storage Resources on the NRP

AWS Course Syllabus. Linux Fundamentals. Installation and Initialization:

MODERN APPLICATION ARCHITECTURE DEMO. Wanja Pernath EMEA Partner Enablement Manager, Middleware & OpenShift

A: PLANNING AND ADMINISTERING SHAREPOINT 2016

Database Engineering. Percona Live, Amsterdam, September, 2015

Fosdem Feb/2018. Frederic Marec Embedded Engineer

VC3. Virtual Clusters for Community Computation. DOE NGNS PI Meeting September 27-28, 2017

Implementing the Twelve-Factor App Methodology for Developing Cloud- Native Applications

[MS20487]: Developing Windows Azure and Web Services

Oracle Linux, Virtualization & OEM12 Discussion Sahil Mahajan / Sundeep Dhall

Fuel VMware DVS plugin testing documentation

Continuous Integration and Delivery with Spinnaker

From Bare Metal to Cloud. Andy ICCLab, ZHAW Piotr Kasprzak, GWDG

Data federations and its applications

Implementing an Advanced Server Infrastructure

Transcription:

Using CernVM-FS to deploy Euclid processing S/W on Science Data Centres M. Poncet (CNES) Q. Le Boulc h (IN2P3) M. Holliman (ROE) On behalf of Euclid EC SGS System Team ADASS 2016 1

Outline Euclid Project & SGS From Euclid pipeline to SGS Architecture From source code to processing nodes CernVM-FS principles Using CernVM-FS on SDCs 2

Euclid Project & SGS From Euclid pipeline to SGS Architecture From source code to processing nodes CernVM-FS principles Using CernVM-FS on SDCs 3

Euclid project M2 mission in the framework of the ESA Cosmic Vision Programme Euclid mission objective is to map the geometry and understand the nature of the dark Universe (dark energy and dark matter) Actors in the mission: ESA and the Euclid Consortium (institutes from 15 European countries and USA, funded by their own national Space Agencies) Euclid Consortium: 15 countries 100+ labs 1200+ members One of the biggest collaboration! For more information see : O1.2, O8.5, P3.5, P8.13, P1.16, P2.13, P8.22, P2.21, P2.31, P4.18 http://sci.esa.int/science-e/www/area/index.cfm?fareaid=102 http://www.euclid-ec.org 4

Euclid Science Ground Segment (SGS) 5

Euclid Project & SGS From Euclid pipeline to SGS Architecture From source code to processing nodes CernVM-FS principles Using CernVM-FS on SDCs 6

Euclid pipeline 7

From dedicated SDCs to Federated SDCs Dedicated SDCs Whole pipeline in any SDC on quantum of data soc SIM VIS SIR NIR MER EXT SPE PHZ SHE LE3 EAS Unbalanced load Pipeline in a Box 100+ PB in grand total More Data transfer than processing? Move the code Not the data paradigm 8

Move the code not the Data SDCs are both storage and processing nodes Data storage and processing allocated by sky area 9

Euclid Project & SGS From Euclid pipeline to SGS Architecture From source code to processing nodes CernVM-FS principles Using CernVM-FS on SDCs 10

From source code to production 11

From source code to production EDEN referential 12

in collaboration with an international team How to federate? 13

in collaboration with an international team EDEN referential 14

in collaboration with an international team How to put code together? 15

in collaboration with an international team Configuration management Euclid SVN / Git 16

in collaboration with an international team How to build? Euclid SVN / Git 17

continuously integrating its code Continuous integration Euclid SVN / Git Nightly Builds Feedback on code EDEN 18

continuously integrating its code Euclid SVN / Git EDEN 19

continuously integrating its code Build Euclid SVN / Git EDEN Elements 20

continuously integrating its code Build Run Tests (Unit, smoke, pre-int. tests) Euclid SVN / Git EDEN Elements 21

continuously integrating its code Build Run Tests (Unit, smoke, pre-int. tests) Generate Documentation (Doxygen) Euclid SVN / Git EDEN Elements 22

continuously integrating its code Build Run Tests (Unit, smoke, pre-int. tests) Generate Documentation (Doxygen) Quality check (Custom) Euclid SVN / Git EDEN Elements 23

continuously integrating its code Build Run Tests (Unit, smoke, pre-int. tests) Generate Documentation (Doxygen) Quality check (Custom) Euclid SVN / Git Dashboards (SonarQube, ) Feedback on code EDEN Elements 24

continuously integrating its code Build Run Tests (Unit, smoke, pre-int. tests) Generate Documentation (Doxygen) Quality check (Custom) Packaging (Custom) Euclid SVN / Git Dashboards (SonarQube, ) Feedback on code EDEN Elements 25

continuously integrating its code Build Run Tests (Unit, smoke, pre-int. tests) Generate Documentation (Doxygen) Quality check (Custom) Packaging (Custom) Storage in repository Euclid SVN / Git Dashboards (SonarQube, ) Local deployment (yum install) YUM Repository (Nexus) Feedback on code EDEN Elements 26

able to quickly deploy it on production Euclid SVN / Git Feedback on code Nightly Builds Feedback on code Yum install FR PRODUCTION ENVIRONMENT UK CH ES DE NL FI IT US Science Data Centers SCIENTIFIC PRODUCTS EDEN 27

able to quickly deploy it on production Euclid SVN / Git Feedback on code Nightly Builds Feedback on code Continuous Deployment FR PRODUCTION ENVIRONMENT UK CH ES DE NL FI IT US Science Data Centers SCIENTIFIC PRODUCTS EDEN 28

«Dream» Solution not (too) intrusive Automatic deployment No (less) admin intervention required Security friendly No exotic protocol (easy to filter) Outgoing connexion prefered (no incoming) Mutiple versions in // EDEN versions Pipeline versions Efficient 29

Euclid Project & SGS From Euclid pipeline to SGS Architecture From source code to processing nodes CernVM-FS principles Using CernVM-FS on SDCs 30

CernVM File System (CernVM-FS) Developed by CERN (European Organization for Nuclear Research) For High Energy Physics (HEP) collaborations To deploy software on the worldwide-distributed computing infrastructure: HTTP based Pull mode: get locally only on access Cache hierarchy User space FUSE local read-only mounting point /cvmfs 31

Cvmfs principles 32

Cvmfs distribution network Central repository Ring of mirrors: scalability, reliability 33

Cvmfs distribution network Local Squid proxy: cache, security Linked to stratum 1: failover, load balancing Proxy cache (Horizontal scaling) 34

Cvmfs distribution network Cvmfs client on workers Linked to Squid proxies HTTP Local file system cache CernVM-FS client Operating System & Applications Read-only access Fuse OS Kernel Proxy cache (Horizontal scaling) 35

Cvmfs distribution network Compatible with Physical nodes (µ)vms Docker container Read-only access HTTP Local file system cache CernVM-FS client Fuse Operating System & Applications OS Kernel Proxy cache Physical Node VM (Horizontal scaling) Container 36

Cvmfs distribution network Multi-sites Local file system cache Read-only access HTTP CernVM-FS client Fuse Operating System & Applications OS Kernel Proxy cache Physical Node VM (Horizontal scaling) Container 37

Euclid Project & SGS From Euclid pipeline to SGS Architecture From source code to processing nodes CernVM-FS principles Using CernVM-FS on SDCs 38

Euclid cvmfs at a glance 39

Euclid cvmfs repository structure Multiple versions in // Any dependency along (yum install installroot) Switch through Env. Variables 40

Euclid cvmfs & continuous deployment Automated scripts GIT CODEEN Nexus CernVM-FS SDC SDC SDC X DEVELOP CODEEN chain (Build, Tests, Doc., Quality) TEST repo TEST Branch Read TEST Environment MASTER CODEEN chain (Build, Tests, Doc., Quality) Packaging (RPM) PROD repo EDEN x.y Deployment PROD Branch EDEN x.y Read PROD Environment Both Test and Production branches 15~mn latency between Installation on stratum 0 Availability on any SDC 41

Conclusions Tested through a technical SGS challenge Adopted on the whole Euclid SGS Used for ongoing Euclid Scientific SGS challenge #3 Automation scripts almost ready 42

Thank you for your attention Maurice.Poncet@cnes.fr Acknowledgments: authors are indebted to all the indididuals participating in the Euclid SGS developement inside ESA and EC, too numerous to be listed here 43

Questions 44

BACKUP SLIDES 45

Euclid at a glance 46

Euclid SGS Components 47

Euclid tools & software stack 48

Euclid Software stack 49