CernVM - a virtual software appliance for LHC applications

Similar documents
13th International Workshop on Advanced Computing and Analysis Techniques in Physics Research ACAT 2010 Jaipur, India February

CernVM a virtual software appliance for LHC applications

Application of Virtualization Technologies & CernVM. Benedikt Hegner CERN

BOSS and LHC computing using CernVM and BOINC

Distributed Systems COMP 212. Lecture 18 Othon Michail

STATUS OF PLANS TO USE CONTAINERS IN THE WORLDWIDE LHC COMPUTING GRID

Chapter 5 C. Virtual machines

Evolution of Cloud Computing in ATLAS

arxiv: v1 [cs.dc] 7 Apr 2014

What is Cloud Computing? Cloud computing is the dynamic delivery of IT resources and capabilities as a Service over the Internet.

Volunteer Computing at CERN

First Experience with LCG. Board of Sponsors 3 rd April 2009

CERN openlab II. CERN openlab and. Sverre Jarp CERN openlab CTO 16 September 2008

Status and Roadmap of CernVM

EE 660: Computer Architecture Cloud Architecture: Virtualization

The Future of Virtualization. Jeff Jennings Global Vice President Products & Solutions VMware

Experiences with OracleVM 3.3

INNOV-4: Fun With Virtualization. Or, How I learned to love computers that don t really exist...

Module 1: Virtualization. Types of Interfaces

Servicing HEP experiments with a complete set of ready integreated and configured common software components

Evolution of the ATLAS PanDA Workload Management System for Exascale Computational Science

Virtualization Introduction

ELFms industrialisation plans

Global Software Distribution with CernVM-FS

Vembu BDR Suite vs HPE VM Explorer

Docker and HPE Accelerate Digital Transformation to Enable Hybrid IT. Steven Follis Solutions Engineer Docker Inc.

Proven video conference management software for Cisco Meeting Server

The Future of Virtualization Desktop to the Datacentre. Raghu Raghuram Vice President Product and Solutions VMware

RADU POPESCU IMPROVING THE WRITE SCALABILITY OF THE CERNVM FILE SYSTEM WITH ERLANG/OTP

g-eclipse A Framework for Accessing Grid Infrastructures Nicholas Loulloudes Trainer, University of Cyprus (loulloudes.n_at_cs.ucy.ac.

CHAPTER 16 - VIRTUAL MACHINES

Nested Virtualization and Server Consolidation

Cloud & container monitoring , Lars Michelsen Check_MK Conference #4

Virtualizing a Batch. University Grid Center

Deploying virtualisation in a production grid

Copyright 2012, Oracle and/or its affiliates. All rights reserved.

How Parallels RAS Enhances Microsoft RDS. White Paper Parallels Remote Application Server

Lecture 09: VMs and VCS head in the clouds

Virtual Machines. Jinkyu Jeong Computer Systems Laboratory Sungkyunkwan University

Scientific Computing on Emerging Infrastructures. using HTCondor

OPENSTACK: THE OPEN CLOUD

Multiprocessor Scheduling. Multiprocessor Scheduling

OPS-9: Fun With Virtualization. John Harlow. John Harlow. About John Harlow

CS 470 Spring Virtualization and Cloud Computing. Mike Lam, Professor. Content taken from the following:

System Specification

Demystifying the Cloud With a Look at Hybrid Hosting and OpenStack

Explore multi core virtualization on the project

Today s Objec4ves. Data Center. Virtualiza4on Cloud Compu4ng Amazon Web Services. What did you think? 10/23/17. Oct 23, 2017 Sprenkle - CSCI325

Using MySQL in a Virtualized Environment. Scott Seighman Systems Engineer Sun Microsystems

The ATLAS Software Installation System v2 Alessandro De Salvo Mayuko Kataoka, Arturo Sanchez Pineda,Yuri Smirnov CHEP 2015

CernVM-FS beyond LHC computing

VIRTUAL CENTRAL LOCK

CS370 Operating Systems

Using Puppet to contextualize computing resources for ATLAS analysis on Google Compute Engine

Xen and CloudStack. Ewan Mellor. Director, Engineering, Open-source Cloud Platforms Citrix Systems

USING A FREE VIRTUAL MACHINE TO SIMILATE A SPECIFIC MODEL OF COMPUTER

SaaSaMe Transport Workload Snapshot Export for. Alibaba Cloud

Science Computing Clouds.

CS 350 Winter 2011 Current Topics: Virtual Machines + Solid State Drives

Cloud Computing for Science

Status of KISTI Tier2 Center for ALICE

IBM Db2 Analytics Accelerator Version 7.1

App Economy Market analysis for Economic Development

Cloud du CCIN2P3 pour l' ATLAS VO

Windows Azure Services - At Different Levels

An Introduction to Virtualization and Cloud Technologies to Support Grid Computing

ElasterStack 3.2 User Administration Guide - Advanced Zone

VMware AirWatch Content Gateway for Linux. VMware Workspace ONE UEM 1811 Unified Access Gateway

The Challenges of X86 Hardware Virtualization. GCC- Virtualization: Rajeev Wankar 36

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez

Open Hybrid Cloud & Red Hat Products Announcements

The LHC Computing Grid

Building a Virtualized Desktop Grid. Eric Sedore

Virtualization. ...or how adding another layer of abstraction is changing the world. CIS 399: Unix Skills University of Pennsylvania.

Virtualization. Pradipta De

Cloud Programming. Programming Environment Oct 29, 2015 Osamu Tatebe

VMware vrealize Code Stream Reference Architecture. 16 MAY 2017 vrealize Code Stream 2.3

Cloud Computing Introduction & Offerings from IBM

Spring 2017 :: CSE 506. Introduction to. Virtual Machines. Nima Honarmand

Introduction to computing, architecture and the UNIX OS. HORT Lecture 1 Instructor: Kranthi Varala

24-vm.txt Mon Nov 21 22:13: Notes on Virtual Machines , Fall 2011 Carnegie Mellon University Randal E. Bryant.

Lecture 1: January 22

VMware AirWatch Content Gateway for Windows. VMware Workspace ONE UEM 1811 Unified Access Gateway

jamf Nation - London Roadshow

VMware AirWatch Content Gateway Guide For Linux

Conference The Data Challenges of the LHC. Reda Tafirout, TRIUMF

Getting to Know Apache CloudStack


Evolution of the HEP Content Distribution Network. Dave Dykstra CernVM Workshop 6 June 2016

BACKUP AND RECOVERY OF A HIGHLY VIRTUALIZED ENVIRONMENT

Multitiered Architectures & Cloud Services. Benoît Garbinato

Oracle Linux, Virtualization & OEM12 Discussion Sahil Mahajan / Sundeep Dhall

Reconstruyendo una Nube Privada con la Innovadora Hiper-Convergencia Infraestructura Huawei FusionCube Hiper-Convergente

Use to exploit extra CPU from busy Tier2 site

Virtualization of the MS Exchange Server Environment

DIRAC pilot framework and the DIRAC Workload Management System

Server Virtualization and Optimization at HSBC. John Gibson Chief Technical Specialist HSBC Bank plc

Lecture 1: January 23

VxRail: Level Up with New Capabilities and Powers GLOBAL SPONSORS

Distributed Systems. 31. The Cloud: Infrastructure as a Service Paul Krzyzanowski. Rutgers University. Fall 2013

Transcription:

CernVM - a virtual software appliance for LHC applications C. Aguado-Sanchez 1), P. Buncic 1), L. Franco 1), S. Klemer 1), P. Mato 1) 1) CERN, Geneva, Switzerland Predrag Buncic (CERN/PH-SFT)

Talk Outline Introduction CernVM Project Building blocks Scalability and performance User Interface and API Release status Future plans & directions Conclusions ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-2

Introduction

Enjoying Frequency scaling Era Cluster of Clusters (GRID) Workstation & PC Clusters Mainframes Single, multi and many cores IBM-VM 360, CERNVM,1988 ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-4

Recent Trends (h/w) Multi & many cores Software benefits from multicore architectures where code can be executed in parallel Under most common operating systems this requires code to execute in separate threads or processes. CPU Core and L1 Caches Bus Interface and L2 Caches CPU Core and L1 Caches Unfortunatelly, HEP/LHC applications were developed during period when it looked like any performance issue can be easily solved by simply waiting 2 more years Support for hardware assisted virtualization VMM can now efficiently virtualize the entire x86 instruction set Intel VT and AMD-V implementations VMware, Xen 3.x (including derivatives like Virtual Iron), Linux KVM and Microsoft Hyper-V Running Virtual Machine will benefit from adoption of multiple core architectures since each virtual machine runs independently of others and can be executed in parallel In some cases (KVM,VMware) running concurrent VM can save memory ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-5

Building on this new CPU capabilities, the software industry giants quickly came up with Cloud computing paradigm that relies on virtualization as enabling technology densly packed and energy efficient multi core systems as a basis fo next generation data centers Utility Computing on demand Amazon Elastic Computing Cloud Blue Cloud (IBM) 13 dedicated data centres Software as a Service (SaaS) Google App Engine Delivering an application through the browser to thousands of customers Investing ~$2 billion per year in new generation data centres Microsoft Azzuro Hosted office applications & data Plans to match Google in terms of number of data centres, $600 million per site Recent Trends (s/w) ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-6

SaaS example: eyeos http://eyeos.org ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-7

Grid vs Cloud Localised data (Storage Element) Complicated scheduling Grid middleware in charge of abstacting complexity and differences of heterogenious resources Requires complex middleware Distributed storage Simplified scheduling Virtualization used as a tool to create a homogenious overlay for end user(s) and reduce scale of the problem Simplifies middleware ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-8

"Cloud Computing is a paradigm in which information is permanently stored in servers on the Internet and cached temporarily on clients that include desktops, entertainment centers, table computers, notebooks, wall computers, handhelds, sensors, monitors, etc. ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-9

CernVM Project

CernVM Project Portable Analysis Environment using Virtualization Technology (WP9) Project goal: Provide a complete, portable and easy to configure user environment for developing and running LHC data analysis locally and on the Grid independent of physical software and hardware platform (Linux, Windows, MacOS) Decouple application lifecycle from evolution of system infrastructure Reduce effort to install, maintain and keep up to date the experiment software Lower the cost of software development by reducing the number of compilerplatform combinations Approved in 2007 (2+2 years) as R&D activity, started January 2008 ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-11

How do we want to do that? CernVM Thin Virtual Software Appliance

Key Building Blocks rpath Linux 1 (www.rpath.org) Slim Linux OS binary compatible with RH/SLC4 raa - rpath Linux Appliance Agent Web user interface XMLRPC API rbulder Build types A tool to build VM images for various virtualization platforms CVMFS - CernVM file system Read only file system optimized for software distribution Aggressive caching Operational in offline mode For as long as you stay within the cache Installable CD/DVD Stub Image Raw Filesystem Image Netboot Image Compressed Tar File Demo CD/DVD (Live CD/DVD) Raw Hard Disk Image Vmware Virtual Appliance Vmware ESX Server Virtual Appliance Microsoft VHD Virtual Apliance Xen Enterprise Virtual Appliance Virtual Iron Virtual Appliance Parallels Virtual Appliance Amazon Machine Image Update CD/DVD Appliance Installable ISO ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-13

Easy packaging with Conary class Root(CPackageRecipe): name='root' version='5.19.02' buildrequires = ['libpng:devel', 'libpng:devellib','krb5:devel', 'libstdc++:devel,'libxml2:devel', 'openssl:devel','python:devel', 'xorg-x11:devel', 'zlib:devel', 'perl:devel', 'perl:runtime'] def setup(r): r.addarchive('ftp://root.cern.ch/root/%(name)s v%(version)s.source.tar.gz') r.environment('rootsys',%(builddir)s') r.manualconfigure('--prefix=/opt/root ') r.make() r.makeinstall() ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-14

CernVM Components group-<experiment>-desktop (lightweight X environment) group-<experiment> (groups and extra packages required by experiment) group-cernvm-desktop X11 100 MB group-cernvm-devel (development tools) group-cernvm (core packages) group-slc4 (SLC4 compatibility libs) compat-db4 compat-openssl compat-linstdc++slc3 compat-libxml2 compat-readline compat-tcl compat-tk ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-15

CVMFS CernVM File System (CVMFS) is derived from Parrot (http://www.cctools.org) and its GROW-FS code base and adapted to run as a FUSE kernel module adding extra features like: possibility to use multiple file catalogues on the server side transparent file compression under given size threshold dynamical expansion of environment variables embedded in symbolic links ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-16

Scalability: Cache Hierarchy LAN/WAN (HTTP) H T T P D Software Repository ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-17

Scalable Infrastructure ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-18

Publishing Releases

CernVM v.s. LXPLUS Ra atio to the execution ti ime in LXPLUS CernVM for ATLAS (benchmarks by Yushu Yao) For the first time running, CernVM is 2-3 times slower, depending on the network speed, since it needs to cache files Once the files are cached, the speed on CernVM is roughly the same as on LXPLUS. CernVM is faster when reading large files (since they are local) Note: LXPLUS has a higher CPU rate than test computer (3GHz vs. 2.66GHz) ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-20

User Interface

raa Web UI ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-22

raa: XML-RPC API import xmlrpclib import os url = 'http://' + user + ':' + password + '@' + host + ':8004/rAA/xmlrpc' server = xmlrpclib.serverproxy(url) r = server.cernvm.config.configupdate('http://cern.ch/cernvm/config', 'ALICE', # (can be None,ATLAS, ALICE, LHCB, CMS) 'off', # (on/off - if on it will make cvmfs locally writeable) 'off', # (on/off - enable/disable grid UI) 'off') # (on/off - enable VO profile) print r["message"] #### # Here we create initial user 'test' with the same password as Web admin #### r = server.cernvm.user.userupdate('test', password,password) print r["message"] #### # Select grid UI version #### r = server.cernvm.config.configgriduiversion( 3.1.22-0") print r["message"] ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-23

Developing CernVM

Developing CernVM JAN FEB MAR APR MAY JUN JUL AUG SEP OCT NOV DEC -Preparation Release 0.5 Kickoff Workshop - Release 0.6 Release 1.0 Available for download now! - Release 0.7 - Release 0.8 - Release 0.91 (RC1) Carlos, Leandro, Steffen, Artem -Release 0.92 (RC2) On time! ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-25

1.0 Release Available now for download from http://rbuilder.cern.ch/project/cernvm/releases Can be run on Linux (KVM, Xen,VMware Player, VirtualBox) Windows(WMware Player, VirtualBox) Mac (Fusion, Parallels, VirtualBox) Release Notes http://cernvm.web.cern.ch/cernvm/index.cgi?page=releasenotes HowTo http://cernvm.web.cern.ch/cernvm/?page=howto Appliance can be configured and used with ALICE, LHCb, ATLAS (and CMS) software frameworks ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-26

Future plans & directions

Removing Single Point of Failure Secondary software repository Load Balancing Reverse Proxy Reverse Proxy Primary (master) software Load repository Balancing Reverse Proxy Regional Reverse Proxy Load Balancing Reverse Proxy Secondary software repository CernVM (thin client) Reverse Proxy Site Reverse Proxy Content Delivery Network ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-28

Bridging Grids & Clouds BOINC Open-source software for volunteer computing and grid computing http://boinc.berkeley.edu/ Summer student project in OpenLab based on LHC@HOME experience and CernVM image, aims to run ATLAS simulation using BOINC infrastructure Successfully run ATLAS PanDA job in CernVM BOINC LHC@HOM E PanDA Pilot ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-29

Adapter CernVM CoPilot AliEn/DIRAC/PanDA 0. Send host JDL (free disk space, free memory, available packages) 1. Append framework specific information (e.g. CE name) and request a job 3. Send input files and commands for execution (packages are already there) 2. Send user job JDL from Task Quee 4. When the job is done send back the output files (and the result of validation) 5. Register output files ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-30

Simplifying Grid deployment Nimbus (former Globus Workspace Service) Nimbus is a set of open source tools that together provide an "Infrastructure-as-a-Service" (IaaS) cloud computing solution http://workspace.globus.org/ Google Summer School (hosted at ANL) project to deploy a one-click, auto-configuring virtual Grid overlay for Alice/AliEn Successfully created virtual AliEn site for ALICE with one command ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-31

Mailing lists cernvm-talk@cern.ch (Open list for announcements and discussion) cernvm.support@cern.ch (End-user support for the CernVM project) Savannah Portal Please submit bugs and feature requests to Savannah at http://savannah.cern.ch/projects/cernvm CernVM Home Page: http://cernvm.cern.ch rbuilder & Download Page: http://rbuilder.cern.ch CernVM Links ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-32

Summary If you believe that it is time to jump on bandwagon and expolore directioin in which major software industry players are going and if you have one of these problems: You work for LHC experiment and its software is not compatible with your favorite hardware or s/w platform running on your laptop/desktop You do not want to spend time to manually keep software up to date Your want to profit from the latest developments in CPU technology and use your multi/many core CPU to its maximal potential without modifying your application You want to share spare CPU cores/cycles with others You want to run your software on voluntary resources beyond the current Grid then CernVM might be what you are looking for. Version 1.0 available at http://rbuilder.cern.ch/project/cernvm/releases ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-33

The End ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-34

Multi core performance ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-35

Application benchmarks 180 160 140 120 100 80 60 40 20 0 Linux Dom0 DomU VMW stresshepix AliRoot simlocal AliRoot reclocal Geant 4 http://cernvm.cern.ch/cernvm/index.cgi?page=benchmarkresults ACAT 2008 CernVM A Virtial Machine for LHC Experiments Erice, 6/11/2008-36