Virtualization of the ATLAS Tier-2/3 environment on the HPC cluster NEMO

Size: px
Start display at page:

Download "Virtualization of the ATLAS Tier-2/3 environment on the HPC cluster NEMO"

Transcription

1 Virtualization of the ATLAS Tier-2/3 environment on the HPC cluster NEMO Ulrike Schnoor (CERN) Anton Gamel, Felix Bührer, Benjamin Rottler, Markus Schumacher (University of Freiburg) February 02, 2018 Pre-GDB Meeting

2 Using HPC ressources via virtualization HPC NEMO at Uni Freiburg Resource Use NEMO in Freiburg to extend local Tier-3 resources ( Black Forest Grid = BFG) Job types Currently mainly local ATLAS analysis and simulation jobs, but easily extendable to any ATLAS jobs Setup Full virtualization of the environment + embedding into the existing OpenStack-Torque/Moab infrastructure in a way that is based on demand fully automated transparent for the user Ulrike Schnoor (CERN) 2/14

3 bwforcluster HPC center NEMO Shared by 3 communities in Baden-Württemberg: Elementary Particle Physics, Neuroscience, Microsystems engineering 752 worker nodes, each with 2 10 cores 128 GB RAM 100 Gbit/s OmniPath 240 GB local SSD 500 TB workspace (BeeGFS) TOP500: Ranked 214 in June in June 2017 (Link) In operation since July 2016 Hybrid of HPC and cloud approach: OpenStack orchestrates bare metal jobs and virtual machines in parallel Ulrike Schnoor (CERN) 3/14

4 Virtualization of ATLAS infrastructure on NEMO Ingredients OpenStack: Management framework allowing to run both virtual machines and bare metal jobs on NEMO Hypervisor: KVM User interface: BFG login nodes Access to CVMFS, Frontier via BFG squid Scheduler: Slurm (front-end for Torque/Moab (back-end for Scheduling for dynamic allocation of VMs: ROCED VM image (SL6, CentOS7) Access to storage: dcache client, local BeeGFS Access to software: CVMFS client Ulrike Schnoor (CERN) 4/14

5 Virtual machine image tool chain Requirements Scientific Linux 6 CernVM image uses modified kernel not suitable Setup Packer ( for automatized image generation Basis: SL6 iso Output: VM template image (qcow2) Contextualization with puppet Install software, services (e.g. cvmfs client), user management etc. with the BFG puppet server identical and modularized setup Important updates? generate new VM Ulrike Schnoor (CERN) 5/14

6 Scheduling with Slurm Elastic Computing Slurm Elastic Computing: Resume and suspend machines on demand with adaptable resume/suspend functions and timeouts Challenges: 3-layer system with Slurm, Torque/Moab, and OpenStack allows almost no transmission/propagation of error messages Not intended for non-permanent resources (queue in Moab): Timeouts not sufficiently adaptable Solution: intermediate layer such as ROCED Ulrike Schnoor (CERN) 6/14

7 ROCED Responsive On-Demand Cloud-enabled Deployment Tool developed by CMS colleagues in Karlsruhe (KIT): Monitors demands in a batch system and dynamically manages virtual machines accordingly Python code with modular structure to adapt to different schedulers, VM types, Clouds etc. Integration and Requirement Adapters modified for BFG/Slurm setup: in production Integration Adapters... integrates booted compute nodes into existing batch server HTCondor Torque Grid Engine SLURM ROCED Core Broker... decides which machines to boot or shutdown Site Adapters Requirement Adapters... supplies information about needed compute nodes, e.g. queue size HTCondor Torque Grid Engine SLURM... boot machines on various Cloud Computing sites Hybrid HPC Cluster Commercial Providers OpenStack Ulrike Schnoor (CERN) 7/14

8 Summary and Outlook Slurm Elastic Computing setup can be used but is very fragile and leads to many job failures Using ROCED instead of Slurm Elastic Computing use non-elastic Slurm together with ROCED Requirement Adapter Integration Adapter implementation for Slurm and BFG in place Future possibilities: Use of containers CVMFS images instead of home-brew with Packer Ulrike Schnoor (CERN) 8/14

9 The Team Anton Gamel, Felix Buehrer, Benjamin Rottler, Ulrike Schnoor, Markus Schumacher Contacts in the Computing Center (HPC Team): Michael Janczyk, Bernd Wiebelt, Dirk von Suchodoletz Formerly also: Konrad Meier Ulrike Schnoor (CERN) 9/14

10 Backup Ulrike Schnoor (CERN) 10/14

11 The Black Forest Grid (BFG) Tier-2 and Tier-3 site of the WLCG In operation since 2005 CPU: 260 nodes, in total 4700 cores (HT) Several generations of worker node hardware Storage: dcache 1.35 PB (grid) lustre parallel storage 180 TB (local users) Local users from physics, biodynamics, and many other groups Future: exclusively Tier-2 and Tier-3 WLCG Ulrike Schnoor (CERN) 11/14

12 Baden-Württemberg HPC bwhpc-c5 project: Initiative in Baden-Württemberg for common frame for HPC ressources at BW universities co-financed by DFG bwforclusters federated approach: user group defined by research field not affiliation Freiburg: bwforcluster for Elementary Particle Physics, Neuroscience, and Microsystems Engineering: NEMO Ulrike Schnoor (CERN) 12/14

13 How to run ATLAS jobs on NEMO? OS: ATLAS currently needs Scientific Linux 6; NEMO runs CentOS7 Software: cvmfs = CERN VM File System: basis for all experiment-specific software not installed on NEMO Storage: afs not available on NEMO Virtualize the environment - Virtual machine image and orchestration/scheduling setup can be used both by local jobs as well as grid jobs Ulrike Schnoor (CERN) 13/14

14 Timeouts in Slurm Elasticity of the Slurm Elastic Computing module can be influenced with several timeout parameters: Main issue: ResumeTimeout should be long in order to catch Moab queue should be short in order to restart quickly if VM start fails Other problem: VMs often stay in COMPLETING (after job is terminated, before turning IDLE) for a long time Ulrike Schnoor (CERN) 14/14

HPC/Cloud Hybrids for Efficient Resource Allocation and Throughput. Multicore World, Wellington, New Zealand, Feb 2017

HPC/Cloud Hybrids for Efficient Resource Allocation and Throughput. Multicore World, Wellington, New Zealand, Feb 2017 HPC/Cloud Hybrids for Efficient Resource Allocation and Throughput Multicore World, Wellington, New Zealand, Feb 2017 lev@levlafayette.com It All Begins at Multicore World At the last Multicore World A

More information

On-demand provisioning of HEP compute resources on cloud sites and shared HPC centers

On-demand provisioning of HEP compute resources on cloud sites and shared HPC centers On-demand provisioning of HEP compute resources on cloud sites and shared HPC centers CHEP 2016 - San Francisco, United States of America Gunther Erli, Frank Fischer, Georg Fleig, Manuel Giffels, Thomas

More information

Hands-On Workshop bwunicluster June 29th 2015

Hands-On Workshop bwunicluster June 29th 2015 Hands-On Workshop bwunicluster June 29th 2015 Agenda Welcome Introduction to bwhpc and the bwunicluster Modules - Software Environment Management Job Submission and Monitoring Interactive Work and Remote

More information

Virtualized Scientific Research Environments and the future role of Computer Centers

Virtualized Scientific Research Environments and the future role of Computer Centers Virtualized Scientific Research Environments and the future role of Computer Centers University of Freiburg, escience dept. K. Meier, B. Grüning, C. Blank, M. Janczyk, D. v. Suchodoletz 31/05/2017 10th

More information

Grid Computing Activities at KIT

Grid Computing Activities at KIT Grid Computing Activities at KIT Meeting between NCP and KIT, 21.09.2015 Manuel Giffels Karlsruhe Institute of Technology Institute of Experimental Nuclear Physics & Steinbuch Center for Computing Courtesy

More information

Virtualizing a Batch. University Grid Center

Virtualizing a Batch. University Grid Center Virtualizing a Batch Queuing System at a University Grid Center Volker Büge (1,2), Yves Kemp (1), Günter Quast (1), Oliver Oberst (1), Marcel Kunze (2) (1) University of Karlsruhe (2) Forschungszentrum

More information

LHCb experience running jobs in virtual machines

LHCb experience running jobs in virtual machines LHCb experience running jobs in virtual machines Andrew McNab, University of Manchester Federico Stagni & Cinzia Luzzi, CERN on behalf of the LHCb collaboration Overview Starting from DIRAC + Grid CernVM

More information

Clouds at other sites T2-type computing

Clouds at other sites T2-type computing Clouds at other sites T2-type computing Randall Sobie University of Victoria Randall Sobie IPP/Victoria 1 Overview Clouds are used in a variety of ways for Tier-2 type computing MC simulation, production

More information

WLCG Lightweight Sites

WLCG Lightweight Sites WLCG Lightweight Sites Mayank Sharma (IT-DI-LCG) 3/7/18 Document reference 2 WLCG Sites Grid is a diverse environment (Various flavors of CE/Batch/WN/ +various preferred tools by admins for configuration/maintenance)

More information

Dynamic Extension of a Virtualized Cluster by using Cloud Resources

Dynamic Extension of a Virtualized Cluster by using Cloud Resources Dynamic Extension of a Virtualized Cluster by using Cloud Resources Oliver Oberst, Thomas Hauth, David Kernert, Stephan Riedel, Günter Quast Institut für Experimentelle Kernphysik, Karlsruhe Institute

More information

Using Puppet to contextualize computing resources for ATLAS analysis on Google Compute Engine

Using Puppet to contextualize computing resources for ATLAS analysis on Google Compute Engine Journal of Physics: Conference Series OPEN ACCESS Using Puppet to contextualize computing resources for ATLAS analysis on Google Compute Engine To cite this article: Henrik Öhman et al 2014 J. Phys.: Conf.

More information

bwfortreff bwhpc user meeting

bwfortreff bwhpc user meeting bwfortreff bwhpc user meeting bwhpc Competence Center MLS&WISO Universitätsrechenzentrum Heidelberg Rechenzentrum der Universität Mannheim Steinbuch Centre for Computing (SCC) Funding: www.bwhpc-c5.de

More information

Comet Virtualization Code & Design Sprint

Comet Virtualization Code & Design Sprint Comet Virtualization Code & Design Sprint SDSC September 23-24 Rick Wagner San Diego Supercomputer Center Meeting Goals Build personal connections between the IU and SDSC members of the Comet team working

More information

A Container On a Virtual Machine On an HPC? Presentation to HPC Advisory Council. Perth, July 31-Aug 01, 2017

A Container On a Virtual Machine On an HPC? Presentation to HPC Advisory Council. Perth, July 31-Aug 01, 2017 A Container On a Virtual Machine On an HPC? Presentation to HPC Advisory Council Perth, July 31-Aug 01, 2017 http://levlafayette.com Necessary and Sufficient Definitions High Performance Computing: High

More information

Cloud du CCIN2P3 pour l' ATLAS VO

Cloud du CCIN2P3 pour l' ATLAS VO Centre de Calcul de l Institut National de Physique Nucléaire et de Physique des Particules Cloud du CCIN2P3 pour l' ATLAS VO Vamvakopoulos Emmanouil «Rencontres LCG France» IRFU Saclay 1 2 December 2014

More information

Singularity tests at CC-IN2P3 for Atlas

Singularity tests at CC-IN2P3 for Atlas Centre de Calcul de l Institut National de Physique Nucléaire et de Physique des Particules Singularity tests at CC-IN2P3 for Atlas Vamvakopoulos Emmanouil Journées LCG-France, 22-24 Novembre 2017, LPC

More information

LHConCRAY. Acceptance Tests 2017 Run4 System Report Miguel Gila, CSCS August 03, 2017

LHConCRAY. Acceptance Tests 2017 Run4 System Report Miguel Gila, CSCS August 03, 2017 LHConCRAY Acceptance Tests 2017 Run4 System Report Miguel Gila, CSCS August 03, 2017 Table of Contents 1. Changes since Run2/3 2. DataWarp 3. Current configuration 4. System statistics 5. Next steps LHConCRAY

More information

HTCondor on Titan. Wisconsin IceCube Particle Astrophysics Center. Vladimir Brik. HTCondor Week May 2018

HTCondor on Titan. Wisconsin IceCube Particle Astrophysics Center. Vladimir Brik. HTCondor Week May 2018 HTCondor on Titan Wisconsin IceCube Particle Astrophysics Center Vladimir Brik HTCondor Week May 2018 Overview of Titan Cray XK7 Supercomputer at Oak Ridge Leadership Computing Facility Ranked #5 by TOP500

More information

ElastiCluster Automated provisioning of computational clusters in the cloud

ElastiCluster Automated provisioning of computational clusters in the cloud ElastiCluster Automated provisioning of computational clusters in the cloud Riccardo Murri (with contributions from Antonio Messina, Nicolas Bär, Sergio Maffioletti, and Sigve

More information

A Virtual Comet. HTCondor Week 2017 May Edgar Fajardo On behalf of OSG Software and Technology

A Virtual Comet. HTCondor Week 2017 May Edgar Fajardo On behalf of OSG Software and Technology A Virtual Comet HTCondor Week 2017 May 3 2017 Edgar Fajardo On behalf of OSG Software and Technology 1 Working in Comet What my friends think I do What Instagram thinks I do What my boss thinks I do 2

More information

ATLAS Tier-3 UniGe

ATLAS Tier-3 UniGe ATLAS Tier-3 cluster @ UniGe Luis March and Yann Meunier (Université de Genève) CHIPP + CSCS GRID: Face To Face meeting CERN, September 1st 2016 Description of ATLAS Tier-3 cluster at UniGe The ATLAS Tier-3

More information

Virtualization. A very short summary by Owen Synge

Virtualization. A very short summary by Owen Synge Virtualization A very short summary by Owen Synge Outline What is Virtulization? What's virtulization good for? What's virtualisation bad for? We had a workshop. What was presented? What did we do with

More information

Operating two InfiniBand grid clusters over 28 km distance

Operating two InfiniBand grid clusters over 28 km distance Operating two InfiniBand grid clusters over 28 km distance Sabine Richling, Steffen Hau, Heinz Kredel, Hans-Günther Kruse IT-Center University of Heidelberg, Germany IT-Center University of Mannheim, Germany

More information

A Laconic HPC with an Orgone Accumulator. Presentation to Multicore World Wellington, February 15-17,

A Laconic HPC with an Orgone Accumulator. Presentation to Multicore World Wellington, February 15-17, A Laconic HPC with an Orgone Accumulator Presentation to Multicore World 2016 Wellington, February 15-17, 2016 http://levlafayette.com Edward - University of Melbourne Cluster - System Installed and operational

More information

The Why and How of HPC-Cloud Hybrids with OpenStack

The Why and How of HPC-Cloud Hybrids with OpenStack The Why and How of HPC-Cloud Hybrids with OpenStack OpenStack Australia Day Melbourne June, 2017 Lev Lafayette, HPC Support and Training Officer, University of Melbourne lev.lafayette@unimelb.edu.au 1.0

More information

Batch Services at CERN: Status and Future Evolution

Batch Services at CERN: Status and Future Evolution Batch Services at CERN: Status and Future Evolution Helge Meinhard, CERN-IT Platform and Engineering Services Group Leader HTCondor Week 20 May 2015 20-May-2015 CERN batch status and evolution - Helge

More information

Clouds in High Energy Physics

Clouds in High Energy Physics Clouds in High Energy Physics Randall Sobie University of Victoria Randall Sobie IPP/Victoria 1 Overview Clouds are integral part of our HEP computing infrastructure Primarily Infrastructure-as-a-Service

More information

Automated Deployment of Private Cloud (EasyCloud)

Automated Deployment of Private Cloud (EasyCloud) Automated Deployment of Private Cloud (EasyCloud) Mohammed Kazim Musab Al-Zahrani Mohannad Mostafa Moath Al-Solea Hassan Al-Salam Advisor: Dr.Ahmad Khayyat COE485 T151 1 Table of Contents Introduction

More information

Application of Virtualization Technologies & CernVM. Benedikt Hegner CERN

Application of Virtualization Technologies & CernVM. Benedikt Hegner CERN Application of Virtualization Technologies & CernVM Benedikt Hegner CERN Virtualization Use Cases Worker Node Virtualization Software Testing Training Platform Software Deployment }Covered today Server

More information

Cloud Computing. UCD IT Services Experience

Cloud Computing. UCD IT Services Experience Cloud Computing UCD IT Services Experience Background - UCD IT Services Central IT provider for University College Dublin 23,000 Full Time Students 7,000 Researchers 5,000 Staff Background - UCD IT Services

More information

Evolution of the HEP Content Distribution Network. Dave Dykstra CernVM Workshop 6 June 2016

Evolution of the HEP Content Distribution Network. Dave Dykstra CernVM Workshop 6 June 2016 Evolution of the HEP Content Distribution Network Dave Dykstra CernVM Workshop 6 June 2016 Current HEP Content Delivery Network The HEP CDN is general purpose squid proxies, at least at all WLCG sites

More information

IN2P3-CC cloud computing (IAAS) status FJPPL Feb 9-11th 2016

IN2P3-CC cloud computing (IAAS) status FJPPL Feb 9-11th 2016 Centre de Calcul de l Institut National de Physique Nucléaire et de Physique des Particules IN2P3-CC cloud computing (IAAS) status FJPPL Feb 9-11th 2016 1 Outline Use cases R&D Internal core services Computing

More information

Monitoring system for geographically distributed datacenters based on Openstack. Gioacchino Vino

Monitoring system for geographically distributed datacenters based on Openstack. Gioacchino Vino Monitoring system for geographically distributed datacenters based on Openstack Gioacchino Vino Tutor: Dott. Domenico Elia Tutor: Dott. Giacinto Donvito Borsa di studio GARR Orio Carlini 2016-2017 INFN

More information

Computing for LHC in Germany

Computing for LHC in Germany 1 Computing for LHC in Germany Günter Quast Universität Karlsruhe (TH) Meeting with RECFA Berlin, October 5th 2007 WLCG Tier1 & Tier2 Additional resources for data analysis - HGF ''Physics at the Terascale''

More information

HTCondor Week 2015: Implementing an HTCondor service at CERN

HTCondor Week 2015: Implementing an HTCondor service at CERN HTCondor Week 2015: Implementing an HTCondor service at CERN Iain Steers, Jérôme Belleman, Ulrich Schwickerath IT-PES-PS HTCondor Week 2015 HTCondor at CERN 2 Outline The Move Environment Grid Pilot Local

More information

Scientific Cluster Deployment and Recovery Using puppet to simplify cluster management

Scientific Cluster Deployment and Recovery Using puppet to simplify cluster management Journal of Physics: Conference Series Scientific Cluster Deployment and Recovery Using puppet to simplify cluster management To cite this article: Val Hendrix et al 2012 J. Phys.: Conf. Ser. 396 042027

More information

A comparison of performance between KVM and Docker instances in OpenStack

A comparison of performance between KVM and Docker instances in OpenStack A comparison of performance between KVM and Docker instances in OpenStack Wataru Takase High Energy Accelerator Research Organiza on (KEK), Japan HEPiX Fall 2015 Workshop at BNL 1 KEK site will become

More information

Implementierung eines Dynamic Remote Storage Systems (DRS) für Applikationen mit hohen IO Anforderungen

Implementierung eines Dynamic Remote Storage Systems (DRS) für Applikationen mit hohen IO Anforderungen Implementierung eines Dynamic Remote Storage Systems (DRS) für Applikationen mit hohen IO Anforderungen Jürgen Salk, Christian Mosch, Matthias Neuer, Karsten Siegmund, Volodymyr Kushnarenko, Stefan Kombrink,

More information

CERN: LSF and HTCondor Batch Services

CERN: LSF and HTCondor Batch Services Batch @ CERN: LSF and HTCondor Batch Services Iain Steers, Jérôme Belleman, Ulrich Schwickerath IT-PES-PS INFN Visit: Batch Batch @ CERN 2 Outline The Move Environment Grid Pilot Local Jobs Conclusion

More information

Scientific Computing on Emerging Infrastructures. using HTCondor

Scientific Computing on Emerging Infrastructures. using HTCondor Scientific Computing on Emerging Infrastructures using HT HT Week, 20th May 2015 University of California, San Diego 1 Scientific Computing LHC probes nature at 10-17cm Weak Scale Scientific instruments:

More information

Conference The Data Challenges of the LHC. Reda Tafirout, TRIUMF

Conference The Data Challenges of the LHC. Reda Tafirout, TRIUMF Conference 2017 The Data Challenges of the LHC Reda Tafirout, TRIUMF Outline LHC Science goals, tools and data Worldwide LHC Computing Grid Collaboration & Scale Key challenges Networking ATLAS experiment

More information

A WEB-BASED SOLUTION TO VISUALIZE OPERATIONAL MONITORING LINUX CLUSTER FOR THE PROTODUNE DATA QUALITY MONITORING CLUSTER

A WEB-BASED SOLUTION TO VISUALIZE OPERATIONAL MONITORING LINUX CLUSTER FOR THE PROTODUNE DATA QUALITY MONITORING CLUSTER A WEB-BASED SOLUTION TO VISUALIZE OPERATIONAL MONITORING LINUX CLUSTER FOR THE PROTODUNE DATA QUALITY MONITORING CLUSTER BADISA MOSESANE EP-NU Supervisor: Nektarios Benekos Department: EP-NU Table of Contents

More information

The ATLAS Tier-3 in Geneva and the Trigger Development Facility

The ATLAS Tier-3 in Geneva and the Trigger Development Facility Journal of Physics: Conference Series The ATLAS Tier-3 in Geneva and the Trigger Development Facility To cite this article: S Gadomski et al 2011 J. Phys.: Conf. Ser. 331 052026 View the article online

More information

The OnApp Cloud Platform

The OnApp Cloud Platform The OnApp Cloud Platform Everything you need to sell cloud, dedicated, CDN, storage & more 286 Cores / 400 Cores 114 Cores 218 10 86 20 The complete cloud platform for service providers OnApp software

More information

The National Analysis DESY

The National Analysis DESY The National Analysis Facility @ DESY Yves Kemp for the NAF team DESY IT Hamburg & DV Zeuthen 10.9.2008 GridKA School NAF: National Analysis Facility Why a talk about an Analysis Facility at a Grid School?

More information

ATLAS Experiment and GCE

ATLAS Experiment and GCE ATLAS Experiment and GCE Google IO Conference San Francisco, CA Sergey Panitkin (BNL) and Andrew Hanushevsky (SLAC), for the ATLAS Collaboration ATLAS Experiment The ATLAS is one of the six particle detectors

More information

Opportunities for container environments on Cray XC30 with GPU devices

Opportunities for container environments on Cray XC30 with GPU devices Opportunities for container environments on Cray XC30 with GPU devices Cray User Group 2016, London Sadaf Alam, Lucas Benedicic, T. Schulthess, Miguel Gila May 12, 2016 Agenda Motivation Container technologies,

More information

IBM Bluemix compute capabilities IBM Corporation

IBM Bluemix compute capabilities IBM Corporation IBM Bluemix compute capabilities After you complete this section, you should understand: IBM Bluemix infrastructure compute options Bare metal servers Virtual servers IBM Bluemix Container Service IBM

More information

ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems

ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems ADAC Federated Testbed Creating a Blueprint for Portable Ecosystems Sadaf Alam, Jeffrey Vetter, Mark Klein, Maxime Martinasso, ExCL team @ ORNL,... ADAC Workshop February 15, 2018 January, 2016 June, 2016

More information

Xen and CloudStack. Ewan Mellor. Director, Engineering, Open-source Cloud Platforms Citrix Systems

Xen and CloudStack. Ewan Mellor. Director, Engineering, Open-source Cloud Platforms Citrix Systems Xen and CloudStack Ewan Mellor Director, Engineering, Open-source Cloud Platforms Citrix Systems Agenda What is CloudStack? Move to the Apache Foundation CloudStack architecture on Xen The future for CloudStack

More information

One Pool To Rule Them All The CMS HTCondor/glideinWMS Global Pool. D. Mason for CMS Software & Computing

One Pool To Rule Them All The CMS HTCondor/glideinWMS Global Pool. D. Mason for CMS Software & Computing One Pool To Rule Them All The CMS HTCondor/glideinWMS Global Pool D. Mason for CMS Software & Computing 1 Going to try to give you a picture of the CMS HTCondor/ glideinwms global pool What s the use case

More information

HTCondor with KRB/AFS Setup and first experiences on the DESY interactive batch farm

HTCondor with KRB/AFS Setup and first experiences on the DESY interactive batch farm HTCondor with KRB/AFS Setup and first experiences on the DESY interactive batch farm Beyer Christoph & Finnern Thomas Madison (Wisconsin), May 2018 HTCondor week The Team and the Outline The Team Outline

More information

arxiv: v1 [cs.dc] 7 Apr 2014

arxiv: v1 [cs.dc] 7 Apr 2014 arxiv:1404.1814v1 [cs.dc] 7 Apr 2014 CernVM Online and Cloud Gateway: a uniform interface for CernVM contextualization and deployment G Lestaris 1, I Charalampidis 2, D Berzano, J Blomer, P Buncic, G Ganis

More information

HPC learning using Cloud infrastructure

HPC learning using Cloud infrastructure HPC learning using Cloud infrastructure Florin MANAILA IT Architect florin.manaila@ro.ibm.com Cluj-Napoca 16 March, 2010 Agenda 1. Leveraging Cloud model 2. HPC on Cloud 3. Recent projects - FutureGRID

More information

WVU RESEARCH COMPUTING INTRODUCTION. Introduction to WVU s Research Computing Services

WVU RESEARCH COMPUTING INTRODUCTION. Introduction to WVU s Research Computing Services WVU RESEARCH COMPUTING INTRODUCTION Introduction to WVU s Research Computing Services WHO ARE WE? Division of Information Technology Services Funded through WVU Research Corporation Provide centralized

More information

Extraordinary HPC file system solutions at KIT

Extraordinary HPC file system solutions at KIT Extraordinary HPC file system solutions at KIT Roland Laifer STEINBUCH CENTRE FOR COMPUTING - SCC KIT University of the State Roland of Baden-Württemberg Laifer Lustre and tools for ldiskfs investigation

More information

Emerging Technologies for HPC Storage

Emerging Technologies for HPC Storage Emerging Technologies for HPC Storage Dr. Wolfgang Mertz CTO EMEA Unstructured Data Solutions June 2018 The very definition of HPC is expanding Blazing Fast Speed Accessibility and flexibility 2 Traditional

More information

CouchDB-based system for data management in a Grid environment Implementation and Experience

CouchDB-based system for data management in a Grid environment Implementation and Experience CouchDB-based system for data management in a Grid environment Implementation and Experience Hassen Riahi IT/SDC, CERN Outline Context Problematic and strategy System architecture Integration and deployment

More information

Practice of Software Development: Dynamic scheduler for scientific simulations

Practice of Software Development: Dynamic scheduler for scientific simulations Practice of Software Development: Dynamic scheduler for scientific simulations @ SimLab EA Teilchen STEINBUCH CENTRE FOR COMPUTING - SCC KIT Universität des Landes Baden-Württemberg und nationales Forschungszentrum

More information

VC3. Virtual Clusters for Community Computation. DOE NGNS PI Meeting September 27-28, 2017

VC3. Virtual Clusters for Community Computation. DOE NGNS PI Meeting September 27-28, 2017 VC3 Virtual Clusters for Community Computation DOE NGNS PI Meeting September 27-28, 2017 Douglas Thain, University of Notre Dame Rob Gardner, University of Chicago John Hover, Brookhaven National Lab A

More information

Spanish Tier-2. Francisco Matorras (IFCA) Nicanor Colino (CIEMAT) F. Matorras N.Colino, Spain CMS T2,.6 March 2008"

Spanish Tier-2. Francisco Matorras (IFCA) Nicanor Colino (CIEMAT) F. Matorras N.Colino, Spain CMS T2,.6 March 2008 Spanish Tier-2 Francisco Matorras (IFCA) Nicanor Colino (CIEMAT) Introduction Report here the status of the federated T2 for CMS basically corresponding to the budget 2006-2007 concentrate on last year

More information

Scalability / Data / Tasks

Scalability / Data / Tasks Jožef Stefan Institute Scalability / Data / Tasks Meeting Scalability Requirements with Large Data and Complex Tasks: Adapting Existing Technologies and Best Practices in Slovenia Jan Jona Javoršek Jožef

More information

CernVM-FS beyond LHC computing

CernVM-FS beyond LHC computing CernVM-FS beyond LHC computing C Condurache, I Collier STFC Rutherford Appleton Laboratory, Harwell Oxford, Didcot, OX11 0QX, UK E-mail: catalin.condurache@stfc.ac.uk Abstract. In the last three years

More information

Overview of a virtual cluster

Overview of a virtual cluster Overview of a virtual cluster using OpenNebula and SLURM Ismael Farfán Estrada ifarfane0900@ipn.mx October 27, 2011 Ismael Farfán Estrada ifarfane0900@ipn.mx () Overview of a virtual cluster October 27,

More information

Linux HPC Software Stack

Linux HPC Software Stack Linux HPC Software Stack Makia Minich Clustre Monkey, HPC Software Stack Lustre Group April 2008 1 1 Project Goals Develop integrated software stack for Linux-based HPC solutions based on Sun HPC hardware

More information

STATUS OF PLANS TO USE CONTAINERS IN THE WORLDWIDE LHC COMPUTING GRID

STATUS OF PLANS TO USE CONTAINERS IN THE WORLDWIDE LHC COMPUTING GRID The WLCG Motivation and benefits Container engines Experiments status and plans Security considerations Summary and outlook STATUS OF PLANS TO USE CONTAINERS IN THE WORLDWIDE LHC COMPUTING GRID SWISS EXPERIENCE

More information

BESIII Computing Model and most recent R&Ds

BESIII Computing Model and most recent R&Ds BESIII Computing Model and most recent R&Ds Marco Maggiora University of Turin and INFN Turin Workshop CCR INFN La Biodola, May 16 th - 20 th 2016 IHEP cluster New Resources Since Sep 2015 86 blade servers

More information

Introduction to OpenStack

Introduction to OpenStack Introduction to OpenStack SANOG 28 4 August 2016 Elizabeth K. Joseph @pleia2 Elizabeth K. Joseph Senior Automation & Tools Engineer at HPE Joined the OpenStack Infrastructure Team in 2013, core and root

More information

Brief review of the HEPIX 2011 spring Darmstadt, 2-6 May

Brief review of the HEPIX 2011 spring Darmstadt, 2-6 May Brief review of the HEPIX 2011 spring Darmstadt, 2-6 May http://indico.cern.ch/conferencedisplay.py?confid=118192 Andrey Y Shevel 7 June 2011 Andrey Y Shevel 1 The presentation outlook HEPiX program Site

More information

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez Scientific data processing at global scale The LHC Computing Grid Chengdu (China), July 5th 2011 Who I am 2 Computing science background Working in the field of computing for high-energy physics since

More information

Your cloud solution for EO Data access and processing

Your cloud solution for EO Data access and processing powered by Your cloud solution for EO Data access and processing Stanisław Dałek VP - CloudFerro 2 About CREODIAS The platform In 2017 European Space Agency, acting on behalf of the European Commission,

More information

Introduction to Abel/Colossus and the queuing system

Introduction to Abel/Colossus and the queuing system Introduction to Abel/Colossus and the queuing system November 14, 2018 Sabry Razick Research Infrastructure Services Group, USIT Topics First 7 slides are about us and links The Research Computing Services

More information

Computing / The DESY Grid Center

Computing / The DESY Grid Center Computing / The DESY Grid Center Developing software for HEP - dcache - ILC software development The DESY Grid Center - NAF, DESY-HH and DESY-ZN Grid overview - Usage and outcome Yves Kemp for DESY IT

More information

Centre de Calcul de l Institut National de Physique Nucléaire et de Physique des Particules. Singularity overview. Vanessa HAMAR

Centre de Calcul de l Institut National de Physique Nucléaire et de Physique des Particules. Singularity overview. Vanessa HAMAR Centre de Calcul de l Institut National de Physique Nucléaire et de Physique des Particules Singularity overview Vanessa HAMAR Disclaimer } The information in this presentation was compiled from different

More information

BeeGFS. Parallel Cluster File System. Container Workshop ISC July Marco Merkel VP ww Sales, Consulting

BeeGFS.   Parallel Cluster File System. Container Workshop ISC July Marco Merkel VP ww Sales, Consulting BeeGFS The Parallel Cluster File System Container Workshop ISC 28.7.18 www.beegfs.io July 2018 Marco Merkel VP ww Sales, Consulting HPC & Cognitive Workloads Demand Today Flash Storage HDD Storage Shingled

More information

The Legnaro-Padova distributed Tier-2: challenges and results

The Legnaro-Padova distributed Tier-2: challenges and results The Legnaro-Padova distributed Tier-2: challenges and results Simone Badoer a, Massimo Biasotto a,fulviacosta b, Alberto Crescente b, Sergio Fantinel a, Roberto Ferrari b, Michele Gulmini a, Gaetano Maron

More information

PROOF-Condor integration for ATLAS

PROOF-Condor integration for ATLAS PROOF-Condor integration for ATLAS G. Ganis,, J. Iwaszkiewicz, F. Rademakers CERN / PH-SFT M. Livny, B. Mellado, Neng Xu,, Sau Lan Wu University Of Wisconsin Condor Week, Madison, 29 Apr 2 May 2008 Outline

More information

Access: bwunicluster, bwforcluster, ForHLR

Access: bwunicluster, bwforcluster, ForHLR Access: bwunicluster, bwforcluster, ForHLR Shamna Shamsudeen, SCC, KIT Steinbuch Centre for Computing (SCC) Funding: www.bwhpc-c5.de Outline Introduction Registration Processes bwunicluster bwforcluster

More information

Conduire OpenStack Vers l Edge Computing Anthony Simonet Inria, École des Mines de Nantes, France

Conduire OpenStack Vers l Edge Computing Anthony Simonet Inria, École des Mines de Nantes, France Discovery Initiative Conduire OpenStack Vers l Edge Computing Anthony Simonet Inria, École des Mines de Nantes, France Fog/Edge Computing Infrastructures Leverage network backbones Extend any point of

More information

13th International Workshop on Advanced Computing and Analysis Techniques in Physics Research ACAT 2010 Jaipur, India February

13th International Workshop on Advanced Computing and Analysis Techniques in Physics Research ACAT 2010 Jaipur, India February LHC Cloud Computing with CernVM Ben Segal 1 CERN 1211 Geneva 23, Switzerland E mail: b.segal@cern.ch Predrag Buncic CERN E mail: predrag.buncic@cern.ch 13th International Workshop on Advanced Computing

More information

CYFRONET SITE REPORT IMPROVING SLURM USABILITY AND MONITORING. M. Pawlik, J. Budzowski, L. Flis, P. Lasoń, M. Magryś

CYFRONET SITE REPORT IMPROVING SLURM USABILITY AND MONITORING. M. Pawlik, J. Budzowski, L. Flis, P. Lasoń, M. Magryś CYFRONET SITE REPORT IMPROVING SLURM USABILITY AND MONITORING M. Pawlik, J. Budzowski, L. Flis, P. Lasoń, M. Magryś Presentation plan 2 Cyfronet introduction System description SLURM modifications Job

More information

Teraflops of Jupyter: A Notebook Based Analysis Portal at BNL

Teraflops of Jupyter: A Notebook Based Analysis Portal at BNL Teraflops of Jupyter: A Notebook Based Analysis Portal at BNL Ofer Rind Spring HEPiX, Madison, WI May 17,2018 In collaboration with: Doug Benjamin, Costin Caramarcu, Zhihua Dong, Will Strecker-Kellogg,

More information

Transient Compute ARC as Cloud Front-End

Transient Compute ARC as Cloud Front-End Digital Infrastructures for Research 2016 2016-09-29, 11:30, Cracow 30 min slot AEC ALBERT EINSTEIN CENTER FOR FUNDAMENTAL PHYSICS Transient Compute ARC as Cloud Front-End Sigve Haug, AEC-LHEP University

More information

OpenNebula on VMware: Cloud Reference Architecture

OpenNebula on VMware: Cloud Reference Architecture OpenNebula on VMware: Cloud Reference Architecture Version 1.2, October 2016 Abstract The OpenNebula Cloud Reference Architecture is a blueprint to guide IT architects, consultants, administrators and

More information

Use of containerisation as an alternative to full virtualisation in grid environments.

Use of containerisation as an alternative to full virtualisation in grid environments. Journal of Physics: Conference Series PAPER OPEN ACCESS Use of containerisation as an alternative to full virtualisation in grid environments. Related content - Use of containerisation as an alternative

More information

Austrian Federated WLCG Tier-2

Austrian Federated WLCG Tier-2 Austrian Federated WLCG Tier-2 Peter Oettl on behalf of Peter Oettl 1, Gregor Mair 1, Katharina Nimeth 1, Wolfgang Jais 1, Reinhard Bischof 2, Dietrich Liko 3, Gerhard Walzel 3 and Natascha Hörmann 3 1

More information

Scheduling Computational and Storage Resources on the NRP

Scheduling Computational and Storage Resources on the NRP Scheduling Computational and Storage Resources on the NRP Rob Gardner Dima Mishin University of Chicago UCSD Second NRP Workshop Montana State University August 6-7, 2018 slides: http://bit.ly/nrp-scheduling

More information

The Software Defined Online Storage System at the GridKa WLCG Tier-1 Center

The Software Defined Online Storage System at the GridKa WLCG Tier-1 Center The Software Defined Online Storage System at the GridKa WLCG Tier-1 Center CHEP 2018, Sofia Jan Erik Sundermann, Jolanta Bubeliene, Ludmilla Obholz, Andreas Petzold STEINBUCH CENTRE FOR COMPUTING (SCC)

More information

Analisi Tier2 e Tier3 Esperienze ai Tier-2 Giacinto Donvito INFN-BARI

Analisi Tier2 e Tier3 Esperienze ai Tier-2 Giacinto Donvito INFN-BARI Analisi Tier2 e Tier3 Esperienze ai Tier-2 Giacinto Donvito INFN-BARI outlook Alice Examples Atlas Examples CMS Examples Alice Examples ALICE Tier-2s at the moment do not support interactive analysis not

More information

Running HEP Workloads on Distributed Clouds

Running HEP Workloads on Distributed Clouds Running HEP Workloads on Distributed Clouds R.Seuster, F. Berghaus, K. Casteels, C. Driemel M. Ebert, C. R. Leavett-Brown, M. Paterson, R.Sobie, T. Weiss-Gibson 2017 Fall HEPiX meeting, Tsukuba 16. - 20.

More information

Scientific Workflows and Cloud Computing. Gideon Juve USC Information Sciences Institute

Scientific Workflows and Cloud Computing. Gideon Juve USC Information Sciences Institute Scientific Workflows and Cloud Computing Gideon Juve USC Information Sciences Institute gideon@isi.edu Scientific Workflows Loosely-coupled parallel applications Expressed as directed acyclic graphs (DAGs)

More information

SAP Monsoon: The goal of a single standardized Hybrid Cloud. David Hoeller - IT Architecture Expert CCSCLD-2372

SAP Monsoon: The goal of a single standardized Hybrid Cloud. David Hoeller - IT Architecture Expert CCSCLD-2372 SAP Monsoon: The goal of a single standardized Hybrid Cloud David Hoeller - IT Architecture Expert CCSCLD-2372 Agenda Introduction The past: Multiple SAP internal clouds Monsoon - DevOps, continuous delivery

More information

Big Data Analytics and the LHC

Big Data Analytics and the LHC Big Data Analytics and the LHC Maria Girone CERN openlab CTO Computing Frontiers 2016, Como, May 2016 DOI: 10.5281/zenodo.45449, CC-BY-SA, images courtesy of CERN 2 3 xx 4 Big bang in the laboratory We

More information

Live Migration of Virtualized Edge Networks: Analytical Modeling and Performance Evaluation

Live Migration of Virtualized Edge Networks: Analytical Modeling and Performance Evaluation Live Migration of Virtualized Edge Networks: Analytical Modeling and Performance Evaluation Walter Cerroni, Franco Callegati DEI University of Bologna, Italy Outline Motivations Virtualized edge networks

More information

A High-Availability Cloud for Research Computing

A High-Availability Cloud for Research Computing This is a post-print version of the following article: J. Riley, J. Noss, J. Cuff, I. M. Llorente, A High- Availability Cloud for Research Computing, IEEE Computer, pp: 91-95, Issue No. 06 - June (2017

More information

Virtualizing Oracle 11g/R2 RAC Database on Oracle VM: Methods/Tips

Virtualizing Oracle 11g/R2 RAC Database on Oracle VM: Methods/Tips Virtualizing Oracle 11g/R2 RAC Database on Oracle VM: Methods/Tips Saar Maoz, RACPack RAC Development, Oracle Kai Yu, Oracle Solutions Engineering, Dell Inc About Authors Saar Maoz Consulting Software

More information

HP Matrix Operating Environment 7.2 Getting Started Guide

HP Matrix Operating Environment 7.2 Getting Started Guide HP Matrix Operating Environment 7.2 Getting Started Guide Abstract This document provides an overview of the HP Matrix Operating Environment. It is intended to be used by system administrators and other

More information

Workload management at KEK/CRC -- status and plan

Workload management at KEK/CRC -- status and plan Workload management at KEK/CRC -- status and plan KEK/CRC Hiroyuki Matsunaga Most of the slides are prepared by Koichi Murakami and Go Iwai CPU in KEKCC Work server & Batch server Xeon 5670 (2.93 GHz /

More information

High Performance Computing Cloud - a PaaS Perspective

High Performance Computing Cloud - a PaaS Perspective a PaaS Perspective Supercomputer Education and Research Center Indian Institute of Science, Bangalore November 2, 2015 Overview Cloud computing is emerging as a latest compute technology Properties of

More information

Challenges and Evolution of the LHC Production Grid. April 13, 2011 Ian Fisk

Challenges and Evolution of the LHC Production Grid. April 13, 2011 Ian Fisk Challenges and Evolution of the LHC Production Grid April 13, 2011 Ian Fisk 1 Evolution Uni x ALICE Remote Access PD2P/ Popularity Tier-2 Tier-2 Uni u Open Lab m Tier-2 Science Uni x Grid Uni z USA Tier-2

More information