Five years of OpenStack at CERN

Similar documents
Preparing for High-Luminosity LHC. Bob Jones CERN Bob.Jones <at> cern.ch

The CERN OpenStack Cloud

The CERN OpenStack Cloud

Upcoming Services in OpenStack Rohit Agarwalla, Technical DEVNET-1102

The CERN OpenStack Cloud

Batch Services at CERN: Status and Future Evolution

The LHC computing model and its evolution. Dr Bob Jones CERN

Service withdrawal: Selected IBM ServicePac offerings

IBM offers Software Maintenance for additional Licensed Program Products

OpenStack Magnum Pike and the CERN cloud. Spyros

Distributed e-infrastructures for data intensive science

Storage and I/O requirements of the LHC experiments

OpenStack Mitaka Release Overview

IN2P3-CC cloud computing (IAAS) status FJPPL Feb 9-11th 2016

Jumpstart your Production OpenStack Deployment with

Phase II Upgrades. Eckhard Elsen. LHC RRB Meeting, Oct 29-31, Director Research and Computing

Infrastructure-as-Code and CI Infrastructure at Open Stack A look at one of the largest CI systems and system administration

International Packets

"Charting the Course... H8Q14S HPE Helion OpenStack. Course Summary

Items exceeding one or more of the maximum weight and dimensions of a flat. For maximum dimensions please see the service user guide.

OSDC.de 2013 Introduction to OpenStack. Justin Clift Open Source & Standards RH 17 April 2013

International Business Mail Rate Card

END-OF-SALE AND END-OF-LIFE ANNOUNCEMENT FOR THE CISCO MEDIA CONVERGENCE SERVER 7845H-2400

Developments in Manufacturing Technologies Research Co-operation between Riga Technical University and CERN

Huawei FusionSphere 6.0 Technical White Paper on OpenStack Integrating FusionCompute HUAWEI TECHNOLOGIES CO., LTD. Issue 01.

Introduction To OpenStack. Haim Ateya Group Manager, EMC

Red Hat OpenStack Platform 10 Product Guide

OnAudience.com I Report 2017 Ad blocking in the Internet

CERN Lustre Evaluation

Part2: Let s pick one cloud IaaS middleware: OpenStack. Sergio Maffioletti

CUSTOMER GUIDE Interoute One Bridge Outlook Plugin Meeting Invite Example Guide

Business internet start, Business internet comfort, Business internet max giga. Internet can also be purchased separately without voice.

Enterprise price plan guide Vodafone One Net Business

* Please note that recovery will only be provided free-of-charge if you hold valid cover via Honda.

AN POST SCHEDULE OF CHARGES

For: Ministry of Education From Date: 19 November 18-2 December 18 Venue: M1 Shops

E R T M S COMMUNICATION PLAN

COMMUNITY-GENERATED ROADMAP

BCS EXIN Foundation Certificate in OpenStack Software Syllabus

European Cybersecurity PPP European Cyber Security Organisation - ECSO November 2016

Automation DriveServer

European Standardization & Digital Transformation. Ashok GANESH Director Innovation ETICS Management Committee

International Roaming Critical Information Summaries JULY 2017

Minimal OpenStack Starting Your OpenStack Journey

Introduction to OpenStack

Intel, OpenStack, & Trust in the Open Cloud. Intel Introduction

ETSI Governance and Decision Making

STANDARD BROADBAND & FIBRE BROADBAND PLANS

Common European Submission Portal

Patent Portfolio Overview July The data in this presentation is current as of this date.

Cisco Extensible Provisioning and Operations Manager 4.5

White Paper The Storage System Best Suited for OpenStack FUJITSU Storage ETERNUS DX S4/S3 series and ETERNUS AF series

MINUTES AND TEXTS CUSTOMER MOBILE BOLT-ON GUIDE JUNE 2018 BOLT-ON WILL KEEP YOU IN CONTROL OF YOUR COSTS. INTERNATIONAL NUMBERS FROM YOUR MOBILE, THIS

Rolling upgrade Performance between OpenStack Deployed in VMs and Containers

STANDARD BROADBAND & FIBRE BROADBAND PLANS

CRE investment weakens in Q as investors struggle to find product in prime markets

Cisco Voice Services Provisioning Tool 2.6(1)

icims Browser & Version Support Policy

OpenStack Manila An Overview of Manila Liberty & Mitaka

Carrier Services. Intelligent telephony. for over COUNTRIES DID NUMBERS. All IP

BoR (10) 13. BEREC report on Alternative Retail Voice and SMS Roaming Tariffs and Retail Data Roaming Tariffs

International Business Parcels Rate card

Overcoming the Compliance Challenges of VAT Remittance. 12 April :55 to 16:30 (CEST)

Step 1: New Portal User User ID Created Using IdentityIQ (IIQ)

Red Hat OpenStack Platform 8 Configure firewall rules for Red Hat OpenStack Platform director

ANNEX A.1 TECHNICAL SPECIFICATIONS OPEN CALL FOR TENDERS

Map Reconfiguration Dealer Guide

Challenges and Evolution of the LHC Production Grid. April 13, 2011 Ian Fisk

1. What is Cloud Computing (CC)? What are the Pros and Cons of CC? Technologies of CC 27

BoR (11) 08. BEREC Report on Alternative Voice and SMS Retail Roaming Tariffs and Retail Data Roaming Tariffs

Disclaimer This presentation may contain product features that are currently under development. This overview of new technology represents no commitme

Digital EAGLEs. Outlook and perspectives

Patent Portfolio Overview May The data in this presentation is current as of this date.

Box 2: Voting Oct 2003 Belgium Bulgaria Croatia Cyprus Czech Republic Denmark Estonia Finland France Germany [2 folders]

STANDARD BROADBAND & FIBRE BROADBAND PLANS

This document is a preview generated by EVS

New strategies of the LHC experiments to meet the computing requirements of the HL-LHC era

STATUS OF PLANS TO USE CONTAINERS IN THE WORLDWIDE LHC COMPUTING GRID

Cisco Aironet In-Building Wireless Solutions International Power Compliance Chart

Oracle Enterprise Manager 12 c : ASH in 3D

EU Telecoms Reform package 2007 Comments/ questions

OpenStack Cloud Storage. PRESENTATION TITLE GOES HERE Sam Fineberg HP Storage

The Canadian Experience

HPC IN EUROPE. Organisation of public HPC resources

The flexible mobile subscription for Switzerland, almost all European countries, the USA and Canada (Region 1) Mobile Internet in Switzerland

Cloud Essentials for Architects using OpenStack

iclass SE multiclass SE 125kHz, 13.56MHz 125kHz, 13.56MHz

Instructions. (For 6180 Industrial Computers) Installing a Processor Upgrade

High Availability for Enterprise Clouds: Oracle Solaris Cluster and OpenStack

EUMETSAT EXPERIENCE WITH MULTICAST ACROSS GÉANT

EWF European Federation for Welding, Joining and Cutting. 4 October, 2013 Lisbon, PORTUGAL

Clouds in High Energy Physics

Human Centric. Innovation. OpenStack = Linux of the Cloud? Ingo Gering, Fujitsu Dirk Müller, SUSE

Confirming its role as Italy s leading exhibition dedicated to security and fire prevention. 333 exhibitor companies

Map Reconfiguration User Guide

EUREKA European Network in international R&D Cooperation

OpenStack Architecture and Pattern Deployment with Heat. OpenStack Architecture and Pattern Deployment using Heat Ruediger Schulze

ITU. The New Global Challenges in Cybersecurity 30 October 2017 Bucharest, Romania

Countdown to GDPR. Impact on the Security Ecosystem and How to Prepare

Customers want to transform their datacenter 80% 28% global IT budgets spent on maintenance. time spent on administrative tasks

Transcription:

Five years of OpenStack at CERN

CERN: founded in 1954: 12 European States Science for Peace Today: 22 Member States ~ 2300 staff ~ 1400 other paid personnel ~ 12500 scientific users Budget (2017) ~1000 MCHF Member States: Austria, Belgium, Bulgaria, Czech Republic, Denmark, Finland, France, Germany, Greece, Hungary, Israel, Italy, Netherlands, Norway, Poland, Portugal, Romania, Slovak Republic, Spain, Sweden, Switzerland and United Kingdom Associate Member States: Pakistan, India, Ukraine, Turkey States in accession to Membership: Cyprus, Serbia Applications for Membership or Associate Membership: Brazil, Croatia, Lithuania, Russia, Slovenia Observers to Council: India, Japan, Russia, United States of America; European Union, JINR and UNESCO 2

The Large Hadron Collider (LHC) ~700 MB/s >1 GB/s ~10 GB/s >1 GB/s

Tim.Bell@cern.ch 5

2016: 50 PB recorded on tape!

Data Centres by Numbers

Managing all this became very very very tricky

2012: Agile Infrastructure project Provisioning + Configuration + Monitoring Aim: virtualize all the machines Unless really, really, really not possible Offer Cloud endpoints to users Scale horizontally Consolidate server provisioning Yes, we use the private cloud for server consolidation usecases as well

CERN Tool Chain

CERN OpenStack Service Timeline July 2013 CERN OpenStack Production February 2014 CERN OpenStack Havana October 2014 CERN OpenStack Icehouse March2015 CERN OpenStack Juno September 2015 CERN OpenStack Kilo September 2016 CERN OpenStack Liberty March 2017 CERN OpenStack Mitaka June 2017 CERN OpenStack Newton 5 April 2012 27 September 2012 4 April 2013 17 October 2013 17 April 2014 16 October 2014 30 April 2015 15 October 2015 7 April 2016 6 Oct 2016 22 Feb 2017 28 Aug 2017 ESSEX FOLSOM GRIZZLY HAVANA ICEHOUSE JUNO KILO LIBERTY MITAKA NEWTON OCATA PIKE Nova (*) Glance (*) Horizon (*) Keystone (*) Nova (*) Glance (*) Horizon (*) Keystone (*) Quantum Cinder Nova Glance Horizon Keystone Quantum Cinder Ceilometer (*) Nova Glance Horizon Keystone Neutron Cinder Ceilometer (*) Heat Nova Glance Horizon Keystone Neutron Cinder Ceilometer Heat Ironic Trove Nova Glance Horizon Keystone Neutron Cinder Ceilometer Heat (*) Rally (*) Nova Glance Horizon Keystone Neutron Cinder Ceilometer Heat Rally Manila Nova Glance Horizon Keystone Neutron (*) Cinder Ceilometer Heat Rally EC2API Magnum (*) Barbican (*) Nova Glance Horizon Keystone Neutron Cinder Ceilometer Heat Rally EC2API Magnum Barbican Ironic (?) Mistral (?) Manila (?) Nova Nova Glance Glance Horizon Horizon Keystone Keystone Neutron Neutron Cinder Cinder Ceilometer Ceilometer Heat Heat Rally Rally EC2API EC2API Magnum Magnum Barbican Barbican Ironic (?) Ironic (?) Mistral (?) Mistral (?) Manila (?) Manila (*) Nova Glance Horizon Keystone Neutron Cinder Ceilometer Heat Rally EC2API Magnum Barbican Ironic (*) Mistral (*) Manila (*) (*) Pilot (?) Trial Retired

CERN OpenStack cloud in numbers

Rich Usage Spectrum Batch service - Physics data analysis IT Services - Sometimes built on top of other virtualised services Experiment services - E.g. build machines Engineering services - E.g. micro-electronics/chip design Infrastructure services - E.g. hostel booking, car rental, Personal VMs - Development rich requirement spectrum!

Scaling Nova Top level cell Runs API service Top cell scheduler >50 child cells run Compute nodes Scheduler Conductor Cells v2 coming Default for all

Rally

Magnum Container Engine as a Service Kubernetes, Docker, Mesos, DCOS 120 clusters, 700 nodes $ magnum cluster-create --name myswarmcluster --cluster-template swarm --node-count 100 $ magnum cluster-list +------+----------------+------------+--------------+-----------------+ uuid name node_count master_count status +------+----------------+------------+--------------+-----------------+... myswarmcluster 100 1 CREATE_COMPLETE +------+----------------+------------+--------------+-----------------+ $ $(magnum cluster-config myswarmcluster --dir magnum/myswarmcluster) $ docker info / ps /... $ docker run --volume-driver cvmfs -v atlas.cern.ch:/cvmfs/atlas -it centos /bin/bash [root@32f4cf39128d /]#

What s new? Mistral Workflow-as-a-Service used for multi-step actions, triggered by users or events Horizon dashboard for visualising results Examples Expire personal resources after 6 months Multi-step project creation Scheduled snapshot of VMs Code at https://gitlab.cern.ch/cloudinfrastructure/mistral-workflows Some more complex cases coming in the pipeline

Automate provisioning Automate routine procedures - Common place for workflows - Clean web interface - Scheduled jobs, cron-style - Traceability and auditing - Fine-grained access control - Disable compute node Disable Nova Compute Disable Neutron agent Switch Alarms OFF Update Service-Now ticket Procedures for - OpenStack project creation - OpenStack quota changes - Notifications of VM owners - Usage and health reports - Notifications Other tasks Send e-mail to VM owners Save intervention details Send calendar invitation Add remote AT job Post new message broker

Manila: Overview File Share Project in OpenStack - Provisioning of shared file systems to VMs - Cinder for file shares APIs for tenants to request shares - Fulfilled by backend drivers - Accessed from instances Support for variety of NAS protocols - NFS, CIFS, MapR-FS, GlusterFS, CephFS, Supports the notion of share types - Map features to backends User instances 1. Request share 3. Provide handle 4. Access share Manila 2. Create share Backend

LHC Incident in April 2016

Manila testing: #fouinehammer m-api m-api m-api DB m-sched m-share Driver RabbitMQ 1 500 nodes 1... 10k PODs

Operations areas going forward Further automate migrations Around 5,000 VMs / year First campaign in 2016 needed some additional scripting such as pausing very active VMs Newton live migration includes most use cases Software Defined Networking Nova network to Neutron migration to be completed In addition to flat network in use currently Introduce higher level functions such as LBaaS

Development areas going forward Nova Pre-emptible VMs Nova Cells V2 Magnum rolling upgrades Collaborations with Industry

Compute needs growing With the needs of LHC computing in future years, efficient and flexible delivery of compute resources will be key Computing needs in 2023 estimated at 60x the current capacity (HL-LHC) 5000000 4500000 4000000 3500000 3000000 2500000 2000000 1500000 1000000 500000 0 WLCG CPU Growth Tier2 Tier1 CERN 20% Growth Current state 2008-12 linear 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 160 140 120 100 80 60 40 20 0 GRID ATLAS CMS LHCb ALICE Run 1 Run 2 Run 3 Run 4 Compute: Growth > x60 What we think is affordable unless we do something differently

Commercial Clouds

Summary OpenStack has provided a strong base for scaling resources over the past 5 years Additional functionality on top of pure Infrastructure-as-a-Service is now coming to production Community and industry collaboration has been productive and inspirational for the CERN team Some big computing challenges up ahead

Thank you!

Further Information Technical details on the CERN cloud at http://openstack-in-production.blogspot.fr Custom CERN code is at https://github.com/cernops Scientific Working Group at https://wiki.openstack.org/wiki/scientific_working_group Helix Nebula details at http://www.helix-nebula.eu/ http://cern.ch/it CERN CC-BY-SA 4.0

Tuning Many hypervisors are configured for compute optimisation CPU Passthrough so VM sees identical CPU Extended Page Tables so memory page mapping is done in hardware Core pinning so scheduler keeps the cores on the underlying physical cores Huge pages to improve memory page cache utilisation Flavors are set to be NUMA aware Improvements of up to 20% in performance Impact is that the VMs cannot be live migrated so service machines are not configured this way

Pick the interesting events 40 million per second Fast, simple information Hardware trigger in a few micro seconds 100 thousand per second Fast algorithms in local computer farm Software trigger in <1 second Muon tracks Energy deposits Few 100 per second Recorded for study