The coolest place on earth

Similar documents
The coolest place on earth

Report. Middleware Proxy: A Request-Driven Messaging Broker For High Volume Data Distribution

Overview. About CERN 2 / 11

Connectivity Services, Autobahn and New Services

SOA-14: Continuous Integration in SOA Projects Andreas Gies

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez

INSPECTOR, A ZERO CODE IDE FOR CONTROL SYSTEMS USER INTERFACE DEVELOPMENT

Solace JMS Broker Delivers Highest Throughput for Persistent and Non-Persistent Delivery

The CESAR Project using J2EE for Accelerator Controls

ORACLE ENTERPRISE MANAGER 10g ORACLE DIAGNOSTICS PACK FOR NON-ORACLE MIDDLEWARE

WLCG Transfers Dashboard: a Unified Monitoring Tool for Heterogeneous Data Transfers.

RED HAT JBOSS FUSE. A lightweight, flexible integration platform

THE LHC CONTROL SYSTEM

Overview of DCS Technologies. Renaud Barillère - CERN IT-CO

CSCS CERN videoconference CFD applications

GlassFish v2 AS/MQ integration changes. Presentation to AS user experience meeting 27 th Sep 2006

Red Hat AMQ 7.2 Introducing Red Hat AMQ 7

Oracle WebLogic Server 11g: Administration Essentials

Control and Monitoring of the Front-End Electronics in ALICE

High Throughput WAN Data Transfer with Hadoop-based Storage

Broker Clusters. Cluster Models

High Performance Computing Course Notes Grid Computing I

Fault Detection using Advanced Analytics at CERN's Large Hadron Collider

CC-IN2P3: A High Performance Data Center for Research

Quick Start. Scalable Deployers in SDL Web 8.5. Feb 2017 SDL Web. Document owner: Richard Hamlyn

Introduction to Protocols for Realtime Data Sharing. Deepti Nagarkar

Oracle WebLogic Diagnostics and Troubleshooting

Grid Computing a new tool for science

Smart Data for. Industrial Control Systems. CERN Technical Workshop

Real World Messaging With Apache ActiveMQ. Bruce Snyder 7 Nov 2008 New Orleans, Louisiana

Contents at a Glance. vii

The Compact Muon Solenoid Experiment. Conference Report. Mailing address: CMS CERN, CH-1211 GENEVA 23, Switzerland

Ultra Messaging Queing Edition (Version ) Guide to Queuing

Conference The Data Challenges of the LHC. Reda Tafirout, TRIUMF

BEAWebLogic. Server. Automatic and Manual Service-level Migration

a new Remote Operations Center at Fermilab

The AAL project: automated monitoring and intelligent analysis for the ATLAS data taking infrastructure

Diplomado Certificación

Quantitative and formal methods for the industrial control systems at CERN: dreams and reality

The CMS data quality monitoring software: experience and future prospects

MSG: An Overview of a Messaging System for the Grid

Global Collaboration on Accelerator Operations and Experiments

CERN openlab II. CERN openlab and. Sverre Jarp CERN openlab CTO 16 September 2008

The ATLAS EventIndex: Full chain deployment and first operation

Event cataloguing and other database applications in ATLAS

PROFINET COMMUNICATION CARD FOR THE CERN CRYOGENICS CRATE ELECTRONICS INSTRUMENTATION

ISTITUTO NAZIONALE DI FISICA NUCLEARE

Virtualizing a Batch. University Grid Center

BEAWebLogic Server. Introduction to BEA WebLogic Server and BEA WebLogic Express

RED HAT JBOSS FUSE A lightweight, lexible integration platform

Front-End Electronics Configuration System for CMS. Philippe Gras CERN - University of Karlsruhe

Administering WebLogic Server on Java Cloud Service I Ed 1 Coming Soon

IEPSAS-Kosice: experiences in running LCG site

Red Hat JBoss Fuse 6.1

Interoperating AliEn and ARC for a distributed Tier1 in the Nordic countries.

A MIMD Multi Threaded Processor

ELECTRICAL NETWORK SUPERVISOR STATUS REPORT

<Insert Picture Here> WebLogic JMS Messaging Infrastructure WebLogic Server 11gR1 Labs

Building Large Scale Distributed Systems with AMQP. Ted Ross

Architecting & Tuning IIB / extreme Scale for Maximum Performance and Reliability

ORACLE DATA SHEET KEY FEATURES AND BENEFITS ORACLE WEBLOGIC SUITE

Graphical System Design

OASIS: STATUS REPORT

High-Throughput Real-Time Network Flow Visualization

Andrea Sciabà CERN, Switzerland

MONTE CARLO SIMULATION FOR RADIOTHERAPY IN A DISTRIBUTED COMPUTING ENVIRONMENT

Create High Performance, Massively Scalable Messaging Solutions with Apache ActiveBlaze

We invented the Web. 20 years later we got Drupal.

Worldwide Production Distributed Data Management at the LHC. Brian Bockelman MSST 2010, 4 May 2010

FRONT-END SOFTWARE ARCHITECTURE [FESA]

SOFTWARE SCENARIO FOR CONTROL SYSTEM OF INDUS-2

Oracle WebLogic Server 12c: Administration I

Designing High Performance IEC61499 Applications on Top of DDS

ARES: AUTOMATIC RELEASE SERVICE

WEBSPHERE APPLICATION SERVER

Samsung SDS Enterprise Cloud

An ATCA framework for the upgraded ATLAS read out electronics at the LHC

ELFms industrialisation plans

Storage Resource Sharing with CASTOR.

Detector Control LHC

The Wuppertal Tier-2 Center and recent software developments on Job Monitoring for ATLAS

The LHC Computing Grid

Grid Computing: dealing with GB/s dataflows

Scalable Computing: Practice and Experience Volume 10, Number 4, pp

Purplefinder Enterprise Platform Messagng with ActiveMQ. Peter Potts 13 th October 2010

SCRIPT: An Architecture for IPFIX Data Distribution

X100 ARCHITECTURE REFERENCES:

3C05 - Advanced Software Engineering Thursday, April 29, 2004

Verification and Diagnostics Framework in ATLAS Trigger/DAQ

Auto Management for Apache Kafka and Distributed Stateful System in General

Big Computing and the Mitchell Institute for Fundamental Physics and Astronomy. David Toback

AMon A Monitoring System for ActiveMQ

How Cisco IT Improves Commerce User Experience by Securely Sharing Internal Business Services with Partners

Pragmatic Clustering. Mike Cannon-Brookes CEO, Atlassian Software Systems

NUCLEAIRE EUROPEAN ORGANIZATION FOR NUCLEAR RESEARCH CERN LIBRARIES, GENEVA CERN-ECP

System Administration of PTC Windchill 11.0

AutoPyFactory: A Scalable Flexible Pilot Factory Implementation

Red Hat JBoss Enterprise Application Platform 7.2

Achieving the Science DMZ

OpenIAM Identity and Access Manager Technical Architecture Overview

Transcription:

The coolest place on earth

Large Scale Messaging with ActiveMQ for Particle Accelerators at CERN 2

Overview Examples 30min Introduction to CERN Operation Usage of ActiveMQ 3

About the Speaker Member of CERN Beams Controls Group: Responsible for JMS Service Middleware Developer for Alarm and Monitoring System Previous Activities: Large Storage Systems Grid Environment felix.ehm@cern.ch 4

What is CERN? A European Organization for Nuclear Research originally : Conseil Européen pour la Recherche Nucléaire Founded in 1954 Based in Geneva, Switzerland 3 000 staff members 8 000 visiting scientists Financed by its member states Twenty Member States 5

Large Machines for High Energy Physics Particle colliders for tracing fragments 6

The CERN Campus 7

The Large Hadron Collider 8

But the LHC is not the only accelerator 9

A lot of systems to control Controls Computers Electricity Cryogenics 85 000 Devices > 2 Million I/O Endpoints Much more when including subsystems! Safety Cooling Ventilation Magnets Vacuum 10

Everything from one central point: The CERN Control Centre 11

USING ACTIVEMQ FOR ACCELERATOR CONTROLS 12

OPTICAL FIBERS WorldFIP SEGMENT (1, 2.5 MBits/sec) PROFIBUS FIP/IO Controls Architecture C++ OPERATOR CONSOLES FIXED DISPLAYS OPERATOR CONSOLES Java Communication FILE SERVERS C++ RT Lynx/OS VME Front Ends APPLICATION SERVERS WORLDFIP Front Ends C SCADA SERVERS PLC Java JMS Purpose: Reliable and scalable transport of data between Java processes T T T T T LHC MACHINE 13

History Early use of ActiveMQ already in 2005 We were looking for a free JMS solution Apache? Can t be bad! Why OpenSource? Low Costs We can read and check the code We can add / fix code 14

History First Setup Clients Java middle-tier servers and GUIs Middle-tier Servers Graphical User Interfaces Data access for Clients via Middleware API Middleware API Two interconnected Brokers Simple Setup, few projects, little data, easy to use. Did the job! 15

History - Evolution But: Service suffered by its own success More users and more data was sent around Higher QoS was requested Redundancy Queues, persistent messaging and global transactions (XA) Support for non Java clients We needed to adapt the infrastructure and needed to SCALE Production Brokers 16

Deployment Today 20 Production Brokers 10 single Brokers 5 HA Clusters No one large cluster but manageable entities Vital part of beam instrumentation and operation No JMS No Particle Physics! Dedicated single Brokers for selected applications 4 Dedicated HA Clusters for selected applications 1 HA Cluster for general usage 17

Deployment Today - Setup HA Clusters 2 equivalent broker members, no master-slave Two (real) machines with separate network links Single Broker For projects which do not require HA Deployed on same (real) machine as middle-tier server Local secured network together with Producers and Consumers Fuse ActiveMQ Distribution since 2010 Issues solved faster in service packs 18

Deployment Today Some numbers : 300 Applications 4 400 Connections 40 000 Subscriptions 85 000 Topics 68 000 Consumers 8 Million msg/h IN, 3.5M msg/h OUT Archived Uptime in 2011: 99.98% Not all data which is produced is consumed 19

Example of Data Handling during LHC Startup 2010 Output Data Handling for 1 Broker of the General Broker Cluster 150% unpredicted increase in data demand 2.5TB/day 1TB/day LHC Startup March 2010 20

Usage Cases Usage Case 1 Usage Case 2 Usage Case 3 Usage Case 4 Payload 2MByte 1 msg/sec 1 Topic 20-30 Java Clients Payload 500Bytes- 1KBytes 30-200 msg/sec 120 Queues 4 Million XAS/day 2 Clients Payload 200Byte - 10KByte 50-4500 msg/sec 10 000 Topics 2-5 Clients Payload <200 Bytes <10 msg/hour 1 Queue < 10 STOMP Clients Scalability Reliability Versatility Flexibility 21

OPERATION 22

Management of Brokers Monitoring via JMX Submission of test message Host machine monitoring Deployment Rollout scripts for deploying/rollback Configuration changes tracked in SVN 23

Diagnostic Tools Dump JMX information into SQLite database for fast, easy and intuitive access. JmxDumper tim.p_coolstpsh5 120 C2MON-SERVER-DAQ-IN-3_55 tim.p_coolmnoui5 80 C2MON-SERVER-DAQ-IN-3_145... select name, awaitack, clientid from subscriptions where awaitack > 0; 24

Diagnostic Tools Traffic Monitoring Tool Crazy Producer Collecting information by listening on Topic Message size & speed Fixed Allows history view on Average Message throughput Average Message size 25

Lessons learnt - Operation Separate usage cases from each other Easier than implementing per destination policies Broker restart does not affect others More clients, more memory Broker does not die when hitting memory limits Check if you need all features Maintenance overhead vs. failure probability & effect 26

Lessons learnt - Configuration Disable DedicatedTaskRunner: takes many threads Reduce the PrefetchLimit on clients side Couldn t get Broker discarding messages ConstantPrefetchLimit unset + brain-dead client = Out of Memory Memory is important But GC takes longer New Java GC options help -XX:+UseConcMarkSweepGC -XX:+CMSIncrementalMode -XX:+CMSIncrementalPacing -XX:CMSIncrementalDutyCycleMin=0 -XX:CMSIncrementalDutyCycle=10 -XX:+HeapDumpOnOutOfMemoryError 27

EXAMPLES 28

LHC Status Displays http://lmgtfy.com/?q=cern+vistars 29

Remote Control for Video Viewer Video Viewer Access Request Show Cam 1 Business Logic Button Panel Show Cam 1 Powered by VLC 30

Bamboo for Accelerators Run Tests for Hardware Commissioning 31

Animate Synoptic Displays Technical Infrastructure (Power, Cooling, Water, Doors, 32

Camel for Unifying Log Events Finding/Debugging a problem becomes cumbersome! Collecting and unifying log messages in one central place Easy correlation of events among many services Log Server? DB 33

Finally A great thanks to all developers! 34