The High Performance Archiver for the LHC Experiments

Similar documents
Oracle at CERN CERN openlab summer students programme 2011

Oracle for administrative, technical and Tier-0 mass storage services

WinCC OA Archiving in a Nutshell. R. Kulaga BE-ICS-FD

Front-End Electronics Configuration System for CMS. Philippe Gras CERN - University of Karlsruhe

The LCG 3D Project. Maria Girone, CERN. The 23rd Open Grid Forum - OGF23 4th June 2008, Barcelona. CERN IT Department CH-1211 Genève 23 Switzerland

An SQL-based approach to physics analysis

AN OVERVIEW OF THE LHC EXPERIMENTS' CONTROL SYSTEMS

Design specifications and test of the HMPID s control system in the ALICE experiment.

Control slice prototypes for the ALICE TPC detector

CouchDB-based system for data management in a Grid environment Implementation and Experience

Short Introduction to DCS, JCOP Framework, PVSS. PVSS Architecture and Concept. JCOP Framework concepts and tools.

ATLAS DCS Overview SCADA Front-End I/O Applications

Database Services at CERN with Oracle 10g RAC and ASM on Commodity HW

Motivation Requirements Design Examples Experiences Conclusion

Detector Control System for Endcap Resistive Plate Chambers

Experiment

Evolution of Database Replication Technologies for WLCG

Detector controls meets JEE on the web

Evolution of Database Replication Technologies for WLCG

Worldwide Production Distributed Data Management at the LHC. Brian Bockelman MSST 2010, 4 May 2010

Scientific data processing at global scale The LHC Computing Grid. fabio hernandez

Netherlands Institute for Radio Astronomy. May 18th, 2009 Hanno Holties

Virtualizing a Batch. University Grid Center

SMI++ object oriented framework used for automation and error recovery in the LHC experiments

Andrea Sciabà CERN, Switzerland

IN a system of many electronics boards of many different

CERN Tape Archive (CTA) :

COOL performance optimization using Oracle hints

ATLAS Nightly Build System Upgrade

The LHC Computing Grid. Slides mostly by: Dr Ian Bird LCG Project Leader 18 March 2008

PROOF-Condor integration for ATLAS

Storage and I/O requirements of the LHC experiments

Storage Optimization with Oracle Database 11g

2008 JINST 3 S Control System. Chapter Detector Control System (DCS) Introduction Design strategy and system architecture

THE ATLAS DATA ACQUISITION SYSTEM IN LHC RUN 2

Summary of the LHC Computing Review

Introduction. Architecture Overview

Database application research in real-time data access of accelerator control system

Ideas for evolution of replication CERN

Detector Control LHC

Overview. About CERN 2 / 11

Distributed Systems COMP 212. Lecture 18 Othon Michail

THE ATLAS experiment comprises a significant number

T HE discovery at CERN of a new particle

JCOP Workshop III. Status of JCOP Activities. 5 th & 6 th June Wayne Salter, CERN IT-CO

LHCb Computing Status. Andrei Tsaregorodtsev CPPM

CERN s Business Computing

Stefan Koestner on behalf of the LHCb Online Group ( IEEE - Nuclear Science Symposium San Diego, Oct.

The LHC Computing Grid

Storage Resource Sharing with CASTOR.

HammerCloud: A Stress Testing System for Distributed Analysis

Fast, In-Memory Analytics on PPDM. Calgary 2016

Experience and lessons learnt from running high availability databases on network attached storage

The ATLAS Conditions Database Model for the Muon Spectrometer

The ATLAS Tier-3 in Geneva and the Trigger Development Facility

Oracle Platform Performance Baseline Oracle 12c on Hitachi VSP G1000. Benchmark Report December 2014

The CMS data quality monitoring software: experience and future prospects

The CERN Detector Safety System for the LHC Experiments

The CORAL Project. Dirk Düllmann for the CORAL team Open Grid Forum, Database Workshop Barcelona, 4 June 2008

Oracle Primavera P6 Enterprise Project Portfolio Management Performance and Sizing Guide. An Oracle White Paper April 2011

Clustering and Reclustering HEP Data in Object Databases

Dell Reference Configuration for Large Oracle Database Deployments on Dell EqualLogic Storage

CERN openlab II. CERN openlab and. Sverre Jarp CERN openlab CTO 16 September 2008

The ATLAS EventIndex: Full chain deployment and first operation

First experiences with the ATLAS pixel detector control system at the combined test beam 2004

Hitachi Converged Platform for Oracle

Fault Detection using Advanced Analytics at CERN's Large Hadron Collider

Veritas InfoScale Enterprise for Oracle Real Application Clusters (RAC)

Oracle for administrative, technical and Tier-0 mass storage services

CLOUDS OF JINR, UNIVERSITY OF SOFIA AND INRNE JOIN TOGETHER

Data Sheet: Storage Management Veritas Storage Foundation for Oracle RAC from Symantec Manageability and availability for Oracle RAC databases

CIT 668: System Architecture. Amazon Web Services

Control and Monitoring of the Front-End Electronics in ALICE

Database Level 100. Rohit Rahi November Copyright 2018, Oracle and/or its affiliates. All rights reserved.

Using MySQL in a Virtualized Environment. Scott Seighman Systems Engineer Sun Microsystems

Outline. Database Tuning. Ideal Transaction. Concurrency Tuning Goals. Concurrency Tuning. Nikolaus Augsten. Lock Tuning. Unit 8 WS 2013/2014

Copyright 2011, Oracle and/or its affiliates. All rights reserved.

Veritas Storage Foundation for Oracle RAC from Symantec

CERN IT-DB Services: Deployment, Status and Outlook. Luca Canali, CERN Gaia DB Workshop, Versoix, March 15 th, 2011

Distributed e-infrastructures for data intensive science

Control and Spy the TELL1. Cédric Potterat EPFL LPHE

Improving Packet Processing Performance of a Memory- Bounded Application

Data Transfers Between LHC Grid Sites Dorian Kcira

Scale out databases for CERN use cases

Using the In-Memory Columnar Store to Perform Real-Time Analysis of CERN Data. Maaike Limper Emil Pilecki Manuel Martín Márquez

The CMS Computing Model

Batch Services at CERN: Status and Future Evolution

for the Physics Community

CMS users data management service integration and first experiences with its NoSQL data storage

ATLAS Oracle database applications and plans for use of the Oracle 11g enhancements

Database Management and Tuning

Database Solutions Engineering. Best Practices for Deploying SSDs in an Oracle OLTP Environment using Dell TM EqualLogic TM PS Series

Data Quality Monitoring Display for ATLAS experiment

The LHC computing model and its evolution. Dr Bob Jones CERN

The ATLAS EventIndex: an event catalogue for experiments collecting large amounts of data

Conference The Data Challenges of the LHC. Reda Tafirout, TRIUMF

ATLAS Data Management Accounting with Hadoop Pig and HBase

ORACLE 11g RDBMS Features: Oracle Total Recall Oracle FLEXCUBE Universal Banking Release [May] [2017]

Contents Overview of the Gateway Performance and Sizing Guide... 5 Primavera Gateway System Architecture... 7 Performance Considerations...

Accelerating Throughput from the LHC to the World

Transcription:

The High Performance Archiver for the LHC Experiments Manuel Gonzalez Berges CERN, Geneva (Switzerland) CERN - IT Department CH-1211 Genève 23 Switzerland www.cern.ch/it

Outline Context Archiving in PVSS PVSS Client Database Server Conclusions Controls CERN IT Department CH-1211 Genève 23 Switzerland www.cern.ch/it The High Performance Archiver for the LHC Experiments - 2

The LHC Experiments ALICE ATLAS CMS LHCb Worldwide collaborations One order of magnitude bigger than previous generation (~1 million i/o) Complex operation Many subdetectors and subsystems Lifetime of 20 years Common approach to controls (Joint Controls Project JCOP) Currently installing and commissioning The High Performance Archiver for the LHC Experiments - 3

LHC experiment control system Control System Back End Front End Detector hardware Back End Linux/Windows PCs (~150) PVSS + JCOP Framework Main functions Graphical user interfaces Operation as a set of Finite State Machines (FSM) Alarm handling Interface to external systems Archiving Front End Several technologies PCs, PLCs, Embbeded computers, etc Main functions Data acquisition Filtering Real time loops & FSM Interlocks The High Performance Archiver for the LHC Experiments - 4

LHC experiments archiving Purpose Debugging of the control system Operation Physics offline analysis (conditions data) Requirements Data storage Single computer peak rate of 2000 changes/s Full application sustained rate of ~150 000 changes/s (~150 computers at 1000 changes/s) Data retrieval Optimize known common queries Get a set of values for a specified time range Snapshot at a given time Other queries will come with usage The High Performance Archiver for the LHC Experiments - 5

Archiving in PVSS Commands/ Settings PVSS system Alarms Storage Commands/ Settings Monitored values Front End/External systems The High Performance Archiver for the LHC Experiments - 6

File vs Database archiver in PVSS File archiver Local hardisk of each computer Issues with managing many big files Proprietary format Database archiver Centralized server Relational database Initially developed for some specific customers Performance far from the extreme requirements of the LHC experiments Close collaboration ETM CERN to improve it CERN expertise in databases Facilities to test on very large systems The High Performance Archiver for the LHC Experiments - 7

Tables EVENTHISTORY_00000001 (f rom BTO_PVSSRDB) ELEMENT_ID : NUMBER(20, 0) TS : TIMESTAMP(9) STATUS : NUMBER(20, 0) MANAGER : NUMBER(20, 0) TEXT : VARCHAR2(4000) TYPE_ : NUMBER(20, 0) USER_ : VARCHAR2(4000) VALUE_STRING : VARCHAR2(4000) VALUE_NUMBER : NUMBER(38, 0) VALUE_TIMESTAMP : TIMESTAMP(9) CORRVALUE_STRING : VARCHAR2(4000) CORRVALUE_NUMBER : NUMBER(38, 0) CORRVALUE_TIMESTAMP : TIMESTAMP(9) OLVALUE_STRING : VARCHAR2(4000) OLVALUE_NUMBER : NUMBER(38, 0) OLVALUE_TIMESTAMP : TIMESTAMP(9) BASE : NUMBER(1, 0) <<PK>> PEVENTHISTORY_00000001() <<Index>> I3EVENTHISTORY_00000001() ALERTHISTORY_00000000 (f rom BTO_PVSSRDB ) ELEMENT_ID : NUMBER(20, 0) TS : TIMESTAMP(9) ACK_STATE : NUMBER(20, 0) ACK_TIME : TIMESTAMP(9) STATE : NUMBER(20, 0) ABBR : LANGSTRING ACK_TYPE : NUMBER(20, 0) ACK_USER : NUMBER(20, 0) ACKABLE : NUMBER(1, 0) ALERT_COLOR : VARCHAR2(4000) CLASS : VARCHAR2(4000) COMMENT_ : VARCHAR2(4000) DEST : NUMBER(20, 0) DEST_TEXT : LANGSTRING DIRECTION : NUMBER(1, 0) INACT_ACK : NUMBER(1, 0) PANEL : VARCHAR2(4000) PARTN_IDX : NUMBER(20, 0) PARTNER : TIMESTAMP(9) PRIO : NUMBER(20, 0) SINGLE_ACK : NUMBER(1, 0) TEXT : LANGSTRING TEXT0 : LANGSTRING TEXT1 : LANGSTRING TYPE_ : NUMBER(20, 0) VALUE_STRING : VARCHAR2(4000) VALUE_NUMBER : NUMBER(38, 0) VALUE_TIMESTAMP : TIMESTAMP(9) VISIBLE : NUMBER(1, 0) ALERT_FORE_COLOR : VARCHAR2(4000) ALERT_FONT_STYLE : VARCHAR2(4000) DETAIL : NUMBER(20, 0) BASE : NUMBER(1, 0) <<PK>> PALERTHISTORY_00000000() <<Index>> I1ALERTHISTORY_00000000() The High Performance Archiver for the LHC Experiments - 8

Single System Archiving (I) PVSS System User Interface Control Event Data Archive Oracle Database Server Driver Driver Simulation file Simulation file The High Performance Archiver for the LHC Experiments - 9

Single System Archiving (II) Test results ~100-300 changes/s depending on setup Bottleneck in Archive Generic interface to database Value changes sent one by one Improvements Use of Oracle native libraries (OCCI) Bulk insertion Client sends blocks to DB A function inside the DB inserts the data into the history tables Reduce number of connections to DB One permanent for storage/retrieval One temporary for configuration Possible to store 2000 changes/s continuously PVSS System User Interface Event Driver Driver Data Archive The High Performance Archiver for the LHC Experiments - 10

Test load on the database server PVSS System n PVSS System 2 PVSS System 1 User Interface Oracle Database Server Event Data Archive Driver Driver Simulation file Simulation file The High Performance Archiver for the LHC Experiments - 11

Performance of Oracle server Oracle server hardware PC with two Xeon processors, 3 GHz, 4 GBytes RAM, Red Hat Enterprise 4 Tests results Server can handle ~20-30 systems each at 1000 changes/s High CPU load Options for improvement Server with better hardware High cost, limited improvement Clustered server Low Cost (PCs) Redundancy -> high availability Easy to upgrade by adding more nodes Issues with scalability The High Performance Archiver for the LHC Experiments - 12

Test setup with RAC server PVSS System 150 PVSS System 2 Oracle Real Application Cluster (RAC) Server PVSS System 1 User Interface Event Data Archive Instance 1 Instance 2 Instance n Database Driver Driver Simulation file Simulation file The High Performance Archiver for the LHC Experiments - 13

Test results with RAC Initial test results 4 server nodes could handle ~ 50 systems Does not scale linearly The nodes interfere with each other Main improvements Direct path insertion Constraints disabled during insertion Can be inefficient in space usage History table partitioned per PVSS client Isolates clients Final performance Scalable server PVSS Systems 160 140 120 100 80 60 40 20 0 1 2 3 4 5 6 RAC nodes 6 nodes could handle 150 clients at 1000 changes/s each Possible to add more nodes Issue with allocation of new storage space (tablespace creation) The High Performance Archiver for the LHC Experiments - 14

Outlook Values Insertion Prototype for issue with new space allocation Space preallocated in advance or with background job Queries Alarms Continue with optimization of known queries Develop an API for external programs Bottlenecks identified ETM working on them Not extreme requirements Buffering to disk when DB not reachable The High Performance Archiver for the LHC Experiments - 15

Conclusions LHC experiments archiving requirements are met Improvements included in standard PVSS version since earlier this year Issues with retrieval and alarms understood New version expected for the end of the year Fruitful collaboration ETM-CERN Benefit from CERN expertise on databases Many solutions were explored with real size test bench Maintenance guaranteed by the company The archiver is currently in use for the commissioning of the LHC experiments The High Performance Archiver for the LHC Experiments - 16

Acknowledgements The following people have been involved in the work presented: CERN Chris Lambert Eric Grancher Laura Fernandez Luca Canali Milosz Hulboj Nilo Segura Pior Golonka Svetozar Kapusta ETM Ewald Sperrer Ronald Putz Oracle Lothar Flatz The High Performance Archiver for the LHC Experiments - 17

Questions? The High Performance Archiver for the LHC Experiments - 18