Development of a PCI Based Data Acquisition Platform for High Intensity Accelerator Experiments

Similar documents
50GeV KEK IPNS. J-PARC Target R&D sub gr. KEK Electronics/Online gr. Contents. Read-out module Front-end

Electronics on the detector Mechanical constraints: Fixing the module on the PM base.

A THESIS SUBMITTED TO THE GRADUATE DIVISION OF THE UNIVERSITY OF HAWAI I IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF

arxiv:physics/ v1 [physics.ins-det] 18 Dec 1998

Takeo Higuchi IPNS, KEK Belle DAQ group. 2009/01/07 SLAC Advanced Instrumentation Seminars

2008 JINST 3 S Online System. Chapter System decomposition and architecture. 8.2 Data Acquisition System

Detector Data Acquisition Hardware Designs and Features of NGC (New General Detector Controller)

Status of Control System. Hiroshi Kaji

Velo readout board RB3. Common L1 board (ROB)

Hera-B DAQ System and its self-healing abilities

Streaming Readout, the JLab perspective. Graham Heyes Data Acquisition Support Group Jefferson Lab

arxiv:cs/ v2 [cs.dc] 12 Mar 2004

Stefan Koestner on behalf of the LHCb Online Group ( IEEE - Nuclear Science Symposium San Diego, Oct.

A DAQ system for CAMAC controller CC/NET using DAQ-Middleware

SPECS : A SERIAL PROTOCOL FOR EXPERIMENT CONTROL SYSTEM IN LHCB.

USCMS HCAL FERU: Front End Readout Unit. Drew Baden University of Maryland February 2000

Construction of a compact DAQ-system using DSP-based VME modules

Data Acquisition in Particle Physics Experiments. Ing. Giuseppe De Robertis INFN Sez. Di Bari

Dominique Gigi CMS/DAQ. Siena 4th October 2006

Development of Beam Monitor DAQ system for 3NBT at J-PARC

Fast data acquisition measurement system for plasma diagnostics using GEM detectors

VXS-621 FPGA & PowerPC VXS Multiprocessor

BTeV at C0. p p. Tevatron CDF. BTeV - a hadron collider B-physics experiment. Fermi National Accelerator Laboratory. Michael Wang

Architecture Specification

Trigger Layout and Responsibilities

The ALICE TPC Readout Control Unit 10th Workshop on Electronics for LHC and future Experiments September 2004, BOSTON, USA

Development of a New TDC LSI and a VME Module

An FPGA Based General Purpose DAQ Module for the KLOE-2 Experiment

The MROD. The MDT Precision Chambers ROD. Adriaan König University of Nijmegen. 5 October nd ATLAS ROD Workshop 1

APV-25 based readout electronics for the SBS front GEM Tracker

arxiv: v2 [nucl-ex] 6 Nov 2008

MiniDAQ1 A COMPACT DATA ACQUISITION SYSTEM FOR GBT READOUT OVER 10G ETHERNET 22/05/2017 TIPP PAOLO DURANTE - MINIDAQ1 1

Artisan Technology Group is your source for quality new and certified-used/pre-owned equipment

Vertex Detector Electronics: ODE to ECS Interface

ATLAS TDAQ RoI Builder and the Level 2 Supervisor system

Signal Conversion in a Modular Open Standard Form Factor. CASPER Workshop August 2017 Saeed Karamooz, VadaTech

PCI-X Bus PCI Express Bus Variants for Portable Computers Variants for Industrial Systems

VXS-610 Dual FPGA and PowerPC VXS Multiprocessor

arxiv: v1 [nucl-ex] 26 Oct 2008

Update on PRad GEMs, Readout Electronics & DAQ

The CMS Event Builder

Modules and Front-End Electronics Developments for the ATLAS ITk Strips Upgrade

Readout Systems. Liquid Argon TPC Analog multiplexed ASICs SiPM arrays. CAEN 2016 / 2017 Product Catalog

The White Rabbit Project

SoLID GEM Detectors in US

ATLANTIS - a modular, hybrid FPGA/CPU processor for the ATLAS. University of Mannheim, B6, 26, Mannheim, Germany

16-Channel 16-Bit PMC Analog I/O Board

The Track-Finding Processor for the Level-1 Trigger of the CMS Endcap Muon System

Beam test measurements of the Belle II vertex detector modules

IBM Network Processor, Development Environment and LHCb Software

Overview of SVT DAQ Upgrades. Per Hansson Ryan Herbst Benjamin Reese

Prototyping NGC. First Light. PICNIC Array Image of ESO Messenger Front Page

S-LINK: A Prototype of the ATLAS Read-out Link

Heavy Photon Search Data Acquisition

Field Program mable Gate Arrays

CMS High Level Trigger Timing Measurements

LHC Detector Upgrades

GPS time synchronization system for T2K

Novel Intelligent I/O Architecture Eliminating the Bus Bottleneck

Detector Control LHC

LHCb Online System BEAUTY-2002

The First Integration Test of the ATLAS End-Cap Muon Level 1 Trigger System

Track-Finder Test Results and VME Backplane R&D. D.Acosta University of Florida

The Fourth Level Trigger Online Reconstruction Farm of HERA-B 1

XMC Adapter for 3U cpci/pxi

EPC -8 Highly integrated Intel486 -based VMEbus Embedded CPU

The GTPC Package: Tracking and Analysis Software for GEM TPCs

Embedded LLRF Controller with Channel Access on MicroTCA Backplane Interconnect

RT2016 Phase-I Trigger Readout Electronics Upgrade for the ATLAS Liquid-Argon Calorimeters

DESIGN OF THE DATA ACQUISITION SYSTEM FOR THE NUCLEAR PHYSICS EXPERIMENTS AT VECC

Advanced NI-DAQmx Programming Techniques with LabVIEW

PXI EXPRESS SPECIFICATION TUTORIAL

PC/104 Modules. Short Form. The complete embedded PC solution. CPUs: x86, 386SX, 486DX4, Pentium

Development of DAQ-Middleware

The Use of LabVIEW FPGA in Accelerator Instrumentation.

1. Introduction. Outline

Scintillator-strip Plane Electronics

Abstract. * Supported by U.S. D.O.E. Grant DE-FG02-96ER M.W. Bongard, APS-DPP, Denver, CO, October 2005

PC104P-24DSI Channel 24-Bit Delta-Sigma PC104-Plus Analog Input Board

64-Channel, 16-Bit Simultaneous Sampling PMC Analog Input Board

The hardware implementation of PXI/PXIe consists of a chassis, controller or computer interface, and peripheral cards.

Chapter 8: Input and Output. Principles of Computer Architecture. Principles of Computer Architecture by M. Murdocca and V.

TORCH: A large-area detector for precision time-of-flight measurements at LHCb

CPCI-16AIO Channel 16-Bit Analog I/O CPCI Board With 8 Input Channels, 8 Output Channels, and Auto calibration

PROGRESS ON ADF BOARD DESIGN

MIL-STD-1553 (T4240/T4160/T4080) 12/8/4 2 PMC/XMC 2.0 WWDT, ETR, RTC, 4 GB DDR3

VME Data Acquisition System, ADC Read

Construction of the Phase I upgrade of the CMS pixel detector

Development and test of a versatile DAQ system based on the ATCA standard

CPCI-16HSDI. 16-Bit, Six-Channel Sigma-Delta Analog Input Board. With 1.1 MSPS Sample Rate per Channel, and Two Independent Clocks.

Modular & reconfigurable common PCB-platform of FPGA based LLRF control system for TESLA Test Facility

64-Channel, 16-Bit Simultaneous Sampling PMC Analog Input Board

The Belle Silicon Vertex Detector. T. Tsuboyama (KEK) 6 Dec Workshop New Hadrons with Various Flavors 6 7 Dec Nagoya Univ.

Introduction to TDC-II and Address Map

The Front-End Driver Card for the CMS Silicon Strip Tracker Readout.

Modern State Space Representation Using ADwin DAQ Systems

New Development of EPICS-based Data Acquisition System for Millimeter-wave Interferometer in KSTAR Tokamak

A LVL2 Zero Suppression Algorithm for TRT Data

Ethernet Networks for the ATLAS Data Collection System: Emulation and Testing

Event-Synchronized Data Acquisition System of 5 Giga-bps Data Rate for User Experiment at the XFEL Facility, SACLA

Transcription:

Development of a PCI Based Data Acquisition Platform for High Intensity Accelerator Experiments T. Higuchi, H. Fujii, M. Ikeno, Y. Igarashi, E. Inoue, R. Itoh, H. Kodama, T. Murakami, M. Nakao, K. Nakayoshi, M. Saitoh, S. Shimazaki, S. Y. Suzuki, M. Tanaka, K. Tauchi, M. Yamauchi, Y. Yasu High Energy Accelerator Research Organization (KEK), Tsukuba G. Varner University of Hawaii, Honolulu, Hawaii Y. Nagasaka Hiroshima Institute of Technology, Hiroshima T. Katayama, K. Watanabe Densan Co. Ltd., Tokyo M. Ishizuka, S. Onozawa, C. J. Li Designtech Co. Ltd., Tokyo Data logging at an upgraded KEKB accelerator or the J-PARC facility, currently under commission, requires a high density data acquisition platform with integrated data reduction CPUs. To follow market trends, we have developed a DAQ platform based on the PCI bus, a choice which permits a fast DAQ and a long expected lifetime of the system. The platform is a 9U-VME motherboard consisting of four slots for signal digitization modules, readout s for data buffering, and three PMC slots, on one of which resides a data reduction CPU. We have performed long term and thermal stability tests. The readout speed on the platform has been measured up to 125 MB/s in DMA mode. 1. INTRODUCTION The Belle collaboration [1] has established the validity of the Kobayashi-Maskawa scheme of CP violation and has measured various quantities in B meson decays [2]. These measurements have been facilitated by the KEKB accelerator [3], which is an e + e collider with a remarkable luminosity greater than L > 10 34 cm 2 s 1. To extend the physics coverage to precise determination of the CKM matrix elements, as well as a search for new physics, we are planning a significant upgrade of the experiment for operating at a luminosity of L > 10 35 cm 2 s 1. Concurrently, planning for experiments at a high intensity proton beam facility, J-PARC, which utilizes fixed target [5] collisions, is maturing. This experimental complex enables us to explore physics topics such as neutrino mass difference or CP violation, as well as a precise determination of ɛ /ɛ in K meson decays. In the high-luminosity experiments the level-1 trigger rate and the data size per event will increase to 10 khz and 50-100 kb, respectively, and an overall requirement for the data acquisition system (DAQ) is typically estimated as 500-1000 MB/s. To match these boundary conditions, several DAQ components need to be developed: triggers, timing distributors, DAQ platforms, computer farms, etc. 1.1. Overview of Current Belle DAQ Figure 1 illustrates global data flow through the current Belle DAQ system [6]. Hit signals from the Belle detector are transferred to electronics hut via differential or other cables. In the electronics hut, hit signals are converted from analog to digital at the DAQ platform and are stored in an event buffer to minimize dead-time. These data are then fetched by CPUs and experience pedestal subtraction, data suppression, and data formatting. The formatted data are sent to an event builder to be combined into a single event record. Output from the event builder is then transferred to the online computer farm. At the online computer farm, data formatting, calibration, event reconstruction, and event filtering, known collectively as level-3 trigger processing, is performed. As the final step, the online computer farm transfers the data to a mass storage system. A level-1 hardware trigger module is drawn in Figure. 1. Each sub-detector generates trigger signals (known as level-0 triggers), which are fed to global decision logic, which determines whether the event should be acted upon or discarded. This trigger decision is first fed to a sequence controller that ensures a correct timing handshaking by detecting busy signals from each DAQ platform. The sequence controller routes the trigger signal to the trigger timing distributor, which then fans-out trigger signals to the DAQ platforms to initiate the data fetch by the CPUs. 1.2. Design of a New DAQ Platform To minimize bandwidth from the detector, data size should be reduced as much as possible. There are two major trends of data reduction: one is complex triggering at lower levels, while the other is a data re- 1

DAQ Platform Belle Event Builder Trigger Module Online Computer Farm Storage Figure 1: An illustration of the global data flow of the current Belle DAQ system. duction by online processors. To save limited human resources, we are encouraged to use commercial products. From the view point of the market trends, ASICs (Application-Specific Integrated Circuits) are becoming very popular; however, an evolutionary storm in the computer market is also brewing. Considering these matters, we have decided to develop a new DAQ platform equipped with CPUs for data compression. The internal bus of the platform should be fast. The PCI bus can perform up to 133 MB/s, which is faster than the VME bus, upon which the current Belle DAQ system is based. The PCI bus also has an advantage in compactness and modularity by providing a mezzanine card standard, known as PMC PCI mezzanine card. Since the data size is expected to triple, a smaller unit platform is preferred so as to keep the space requirements within the electronics hut manageable. The modularity of the PMC is especially beneficial when considering upgrade of the processor PMC module. Moreover, because the PCI bus has spread so widely throughout the world, it is unlikely to die quickly and current new technologies show little promise to improve on it. Thus we can aim for a long lifetime of the PCI-based DAQ platform, while using newest products. Higher luminosity consequently requires a pipelined DAQ system. It is necessary for the platform to be equipped with readout s to buffer events. In summary, we have chosen an on-board DAQ platform based on the PCI bus [7], which contains a CPU for data compression and readout s for event buffering. 2. COMMON DAQ PLATFORM The new DAQ platform consists of two components, the COPPER (COmmon Pipelined Platform for Electronics Readout) and the SPIGOT (Sequencing Pipeline Interface with Global Oversight and Timing). The COPPER is a 9U-VME sized card which serves as the platform for interfacing the front-end specific electronics, via a data compression CPU, to the event builder. Detector input connections are made through FINESSE (Front-end INstrumentation Entity for Sub-detector Specific Electronics) A/D conversion modules, which transfer their data to readout s instrumented on the COPPER board. The SPIGOT, which interfaces with the COPPER at its backplane (rear) side, is a 6U-VME sized board which includes a trigger module that issues trigger decisions to the FINESSE daughter-cards on the COPPER. In the following subsections, we give description of the COPPER and the SPIGOT. 2.1. COPPER Figure 2 shows a block diagram of a single COP- PER board. The COPPER is a 9U-VME sized board. It is equipped with four slots for FINESSE modules, readout s, and three PMC slots connected to the PCI bus. A processor module for the data compression resides at one of three PMC slots. Detector hit signals are first put into one of (up to) four FINESSE cards, at which the signals are digitized. Upon receipt of a level-1 trigger by the FI- NESSE from a trigger module on the SPIGOT, the FINESSE pushes data into the readout s. Each readout is connected to the local bus on the COPPER. A local-pci bus bridge, PCI9054 [8], collects data from the s and transfers them to the main memory in the processor module on the PCI bus using DMA. A local bus sequencer on the local bus counts the number of words in each readout associated to every trigger to provide the event size to the processor module and to throttle the level-1 trigger when the s get full. In the processor module, complicated data compression is performed: pedestal subtraction, zero suppression, and data formatting. Formatted data is subsequently transferred from the processor via an external link device to the event builder. 2

detector signals mezzanine (add-on) module trigger, clock local bus Bridge Bus CTRL PCI bus PMC Memory Network CPU trigger interrupt Figure 2: A block diagram of the COPPER board. Figure 3: A photograph of the COPPER board, with various functional components and interfaces indicated. The COPPER has RJ45 port so that the processor module on the COPPER can be operated through the network. The COPPER also has a VME-PCI dual port memory to be redundantly operated from the VME master module and to export the event data at an very early stage of the system development. The photograph of the COPPER is shown in Figure 3. 2.2. SPIGOT The SPIGOT is a 6U-VME sized board upon which is mounted a trigger module and up to two PMC modules, where one of two is intended to be used for PMC link device, typically an Ethernet card, connected to the event builder. The SPIGOT resides behind the COPPER, mounted in a transition module location, offset to align at the bottom of the P3 plane location. The PCI bus on the SPIGOT is electrically connected to the one on the COPPER via PCI-PCI bridge on the COPPER. The trigger module distributes a trigger timing and 3

sampling clock to each FINESSE module, the signals of which come from an upstream transmitter module. The trigger module includes a trigger, which is accessible to the processor via the PCI bus, to hold an event counter and a trigger type. This trigger information is included in a header of each event by the processor. 3. FINESSE The FINESSE is a mezzanine card with dimensions of 71.0 186.0 mm 2 and is mounted on the COPPER. Each FINESSE samples and digitizes detector hit signals according to either an external or internal clock, holds digitized signals in its synchronous buffer, and transfers the digitized data to readout s on the COPPER upon receipt of a level-1 trigger signal. A block diagram of the FINESSE is shown in Figure 4. A photograph of a prototype FINESSE is shown in Figure 5. The FINESSE receives an external sampling clock for A/D conversion continuously from the trigger module in LVDS format, and stores the digitized signals to the synchronous buffer according to this clock. Once the FINESSE receives a trigger, it freezes the relevant contents of the synchronous buffer moves these contents to a readout on the COPPER. The FINESSE shall ensure a busy handshaking with the trigger module. Upon receipt of a level-1 trigger, each FINESSE shall assert trigger busy until all data associated with the trigger are flushed to the readout. Delivery of the next trigger by the trigger module is suspended until all busy signals asserted by each of four FINESSEs are cleared. A/D conversion can be as accurate as 16 bits. To utilize full PCI bandwidth, two data words are fetched by the processor in parallel simultaneously. The first 16-bit data word appears in the lower 16-bits of the local data bus and the trailing 16-bit word does in the higher. Write clocks to the readout s during busy assertion are spied upon by a local bus sequencer to count the number of stored words in the readout. 4. PMC PROCESSOR MODULE The PMC processor module on the COPPER receives data from the readout s, reduces the data size, and sends this formatted data to the event builder. To develop the data reduction software easily, the processor should be capable of handling a familiar operating system such as Linux. It is also important for us to choose a CPU that is on a concrete upgrade path. Our answer for the processor module is to use Intel processor based on a PC architecture. Another constraint comes from the power consumption. Because of the limited power supply capacity in a standard VME crate and because of the limits imposed by air cooling, the power consumption by the processor module should be less than 20 W. We purchased EPC-6315 CPUs provided by RadiSys Corp. [9], equipped with Intel Pentium III processors operating with an 800 MHz clock. In this configuration the processor consumes up to 17 W of electrical power. Figure 6 is a photograph of the EPC- 6315. We evaluated the performance of the EPC-6315 by means of a pseudo-data-compression software. We assume that each FINESSE converts 16-channels of analog signal into 8-bit digital signals. If we set the sampling rate and sampling window as 100 MHz and 500 ns, respectively, the required data processing rate by the processor should be more than 32 MB/s at a 10 khz trigger rate. The test software works with Monte-Carlo-generated hit signals. It copies data from a virtual readout (memory) to a work area, subtracts pedestals, suppresses channels without hits, and compresses data into hit timing and induced charge. At 20% occupancy, the processing speed is estimated 94 MB/s, which is marginal for our application. We are developing a network based boot-up mechanism to handle the large number of processors. 5. PERFORMANCE STUDY We have studied long-term stability, thermal stability, and readout performance of the new DAQ platform. Figure 7 shows a block diagram of the test setup. Data are generated on the prototype FINESSEs according to a pre-defined table and are read by the processor module (EPC-6315) on the COPPER. After a data integrity test on the processor, the data are transferred to the data receiver process running on the Linux PC through a network interface module on the SPIGOT. In the receiver process, event header/footer, event counter, and the contents are checked. The event counter should be same for the four FINESSE modules and increase by 1 for each trigger. The data contents should be consistent with the pre-defined table. A 10 Hz random trigger is generated by the trigger module on the SPIGOT and fed to the prototype FINESSEs. Messages from the processor module are monitored via a serial terminal. We operated the DAQ platform in the room temperature for 48 hours and in a thermostatic oven for 77 hours, varying the temperature from 10 Cto50 C. We detected no errors in the receiver process. We con- 4

FINESSE Setup Register Local Bus Detector Signals ADC/TDC Buffer 16-bit data 2 Event Clock Trigger Trigger Busy Trigger Timing Distributor Figure 4: A block diagram of the FINESSE. Digitized signals from the ADCs or TDCs are stored in a synchronous buffer to minimize dead-time due to trigger latency. 6. SUMMARY Figure 5: A photograph of a prototype FINESSE. A new DAQ platform is mandated for high-rate experiments to operate at an upgraded KEKB and a new J-PARC accelerator. To realize a long lifetime of a DAQ platform while utilizing the newest products, we have designed a platform motherboard based on the PCI bus and with an on-board data compression CPU, equipped with a moderate computational capability. Signal digitization modules, to be housed on the motherboard, and a trigger distribution scheme are also designed. The platform is measured to be stable and has achieved DMA data transfer speeds of 125 MB/s. Acknowledgments This work is supported by Grant-in-Aid for Scientific Research (C) and Grant-in-Aid for Scientific Research on Priority Areas (B) from the Japan Society for the Promotion of Science and the Ministry of Education, Culture, Sports, Science and Technology. Figure 6: A photograph of the PMC processor board, a RadiSys EPC-6315. clude the platform works stably for extended periods and is tolerant of changes in the thermal environment. We measured the data transfer speed on the COP- PER from the readout s to the CPU main memory using DMA. We determined a transfer rate of 125 MB/s without DMA setup overhead, which corresponds to 94% of full PCI-32 perfomance. We will continue to evaluate the system performance. References [1] S. Mori et al. [BELLE Collaboration], Nucl. Instrum. Meth. A 479, 117 (2002). [2] A. Abashian et al. [BELLE Collaboration], Phys. Rev. Lett. 86, 2509 (2001); K. Abe et al. [Belle Collaboration], Phys. Rev. Lett. 87, 091802 (2001); [3] B. Kikutani ed., KEK Preprint 2001-157 (2001), to appear in Nucl. Instrum. Meth. A. [4] Belle Collaboration, Expression of Interest in A High Luminosity Upgrade of the KEKB Collider and the Belle Detector, http://belle.kek.jp/ yamauchi/eoi.ps.gz. 5

Figure 7: A schematic view of the test setup for the new DAQ platform. Orange arrows show the data flow. Data generated on the prototype FINESSEs are read by the processor module (a RadiSys EPC-6315) on the COPPER. After a data integrity check by the processor, the data are transferred to the data receiver process running on a Linux PC through a network interface module on the SPIGOT. A 10 Hz random trigger is generated by the trigger module on the SPIGOT and fed to the prototype FINESSEs. Messages from the processor module are monitored on a serial terminal. [5] The Joint Project Team of JAERI and KEK, The Joint Project for High-Intensity Proton Accelerators, KEK Report 99-4, July 1999. [6] S. Y. Suzuki et al., Belle DAQ system upgrade at 2001, to appear in Nucl. Instrum. Meth. A. [7] The specifications and standards for the PCI bus are maintained by the PCI-SIG (PCI Special Interests Group), a commercial consortium of computer vendors. Documentation is available for purchase on-line: http://www.psisig.com. [8] PLX Technology Inc., PCI 9054 data book version 2.1, (2000). [9] RadiSys Co., EPC-6315 Hardware Reference, (2002). 6