PCIe40 output interface 01/08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 1

Size: px
Start display at page:

Download "PCIe40 output interface 01/08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 1"

Transcription

1 PCIe40 output interface LHCB MINIDAQ2 WORKSHOP 01/08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 1

2 First of all MINIDAQ1 (AMC40) MINIDAQ2 (PCIE40) GBT GBT 10GbE PCIe 01/08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 2

3 Outline Terminology Firmware side Firmware structure and data flow Data processing interface and bandwidth Multichannel DMA and data streams Driver side Driver structure Driver usage Software side Data management Performance Performance Performance Reminder: all software releases are tagged by release date and are available here: The list of released tags can be accessed here: 01/08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 3

4 Terminology Interface ID: unique (within a server) PCIe40 driver identifier eg: 2 PCIe40 boards in a server -> 4 interfaces (0, 1, 2, 3) Link ID: local identifier (within a board) of a given interface eg: optical links > pcie link 0, optical links > pcie link 1 Interface address: topological address (bus/device/function) of an interface eg: 05:00.0 Communication controller: CERN/ECP/EDU Device ce40 (rev 01) Event ID: unique identifier (within an entire experiment/run) of a collision Fragment: the output data from a single TELL40 instance for a given Event ID eg: 1 PCIe40 board -> 2 fragments per Event ID Event: the output data from the full event builder for a given Event ID Host: the server housing a given PCIe40 board Stream: an independent, simplex (FPGA->HOST) data communication channel 01/08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 4

5 Firmware dataflow Event Fragments DMA ctrl (x2) Stream parser TFC banks 250 MHz 40 MHz DMA stream (meta data) DMA stream (main data) DMA stream (odin data) optional MSI generator Descriptor scheduler PCIe Hard IP Gen3 x8 main meta = ~ /08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 5

6 Data processing interface 01/08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 6

7 Data streams Generic FPGA->PC communication mechanism implemented for the LHCb upgrade FPGA: custom DMA controller Host: custom Linux kernel driver and userspace software Three stream types in current design, more could be added if you have ideas: MAIN data stream Fully implemented (firmware and software) Transports all the frontend data aggregated by the TELL40 firmware META data stream Implemented in firmware, not fully integrated in current software Transports blocks of event metadata into the network event builder ODIN data stream Not yet instantiated, will be used to transport ODIN banks out of the SODIN boards Today we will be using the first stream only 01/08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 7

8 Hard IP ECS kernel driver Arria 10 PCIe interface BAR0 (MMM) BAR2 (MMM) x8 HSSI libpcie40_ecs.so /dev/pcie40_0_bar0 /dev/pcie40_0_bar2 ecs_mmap() ecs_mmap() Kernel driver 01/08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 8

9 Demo time! Read / write registers (eg: 0x700004) from the command line pcie40_ecs b {BAR} a {ADDRESS} r pcie40_ecs b {BAR} a {ADDRESS} w {VALUE} (reminder: BAR 0 = user code (you!), BAR 1 = hidden, BAR 2 = low level interface) or using the library: #include <lhcb/pcie40/ecs.h> -lpcie40_ecs p40_ecs_open(interface_id, bar_number, handle) p40_ecs_w32(handle, address, value) p40_ecs_r32(handle, address) 01/08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 9

10 PCIe Hard IP DMA kernel driver Input data Memory descriptors (read) S Virtual memory map (Pseudo-IOMMU) Event builder (TODO) Data readout Board acceptance test > M BAR S M libpcie40_daq.so M S DMA stream DMA data (write) DMA buffer > > < M S Commands Status Control DMA data (read) < S M x8 HSSI /dev/pcie40_?_ctrl Kernel driver: device discovery stream discovery memory management interrupt management ioctls etc /dev/pcie40_?_xyz Virtual circular buffer W R 01/08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 10

11 Demo time! Locate the device files ls /dev/pcie40_* Understand the reload procedure pcie40_reload Understand the output of the low-level DAQ command line pcie40_daq Run a data integrity test pcie40_daq -vrfegc Run a performance test pcie40_daq vrego Understand how to reset an interface pcie40_daq rt pcie40_daq Rt 01/08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 11

12 CVP kernel driver Allows reprogramming the FPGA over PCIe bus Based on Altera reference implementation Will be necessary to upgrade firmware in the field when we scale out Partial success on Nallatech FPGA fw.core.rbf /dev/pcie40_0_cvp Kernel CvP helper PCIE40 cfg space Future work: create flash bootstrap image for PCIe40 and integrate support in driver Altera CvP VSEC 01/08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 12

13 Demo time! Not yet, the CvP driver is still in development However, time for a few words about driver updates: Driver releases are managed in parallel with DAQ40 software releases eg: lhcb-pcie40-tools is always released together with lhcb-pcie40-driver There are a package version ( ), a core version (FPGA only, you don t care) and a register map version (FPGA & Driver) PCIe firmware and driver are versioned together register map version compatibility is checked at runtime Linux drivers are packaged with DKMS, this means: Kernel upgrades/downgrades are transparent Driver upgrades/downgrades are (mostly) transparent yum erase lhcb-pcie40-driver; yum install lhcb-pcie40-driver-yyyymmdd Driver releases in the stable channel track the master firmware branch 01/08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 13

14 MiniDAQ2 readout Similar philosophy between MiniDAQ1 and MiniDAQ2 Common output file format (.frg) Contains fragments, not events! It s not.mdf, but serves similar purpose Comes with a C++ API MiniDAQ2 status Stable firmware interface Kernel drivers Command line tools Storage layer DIM interface WinCC integration TODO Emulation driver Event builder integration (longer term) FPGA Kernel User space DAQ streams MAIN.frg on-chip buffer host buffer consumer META To event builder (TODO) 01/08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 14

15 Demo time! Capture some data using only the command line, check the FSM transitions pcie40_frgwriter D Check the data file: pcie40_frgreader V file.frg Examine the data file contents: pcie40_frgreader file.frg less Do the same using the C++ library: lhcb-daq40-software/daq40_frgreader/example Note: {pcie40,daq40,amc40}_frgreader are the same program 01/08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 15

16 PCIe DIM interface Top Interfaces Streams COMMANDS STATE & STATUS Writers Stream FSM disk pcie_0 FSM MAIN META Buffer FSM Storage FSM Controller FSM ODIN MAIN pcie_1 FSM META ODIN Stream FSM Buffer FSM DIM Storage FSM 01/08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 16

17 Demo time! Use the PCIe WinCC panel to understand PCIe40 readout performance: Observe buffer occupancy in performance mode Observe buffer occupancy in data integrity mode Observe buffer occupancy while going through the DAQ software Metadata acceleration to the rescue! 01/08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 17

18 Metadata acceleration We see the software spends a lot of CPU and memory bandwidth to parse the fragment sequence and split it into blocks. Using metadata acceleration, all of this work is done in the FPGA, and the ready blocks are sent to the event builder using a dedicated stream. This function is already implemented in the firmware, but not yet fully in the software, will be necessary in order to run the event builder efficiently. 01/08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 18

19 DQMP integration pcie40_stream_dma amc40_stream_udp daq40_blockbuffer daq40_frgwriter daq40_dqmonitor So far used successfully by RICH test beam with MiniDAQ1 SHMEM IPC 01/08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 19

20 DQMP at work (F. Cindolo) 01/08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 20

21 Questions? Comments? THANK YOU 01/08/2017 LHCB MINIDAQ2 WORKSHOP - PCIE - PAOLO DURANTE 21

MiniDAQ1 A COMPACT DATA ACQUISITION SYSTEM FOR GBT READOUT OVER 10G ETHERNET 22/05/2017 TIPP PAOLO DURANTE - MINIDAQ1 1

MiniDAQ1 A COMPACT DATA ACQUISITION SYSTEM FOR GBT READOUT OVER 10G ETHERNET 22/05/2017 TIPP PAOLO DURANTE - MINIDAQ1 1 MiniDAQ1 A COMPACT DATA ACQUISITION SYSTEM FOR GBT READOUT OVER 10G ETHERNET 22/05/2017 TIPP 2017 - PAOLO DURANTE - MINIDAQ1 1 Overview LHCb upgrade Optical frontend readout Slow control implementation

More information

A generic firmware core to drive the Front-End GBT-SCAs for the LHCb upgrade

A generic firmware core to drive the Front-End GBT-SCAs for the LHCb upgrade Journal of Instrumentation OPEN ACCESS A generic firmware core to drive the Front-End GBT-SCAs for the LHCb upgrade Recent citations - The Versatile Link Demo Board (VLDB) R. Martín Lesma et al To cite

More information

A generic firmware core to drive the Front-End GBT-SCAs for the LHCb upgrade

A generic firmware core to drive the Front-End GBT-SCAs for the LHCb upgrade A generic firmware core to drive the Front-End GBT-SCAs for the LHCb upgrade F. Alessio 1, C. Caplan, C. Gaspar 1, R. Jacobsson 1, K. Wyllie 1 1 CERN CH-, Switzerland CBPF Rio de Janeiro, Brazil Corresponding

More information

GitLab-CI for FPGA development at LHCb 21/11/2018 CERN ELECTRONICS USERS GROUP - GITLAB CI FOR FPGAS 1

GitLab-CI for FPGA development at LHCb 21/11/2018 CERN ELECTRONICS USERS GROUP - GITLAB CI FOR FPGAS 1 GitLab-CI for FPGA development at LHCb 21/11/2018 CERN ELECTRONICS USERS GROUP - GITLAB CI FOR FPGAS 1 The EP/LBC (Online) group 21/11/2018 CERN ELECTRONICS USERS GROUP - GITLAB CI FOR FPGAS 2 GitLab-CI

More information

Centre de Physique des Particules de Marseille. The PCIe-based readout system for the LHCb experiment

Centre de Physique des Particules de Marseille. The PCIe-based readout system for the LHCb experiment The PCIe-based readout system for the LHCb experiment K.Arnaud, J.P. Duval, J.P. Cachemiche, Cachemiche,P.-Y. F. Réthoré F. Hachon, M. Jevaud, R. Le Gac, Rethore Centre de Physique des Particules def.marseille

More information

2008 JINST 3 S Online System. Chapter System decomposition and architecture. 8.2 Data Acquisition System

2008 JINST 3 S Online System. Chapter System decomposition and architecture. 8.2 Data Acquisition System Chapter 8 Online System The task of the Online system is to ensure the transfer of data from the front-end electronics to permanent storage under known and controlled conditions. This includes not only

More information

Improving Packet Processing Performance of a Memory- Bounded Application

Improving Packet Processing Performance of a Memory- Bounded Application Improving Packet Processing Performance of a Memory- Bounded Application Jörn Schumacher CERN / University of Paderborn, Germany jorn.schumacher@cern.ch On behalf of the ATLAS FELIX Developer Team LHCb

More information

FPGA Augmented ASICs: The Time Has Come

FPGA Augmented ASICs: The Time Has Come FPGA Augmented ASICs: The Time Has Come David Riddoch Steve Pope Copyright 2012 Solarflare Communications, Inc. All Rights Reserved. Hardware acceleration is Niche (With the obvious exception of graphics

More information

The new detector readout system for the ATLAS experiment

The new detector readout system for the ATLAS experiment LInk exange The new detector readout system for the ATLAS experiment Soo Ryu Argonne National Laboratory On behalf of the ATLAS Collaboration ATLAS DAQ for LHC Run2 (2015-2018) 40MHz L1 trigger 100kHz

More information

Introduction to the OpenCAPI Interface

Introduction to the OpenCAPI Interface Introduction to the OpenCAPI Interface Brian Allison, STSM OpenCAPI Technology and Enablement Speaker name, Title Company/Organization Name Join the Conversation #OpenPOWERSummit Industry Collaboration

More information

FPGA Manager. State of the Union. Moritz Fischer, National Instruments

FPGA Manager. State of the Union. Moritz Fischer, National Instruments FPGA Manager State of the Union Moritz Fischer, National Instruments $whoami Embedded Software Engineer at National Instruments Other stuff I do: U-Boot, OE, Linux Kernel Co-Maintainer of FPGA Manager

More information

FELI. : the detector readout upgrade of the ATLAS experiment. Soo Ryu. Argonne National Laboratory, (on behalf of the FELIX group)

FELI. : the detector readout upgrade of the ATLAS experiment. Soo Ryu. Argonne National Laboratory, (on behalf of the FELIX group) LI : the detector readout upgrade of the ATLAS experiment Soo Ryu Argonne National Laboratory, sryu@anl.gov (on behalf of the LIX group) LIX group John Anderson, Soo Ryu, Jinlong Zhang Hucheng Chen, Kai

More information

arxiv: v1 [physics.ins-det] 16 Oct 2017

arxiv: v1 [physics.ins-det] 16 Oct 2017 arxiv:1710.05607v1 [physics.ins-det] 16 Oct 2017 The ALICE O 2 common driver for the C-RORC and CRU read-out cards Boeschoten P and Costa F for the ALICE collaboration E-mail: pascal.boeschoten@cern.ch,

More information

New slow-control FPGA IP for GBT based system and status update of the GBT-FPGA project

New slow-control FPGA IP for GBT based system and status update of the GBT-FPGA project New slow-control FPGA IP for GBT based system and status update of the GBT-FPGA project 1 CERN Geneva CH-1211, Switzerland E-mail: julian.mendez@cern.ch Sophie Baron a, Pedro Vicente Leitao b CERN Geneva

More information

Validation of the front-end electronics and firmware for LHCb vertex locator.

Validation of the front-end electronics and firmware for LHCb vertex locator. Validation of the front-end electronics and firmware for LHCb vertex locator. Antonio Fernández Prieto Universidade de santiago de compostela, Spain E-mail: antonio.fernandez.prieto@cern.ch Pablo Vázquez

More information

Database Acceleration Solution Using FPGAs and Integrated Flash Storage

Database Acceleration Solution Using FPGAs and Integrated Flash Storage Database Acceleration Solution Using FPGAs and Integrated Flash Storage HK Verma, Xilinx Inc. August 2017 1 FPGA Analytics in Flash Storage System In-memory or Flash storage based DB reduce disk access

More information

Update on PRad GEMs, Readout Electronics & DAQ

Update on PRad GEMs, Readout Electronics & DAQ Update on PRad GEMs, Readout Electronics & DAQ Kondo Gnanvo University of Virginia, Charlottesville, VA Outline PRad GEMs update Upgrade of SRS electronics Integration into JLab DAQ system Cosmic tests

More information

Do you need MEV PCIe remote update on Cyclone GX FPGA?

Do you need MEV PCIe remote update on Cyclone GX FPGA? MEV Ltd Building 67 Europa Business Park Bird Hall Lane Stockport Cheshire SK3 0XA United Kingdom 0161 477 1898 www.mev.co.uk info@mev.co.uk Do you need MEV PCIe remote update on Cyclone GX FPGA? Would

More information

TFC update and TFC simulation testbench

TFC update and TFC simulation testbench TFC update and TFC simulation testbench LHCb Electronics Upgrade Meeting 14 February 2012 Federico Alessio Richard Jacobsson Outline Modifications in SOL40-TELL40 protocol First version of S-ODIN firmware

More information

Read-out of High Speed S-LINK Data Via a Buffered PCI Card

Read-out of High Speed S-LINK Data Via a Buffered PCI Card Read-out of High Speed S-LINK Data Via a Buffered PCI Card A. Guirao Talk for the 4 th PCaPAC International Workshop - This is the paper copy version of the presentation- Slide 9th is repeated due to an

More information

Introduction to PCI Express

Introduction to PCI Express Introduction to PCI Express Paolo Durante (CERN EP-LBC) 14/02/2017 ISOTDAQ 2018 - Introduction to PCIe 1 Where will you find? PCI (Peripheral Component Interconnect) Express is a popular standard for high-speed

More information

Demystifying Network Cards

Demystifying Network Cards Demystifying Network Cards Paul Emmerich December 27, 2017 Chair of Network Architectures and Services About me PhD student at Researching performance of software packet processing systems Mostly working

More information

I/O Handling. ECE 650 Systems Programming & Engineering Duke University, Spring Based on Operating Systems Concepts, Silberschatz Chapter 13

I/O Handling. ECE 650 Systems Programming & Engineering Duke University, Spring Based on Operating Systems Concepts, Silberschatz Chapter 13 I/O Handling ECE 650 Systems Programming & Engineering Duke University, Spring 2018 Based on Operating Systems Concepts, Silberschatz Chapter 13 Input/Output (I/O) Typical application flow consists of

More information

Frontend Control Electronics for the LHCb upgrade Hardware realization and test

Frontend Control Electronics for the LHCb upgrade Hardware realization and test First Prototype of the muon Frontend Control Electronics for the LHCb upgrade Hardware realization and test V. Bocci, G. Chiodi, P. Fresch et al. International Conference on Technology and Instrumentation

More information

VGA Assignment Using VFIO. Alex Williamson October 21 st, 2013

VGA Assignment Using VFIO. Alex Williamson October 21 st, 2013 VGA Assignment Using VFIO alex.williamson@redhat.com October 21 st, 2013 Agenda Introduction to PCI & PCIe IOMMUs VFIO VGA VFIO VGA support Quirks, quirks, quirks Status and future Performance 2 A brief

More information

Detector Control LHC

Detector Control LHC Detector Control Systems @ LHC Matthias Richter Department of Physics, University of Oslo IRTG Lecture week Autumn 2012 Oct 18 2012 M. Richter (UiO) DCS @ LHC Oct 09 2012 1 / 39 Detectors in High Energy

More information

FELIX the new detector readout system for the ATLAS experiment

FELIX the new detector readout system for the ATLAS experiment FrontEnd LInk exchange LIX the new detector readout system for the ATLAS experiment Julia Narevicius Weizmann Institute of Science on behalf of the ATLAS Collaboration Introduction to ATLAS readout: today

More information

Configuration via Protocol (CvP) Implementation in V-series FPGA Devices User Guide

Configuration via Protocol (CvP) Implementation in V-series FPGA Devices User Guide Configuration via Protocol (CvP) Implementation in V-series FPGA Devices User Guide Subscribe UG-01101 101 Innovation Drive San Jose, CA 95134 www.altera.com TOC-2 CvP Overview Contents Overview... 1-1

More information

MicroTCA / AMC Solutions for Real-Time Data Acquisition

MicroTCA / AMC Solutions for Real-Time Data Acquisition THE MAGAZINE OF RECORD FOR THE EMBEDDED COMPUTING INDUSTRY May 2013 TECHNOLOGY IN SYSTEMS MicroTCA / AMC Solutions for Real-Time Data Acquisition MicroTCA has evolved out of the world of ATCA to become

More information

ANIC Host CPU Offload Features Overview An Overview of Features and Functions Available with ANIC Adapters

ANIC Host CPU Offload Features Overview An Overview of Features and Functions Available with ANIC Adapters ANIC Host CPU Offload Features Overview An Overview of Features and Functions Available with ANIC Adapters ANIC Adapters Accolade s ANIC line of FPGA-based adapters/nics help accelerate security and networking

More information

BES-III off-detector readout electronics for the GEM detector: an update

BES-III off-detector readout electronics for the GEM detector: an update BES-III off-detector readout electronics for the GEM detector: an update The CGEM off-detector collaboration ( INFN/Univ. FE, INFN LNF, Univ. Uppsala ) 1 Outline Reminder Update on development status Off-detector

More information

Deferred High Level Trigger in LHCb: A Boost to CPU Resource Utilization

Deferred High Level Trigger in LHCb: A Boost to CPU Resource Utilization Deferred High Level Trigger in LHCb: A Boost to Resource Utilization The use of periods without beam for online high level triggers Introduction, problem statement Realization of the chosen solution Conclusions

More information

Stefan Koestner on behalf of the LHCb Online Group ( IEEE - Nuclear Science Symposium San Diego, Oct.

Stefan Koestner on behalf of the LHCb Online Group (  IEEE - Nuclear Science Symposium San Diego, Oct. Stefan Koestner on behalf of the LHCb Online Group (email: Stefan.Koestner@cern.ch) IEEE - Nuclear Science Symposium San Diego, Oct. 31 st 2006 Dedicated to B-physics : single arm forward spectrometer

More information

RT2016 Phase-I Trigger Readout Electronics Upgrade for the ATLAS Liquid-Argon Calorimeters

RT2016 Phase-I Trigger Readout Electronics Upgrade for the ATLAS Liquid-Argon Calorimeters RT2016 Phase-I Trigger Readout Electronics Upgrade for the ATLAS Liquid-Argon Calorimeters Nicolas Chevillot (LAPP/CNRS-IN2P3) on behalf of the ATLAS Liquid Argon Calorimeter Group 1 Plan Context Front-end

More information

Creating PCI Express Links in Intel FPGAs

Creating PCI Express Links in Intel FPGAs Creating PCI Express Links in Intel FPGAs Course Description This course provides all necessary theoretical and practical know how to create PCI Express links in Intel FPGAs. The course goes into great

More information

HKG net_mdev: Fast-path userspace I/O. Ilias Apalodimas Mykyta Iziumtsev François-Frédéric Ozog

HKG net_mdev: Fast-path userspace I/O. Ilias Apalodimas Mykyta Iziumtsev François-Frédéric Ozog HKG18-110 net_mdev: Fast-path userspace I/O Ilias Apalodimas Mykyta Iziumtsev François-Frédéric Ozog Why userland I/O Time sensitive networking Developed mostly for Industrial IOT, automotive and audio/video

More information

Maximizing heterogeneous system performance with ARM interconnect and CCIX

Maximizing heterogeneous system performance with ARM interconnect and CCIX Maximizing heterogeneous system performance with ARM interconnect and CCIX Neil Parris, Director of product marketing Systems and software group, ARM Teratec June 2017 Intelligent flexible cloud to enable

More information

Embedded Tech Trends 2014 Rodger H. Hosking Pentek, Inc. VPX for Rugged, Conduction-Cooled Software Radio Virtex-7 Applications

Embedded Tech Trends 2014 Rodger H. Hosking Pentek, Inc. VPX for Rugged, Conduction-Cooled Software Radio Virtex-7 Applications Embedded Tech Trends 2014 Rodger H. Hosking Pentek, Inc. VPX for Rugged, Conduction-Cooled Software Radio Virtex-7 Applications System Essentials: Rugged Software Radio Industry Standard Open Architectures

More information

Signal Conversion in a Modular Open Standard Form Factor. CASPER Workshop August 2017 Saeed Karamooz, VadaTech

Signal Conversion in a Modular Open Standard Form Factor. CASPER Workshop August 2017 Saeed Karamooz, VadaTech Signal Conversion in a Modular Open Standard Form Factor CASPER Workshop August 2017 Saeed Karamooz, VadaTech At VadaTech we are technology leaders First-to-market silicon Continuous innovation Open systems

More information

Pactron FPGA Accelerated Computing Solutions

Pactron FPGA Accelerated Computing Solutions Pactron FPGA Accelerated Computing Solutions Intel Xeon + Altera FPGA 2015 Pactron HJPC Corporation 1 Motivation for Accelerators Enhanced Performance: Accelerators compliment CPU cores to meet market

More information

NVM Express 1.3 Delivering Continuous Innovation

NVM Express 1.3 Delivering Continuous Innovation Architected for Performance NVM Express 1.3 Delivering Continuous Innovation June 2017 Jonmichael Hands, Product Marketing Manager Intel, NVM Express Marketing Co-Chair View recorded webcast NVMe 1.3 -

More information

ClearSpeed Visual Profiler

ClearSpeed Visual Profiler ClearSpeed Visual Profiler Copyright 2007 ClearSpeed Technology plc. All rights reserved. 12 November 2007 www.clearspeed.com 1 Profiling Application Code Why use a profiler? Program analysis tools are

More information

An Intelligent NIC Design Xin Song

An Intelligent NIC Design Xin Song 2nd International Conference on Advances in Mechanical Engineering and Industrial Informatics (AMEII 2016) An Intelligent NIC Design Xin Song School of Electronic and Information Engineering Tianjin Vocational

More information

Configuration via Protocol (CvP) Implementation in Altera FPGAs User Guide

Configuration via Protocol (CvP) Implementation in Altera FPGAs User Guide Configuration via Protocol (CvP) Implementation in Altera FPGAs User Guide Configuration via Protocol (CvP) Implementation in Altera FPGAs User Guide 101 Innovation Drive San Jose, CA 95134 www.altera.com

More information

CS330: Operating System and Lab. (Spring 2006) I/O Systems

CS330: Operating System and Lab. (Spring 2006) I/O Systems CS330: Operating System and Lab. (Spring 2006) I/O Systems Today s Topics Block device vs. Character device Direct I/O vs. Memory-mapped I/O Polling vs. Interrupts Programmed I/O vs. DMA Blocking vs. Non-blocking

More information

In-Place Associative Computing:

In-Place Associative Computing: In-Place Associative Computing: 1 Page Abstract... 3 Overview... 3 Associative Processing Unit (APU) Card... 3 Host-Device interface... 4 The APU Card Controller... 4 Host to Device Interactions... 5 APU

More information

OS Structure. Kevin Webb Swarthmore College January 25, Relevant xkcd:

OS Structure. Kevin Webb Swarthmore College January 25, Relevant xkcd: OS Structure Kevin Webb Swarthmore College January 25, 2018 Relevant xkcd: One of the survivors, poking around in the ruins with the point of a spear, uncovers a singed photo of Richard Stallman. They

More information

27 March 2018 Mikael Arguedas and Morgan Quigley

27 March 2018 Mikael Arguedas and Morgan Quigley 27 March 2018 Mikael Arguedas and Morgan Quigley Separate devices: (prototypes 0-3) Unified camera: (prototypes 4-5) Unified system: (prototypes 6+) USB3 USB Host USB3 USB2 USB3 USB Host PCIe root

More information

Intel Acceleration Stack for Intel Xeon CPU with FPGAs Version 1.2 Release Notes

Intel Acceleration Stack for Intel Xeon CPU with FPGAs Version 1.2 Release Notes Intel Acceleration Stack for Intel Xeon CPU with FPGAs Version 1.2 Updated for Intel Acceleration Stack for Intel Xeon CPU with FPGAs: 1.2 Subscribe Latest document on the web: PDF HTML Contents Contents

More information

CIS Operating Systems File Systems. Professor Qiang Zeng Fall 2017

CIS Operating Systems File Systems. Professor Qiang Zeng Fall 2017 CIS 5512 - Operating Systems File Systems Professor Qiang Zeng Fall 2017 Previous class I/O subsystem: hardware aspect Terms: controller, bus, port Addressing: port-mapped IO and memory-mapped IO I/O subsystem:

More information

CIS Operating Systems File Systems. Professor Qiang Zeng Spring 2018

CIS Operating Systems File Systems. Professor Qiang Zeng Spring 2018 CIS 3207 - Operating Systems File Systems Professor Qiang Zeng Spring 2018 Previous class I/O subsystem: hardware aspect Terms: controller, bus, port Addressing: port-mapped IO and memory-mapped IO I/O

More information

Methods to protect proprietary components in device drivers

Methods to protect proprietary components in device drivers Methods to protect proprietary components in device drivers Matt Porter Embedded Alley Solutions, Inc. Introduction Why the interest in closed drivers on Linux? Competition Advantage perception Upsell

More information

Nitro240/260 CPU Board Scalable 680x0 VME board for I/O intensive applications

Nitro240/260 CPU Board Scalable 680x0 VME board for I/O intensive applications Nitro240/260 CPU Board Scalable 680x0 VME board for I/O intensive applications Nitro260 features a 50 MHz MC68060 CISC processor with superscalar pipeline architecture for maximum integer and floating

More information

ADQ14 Development Kit

ADQ14 Development Kit ADQ14 Development Kit Documentation : P Devices PD : ecurity Class: : Release : P Devices Page 2(of 21) ecurity class Table of Contents 1 Tools...3 2 Overview...4 2.1 High-level block overview...4 3 How

More information

Spring 2017 :: CSE 506. Device Programming. Nima Honarmand

Spring 2017 :: CSE 506. Device Programming. Nima Honarmand Device Programming Nima Honarmand read/write interrupt read/write Spring 2017 :: CSE 506 Device Interface (Logical View) Device Interface Components: Device registers Device Memory DMA buffers Interrupt

More information

Improving DPDK Performance

Improving DPDK Performance Improving DPDK Performance Data Plane Development Kit (DPDK) was pioneered by Intel as a way to boost the speed of packet API with standard hardware. DPDK-enabled applications typically show four or more

More information

[08] IO SUBSYSTEM 1. 1

[08] IO SUBSYSTEM 1. 1 [08] IO SUBSYSTEM 1. 1 OUTLINE Input/Output (IO) Hardware Device Classes OS Interfaces Performing IO Polled Mode Interrupt Driven Blocking vs Non-blocking Handling IO Buffering & Strategies Other Issues

More information

Altera PCIe reference design testing CRU INDIA TEAM

Altera PCIe reference design testing CRU INDIA TEAM Altera PCIe reference design testing CRU INDIA TEAM We have found four example designs : Example Design :Variation 1. Stratix V Avalon-ST Interface for PCIe Solutions -- for better understanding of PCIe

More information

The Nios II Family of Configurable Soft-core Processors

The Nios II Family of Configurable Soft-core Processors The Nios II Family of Configurable Soft-core Processors James Ball August 16, 2005 2005 Altera Corporation Agenda Nios II Introduction Configuring your CPU FPGA vs. ASIC CPU Design Instruction Set Architecture

More information

DAQ-in-a-box. δράση. [transliterated drasi: 'action'] Håkan Johansson, Chalmers, Göteborg

DAQ-in-a-box. δράση. [transliterated drasi: 'action'] Håkan Johansson, Chalmers, Göteborg DAQ-in-a-box δράση [transliterated drasi: 'action'] Håkan Johansson, Chalmers, Göteborg Lichtenberghaus, Darmstadt, November 2017 NUSTAR experiments Many types of detectors Short production runs (~1 week)

More information

AN 575: PCI Express-to-DDR2 SDRAM Reference Design

AN 575: PCI Express-to-DDR2 SDRAM Reference Design AN 575: PCI Express-to-DDR2 SDRAM Reference Design April 2009 AN-575-1.0 Introduction This application note introduces the dedicated PCI Express logic block implemented in Arria II GX FPGA hardware and

More information

SDA: Software-Defined Accelerator for general-purpose big data analysis system

SDA: Software-Defined Accelerator for general-purpose big data analysis system SDA: Software-Defined Accelerator for general-purpose big data analysis system Jian Ouyang(ouyangjian@baidu.com), Wei Qi, Yong Wang, Yichen Tu, Jing Wang, Bowen Jia Baidu is beyond a search engine Search

More information

VXS-610 Dual FPGA and PowerPC VXS Multiprocessor

VXS-610 Dual FPGA and PowerPC VXS Multiprocessor VXS-610 Dual FPGA and PowerPC VXS Multiprocessor Two Xilinx Virtex -5 FPGAs for high performance processing On-board PowerPC CPU for standalone operation, communications management and user applications

More information

SE300 SWE Practices. Lecture 10 Introduction to Event- Driven Architectures. Tuesday, March 17, Sam Siewert

SE300 SWE Practices. Lecture 10 Introduction to Event- Driven Architectures. Tuesday, March 17, Sam Siewert SE300 SWE Practices Lecture 10 Introduction to Event- Driven Architectures Tuesday, March 17, 2015 Sam Siewert Copyright {c} 2014 by the McGraw-Hill Companies, Inc. All rights Reserved. Four Common Types

More information

40Gbps+ Full Line Rate, Programmable Network Accelerators for Low Latency Applications SAAHPC 19 th July 2011

40Gbps+ Full Line Rate, Programmable Network Accelerators for Low Latency Applications SAAHPC 19 th July 2011 40Gbps+ Full Line Rate, Programmable Network Accelerators for Low Latency Applications SAAHPC 19 th July 2011 Allan Cantle President & Founder www.nallatech.com Company Overview ISI + Nallatech + Innovative

More information

Software Development Using Full System Simulation with Freescale QorIQ Communications Processors

Software Development Using Full System Simulation with Freescale QorIQ Communications Processors Patrick Keliher, Simics Field Application Engineer Software Development Using Full System Simulation with Freescale QorIQ Communications Processors 1 2013 Wind River. All Rights Reserved. Agenda Introduction

More information

Flash Controller Solutions in Programmable Technology

Flash Controller Solutions in Programmable Technology Flash Controller Solutions in Programmable Technology David McIntyre Senior Business Unit Manager Computer and Storage Business Unit Altera Corp. dmcintyr@altera.com Flash Memory Summit 2012 Santa Clara,

More information

PCIe interface firmware and software

PCIe interface firmware and software PCIe interface firmware and software Filippo Costa Sanjoy Mukherjee Tuan Mate Nguyen ALICE -O 2 CERN OUTLINE PCIe interface : ) data flow 2) control interface 3) DCS interface (SC and monitoring) 2 PCIe

More information

The S6000 Family of Processors

The S6000 Family of Processors The S6000 Family of Processors Today s Design Challenges The advent of software configurable processors In recent years, the widespread adoption of digital technologies has revolutionized the way in which

More information

Data Acquisition in Particle Physics Experiments. Ing. Giuseppe De Robertis INFN Sez. Di Bari

Data Acquisition in Particle Physics Experiments. Ing. Giuseppe De Robertis INFN Sez. Di Bari Data Acquisition in Particle Physics Experiments Ing. Giuseppe De Robertis INFN Sez. Di Bari Outline DAQ systems Theory of operation Case of a large experiment (CMS) Example of readout GEM detectors for

More information

Streaming Readout, the JLab perspective. Graham Heyes Data Acquisition Support Group Jefferson Lab

Streaming Readout, the JLab perspective. Graham Heyes Data Acquisition Support Group Jefferson Lab Streaming Readout, the JLab perspective Graham Heyes Data Acquisition Support Group Jefferson Lab Introduction After the 12 GeV accelerator upgrade all four halls took production data in Spring of this

More information

IBM Network Processor, Development Environment and LHCb Software

IBM Network Processor, Development Environment and LHCb Software IBM Network Processor, Development Environment and LHCb Software LHCb Readout Unit Internal Review July 24 th 2001 Niko Neufeld, CERN 1 Outline IBM NP4GS3 Architecture A Readout Unit based on the NP4GS3

More information

Intel Accelerator Functional Unit (AFU) Simulation Environment (ASE) Quick Start User Guide

Intel Accelerator Functional Unit (AFU) Simulation Environment (ASE) Quick Start User Guide Intel Accelerator Functional Unit (AFU) Simulation Environment (ASE) Quick Start User Guide Updated for Intel Acceleration Stack: 1.0 Production Subscribe Send Feedback Latest document on the web: PDF

More information

KeyStone C665x Multicore SoC

KeyStone C665x Multicore SoC KeyStone Multicore SoC Architecture KeyStone C6655/57: Device Features C66x C6655: One C66x DSP Core at 1.0 or 1.25 GHz C6657: Two C66x DSP Cores at 0.85, 1.0, or 1.25 GHz Fixed and Floating Point Operations

More information

Input / Output. Kevin Webb Swarthmore College April 12, 2018

Input / Output. Kevin Webb Swarthmore College April 12, 2018 Input / Output Kevin Webb Swarthmore College April 12, 2018 xkcd #927 Fortunately, the charging one has been solved now that we've all standardized on mini-usb. Or is it micro-usb? Today s Goals Characterize

More information

Construction of the Phase I upgrade of the CMS pixel detector

Construction of the Phase I upgrade of the CMS pixel detector Forward Pixel Barrel Pixel TECHNOLOGY AND INSTRUMENTATION IN PARTICLE PHYSICS 2017, May 22-26, 2017 Construction of the Phase I upgrade of the CMS pixel detector Satoshi Hasegawa Fermi National Accelerator

More information

Michael Adler 2017/09

Michael Adler 2017/09 Michael Adler 2017/09 Outline System overview Core Cache Interface (CCI-P) abstraction Application-specific memory hierarchies (Memory Properties Factory MPF) Clocking Simulation (ASE) GitHub open source

More information

Velo readout board RB3. Common L1 board (ROB)

Velo readout board RB3. Common L1 board (ROB) Velo readout board RB3 Testing... Common L1 board (ROB) Specifying Federica Legger 10 February 2003 1 Summary LHCb Detectors Online (Trigger, DAQ) VELO (detector and Readout chain) L1 electronics for VELO

More information

A software platform to support dynamically reconfigurable Systems-on-Chip under the GNU/Linux operating system

A software platform to support dynamically reconfigurable Systems-on-Chip under the GNU/Linux operating system A software platform to support dynamically reconfigurable Systems-on-Chip under the GNU/Linux operating system 26th July 2005 Alberto Donato donato@elet.polimi.it Relatore: Prof. Fabrizio Ferrandi Correlatore:

More information

Network Interface Architecture and Prototyping for Chip and Cluster Multiprocessors

Network Interface Architecture and Prototyping for Chip and Cluster Multiprocessors University of Crete School of Sciences & Engineering Computer Science Department Master Thesis by Michael Papamichael Network Interface Architecture and Prototyping for Chip and Cluster Multiprocessors

More information

ProtoDUNE SP DAQ Overview

ProtoDUNE SP DAQ Overview ProtoDUNE SP DAQ Overview K. Hennessy, G. Lehmann Miotto Description Introduction The design of the ProtoDUNE SP DAQ relies on the main principles of the DUNE DAQ design and on experience gained while

More information

The Price of Safety: Evaluating IOMMU Performance

The Price of Safety: Evaluating IOMMU Performance The Price of Safety: Evaluating IOMMU Performance Muli Ben-Yehuda 1 Jimi Xenidis 2 Michal Ostrowski 2 Karl Rister 3 Alexis Bruemmer 3 Leendert Van Doorn 4 1 muli@il.ibm.com 2 {jimix,mostrows}@watson.ibm.com

More information

GFS: The Google File System

GFS: The Google File System GFS: The Google File System Brad Karp UCL Computer Science CS GZ03 / M030 24 th October 2014 Motivating Application: Google Crawl the whole web Store it all on one big disk Process users searches on one

More information

9. PIO Core. Core Overview. Functional Description

9. PIO Core. Core Overview. Functional Description 9. PIO Core NII51007-9.0.0 Core Overview The parallel input/output (PIO) core with Avalon interface provides a memory-mapped interface between an Avalon Memory-Mapped (Avalon-MM) slave port and general-purpose

More information

SPECS : A SERIAL PROTOCOL FOR EXPERIMENT CONTROL SYSTEM IN LHCB.

SPECS : A SERIAL PROTOCOL FOR EXPERIMENT CONTROL SYSTEM IN LHCB. 10th ICALEPCS Int. Conf. on Accelerator & Large Expt. Physics Control Systems. Geneva, 10-14 Oct 2005, WE1.5-4O (2005) : A SERIAL PROTOCOL FOR EXPERIMENT CONTROL SYSTEM IN LHCB. D.Breton, 1 D.Charlet,

More information

Implementation and verification of PCI express interface in a SoC

Implementation and verification of PCI express interface in a SoC Implementation and verification of PCI express interface in a SoC Vinay Kumar Pamula a) and Sai Raghavendra Mantripragada b) Department of ECE, University College of Engineering Kakinada, JNTUK, Kakinada

More information

Matrox Imaging White Paper

Matrox Imaging White Paper Reliable high bandwidth video capture with Matrox Radient Abstract The constant drive for greater analysis resolution and higher system throughput results in the design of vision systems with multiple

More information

S32 SDK for Power Architecture Release Notes Version EAR

S32 SDK for Power Architecture Release Notes Version EAR S32 SDK for Power Architecture Release Notes Version 0.8.0 EAR 2017 NXP Contents 1. DESCRIPTION...3 2. SOFTWARE CONTENTS...4 3. DOCUMENTATION...4 4. EXAMPLES...5 5. SUPPORTED HARDWARE AND COMPATIBLE SOFTWARE...6

More information

3-D Accelerator on Chip

3-D Accelerator on Chip 3-D Accelerator on Chip Third Prize 3-D Accelerator on Chip Institution: Participants: Instructor: Donga & Pusan University Young-Hee Won, Jin-Sung Park, Woo-Sung Moon Sam-Hak Jin Design Introduction Recently,

More information

Vertex Detector Electronics: ODE to ECS Interface

Vertex Detector Electronics: ODE to ECS Interface Vertex Detector Electronics: ODE to ECS Interface LHCb Technical Note Issue: 1 Revision: 0 Reference: LHCb 2000-012 VELO Created: 1 February 2000 Last modified: 20 March 2000 Prepared By: Yuri Ermoline

More information

FINALTERM EXAMINATION Fall 2008 CS501- Advance Computer Architecture (Session - 1) Marks: 75

FINALTERM EXAMINATION Fall 2008 CS501- Advance Computer Architecture (Session - 1) Marks: 75 FINALTERM EXAMINATION Fall 2008 CS501- Advance Computer Architecture (Session - 1) Marks: 75 Question No: 1 ( Marks: 1 ) - Please choose one Which one of the following is the memory organization of SRC

More information

Introduction to High Performance Parallel I/O

Introduction to High Performance Parallel I/O Introduction to High Performance Parallel I/O Richard Gerber Deputy Group Lead NERSC User Services August 30, 2013-1- Some slides from Katie Antypas I/O Needs Getting Bigger All the Time I/O needs growing

More information

VXS-621 FPGA & PowerPC VXS Multiprocessor

VXS-621 FPGA & PowerPC VXS Multiprocessor VXS-621 FPGA & PowerPC VXS Multiprocessor Xilinx Virtex -5 FPGA for high performance processing On-board PowerPC CPU for standalone operation, communications management and user applications Two PMC/XMC

More information

INT G bit TCP Offload Engine SOC

INT G bit TCP Offload Engine SOC INT 10011 10 G bit TCP Offload Engine SOC Product brief, features and benefits summary: Highly customizable hardware IP block. Easily portable to ASIC flow, Xilinx/Altera FPGAs or Structured ASIC flow.

More information

AN 690: PCI Express DMA Reference Design for Stratix V Devices

AN 690: PCI Express DMA Reference Design for Stratix V Devices AN 690: PCI Express DMA Reference Design for Stratix V Devices an690-1.0 Subscribe The PCI Express Avalon Memory-Mapped (Avalon-MM) DMA Reference Design highlights the performance of the Avalon-MM 256-Bit

More information

A Prototype Storage Subsystem based on PCM

A Prototype Storage Subsystem based on PCM PSS A Prototype Storage Subsystem based on IBM Research Zurich Ioannis Koltsidas, Roman Pletka, Peter Mueller, Thomas Weigold, Evangelos Eleftheriou University of Patras Maria Varsamou, Athina Ntalla,

More information

ROM Status Update. U. Marconi, INFN Bologna

ROM Status Update. U. Marconi, INFN Bologna ROM Status Update U. Marconi, INFN Bologna Drift Chamber ~ 35 L1 processor EMC ~ 80 L1 processor? SVT L1 processor L3 to L5 ~15 Radiation wall Clk, L1, Sync Cmds Global Level1 Trigger (GLT) Raw L1 FCTS

More information

APV-25 based readout electronics for the SBS front GEM Tracker

APV-25 based readout electronics for the SBS front GEM Tracker APV-25 based readout electronics for the SBS front GEM Tracker Authors: Evaristo Cisbani, Paolo Musico Date: 26/June/2014 Version: 1.0 APV-25 based readout electronics for the SBS front GEM Tracker...

More information

Simplify System Complexity

Simplify System Complexity 1 2 Simplify System Complexity With the new high-performance CompactRIO controller Arun Veeramani Senior Program Manager National Instruments NI CompactRIO The Worlds Only Software Designed Controller

More information

10Gbps TCP/IP streams from the FPGA

10Gbps TCP/IP streams from the FPGA TWEPP 2013 10Gbps TCP/IP streams from the FPGA for the CMS DAQ Eventbuilder Network Petr Žejdl, Dominique Gigi on behalf of the CMS DAQ Group 26 September 2013 Outline CMS DAQ Readout System TCP/IP Introduction,

More information