A FPGA-based architecture for real-time cluster finding in the LHCb
silicon pixel detector
- URL: http://arxiv.org/abs/2302.03972v3
- Date: Mon, 19 Jun 2023 12:05:09 GMT
- Title: A FPGA-based architecture for real-time cluster finding in the LHCb
silicon pixel detector
- Authors: G. Bassi, L. Giambastiani, K. Hennessy, F. Lazzari, M. J. Morello, T.
Pajero, A. Fernandez Prieto, G. Punzi
- Abstract summary: This article describes a custom VHDL firmware implementation of a two-dimensional cluster-finder architecture for reconstructing hit positions in the new VELO detector.
The pre-processing allows the first level of the software trigger to accept a 11% higher rate of events.
It additionally allows the raw pixel data to be dropped at the readout level, thus saving approximately 14% of the DAQ bandwidth.
- Score: 0.8431877864777444
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This article describes a custom VHDL firmware implementation of a
two-dimensional cluster-finder architecture for reconstructing hit positions in
the new vertex pixel detector (VELO) that is part of the LHCb Upgrade. This
firmware has been deployed to the existing FPGA cards that perform the readout
of the VELO, as a further enhancement of the DAQ system, and will run in real
time during physics data taking, reconstructing VELO hits coordinates
on-the-fly at the LHC collision rate. This pre-processing allows the first
level of the software trigger to accept a 11% higher rate of events, as the
ready-made hits coordinates accelerate the track reconstruction and consumes
significantly less electrical power. It additionally allows the raw pixel data
to be dropped at the readout level, thus saving approximately 14% of the DAQ
bandwidth. Detailed simulation studies have shown that the use of this
real-time cluster finding does not introduce any appreciable degradation in the
tracking performance in comparison to a full-fledged software implementation.
This work is part of a wider effort aimed at boosting the real-time processing
capability of HEP experiments by delegating intensive tasks to dedicated
computing accelerators deployed at the earliest stages of the data acquisition
chain.
Related papers
- Embedded Graph Convolutional Networks for Real-Time Event Data Processing on SoC FPGAs [0.815557531820863]
Event cameras find significant relevance for their integration into embedded real-time systems.
One effective approach to ensure the necessary throughput and latency for event processing systems is through the utilisation of graph convolutional networks (GCNs)
We introduce a series of hardware-aware optimisations tailored for PointNet++, a GCN architecture designed for point cloud processing.
arXiv Detail & Related papers (2024-06-11T14:47:36Z) - FPGA-QHAR: Throughput-Optimized for Quantized Human Action Recognition
on The Edge [0.6254873489691849]
This paper proposed an integrated end-to-end HAR scalable HW/SW accelerator co-design based on an enhanced 8-bit quantized Two-Stream SimpleNet-PyTorch CNN architecture.
Our development uses partially streaming dataflow architecture to achieve higher throughput versus network design and resource utilization trade-off.
Our proposed methodology achieved nearly 81% prediction accuracy with an approximately 24 FPS real-time inference throughput at 187MHz on ZCU104.
arXiv Detail & Related papers (2023-11-04T10:38:21Z) - SATAY: A Streaming Architecture Toolflow for Accelerating YOLO Models on
FPGA Devices [48.47320494918925]
This work tackles the challenges of deploying stateof-the-art object detection models onto FPGA devices for ultralow latency applications.
We employ a streaming architecture design for our YOLO accelerators, implementing the complete model on-chip in a deeply pipelined fashion.
We introduce novel hardware components to support the operations of YOLO models in a dataflow manner, and off-chip memory buffering to address the limited on-chip memory resources.
arXiv Detail & Related papers (2023-09-04T13:15:01Z) - Implementation of a framework for deploying AI inference engines in
FPGAs [0.0]
The goal is to ensure the highest possible framerate while keeping the maximum latency constrained to the needs of the experiment.
The ability to reduce the precision of the implemented networks through quantization is necessary to optimize the use of both DSP and memory resources in the FPGA.
arXiv Detail & Related papers (2023-05-30T23:37:51Z) - Spatiotemporal Attention-based Semantic Compression for Real-time Video
Recognition [117.98023585449808]
We propose a temporal attention-based autoencoder (STAE) architecture to evaluate the importance of frames and pixels in each frame.
We develop a lightweight decoder that leverages a 3D-2D CNN combined to reconstruct missing information.
Experimental results show that ViT_STAE can compress the video dataset H51 by 104x with only 5% accuracy loss.
arXiv Detail & Related papers (2023-05-22T07:47:27Z) - Communication-Efficient Graph Neural Networks with Probabilistic
Neighborhood Expansion Analysis and Caching [59.8522166385372]
Training and inference with graph neural networks (GNNs) on massive graphs has been actively studied since the inception of GNNs.
This paper is concerned with minibatch training and inference with GNNs that employ node-wise sampling in distributed settings.
We present SALIENT++, which extends the prior state-of-the-art SALIENT system to work with partitioned feature data.
arXiv Detail & Related papers (2023-05-04T21:04:01Z) - Degradation-Aware Unfolding Half-Shuffle Transformer for Spectral
Compressive Imaging [142.11622043078867]
We propose a principled Degradation-Aware Unfolding Framework (DAUF) that estimates parameters from the compressed image and physical mask, and then uses these parameters to control each iteration.
By plugging HST into DAUF, we establish the first Transformer-based deep unfolding method, Degradation-Aware Unfolding Half-Shuffle Transformer (DAUHST) for HSI reconstruction.
arXiv Detail & Related papers (2022-05-20T11:37:44Z) - Eventor: An Efficient Event-Based Monocular Multi-View Stereo
Accelerator on FPGA Platform [11.962626341154609]
Event cameras are bio-inspired vision sensors that asynchronously represent pixel-level brightness changes as event streams.
EMVS is a technique that exploits the event streams to estimate semi-dense 3D structure with known trajectory.
In this paper, Eventor is proposed as a fast and efficient EMVS accelerator by realizing the most critical and time-consuming stages.
arXiv Detail & Related papers (2022-03-29T11:13:36Z) - Reinforcement Learning with Latent Flow [78.74671595139613]
Flow of Latents for Reinforcement Learning (Flare) is a network architecture for RL that explicitly encodes temporal information through latent vector differences.
We show that Flare recovers optimal performance in state-based RL without explicit access to the state velocity.
We also show that Flare achieves state-of-the-art performance on pixel-based challenging continuous control tasks within the DeepMind control benchmark suite.
arXiv Detail & Related papers (2021-01-06T03:50:50Z) - Accelerated Charged Particle Tracking with Graph Neural Networks on
FPGAs [0.0]
We develop and study FPGA implementations of algorithms for charged particle tracking based on graph neural networks.
We find a considerable speedup over CPU-based execution is possible, potentially enabling such algorithms to be used effectively in future computing.
arXiv Detail & Related papers (2020-11-30T18:17:43Z) - Transfer Learning for Motor Imagery Based Brain-Computer Interfaces: A
Complete Pipeline [54.73337667795997]
Transfer learning (TL) has been widely used in motor imagery (MI) based brain-computer interfaces (BCIs) to reduce the calibration effort for a new subject.
This paper proposes that TL could be considered in all three components (spatial filtering, feature engineering, and classification) of MI-based BCIs.
arXiv Detail & Related papers (2020-07-03T23:44:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.