Lightweight Jet Reconstruction and Identification as an Object Detection
Task
- URL: http://arxiv.org/abs/2202.04499v1
- Date: Wed, 9 Feb 2022 15:01:53 GMT
- Title: Lightweight Jet Reconstruction and Identification as an Object Detection
Task
- Authors: Adrian Alan Pol, Thea Aarrestad, Ekaterina Govorkova, Roi Halily, Anat
Klempner, Tal Kopetz, Vladimir Loncar, Jennifer Ngadiuba, Maurizio Pierini,
Olya Sirkin, Sioni Summers
- Abstract summary: We apply convolutional techniques to end-to-end jet identification and reconstruction tasks encountered at the CERN Large Hadron Collider.
PFJet-SSD performs simultaneous localization, classification and regression tasks to cluster jets and reconstruct their features.
We show that the ternary network closely matches the performance of its full-precision equivalent and outperforms the state-of-the-art rule-based algorithm.
- Score: 5.071565475111431
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We apply object detection techniques based on deep convolutional blocks to
end-to-end jet identification and reconstruction tasks encountered at the CERN
Large Hadron Collider (LHC). Collision events produced at the LHC and
represented as an image composed of calorimeter and tracker cells are given as
an input to a Single Shot Detection network. The algorithm, named PFJet-SSD
performs simultaneous localization, classification and regression tasks to
cluster jets and reconstruct their features. This all-in-one single
feed-forward pass gives advantages in terms of execution time and an improved
accuracy w.r.t. traditional rule-based methods. A further gain is obtained from
network slimming, homogeneous quantization, and optimized runtime for meeting
memory and latency constraints of a typical real-time processing environment.
We experiment with 8-bit and ternary quantization, benchmarking their accuracy
and inference latency against a single-precision floating-point. We show that
the ternary network closely matches the performance of its full-precision
equivalent and outperforms the state-of-the-art rule-based algorithm. Finally,
we report the inference latency on different hardware platforms and discuss
future applications.
Related papers
- Collaborative Edge AI Inference over Cloud-RAN [37.3710464868215]
A cloud radio access network (Cloud-RAN) based collaborative edge AI inference architecture is proposed.
Specifically, geographically distributed devices capture real-time noise-corrupted sensory data samples and extract the noisy local feature vectors.
We allow each RRH receives local feature vectors from all devices over the same resource blocks simultaneously by leveraging an over-the-air computation (AirComp) technique.
These aggregated feature vectors are quantized and transmitted to a central processor for further aggregation and downstream inference tasks.
arXiv Detail & Related papers (2024-04-09T04:26:16Z) - Edge-Enabled Real-time Railway Track Segmentation [0.0]
We propose an edge-enabled real-time railway track segmentation algorithm.
It is optimized to be suitable for edge applications by optimizing the network structure and quantizing the model after training.
Experimental results demonstrate that our enhanced algorithm achieves an accuracy level of 83.3%.
arXiv Detail & Related papers (2024-01-21T13:45:52Z) - Task-Oriented Over-the-Air Computation for Multi-Device Edge AI [57.50247872182593]
6G networks for supporting edge AI features task-oriented techniques that focus on effective and efficient execution of AI task.
Task-oriented over-the-air computation (AirComp) scheme is proposed in this paper for multi-device split-inference system.
arXiv Detail & Related papers (2022-11-02T16:35:14Z) - MAPLE-Edge: A Runtime Latency Predictor for Edge Devices [80.01591186546793]
We propose MAPLE-Edge, an edge device-oriented extension of MAPLE, the state-of-the-art latency predictor for general purpose hardware.
Compared to MAPLE, MAPLE-Edge can describe the runtime and target device platform using a much smaller set of CPU performance counters.
We also demonstrate that unlike MAPLE which performs best when trained on a pool of devices sharing a common runtime, MAPLE-Edge can effectively generalize across runtimes.
arXiv Detail & Related papers (2022-04-27T14:00:48Z) - AEGNN: Asynchronous Event-based Graph Neural Networks [54.528926463775946]
Event-based Graph Neural Networks generalize standard GNNs to process events as "evolving"-temporal graphs.
AEGNNs are easily trained on synchronous inputs and can be converted to efficient, "asynchronous" networks at test time.
arXiv Detail & Related papers (2022-03-31T16:21:12Z) - ZippyPoint: Fast Interest Point Detection, Description, and Matching
through Mixed Precision Discretization [71.91942002659795]
We investigate and adapt network quantization techniques to accelerate inference and enable its use on compute limited platforms.
ZippyPoint, our efficient quantized network with binary descriptors, improves the network runtime speed, the descriptor matching speed, and the 3D model size.
These improvements come at a minor performance degradation as evaluated on the tasks of homography estimation, visual localization, and map-free visual relocalization.
arXiv Detail & Related papers (2022-03-07T18:59:03Z) - Faster object tracking pipeline for real time tracking [0.0]
Multi-object tracking (MOT) is a challenging practical problem for vision based applications.
This paper showcases a generic pipeline which can be used to speed up detection based object tracking methods.
arXiv Detail & Related papers (2020-11-08T06:33:48Z) - Distance-Weighted Graph Neural Networks on FPGAs for Real-Time Particle
Reconstruction in High Energy Physics [11.125632758828266]
We discuss how to design distance-weighted graph networks that can be executed with a latency of less than 1$mumathrms$ on an FPGA.
We consider a representative task associated to particle reconstruction and identification in a next-generation calorimeter operating at a particle collider.
We convert the compressed models into firmware to be implemented on an FPGA.
arXiv Detail & Related papers (2020-08-08T21:26:31Z) - AQD: Towards Accurate Fully-Quantized Object Detection [94.06347866374927]
We propose an Accurate Quantized object Detection solution, termed AQD, to get rid of floating-point computation.
Our AQD achieves comparable or even better performance compared with the full-precision counterpart under extremely low-bit schemes.
arXiv Detail & Related papers (2020-07-14T09:07:29Z) - Depthwise Non-local Module for Fast Salient Object Detection Using a
Single Thread [136.2224792151324]
We propose a new deep learning algorithm for fast salient object detection.
The proposed algorithm achieves competitive accuracy and high inference efficiency simultaneously with a single CPU thread.
arXiv Detail & Related papers (2020-01-22T15:23:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.