A Hybrid Approach To Real-Time Multi-Object Tracking
- URL: http://arxiv.org/abs/2308.01248v1
- Date: Wed, 2 Aug 2023 16:02:42 GMT
- Title: A Hybrid Approach To Real-Time Multi-Object Tracking
- Authors: Vincenzo Mariano Scarrica, Ciro Panariello, Alessio Ferone, Antonino
Staiano
- Abstract summary: Multi-Object Tracking, also known as Multi-Target Tracking, is a significant area of computer vision.
This paper proposes a hybrid strategy for real-time multi-target tracking that combines effectively a classical optical flow algorithm with a deep learning architecture.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Multi-Object Tracking, also known as Multi-Target Tracking, is a significant
area of computer vision that has many uses in a variety of settings. The
development of deep learning, which has encouraged researchers to propose more
and more work in this direction, has significantly impacted the scientific
advancement around the study of tracking as well as many other domains related
to computer vision. In fact, all of the solutions that are currently
state-of-the-art in the literature and in the tracking industry, are built on
top of deep learning methodologies that produce exceptionally good results.
Deep learning is enabled thanks to the ever more powerful technology
researchers can use to handle the significant computational resources demanded
by these models. However, when real-time is a main requirement, developing a
tracking system without being constrained by expensive hardware support with
enormous computational resources is necessary to widen tracking applications in
real-world contexts. To this end, a compromise is to combine powerful deep
strategies with more traditional approaches to favor considerably lower
processing solutions at the cost of less accurate tracking results even though
suitable for real-time domains. Indeed, the present work goes in that
direction, proposing a hybrid strategy for real-time multi-target tracking that
combines effectively a classical optical flow algorithm with a deep learning
architecture, targeted to a human-crowd tracking system exhibiting a desirable
trade-off between performance in tracking precision and computational costs.
The developed architecture was experimented with different settings, and
yielded a MOTA of 0.608 out of the compared state-of-the-art 0.549 results, and
about half the running time when introducing the optical flow phase, achieving
almost the same performance in terms of accuracy.
Related papers
- Real-time Monitoring and Analysis of Track and Field Athletes Based on Edge Computing and Deep Reinforcement Learning Algorithm [3.4836781982613534]
This research focuses on real-time monitoring and analysis of track and field athletes.
We propose an IoT-optimized system that integrates edge computing and deep learning algorithms.
arXiv Detail & Related papers (2024-11-11T05:12:15Z) - Efficient Human Pose Estimation: Leveraging Advanced Techniques with MediaPipe [5.439359582541082]
This study presents significant enhancements in human pose estimation using the MediaPipe framework.
The research focuses on improving accuracy, computational efficiency, and real-time processing capabilities.
The advancements have wide-ranging applications in augmented reality, sports analytics, and healthcare.
arXiv Detail & Related papers (2024-06-21T21:00:45Z) - PNAS-MOT: Multi-Modal Object Tracking with Pareto Neural Architecture Search [64.28335667655129]
Multiple object tracking is a critical task in autonomous driving.
As tracking accuracy improves, neural networks become increasingly complex, posing challenges for their practical application in real driving scenarios due to the high level of latency.
In this paper, we explore the use of the neural architecture search (NAS) methods to search for efficient architectures for tracking, aiming for low real-time latency while maintaining relatively high accuracy.
arXiv Detail & Related papers (2024-03-23T04:18:49Z) - Computation-efficient Deep Learning for Computer Vision: A Survey [121.84121397440337]
Deep learning models have reached or even exceeded human-level performance in a range of visual perception tasks.
Deep learning models usually demand significant computational resources, leading to impractical power consumption, latency, or carbon emissions in real-world scenarios.
New research focus is computationally efficient deep learning, which strives to achieve satisfactory performance while minimizing the computational cost during inference.
arXiv Detail & Related papers (2023-08-27T03:55:28Z) - Multi-Fidelity Active Learning with GFlowNets [65.91555804996203]
We propose a multi-fidelity active learning algorithm with GFlowNets as a sampler, to efficiently discover diverse, high-scoring candidates.
Our evaluation on molecular discovery tasks shows that multi-fidelity active learning with GFlowNets can discover high-scoring candidates at a fraction of the budget of its single-fidelity counterpart.
arXiv Detail & Related papers (2023-06-20T17:43:42Z) - NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision
Research [96.53307645791179]
We introduce the Never-Ending VIsual-classification Stream (NEVIS'22), a benchmark consisting of a stream of over 100 visual classification tasks.
Despite being limited to classification, the resulting stream has a rich diversity of tasks from OCR, to texture analysis, scene recognition, and so forth.
Overall, NEVIS'22 poses an unprecedented challenge for current sequential learning approaches due to the scale and diversity of tasks.
arXiv Detail & Related papers (2022-11-15T18:57:46Z) - Scalable and Real-time Multi-Camera Vehicle Detection,
Re-Identification, and Tracking [58.95210121654722]
We propose a real-time city-scale multi-camera vehicle tracking system that handles real-world, low-resolution CCTV instead of idealized and curated video streams.
Our method is ranked among the top five performers on the public leaderboard.
arXiv Detail & Related papers (2022-04-15T12:47:01Z) - Improving the Accuracy of Early Exits in Multi-Exit Architectures via
Curriculum Learning [88.17413955380262]
Multi-exit architectures allow deep neural networks to terminate their execution early in order to adhere to tight deadlines at the cost of accuracy.
We introduce a novel method called Multi-Exit Curriculum Learning that utilizes curriculum learning.
Our method consistently improves the accuracy of early exits compared to the standard training approach.
arXiv Detail & Related papers (2021-04-21T11:12:35Z) - Weakly-Supervised Domain Adaptation of Deep Regression Trackers via
Reinforced Knowledge Distillation [27.00282405409842]
We present the first methodology for domain adaption of such a class of trackers.
We propose a weakly-supervised adaptation strategy, in which reinforcement learning is used to express weak supervision.
Experiments on five different robotic vision domains demonstrate the relevance of our methodology.
arXiv Detail & Related papers (2021-03-26T14:37:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.