Real-time Multi-Object Tracking Based on Bi-directional Matching
- URL: http://arxiv.org/abs/2303.08444v1
- Date: Wed, 15 Mar 2023 08:38:08 GMT
- Title: Real-time Multi-Object Tracking Based on Bi-directional Matching
- Authors: Huilan Luo, Zehua Zeng
- Abstract summary: This study offers a bi-directional matching algorithm for multi-object tracking.
A stranded area is used in the matching algorithm to temporarily store the objects that fail to be tracked.
In the MOT17 challenge, the proposed algorithm achieves 63.4% MOTA, 55.3% IDF1, and 20.1 FPS tracking speed.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In recent years, anchor-free object detection models combined with matching
algorithms are used to achieve real-time muti-object tracking and also ensure
high tracking accuracy. However, there are still great challenges in
multi-object tracking. For example, when most part of a target is occluded or
the target just disappears from images temporarily, it often leads to tracking
interruptions for most of the existing tracking algorithms. Therefore, this
study offers a bi-directional matching algorithm for multi-object tracking that
makes advantage of bi-directional motion prediction information to improve
occlusion handling. A stranded area is used in the matching algorithm to
temporarily store the objects that fail to be tracked. When objects recover
from occlusions, our method will first try to match them with objects in the
stranded area to avoid erroneously generating new identities, thus forming a
more continuous trajectory. Experiments show that our approach can improve the
multi-object tracking performance in the presence of occlusions. In addition,
this study provides an attentional up-sampling module that not only assures
tracking accuracy but also accelerates training speed. In the MOT17 challenge,
the proposed algorithm achieves 63.4% MOTA, 55.3% IDF1, and 20.1 FPS tracking
speed.
Related papers
- CORT: Class-Oriented Real-time Tracking for Embedded Systems [46.3107850275261]
This work proposes a new approach to multi-class object tracking.
It allows achieving smaller and more predictable execution times, without penalizing the tracking performance.
The proposed solution was evaluated in complex urban scenarios with several objects of different types.
arXiv Detail & Related papers (2024-07-20T09:12:17Z) - SparseTrack: Multi-Object Tracking by Performing Scene Decomposition
based on Pseudo-Depth [84.64121608109087]
We propose a pseudo-depth estimation method for obtaining the relative depth of targets from 2D images.
Secondly, we design a depth cascading matching (DCM) algorithm, which can use the obtained depth information to convert a dense target set into multiple sparse target subsets.
By integrating the pseudo-depth method and the DCM strategy into the data association process, we propose a new tracker, called SparseTrack.
arXiv Detail & Related papers (2023-06-08T14:36:10Z) - SCTracker: Multi-object tracking with shape and confidence constraints [11.210661553388615]
This paper proposes a multi-object tracker based on shape constraint and confidence named SCTracker.
Intersection of Union distance with shape constraints is applied to calculate the cost matrix between tracks and detections.
The Kalman Filter based on the detection confidence is used to update the motion state to improve the tracking performance when the detection has low confidence.
arXiv Detail & Related papers (2023-05-16T15:18:42Z) - ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every
Detection Box [81.45219802386444]
Multi-object tracking (MOT) aims at estimating bounding boxes and identities of objects across video frames.
We propose a hierarchical data association strategy to mine the true objects in low-score detection boxes.
In 3D scenarios, it is much easier for the tracker to predict object velocities in the world coordinate.
arXiv Detail & Related papers (2023-03-27T15:35:21Z) - Tracking by Associating Clips [110.08925274049409]
In this paper, we investigate an alternative by treating object association as clip-wise matching.
Our new perspective views a single long video sequence as multiple short clips, and then the tracking is performed both within and between the clips.
The benefits of this new approach are two folds. First, our method is robust to tracking error accumulation or propagation, as the video chunking allows bypassing the interrupted frames.
Second, the multiple frame information is aggregated during the clip-wise matching, resulting in a more accurate long-range track association than the current frame-wise matching.
arXiv Detail & Related papers (2022-12-20T10:33:17Z) - QDTrack: Quasi-Dense Similarity Learning for Appearance-Only Multiple
Object Tracking [73.52284039530261]
We present Quasi-Dense Similarity Learning, which densely samples hundreds of object regions on a pair of images for contrastive learning.
We find that the resulting distinctive feature space admits a simple nearest neighbor search at inference time for object association.
We show that our similarity learning scheme is not limited to video data, but can learn effective instance similarity even from static input.
arXiv Detail & Related papers (2022-10-12T15:47:36Z) - CAMO-MOT: Combined Appearance-Motion Optimization for 3D Multi-Object
Tracking with Camera-LiDAR Fusion [34.42289908350286]
3D Multi-object tracking (MOT) ensures consistency during continuous dynamic detection.
It can be challenging to accurately track the irregular motion of objects for LiDAR-based methods.
We propose a novel camera-LiDAR fusion 3D MOT framework based on the Combined Appearance-Motion Optimization (CAMO-MOT)
arXiv Detail & Related papers (2022-09-06T14:41:38Z) - Learning to Track with Object Permanence [61.36492084090744]
We introduce an end-to-end trainable approach for joint object detection and tracking.
Our model, trained jointly on synthetic and real data, outperforms the state of the art on KITTI, and MOT17 datasets.
arXiv Detail & Related papers (2021-03-26T04:43:04Z) - DEFT: Detection Embeddings for Tracking [3.326320568999945]
We propose an efficient joint detection and tracking model named DEFT.
Our approach relies on an appearance-based object matching network jointly-learned with an underlying object detection network.
DEFT has comparable accuracy and speed to the top methods on 2D online tracking leaderboards.
arXiv Detail & Related papers (2021-02-03T20:00:44Z) - Appearance-free Tripartite Matching for Multiple Object Tracking [6.165592821539306]
Multiple Object Tracking (MOT) detects the trajectories of multiple objects given an input video.
Most existing algorithms depend on the uniqueness of the object's appearance, and the dominating bipartite matching scheme ignores the speed smoothness.
We propose an appearance-free tripartite matching to avoid the irregular velocity problem of the bipartite matching.
arXiv Detail & Related papers (2020-08-09T02:16:44Z) - Tracking-by-Counting: Using Network Flows on Crowd Density Maps for
Tracking Multiple Targets [96.98888948518815]
State-of-the-art multi-object tracking(MOT) methods follow the tracking-by-detection paradigm.
We propose a new MOT paradigm, tracking-by-counting, tailored for crowded scenes.
arXiv Detail & Related papers (2020-07-18T19:51:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.