Occlusion-Aware Detection and Re-ID Calibrated Network for Multi-Object
Tracking
- URL: http://arxiv.org/abs/2308.15795v1
- Date: Wed, 30 Aug 2023 06:56:53 GMT
- Title: Occlusion-Aware Detection and Re-ID Calibrated Network for Multi-Object
Tracking
- Authors: Yukun Su, Ruizhou Sun, Xin Shu, Yu Zhang, Qingyao Wu
- Abstract summary: Occlusion-Aware Attention (OAA) module in the detector highlights the object features while suppressing the occluded background regions.
OAA can serve as a modulator that enhances the detector for some potentially occluded objects.
We design a Re-ID embedding matching block based on the optimal transport problem.
- Score: 38.36872739816151
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Multi-Object Tracking (MOT) is a crucial computer vision task that aims to
predict the bounding boxes and identities of objects simultaneously. While
state-of-the-art methods have made remarkable progress by jointly optimizing
the multi-task problems of detection and Re-ID feature learning, yet, few
approaches explore to tackle the occlusion issue, which is a long-standing
challenge in the MOT field. Generally, occluded objects may hinder the detector
from estimating the bounding boxes, resulting in fragmented trajectories. And
the learned occluded Re-ID embeddings are less distinct since they contain
interferer. To this end, we propose an occlusion-aware detection and Re-ID
calibrated network for multi-object tracking, termed as ORCTrack. Specifically,
we propose an Occlusion-Aware Attention (OAA) module in the detector that
highlights the object features while suppressing the occluded background
regions. OAA can serve as a modulator that enhances the detector for some
potentially occluded objects. Furthermore, we design a Re-ID embedding matching
block based on the optimal transport problem, which focuses on enhancing and
calibrating the Re-ID representations through different adjacent frames
complementarily. To validate the effectiveness of the proposed method,
extensive experiments are conducted on two challenging VisDrone2021-MOT and
KITTI benchmarks. Experimental evaluations demonstrate the superiority of our
approach, which can achieve new state-of-the-art performance and enjoy high
run-time efficiency.
Related papers
- Renormalized Connection for Scale-preferred Object Detection in Satellite Imagery [51.83786195178233]
We design a Knowledge Discovery Network (KDN) to implement the renormalization group theory in terms of efficient feature extraction.
Renormalized connection (RC) on the KDN enables synergistic focusing'' of multi-scale features.
RCs extend the multi-level feature's divide-and-conquer'' mechanism of the FPN-based detectors to a wide range of scale-preferred tasks.
arXiv Detail & Related papers (2024-09-09T13:56:22Z) - Multiple Object Tracking based on Occlusion-Aware Embedding Consistency
Learning [46.726678333518066]
Occlusion Prediction Module (OPM) and Occlusion-Aware Association Module (OAAM)
OPM predicts occlusion information for each true detection, facilitating the selection of valid samples for consistency learning of the track's visual embedding.
OAAM generates two separate embeddings for each track, guaranteeing consistency in both unoccluded and occluded detections.
arXiv Detail & Related papers (2023-11-05T06:08:58Z) - Small Object Detection via Coarse-to-fine Proposal Generation and
Imitation Learning [52.06176253457522]
We propose a two-stage framework tailored for small object detection based on the Coarse-to-fine pipeline and Feature Imitation learning.
CFINet achieves state-of-the-art performance on the large-scale small object detection benchmarks, SODA-D and SODA-A.
arXiv Detail & Related papers (2023-08-18T13:13:09Z) - CAMO-MOT: Combined Appearance-Motion Optimization for 3D Multi-Object
Tracking with Camera-LiDAR Fusion [34.42289908350286]
3D Multi-object tracking (MOT) ensures consistency during continuous dynamic detection.
It can be challenging to accurately track the irregular motion of objects for LiDAR-based methods.
We propose a novel camera-LiDAR fusion 3D MOT framework based on the Combined Appearance-Motion Optimization (CAMO-MOT)
arXiv Detail & Related papers (2022-09-06T14:41:38Z) - Online Multi-Object Tracking with Unsupervised Re-Identification
Learning and Occlusion Estimation [80.38553821508162]
Occlusion between different objects is a typical challenge in Multi-Object Tracking (MOT)
In this paper, we focus on online multi-object tracking and design two novel modules to handle these problems.
The proposed unsupervised re-identification learning module does not require any (pseudo) identity information nor suffer from the scalability issue.
Our study shows that, when applied to state-of-the-art MOT methods, the proposed unsupervised re-identification learning is comparable to supervised re-identification learning.
arXiv Detail & Related papers (2022-01-04T18:59:58Z) - Multi-object Tracking with a Hierarchical Single-branch Network [31.680667324595557]
We propose an online multi-object tracking framework based on a hierarchical single-branch network.
Our novel iHOIM loss function unifies the objectives of the two sub-tasks and encourages better detection performance.
Experimental results on MOT16 and MOT20 datasets show that we can achieve state-of-the-art tracking performance.
arXiv Detail & Related papers (2021-01-06T12:14:58Z) - IA-MOT: Instance-Aware Multi-Object Tracking with Motion Consistency [40.354708148590696]
"instance-aware MOT" (IA-MOT) can track multiple objects in either static or moving cameras.
Our proposed method won the first place in Track 3 of the BMTT Challenge in CVPR 2020 workshops.
arXiv Detail & Related papers (2020-06-24T03:53:36Z) - FairMOT: On the Fairness of Detection and Re-Identification in Multiple
Object Tracking [92.48078680697311]
Multi-object tracking (MOT) is an important problem in computer vision.
We present a simple yet effective approach termed as FairMOT based on the anchor-free object detection architecture CenterNet.
The approach achieves high accuracy for both detection and tracking.
arXiv Detail & Related papers (2020-04-04T08:18:00Z) - Progressive Object Transfer Detection [84.48927705173494]
We propose a novel Progressive Object Transfer Detection (POTD) framework.
First, POTD can leverage various object supervision of different domains effectively into a progressive detection procedure.
Second, POTD consists of two delicate transfer stages, i.e., Low-Shot Transfer Detection (LSTD), and Weakly-Supervised Transfer Detection (WSTD)
arXiv Detail & Related papers (2020-02-12T00:16:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.