AutoSelect: Automatic and Dynamic Detection Selection for 3D
Multi-Object Tracking
- URL: http://arxiv.org/abs/2012.05894v1
- Date: Thu, 10 Dec 2020 18:55:51 GMT
- Title: AutoSelect: Automatic and Dynamic Detection Selection for 3D
Multi-Object Tracking
- Authors: Xinshuo Weng, Kris Kitani
- Abstract summary: 3D multi-object tracking is an important component in robotic perception systems such as self-driving vehicles.
Recent work follows a tracking-by-detection pipeline, which aims to match past tracklets with detections in the current frame.
Finding a proper threshold is non-trivial, which requires extensive manual search via ablation study.
We propose to automatically select high-quality detections and remove the efforts needed for manual threshold search.
- Score: 25.744696682209934
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: 3D multi-object tracking is an important component in robotic perception
systems such as self-driving vehicles. Recent work follows a
tracking-by-detection pipeline, which aims to match past tracklets with
detections in the current frame. To avoid matching with false positive
detections, prior work filters out detections with low confidence scores via a
threshold. However, finding a proper threshold is non-trivial, which requires
extensive manual search via ablation study. Also, this threshold is sensitive
to many factors such as target object category so we need to re-search the
threshold if these factors change. To ease this process, we propose to
automatically select high-quality detections and remove the efforts needed for
manual threshold search. Also, prior work often uses a single threshold per
data sequence, which is sub-optimal in particular frames or for certain
objects. Instead, we dynamically search threshold per frame or per object to
further boost performance. Through experiments on KITTI and nuScenes, our
method can filter out $45.7\%$ false positives while maintaining the recall,
achieving new S.O.T.A. performance and removing the need for manually threshold
tuning.
Related papers
- RobMOT: Robust 3D Multi-Object Tracking by Observational Noise and State Estimation Drift Mitigation on LiDAR PointCloud [11.111388829965103]
This work addresses limitations in recent 3D tracking-by-detection methods.
We propose a novel online track validity mechanism that temporally distinguishes between legitimate and ghost tracks.
We also introduce a refinement to the Kalman filter that enhances noise mitigation in trajectory drift, leading to more robust state estimation for occluded objects.
arXiv Detail & Related papers (2024-05-19T12:49:21Z) - RTracker: Recoverable Tracking via PN Tree Structured Memory [71.05904715104411]
We propose a recoverable tracking framework, RTracker, that uses a tree-structured memory to dynamically associate a tracker and a detector to enable self-recovery.
Specifically, we propose a Positive-Negative Tree-structured memory to chronologically store and maintain positive and negative target samples.
Our core idea is to use the support samples of positive and negative target categories to establish a relative distance-based criterion for a reliable assessment of target loss.
arXiv Detail & Related papers (2024-03-28T08:54:40Z) - Unsupervised Domain Adaptation for Self-Driving from Past Traversal
Features [69.47588461101925]
We propose a method to adapt 3D object detectors to new driving environments.
Our approach enhances LiDAR-based detection models using spatial quantized historical features.
Experiments on real-world datasets demonstrate significant improvements.
arXiv Detail & Related papers (2023-09-21T15:00:31Z) - SalienDet: A Saliency-based Feature Enhancement Algorithm for Object
Detection for Autonomous Driving [160.57870373052577]
We propose a saliency-based OD algorithm (SalienDet) to detect unknown objects.
Our SalienDet utilizes a saliency-based algorithm to enhance image features for object proposal generation.
We design a dataset relabeling approach to differentiate the unknown objects from all objects in training sample set to achieve Open-World Detection.
arXiv Detail & Related papers (2023-05-11T16:19:44Z) - You Only Need Two Detectors to Achieve Multi-Modal 3D Multi-Object Tracking [9.20064374262956]
The proposed framework can achieve robust tracking by using only a 2D detector and a 3D detector.
It is proven more accurate than many of the state-of-the-art TBD-based multi-modal tracking methods.
arXiv Detail & Related papers (2023-04-18T02:45:18Z) - CAMO-MOT: Combined Appearance-Motion Optimization for 3D Multi-Object
Tracking with Camera-LiDAR Fusion [34.42289908350286]
3D Multi-object tracking (MOT) ensures consistency during continuous dynamic detection.
It can be challenging to accurately track the irregular motion of objects for LiDAR-based methods.
We propose a novel camera-LiDAR fusion 3D MOT framework based on the Combined Appearance-Motion Optimization (CAMO-MOT)
arXiv Detail & Related papers (2022-09-06T14:41:38Z) - Beyond Greedy Search: Tracking by Multi-Agent Reinforcement
Learning-based Beam Search [103.53249725360286]
Existing trackers usually select a location or proposal with the maximum score as tracking result for each frame.
We propose a novel multi-agent reinforcement learning based beam search strategy (termed BeamTracking) to address this issue.
arXiv Detail & Related papers (2022-05-19T16:35:36Z) - DSRRTracker: Dynamic Search Region Refinement for Attention-based
Siamese Multi-Object Tracking [13.104037155691644]
We propose an end-to-end MOT method, with a Gaussian filter-inspired dynamic search region refinement module.
Our method can achieve the state-of-the-art performance with reasonable speed.
arXiv Detail & Related papers (2022-03-21T04:14:06Z) - Distractor-Aware Fast Tracking via Dynamic Convolutions and MOT
Philosophy [63.91005999481061]
A practical long-term tracker typically contains three key properties, i.e. an efficient model design, an effective global re-detection strategy and a robust distractor awareness mechanism.
We propose a two-task tracking frame work (named DMTrack) to achieve distractor-aware fast tracking via Dynamic convolutions (d-convs) and Multiple object tracking (MOT) philosophy.
Our tracker achieves state-of-the-art performance on the LaSOT, OxUvA, TLP, VOT2018LT and VOT 2019LT benchmarks and runs in real-time (3x faster
arXiv Detail & Related papers (2021-04-25T00:59:53Z) - Detecting Invisible People [58.49425715635312]
We re-purpose tracking benchmarks and propose new metrics for the task of detecting invisible objects.
We demonstrate that current detection and tracking systems perform dramatically worse on this task.
Second, we build dynamic models that explicitly reason in 3D, making use of observations produced by state-of-the-art monocular depth estimation networks.
arXiv Detail & Related papers (2020-12-15T16:54:45Z) - A Video Analysis Method on Wanfang Dataset via Deep Neural Network [8.485930905198982]
We describe the new function for real-time multi-object detection in sports competition and pedestrians flow detection in public based on deep learning.
Based on the proposed algorithm, we adopt wanfang sports competition dataset as the main test dataset.
Our work also can used for pedestrians flow detection and pedestrian alarm tasks.
arXiv Detail & Related papers (2020-02-28T04:09:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.