3DIoUMatch: Leveraging IoU Prediction for Semi-Supervised 3D Object
Detection
- URL: http://arxiv.org/abs/2012.04355v2
- Date: Wed, 7 Apr 2021 13:31:55 GMT
- Title: 3DIoUMatch: Leveraging IoU Prediction for Semi-Supervised 3D Object
Detection
- Authors: He Wang, Yezhen Cong, Or Litany, Yue Gao, Leonidas J. Guibas
- Abstract summary: 3DIoUMatch is a novel semi-supervised method for 3D object detection applicable to both indoor and outdoor scenes.
We leverage a teacher-student mutual learning framework to propagate information from the labeled to the unlabeled train set in the form of pseudo-labels.
Our method consistently improves state-of-the-art methods on both ScanNet and SUN-RGBD benchmarks by significant margins under all label ratios.
- Score: 76.42897462051067
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: 3D object detection is an important yet demanding task that heavily relies on
difficult to obtain 3D annotations. To reduce the required amount of
supervision, we propose 3DIoUMatch, a novel semi-supervised method for 3D
object detection applicable to both indoor and outdoor scenes. We leverage a
teacher-student mutual learning framework to propagate information from the
labeled to the unlabeled train set in the form of pseudo-labels. However, due
to the high task complexity, we observe that the pseudo-labels suffer from
significant noise and are thus not directly usable. To that end, we introduce a
confidence-based filtering mechanism, inspired by FixMatch. We set confidence
thresholds based upon the predicted objectness and class probability to filter
low-quality pseudo-labels. While effective, we observe that these two measures
do not sufficiently capture localization quality. We therefore propose to use
the estimated 3D IoU as a localization metric and set category-aware
self-adjusted thresholds to filter poorly localized proposals. We adopt VoteNet
as our backbone detector on indoor datasets while we use PV-RCNN on the
autonomous driving dataset, KITTI. Our method consistently improves
state-of-the-art methods on both ScanNet and SUN-RGBD benchmarks by significant
margins under all label ratios (including fully labeled setting). For example,
when training using only 10\% labeled data on ScanNet, 3DIoUMatch achieves 7.7
absolute improvement on mAP@0.25 and 8.5 absolute improvement on mAP@0.5 upon
the prior art. On KITTI, we are the first to demonstrate semi-supervised 3D
object detection and our method surpasses a fully supervised baseline from 1.8%
to 7.6% under different label ratios and categories.
Related papers
- TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection [59.498894868956306]
Pseudo-labeling approaches to semi-supervised learning adopt a teacher-student framework.
We leverage pre-trained motion-forecasting models to generate object trajectories on pseudo-labeled data.
Our approach improves pseudo-label quality in two distinct manners.
arXiv Detail & Related papers (2024-09-17T05:35:00Z) - Reliable Student: Addressing Noise in Semi-Supervised 3D Object Detection [3.6035734305775518]
Semi-supervised 3D object detection can benefit from the promising pseudo-labeling technique when labeled data is limited.
Recent approaches have overlooked the impact of noisy pseudo-labels during training.
We propose the Reliable Student framework, which incorporates two complementary approaches to mitigate errors.
arXiv Detail & Related papers (2024-04-27T13:38:45Z) - HyperMODEST: Self-Supervised 3D Object Detection with Confidence Score
Filtering [9.14477900515147]
Current LiDAR-based 3D object detectors for autonomous driving are almost entirely trained on human-annotated data.
MODEST is the first work to train 3D object detectors without any labels.
We propose a universal method that can largely accelerate the self-training process and does not require tuning on a specific dataset.
arXiv Detail & Related papers (2023-04-27T18:12:11Z) - DDS3D: Dense Pseudo-Labels with Dynamic Threshold for Semi-Supervised 3D
Object Detection [15.440609044002722]
We present a simple yet effective semi-supervised 3D object detector named3D.
Benefiting from these two components, our3D outperforms the state-of-the-art semi-supervised 3d object detection with mAP of 3.1% on the dataset and 2.1% on the cyclist.
arXiv Detail & Related papers (2023-03-09T07:30:53Z) - Semi-supervised 3D Object Detection with Proficient Teachers [114.54835359657707]
Dominated point cloud-based 3D object detectors in autonomous driving scenarios rely heavily on the huge amount of accurately labeled samples.
Pseudo-Labeling methodology is commonly used for SSL frameworks, however, the low-quality predictions from the teacher model have seriously limited its performance.
We propose a new Pseudo-Labeling framework for semi-supervised 3D object detection, by enhancing the teacher model to a proficient one with several necessary designs.
arXiv Detail & Related papers (2022-07-26T04:54:03Z) - ST3D++: Denoised Self-training for Unsupervised Domain Adaptation on 3D
Object Detection [78.71826145162092]
We present a self-training method, named ST3D++, with a holistic pseudo label denoising pipeline for unsupervised domain adaptation on 3D object detection.
We equip the pseudo label generation process with a hybrid quality-aware triplet memory to improve the quality and stability of generated pseudo labels.
In the model training stage, we propose a source data assisted training strategy and a curriculum data augmentation policy.
arXiv Detail & Related papers (2021-08-15T07:49:06Z) - Semi-supervised 3D Object Detection via Adaptive Pseudo-Labeling [18.209409027211404]
3D object detection is an important task in computer vision.
Most existing methods require a large number of high-quality 3D annotations, which are expensive to collect.
We propose a novel semi-supervised framework based on pseudo-labeling for outdoor 3D object detection tasks.
arXiv Detail & Related papers (2021-08-15T02:58:43Z) - ST3D: Self-training for Unsupervised Domain Adaptation on 3D
ObjectDetection [78.71826145162092]
We present a new domain adaptive self-training pipeline, named ST3D, for unsupervised domain adaptation on 3D object detection from point clouds.
Our ST3D achieves state-of-the-art performance on all evaluated datasets and even surpasses fully supervised results on KITTI 3D object detection benchmark.
arXiv Detail & Related papers (2021-03-09T10:51:24Z) - SESS: Self-Ensembling Semi-Supervised 3D Object Detection [138.80825169240302]
We propose SESS, a self-ensembling semi-supervised 3D object detection framework. Specifically, we design a thorough perturbation scheme to enhance generalization of the network on unlabeled and new unseen data.
Our SESS achieves competitive performance compared to the state-of-the-art fully-supervised method by using only 50% labeled data.
arXiv Detail & Related papers (2019-12-26T08:48:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.