Not Every Side Is Equal: Localization Uncertainty Estimation for
Semi-Supervised 3D Object Detection
- URL: http://arxiv.org/abs/2312.10390v1
- Date: Sat, 16 Dec 2023 09:08:03 GMT
- Title: Not Every Side Is Equal: Localization Uncertainty Estimation for
Semi-Supervised 3D Object Detection
- Authors: ChuXin Wang, Wenfei Yang, Tianzhu Zhang
- Abstract summary: Semi-supervised 3D object detection from point cloud aims to train a detector with a small number of labeled data and a large number of unlabeled data.
Existing methods treat each pseudo bounding box as a whole and assign equal importance to each side during training.
We propose a side-aware framework for semi-supervised 3D object detection consisting of three key designs.
- Score: 38.77989138502667
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Semi-supervised 3D object detection from point cloud aims to train a detector
with a small number of labeled data and a large number of unlabeled data. The
core of existing methods lies in how to select high-quality pseudo-labels using
the designed quality evaluation criterion. However, these methods treat each
pseudo bounding box as a whole and assign equal importance to each side during
training, which is detrimental to model performance due to many sides having
poor localization quality. Besides, existing methods filter out a large number
of low-quality pseudo-labels, which also contain some correct regression values
that can help with model training. To address the above issues, we propose a
side-aware framework for semi-supervised 3D object detection consisting of
three key designs: a 3D bounding box parameterization method, an uncertainty
estimation module, and a pseudo-label selection strategy. These modules work
together to explicitly estimate the localization quality of each side and
assign different levels of importance during the training phase. Extensive
experiment results demonstrate that the proposed method can consistently
outperform baseline models under different scenes and evaluation criteria.
Moreover, our method achieves state-of-the-art performance on three datasets
with different labeled ratios.
Related papers
- Dual-Perspective Knowledge Enrichment for Semi-Supervised 3D Object
Detection [55.210991151015534]
We present a novel Dual-Perspective Knowledge Enrichment approach named DPKE for semi-supervised 3D object detection.
Our DPKE enriches the knowledge of limited training data, particularly unlabeled data, from two perspectives: data-perspective and feature-perspective.
arXiv Detail & Related papers (2024-01-10T08:56:07Z) - Revisiting Domain-Adaptive 3D Object Detection by Reliable, Diverse and
Class-balanced Pseudo-Labeling [38.07637524378327]
Unsupervised domain adaptation (DA) with the aid of pseudo labeling techniques has emerged as a crucial approach for domain-adaptive 3D object detection.
Existing DA methods suffer from a substantial drop in performance when applied to a multi-class training setting.
We propose a novel ReDB framework tailored for learning to detect all classes at once.
arXiv Detail & Related papers (2023-07-16T04:34:11Z) - Exploring Active 3D Object Detection from a Generalization Perspective [58.597942380989245]
Uncertainty-based active learning policies fail to balance the trade-off between point cloud informativeness and box-level annotation costs.
We propose textscCrb, which hierarchically filters out the point clouds of redundant 3D bounding box labels.
Experiments show that the proposed approach outperforms existing active learning strategies.
arXiv Detail & Related papers (2023-01-23T02:43:03Z) - 360-MLC: Multi-view Layout Consistency for Self-training and
Hyper-parameter Tuning [40.93848397359068]
We present 360-MLC, a self-training method based on multi-view layout consistency for finetuning monocular room- models.
We leverage the entropy information in multiple layout estimations as a quantitative metric to measure the geometry consistency of the scene.
arXiv Detail & Related papers (2022-10-24T03:31:48Z) - Semi-supervised 3D Object Detection with Proficient Teachers [114.54835359657707]
Dominated point cloud-based 3D object detectors in autonomous driving scenarios rely heavily on the huge amount of accurately labeled samples.
Pseudo-Labeling methodology is commonly used for SSL frameworks, however, the low-quality predictions from the teacher model have seriously limited its performance.
We propose a new Pseudo-Labeling framework for semi-supervised 3D object detection, by enhancing the teacher model to a proficient one with several necessary designs.
arXiv Detail & Related papers (2022-07-26T04:54:03Z) - ST3D++: Denoised Self-training for Unsupervised Domain Adaptation on 3D
Object Detection [78.71826145162092]
We present a self-training method, named ST3D++, with a holistic pseudo label denoising pipeline for unsupervised domain adaptation on 3D object detection.
We equip the pseudo label generation process with a hybrid quality-aware triplet memory to improve the quality and stability of generated pseudo labels.
In the model training stage, we propose a source data assisted training strategy and a curriculum data augmentation policy.
arXiv Detail & Related papers (2021-08-15T07:49:06Z) - 3DIoUMatch: Leveraging IoU Prediction for Semi-Supervised 3D Object
Detection [76.42897462051067]
3DIoUMatch is a novel semi-supervised method for 3D object detection applicable to both indoor and outdoor scenes.
We leverage a teacher-student mutual learning framework to propagate information from the labeled to the unlabeled train set in the form of pseudo-labels.
Our method consistently improves state-of-the-art methods on both ScanNet and SUN-RGBD benchmarks by significant margins under all label ratios.
arXiv Detail & Related papers (2020-12-08T11:06:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.