Semi-supervised 3D Object Detection with Proficient Teachers
- URL: http://arxiv.org/abs/2207.12655v1
- Date: Tue, 26 Jul 2022 04:54:03 GMT
- Title: Semi-supervised 3D Object Detection with Proficient Teachers
- Authors: Junbo Yin, Jin Fang, Dingfu Zhou, Liangjun Zhang, Cheng-Zhong Xu,
Jianbing Shen, and Wenguan Wang
- Abstract summary: Dominated point cloud-based 3D object detectors in autonomous driving scenarios rely heavily on the huge amount of accurately labeled samples.
Pseudo-Labeling methodology is commonly used for SSL frameworks, however, the low-quality predictions from the teacher model have seriously limited its performance.
We propose a new Pseudo-Labeling framework for semi-supervised 3D object detection, by enhancing the teacher model to a proficient one with several necessary designs.
- Score: 114.54835359657707
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Dominated point cloud-based 3D object detectors in autonomous driving
scenarios rely heavily on the huge amount of accurately labeled samples,
however, 3D annotation in the point cloud is extremely tedious, expensive and
time-consuming. To reduce the dependence on large supervision, semi-supervised
learning (SSL) based approaches have been proposed. The Pseudo-Labeling
methodology is commonly used for SSL frameworks, however, the low-quality
predictions from the teacher model have seriously limited its performance. In
this work, we propose a new Pseudo-Labeling framework for semi-supervised 3D
object detection, by enhancing the teacher model to a proficient one with
several necessary designs. First, to improve the recall of pseudo labels, a
Spatialtemporal Ensemble (STE) module is proposed to generate sufficient seed
boxes. Second, to improve the precision of recalled boxes, a Clusteringbased
Box Voting (CBV) module is designed to get aggregated votes from the clustered
seed boxes. This also eliminates the necessity of sophisticated thresholds to
select pseudo labels. Furthermore, to reduce the negative influence of wrongly
pseudo-labeled samples during the training, a soft supervision signal is
proposed by considering Box-wise Contrastive Learning (BCL). The effectiveness
of our model is verified on both ONCE and Waymo datasets. For example, on ONCE,
our approach significantly improves the baseline by 9.51 mAP. Moreover, with
half annotations, our model outperforms the oracle model with full annotations
on Waymo.
Related papers
- TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection [59.498894868956306]
Pseudo-labeling approaches to semi-supervised learning adopt a teacher-student framework.
We leverage pre-trained motion-forecasting models to generate object trajectories on pseudo-labeled data.
Our approach improves pseudo-label quality in two distinct manners.
arXiv Detail & Related papers (2024-09-17T05:35:00Z) - Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection [77.23918785277404]
We present Diffusion-SS3D, a new perspective of enhancing the quality of pseudo-labels via the diffusion model for semi-supervised 3D object detection.
Specifically, we include noises to produce corrupted 3D object size and class label, distributions, and then utilize the diffusion model as a denoising process to obtain bounding box outputs.
We conduct experiments on the ScanNet and SUN RGB-D benchmark datasets to demonstrate that our approach achieves state-of-the-art performance against existing methods.
arXiv Detail & Related papers (2023-12-05T18:54:03Z) - Augment and Criticize: Exploring Informative Samples for Semi-Supervised
Monocular 3D Object Detection [64.65563422852568]
We improve the challenging monocular 3D object detection problem with a general semi-supervised framework.
We introduce a novel, simple, yet effective Augment and Criticize' framework that explores abundant informative samples from unlabeled data.
The two new detectors, dubbed 3DSeMo_DLE and 3DSeMo_FLEX, achieve state-of-the-art results with remarkable improvements for over 3.5% AP_3D/BEV (Easy) on KITTI.
arXiv Detail & Related papers (2023-03-20T16:28:15Z) - Exploring Active 3D Object Detection from a Generalization Perspective [58.597942380989245]
Uncertainty-based active learning policies fail to balance the trade-off between point cloud informativeness and box-level annotation costs.
We propose textscCrb, which hierarchically filters out the point clouds of redundant 3D bounding box labels.
Experiments show that the proposed approach outperforms existing active learning strategies.
arXiv Detail & Related papers (2023-01-23T02:43:03Z) - Open-Set Semi-Supervised Learning for 3D Point Cloud Understanding [62.17020485045456]
It is commonly assumed in semi-supervised learning (SSL) that the unlabeled data are drawn from the same distribution as that of the labeled ones.
We propose to selectively utilize unlabeled data through sample weighting, so that only conducive unlabeled data would be prioritized.
arXiv Detail & Related papers (2022-05-02T16:09:17Z) - Active Self-Semi-Supervised Learning for Few Labeled Samples [4.713652957384158]
Training deep models with limited annotations poses a significant challenge when applied to diverse practical domains.
We propose a simple yet effective framework, active self-semi-supervised learning (AS3L)
AS3L bootstraps semi-supervised models with prior pseudo-labels (PPL)
We develop active learning and label propagation strategies to obtain accurate PPL.
arXiv Detail & Related papers (2022-03-09T07:45:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.