Hierarchical Supervision and Shuffle Data Augmentation for 3D
Semi-Supervised Object Detection
- URL: http://arxiv.org/abs/2304.01464v1
- Date: Tue, 4 Apr 2023 02:09:32 GMT
- Title: Hierarchical Supervision and Shuffle Data Augmentation for 3D
Semi-Supervised Object Detection
- Authors: Chuandong Liu, Chenqiang Gao, Fangcen Liu, Pengcheng Li, Deyu Meng,
Xinbo Gao
- Abstract summary: State-of-the-art 3D object detectors are usually trained on large-scale datasets with high-quality 3D annotations.
A natural remedy is to adopt semi-supervised learning (SSL) by leveraging a limited amount of labeled samples and abundant unlabeled samples.
This paper introduces a novel approach of Hierarchical Supervision and Shuffle Data Augmentation (HSSDA), which is a simple yet effective teacher-student framework.
- Score: 90.32180043449263
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: State-of-the-art 3D object detectors are usually trained on large-scale
datasets with high-quality 3D annotations. However, such 3D annotations are
often expensive and time-consuming, which may not be practical for real
applications. A natural remedy is to adopt semi-supervised learning (SSL) by
leveraging a limited amount of labeled samples and abundant unlabeled samples.
Current pseudolabeling-based SSL object detection methods mainly adopt a
teacher-student framework, with a single fixed threshold strategy to generate
supervision signals, which inevitably brings confused supervision when guiding
the student network training. Besides, the data augmentation of the point cloud
in the typical teacher-student framework is too weak, and only contains basic
down sampling and flip-and-shift (i.e., rotate and scaling), which hinders the
effective learning of feature information. Hence, we address these issues by
introducing a novel approach of Hierarchical Supervision and Shuffle Data
Augmentation (HSSDA), which is a simple yet effective teacher-student
framework. The teacher network generates more reasonable supervision for the
student network by designing a dynamic dual-threshold strategy. Besides, the
shuffle data augmentation strategy is designed to strengthen the feature
representation ability of the student network. Extensive experiments show that
HSSDA consistently outperforms the recent state-of-the-art methods on different
datasets. The code will be released at https://github.com/azhuantou/HSSDA.
Related papers
- Self-supervised cross-modality learning for uncertainty-aware object detection and recognition in applications which lack pre-labelled training data [6.892494758401737]
We show how an uncertainty-aware, deep neural network can be trained to detect, recognise and localise objects in 2D RGB images.
Our method can be applied to many important industrial tasks, where labelled datasets are typically unavailable.
arXiv Detail & Related papers (2024-11-05T13:26:31Z) - Dual-Perspective Knowledge Enrichment for Semi-Supervised 3D Object
Detection [55.210991151015534]
We present a novel Dual-Perspective Knowledge Enrichment approach named DPKE for semi-supervised 3D object detection.
Our DPKE enriches the knowledge of limited training data, particularly unlabeled data, from two perspectives: data-perspective and feature-perspective.
arXiv Detail & Related papers (2024-01-10T08:56:07Z) - Semi-supervised 3D Object Detection with Proficient Teachers [114.54835359657707]
Dominated point cloud-based 3D object detectors in autonomous driving scenarios rely heavily on the huge amount of accurately labeled samples.
Pseudo-Labeling methodology is commonly used for SSL frameworks, however, the low-quality predictions from the teacher model have seriously limited its performance.
We propose a new Pseudo-Labeling framework for semi-supervised 3D object detection, by enhancing the teacher model to a proficient one with several necessary designs.
arXiv Detail & Related papers (2022-07-26T04:54:03Z) - Open-Set Semi-Supervised Learning for 3D Point Cloud Understanding [62.17020485045456]
It is commonly assumed in semi-supervised learning (SSL) that the unlabeled data are drawn from the same distribution as that of the labeled ones.
We propose to selectively utilize unlabeled data through sample weighting, so that only conducive unlabeled data would be prioritized.
arXiv Detail & Related papers (2022-05-02T16:09:17Z) - Large-Scale Pre-training for Person Re-identification with Noisy Labels [125.49696935852634]
We develop a large-scale Pre-training framework utilizing Noisy Labels (PNL)
In principle, joint learning of these three modules not only clusters similar examples to one prototype, but also rectifies noisy labels based on the prototype assignment.
This simple pre-training task provides a scalable way to learn SOTA Re-ID representations from scratch on "LUPerson-NL" without bells and whistles.
arXiv Detail & Related papers (2022-03-30T17:59:58Z) - Semi-supervised 3D Object Detection via Adaptive Pseudo-Labeling [18.209409027211404]
3D object detection is an important task in computer vision.
Most existing methods require a large number of high-quality 3D annotations, which are expensive to collect.
We propose a novel semi-supervised framework based on pseudo-labeling for outdoor 3D object detection tasks.
arXiv Detail & Related papers (2021-08-15T02:58:43Z) - SESS: Self-Ensembling Semi-Supervised 3D Object Detection [138.80825169240302]
We propose SESS, a self-ensembling semi-supervised 3D object detection framework. Specifically, we design a thorough perturbation scheme to enhance generalization of the network on unlabeled and new unseen data.
Our SESS achieves competitive performance compared to the state-of-the-art fully-supervised method by using only 50% labeled data.
arXiv Detail & Related papers (2019-12-26T08:48:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.