Understanding the Challenges When 3D Semantic Segmentation Faces Class
Imbalanced and OOD Data
- URL: http://arxiv.org/abs/2203.00214v1
- Date: Tue, 1 Mar 2022 03:53:18 GMT
- Title: Understanding the Challenges When 3D Semantic Segmentation Faces Class
Imbalanced and OOD Data
- Authors: Yancheng Pan, Fan Xie, Huijing Zhao
- Abstract summary: 3D semantic segmentation is an essential process in the creation of a safe autonomous driving system.
Deep learning models for 3D semantic segmentation often suffer from the class imbalance problem and out-of-distribution data.
In this study, we explore how the class imbalance problem affects 3DSS performance and whether the model can detect the category prediction correctness.
- Score: 4.503636381237414
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: 3D semantic segmentation (3DSS) is an essential process in the creation of a
safe autonomous driving system. However, deep learning models for 3D semantic
segmentation often suffer from the class imbalance problem and
out-of-distribution (OOD) data. In this study, we explore how the class
imbalance problem affects 3DSS performance and whether the model can detect the
category prediction correctness, or whether data is ID (in-distribution) or
OOD. For these purposes, we conduct two experiments using three representative
3DSS models and five trust scoring methods, and conduct both a confusion and
feature analysis of each class. Furthermore, a data augmentation method for the
3D LiDAR dataset is proposed to create a new dataset based on SemanticKITTI and
SemanticPOSS, called AugKITTI. We propose the wPre metric and TSD for a more
in-depth analysis of the results, and follow are proposals with an insightful
discussion. Based on the experimental results, we find that: (1) the classes
are not only imbalanced in their data size but also in the basic properties of
each semantic category. (2) The intraclass diversity and interclass ambiguity
make class learning difficult and greatly limit the models' performance,
creating the challenges of semantic and data gaps. (3) The trust scores are
unreliable for classes whose features are confused with other classes. For 3DSS
models, those misclassified ID classes and OODs may also be given high trust
scores, making the 3DSS predictions unreliable, and leading to the challenges
in judging 3DSS result trustworthiness. All of these outcomes point to several
research directions for improving the performance and reliability of the 3DSS
models used for real-world applications.
Related papers
- DyConfidMatch: Dynamic Thresholding and Re-sampling for 3D Semi-supervised Learning [4.259908158892314]
Semi-supervised learning (SSL) leverages limited labeled and abundant unlabeled data but often faces challenges with data imbalance.
This study investigates class-level confidence as an indicator of learning status in 3D SSL, proposing a novel method that utilizes dynamic thresholding.
A re-sampling strategy is also introduced to mitigate bias towards well-represented classes, ensuring equitable class representation.
arXiv Detail & Related papers (2024-11-13T05:09:28Z) - Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding [50.448520056844885]
We propose a generative Bayesian network to produce diverse synthetic scenes with real-world patterns.
A series of experiments robustly display our method's consistent superiority over existing state-of-the-art pre-training approaches.
arXiv Detail & Related papers (2024-06-17T07:43:53Z) - FILP-3D: Enhancing 3D Few-shot Class-incremental Learning with
Pre-trained Vision-Language Models [62.663113296987085]
Few-shot class-incremental learning aims to mitigate the catastrophic forgetting issue when a model is incrementally trained on limited data.
We introduce two novel components: the Redundant Feature Eliminator (RFE) and the Spatial Noise Compensator (SNC)
Considering the imbalance in existing 3D datasets, we also propose new evaluation metrics that offer a more nuanced assessment of a 3D FSCIL model.
arXiv Detail & Related papers (2023-12-28T14:52:07Z) - 3D Adversarial Augmentations for Robust Out-of-Domain Predictions [115.74319739738571]
We focus on improving the generalization to out-of-domain data.
We learn a set of vectors that deform the objects in an adversarial fashion.
We perform adversarial augmentation by applying the learned sample-independent vectors to the available objects when training a model.
arXiv Detail & Related papers (2023-08-29T17:58:55Z) - Class-Level Confidence Based 3D Semi-Supervised Learning [18.95161296147023]
We show that unlabeled data class-level confidence can represent the learning status in the 3D imbalanced dataset.
Our method significantly outperforms state-of-the-art counterparts for both 3D SSL classification and detection tasks.
arXiv Detail & Related papers (2022-10-18T20:13:28Z) - Open-Set Semi-Supervised Learning for 3D Point Cloud Understanding [62.17020485045456]
It is commonly assumed in semi-supervised learning (SSL) that the unlabeled data are drawn from the same distribution as that of the labeled ones.
We propose to selectively utilize unlabeled data through sample weighting, so that only conducive unlabeled data would be prioritized.
arXiv Detail & Related papers (2022-05-02T16:09:17Z) - Hyperspectral Classification Based on Lightweight 3-D-CNN With Transfer
Learning [67.40866334083941]
We propose an end-to-end 3-D lightweight convolutional neural network (CNN) for limited samples-based HSI classification.
Compared with conventional 3-D-CNN models, the proposed 3-D-LWNet has a deeper network structure, less parameters, and lower computation cost.
Our model achieves competitive performance for HSI classification compared to several state-of-the-art methods.
arXiv Detail & Related papers (2020-12-07T03:44:35Z) - SESS: Self-Ensembling Semi-Supervised 3D Object Detection [138.80825169240302]
We propose SESS, a self-ensembling semi-supervised 3D object detection framework. Specifically, we design a thorough perturbation scheme to enhance generalization of the network on unlabeled and new unseen data.
Our SESS achieves competitive performance compared to the state-of-the-art fully-supervised method by using only 50% labeled data.
arXiv Detail & Related papers (2019-12-26T08:48:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.