Rethinking Few-shot 3D Point Cloud Semantic Segmentation
- URL: http://arxiv.org/abs/2403.00592v1
- Date: Fri, 1 Mar 2024 15:14:47 GMT
- Title: Rethinking Few-shot 3D Point Cloud Semantic Segmentation
- Authors: Zhaochong An, Guolei Sun, Yun Liu, Fayao Liu, Zongwei Wu, Dan Wang,
Luc Van Gool, Serge Belongie
- Abstract summary: This paper revisits few-shot 3D point cloud semantic segmentation (FS-PCS)
We focus on two significant issues in the state-of-the-art: foreground leakage and sparse point distribution.
To address these issues, we introduce a standardized FS-PCS setting, upon which a new benchmark is built.
- Score: 62.80639841429669
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper revisits few-shot 3D point cloud semantic segmentation (FS-PCS),
with a focus on two significant issues in the state-of-the-art: foreground
leakage and sparse point distribution. The former arises from non-uniform point
sampling, allowing models to distinguish the density disparities between
foreground and background for easier segmentation. The latter results from
sampling only 2,048 points, limiting semantic information and deviating from
the real-world practice. To address these issues, we introduce a standardized
FS-PCS setting, upon which a new benchmark is built. Moreover, we propose a
novel FS-PCS model. While previous methods are based on feature optimization by
mainly refining support features to enhance prototypes, our method is based on
correlation optimization, referred to as Correlation Optimization Segmentation
(COSeg). Specifically, we compute Class-specific Multi-prototypical Correlation
(CMC) for each query point, representing its correlations to category
prototypes. Then, we propose the Hyper Correlation Augmentation (HCA) module to
enhance CMC. Furthermore, tackling the inherent property of few-shot training
to incur base susceptibility for models, we propose to learn non-parametric
prototypes for the base classes during training. The learned base prototypes
are used to calibrate correlations for the background class through a Base
Prototypes Calibration (BPC) module. Experiments on popular datasets
demonstrate the superiority of COSeg over existing methods. The code is
available at: https://github.com/ZhaochongAn/COSeg
Related papers
- Class-Imbalanced Semi-Supervised Learning for Large-Scale Point Cloud
Semantic Segmentation via Decoupling Optimization [64.36097398869774]
Semi-supervised learning (SSL) has been an active research topic for large-scale 3D scene understanding.
The existing SSL-based methods suffer from severe training bias due to class imbalance and long-tail distributions of the point cloud data.
We introduce a new decoupling optimization framework, which disentangles feature representation learning and classifier in an alternative optimization manner to shift the bias decision boundary effectively.
arXiv Detail & Related papers (2024-01-13T04:16:40Z) - Prototypical Kernel Learning and Open-set Foreground Perception for
Generalized Few-shot Semantic Segmentation [7.707161030443157]
Generalized Few-shot Semantic (GFSS) extends Few-shot Semantic aggregation to segment unseen classes and seen classes during evaluation.
We address the aforementioned problems by jointing the prototypical kernel learning and open-set perception.
In addition, a foreground contextual perception module cooperating with conditional bias based inference is adopted to perform class-agnostic as well as open-set foreground detection.
arXiv Detail & Related papers (2023-08-09T13:38:52Z) - Boosting Few-shot 3D Point Cloud Segmentation via Query-Guided
Enhancement [30.017448714419455]
This paper proposes a novel approach to improve point cloud few-shot segmentation (PC-FSS) models.
Unlike existing PC-FSS methods that directly utilize categorical information from support prototypes to recognize novel classes in query samples, our method identifies two critical aspects that substantially enhance model performance.
arXiv Detail & Related papers (2023-08-06T18:07:45Z) - Boosting Low-Data Instance Segmentation by Unsupervised Pre-training
with Saliency Prompt [103.58323875748427]
This work offers a novel unsupervised pre-training solution for low-data regimes.
Inspired by the recent success of the Prompting technique, we introduce a new pre-training method that boosts QEIS models.
Experimental results show that our method significantly boosts several QEIS models on three datasets.
arXiv Detail & Related papers (2023-02-02T15:49:03Z) - Prediction Calibration for Generalized Few-shot Semantic Segmentation [101.69940565204816]
Generalized Few-shot Semantic (GFSS) aims to segment each image pixel into either base classes with abundant training examples or novel classes with only a handful of (e.g., 1-5) training images per class.
We build a cross-attention module that guides the classifier's final prediction using the fused multi-level features.
Our PCN outperforms the state-the-art alternatives by large margins.
arXiv Detail & Related papers (2022-10-15T13:30:12Z) - Rethinking Semantic Segmentation: A Prototype View [126.59244185849838]
We present a nonparametric semantic segmentation model based on non-learnable prototypes.
Our framework yields compelling results over several datasets.
We expect this work will provoke a rethink of the current de facto semantic segmentation model design.
arXiv Detail & Related papers (2022-03-28T21:15:32Z) - Learn to Learn Metric Space for Few-Shot Segmentation of 3D Shapes [17.217954254022573]
We introduce a meta-learning-based method for few-shot 3D shape segmentation where only a few labeled samples are provided for the unseen classes.
We demonstrate the superior performance of our proposed on the ShapeNet part dataset under the few-shot scenario, compared with well-established baseline and state-of-the-art semi-supervised methods.
arXiv Detail & Related papers (2021-07-07T01:47:00Z) - Contrastive Prototype Learning with Augmented Embeddings for Few-Shot
Learning [58.2091760793799]
We propose a novel contrastive prototype learning with augmented embeddings (CPLAE) model.
With a class prototype as an anchor, CPL aims to pull the query samples of the same class closer and those of different classes further away.
Extensive experiments on several benchmarks demonstrate that our proposed CPLAE achieves new state-of-the-art.
arXiv Detail & Related papers (2021-01-23T13:22:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.