Boosting Few-shot 3D Point Cloud Segmentation via Query-Guided
Enhancement
- URL: http://arxiv.org/abs/2308.03177v2
- Date: Tue, 8 Aug 2023 05:26:45 GMT
- Title: Boosting Few-shot 3D Point Cloud Segmentation via Query-Guided
Enhancement
- Authors: Zhenhua Ning, Zhuotao Tian, Guangming Lu, Wenjie Pei
- Abstract summary: This paper proposes a novel approach to improve point cloud few-shot segmentation (PC-FSS) models.
Unlike existing PC-FSS methods that directly utilize categorical information from support prototypes to recognize novel classes in query samples, our method identifies two critical aspects that substantially enhance model performance.
- Score: 30.017448714419455
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Although extensive research has been conducted on 3D point cloud
segmentation, effectively adapting generic models to novel categories remains a
formidable challenge. This paper proposes a novel approach to improve point
cloud few-shot segmentation (PC-FSS) models. Unlike existing PC-FSS methods
that directly utilize categorical information from support prototypes to
recognize novel classes in query samples, our method identifies two critical
aspects that substantially enhance model performance by reducing contextual
gaps between support prototypes and query features. Specifically, we (1) adapt
support background prototypes to match query context while removing extraneous
cues that may obscure foreground and background in query samples, and (2)
holistically rectify support prototypes under the guidance of query features to
emulate the latter having no semantic gap to the query targets. Our proposed
designs are agnostic to the feature extractor, rendering them readily
applicable to any prototype-based methods. The experimental results on S3DIS
and ScanNet demonstrate notable practical benefits, as our approach achieves
significant improvements while still maintaining high efficiency. The code for
our approach is available at
https://github.com/AaronNZH/Boosting-Few-shot-3D-Point-Cloud-Segmentation-via-Query-Guided-Enhanceme nt
Related papers
- Rethinking Few-shot 3D Point Cloud Semantic Segmentation [62.80639841429669]
This paper revisits few-shot 3D point cloud semantic segmentation (FS-PCS)
We focus on two significant issues in the state-of-the-art: foreground leakage and sparse point distribution.
To address these issues, we introduce a standardized FS-PCS setting, upon which a new benchmark is built.
arXiv Detail & Related papers (2024-03-01T15:14:47Z) - Dynamic Prototype Adaptation with Distillation for Few-shot Point Cloud
Segmentation [32.494146296437656]
Few-shot point cloud segmentation seeks to generate per-point masks for previously unseen categories.
We present dynamic prototype adaptation (DPA), which explicitly learns task-specific prototypes for each query point cloud.
arXiv Detail & Related papers (2024-01-29T11:00:46Z) - Fine-Grained Prototypes Distillation for Few-Shot Object Detection [8.795211323408513]
Few-shot object detection (FSOD) aims at extending a generic detector for novel object detection with only a few training examples.
In general, methods based on meta-learning employ an additional support branch to encode novel examples into class prototypes.
New methods are required to capture the distinctive local context for more robust novel object detection.
arXiv Detail & Related papers (2024-01-15T12:12:48Z) - Prototype Adaption and Projection for Few- and Zero-shot 3D Point Cloud
Semantic Segmentation [30.18333233940194]
We address the challenging task of few-shot and zero-shot 3D point cloud semantic segmentation.
Our proposed method surpasses state-of-the-art algorithms by a considerable 7.90% and 14.82% under the 2-way 1-shot setting on S3DIS and ScanNet benchmarks, respectively.
arXiv Detail & Related papers (2023-05-23T17:58:05Z) - Position-Guided Point Cloud Panoptic Segmentation Transformer [118.17651196656178]
This work begins by applying this appealing paradigm to LiDAR-based point cloud segmentation and obtains a simple yet effective baseline.
We observe that instances in the sparse point clouds are relatively small to the whole scene and often have similar geometry but lack distinctive appearance for segmentation, which are rare in the image domain.
The method, named Position-guided Point cloud Panoptic segmentation transFormer (P3Former), outperforms previous state-of-the-art methods by 3.4% and 1.2% on Semantic KITTI and nuScenes benchmark, respectively.
arXiv Detail & Related papers (2023-03-23T17:59:02Z) - Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation [103.90033029330527]
Few-Shot Instance (FSIS) requires detecting and segmenting novel classes with limited support examples.
We introduce a unified framework, Reference Twice (RefT), to exploit the relationship between support and query features for FSIS.
arXiv Detail & Related papers (2023-01-03T15:33:48Z) - APANet: Adaptive Prototypes Alignment Network for Few-Shot Semantic
Segmentation [56.387647750094466]
Few-shot semantic segmentation aims to segment novel-class objects in a given query image with only a few labeled support images.
Most advanced solutions exploit a metric learning framework that performs segmentation through matching each query feature to a learned class-specific prototype.
We present an adaptive prototype representation by introducing class-specific and class-agnostic prototypes.
arXiv Detail & Related papers (2021-11-24T04:38:37Z) - Plug-and-Play Few-shot Object Detection with Meta Strategy and Explicit
Localization Inference [78.41932738265345]
This paper proposes a plug detector that can accurately detect the objects of novel categories without fine-tuning process.
We introduce two explicit inferences into the localization process to reduce its dependence on annotated data.
It shows a significant lead in both efficiency, precision, and recall under varied evaluation protocols.
arXiv Detail & Related papers (2021-10-26T03:09:57Z) - An Enhanced Span-based Decomposition Method for Few-Shot Sequence
Labeling [27.468499201647063]
Few-Shot Sequence Labeling (FSSL) is a canonical solution for the tagging models to generalize on an emerging, resource-scarce domain.
We propose Enhanced Span-based Decomposition method, which follows the metric-based meta-learning paradigm for FSSL.
arXiv Detail & Related papers (2021-09-27T12:59:48Z) - Prior Guided Feature Enrichment Network for Few-Shot Segmentation [64.91560451900125]
State-of-the-art semantic segmentation methods require sufficient labeled data to achieve good results.
Few-shot segmentation is proposed to tackle this problem by learning a model that quickly adapts to new classes with a few labeled support samples.
Theses frameworks still face the challenge of generalization ability reduction on unseen classes due to inappropriate use of high-level semantic information.
arXiv Detail & Related papers (2020-08-04T10:41:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.