A Closer Look at Few-Shot 3D Point Cloud Classification
- URL: http://arxiv.org/abs/2303.18210v1
- Date: Fri, 31 Mar 2023 17:01:13 GMT
- Title: A Closer Look at Few-Shot 3D Point Cloud Classification
- Authors: Chuangguan Ye, Hongyuan Zhu, Bo Zhang, Tao Chen
- Abstract summary: We propose a new network, Point-cloud Correlation Interaction ( PCIA), with three novel plug-and-play components called Salient-Part Fusion (SPF), Self-Channel Interaction Plus (SCI+) module, and Cross-Instance Fusion Plus (CIF+) module.
These modules can be inserted into most FSL algorithms with minor changes and significantly improve the performance.
Experimental results on three benchmark datasets, ModelNet40-FS, ShapeNet70-FS, and ScanObjectNN-FS, demonstrate that our method achieves state-of-the-art performance.
- Score: 21.57893885371941
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In recent years, research on few-shot learning (FSL) has been fast-growing in
the 2D image domain due to the less requirement for labeled training data and
greater generalization for novel classes. However, its application in 3D point
cloud data is relatively under-explored. Not only need to distinguish unseen
classes as in the 2D domain, 3D FSL is more challenging in terms of irregular
structures, subtle inter-class differences, and high intra-class variances
{when trained on a low number of data.} Moreover, different architectures and
learning algorithms make it difficult to study the effectiveness of existing 2D
FSL algorithms when migrating to the 3D domain. In this work, for the first
time, we perform systematic and extensive investigations of directly applying
recent 2D FSL works to 3D point cloud related backbone networks and thus
suggest a strong learning baseline for few-shot 3D point cloud classification.
Furthermore, we propose a new network, Point-cloud Correlation Interaction
(PCIA), with three novel plug-and-play components called Salient-Part Fusion
(SPF) module, Self-Channel Interaction Plus (SCI+) module, and Cross-Instance
Fusion Plus (CIF+) module to obtain more representative embeddings and improve
the feature distinction. These modules can be inserted into most FSL algorithms
with minor changes and significantly improve the performance. Experimental
results on three benchmark datasets, ModelNet40-FS, ShapeNet70-FS, and
ScanObjectNN-FS, demonstrate that our method achieves state-of-the-art
performance for the 3D FSL task. Code and datasets are available at
https://github.com/cgye96/A_Closer_Look_At_3DFSL.
Related papers
- Foundation Model-Powered 3D Few-Shot Class Incremental Learning via Training-free Adaptor [9.54964908165465]
This paper introduces a new method to tackle the Few-Shot Continual Incremental Learning problem in 3D point cloud environments.
We leverage a foundational 3D model trained extensively on point cloud data.
Our approach uses a dual cache system: first, it uses previous test samples based on how confident the model was in its predictions to prevent forgetting, and second, it includes a small number of new task samples to prevent overfitting.
arXiv Detail & Related papers (2024-10-11T20:23:00Z) - Leveraging Large-Scale Pretrained Vision Foundation Models for
Label-Efficient 3D Point Cloud Segmentation [67.07112533415116]
We present a novel framework that adapts various foundational models for the 3D point cloud segmentation task.
Our approach involves making initial predictions of 2D semantic masks using different large vision models.
To generate robust 3D semantic pseudo labels, we introduce a semantic label fusion strategy that effectively combines all the results via voting.
arXiv Detail & Related papers (2023-11-03T15:41:15Z) - Clustering based Point Cloud Representation Learning for 3D Analysis [80.88995099442374]
We propose a clustering based supervised learning scheme for point cloud analysis.
Unlike current de-facto, scene-wise training paradigm, our algorithm conducts within-class clustering on the point embedding space.
Our algorithm shows notable improvements on famous point cloud segmentation datasets.
arXiv Detail & Related papers (2023-07-27T03:42:12Z) - What Makes for Effective Few-shot Point Cloud Classification? [18.62689395276194]
We show that 3D few-shot learning is more challenging with unordered structures, high intra-class variances, and subtle inter-class differences.
We propose a novel plug-and-play component called Cross-Instance Adaptation (CIA) module, to address the high intra-class variances and subtle inter-class differences issues.
arXiv Detail & Related papers (2023-03-31T15:55:06Z) - ULIP: Learning a Unified Representation of Language, Images, and Point
Clouds for 3D Understanding [110.07170245531464]
Current 3D models are limited by datasets with a small number of annotated data and a pre-defined set of categories.
Recent advances have shown that similar problems can be significantly alleviated by employing knowledge from other modalities, such as language.
We learn a unified representation of images, texts, and 3D point clouds by pre-training with object triplets from the three modalities.
arXiv Detail & Related papers (2022-12-10T01:34:47Z) - Focal Sparse Convolutional Networks for 3D Object Detection [121.45950754511021]
We introduce two new modules to enhance the capability of Sparse CNNs.
They are focal sparse convolution (Focals Conv) and its multi-modal variant of focal sparse convolution with fusion.
For the first time, we show that spatially learnable sparsity in sparse convolution is essential for sophisticated 3D object detection.
arXiv Detail & Related papers (2022-04-26T17:34:10Z) - MetaSets: Meta-Learning on Point Sets for Generalizable Representations [100.5981809166658]
We study a new problem of 3D Domain Generalization (3DDG) with the goal to generalize the model to other unseen domains of point clouds without access to them in the training process.
We propose to tackle this problem via MetaSets, which meta-learns point cloud representations from a group of classification tasks on carefully-designed transformed point sets.
We design two benchmarks for Sim-to-Real transfer of 3D point clouds. Experimental results show that MetaSets outperforms existing 3D deep learning methods by large margins.
arXiv Detail & Related papers (2022-04-15T03:24:39Z) - LATFormer: Locality-Aware Point-View Fusion Transformer for 3D Shape
Recognition [38.540048855119004]
We propose a novel Locality-Aware Point-View Fusion Transformer (LATFormer) for 3D shape retrieval and classification.
The core component of LATFormer is a module named Locality-Aware Fusion (LAF) which integrates the local features of correlated regions across the two modalities.
In our LATFormer, we utilize the LAF module to fuse the multi-scale features of the two modalities both bidirectionally and hierarchically to obtain more informative features.
arXiv Detail & Related papers (2021-09-03T03:23:27Z) - Point Discriminative Learning for Unsupervised Representation Learning
on 3D Point Clouds [54.31515001741987]
We propose a point discriminative learning method for unsupervised representation learning on 3D point clouds.
We achieve this by imposing a novel point discrimination loss on the middle level and global level point features.
Our method learns powerful representations and achieves new state-of-the-art performance.
arXiv Detail & Related papers (2021-08-04T15:11:48Z) - Zero-Shot Learning on 3D Point Cloud Objects and Beyond [21.6491982908705]
We identify some of the challenges and apply 2D Zero-Shot Learning (ZSL) methods in the 3D domain to analyze the performance of existing models.
A novel loss function is developed that simultaneously aligns seen semantics with point cloud features.
An extensive set of experiments is carried out, establishing state-of-the-art for ZSL and GZSL on synthetic and real datasets.
arXiv Detail & Related papers (2021-04-11T10:04:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.