Related papers: Novel Class Discovery for Point Cloud Segmentation via Joint Learning of Causal Representation and Reasoning

Novel Class Discovery for Point Cloud Segmentation via Joint Learning of Causal Representation and Reasoning

URL: http://arxiv.org/abs/2510.13307v2
Date: Thu, 23 Oct 2025 01:35:00 GMT
Title: Novel Class Discovery for Point Cloud Segmentation via Joint Learning of Causal Representation and Reasoning
Authors: Yang Li, Aming Wu, Zihao Zhang, Yahong Han,
Abstract summary: We focus on Novel Class Discovery for Point Cloud (3D-NCD)<n>Key to this task is to setup the exact correlations between the point representations and their base class labels.<n>We propose a new method, i.e., Joint Learning of Causal Representation and Reasoning.
Score: 58.25418970608328
License: http://creativecommons.org/licenses/by/4.0/
Abstract: In this paper, we focus on Novel Class Discovery for Point Cloud Segmentation (3D-NCD), aiming to learn a model that can segment unlabeled (novel) 3D classes using only the supervision from labeled (base) 3D classes. The key to this task is to setup the exact correlations between the point representations and their base class labels, as well as the representation correlations between the points from base and novel classes. A coarse or statistical correlation learning may lead to the confusion in novel class inference. lf we impose a causal relationship as a strong correlated constraint upon the learning process, the essential point cloud representations that accurately correspond to the classes should be uncovered. To this end, we introduce a structural causal model (SCM) to re-formalize the 3D-NCD problem and propose a new method, i.e., Joint Learning of Causal Representation and Reasoning. Specifically, we first analyze hidden confounders in the base class representations and the causal relationships between the base and novel classes through SCM. We devise a causal representation prototype that eliminates confounders to capture the causal representations of base classes. A graph structure is then used to model the causal relationships between the base classes' causal representation prototypes and the novel class prototypes, enabling causal reasoning from base to novel classes. Extensive experiments and visualization results on 3D and 2D NCD semantic segmentation demonstrate the superiorities of our method.

Related papers

Order-Robust Class Incremental Learning: Graph-Driven Dynamic Similarity Grouping [19.168022702075774]
Class Incremental Learning (CIL) aims to enable models to learn new classes sequentially while retaining knowledge of previous ones.<n>Recent studies highlight that the performance of CIL models is highly sensitive to the order of class arrival.<n>We propose Graph-Driven Dynamic Similarity Grouping (GDDSG), a novel method that employs graph coloring algorithms to dynamically partition classes into similarity-constrained groups.
arXiv Detail & Related papers (2025-02-27T12:16:57Z)
CP-VoteNet: Contrastive Prototypical VoteNet for Few-Shot Point Cloud Object Detection [7.205000222081269]
Few-shot point cloud 3D object detection (FS3D) aims to identify and localise objects of novel classes from point clouds. We introduce contrastive semantics mining, which enables the network to extract discriminative categorical features. Through refined primitive geometric structures, the transferability of feature encoding from base to novel classes is significantly enhanced.
arXiv Detail & Related papers (2024-08-30T06:13:49Z)
Hierarchical Insights: Exploiting Structural Similarities for Reliable 3D Semantic Segmentation [4.480310276450028]
We propose a training strategy for a 3D LiDAR semantic segmentation model that learns structural relationships between classes through abstraction. This is achieved by implicitly modeling these relationships using a learning rule for hierarchical multi-label classification (HMC) Our detailed analysis demonstrates that this training strategy not only improves the model's confidence calibration but also retains additional information useful for downstream tasks such as fusion, prediction, and planning.
arXiv Detail & Related papers (2024-04-09T08:49:01Z)
Rethinking Few-shot 3D Point Cloud Semantic Segmentation [62.80639841429669]
This paper revisits few-shot 3D point cloud semantic segmentation (FS-PCS) We focus on two significant issues in the state-of-the-art: foreground leakage and sparse point distribution. To address these issues, we introduce a standardized FS-PCS setting, upon which a new benchmark is built.
arXiv Detail & Related papers (2024-03-01T15:14:47Z)
Learning from Semi-Factuals: A Debiased and Semantic-Aware Framework for Generalized Relation Discovery [12.716874398564482]
Generalized Relation Discovery (GRD) aims to identify unlabeled instances in existing pre-defined relations or discover novel relations. We propose a novel framework, SFGRD, for this task by learning from semi-factuals in two stages. SFGRD surpasses state-of-the-art models in terms of accuracy by 2.36% $sim$5.78% and cosine similarity by 32.19%$sim$ 84.45%.
arXiv Detail & Related papers (2024-01-12T02:38:55Z)
Class-level Structural Relation Modelling and Smoothing for Visual Representation Learning [12.247343963572732]
This paper presents a framework termed bfClass-level Structural Relation Modeling and Smoothing for Visual Representation Learning (CSRMS) It includes the Class-level Relation Modelling, Class-aware GraphGuided Sampling, and Graph-Guided Representation Learning modules. Experiments demonstrate the effectiveness of structured knowledge modelling for enhanced representation learning and show that CSRMS can be incorporated with any state-of-the-art visual representation learning models for performance gains.
arXiv Detail & Related papers (2023-08-08T09:03:46Z)
Contrastive Neighborhood Alignment [81.65103777329874]
We present Contrastive Neighborhood Alignment (CNA), a manifold learning approach to maintain the topology of learned features. The target model aims to mimic the local structure of the source representation space using a contrastive loss. CNA is illustrated in three scenarios: manifold learning, where the model maintains the local topology of the original data in a dimension-reduced space; model distillation, where a small student model is trained to mimic a larger teacher; and legacy model update, where an older model is replaced by a more powerful one.
arXiv Detail & Related papers (2022-01-06T04:58:31Z)
Unsupervised Part Discovery from Contrastive Reconstruction [90.88501867321573]
The goal of self-supervised visual representation learning is to learn strong, transferable image representations. We propose an unsupervised approach to object part discovery and segmentation. Our method yields semantic parts consistent across fine-grained but visually distinct categories.
arXiv Detail & Related papers (2021-11-11T17:59:42Z)
Explanation-Guided Training for Cross-Domain Few-Shot Classification [96.12873073444091]
Cross-domain few-shot classification task (CD-FSC) combines few-shot classification with the requirement to generalize across domains represented by datasets. We introduce a novel training approach for existing FSC models. We show that explanation-guided training effectively improves the model generalization.
arXiv Detail & Related papers (2020-07-17T07:28:08Z)
Fine-Grained 3D Shape Classification with Hierarchical Part-View Attentions [70.0171362989609]
We propose a novel fine-grained 3D shape classification method named FG3D-Net to capture the fine-grained local details of 3D shapes from multiple rendered views. Our results under the fine-grained 3D shape dataset show that our method outperforms other state-of-the-art methods.
arXiv Detail & Related papers (2020-05-26T06:53:19Z)

This list is automatically generated from the titles and abstracts of the papers in this site.