InOR-Net: Incremental 3D Object Recognition Network for Point Cloud
Representation
- URL: http://arxiv.org/abs/2302.09886v1
- Date: Mon, 20 Feb 2023 10:30:16 GMT
- Title: InOR-Net: Incremental 3D Object Recognition Network for Point Cloud
Representation
- Authors: Jiahua Dong, Yang Cong, Gan Sun, Lixu Wang, Lingjuan Lyu, Jun Li, and
Ender Konukoglu
- Abstract summary: We develop a novel Incremental 3D Object Recognition Network (i.e., InOR-Net) to recognize new classes of 3D objects continuously.
Specifically, a category-guided geometric reasoning is proposed to reason local geometric structures with distinctive 3D characteristics of each class.
We then propose a novel critic-induced geometric attention mechanism to distinguish which 3D geometric characteristics within each class are beneficial to overcome the catastrophic forgetting on old classes of 3D objects.
- Score: 51.121731449575776
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: 3D object recognition has successfully become an appealing research topic in
the real-world. However, most existing recognition models unreasonably assume
that the categories of 3D objects cannot change over time in the real-world.
This unrealistic assumption may result in significant performance degradation
for them to learn new classes of 3D objects consecutively, due to the
catastrophic forgetting on old learned classes. Moreover, they cannot explore
which 3D geometric characteristics are essential to alleviate the catastrophic
forgetting on old classes of 3D objects. To tackle the above challenges, we
develop a novel Incremental 3D Object Recognition Network (i.e., InOR-Net),
which could recognize new classes of 3D objects continuously via overcoming the
catastrophic forgetting on old classes. Specifically, a category-guided
geometric reasoning is proposed to reason local geometric structures with
distinctive 3D characteristics of each class by leveraging intrinsic category
information. We then propose a novel critic-induced geometric attention
mechanism to distinguish which 3D geometric characteristics within each class
are beneficial to overcome the catastrophic forgetting on old classes of 3D
objects, while preventing the negative influence of useless 3D characteristics.
In addition, a dual adaptive fairness compensations strategy is designed to
overcome the forgetting brought by class imbalance, by compensating biased
weights and predictions of the classifier. Comparison experiments verify the
state-of-the-art performance of the proposed InOR-Net model on several public
point cloud datasets.
Related papers
- Collaborative Novel Object Discovery and Box-Guided Cross-Modal Alignment for Open-Vocabulary 3D Object Detection [34.91703960513125]
CoDAv2 is a unified framework designed to tackle both the localization and classification of novel 3D objects.
CoDAv2 outperforms the best-performing method by a large margin.
Source code and pre-trained models are available at the GitHub project page.
arXiv Detail & Related papers (2024-06-02T18:32:37Z) - FILP-3D: Enhancing 3D Few-shot Class-incremental Learning with
Pre-trained Vision-Language Models [62.663113296987085]
Few-shot class-incremental learning aims to mitigate the catastrophic forgetting issue when a model is incrementally trained on limited data.
We introduce two novel components: the Redundant Feature Eliminator (RFE) and the Spatial Noise Compensator (SNC)
Considering the imbalance in existing 3D datasets, we also propose new evaluation metrics that offer a more nuanced assessment of a 3D FSCIL model.
arXiv Detail & Related papers (2023-12-28T14:52:07Z) - Uncertainty-aware 3D Object-Level Mapping with Deep Shape Priors [15.34487368683311]
We propose a framework that can reconstruct high-quality object-level maps for unknown objects.
Our approach takes multiple RGB-D images as input and outputs dense 3D shapes and 9-DoF poses for detected objects.
We derive a probabilistic formulation that propagates shape and pose uncertainty through two novel loss functions.
arXiv Detail & Related papers (2023-09-17T00:48:19Z) - Large-Vocabulary 3D Diffusion Model with Transformer [57.076986347047]
We introduce a diffusion-based feed-forward framework for synthesizing massive categories of real-world 3D objects with a single generative model.
We propose a novel triplane-based 3D-aware Diffusion model with TransFormer, DiffTF, for handling challenges via three aspects.
Experiments on ShapeNet and OmniObject3D convincingly demonstrate that a single DiffTF model achieves state-of-the-art large-vocabulary 3D object generation performance.
arXiv Detail & Related papers (2023-09-14T17:59:53Z) - Single-view 3D Mesh Reconstruction for Seen and Unseen Categories [69.29406107513621]
Single-view 3D Mesh Reconstruction is a fundamental computer vision task that aims at recovering 3D shapes from single-view RGB images.
This paper tackles Single-view 3D Mesh Reconstruction, to study the model generalization on unseen categories.
We propose an end-to-end two-stage network, GenMesh, to break the category boundaries in reconstruction.
arXiv Detail & Related papers (2022-08-04T14:13:35Z) - I3DOL: Incremental 3D Object Learning without Catastrophic Forgetting [38.7610646073842]
I3DOL is first exploration to learn new classes of 3D object continually.
An adaptive-geometric centroid module is designed to construct discriminative local geometric structures.
A geometric-aware attention mechanism is developed to quantify the contributions of local geometric structures.
arXiv Detail & Related papers (2020-12-16T15:17:51Z) - Canonical 3D Deformer Maps: Unifying parametric and non-parametric
methods for dense weakly-supervised category reconstruction [79.98689027127855]
We propose a new representation of the 3D shape of common object categories that can be learned from a collection of 2D images of independent objects.
Our method builds in a novel way on concepts from parametric deformation models, non-parametric 3D reconstruction, and canonical embeddings.
It achieves state-of-the-art results in dense 3D reconstruction on public in-the-wild datasets of faces, cars, and birds.
arXiv Detail & Related papers (2020-08-28T15:44:05Z) - Fine-Grained 3D Shape Classification with Hierarchical Part-View
Attentions [70.0171362989609]
We propose a novel fine-grained 3D shape classification method named FG3D-Net to capture the fine-grained local details of 3D shapes from multiple rendered views.
Our results under the fine-grained 3D shape dataset show that our method outperforms other state-of-the-art methods.
arXiv Detail & Related papers (2020-05-26T06:53:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.