3D_DEN: Open-ended 3D Object Recognition using Dynamically Expandable
Networks
- URL: http://arxiv.org/abs/2009.07213v2
- Date: Mon, 15 Mar 2021 19:41:06 GMT
- Title: 3D_DEN: Open-ended 3D Object Recognition using Dynamically Expandable
Networks
- Authors: Sudhakaran Jain and Hamidreza Kasaei
- Abstract summary: We propose a new deep transfer learning approach based on a dynamic architectural method to make robots capable of open-ended learning about new 3D object categories.
Experimental results showed that the proposed model outperformed state-of-the-art approaches with regards to accuracy and also substantially minimizes computational overhead.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Service robots, in general, have to work independently and adapt to the
dynamic changes happening in the environment in real-time. One important aspect
in such scenarios is to continually learn to recognize newer object categories
when they become available. This combines two main research problems namely
continual learning and 3D object recognition. Most of the existing research
approaches include the use of deep Convolutional Neural Networks (CNNs)
focusing on image datasets. A modified approach might be needed for continually
learning 3D object categories. A major concern in using CNNs is the problem of
catastrophic forgetting when a model tries to learn a new task. Despite various
proposed solutions to mitigate this problem, there still exist some downsides
of such solutions, e.g., computational complexity, especially when learning
substantial number of tasks. These downsides can pose major problems in robotic
scenarios where real-time response plays an essential role. Towards addressing
this challenge, we propose a new deep transfer learning approach based on a
dynamic architectural method to make robots capable of open-ended learning
about new 3D object categories. Furthermore, we make sure that the mentioned
downsides are minimized to a great extent. Experimental results showed that the
proposed model outperformed state-of-the-art approaches with regards to
accuracy and also substantially minimizes computational overhead.
Related papers
- Learning with Noisy Ground Truth: From 2D Classification to 3D Reconstruction [13.314778587751588]
We propose a formal definition unify the analysis of LNGT LNGT in the context of different machine learning tasks.
We provide in-depth analysis on memorization effect and insightful discussion about potential future research opportunities.
arXiv Detail & Related papers (2024-06-23T02:21:48Z) - FILP-3D: Enhancing 3D Few-shot Class-incremental Learning with
Pre-trained Vision-Language Models [62.663113296987085]
Few-shot class-incremental learning aims to mitigate the catastrophic forgetting issue when a model is incrementally trained on limited data.
We introduce two novel components: the Redundant Feature Eliminator (RFE) and the Spatial Noise Compensator (SNC)
Considering the imbalance in existing 3D datasets, we also propose new evaluation metrics that offer a more nuanced assessment of a 3D FSCIL model.
arXiv Detail & Related papers (2023-12-28T14:52:07Z) - Graphical Object-Centric Actor-Critic [55.2480439325792]
We propose a novel object-centric reinforcement learning algorithm combining actor-critic and model-based approaches.
We use a transformer encoder to extract object representations and graph neural networks to approximate the dynamics of an environment.
Our algorithm performs better in a visually complex 3D robotic environment and a 2D environment with compositional structure than the state-of-the-art model-free actor-critic algorithm.
arXiv Detail & Related papers (2023-10-26T06:05:12Z) - InOR-Net: Incremental 3D Object Recognition Network for Point Cloud
Representation [51.121731449575776]
We develop a novel Incremental 3D Object Recognition Network (i.e., InOR-Net) to recognize new classes of 3D objects continuously.
Specifically, a category-guided geometric reasoning is proposed to reason local geometric structures with distinctive 3D characteristics of each class.
We then propose a novel critic-induced geometric attention mechanism to distinguish which 3D geometric characteristics within each class are beneficial to overcome the catastrophic forgetting on old classes of 3D objects.
arXiv Detail & Related papers (2023-02-20T10:30:16Z) - AdaAfford: Learning to Adapt Manipulation Affordance for 3D Articulated
Objects via Few-shot Interactions [13.802675708793014]
Perceiving and interacting with 3D articulated objects, such as cabinets, doors, and faucets, pose particular challenges for future home-assistant robots.
We propose a novel framework, named AdaAfford, that learns to perform very few test-time interactions for quickly adapting the affordance priors to more accurate instance-specific posteriors.
arXiv Detail & Related papers (2021-12-01T03:00:05Z) - Lifelong 3D Object Recognition and Grasp Synthesis Using Dual Memory
Recurrent Self-Organization Networks [0.0]
Humans learn to recognize and manipulate new objects in lifelong settings without forgetting the previously gained knowledge.
In most conventional deep neural networks, this is not possible due to the problem of catastrophic forgetting.
We propose a hybrid model architecture consisting of a dual-memory recurrent neural network and an autoencoder to tackle object recognition and grasping simultaneously.
arXiv Detail & Related papers (2021-09-23T11:14:13Z) - RandomRooms: Unsupervised Pre-training from Synthetic Shapes and
Randomized Layouts for 3D Object Detection [138.2892824662943]
A promising solution is to make better use of the synthetic dataset, which consists of CAD object models, to boost the learning on real datasets.
Recent work on 3D pre-training exhibits failure when transfer features learned on synthetic objects to other real-world applications.
In this work, we put forward a new method called RandomRooms to accomplish this objective.
arXiv Detail & Related papers (2021-08-17T17:56:12Z) - Learning without Forgetting for 3D Point Cloud Objects [7.761414660999872]
When we fine-tune a well-trained deep learning model for a new set of classes, the network learns new concepts but gradually forgets the knowledge of old training.
We investigate knowledge distillation techniques on 3D data to reduce catastrophic forgetting of the previous training.
We observe that exploring the interrelation of old and new knowledge during training helps to learn new concepts without forgetting old ones.
arXiv Detail & Related papers (2021-06-27T16:39:39Z) - Unsupervised Learning of 3D Object Categories from Videos in the Wild [75.09720013151247]
We focus on learning a model from multiple views of a large collection of object instances.
We propose a new neural network design, called warp-conditioned ray embedding (WCR), which significantly improves reconstruction.
Our evaluation demonstrates performance improvements over several deep monocular reconstruction baselines on existing benchmarks.
arXiv Detail & Related papers (2021-03-30T17:57:01Z) - Secrets of 3D Implicit Object Shape Reconstruction in the Wild [92.5554695397653]
Reconstructing high-fidelity 3D objects from sparse, partial observation is crucial for various applications in computer vision, robotics, and graphics.
Recent neural implicit modeling methods show promising results on synthetic or dense datasets.
But, they perform poorly on real-world data that is sparse and noisy.
This paper analyzes the root cause of such deficient performance of a popular neural implicit model.
arXiv Detail & Related papers (2021-01-18T03:24:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.