Affordance detection with Dynamic-Tree Capsule Networks
- URL: http://arxiv.org/abs/2211.05200v1
- Date: Wed, 9 Nov 2022 21:14:08 GMT
- Title: Affordance detection with Dynamic-Tree Capsule Networks
- Authors: Antonio Rodr\'iguez-S\'anchez, Simon Haller-Seeber, David Peer, Chris
Engelhardt, Jakob Mittelberger, Matteo Saveriano
- Abstract summary: Affordance detection from visual input is a fundamental step in autonomous robotic manipulation.
We introduce the first affordance detection network based on dynamic tree-structured capsules for sparse 3D point clouds.
Our algorithm is superior to current affordance detection methods when faced with grasping previously unseen objects.
- Score: 5.847547503155588
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Affordance detection from visual input is a fundamental step in autonomous
robotic manipulation. Existing solutions to the problem of affordance detection
rely on convolutional neural networks. However, these networks do not consider
the spatial arrangement of the input data and miss parts-to-whole
relationships. Therefore, they fall short when confronted with novel,
previously unseen object instances or new viewpoints. One solution to overcome
such limitations can be to resort to capsule networks. In this paper, we
introduce the first affordance detection network based on dynamic
tree-structured capsules for sparse 3D point clouds. We show that our
capsule-based network outperforms current state-of-the-art models on viewpoint
invariance and parts-segmentation of new object instances through a novel
dataset we only used for evaluation and it is publicly available from
github.com/gipfelen/DTCG-Net. In the experimental evaluation we will show that
our algorithm is superior to current affordance detection methods when faced
with grasping previously unseen objects thanks to our Capsule Network enforcing
a parts-to-whole representation.
Related papers
- Investigating the Gestalt Principle of Closure in Deep Convolutional Neural Networks [4.406699323036466]
This study investigates the principle of closure in convolutional neural networks.
We conduct experiments using simple visual stimuli with progressively removed edge sections.
We evaluate well-known networks on their ability to classify incomplete polygons.
arXiv Detail & Related papers (2024-11-01T14:36:21Z) - Spatial-Temporal Graph Enhanced DETR Towards Multi-Frame 3D Object Detection [54.041049052843604]
We present STEMD, a novel end-to-end framework that enhances the DETR-like paradigm for multi-frame 3D object detection.
First, to model the inter-object spatial interaction and complex temporal dependencies, we introduce the spatial-temporal graph attention network.
Finally, it poses a challenge for the network to distinguish between the positive query and other highly similar queries that are not the best match.
arXiv Detail & Related papers (2023-07-01T13:53:14Z) - ALSO: Automotive Lidar Self-supervision by Occupancy estimation [70.70557577874155]
We propose a new self-supervised method for pre-training the backbone of deep perception models operating on point clouds.
The core idea is to train the model on a pretext task which is the reconstruction of the surface on which the 3D points are sampled.
The intuition is that if the network is able to reconstruct the scene surface, given only sparse input points, then it probably also captures some fragments of semantic information.
arXiv Detail & Related papers (2022-12-12T13:10:19Z) - AGO-Net: Association-Guided 3D Point Cloud Object Detection Network [86.10213302724085]
We propose a novel 3D detection framework that associates intact features for objects via domain adaptation.
We achieve new state-of-the-art performance on the KITTI 3D detection benchmark in both accuracy and speed.
arXiv Detail & Related papers (2022-08-24T16:54:38Z) - Improved Orientation Estimation and Detection with Hybrid Object
Detection Networks for Automotive Radar [1.53934570513443]
We present novel hybrid architectures that combine grid- and point-based processing to improve radar-based object detection networks.
We show that a point-based model can extract neighborhood features, leveraging the exact relative positions of points, before grid rendering.
This has significant benefits for a following convolutional detection backbone.
arXiv Detail & Related papers (2022-05-03T06:29:03Z) - CPFN: Cascaded Primitive Fitting Networks for High-Resolution Point
Clouds [51.47100091540298]
We present Cascaded Primitive Fitting Networks (CPFN) that relies on an adaptive patch sampling network to assemble detection results of global and local primitive detection networks.
CPFN improves the state-of-the-art SPFN performance by 13-14% on high-resolution point cloud datasets and specifically improves the detection of fine-scale primitives by 20-22%.
arXiv Detail & Related papers (2021-08-31T23:27:33Z) - Deformable Capsules for Object Detection [3.702343116848637]
We introduce a new family of capsule networks, deformable capsules (textitDeformCaps), to address a very important problem in computer vision: object detection.
We demonstrate that the proposed methods efficiently scale up to create the first-ever capsule network for object detection in the literature.
arXiv Detail & Related papers (2021-04-11T15:36:30Z) - PC-RGNN: Point Cloud Completion and Graph Neural Network for 3D Object
Detection [57.49788100647103]
LiDAR-based 3D object detection is an important task for autonomous driving.
Current approaches suffer from sparse and partial point clouds of distant and occluded objects.
In this paper, we propose a novel two-stage approach, namely PC-RGNN, dealing with such challenges by two specific solutions.
arXiv Detail & Related papers (2020-12-18T18:06:43Z) - Recalibration of Neural Networks for Point Cloud Analysis [3.7814216736076434]
We introduce re-calibration modules on deep neural networks for 3D point clouds.
We demonstrate the benefit and versatility of our proposed modules by incorporating them into three state-of-the-art networks for 3D point cloud analysis.
In the second set of experiments, we investigate the benefits of re-calibration blocks on Alzheimer's Disease diagnosis.
arXiv Detail & Related papers (2020-11-25T17:14:34Z) - Ventral-Dorsal Neural Networks: Object Detection via Selective Attention [51.79577908317031]
We propose a new framework called Ventral-Dorsal Networks (VDNets)
Inspired by the structure of the human visual system, we propose the integration of a "Ventral Network" and a "Dorsal Network"
Our experimental results reveal that the proposed method outperforms state-of-the-art object detection approaches.
arXiv Detail & Related papers (2020-05-15T23:57:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.