Angle Based Feature Learning in GNN for 3D Object Detection using Point
Cloud
- URL: http://arxiv.org/abs/2108.00780v1
- Date: Mon, 2 Aug 2021 10:56:02 GMT
- Title: Angle Based Feature Learning in GNN for 3D Object Detection using Point
Cloud
- Authors: Md Afzal Ansari, Md Meraz, Pavan Chakraborty and Mohammed Javed
- Abstract summary: We present new feature encoding methods for Detection of 3D objects in point clouds.
We used a graph neural network (GNN) for Detection of 3D objects namely cars, pedestrians, and cyclists.
- Score: 4.3012765978447565
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we present new feature encoding methods for Detection of 3D
objects in point clouds. We used a graph neural network (GNN) for Detection of
3D objects namely cars, pedestrians, and cyclists. Feature encoding is one of
the important steps in Detection of 3D objects. The dataset used is point cloud
data which is irregular and unstructured and it needs to be encoded in such a
way that ensures better feature encapsulation. Earlier works have used relative
distance as one of the methods to encode the features. These methods are not
resistant to rotation variance problems in Graph Neural Networks. We have
included angular-based measures while performing feature encoding in graph
neural networks. Along with that, we have performed a comparison between other
methods like Absolute, Relative, Euclidean distances, and a combination of the
Angle and Relative methods. The model is trained and evaluated on the subset of
the KITTI object detection benchmark dataset under resource constraints. Our
results demonstrate that a combination of angle measures and relative distance
has performed better than other methods. In comparison to the baseline
method(relative), it achieved better performance. We also performed time
analysis of various feature encoding methods.
Related papers
- Inverse Neural Rendering for Explainable Multi-Object Tracking [35.072142773300655]
We recast 3D multi-object tracking from RGB cameras as an emphInverse Rendering (IR) problem.
We optimize an image loss over generative latent spaces that inherently disentangle shape and appearance properties.
We validate the generalization and scaling capabilities of our method by learning the generative prior exclusively from synthetic data.
arXiv Detail & Related papers (2024-04-18T17:37:53Z) - 3D Video Object Detection with Learnable Object-Centric Global
Optimization [65.68977894460222]
Correspondence-based optimization is the cornerstone for 3D scene reconstruction but is less studied in 3D video object detection.
We propose BA-Det, an end-to-end optimizable object detector with object-centric temporal correspondence learning and featuremetric object bundle adjustment.
arXiv Detail & Related papers (2023-03-27T17:39:39Z) - ALSO: Automotive Lidar Self-supervision by Occupancy estimation [70.70557577874155]
We propose a new self-supervised method for pre-training the backbone of deep perception models operating on point clouds.
The core idea is to train the model on a pretext task which is the reconstruction of the surface on which the 3D points are sampled.
The intuition is that if the network is able to reconstruct the scene surface, given only sparse input points, then it probably also captures some fragments of semantic information.
arXiv Detail & Related papers (2022-12-12T13:10:19Z) - AGO-Net: Association-Guided 3D Point Cloud Object Detection Network [86.10213302724085]
We propose a novel 3D detection framework that associates intact features for objects via domain adaptation.
We achieve new state-of-the-art performance on the KITTI 3D detection benchmark in both accuracy and speed.
arXiv Detail & Related papers (2022-08-24T16:54:38Z) - Learnable Online Graph Representations for 3D Multi-Object Tracking [156.58876381318402]
We propose a unified and learning based approach to the 3D MOT problem.
We employ a Neural Message Passing network for data association that is fully trainable.
We show the merit of the proposed approach on the publicly available nuScenes dataset by achieving state-of-the-art performance of 65.6% AMOTA and 58% fewer ID-switches.
arXiv Detail & Related papers (2021-04-23T17:59:28Z) - Dynamic Edge Weights in Graph Neural Networks for 3D Object Detection [0.0]
We propose an attention based feature aggregation technique in graph neural network (GNN) for detecting objects in LiDAR scan.
In each layer of the GNN, apart from the linear transformation which maps the per node input features to the corresponding higher level features, a per node masked attention is also performed.
The experiments on KITTI dataset show that our method yields comparable results for 3D object detection.
arXiv Detail & Related papers (2020-09-17T12:56:17Z) - Factor Graph based 3D Multi-Object Tracking in Point Clouds [8.411514688735183]
We propose a novel optimization-based approach that does not rely on explicit and fixed assignments.
We demonstrate its performance on the real world KITTI tracking dataset and achieve better results than many state-of-the-art algorithms.
arXiv Detail & Related papers (2020-08-12T13:34:46Z) - GNN3DMOT: Graph Neural Network for 3D Multi-Object Tracking with
Multi-Feature Learning [30.72094639797806]
3D Multi-object tracking (MOT) is crucial to autonomous systems.
We propose two techniques to improve the discriminative feature learning for MOT.
Our proposed method achieves state-of-the-art performance on KITTI and nuScenes 3D MOT benchmarks.
arXiv Detail & Related papers (2020-06-12T17:08:14Z) - DOPS: Learning to Detect 3D Objects and Predict their 3D Shapes [54.239416488865565]
We propose a fast single-stage 3D object detection method for LIDAR data.
The core novelty of our method is a fast, single-pass architecture that both detects objects in 3D and estimates their shapes.
We find that our proposed method achieves state-of-the-art results by 5% on object detection in ScanNet scenes, and it gets top results by 3.4% in the Open dataset.
arXiv Detail & Related papers (2020-04-02T17:48:50Z) - D3Feat: Joint Learning of Dense Detection and Description of 3D Local
Features [51.04841465193678]
We leverage a 3D fully convolutional network for 3D point clouds.
We propose a novel and practical learning mechanism that densely predicts both a detection score and a description feature for each 3D point.
Our method achieves state-of-the-art results in both indoor and outdoor scenarios.
arXiv Detail & Related papers (2020-03-06T12:51:09Z) - Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud [3.04585143845864]
We propose a graph neural network to detect objects from a LiDAR point cloud.
We encode the point cloud efficiently in a fixed radius near-neighbors graph.
In Point-GNN, we propose an auto-registration mechanism to reduce translation variance.
arXiv Detail & Related papers (2020-03-02T23:44:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.