A Novel Neural Network Training Method for Autonomous Driving Using
Semi-Pseudo-Labels and 3D Data Augmentations
- URL: http://arxiv.org/abs/2207.09869v1
- Date: Wed, 20 Jul 2022 13:04:08 GMT
- Title: A Novel Neural Network Training Method for Autonomous Driving Using
Semi-Pseudo-Labels and 3D Data Augmentations
- Authors: Tamas Matuszka, Daniel Kozma
- Abstract summary: Training neural networks to perform 3D object detection for autonomous driving requires a large amount of diverse annotated data.
We have designed a convolutional neural network for 3D object detection which can significantly increase the detection range.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Training neural networks to perform 3D object detection for autonomous
driving requires a large amount of diverse annotated data. However, obtaining
training data with sufficient quality and quantity is expensive and sometimes
impossible due to human and sensor constraints. Therefore, a novel solution is
needed for extending current training methods to overcome this limitation and
enable accurate 3D object detection. Our solution for the above-mentioned
problem combines semi-pseudo-labeling and novel 3D augmentations. For
demonstrating the applicability of the proposed method, we have designed a
convolutional neural network for 3D object detection which can significantly
increase the detection range in comparison with the training data distribution.
Related papers
- STONE: A Submodular Optimization Framework for Active 3D Object Detection [20.54906045954377]
Key requirement for training an accurate 3D object detector is the availability of a large amount of LiDAR-based point cloud data.
This paper proposes a unified active 3D object detection framework, for greatly reducing the labeling cost of training 3D object detectors.
arXiv Detail & Related papers (2024-10-04T20:45:33Z) - CatFree3D: Category-agnostic 3D Object Detection with Diffusion [63.75470913278591]
We introduce a novel pipeline that decouples 3D detection from 2D detection and depth prediction.
We also introduce the Normalised Hungarian Distance (NHD) metric for an accurate evaluation of 3D detection results.
arXiv Detail & Related papers (2024-08-22T22:05:57Z) - Generalized Few-Shot 3D Object Detection of LiDAR Point Cloud for
Autonomous Driving [91.39625612027386]
We propose a novel task, called generalized few-shot 3D object detection, where we have a large amount of training data for common (base) objects, but only a few data for rare (novel) classes.
Specifically, we analyze in-depth differences between images and point clouds, and then present a practical principle for the few-shot setting in the 3D LiDAR dataset.
To solve this task, we propose an incremental fine-tuning method to extend existing 3D detection models to recognize both common and rare objects.
arXiv Detail & Related papers (2023-02-08T07:11:36Z) - Simplified Learning of CAD Features Leveraging a Deep Residual
Autoencoder [3.567248644184455]
In computer vision, deep residual neural networks like EfficientNet have set new standards in terms of robustness and accuracy.
One key problem underlying the training of deep neural networks is the immanent lack of a sufficient amount of training data.
We present a deep residual 3D autoencoder based on the EfficientNet architecture, intended for transfer learning tasks related to 3D CAD model assessment.
arXiv Detail & Related papers (2022-02-21T10:27:55Z) - Semi-supervised 3D Object Detection via Temporal Graph Neural Networks [17.90796183565084]
3D object detection plays an important role in autonomous driving and other robotics applications.
We propose leveraging large amounts of unlabeled point cloud videos by semi-supervised learning of 3D object detectors.
Our method achieves state-of-the-art detection performance on the challenging nuScenes and H3D benchmarks.
arXiv Detail & Related papers (2022-02-01T02:06:54Z) - Advancing 3D Medical Image Analysis with Variable Dimension Transform
based Supervised 3D Pre-training [45.90045513731704]
This paper revisits an innovative yet simple fully-supervised 3D network pre-training framework.
With a redesigned 3D network architecture, reformulated natural images are used to address the problem of data scarcity.
Comprehensive experiments on four benchmark datasets demonstrate that the proposed pre-trained models can effectively accelerate convergence.
arXiv Detail & Related papers (2022-01-05T03:11:21Z) - Learnable Online Graph Representations for 3D Multi-Object Tracking [156.58876381318402]
We propose a unified and learning based approach to the 3D MOT problem.
We employ a Neural Message Passing network for data association that is fully trainable.
We show the merit of the proposed approach on the publicly available nuScenes dataset by achieving state-of-the-art performance of 65.6% AMOTA and 58% fewer ID-switches.
arXiv Detail & Related papers (2021-04-23T17:59:28Z) - ST3D: Self-training for Unsupervised Domain Adaptation on 3D
ObjectDetection [78.71826145162092]
We present a new domain adaptive self-training pipeline, named ST3D, for unsupervised domain adaptation on 3D object detection from point clouds.
Our ST3D achieves state-of-the-art performance on all evaluated datasets and even surpasses fully supervised results on KITTI 3D object detection benchmark.
arXiv Detail & Related papers (2021-03-09T10:51:24Z) - Bridging the Reality Gap for Pose Estimation Networks using Sensor-Based
Domain Randomization [1.4290119665435117]
Methods trained on synthetic data use 2D images, as domain randomization in 2D is more developed.
Our method integrates the 3D data into the network to increase the accuracy of the pose estimation.
Experiments on three large pose estimation benchmarks show that the presented method outperforms previous methods trained on synthetic data.
arXiv Detail & Related papers (2020-11-17T09:12:11Z) - Manual-Label Free 3D Detection via An Open-Source Simulator [50.74299948748722]
We propose a manual-label free 3D detection algorithm that leverages the CARLA simulator to generate a large amount of self-labeled training samples.
Domain Adaptive VoxelNet (DA-VoxelNet) can cross the distribution gap from the synthetic data to the real scenario.
Experimental results show that the proposed unsupervised DA 3D detector can achieve 76.66% and 56.64% mAP on KITTI evaluation set.
arXiv Detail & Related papers (2020-11-16T08:29:01Z) - Reinforced Axial Refinement Network for Monocular 3D Object Detection [160.34246529816085]
Monocular 3D object detection aims to extract the 3D position and properties of objects from a 2D input image.
Conventional approaches sample 3D bounding boxes from the space and infer the relationship between the target object and each of them, however, the probability of effective samples is relatively small in the 3D space.
We propose to start with an initial prediction and refine it gradually towards the ground truth, with only one 3d parameter changed in each step.
This requires designing a policy which gets a reward after several steps, and thus we adopt reinforcement learning to optimize it.
arXiv Detail & Related papers (2020-08-31T17:10:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.