Improving 3D Object Detection through Progressive Population Based
Augmentation
- URL: http://arxiv.org/abs/2004.00831v2
- Date: Thu, 16 Jul 2020 19:07:15 GMT
- Title: Improving 3D Object Detection through Progressive Population Based
Augmentation
- Authors: Shuyang Cheng, Zhaoqi Leng, Ekin Dogus Cubuk, Barret Zoph, Chunyan
Bai, Jiquan Ngiam, Yang Song, Benjamin Caine, Vijay Vasudevan, Congcong Li,
Quoc V. Le, Jonathon Shlens, Dragomir Anguelov
- Abstract summary: We present the first attempt to automate the design of data augmentation policies for 3D object detection.
We introduce the Progressive Population Based Augmentation (PPBA) algorithm, which learns to optimize augmentation strategies by narrowing down the search space and adopting the best parameters discovered in previous iterations.
We find that PPBA may be up to 10x more data efficient than baseline 3D detection models without augmentation, highlighting that 3D detection models may achieve competitive accuracy with far fewer labeled examples.
- Score: 91.56261177665762
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Data augmentation has been widely adopted for object detection in 3D point
clouds. However, all previous related efforts have focused on manually
designing specific data augmentation methods for individual architectures. In
this work, we present the first attempt to automate the design of data
augmentation policies for 3D object detection. We introduce the Progressive
Population Based Augmentation (PPBA) algorithm, which learns to optimize
augmentation strategies by narrowing down the search space and adopting the
best parameters discovered in previous iterations. On the KITTI 3D detection
test set, PPBA improves the StarNet detector by substantial margins on the
moderate difficulty category of cars, pedestrians, and cyclists, outperforming
all current state-of-the-art single-stage detection models. Additional
experiments on the Waymo Open Dataset indicate that PPBA continues to
effectively improve the StarNet and PointPillars detectors on a 20x larger
dataset compared to KITTI. The magnitude of the improvements may be comparable
to advances in 3D perception architectures and the gains come without an
incurred cost at inference time. In subsequent experiments, we find that PPBA
may be up to 10x more data efficient than baseline 3D detection models without
augmentation, highlighting that 3D detection models may achieve competitive
accuracy with far fewer labeled examples.
Related papers
- Study of Dropout in PointPillars with 3D Object Detection [0.0]
3D object detection is critical for autonomous driving, leveraging deep learning techniques to interpret LiDAR data.
This study provides an analysis of enhancing the performance of PointPillars model under various dropout rates.
arXiv Detail & Related papers (2024-09-01T09:30:54Z) - 3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features [70.50665869806188]
3DiffTection is a state-of-the-art method for 3D object detection from single images.
We fine-tune a diffusion model to perform novel view synthesis conditioned on a single image.
We further train the model on target data with detection supervision.
arXiv Detail & Related papers (2023-11-07T23:46:41Z) - GOOD: General Optimization-based Fusion for 3D Object Detection via
LiDAR-Camera Object Candidates [10.534984939225014]
3D object detection serves as the core basis of the perception tasks in autonomous driving.
Good is a general optimization-based fusion framework that can achieve satisfying detection without training additional models.
Experiments on both nuScenes and KITTI datasets are carried out and the results show that GOOD outperforms by 9.1% on mAP score compared with PointPillars.
arXiv Detail & Related papers (2023-03-17T07:05:04Z) - LidarAugment: Searching for Scalable 3D LiDAR Data Augmentations [55.45435708426761]
LidarAugment is a search-based data augmentation strategy for 3D object detection.
We show LidarAugment can be customized for different model architectures.
It consistently improves convolution-based UPillars/StarNet/RSN and transformer-based SWFormer.
arXiv Detail & Related papers (2022-10-24T18:00:04Z) - Cost-Aware Evaluation and Model Scaling for LiDAR-Based 3D Object
Detection [34.34668878632354]
This work is to conduct a cost-aware evaluation of LiDAR-based 3D object detectors.
Specifically, we focus on SECOND, a simple grid-based one-stage detector.
We compare the family of scaled SECOND with recent 3D detection methods, such as Voxel R-CNN and PV-RCNN++.
arXiv Detail & Related papers (2022-05-02T18:16:17Z) - ST3D: Self-training for Unsupervised Domain Adaptation on 3D
ObjectDetection [78.71826145162092]
We present a new domain adaptive self-training pipeline, named ST3D, for unsupervised domain adaptation on 3D object detection from point clouds.
Our ST3D achieves state-of-the-art performance on all evaluated datasets and even surpasses fully supervised results on KITTI 3D object detection benchmark.
arXiv Detail & Related papers (2021-03-09T10:51:24Z) - Reinforced Axial Refinement Network for Monocular 3D Object Detection [160.34246529816085]
Monocular 3D object detection aims to extract the 3D position and properties of objects from a 2D input image.
Conventional approaches sample 3D bounding boxes from the space and infer the relationship between the target object and each of them, however, the probability of effective samples is relatively small in the 3D space.
We propose to start with an initial prediction and refine it gradually towards the ground truth, with only one 3d parameter changed in each step.
This requires designing a policy which gets a reward after several steps, and thus we adopt reinforcement learning to optimize it.
arXiv Detail & Related papers (2020-08-31T17:10:48Z) - InfoFocus: 3D Object Detection for Autonomous Driving with Dynamic
Information Modeling [65.47126868838836]
We propose a novel 3D object detection framework with dynamic information modeling.
Coarse predictions are generated in the first stage via a voxel-based region proposal network.
Experiments are conducted on the large-scale nuScenes 3D detection benchmark.
arXiv Detail & Related papers (2020-07-16T18:27:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.