Quantifying Data Augmentation for LiDAR based 3D Object Detection
- URL: http://arxiv.org/abs/2004.01643v2
- Date: Fri, 29 Jul 2022 11:43:02 GMT
- Title: Quantifying Data Augmentation for LiDAR based 3D Object Detection
- Authors: Martin Hahner, Dengxin Dai, Alexander Liniger, and Luc Van Gool
- Abstract summary: In this work, we shed light on different data augmentation techniques commonly used in Light Detection and Ranging (LiDAR) based 3D Object Detection.
We investigate a variety of global and local augmentation techniques, where global augmentation techniques are applied to the entire point cloud of a scene and local augmentation techniques are only applied to points belonging to individual objects in the scene.
Our findings show that both types of data augmentation can lead to performance increases, but it also turns out, that some augmentation techniques, such as individual object translation, for example, can be counterproductive and can hurt the overall performance.
- Score: 139.64869289514525
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this work, we shed light on different data augmentation techniques
commonly used in Light Detection and Ranging (LiDAR) based 3D Object Detection.
For the bulk of our experiments, we utilize the well known PointPillars
pipeline and the well established KITTI dataset. We investigate a variety of
global and local augmentation techniques, where global augmentation techniques
are applied to the entire point cloud of a scene and local augmentation
techniques are only applied to points belonging to individual objects in the
scene. Our findings show that both types of data augmentation can lead to
performance increases, but it also turns out, that some augmentation
techniques, such as individual object translation, for example, can be
counterproductive and can hurt the overall performance. We show that these
findings transfer and generalize well to other state of the art 3D Object
Detection methods and the challenging STF dataset. On the KITTI dataset we can
gain up to 1.5% and on the STF dataset up to 1.7% in 3D mAP on the moderate car
class.
Related papers
- Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding [50.448520056844885]
We propose a generative Bayesian network to produce diverse synthetic scenes with real-world patterns.
A series of experiments robustly display our method's consistent superiority over existing state-of-the-art pre-training approaches.
arXiv Detail & Related papers (2024-06-17T07:43:53Z) - MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations [55.022519020409405]
This paper builds the first largest ever multi-modal 3D scene dataset and benchmark with hierarchical grounded language annotations, MMScan.
The resulting multi-modal 3D dataset encompasses 1.4M meta-annotated captions on 109k objects and 7.7k regions as well as over 3.04M diverse samples for 3D visual grounding and question-answering benchmarks.
arXiv Detail & Related papers (2024-06-13T17:59:30Z) - MDT3D: Multi-Dataset Training for LiDAR 3D Object Detection
Generalization [3.8243923744440926]
3D object detection models trained on a source dataset with a specific point distribution have shown difficulties in generalizing to unseen datasets.
We leverage the information available from several annotated source datasets with our Multi-Dataset Training for 3D Object Detection (MDT3D) method.
We show how we managed the mix of datasets during training and finally introduce a new cross-dataset augmentation method: cross-dataset object injection.
arXiv Detail & Related papers (2023-08-02T08:20:00Z) - Learning Object-level Point Augmentor for Semi-supervised 3D Object
Detection [85.170578641966]
We propose an object-level point augmentor (OPA) that performs local transformations for semi-supervised 3D object detection.
In this way, the resultant augmentor is derived to emphasize object instances rather than irrelevant backgrounds.
Experiments on the ScanNet and SUN RGB-D datasets show that the proposed OPA performs favorably against the state-of-the-art methods.
arXiv Detail & Related papers (2022-12-19T06:56:14Z) - LidarAugment: Searching for Scalable 3D LiDAR Data Augmentations [55.45435708426761]
LidarAugment is a search-based data augmentation strategy for 3D object detection.
We show LidarAugment can be customized for different model architectures.
It consistently improves convolution-based UPillars/StarNet/RSN and transformer-based SWFormer.
arXiv Detail & Related papers (2022-10-24T18:00:04Z) - Real3D-Aug: Point Cloud Augmentation by Placing Real Objects with
Occlusion Handling for 3D Detection and Segmentation [0.0]
We propose a data augmentation method that takes advantage of already annotated data multiple times.
We propose an augmentation framework that reuses real data, automatically finds suitable placements in the scene to be augmented.
The pipeline proves competitive in training top-performing models for 3D object detection and semantic segmentation.
arXiv Detail & Related papers (2022-06-15T16:25:30Z) - Pattern-Aware Data Augmentation for LiDAR 3D Object Detection [7.394029879643516]
We propose pattern-aware ground truth sampling, a data augmentation technique that downsamples an object's point cloud based on the LiDAR's characteristics.
We improve the performance of PV-RCNN on the car class by more than 0.7 percent on the KITTI validation split at distances greater than 25 m.
arXiv Detail & Related papers (2021-11-30T19:14:47Z) - Part-Aware Data Augmentation for 3D Object Detection in Point Cloud [33.59724834383291]
3D label has more sophisticated and rich structural information than the 2D label, so it enables more diverse and effective data augmentation.
We propose part-aware data augmentation (PA-AUG) that can better utilize rich information of 3D label.
We show that PA-AUG not only increases performance for a given dataset but also is robust to corrupted data.
arXiv Detail & Related papers (2020-07-27T08:47:19Z) - Improving 3D Object Detection through Progressive Population Based
Augmentation [91.56261177665762]
We present the first attempt to automate the design of data augmentation policies for 3D object detection.
We introduce the Progressive Population Based Augmentation (PPBA) algorithm, which learns to optimize augmentation strategies by narrowing down the search space and adopting the best parameters discovered in previous iterations.
We find that PPBA may be up to 10x more data efficient than baseline 3D detection models without augmentation, highlighting that 3D detection models may achieve competitive accuracy with far fewer labeled examples.
arXiv Detail & Related papers (2020-04-02T05:57:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.