Traffic Context Aware Data Augmentation for Rare Object Detection in
Autonomous Driving
- URL: http://arxiv.org/abs/2205.00376v1
- Date: Sun, 1 May 2022 01:45:00 GMT
- Title: Traffic Context Aware Data Augmentation for Rare Object Detection in
Autonomous Driving
- Authors: Naifan Li, Fan Song, Ying Zhang, Pengpeng Liang, Erkang Cheng
- Abstract summary: We propose a systematic study on simple Copy-Paste data augmentation for rare object detection in autonomous driving.
Specifically, local adaptive instance-level image transformation is introduced to generate realistic rare object masks.
We build a new dataset named NM10k consisting 10k training images, 4k validation images and the corresponding labels.
- Score: 5.037913689432052
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Detection of rare objects (e.g., traffic cones, traffic barrels and traffic
warning triangles) is an important perception task to improve the safety of
autonomous driving. Training of such models typically requires a large number
of annotated data which is expensive and time consuming to obtain. To address
the above problem, an emerging approach is to apply data augmentation to
automatically generate cost-free training samples. In this work, we propose a
systematic study on simple Copy-Paste data augmentation for rare object
detection in autonomous driving. Specifically, local adaptive instance-level
image transformation is introduced to generate realistic rare object masks from
source domain to the target domain. Moreover, traffic scene context is utilized
to guide the placement of masks of rare objects. To this end, our data
augmentation generates training data with high quality and realistic
characteristics by leveraging both local and global consistency. In addition,
we build a new dataset named NM10k consisting 10k training images, 4k
validation images and the corresponding labels with a diverse range of
scenarios in autonomous driving. Experiments on NM10k show that our method
achieves promising results on rare object detection. We also present a thorough
study to illustrate the effectiveness of our local-adaptive and global
constraints based Copy-Paste data augmentation for rare object detection. The
data, development kit and more information of NM10k dataset are available
online at: \url{https://nullmax-vision.github.io}.
Related papers
- Label-Efficient 3D Object Detection For Road-Side Units [10.663986706501188]
Collaborative perception can enhance the perception of autonomous vehicles via deep information fusion with intelligent roadside units (RSU)
The data-hungry nature of these methods creates a major hurdle for their real-world deployment, particularly due to the need for annotated RSU data.
We devise a label-efficient object detection method for RSU based on unsupervised object discovery.
arXiv Detail & Related papers (2024-04-09T12:29:16Z) - Transfer learning with generative models for object detection on limited datasets [1.4999444543328293]
In some fields, such as marine biology, it is necessary to have correctly labeled bounding boxes around each object.
We propose a transfer learning framework that is valid for a generic scenario.
Our results pave the way for new generative AI-based protocols for machine learning applications in various domains.
arXiv Detail & Related papers (2024-02-09T21:17:31Z) - Unsupervised Domain Adaptation for Self-Driving from Past Traversal
Features [69.47588461101925]
We propose a method to adapt 3D object detectors to new driving environments.
Our approach enhances LiDAR-based detection models using spatial quantized historical features.
Experiments on real-world datasets demonstrate significant improvements.
arXiv Detail & Related papers (2023-09-21T15:00:31Z) - Generalized Few-Shot 3D Object Detection of LiDAR Point Cloud for
Autonomous Driving [91.39625612027386]
We propose a novel task, called generalized few-shot 3D object detection, where we have a large amount of training data for common (base) objects, but only a few data for rare (novel) classes.
Specifically, we analyze in-depth differences between images and point clouds, and then present a practical principle for the few-shot setting in the 3D LiDAR dataset.
To solve this task, we propose an incremental fine-tuning method to extend existing 3D detection models to recognize both common and rare objects.
arXiv Detail & Related papers (2023-02-08T07:11:36Z) - SODA10M: Towards Large-Scale Object Detection Benchmark for Autonomous
Driving [94.11868795445798]
We release a Large-Scale Object Detection benchmark for Autonomous driving, named as SODA10M, containing 10 million unlabeled images and 20K images labeled with 6 representative object categories.
To improve diversity, the images are collected every ten seconds per frame within 32 different cities under different weather conditions, periods and location scenes.
We provide extensive experiments and deep analyses of existing supervised state-of-the-art detection models, popular self-supervised and semi-supervised approaches, and some insights about how to develop future models.
arXiv Detail & Related papers (2021-06-21T13:55:57Z) - One Million Scenes for Autonomous Driving: ONCE Dataset [91.94189514073354]
We introduce the ONCE dataset for 3D object detection in the autonomous driving scenario.
The data is selected from 144 driving hours, which is 20x longer than the largest 3D autonomous driving dataset available.
We reproduce and evaluate a variety of self-supervised and semi-supervised methods on the ONCE dataset.
arXiv Detail & Related papers (2021-06-21T12:28:08Z) - Fine-Grained Vehicle Perception via 3D Part-Guided Visual Data
Augmentation [77.60050239225086]
We propose an effective training data generation process by fitting a 3D car model with dynamic parts to vehicles in real images.
Our approach is fully automatic without any human interaction.
We present a multi-task network for VUS parsing and a multi-stream network for VHI parsing.
arXiv Detail & Related papers (2020-12-15T03:03:38Z) - Deep traffic light detection by overlaying synthetic context on
arbitrary natural images [49.592798832978296]
We propose a method to generate artificial traffic-related training data for deep traffic light detectors.
This data is generated using basic non-realistic computer graphics to blend fake traffic scenes on top of arbitrary image backgrounds.
It also tackles the intrinsic data imbalance problem in traffic light datasets, caused mainly by the low amount of samples of the yellow state.
arXiv Detail & Related papers (2020-11-07T19:57:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.