Synthesizing Annotated Image and Video Data Using a Rendering-Based
Pipeline for Improved License Plate Recognition
- URL: http://arxiv.org/abs/2209.14448v1
- Date: Wed, 28 Sep 2022 22:11:58 GMT
- Title: Synthesizing Annotated Image and Video Data Using a Rendering-Based
Pipeline for Improved License Plate Recognition
- Authors: Andreas Spruck, Maximilane Gruber, Anatol Maier, Denise Moussa,
J\"urgen Seiler, Christian Riess, Andr\'e Kaup
- Abstract summary: An insufficient number of training samples is a common problem in neural network applications.
We propose a novel rendering-based pipeline for synthesizing annotated data sets.
The pipeline is capable of generating and annotating synthetic and partly-real image and video data in a fully automatic procedure.
- Score: 8.15545354580221
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: An insufficient number of training samples is a common problem in neural
network applications. While data augmentation methods require at least a
minimum number of samples, we propose a novel, rendering-based pipeline for
synthesizing annotated data sets. Our method does not modify existing samples
but synthesizes entirely new samples. The proposed rendering-based pipeline is
capable of generating and annotating synthetic and partly-real image and video
data in a fully automatic procedure. Moreover, the pipeline can aid the
acquisition of real data. The proposed pipeline is based on a rendering
process. This process generates synthetic data. Partly-real data bring the
synthetic sequences closer to reality by incorporating real cameras during the
acquisition process. The benefits of the proposed data generation pipeline,
especially for machine learning scenarios with limited available training data,
are demonstrated by an extensive experimental validation in the context of
automatic license plate recognition. The experiments demonstrate a significant
reduction of the character error rate and miss rate from 73.74% and 100% to
14.11% and 41.27% respectively, compared to an OCR algorithm trained on a real
data set solely. These improvements are achieved by training the algorithm on
synthesized data solely. When additionally incorporating real data, the error
rates can be decreased further. Thereby, the character error rate and miss rate
can be reduced to 11.90% and 39.88% respectively. All data used during the
experiments as well as the proposed rendering-based pipeline for the automated
data generation is made publicly available under (URL will be revealed upon
publication).
Related papers
- A New Benchmark: On the Utility of Synthetic Data with Blender for Bare
Supervised Learning and Downstream Domain Adaptation [42.2398858786125]
Deep learning in computer vision has achieved great success with the price of large-scale labeled training data.
The uncontrollable data collection process produces non-IID training and test data, where undesired duplication may exist.
To circumvent them, an alternative is to generate synthetic data via 3D rendering with domain randomization.
arXiv Detail & Related papers (2023-03-16T09:03:52Z) - Minimizing the Accumulated Trajectory Error to Improve Dataset
Distillation [151.70234052015948]
We propose a novel approach that encourages the optimization algorithm to seek a flat trajectory.
We show that the weights trained on synthetic data are robust against the accumulated errors perturbations with the regularization towards the flat trajectory.
Our method, called Flat Trajectory Distillation (FTD), is shown to boost the performance of gradient-matching methods by up to 4.7%.
arXiv Detail & Related papers (2022-11-20T15:49:11Z) - Towards Real-World Video Deblurring by Exploring Blur Formation Process [53.91239555063343]
In recent years, deep learning-based approaches have achieved promising success on video deblurring task.
The models trained on existing synthetic datasets still suffer from generalization problems over real-world blurry scenarios.
We propose a novel realistic blur synthesis pipeline termed RAW-Blur by leveraging blur formation cues.
arXiv Detail & Related papers (2022-08-28T09:24:52Z) - TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual
Environments [84.6017003787244]
This work proposes a synthetic data generation pipeline to address the difficulties and domain-gaps present in simulated datasets.
We show that using annotations and visual cues from existing datasets, we can facilitate automated multi-modal data generation.
arXiv Detail & Related papers (2022-08-16T20:46:08Z) - BeCAPTCHA-Type: Biometric Keystroke Data Generation for Improved Bot
Detection [63.447493500066045]
This work proposes a data driven learning model for the synthesis of keystroke biometric data.
The proposed method is compared with two statistical approaches based on Universal and User-dependent models.
Our experimental framework considers a dataset with 136 million keystroke events from 168 thousand subjects.
arXiv Detail & Related papers (2022-07-27T09:26:15Z) - Synt++: Utilizing Imperfect Synthetic Data to Improve Speech Recognition [18.924716098922683]
Machine learning with synthetic data is not trivial due to the gap between the synthetic and the real data distributions.
We propose two novel techniques during training to mitigate the problems due to the distribution gap.
We show that these methods significantly improve the training of speech recognition models using synthetic data.
arXiv Detail & Related papers (2021-10-21T21:11:42Z) - Synthetic Data for Model Selection [2.4499092754102874]
We show that synthetic data can be beneficial for model selection.
We introduce a novel method to calibrate the synthetic error estimation to fit that of the real domain.
arXiv Detail & Related papers (2021-05-03T09:52:03Z) - UnrealPerson: An Adaptive Pipeline towards Costless Person
Re-identification [102.58619642363959]
This paper presents UnrealPerson, a novel pipeline that makes full use of unreal image data to decrease the costs in both the training and deployment stages.
With 3,000 IDs and 120,000 instances, our method achieves a 38.5% rank-1 accuracy when being directly transferred to MSMT17.
arXiv Detail & Related papers (2020-12-08T08:15:30Z) - Syn2Real Transfer Learning for Image Deraining using Gaussian Processes [92.15895515035795]
CNN-based methods for image deraining have achieved excellent performance in terms of reconstruction error as well as visual quality.
Due to challenges in obtaining real world fully-labeled image deraining datasets, existing methods are trained only on synthetically generated data.
We propose a Gaussian Process-based semi-supervised learning framework which enables the network in learning to derain using synthetic dataset.
arXiv Detail & Related papers (2020-06-10T00:33:18Z) - Can Synthetic Data Improve Object Detection Results for Remote Sensing
Images? [15.466412729455874]
We propose the use of realistic synthetic data with a wide distribution to improve the performance of remote sensing image aircraft detection.
We randomly set the parameters during rendering, such as the size of the instance and the class of background images.
In order to make the synthetic images more realistic, we refine the synthetic images at the pixel level using CycleGAN with real unlabeled images.
arXiv Detail & Related papers (2020-06-09T02:23:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.