On the Cross-dataset Generalization in License Plate Recognition
- URL: http://arxiv.org/abs/2201.00267v2
- Date: Tue, 4 Jan 2022 08:00:02 GMT
- Title: On the Cross-dataset Generalization in License Plate Recognition
- Authors: Rayson Laroca, Everton V. Cardoso, Diego R. Lucio, Valter Estevam,
David Menotti
- Abstract summary: We propose a traditional-split versus leave-one-dataset-out experimental setup to empirically assess the cross-dataset generalization of 12 OCR models.
Results shed light on the limitations of the traditional-split protocol for evaluating approaches in the ALPR context.
- Score: 1.8514314381314887
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Automatic License Plate Recognition (ALPR) systems have shown remarkable
performance on license plates (LPs) from multiple regions due to advances in
deep learning and the increasing availability of datasets. The evaluation of
deep ALPR systems is usually done within each dataset; therefore, it is
questionable if such results are a reliable indicator of generalization
ability. In this paper, we propose a traditional-split versus
leave-one-dataset-out experimental setup to empirically assess the
cross-dataset generalization of 12 Optical Character Recognition (OCR) models
applied to LP recognition on nine publicly available datasets with a great
variety in several aspects (e.g., acquisition settings, image resolution, and
LP layouts). We also introduce a public dataset for end-to-end ALPR that is the
first to contain images of vehicles with Mercosur LPs and the one with the
highest number of motorcycle images. The experimental results shed light on the
limitations of the traditional-split protocol for evaluating approaches in the
ALPR context, as there are significant drops in performance for most datasets
when training and testing the models in a leave-one-dataset-out fashion.
Related papers
- Toward Enhancing Vehicle Color Recognition in Adverse Conditions: A Dataset and Benchmark [2.326743352134195]
Vehicle Color Recognition (VCR) has garnered significant research interest because color is a visually distinguishable attribute of vehicles.
Despite the success of existing methods for this task, the relatively low complexity of the datasets used in the literature has been largely overlooked.
This research addresses this gap by compiling a new dataset representing a more challenging VCR scenario.
arXiv Detail & Related papers (2024-08-21T12:54:41Z) - Rethinking Transformers Pre-training for Multi-Spectral Satellite
Imagery [78.43828998065071]
Recent advances in unsupervised learning have demonstrated the ability of large vision models to achieve promising results on downstream tasks.
Such pre-training techniques have also been explored recently in the remote sensing domain due to the availability of large amount of unlabelled data.
In this paper, we re-visit transformers pre-training and leverage multi-scale information that is effectively utilized with multiple modalities.
arXiv Detail & Related papers (2024-03-08T16:18:04Z) - A Survey on Open-Set Image Recognition [18.474539379698538]
Open-set image recognition (OSR) aims to both classify known-class samples and identify unknown-class samples in the testing set.
We introduce a new taxonomy, under which we comprehensively review the existing DNN-based OSR methods.
We compare the performances of some typical and state-of-the-art OSR methods on both coarse-grained datasets and fine-grained datasets.
arXiv Detail & Related papers (2023-12-25T00:30:23Z) - Cluster-level pseudo-labelling for source-free cross-domain facial
expression recognition [94.56304526014875]
We propose the first Source-Free Unsupervised Domain Adaptation (SFUDA) method for Facial Expression Recognition (FER)
Our method exploits self-supervised pretraining to learn good feature representations from the target data.
We validate the effectiveness of our method in four adaptation setups, proving that it consistently outperforms existing SFUDA methods when applied to FER.
arXiv Detail & Related papers (2022-10-11T08:24:50Z) - Mitigating Representation Bias in Action Recognition: Algorithms and
Benchmarks [76.35271072704384]
Deep learning models perform poorly when applied to videos with rare scenes or objects.
We tackle this problem from two different angles: algorithm and dataset.
We show that the debiased representation can generalize better when transferred to other datasets and tasks.
arXiv Detail & Related papers (2022-09-20T00:30:35Z) - A First Look at Dataset Bias in License Plate Recognition [1.8496815029347666]
dataset bias has been recognized as a severe problem in the computer vision community.
This work investigates the dataset bias problem in the License Plate Recognition context.
arXiv Detail & Related papers (2022-08-23T00:20:33Z) - An advanced combination of semi-supervised Normalizing Flow & Yolo
(YoloNF) to detect and recognize vehicle license plates [1.5208105446192792]
This paper presents a robust and efficient ALPR system based on the state-of-the-art YOLO object detector and Normalizing flows.
The model uses two new strategies. Firstly, a two-stage network using YOLO and a normalization flow-based model for normalization to detect Licenses Plates (LP) and recognize the LP with numbers and Arabic characters.
arXiv Detail & Related papers (2022-07-21T22:22:57Z) - Prefix Conditioning Unifies Language and Label Supervision [84.11127588805138]
We show that dataset biases negatively affect pre-training by reducing the generalizability of learned representations.
In experiments, we show that this simple technique improves the performance in zero-shot image recognition accuracy and robustness to the image-level distribution shift.
arXiv Detail & Related papers (2022-06-02T16:12:26Z) - Unified Chinese License Plate Detection and Recognition with High
Efficiency [3.0279719282256137]
Deep learning methods have reached an excellent performance on License Plate (LP) detection and recognition tasks.
It is still challenging to build a robust model for Chinese LPs since there are not enough large and representative datasets.
We propose a new dataset named Chinese Road Plate dataset (CRPD) that contains multi-objective Chinese LP images.
arXiv Detail & Related papers (2022-05-07T07:35:51Z) - Robust Semi-supervised Federated Learning for Images Automatic
Recognition in Internet of Drones [57.468730437381076]
We present a Semi-supervised Federated Learning (SSFL) framework for privacy-preserving UAV image recognition.
There are significant differences in the number, features, and distribution of local data collected by UAVs using different camera modules.
We propose an aggregation rule based on the frequency of the client's participation in training, namely the FedFreq aggregation rule.
arXiv Detail & Related papers (2022-01-03T16:49:33Z) - Omni-supervised Facial Expression Recognition via Distilled Data [120.11782405714234]
We propose omni-supervised learning to exploit reliable samples in a large amount of unlabeled data for network training.
We experimentally verify that the new dataset can significantly improve the ability of the learned FER model.
To tackle this, we propose to apply a dataset distillation strategy to compress the created dataset into several informative class-wise images.
arXiv Detail & Related papers (2020-05-18T09:36:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.