Iranis: A Large-scale Dataset of Farsi License Plate Characters
- URL: http://arxiv.org/abs/2101.00295v1
- Date: Fri, 1 Jan 2021 18:54:44 GMT
- Title: Iranis: A Large-scale Dataset of Farsi License Plate Characters
- Authors: Ali Tourani, Sajjad Soroori, Asadollah Shahbahrami, and Alireza
Akoushideh
- Abstract summary: This paper introduces a large-scale dataset that includes images of numbers and characters used in Iranian car license plates.
The variety of instances in terms of camera shooting angle, illumination, resolution, and contrast make the dataset a proper choice for training deep learning systems.
- Score: 2.537406035246369
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Providing huge amounts of data is a fundamental demand when dealing with Deep
Neural Networks (DNNs). Employing these algorithms to solve computer vision
problems resulted in the advent of various image datasets to feed the most
common visual imagery deep structures, known as Convolutional Neural Networks
(CNNs). In this regard, some datasets can be found that contain hundreds or
even thousands of images for license plate detection and optical character
recognition purposes. However, no publicly available image dataset provides
such data for the recognition of Farsi characters used in car license plates.
The gap has to be filled due to the numerous advantages of developing accurate
deep learning-based systems for law enforcement and surveillance purposes. This
paper introduces a large-scale dataset that includes images of numbers and
characters used in Iranian car license plates. The dataset, named Iranis,
contains more than 83,000 images of Farsi numbers and letters collected from
real-world license plate images captured by various cameras. The variety of
instances in terms of camera shooting angle, illumination, resolution, and
contrast make the dataset a proper choice for training DNNs. Dataset images are
manually annotated for object detection and image classification. Finally, and
to build a baseline for Farsi character recognition, the paper provides a
performance analysis using a YOLO v.3 object detector.
Related papers
- AddressCLIP: Empowering Vision-Language Models for City-wide Image Address Localization [57.34659640776723]
We propose an end-to-end framework named AddressCLIP to solve the problem with more semantics.
We have built three datasets from Pittsburgh and San Francisco on different scales specifically for the IAL problem.
arXiv Detail & Related papers (2024-07-11T03:18:53Z) - From Pixels to Prose: A Large Dataset of Dense Image Captions [76.97493750144812]
PixelProse is a comprehensive dataset of over 16M (million) synthetically generated captions.
To ensure data integrity, we rigorously analyze our dataset for problematic content.
We also provide valuable metadata such as watermark presence and aesthetic scores.
arXiv Detail & Related papers (2024-06-14T17:59:53Z) - Efficient Deduplication and Leakage Detection in Large Scale Image
Datasets with a focus on the CrowdAI Mapping Challenge Dataset [5.149242555705579]
We propose a drop-in pipeline that employs perceptual hashing techniques for efficient de-duplication of the dataset.
In our experiments, we demonstrate that nearly 250k($ sim $90%) images in the training split were identical.
Our analysis on the validation split demonstrates that roughly 56k of the 60k images also appear in the training split, resulting in a data leakage of 93%.
arXiv Detail & Related papers (2023-04-05T08:36:17Z) - HPointLoc: Point-based Indoor Place Recognition using Synthetic RGB-D
Images [58.720142291102135]
We present a novel dataset named as HPointLoc, specially designed for exploring capabilities of visual place recognition in indoor environment.
The dataset is based on the popular Habitat simulator, in which it is possible to generate indoor scenes using both own sensor data and open datasets.
arXiv Detail & Related papers (2022-12-30T12:20:56Z) - IR-LPR: Large Scale of Iranian License Plate Recognition Dataset [0.0]
We have prepared a complete dataset including 20,967 car images along with all the detection annotation of the whole license plate and its characters.
The largest Iranian dataset for recognizing the characters of a license plate has 5,000 images.
arXiv Detail & Related papers (2022-09-10T14:41:59Z) - A large scale multi-view RGBD visual affordance learning dataset [4.3773754388936625]
We introduce a large scale multi-view RGBD visual affordance learning dataset.
This is the first ever and the largest multi-view RGBD visual affordance learning dataset.
Several state-of-the-art deep learning networks are evaluated each for affordance recognition and segmentation tasks.
arXiv Detail & Related papers (2022-03-26T14:31:35Z) - Scaling Up Visual and Vision-Language Representation Learning With Noisy
Text Supervision [57.031588264841]
We leverage a noisy dataset of over one billion image alt-text pairs, obtained without expensive filtering or post-processing steps.
A simple dual-encoder architecture learns to align visual and language representations of the image and text pairs using a contrastive loss.
We show that the scale of our corpus can make up for its noise and leads to state-of-the-art representations even with such a simple learning scheme.
arXiv Detail & Related papers (2021-02-11T10:08:12Z) - Six-channel Image Representation for Cross-domain Object Detection [17.854940064699985]
Deep learning models are data-driven and the excellent performance is highly dependent on the abundant and diverse datasets.
Some image-to-image translation techniques are employed to generate some fake data of some specific scenes to train the models.
We propose to inspire the original 3-channel images and their corresponding GAN-generated fake images to form 6-channel representations of the dataset.
arXiv Detail & Related papers (2021-01-03T04:50:03Z) - Robust Image Retrieval-based Visual Localization using Kapture [10.249293519246478]
We present a versatile pipeline for visual localization that facilitates the use of different local and global features.
We evaluate our methods on eight public datasets where they rank top on all and first on many of them.
To foster future research, we release code, models, and all datasets used in this paper in the kapture format open source under a permissive BSD license.
arXiv Detail & Related papers (2020-07-27T21:10:35Z) - A Robust Attentional Framework for License Plate Recognition in the Wild [95.7296788722492]
We propose a robust framework for license plate recognition in the wild.
It is composed of a tailored CycleGAN model for license plate image generation and an elaborate designed image-to-sequence network for plate recognition.
We release a new license plate dataset, named "CLPD", with 1200 images from all 31 provinces in mainland China.
arXiv Detail & Related papers (2020-06-06T17:11:52Z) - Real-MFF: A Large Realistic Multi-focus Image Dataset with Ground Truth [58.226535803985804]
We introduce a large and realistic multi-focus dataset called Real-MFF.
The dataset contains 710 pairs of source images with corresponding ground truth images.
We evaluate 10 typical multi-focus algorithms on this dataset for the purpose of illustration.
arXiv Detail & Related papers (2020-03-28T12:33:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.