End-to-End License Plate Recognition Pipeline for Real-time Low Resource
Video Based Applications
- URL: http://arxiv.org/abs/2108.08339v1
- Date: Wed, 18 Aug 2021 18:31:01 GMT
- Title: End-to-End License Plate Recognition Pipeline for Real-time Low Resource
Video Based Applications
- Authors: Alif Ashrafee, Akib Mohammed Khan, Mohammad Sabik Irbaz, MD Abdullah
Al Nasim
- Abstract summary: We propose a novel two-stage detection pipeline paired with Vision API to provide real-time inference speed.
We trained our models on an image dataset and a video dataset containing license plates in the wild.
We observed reasonable detection and recognition performance with real-time processing speed (27.2 frames per second)
- Score: 0.43012765978447565
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: Automatic License Plate Recognition systems aim to provide an end-to-end
solution towards detecting, localizing, and recognizing license plate
characters from vehicles appearing in video frames. However, deploying such
systems in the real world requires real-time performance in low-resource
environments. In our paper, we propose a novel two-stage detection pipeline
paired with Vision API that aims to provide real-time inference speed along
with consistently accurate detection and recognition performance. We used a
haar-cascade classifier as a filter on top of our backbone MobileNet SSDv2
detection model. This reduces inference time by only focusing on high
confidence detections and using them for recognition. We also impose a temporal
frame separation strategy to identify multiple vehicle license plates in the
same clip. Furthermore, there are no publicly available Bangla license plate
datasets, for which we created an image dataset and a video dataset containing
license plates in the wild. We trained our models on the image dataset and
achieved an AP(0.5) score of 86% and tested our pipeline on the video dataset
and observed reasonable detection and recognition performance (82.7% detection
rate, and 60.8% OCR F1 score) with real-time processing speed (27.2 frames per
second).
Related papers
- BVI-RLV: A Fully Registered Dataset and Benchmarks for Low-Light Video Enhancement [56.97766265018334]
This paper introduces a low-light video dataset, consisting of 40 scenes with various motion scenarios under two distinct low-lighting conditions.
We provide fully registered ground truth data captured in normal light using a programmable motorized dolly and refine it via an image-based approach for pixel-wise frame alignment across different light levels.
Our experimental results demonstrate the significance of fully registered video pairs for low-light video enhancement (LLVE) and the comprehensive evaluation shows that the models trained with our dataset outperform those trained with the existing datasets.
arXiv Detail & Related papers (2024-07-03T22:41:49Z) - A Dataset and Model for Realistic License Plate Deblurring [17.52035404373648]
We introduce the first large-scale license plate deblurring dataset named License Plate Blur (LPBlur)
Then, we propose a License Plate Deblurring Generative Adversarial Network (LPDGAN) to tackle the license plate deblurring.
Our proposed model outperforms other state-of-the-art motion deblurring methods in realistic license plate deblurring scenarios.
arXiv Detail & Related papers (2024-04-21T14:36:57Z) - HPointLoc: Point-based Indoor Place Recognition using Synthetic RGB-D
Images [58.720142291102135]
We present a novel dataset named as HPointLoc, specially designed for exploring capabilities of visual place recognition in indoor environment.
The dataset is based on the popular Habitat simulator, in which it is possible to generate indoor scenes using both own sensor data and open datasets.
arXiv Detail & Related papers (2022-12-30T12:20:56Z) - Deep Learning Computer Vision Algorithms for Real-time UAVs On-board
Camera Image Processing [77.34726150561087]
This paper describes how advanced deep learning based computer vision algorithms are applied to enable real-time on-board sensor processing for small UAVs.
All algorithms have been developed using state-of-the-art image processing methods based on deep neural networks.
arXiv Detail & Related papers (2022-11-02T11:10:42Z) - Spatial-Temporal Frequency Forgery Clue for Video Forgery Detection in
VIS and NIR Scenario [87.72258480670627]
Existing face forgery detection methods based on frequency domain find that the GAN forged images have obvious grid-like visual artifacts in the frequency spectrum compared to the real images.
This paper proposes a Cosine Transform-based Forgery Clue Augmentation Network (FCAN-DCT) to achieve a more comprehensive spatial-temporal feature representation.
arXiv Detail & Related papers (2022-07-05T09:27:53Z) - Benchmarking Algorithms for Automatic License Plate Recognition [0.0]
We evaluate a lightweight Convolutional Neural Network (CNN) called LPRNet for automatic License Plate Recognition (LPR)
LPRNet is an end-to-end framework and demonstrated robust performance on both datasets.
Once properly trained, LPRNet can be used to recognize characters from a specific region and dataset.
arXiv Detail & Related papers (2022-03-27T13:21:29Z) - End-to-End High Accuracy License Plate Recognition Based on Depthwise
Separable Convolution Networks [0.0]
We propose a novel segmentation-free framework for license plate recognition and introduce NP-ALPR dataset.
The proposed network model consists of the latest deep learning methods and state-of-the-art ideas, and benefits from a novel network architecture.
We evaluate the effectiveness of the proposed method on three different datasets and show a recognition accuracy of over 99% and over 70 fps.
arXiv Detail & Related papers (2022-02-21T14:45:03Z) - 2nd Place Solution for Waymo Open Dataset Challenge - Real-time 2D
Object Detection [26.086623067939605]
In this report, we introduce a real-time method to detect the 2D objects from images.
We leverage accelerationRT to optimize the inference time of our detection pipeline.
Our framework achieves the latency of 45.8ms/frame on an Nvidia Tesla V100 GPU.
arXiv Detail & Related papers (2021-06-16T11:32:03Z) - Video-based Person Re-identification without Bells and Whistles [49.51670583977911]
Video-based person re-identification (Re-ID) aims at matching the video tracklets with cropped video frames for identifying the pedestrians under different cameras.
There exists severe spatial and temporal misalignment for those cropped tracklets due to the imperfect detection and tracking results generated with obsolete methods.
We present a simple re-Detect and Link (DL) module which can effectively reduce those unexpected noise through applying the deep learning-based detection and tracking on the cropped tracklets.
arXiv Detail & Related papers (2021-05-22T10:17:38Z) - Traffic Surveillance using Vehicle License Plate Detection and
Recognition in Bangladesh [0.0]
This paper presents a YOLOv4 object detection model in which the Convolutional Neural Network (CNN) is trained and tuned for detecting the license plate of the vehicles of Bangladesh.
Here we also present a Graphical User Interface (GUI) based on Tkinter, a python package.
arXiv Detail & Related papers (2020-12-03T19:16:49Z) - A Robust Attentional Framework for License Plate Recognition in the Wild [95.7296788722492]
We propose a robust framework for license plate recognition in the wild.
It is composed of a tailored CycleGAN model for license plate image generation and an elaborate designed image-to-sequence network for plate recognition.
We release a new license plate dataset, named "CLPD", with 1200 images from all 31 provinces in mainland China.
arXiv Detail & Related papers (2020-06-06T17:11:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.