Related papers: End-to-End License Plate Recognition Pipeline for Real-time Low Resource Video Based Applications

End-to-End License Plate Recognition Pipeline for Real-time Low Resource Video Based Applications

URL: http://arxiv.org/abs/2108.08339v1
Date: Wed, 18 Aug 2021 18:31:01 GMT
Title: End-to-End License Plate Recognition Pipeline for Real-time Low Resource Video Based Applications
Authors: Alif Ashrafee, Akib Mohammed Khan, Mohammad Sabik Irbaz, MD Abdullah Al Nasim
Abstract summary: We propose a novel two-stage detection pipeline paired with Vision API to provide real-time inference speed. We trained our models on an image dataset and a video dataset containing license plates in the wild. We observed reasonable detection and recognition performance with real-time processing speed (27.2 frames per second)
Score: 0.43012765978447565
License: http://creativecommons.org/publicdomain/zero/1.0/
Abstract: Automatic License Plate Recognition systems aim to provide an end-to-end solution towards detecting, localizing, and recognizing license plate characters from vehicles appearing in video frames. However, deploying such systems in the real world requires real-time performance in low-resource environments. In our paper, we propose a novel two-stage detection pipeline paired with Vision API that aims to provide real-time inference speed along with consistently accurate detection and recognition performance. We used a haar-cascade classifier as a filter on top of our backbone MobileNet SSDv2 detection model. This reduces inference time by only focusing on high confidence detections and using them for recognition. We also impose a temporal frame separation strategy to identify multiple vehicle license plates in the same clip. Furthermore, there are no publicly available Bangla license plate datasets, for which we created an image dataset and a video dataset containing license plates in the wild. We trained our models on the image dataset and achieved an AP(0.5) score of 86% and tested our pipeline on the video dataset and observed reasonable detection and recognition performance (82.7% detection rate, and 60.8% OCR F1 score) with real-time processing speed (27.2 frames per second).

Related papers

TransLPRNet: Lite Vision-Language Network for Single/Dual-line Chinese License Plate Recognition [1.1499574149885023]
This paper proposes a unified solution that integrates a lightweight visual encoder with a text decoder.<n>To mitigate the scarcity of double-line license plate datasets, we constructed a single/double-line license plate dataset.<n>The proposed algorithm achieves an average recognition accuracy of 99.34% on the corrected CCPD test set under coarse localization disturbance.
arXiv Detail & Related papers (2025-07-23T09:03:01Z)
Connecting Vision and Emissions: A Behavioural AI Approach to Carbon Estimation in Road Design [0.0]
We present an enhanced YOLOv8 real time vehicle detection and classification framework, for estimating carbon emissions in urban environments.<n>The framework incorporates a hybrid pipeline where each detected vehicle is tracked and its bounding box is cropped and passed to a deep Optical Character Recognition (OCR) module.<n>This OCR system, composed of multiple convolutional neural network (CNN) layers, is trained specifically for character-level detection and license plate decoding.
arXiv Detail & Related papers (2025-06-18T11:50:24Z)
PatrolVision: Automated License Plate Recognition in the wild [0.0]
We propose a complete ALPR system for Singapore license plates having both single and double line. We first detect the license plate from the full image using RFB-Net and rectify multiple distorted license plates in a single image. We evaluate the performance of our proposed system on a newly built dataset covering more than 16,000 images.
arXiv Detail & Related papers (2025-04-15T02:10:43Z)
Efficient License Plate Recognition in Videos Using Visual Rhythm and Accumulative Line Analysis [0.36832029288386137]
Video-based Automatic License Plate Recognition (ALPR) involves extracting vehicle license plate text information from video captures. Traditional systems rely heavily on high-end computing resources and utilize multiple frames to recognize license plates. We propose two methods capable of efficiently extracting exactly one frame per vehicle and recognizing its license plate characters from this single image.
arXiv Detail & Related papers (2025-01-08T16:17:05Z)
Efficient Video-Based ALPR System Using YOLO and Visual Rhythm [0.36832029288386137]
We propose a system capable of extracting exactly one frame per vehicle and recognizing its license plate characters from this singular image. Early experiments show that this methodology is viable.
arXiv Detail & Related papers (2025-01-04T12:15:58Z)
BVI-RLV: A Fully Registered Dataset and Benchmarks for Low-Light Video Enhancement [56.97766265018334]
This paper introduces a low-light video dataset, consisting of 40 scenes with various motion scenarios under two distinct low-lighting conditions. We provide fully registered ground truth data captured in normal light using a programmable motorized dolly and refine it via an image-based approach for pixel-wise frame alignment across different light levels. Our experimental results demonstrate the significance of fully registered video pairs for low-light video enhancement (LLVE) and the comprehensive evaluation shows that the models trained with our dataset outperform those trained with the existing datasets.
arXiv Detail & Related papers (2024-07-03T22:41:49Z)
A Dataset and Model for Realistic License Plate Deblurring [17.52035404373648]
We introduce the first large-scale license plate deblurring dataset named License Plate Blur (LPBlur) Then, we propose a License Plate Deblurring Generative Adversarial Network (LPDGAN) to tackle the license plate deblurring. Our proposed model outperforms other state-of-the-art motion deblurring methods in realistic license plate deblurring scenarios.
arXiv Detail & Related papers (2024-04-21T14:36:57Z)
HPointLoc: Point-based Indoor Place Recognition using Synthetic RGB-D Images [58.720142291102135]
We present a novel dataset named as HPointLoc, specially designed for exploring capabilities of visual place recognition in indoor environment. The dataset is based on the popular Habitat simulator, in which it is possible to generate indoor scenes using both own sensor data and open datasets.
arXiv Detail & Related papers (2022-12-30T12:20:56Z)
Deep Learning Computer Vision Algorithms for Real-time UAVs On-board Camera Image Processing [77.34726150561087]
This paper describes how advanced deep learning based computer vision algorithms are applied to enable real-time on-board sensor processing for small UAVs. All algorithms have been developed using state-of-the-art image processing methods based on deep neural networks.
arXiv Detail & Related papers (2022-11-02T11:10:42Z)
Spatial-Temporal Frequency Forgery Clue for Video Forgery Detection in VIS and NIR Scenario [87.72258480670627]
Existing face forgery detection methods based on frequency domain find that the GAN forged images have obvious grid-like visual artifacts in the frequency spectrum compared to the real images. This paper proposes a Cosine Transform-based Forgery Clue Augmentation Network (FCAN-DCT) to achieve a more comprehensive spatial-temporal feature representation.
arXiv Detail & Related papers (2022-07-05T09:27:53Z)
Benchmarking Algorithms for Automatic License Plate Recognition [0.0]
We evaluate a lightweight Convolutional Neural Network (CNN) called LPRNet for automatic License Plate Recognition (LPR) LPRNet is an end-to-end framework and demonstrated robust performance on both datasets. Once properly trained, LPRNet can be used to recognize characters from a specific region and dataset.
arXiv Detail & Related papers (2022-03-27T13:21:29Z)
End-to-End High Accuracy License Plate Recognition Based on Depthwise Separable Convolution Networks [0.0]
We propose a novel segmentation-free framework for license plate recognition and introduce NP-ALPR dataset. The proposed network model consists of the latest deep learning methods and state-of-the-art ideas, and benefits from a novel network architecture. We evaluate the effectiveness of the proposed method on three different datasets and show a recognition accuracy of over 99% and over 70 fps.
arXiv Detail & Related papers (2022-02-21T14:45:03Z)
2nd Place Solution for Waymo Open Dataset Challenge - Real-time 2D Object Detection [26.086623067939605]
In this report, we introduce a real-time method to detect the 2D objects from images. We leverage accelerationRT to optimize the inference time of our detection pipeline. Our framework achieves the latency of 45.8ms/frame on an Nvidia Tesla V100 GPU.
arXiv Detail & Related papers (2021-06-16T11:32:03Z)
Video-based Person Re-identification without Bells and Whistles [49.51670583977911]
Video-based person re-identification (Re-ID) aims at matching the video tracklets with cropped video frames for identifying the pedestrians under different cameras. There exists severe spatial and temporal misalignment for those cropped tracklets due to the imperfect detection and tracking results generated with obsolete methods. We present a simple re-Detect and Link (DL) module which can effectively reduce those unexpected noise through applying the deep learning-based detection and tracking on the cropped tracklets.
arXiv Detail & Related papers (2021-05-22T10:17:38Z)
Traffic Surveillance using Vehicle License Plate Detection and Recognition in Bangladesh [0.0]
This paper presents a YOLOv4 object detection model in which the Convolutional Neural Network (CNN) is trained and tuned for detecting the license plate of the vehicles of Bangladesh. Here we also present a Graphical User Interface (GUI) based on Tkinter, a python package.
arXiv Detail & Related papers (2020-12-03T19:16:49Z)
A Robust Attentional Framework for License Plate Recognition in the Wild [95.7296788722492]
We propose a robust framework for license plate recognition in the wild. It is composed of a tailored CycleGAN model for license plate image generation and an elaborate designed image-to-sequence network for plate recognition. We release a new license plate dataset, named "CLPD", with 1200 images from all 31 provinces in mainland China.
arXiv Detail & Related papers (2020-06-06T17:11:52Z)

This list is automatically generated from the titles and abstracts of the papers in this site.