Related papers: Efficient License Plate Recognition in Videos Using Visual Rhythm and Accumulative Line Analysis

Efficient License Plate Recognition in Videos Using Visual Rhythm and Accumulative Line Analysis

URL: http://arxiv.org/abs/2501.04750v1
Date: Wed, 08 Jan 2025 16:17:05 GMT
Title: Efficient License Plate Recognition in Videos Using Visual Rhythm and Accumulative Line Analysis
Authors: Victor Nascimento Ribeiro, Nina S. T. Hirata,
Abstract summary: Video-based Automatic License Plate Recognition (ALPR) involves extracting vehicle license plate text information from video captures.<n>Traditional systems rely heavily on high-end computing resources and utilize multiple frames to recognize license plates.<n>We propose two methods capable of efficiently extracting exactly one frame per vehicle and recognizing its license plate characters from this single image.
Score: 0.36832029288386137
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Video-based Automatic License Plate Recognition (ALPR) involves extracting vehicle license plate text information from video captures. Traditional systems typically rely heavily on high-end computing resources and utilize multiple frames to recognize license plates, leading to increased computational overhead. In this paper, we propose two methods capable of efficiently extracting exactly one frame per vehicle and recognizing its license plate characters from this single image, thus significantly reducing computational demands. The first method uses Visual Rhythm (VR) to generate time-spatial images from videos, while the second employs Accumulative Line Analysis (ALA), a novel algorithm based on single-line video processing for real-time operation. Both methods leverage YOLO for license plate detection within the frame and a Convolutional Neural Network (CNN) for Optical Character Recognition (OCR) to extract textual information. Experiments on real videos demonstrate that the proposed methods achieve results comparable to traditional frame-by-frame approaches, with processing speeds three times faster.

Related papers

Combining YOLO and Visual Rhythm for Vehicle Counting [0.36832029288386137]
Video-based vehicle detection and counting play a critical role in managing transport infrastructure.<n>Traditional image-based counting methods usually involve two main steps: initial detection and subsequent tracking.<n>This work presents an alternative and more efficient method for vehicle detection and counting.
arXiv Detail & Related papers (2025-01-08T14:33:47Z)
Efficient Video-Based ALPR System Using YOLO and Visual Rhythm [0.36832029288386137]
We propose a system capable of extracting exactly one frame per vehicle and recognizing its license plate characters from this singular image.<n>Early experiments show that this methodology is viable.
arXiv Detail & Related papers (2025-01-04T12:15:58Z)
Neuromorphic Synergy for Video Binarization [54.195375576583864]
Bimodal objects serve as a visual form to embed information that can be easily recognized by vision systems. Neuromorphic cameras offer new capabilities for alleviating motion blur, but it is non-trivial to first de-blur and then binarize the images in a real-time manner. We propose an event-based binary reconstruction method that leverages the prior knowledge of the bimodal target's properties to perform inference independently in both event space and image space. We also develop an efficient integration method to propagate this binary image to high frame rate binary video.
arXiv Detail & Related papers (2024-02-20T01:43:51Z)
Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models [149.1331903899298]
We propose a novel framework called BIKE, which utilizes the cross-modal bridge to explore bidirectional knowledge. We present a Temporal Concept Spotting mechanism that uses the Text-to-Video expertise to capture temporal saliency in a parameter-free manner. Our best model achieves a state-of-the-art accuracy of 88.6% on the challenging Kinetics-400 using the released CLIP model.
arXiv Detail & Related papers (2022-12-31T11:36:53Z)
Deep Learning Computer Vision Algorithms for Real-time UAVs On-board Camera Image Processing [77.34726150561087]
This paper describes how advanced deep learning based computer vision algorithms are applied to enable real-time on-board sensor processing for small UAVs. All algorithms have been developed using state-of-the-art image processing methods based on deep neural networks.
arXiv Detail & Related papers (2022-11-02T11:10:42Z)
Frozen CLIP Models are Efficient Video Learners [86.73871814176795]
Video recognition has been dominated by the end-to-end learning paradigm. Recent advances in Contrastive Vision-Language Pre-training pave the way for a new route for visual recognition tasks. We present Efficient Video Learning -- an efficient framework for directly training high-quality video recognition models.
arXiv Detail & Related papers (2022-08-06T17:38:25Z)
NSNet: Non-saliency Suppression Sampler for Efficient Video Recognition [89.84188594758588]
A novel Non-saliency Suppression Network (NSNet) is proposed to suppress the responses of non-salient frames. NSNet achieves the state-of-the-art accuracy-efficiency trade-off and presents a significantly faster (2.44.3x) practical inference speed than state-of-the-art methods.
arXiv Detail & Related papers (2022-07-21T09:41:22Z)
End-to-End License Plate Recognition Pipeline for Real-time Low Resource Video Based Applications [0.43012765978447565]
We propose a novel two-stage detection pipeline paired with Vision API to provide real-time inference speed. We trained our models on an image dataset and a video dataset containing license plates in the wild. We observed reasonable detection and recognition performance with real-time processing speed (27.2 frames per second)
arXiv Detail & Related papers (2021-08-18T18:31:01Z)
Video Corpus Moment Retrieval with Contrastive Learning [56.249924768243375]
Video corpus moment retrieval (VCMR) is to retrieve a temporal moment that semantically corresponds to a given text query. We propose a Retrieval and Localization Network with Contrastive Learning (ReLoCLNet) for VCMR. Experimental results show that ReLoCLNet encodes text and video separately for efficiency, its retrieval accuracy is comparable with baselines adopting cross-modal interaction learning.
arXiv Detail & Related papers (2021-05-13T12:54:39Z)
Rethinking and Designing a High-performing Automatic License Plate Recognition Approach [16.66787965777127]
We propose a novel automatic license plate recognition (ALPR) approach, termed VSNet. VSNet includes two CNNs, i.e., VertexNet for license plate detection and SCR-Net for license plate recognition, which is integrated in a resampling-based cascaded manner. Experimental results show that the proposed VSNet outperforms state-of-the-art methods by more than 50% relative improvement on error rate.
arXiv Detail & Related papers (2020-11-30T16:03:57Z)
Deep Learning Based Vehicle Tracking System Using License Plate Detection And Recognition [0.0]
The proposed system uses a novel approach to vehicle tracking using Vehicle License plate detection and recognition (OCR) technique. Results were obtained at a speed of 30 frames per second with accuracy close to human.
arXiv Detail & Related papers (2020-05-10T14:03:33Z)

This list is automatically generated from the titles and abstracts of the papers in this site.