Efficient Video-Based ALPR System Using YOLO and Visual Rhythm
- URL: http://arxiv.org/abs/2501.02270v2
- Date: Wed, 08 Jan 2025 13:42:02 GMT
- Title: Efficient Video-Based ALPR System Using YOLO and Visual Rhythm
- Authors: Victor Nascimento Ribeiro, Nina S. T. Hirata,
- Abstract summary: We propose a system capable of extracting exactly one frame per vehicle and recognizing its license plate characters from this singular image.
Early experiments show that this methodology is viable.
- Score: 0.36832029288386137
- License:
- Abstract: Automatic License Plate Recognition (ALPR) involves extracting vehicle license plate information from image or a video capture. These systems have gained popularity due to the wide availability of low-cost surveillance cameras and advances in Deep Learning. Typically, video-based ALPR systems rely on multiple frames to detect the vehicle and recognize the license plates. Therefore, we propose a system capable of extracting exactly one frame per vehicle and recognizing its license plate characters from this singular image using an Optical Character Recognition (OCR) model. Early experiments show that this methodology is viable.
Related papers
- Efficient License Plate Recognition in Videos Using Visual Rhythm and Accumulative Line Analysis [0.36832029288386137]
Video-based Automatic License Plate Recognition (ALPR) involves extracting vehicle license plate text information from video captures.
Traditional systems rely heavily on high-end computing resources and utilize multiple frames to recognize license plates.
We propose two methods capable of efficiently extracting exactly one frame per vehicle and recognizing its license plate characters from this single image.
arXiv Detail & Related papers (2025-01-08T16:17:05Z) - A Training-Free Framework for Video License Plate Tracking and Recognition with Only One-Shot [25.032455444204466]
OneShotLP is a training-free framework for video-based license plate detection and recognition.
It offers the ability to function effectively without extensive training data and adaptability to various license plate styles.
This highlights the potential of leveraging pre-trained models for diverse real-world applications in intelligent transportation systems.
arXiv Detail & Related papers (2024-08-11T08:42:02Z) - Bidirectional Cross-Modal Knowledge Exploration for Video Recognition
with Pre-trained Vision-Language Models [149.1331903899298]
We propose a novel framework called BIKE, which utilizes the cross-modal bridge to explore bidirectional knowledge.
We present a Temporal Concept Spotting mechanism that uses the Text-to-Video expertise to capture temporal saliency in a parameter-free manner.
Our best model achieves a state-of-the-art accuracy of 88.6% on the challenging Kinetics-400 using the released CLIP model.
arXiv Detail & Related papers (2022-12-31T11:36:53Z) - Deep Learning Computer Vision Algorithms for Real-time UAVs On-board
Camera Image Processing [77.34726150561087]
This paper describes how advanced deep learning based computer vision algorithms are applied to enable real-time on-board sensor processing for small UAVs.
All algorithms have been developed using state-of-the-art image processing methods based on deep neural networks.
arXiv Detail & Related papers (2022-11-02T11:10:42Z) - Frozen CLIP Models are Efficient Video Learners [86.73871814176795]
Video recognition has been dominated by the end-to-end learning paradigm.
Recent advances in Contrastive Vision-Language Pre-training pave the way for a new route for visual recognition tasks.
We present Efficient Video Learning -- an efficient framework for directly training high-quality video recognition models.
arXiv Detail & Related papers (2022-08-06T17:38:25Z) - An Automated Approach for the Recognition of Bengali License Plates [0.0]
This study proposes a hybrid method for detecting license plates using characters from them.
Our captured image information was used for the recognition procedure in Bangladeshi vehicles.
arXiv Detail & Related papers (2021-09-01T17:31:33Z) - End-to-End License Plate Recognition Pipeline for Real-time Low Resource
Video Based Applications [0.43012765978447565]
We propose a novel two-stage detection pipeline paired with Vision API to provide real-time inference speed.
We trained our models on an image dataset and a video dataset containing license plates in the wild.
We observed reasonable detection and recognition performance with real-time processing speed (27.2 frames per second)
arXiv Detail & Related papers (2021-08-18T18:31:01Z) - Self-Supervised Steering Angle Prediction for Vehicle Control Using
Visual Odometry [55.11913183006984]
We show how a model can be trained to control a vehicle's trajectory using camera poses estimated through visual odometry methods.
We propose a scalable framework that leverages trajectory information from several different runs using a camera setup placed at the front of a car.
arXiv Detail & Related papers (2021-03-20T16:29:01Z) - Learning from Weakly-labeled Web Videos via Exploring Sub-Concepts [89.06560404218028]
We introduce a new method for pre-training video action recognition models using queried web videos.
Instead of trying to filter out, we propose to convert the potential noises in these queried videos to useful supervision signals.
We show that SPL outperforms several existing pre-training strategies using pseudo-labels.
arXiv Detail & Related papers (2021-01-11T05:50:16Z) - A Robust Attentional Framework for License Plate Recognition in the Wild [95.7296788722492]
We propose a robust framework for license plate recognition in the wild.
It is composed of a tailored CycleGAN model for license plate image generation and an elaborate designed image-to-sequence network for plate recognition.
We release a new license plate dataset, named "CLPD", with 1200 images from all 31 provinces in mainland China.
arXiv Detail & Related papers (2020-06-06T17:11:52Z) - Deep Learning Based Vehicle Tracking System Using License Plate
Detection And Recognition [0.0]
The proposed system uses a novel approach to vehicle tracking using Vehicle License plate detection and recognition (OCR) technique.
Results were obtained at a speed of 30 frames per second with accuracy close to human.
arXiv Detail & Related papers (2020-05-10T14:03:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.