Improved Pothole Detection Using YOLOv7 and ESRGAN
- URL: http://arxiv.org/abs/2401.08588v1
- Date: Fri, 10 Nov 2023 16:22:10 GMT
- Title: Improved Pothole Detection Using YOLOv7 and ESRGAN
- Authors: Nirmal Kumar Rout, Gyanateet Dutta, Varun Sinha, Arghadeep Dey,
Subhrangshu Mukherjee, Gopal Gupta
- Abstract summary: Potholes are common road hazards that is causing damage to vehicles and posing a safety risk to drivers.
CNNs are widely used in the industry for object detection based on Deep Learning methods.
In this paper, a unique better algorithm is proposed to warrant the use of low-resolution cameras or low-resolution images and video feed for automatic pothole detection.
- Score: 1.0485739694839669
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Potholes are common road hazards that is causing damage to vehicles and
posing a safety risk to drivers. The introduction of Convolutional Neural
Networks (CNNs) is widely used in the industry for object detection based on
Deep Learning methods and has achieved significant progress in hardware
improvement and software implementations. In this paper, a unique better
algorithm is proposed to warrant the use of low-resolution cameras or
low-resolution images and video feed for automatic pothole detection using
Super Resolution (SR) through Super Resolution Generative Adversarial Networks
(SRGANs). Then we have proceeded to establish a baseline pothole detection
performance on low quality and high quality dashcam images using a You Only
Look Once (YOLO) network, namely the YOLOv7 network. We then have illustrated
and examined the speed and accuracy gained above the benchmark after having
upscaling implementation on the low quality images.
Related papers
- Accelerating Object Detection with YOLOv4 for Real-Time Applications [0.276240219662896]
Convolutional Neural Network (CNN) have emerged as a powerful tool for recognizing image content and in computer vision approach for most problems.
This paper introduces the brief introduction of deep learning and object detection framework like Convolutional Neural Network(CNN)
arXiv Detail & Related papers (2024-10-17T17:44:57Z) - Exploring Deep Learning Image Super-Resolution for Iris Recognition [50.43429968821899]
We propose the use of two deep learning single-image super-resolution approaches: Stacked Auto-Encoders (SAE) and Convolutional Neural Networks (CNN)
We validate the methods with a database of 1.872 near-infrared iris images with quality assessment and recognition experiments showing the superiority of deep learning approaches over the compared algorithms.
arXiv Detail & Related papers (2023-11-02T13:57:48Z) - SqueezerFaceNet: Reducing a Small Face Recognition CNN Even More Via
Filter Pruning [55.84746218227712]
We develop SqueezerFaceNet, a light face recognition network which less than 1M parameters.
We show that it can be further reduced (up to 40%) without an appreciable loss in performance.
arXiv Detail & Related papers (2023-07-20T08:38:50Z) - LSwinSR: UAV Imagery Super-Resolution based on Linear Swin Transformer [7.3817359680010615]
Super-resolution technology is especially beneficial for Unmanned Aerial Vehicles (UAV)
In this paper, for the super-resolution of UAV images, a novel network based on the state-of-the-art Swin Transformer is proposed with better efficiency and competitive accuracy.
arXiv Detail & Related papers (2023-03-17T20:14:10Z) - Rethinking Resolution in the Context of Efficient Video Recognition [49.957690643214576]
Cross-resolution KD (ResKD) is a simple but effective method to boost recognition accuracy on low-resolution frames.
We extensively demonstrate its effectiveness over state-of-the-art architectures, i.e., 3D-CNNs and Video Transformers.
arXiv Detail & Related papers (2022-09-26T15:50:44Z) - FOVEA: Foveated Image Magnification for Autonomous Navigation [53.69803081925454]
We propose an attentional approach that elastically magnifies certain regions while maintaining a small input canvas.
Our proposed method boosts the detection AP over standard Faster R-CNN, with and without finetuning.
On the autonomous driving datasets Argoverse-HD and BDD100K, we show our proposed method boosts the detection AP over standard Faster R-CNN, with and without finetuning.
arXiv Detail & Related papers (2021-08-27T03:07:55Z) - Improved detection of small objects in road network sequences [0.0]
We propose a new procedure for detecting small-scale objects by applying super-resolution processes based on detections performed by convolutional neural networks.
This work has been tested for a set of traffic images containing elements of different scales to test the efficiency according to the detections obtained by the model.
arXiv Detail & Related papers (2021-05-18T10:13:23Z) - D-Unet: A Dual-encoder U-Net for Image Splicing Forgery Detection and
Localization [108.8592577019391]
Image splicing forgery detection is a global binary classification task that distinguishes the tampered and non-tampered regions by image fingerprints.
We propose a novel network called dual-encoder U-Net (D-Unet) for image splicing forgery detection, which employs an unfixed encoder and a fixed encoder.
In an experimental comparison study of D-Unet and state-of-the-art methods, D-Unet outperformed the other methods in image-level and pixel-level detection.
arXiv Detail & Related papers (2020-12-03T10:54:02Z) - Anchor-free Small-scale Multispectral Pedestrian Detection [88.7497134369344]
We propose a method for effective and efficient multispectral fusion of the two modalities in an adapted single-stage anchor-free base architecture.
We aim at learning pedestrian representations based on object center and scale rather than direct bounding box predictions.
Results show our method's effectiveness in detecting small-scaled pedestrians.
arXiv Detail & Related papers (2020-08-19T13:13:01Z) - Small-Object Detection in Remote Sensing Images with End-to-End
Edge-Enhanced GAN and Object Detector Network [9.135036713000513]
A generative adversarial network (GAN)-based model called enhanced super-resolution GAN (ESRGAN) shows remarkable image enhancement performance.
We propose a new edge-enhanced super-resolution GAN (EESRGAN) to improve the image quality of remote sensing images.
arXiv Detail & Related papers (2020-03-20T03:07:30Z) - The Mertens Unrolled Network (MU-Net): A High Dynamic Range Fusion
Neural Network for Through the Windshield Driver Recognition [1.758759574398869]
Face recognition in unconstrained environments poses a number of unique challenges ranging from glare, poor illumination, driver pose and motion.
We further develop the hardware and software of a custom vehicle imaging system to better overcome these challenges.
We name the Mertens Unrolled Network (MU-Net) for the purpose of fine-tuning the HDR output of through-windshield images.
arXiv Detail & Related papers (2020-02-27T16:57:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.