Pavement Distress Detection and Segmentation using YOLOv4 and DeepLabv3
on Pavements in the Philippines
- URL: http://arxiv.org/abs/2103.06467v1
- Date: Thu, 11 Mar 2021 05:25:29 GMT
- Title: Pavement Distress Detection and Segmentation using YOLOv4 and DeepLabv3
on Pavements in the Philippines
- Authors: James-Andrew Sarmiento
- Abstract summary: This study proposed the use of deep learning for two ways of recording pavement distresses from 2D RGB images.
YOLOv4 is used for pavement distress detection while DeepLabv3 is employed for pavement distress segmentation on a small dataset of pavement images in the Philippines.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Road transport infrastructure is critical for safe, fast, economical, and
reliable mobility within the whole country that is conducive to a productive
society. However, roads tend to deteriorate over time due to natural causes in
the environment and repeated traffic loads. Pavement Distress (PD) detection is
essential in monitoring the current conditions of the public roads to enable
targeted rehabilitation and preventive maintenance. Nonetheless, distress
detection surveys are still done via manual inspection for developing countries
such as the Philippines. This study proposed the use of deep learning for two
ways of recording pavement distresses from 2D RGB images - detection and
segmentation. YOLOv4 is used for pavement distress detection while DeepLabv3 is
employed for pavement distress segmentation on a small dataset of pavement
images in the Philippines. This study aims to provide a basis to potentially
spark solutions in building a cheap, scalable, and automated end-to-end
solution for PD detection in the country.
Related papers
- M2S-RoAD: Multi-Modal Semantic Segmentation for Road Damage Using Camera and LiDAR Data [9.967263440745432]
This paper introduces M2S-RoAD, a dataset for the semantic segmentation of different classes of road damage.
M2S-RoAD was collected in various towns across New South Wales, Australia, and labelled for semantic segmentation to identify nine distinct types of road damage.
arXiv Detail & Related papers (2025-04-14T11:32:01Z) - DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments [60.69159598130235]
We present a new dataset, DAVE, designed for evaluating perception methods with high representation of Vulnerable Road Users (VRUs)
DAVE is a manually annotated dataset encompassing 16 diverse actor categories (spanning animals, humans, vehicles, etc.) and 16 action types (complex and rare cases like cut-ins, zigzag movement, U-turn, etc.)
Our experiments show that existing methods suffer degradation in performance when evaluated on DAVE, highlighting its benefit for future video recognition research.
arXiv Detail & Related papers (2024-12-28T06:13:44Z) - Advancing Pavement Distress Detection in Developing Countries: A Novel Deep Learning Approach with Locally-Collected Datasets [3.378738346115004]
Road infrastructure maintenance in developing countries faces unique challenges due to resource constraints and diverse environmental factors.
This study addresses the critical need for efficient, accurate, and locally-relevant pavement distress detection methods.
We present a novel deep learning approach combining YOLO (You Only Look Once) object detection models with a Convolutional Block Attention Module.
arXiv Detail & Related papers (2024-08-10T23:20:36Z) - OpenLane-V2: A Topology Reasoning Benchmark for Unified 3D HD Mapping [84.65114565766596]
We present OpenLane-V2, the first dataset on topology reasoning for traffic scene structure.
OpenLane-V2 consists of 2,000 annotated road scenes that describe traffic elements and their correlation to the lanes.
We evaluate various state-of-the-art methods, and present their quantitative and qualitative results on OpenLane-V2 to indicate future avenues for investigating topology reasoning in traffic scenes.
arXiv Detail & Related papers (2023-04-20T16:31:22Z) - Unsupervised Adaptation from Repeated Traversals for Autonomous Driving [54.59577283226982]
Self-driving cars must generalize to the end-user's environment to operate reliably.
One potential solution is to leverage unlabeled data collected from the end-users' environments.
There is no reliable signal in the target domain to supervise the adaptation process.
We show that this simple additional assumption is sufficient to obtain a potent signal that allows us to perform iterative self-training of 3D object detectors on the target domain.
arXiv Detail & Related papers (2023-03-27T15:07:55Z) - Threatening Patch Attacks on Object Detection in Optical Remote Sensing
Images [55.09446477517365]
Advanced Patch Attacks (PAs) on object detection in natural images have pointed out the great safety vulnerability in methods based on deep neural networks.
We propose a more Threatening PA without the scarification of the visual quality, dubbed TPA.
To the best of our knowledge, this is the first attempt to study the PAs on object detection in O-RSIs, and we hope this work can get our readers interested in studying this topic.
arXiv Detail & Related papers (2023-02-13T02:35:49Z) - A Pre-study on Data Processing Pipelines for Roadside Object Detection
Systems Towards Safer Road Infrastructure [1.9967512860886603]
Single-vehicle accidents are the most common type of fatal accidents in Sweden.
This study investigates the feasibility, implementation, limitations and scaling up of data processing pipelines for roadside object detection.
The goal of this report is to investigate how to implement a scalable roadside object detection system towards safe road infrastructure and Sweden's Vision Zero.
arXiv Detail & Related papers (2022-04-17T16:27:26Z) - A Computer Vision-assisted Approach to Automated Real-Time Road
Infrastructure Management [0.0]
We propose a supervised object detection approach to detect and classify road distresses in real-time via a vehicle dashboard-mounted smartphone camera.
Our results rank in the top 5 of 121 teams that entered the IEEE's 2020 Global Road Damage Detection ("GRDC") Challenge.
arXiv Detail & Related papers (2022-02-27T04:08:00Z) - Triggering Failures: Out-Of-Distribution detection by learning from
local adversarial attacks in Semantic Segmentation [76.2621758731288]
We tackle the detection of out-of-distribution (OOD) objects in semantic segmentation.
Our main contribution is a new OOD detection architecture called ObsNet associated with a dedicated training scheme based on Local Adversarial Attacks (LAA)
We show it obtains top performances both in speed and accuracy when compared to ten recent methods of the literature on three different datasets.
arXiv Detail & Related papers (2021-08-03T17:09:56Z) - Spatio-Temporal Graph Convolutional Networks for Road Network Inundation
Status Prediction during Urban Flooding [1.376408511310322]
The objective of this study is to predict the near-future flooding status of road segments based on their own and adjacent road segments current status.
Existing studies related to near-future prediction of road network flooding status at road segment level are missing.
arXiv Detail & Related papers (2021-04-06T04:03:34Z) - Exploiting Playbacks in Unsupervised Domain Adaptation for 3D Object
Detection [55.12894776039135]
State-of-the-art 3D object detectors, based on deep learning, have shown promising accuracy but are prone to over-fit to domain idiosyncrasies.
We propose a novel learning approach that drastically reduces this gap by fine-tuning the detector on pseudo-labels in the target domain.
We show, on five autonomous driving datasets, that fine-tuning the detector on these pseudo-labels substantially reduces the domain gap to new driving environments.
arXiv Detail & Related papers (2021-03-26T01:18:11Z) - An Efficient and Scalable Deep Learning Approach for Road Damage
Detection [0.0]
This paper introduces a deep learning-based surveying scheme to analyze the image-based distress data in real-time.
A database consisting of a diverse population of crack distress types such as longitudinal, transverse, and alligator cracks is used.
Proposed models, resulted in F1-scores, ranging from 52% to 56%, and average inference time from 178-10 images per second.
arXiv Detail & Related papers (2020-11-18T23:05:41Z) - Road Curb Detection and Localization with Monocular Forward-view Vehicle
Camera [74.45649274085447]
We propose a robust method for estimating road curb 3D parameters using a calibrated monocular camera equipped with a fisheye lens.
Our approach is able to estimate the vehicle to curb distance in real time with mean accuracy of more than 90%.
arXiv Detail & Related papers (2020-02-28T00:24:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.