Road Damage Detection and Classification with Detectron2 and Faster
R-CNN
- URL: http://arxiv.org/abs/2010.15021v1
- Date: Wed, 28 Oct 2020 14:53:17 GMT
- Title: Road Damage Detection and Classification with Detectron2 and Faster
R-CNN
- Authors: Vung Pham, Chau Pham, and Tommy Dang
- Abstract summary: We evaluate Detectron2's implementation of Faster R-CNN using different base models and configurations.
We also experiment with these approaches using the Global Road Damage Detection Challenge 2020, A Track in the IEEE Big Data 2020 Big Data Cup Challenge dataset.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The road is vital for many aspects of life, and road maintenance is crucial
for human safety. One of the critical tasks to allow timely repair of road
damages is to quickly and efficiently detect and classify them. This work
details the strategies and experiments evaluated for these tasks. Specifically,
we evaluate Detectron2's implementation of Faster R-CNN using different base
models and configurations. We also experiment with these approaches using the
Global Road Damage Detection Challenge 2020, A Track in the IEEE Big Data 2020
Big Data Cup Challenge dataset. The results show that the X101-FPN base model
for Faster R-CNN with Detectron2's default configurations are efficient and
general enough to be transferable to different countries in this challenge.
This approach results in F1 scores of 51.0% and 51.4% for the test1 and test2
sets of the challenge, respectively. Though the visualizations show good
prediction results, the F1 scores are low. Therefore, we also evaluate the
prediction results against the existing annotations and discover some
discrepancies. Thus, we also suggest strategies to improve the labeling process
for this dataset.
Related papers
- DiRecNetV2: A Transformer-Enhanced Network for Aerial Disaster Recognition [4.678150356894011]
integration of Unmanned Aerial Vehicles with artificial intelligence (AI) models for aerial imagery processing in disaster assessment requires exceptional accuracy, computational efficiency, and real-time processing capabilities.
Traditionally Convolutional Neural Networks (CNNs) demonstrate efficiency in local feature extraction but are limited by their potential for global context interpretation.
Vision Transformers (ViTs) show promise for improved global context interpretation through the use of attention mechanisms, although they still remain underinvestigated in UAV-based disaster response applications.
arXiv Detail & Related papers (2024-10-17T15:25:13Z) - TrackFormers: In Search of Transformer-Based Particle Tracking for the High-Luminosity LHC Era [2.9052912091435923]
High-Energy Physics experiments are facing a multi-fold data increase with every new iteration.
One such step in need of an overhaul is the task of particle track reconstruction, a.k.a., tracking.
A Machine Learning-assisted solution is expected to provide significant improvements.
arXiv Detail & Related papers (2024-07-09T18:47:25Z) - Predicting the duration of traffic incidents for Sydney greater metropolitan area using machine learning methods [3.3373764108905446]
This research presents a comprehensive approach to predicting the duration of traffic incidents and classifying them as short-term or long-term.
We train and evaluate a variety of advanced machine learning models including Gradient Boosted Decision Trees (GBDT), Random Forest, LightGBM, and XGBoost.
XGBoost and LightGBM outperform conventional models with XGBoost achieving the lowest RMSE of 33.7 for predicting incident duration and highest classification F1 score of 0.62 for a 30-minute duration threshold.
arXiv Detail & Related papers (2024-06-27T03:16:09Z) - SIRST-5K: Exploring Massive Negatives Synthesis with Self-supervised
Learning for Robust Infrared Small Target Detection [53.19618419772467]
Single-frame infrared small target (SIRST) detection aims to recognize small targets from clutter backgrounds.
With the development of Transformer, the scale of SIRST models is constantly increasing.
With a rich diversity of infrared small target data, our algorithm significantly improves the model performance and convergence speed.
arXiv Detail & Related papers (2024-03-08T16:14:54Z) - Evaluating Graph Neural Networks for Link Prediction: Current Pitfalls
and New Benchmarking [66.83273589348758]
Link prediction attempts to predict whether an unseen edge exists based on only a portion of edges of a graph.
A flurry of methods have been introduced in recent years that attempt to make use of graph neural networks (GNNs) for this task.
New and diverse datasets have also been created to better evaluate the effectiveness of these new models.
arXiv Detail & Related papers (2023-06-18T01:58:59Z) - Robust Trajectory Prediction against Adversarial Attacks [84.10405251683713]
Trajectory prediction using deep neural networks (DNNs) is an essential component of autonomous driving systems.
These methods are vulnerable to adversarial attacks, leading to serious consequences such as collisions.
In this work, we identify two key ingredients to defend trajectory prediction models against adversarial attacks.
arXiv Detail & Related papers (2022-07-29T22:35:05Z) - Oriented R-CNN for Object Detection [61.78746189807462]
This work proposes an effective and simple oriented object detection framework, termed Oriented R-CNN.
In the first stage, we propose an oriented Region Proposal Network (oriented RPN) that directly generates high-quality oriented proposals in a nearly cost-free manner.
The second stage is oriented R-CNN head for refining oriented Regions of Interest (oriented RoIs) and recognizing them.
arXiv Detail & Related papers (2021-08-12T12:47:43Z) - CNN Model & Tuning for Global Road Damage Detection [0.0]
We assess single and multi-stage network architectures for object detection.
Data preparation for provided Road Damage training dataset, captured using smartphone camera from Czech, India and Japan is discussed.
We show a mean F1 score of 0.542 on Test2 and 0.536 on Test1 datasets using a multi-stage Faster R-CNN model, with Resnet-50 and ResnetCSP-101 backbones respectively.
arXiv Detail & Related papers (2021-03-17T09:01:23Z) - Road Damage Detection using Deep Ensemble Learning [36.24563211765782]
We present an ensemble model for efficient detection and classification of road damages.
Our solution utilizes a state-of-the-art object detector known as You Only Look Once (YOLO-v4)
It was able to achieve an F1 score of 0.628 on the test 1 dataset and 0.6358 on the test 2 dataset.
arXiv Detail & Related papers (2020-10-30T03:18:14Z) - Targeted Physical-World Attention Attack on Deep Learning Models in Road
Sign Recognition [79.50450766097686]
This paper proposes the targeted attention attack (TAA) method for real world road sign attack.
Experimental results validate that the TAA method improves the attack successful rate (nearly 10%) and reduces the perturbation loss (about a quarter) compared with the popular RP2 method.
arXiv Detail & Related papers (2020-10-09T02:31:34Z) - Cascaded Regression Tracking: Towards Online Hard Distractor
Discrimination [202.2562153608092]
We propose a cascaded regression tracker with two sequential stages.
In the first stage, we filter out abundant easily-identified negative candidates.
In the second stage, a discrete sampling based ridge regression is designed to double-check the remaining ambiguous hard samples.
arXiv Detail & Related papers (2020-06-18T07:48:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.