YOLOv5s-GTB: light-weighted and improved YOLOv5s for bridge crack
detection
- URL: http://arxiv.org/abs/2206.01498v1
- Date: Fri, 3 Jun 2022 10:52:59 GMT
- Title: YOLOv5s-GTB: light-weighted and improved YOLOv5s for bridge crack
detection
- Authors: Xiao Ruiqiang
- Abstract summary: This study proposes a light-weighted, high-precision, deep learning-based bridge apparent crack recognition model that can be deployed in mobile devices' scenarios.
YOLOv5 is identified as the basic framework for the light-weighted crack detection model through experiments for comparison and validation.
The improved model has 42% fewer parameters and faster inference response, but also significantly outperforms the original model in terms of accuracy and mAP.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In response to the situation that the conventional bridge crack manual
detection method has a large amount of human and material resources wasted,
this study is aimed to propose a light-weighted, high-precision, deep
learning-based bridge apparent crack recognition model that can be deployed in
mobile devices' scenarios. In order to enhance the performance of YOLOv5,
firstly, the data augmentation methods are supplemented, and then the YOLOv5
series algorithm is trained to select a suitable basic framework. The YOLOv5s
is identified as the basic framework for the light-weighted crack detection
model through experiments for comparison and validation.By replacing the
traditional DarkNet backbone network of YOLOv5s with GhostNet backbone network,
introducing Transformer multi-headed self-attention mechanism and
bi-directional feature pyramid network (BiFPN) to replace the commonly used
feature pyramid network, the improved model not only has 42% fewer parameters
and faster inference response, but also significantly outperforms the original
model in terms of accuracy and mAP (8.5% and 1.1% improvement, respectively).
Luckily each improved part has a positive impact on the result. This paper
provides a feasible idea to establish a digital operation management system in
the field of highway and bridge in the future and to implement the whole life
cycle structure health monitoring of civil infrastructure in China.
Related papers
- Benchmarking YOLOv8 for Optimal Crack Detection in Civil Infrastructure [9.147836981924016]
This study bridges the gap by rigorously evaluating YOLOv8's performance across five model scales.
YOLOv8 delivered exceptional accuracy and speed, setting a new benchmark for real-time crack detection.
This work paves the way for safer, more efficient transportation systems worldwide.
arXiv Detail & Related papers (2025-01-12T20:17:46Z) - YOLOv5, YOLOv8 and YOLOv10: The Go-To Detectors for Real-time Vision [0.6662800021628277]
This paper focuses on the evolution of the YOLO (You Only Look Once) object detection algorithm, focusing on YOLOv5, YOLOv8, and YOLOv10.
We analyze the architectural advancements, performance improvements, and suitability for edge deployment across these versions.
arXiv Detail & Related papers (2024-07-03T10:40:20Z) - YOLOv10: Real-Time End-to-End Object Detection [68.28699631793967]
YOLOs have emerged as the predominant paradigm in the field of real-time object detection.
The reliance on the non-maximum suppression (NMS) for post-processing hampers the end-to-end deployment of YOLOs.
We introduce the holistic efficiency-accuracy driven model design strategy for YOLOs.
arXiv Detail & Related papers (2024-05-23T11:44:29Z) - Fostc3net:A Lightweight YOLOv5 Based On the Network Structure Optimization [11.969138981034247]
This paper presents an enhanced lightweight YOLOv5 technique customized for mobile devices.
The proposed model achieves a 1% increase in detection accuracy, a 13% reduction in FLOPs, and a 26% decrease in model parameters compared to the existing YOLOv5.
arXiv Detail & Related papers (2024-03-20T16:07:04Z) - PREM: A Simple Yet Effective Approach for Node-Level Graph Anomaly
Detection [65.24854366973794]
Node-level graph anomaly detection (GAD) plays a critical role in identifying anomalous nodes from graph-structured data in domains such as medicine, social networks, and e-commerce.
We introduce a simple method termed PREprocessing and Matching (PREM for short) to improve the efficiency of GAD.
Our approach streamlines GAD, reducing time and memory consumption while maintaining powerful anomaly detection capabilities.
arXiv Detail & Related papers (2023-10-18T02:59:57Z) - Improvement and Enhancement of YOLOv5 Small Target Recognition Based on
Multi-module Optimization [9.125818713673366]
The performance of the model is successfully enhanced by introducing GhostNet-based convolutional module, RepGFPN-based Neck module optimization, CA and Transformer's attention mechanism, and loss function improvement using NWD.
The improved model shows significant superiority in dealing with complex backgrounds and tiny targets in real-world application tests.
arXiv Detail & Related papers (2023-10-03T05:39:36Z) - A lightweight and accurate YOLO-like network for small target detection
in Aerial Imagery [94.78943497436492]
We present YOLO-S, a simple, fast and efficient network for small target detection.
YOLO-S exploits a small feature extractor based on Darknet20, as well as skip connection, via both bypass and concatenation.
YOLO-S has an 87% decrease of parameter size and almost one half FLOPs of YOLOv3, making practical the deployment for low-power industrial applications.
arXiv Detail & Related papers (2022-04-05T16:29:49Z) - Improved YOLOv5 network for real-time multi-scale traffic sign detection [4.5598087061051755]
We propose an improved feature pyramid model, named AF-FPN, which utilize the adaptive attention module (AAM) and feature enhancement module (FEM) to reduce the information loss in the process of feature map generation.
We replace the original feature pyramid network in YOLOv5 with AF-FPN, which improves the detection performance for multi-scale targets of the YOLOv5 network.
arXiv Detail & Related papers (2021-12-16T11:02:12Z) - Benchmarking Detection Transfer Learning with Vision Transformers [60.97703494764904]
complexity of object detection methods can make benchmarking non-trivial when new architectures, such as Vision Transformer (ViT) models, arrive.
We present training techniques that overcome these challenges, enabling the use of standard ViT models as the backbone of Mask R-CNN.
Our results show that recent masking-based unsupervised learning methods may, for the first time, provide convincing transfer learning improvements on COCO.
arXiv Detail & Related papers (2021-11-22T18:59:15Z) - Anchor-free Small-scale Multispectral Pedestrian Detection [88.7497134369344]
We propose a method for effective and efficient multispectral fusion of the two modalities in an adapted single-stage anchor-free base architecture.
We aim at learning pedestrian representations based on object center and scale rather than direct bounding box predictions.
Results show our method's effectiveness in detecting small-scaled pedestrians.
arXiv Detail & Related papers (2020-08-19T13:13:01Z) - Progressively Guided Alternate Refinement Network for RGB-D Salient
Object Detection [63.18846475183332]
We aim to develop an efficient and compact deep network for RGB-D salient object detection.
We propose a progressively guided alternate refinement network to refine it.
Our model outperforms existing state-of-the-art approaches by a large margin.
arXiv Detail & Related papers (2020-08-17T02:55:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.