Dual flow fusion model for concrete surface crack segmentation
- URL: http://arxiv.org/abs/2305.05132v2
- Date: Tue, 16 May 2023 13:26:54 GMT
- Title: Dual flow fusion model for concrete surface crack segmentation
- Authors: Yuwei Duan
- Abstract summary: Cracks and other damages pose a significant threat to the safe operation of transportation infrastructure.
Deep learning models have been widely applied to practical visual segmentation tasks.
This paper proposes a crack segmentation model based on the fusion of dual streams.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The existence of cracks and other damages pose a significant threat to the
safe operation of transportation infrastructure. Traditional manual detection
and ultrasound equipment testing consume a lot of time and resources. With the
development of deep learning technology, many deep learning models have been
widely applied to practical visual segmentation tasks. The detection method
based on deep learning models has the advantages of high detection accuracy,
fast detection speed, and simple operation. However, deep learning-based crack
segmentation models are sensitive to background noise, have rough edges, and
lack robustness. Therefore, this paper proposes a crack segmentation model
based on the fusion of dual streams. The image is inputted simultaneously into
two designed processing streams to independently extract long-distance
dependence and local detail features. The adaptive prediction is achieved
through the dual-headed mechanism. Meanwhile, a novel interaction fusion
mechanism is proposed to guide the complementary of different feature layers to
achieve crack location and recognition in complex backgrounds. Finally, an edge
optimization method is proposed to improve the accuracy of segmentation.
Experiments show that the F1 value of segmentation results on the DeepCrack[1]
public dataset is 93.7% and the IOU value is 86.6%. The F1 value of
segmentation results on the CRACK500[2] dataset is 78.1%, and the IOU value is
66.0%.
Related papers
- CrackSegDiff: Diffusion Probability Model-based Multi-modal Crack Segmentation [5.534972596061796]
We propose a novel DPM-based approach for crack segmentation, named CrackSegDiff.
Our approach employs Vm-unet to efficiently capture long-range information of the original data.
CrackSegDiff outperforms state-of-the-art methods, particularly in the detection of shallow cracks.
arXiv Detail & Related papers (2024-10-10T16:44:10Z) - Hybrid-Segmentor: A Hybrid Approach to Automated Fine-Grained Crack Segmentation in Civil Infrastructure [52.2025114590481]
We introduce Hybrid-Segmentor, an encoder-decoder based approach that is capable of extracting both fine-grained local and global crack features.
This allows the model to improve its generalization capabilities in distinguish various type of shapes, surfaces and sizes of cracks.
The proposed model outperforms existing benchmark models across 5 quantitative metrics (accuracy 0.971, precision 0.804, recall 0.744, F1-score 0.770, and IoU score 0.630), achieving state-of-the-art status.
arXiv Detail & Related papers (2024-09-04T16:47:16Z) - A new method for optical steel rope non-destructive damage detection [3.195044561824979]
This paper presents a novel algorithm for non-destructive damage detection for steel ropes in high-altitude environments (aerial ropeway)
A segmentation model named RGBD-UNet is designed to accurately extract steel ropes from complex backgrounds.
A detection model named VovNetV3.5 is developed to differentiate between normal and abnormal steel ropes.
arXiv Detail & Related papers (2024-02-06T09:39:05Z) - Augmentation is AUtO-Net: Augmentation-Driven Contrastive Multiview
Learning for Medical Image Segmentation [3.1002416427168304]
This thesis focuses on retinal blood vessel segmentation tasks.
It provides an extensive literature review of deep learning-based medical image segmentation approaches.
It proposes a novel efficient, simple multiview learning framework.
arXiv Detail & Related papers (2023-11-02T06:31:08Z) - CrossDF: Improving Cross-Domain Deepfake Detection with Deep Information Decomposition [53.860796916196634]
We propose a Deep Information Decomposition (DID) framework to enhance the performance of Cross-dataset Deepfake Detection (CrossDF)
Unlike most existing deepfake detection methods, our framework prioritizes high-level semantic features over specific visual artifacts.
It adaptively decomposes facial features into deepfake-related and irrelevant information, only using the intrinsic deepfake-related information for real/fake discrimination.
arXiv Detail & Related papers (2023-09-30T12:30:25Z) - Towards Better Certified Segmentation via Diffusion Models [62.21617614504225]
segmentation models can be vulnerable to adversarial perturbations, which hinders their use in critical-decision systems like healthcare or autonomous driving.
Recently, randomized smoothing has been proposed to certify segmentation predictions by adding Gaussian noise to the input to obtain theoretical guarantees.
In this paper, we address the problem of certifying segmentation prediction using a combination of randomized smoothing and diffusion models.
arXiv Detail & Related papers (2023-06-16T16:30:39Z) - Detection of Pavement Cracks by Deep Learning Models of Transformer and
UNet [9.483452333312373]
In recent years, the emergence and development of deep learning techniques have shown great potential to facilitate surface crack detection.
In this study, we investigated nine promising models to evaluate their performance in pavement surface crack detection by model accuracy, computational complexity, and model stability.
We find that transformer-based models generally are easier to converge during the training process and have higher accuracy, but usually exhibit more memory consumption and low processing efficiency.
arXiv Detail & Related papers (2023-04-25T06:07:49Z) - Real-Time Scene Text Detection with Differentiable Binarization and
Adaptive Scale Fusion [62.269219152425556]
segmentation-based scene text detection methods have drawn extensive attention in the scene text detection field.
We propose a Differentiable Binarization (DB) module that integrates the binarization process into a segmentation network.
An efficient Adaptive Scale Fusion (ASF) module is proposed to improve the scale robustness by fusing features of different scales adaptively.
arXiv Detail & Related papers (2022-02-21T15:30:14Z) - Capturing scattered discriminative information using a deep architecture
in acoustic scene classification [49.86640645460706]
In this study, we investigate various methods to capture discriminative information and simultaneously mitigate the overfitting problem.
We adopt a max feature map method to replace conventional non-linear activations in a deep neural network.
Two data augment methods and two deep architecture modules are further explored to reduce overfitting and sustain the system's discriminative power.
arXiv Detail & Related papers (2020-07-09T08:32:06Z) - Depthwise Non-local Module for Fast Salient Object Detection Using a
Single Thread [136.2224792151324]
We propose a new deep learning algorithm for fast salient object detection.
The proposed algorithm achieves competitive accuracy and high inference efficiency simultaneously with a single CPU thread.
arXiv Detail & Related papers (2020-01-22T15:23:48Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.