Related papers: A Convolutional-Transformer Network for Crack Segmentation with Boundary Awareness

A Convolutional-Transformer Network for Crack Segmentation with Boundary Awareness

URL: http://arxiv.org/abs/2302.11728v3
Date: Sun, 12 Nov 2023 03:46:41 GMT
Title: A Convolutional-Transformer Network for Crack Segmentation with Boundary Awareness
Authors: Huaqi Tao, Bingxi Liu, Jinqiang Cui and Hong Zhang
Abstract summary: Cracks play a crucial role in assessing the safety and durability of manufactured buildings. We propose a novel convolutional-transformer network based on encoder-decoder architecture to solve this challenge.
Score: 5.98717173705421
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Cracks play a crucial role in assessing the safety and durability of manufactured buildings. However, the long and sharp topological features and complex background of cracks make the task of crack segmentation extremely challenging. In this paper, we propose a novel convolutional-transformer network based on encoder-decoder architecture to solve this challenge. Particularly, we designed a Dilated Residual Block (DRB) and a Boundary Awareness Module (BAM). The DRB pays attention to the local detail of cracks and adjusts the feature dimension for other blocks as needed. And the BAM learns the boundary features from the dilated crack label. Furthermore, the DRB is combined with a lightweight transformer that captures global information to serve as an effective encoder. Experimental results show that the proposed network performs better than state-of-the-art algorithms on two typical datasets. Datasets, code, and trained models are available for research at https://github.com/HqiTao/CT-crackseg.

Related papers

Dual-branch Graph Feature Learning for NLOS Imaging [51.31554007495926]
Non-line-of-sight (NLOS) imaging offers the capability to reveal occluded scenes that are not directly visible. xnet methodology integrates an albedo-focused reconstruction branch dedicated to albedo information recovery and a depth-focused reconstruction branch that extracts geometrical structure. Our method attains the highest level of performance among existing methods across synthetic and real data.
arXiv Detail & Related papers (2025-02-27T01:49:00Z)
FlexiCrackNet: A Flexible Pipeline for Enhanced Crack Segmentation with General Features Transfered from SAM [24.99233476254989]
FlexiCrackNet is a novel pipeline that seamlessly integrates traditional deep learning paradigms with the strengths of large-scale pre-trained models. Experiments show that FlexiCrackNet outperforms state-of-the-art methods, excels in zero-shot generalization, computational efficiency, and segmentation robustness. These advancements underscore the potential of FlexiCrackNet for real-world applications in automated crack detection and comprehensive structural health monitoring systems.
arXiv Detail & Related papers (2025-01-31T02:37:09Z)
CrossDiff: Diffusion Probabilistic Model With Cross-conditional Encoder-Decoder for Crack Segmentation [5.69969816883978]
We propose a novel diffusion-based model with a cross-conditional encoder-decoder, named CrossDiff. The proposed CrossDiff model achieves impressive performance, outperforming other state-of-the-art methods by 8.0% in terms of both Dice score and IoU.
arXiv Detail & Related papers (2025-01-22T13:13:41Z)
DSCformer: A Dual-Branch Network Integrating Enhanced Dynamic Snake Convolution and SegFormer for Crack Segmentation [6.898227391740093]
Current convolutional neural networks (CNNs) have demonstrated strong performance in crack segmentation tasks. Transformers excel at capturing global context but lack precision in detailed feature extraction. We introduce DSCformer, a novel hybrid model that integrates an enhanced Dynamic Snake Convolution (DSConv) with a Transformer architecture for crack segmentation.
arXiv Detail & Related papers (2024-11-14T11:25:32Z)
Topology-aware Mamba for Crack Segmentation in Structures [5.9184143707401775]
CrackMamba, a Mamba-based model, is designed for efficient and accurate crack segmentation for monitoring the structural health of infrastructure. CrackMamba addresses these challenges by utilizing the VMambaV2 with pre-trained ImageNet-1k weights as the encoder and a newly designed decoder for better performance. Experiments show that CrackMamba achieves state-of-the-art (SOTA) performance on the CrackSeg9k and SewerCrack datasets, and demonstrates competitive performance on the retinal vessel segmentation dataset CHASEunderlineDB1.
arXiv Detail & Related papers (2024-10-25T15:17:52Z)
Hybrid-Segmentor: A Hybrid Approach to Automated Fine-Grained Crack Segmentation in Civil Infrastructure [52.2025114590481]
We introduce Hybrid-Segmentor, an encoder-decoder based approach that is capable of extracting both fine-grained local and global crack features. This allows the model to improve its generalization capabilities in distinguish various type of shapes, surfaces and sizes of cracks. The proposed model outperforms existing benchmark models across 5 quantitative metrics (accuracy 0.971, precision 0.804, recall 0.744, F1-score 0.770, and IoU score 0.630), achieving state-of-the-art status.
arXiv Detail & Related papers (2024-09-04T16:47:16Z)
Staircase Cascaded Fusion of Lightweight Local Pattern Recognition and Long-Range Dependencies for Structural Crack Segmentation [28.157401919910914]
We propose a staircase cascaded fusion crack segmentation network (CrackSCF) that generates high-quality crack segmentation maps using minimal computational resources. We constructed a staircase cascaded fusion module that effectively captures local patterns of cracks and long-range dependencies of pixels. To reduce the computational resources required by the model, we introduced a lightweight convolution block, which replaces all convolution operations in the network.
arXiv Detail & Related papers (2024-08-23T03:21:51Z)
Real-time High-Resolution Neural Network with Semantic Guidance for Crack Segmentation [4.651261550392625]
This paper describes HrSegNet, a high-resolution network with semantic guidance specifically designed for crack segmentation. HrSegNet guarantees real-time inference speed while preserving crack details. This approach demonstrates that there is a trade-off between high-resolution modeling and real-time detection.
arXiv Detail & Related papers (2023-07-01T08:38:18Z)
Learning-Based Defect Recognitions for Autonomous UAV Inspections [1.713291434132985]
We have implemented a deep learning framework for crack detection based on classical network architectures including Alexnet, VGG, and Resnet. Inspired by the feature pyramid network architecture, a hierarchical convolutional neural network (CNN) deep learning framework is also proposed. A framework for automatic unmanned aerial vehicle inspections is also proposed and will be established for the crack inspection tasks of various concrete structures.
arXiv Detail & Related papers (2023-02-13T04:25:05Z)
BCS-Net: Boundary, Context and Semantic for Automatic COVID-19 Lung Infection Segmentation from CT Images [83.82141604007899]
BCS-Net is a novel network for automatic COVID-19 lung infection segmentation from CT images. BCS-Net follows an encoder-decoder architecture, and more designs focus on the decoder stage. In each BCSR block, the attention-guided global context (AGGC) module is designed to learn the most valuable encoder features for decoder.
arXiv Detail & Related papers (2022-07-17T08:54:07Z)
Crosslink-Net: Double-branch Encoder Segmentation Network via Fusing Vertical and Horizontal Convolutions [58.71117402626524]
We present a novel double-branch encoder architecture for medical image segmentation. Our architecture is inspired by two observations: 1) Since the discrimination of features learned via square convolutional kernels needs to be further improved, we propose to utilize non-square vertical and horizontal convolutional kernels. The experiments validate the effectiveness of our model on four datasets.
arXiv Detail & Related papers (2021-07-24T02:58:32Z)
Boundary-Aware Segmentation Network for Mobile and Web Applications [60.815545591314915]
Boundary-Aware Network (BASNet) is integrated with a predict-refine architecture and a hybrid loss for highly accurate image segmentation. BASNet runs at over 70 fps on a single GPU which benefits many potential real applications. Based on BASNet, we further developed two (close to) commercial applications: AR COPY & PASTE, in which BASNet is augmented reality for "COPY" and "PASTING" real-world objects, and OBJECT CUT, which is a web-based tool for automatic object background removal.
arXiv Detail & Related papers (2021-01-12T19:20:26Z)
Suppress and Balance: A Simple Gated Network for Salient Object Detection [89.88222217065858]
We propose a simple gated network (GateNet) to solve both issues at once. With the help of multilevel gate units, the valuable context information from the encoder can be optimally transmitted to the decoder. In addition, we adopt the atrous spatial pyramid pooling based on the proposed "Fold" operation (Fold-ASPP) to accurately localize salient objects of various scales.
arXiv Detail & Related papers (2020-07-16T02:00:53Z)
End-to-End Object Detection with Transformers [88.06357745922716]
We present a new method that views object detection as a direct set prediction problem. Our approach streamlines the detection pipeline, effectively removing the need for many hand-designed components. The main ingredients of the new framework, called DEtection TRansformer or DETR, are a set-based global loss.
arXiv Detail & Related papers (2020-05-26T17:06:38Z)

This list is automatically generated from the titles and abstracts of the papers in this site.