A Convolutional-Transformer Network for Crack Segmentation with Boundary
Awareness
- URL: http://arxiv.org/abs/2302.11728v3
- Date: Sun, 12 Nov 2023 03:46:41 GMT
- Title: A Convolutional-Transformer Network for Crack Segmentation with Boundary
Awareness
- Authors: Huaqi Tao, Bingxi Liu, Jinqiang Cui and Hong Zhang
- Abstract summary: Cracks play a crucial role in assessing the safety and durability of manufactured buildings.
We propose a novel convolutional-transformer network based on encoder-decoder architecture to solve this challenge.
- Score: 5.98717173705421
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Cracks play a crucial role in assessing the safety and durability of
manufactured buildings. However, the long and sharp topological features and
complex background of cracks make the task of crack segmentation extremely
challenging. In this paper, we propose a novel convolutional-transformer
network based on encoder-decoder architecture to solve this challenge.
Particularly, we designed a Dilated Residual Block (DRB) and a Boundary
Awareness Module (BAM). The DRB pays attention to the local detail of cracks
and adjusts the feature dimension for other blocks as needed. And the BAM
learns the boundary features from the dilated crack label. Furthermore, the DRB
is combined with a lightweight transformer that captures global information to
serve as an effective encoder. Experimental results show that the proposed
network performs better than state-of-the-art algorithms on two typical
datasets. Datasets, code, and trained models are available for research at
https://github.com/HqiTao/CT-crackseg.
Related papers
- CrackCLF: Automatic Pavement Crack Detection based on Closed-Loop
Feedback [14.986335013488643]
CrackCLF is a neural network model that learns to correct errors on its own.
The proposed CLF can be defined as a plug and play module, which can be embedded into different neural network models to improve their performances.
arXiv Detail & Related papers (2023-11-20T14:52:48Z) - Global Context Aggregation Network for Lightweight Saliency Detection of
Surface Defects [70.48554424894728]
We develop a Global Context Aggregation Network (GCANet) for lightweight saliency detection of surface defects on the encoder-decoder structure.
First, we introduce a novel transformer encoder on the top layer of the lightweight backbone, which captures global context information through a novel Depth-wise Self-Attention (DSA) module.
The experimental results on three public defect datasets demonstrate that the proposed network achieves a better trade-off between accuracy and running efficiency compared with other 17 state-of-the-art methods.
arXiv Detail & Related papers (2023-09-22T06:19:11Z) - Real-time High-Resolution Neural Network with Semantic Guidance for
Crack Segmentation [4.651261550392625]
This paper describes HrSegNet, a high-resolution network with semantic guidance specifically designed for crack segmentation.
HrSegNet guarantees real-time inference speed while preserving crack details.
This approach demonstrates that there is a trade-off between high-resolution modeling and real-time detection.
arXiv Detail & Related papers (2023-07-01T08:38:18Z) - Learning-Based Defect Recognitions for Autonomous UAV Inspections [1.713291434132985]
We have implemented a deep learning framework for crack detection based on classical network architectures including Alexnet, VGG, and Resnet.
Inspired by the feature pyramid network architecture, a hierarchical convolutional neural network (CNN) deep learning framework is also proposed.
A framework for automatic unmanned aerial vehicle inspections is also proposed and will be established for the crack inspection tasks of various concrete structures.
arXiv Detail & Related papers (2023-02-13T04:25:05Z) - Defect Transformer: An Efficient Hybrid Transformer Architecture for
Surface Defect Detection [2.0999222360659604]
We propose an efficient hybrid transformer architecture, termed Defect Transformer (DefT), for surface defect detection.
DefT incorporates CNN and transformer into a unified model to capture local and non-local relationships collaboratively.
Experiments on three datasets demonstrate the superiority and efficiency of our method compared with other CNN- and transformer-based networks.
arXiv Detail & Related papers (2022-07-17T23:37:48Z) - BCS-Net: Boundary, Context and Semantic for Automatic COVID-19 Lung
Infection Segmentation from CT Images [83.82141604007899]
BCS-Net is a novel network for automatic COVID-19 lung infection segmentation from CT images.
BCS-Net follows an encoder-decoder architecture, and more designs focus on the decoder stage.
In each BCSR block, the attention-guided global context (AGGC) module is designed to learn the most valuable encoder features for decoder.
arXiv Detail & Related papers (2022-07-17T08:54:07Z) - CarNet: A Lightweight and Efficient Encoder-Decoder Architecture for
High-quality Road Crack Detection [21.468229247797627]
We present a lightweight encoder-decoder architecture, CarNet, for efficient and high-quality crack detection.
In particular, we propose that the ideal encoder should present an olive-type distribution about the number of convolutional layers at different stages.
In the decoder, we introduce a lightweight up-sampling feature pyramid module to learn rich hierarchical features for crack detection.
arXiv Detail & Related papers (2021-09-13T05:01:34Z) - Crosslink-Net: Double-branch Encoder Segmentation Network via Fusing
Vertical and Horizontal Convolutions [58.71117402626524]
We present a novel double-branch encoder architecture for medical image segmentation.
Our architecture is inspired by two observations: 1) Since the discrimination of features learned via square convolutional kernels needs to be further improved, we propose to utilize non-square vertical and horizontal convolutional kernels.
The experiments validate the effectiveness of our model on four datasets.
arXiv Detail & Related papers (2021-07-24T02:58:32Z) - Boundary-Aware Segmentation Network for Mobile and Web Applications [60.815545591314915]
Boundary-Aware Network (BASNet) is integrated with a predict-refine architecture and a hybrid loss for highly accurate image segmentation.
BASNet runs at over 70 fps on a single GPU which benefits many potential real applications.
Based on BASNet, we further developed two (close to) commercial applications: AR COPY & PASTE, in which BASNet is augmented reality for "COPY" and "PASTING" real-world objects, and OBJECT CUT, which is a web-based tool for automatic object background removal.
arXiv Detail & Related papers (2021-01-12T19:20:26Z) - Suppress and Balance: A Simple Gated Network for Salient Object
Detection [89.88222217065858]
We propose a simple gated network (GateNet) to solve both issues at once.
With the help of multilevel gate units, the valuable context information from the encoder can be optimally transmitted to the decoder.
In addition, we adopt the atrous spatial pyramid pooling based on the proposed "Fold" operation (Fold-ASPP) to accurately localize salient objects of various scales.
arXiv Detail & Related papers (2020-07-16T02:00:53Z) - End-to-End Object Detection with Transformers [88.06357745922716]
We present a new method that views object detection as a direct set prediction problem.
Our approach streamlines the detection pipeline, effectively removing the need for many hand-designed components.
The main ingredients of the new framework, called DEtection TRansformer or DETR, are a set-based global loss.
arXiv Detail & Related papers (2020-05-26T17:06:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.