Segmentation method of U-net sheet metal engineering drawing based on
CBAM attention mechanism
- URL: http://arxiv.org/abs/2209.14102v2
- Date: Thu, 27 Apr 2023 06:19:51 GMT
- Title: Segmentation method of U-net sheet metal engineering drawing based on
CBAM attention mechanism
- Authors: Zhiwei Song, Hui Yao
- Abstract summary: This paper proposes a U-net-based method for the segmentation and extraction of specific units in welding engineering drawings.
Using vgg16 as the backbone network, experiments have verified that the IoU, mAP, and Accu of our model in the welding engineering drawing dataset segmentation task are 84.72%, 86.84%, and 99.42%, respectively.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In the manufacturing process of heavy industrial equipment, the specific unit
in the welding diagram is first manually redrawn and then the corresponding
sheet metal parts are cut, which is inefficient. To this end, this paper
proposes a U-net-based method for the segmentation and extraction of specific
units in welding engineering drawings. This method enables the cutting device
to automatically segment specific graphic units according to visual information
and automatically cut out sheet metal parts of corresponding shapes according
to the segmentation results. This process is more efficient than traditional
human-assisted cutting. Two weaknesses in the U-net network will lead to a
decrease in segmentation performance: first, the focus on global semantic
feature information is weak, and second, there is a large dimensional
difference between shallow encoder features and deep decoder features. Based on
the CBAM (Convolutional Block Attention Module) attention mechanism, this paper
proposes a U-net jump structure model with an attention mechanism to improve
the network's global semantic feature extraction ability. In addition, a U-net
attention mechanism model with dual pooling convolution fusion is designed, the
deep encoder's maximum pooling + convolution features and the shallow encoder's
average pooling + convolution features are fused vertically to reduce the
dimension difference between the shallow encoder and deep decoder. The
dual-pool convolutional attention jump structure replaces the traditional U-net
jump structure, which can effectively improve the specific unit segmentation
performance of the welding engineering drawing. Using vgg16 as the backbone
network, experiments have verified that the IoU, mAP, and Accu of our model in
the welding engineering drawing dataset segmentation task are 84.72%, 86.84%,
and 99.42%, respectively.
Related papers
- Prototype Learning Guided Hybrid Network for Breast Tumor Segmentation in DCE-MRI [58.809276442508256]
We propose a hybrid network via the combination of convolution neural network (CNN) and transformer layers.
The experimental results on private and public DCE-MRI datasets demonstrate that the proposed hybrid network superior performance than the state-of-the-art methods.
arXiv Detail & Related papers (2024-08-11T15:46:00Z) - Bilateral Network with Residual U-blocks and Dual-Guided Attention for
Real-time Semantic Segmentation [18.393208069320362]
We design a new fusion mechanism for two-branch architecture which is guided by attention computation.
To be precise, we use the Dual-Guided Attention (DGA) module we proposed to replace some multi-scale transformations.
Experiments on Cityscapes and CamVid dataset show the effectiveness of our method.
arXiv Detail & Related papers (2023-10-31T09:20:59Z) - Unite-Divide-Unite: Joint Boosting Trunk and Structure for High-accuracy
Dichotomous Image Segmentation [48.995367430746086]
High-accuracy Dichotomous Image rendering (DIS) aims to pinpoint category-agnostic foreground objects from natural scenes.
We introduce a novel Unite-Divide-Unite Network (UDUN) that restructures and bipartitely arranges complementary features to boost the effectiveness of trunk and structure identification.
Using 1024*1024 input, our model enables real-time inference at 65.3 fps with ResNet-18.
arXiv Detail & Related papers (2023-07-26T09:04:35Z) - Joint Channel Estimation and Feedback with Masked Token Transformers in
Massive MIMO Systems [74.52117784544758]
This paper proposes an encoder-decoder based network that unveils the intrinsic frequency-domain correlation within the CSI matrix.
The entire encoder-decoder network is utilized for channel compression.
Our method outperforms state-of-the-art channel estimation and feedback techniques in joint tasks.
arXiv Detail & Related papers (2023-06-08T06:15:17Z) - EAA-Net: Rethinking the Autoencoder Architecture with Intra-class
Features for Medical Image Segmentation [4.777011444412729]
We propose a light-weight end-to-end segmentation framework based on multi-task learning, termed Edge Attention autoencoder Network (EAA-Net)
Our approach not only utilizes the segmentation network to obtain inter-class features, but also applies the reconstruction network to extract intra-class features among the foregrounds.
Experimental results show that our method performs well in medical image segmentation tasks.
arXiv Detail & Related papers (2022-08-19T07:42:55Z) - MGAE: Masked Autoencoders for Self-Supervised Learning on Graphs [55.66953093401889]
Masked graph autoencoder (MGAE) framework to perform effective learning on graph structure data.
Taking insights from self-supervised learning, we randomly mask a large proportion of edges and try to reconstruct these missing edges during training.
arXiv Detail & Related papers (2022-01-07T16:48:07Z) - GridDehazeNet+: An Enhanced Multi-Scale Network with Intra-Task
Knowledge Transfer for Single Image Dehazing [12.982905875008214]
We propose an enhanced multi-scale network, dubbed GridDehazeNet+, for single image dehazing.
It consists of three modules: pre-processing, backbone, and post-processing.
arXiv Detail & Related papers (2021-03-25T17:35:36Z) - Multi-stage Attention ResU-Net for Semantic Segmentation of
Fine-Resolution Remote Sensing Images [9.398340832493457]
We propose a Linear Attention Mechanism (LAM) to address this issue.
LAM is approximately equivalent to dot-product attention with computational efficiency.
We design a Multi-stage Attention ResU-Net for semantic segmentation from fine-resolution remote sensing images.
arXiv Detail & Related papers (2020-11-29T07:24:21Z) - Efficient Medical Image Segmentation with Intermediate Supervision
Mechanism [48.244918515770514]
The expansion path of U-Net may ignore the characteristics of small targets, so an intermediate supervision mechanism is proposed.
Although the intermediate supervision mechanism improves the segmentation accuracy, the training time is too long due to the extra input and multiple loss functions.
To reduce the redundancy of the model, we combine shared-weight decoder module with tied-weight decoder module.
arXiv Detail & Related papers (2020-11-15T13:46:00Z) - Multi-scale Attention U-Net (MsAUNet): A Modified U-Net Architecture for
Scene Segmentation [1.713291434132985]
We propose a novel multi-scale attention network for scene segmentation by using contextual information from an image.
This network can map local features with their global counterparts with improved accuracy and emphasize on discriminative image regions.
We have evaluated our model on two standard datasets named PascalVOC2012 and ADE20k.
arXiv Detail & Related papers (2020-09-15T08:03:41Z) - Augmented Parallel-Pyramid Net for Attention Guided Pose-Estimation [90.28365183660438]
This paper proposes an augmented parallel-pyramid net with attention partial module and differentiable auto-data augmentation.
We define a new pose search space where the sequences of data augmentations are formulated as a trainable and operational CNN component.
Notably, our method achieves the top-1 accuracy on the challenging COCO keypoint benchmark and the state-of-the-art results on the MPII datasets.
arXiv Detail & Related papers (2020-03-17T03:52:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.