MCPA: Multi-scale Cross Perceptron Attention Network for 2D Medical
Image Segmentation
- URL: http://arxiv.org/abs/2307.14588v1
- Date: Thu, 27 Jul 2023 02:18:12 GMT
- Title: MCPA: Multi-scale Cross Perceptron Attention Network for 2D Medical
Image Segmentation
- Authors: Liang Xu, Mingxiao Chen, Yi Cheng, Pengfei Shao, Shuwei Shen, Peng
Yao, and Ronald X.Xu
- Abstract summary: We propose a 2D medical image segmentation model called Multi-scale Cross Perceptron Attention Network (MCPA)
The MCPA consists of three main components: an encoder, a decoder, and a Cross Perceptron.
We evaluate our proposed MCPA model on several publicly available medical image datasets from different tasks and devices.
- Score: 7.720152925974362
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The UNet architecture, based on Convolutional Neural Networks (CNN), has
demonstrated its remarkable performance in medical image analysis. However, it
faces challenges in capturing long-range dependencies due to the limited
receptive fields and inherent bias of convolutional operations. Recently,
numerous transformer-based techniques have been incorporated into the UNet
architecture to overcome this limitation by effectively capturing global
feature correlations. However, the integration of the Transformer modules may
result in the loss of local contextual information during the global feature
fusion process. To overcome these challenges, we propose a 2D medical image
segmentation model called Multi-scale Cross Perceptron Attention Network
(MCPA). The MCPA consists of three main components: an encoder, a decoder, and
a Cross Perceptron. The Cross Perceptron first captures the local correlations
using multiple Multi-scale Cross Perceptron modules, facilitating the fusion of
features across scales. The resulting multi-scale feature vectors are then
spatially unfolded, concatenated, and fed through a Global Perceptron module to
model global dependencies. Furthermore, we introduce a Progressive Dual-branch
Structure to address the semantic segmentation of the image involving finer
tissue structures. This structure gradually shifts the segmentation focus of
MCPA network training from large-scale structural features to more
sophisticated pixel-level features. We evaluate our proposed MCPA model on
several publicly available medical image datasets from different tasks and
devices, including the open large-scale dataset of CT (Synapse), MRI (ACDC),
fundus camera (DRIVE, CHASE_DB1, HRF), and OCTA (ROSE). The experimental
results show that our MCPA model achieves state-of-the-art performance. The
code is available at
https://github.com/simonustc/MCPA-for-2D-Medical-Image-Segmentation.
Related papers
- EFCNet: Every Feature Counts for Small Medical Object Segmentation [44.26196156775273]
This paper explores the segmentation of very small medical objects with significant clinical value.
CNNs, UNet-like models, and recent Transformers have shown substantial progress in image segmentation.
We propose a novel model named EFCNet for small object segmentation in medical images.
arXiv Detail & Related papers (2024-06-26T09:33:51Z) - SDR-Former: A Siamese Dual-Resolution Transformer for Liver Lesion
Classification Using 3D Multi-Phase Imaging [59.78761085714715]
This study proposes a novel Siamese Dual-Resolution Transformer (SDR-Former) framework for liver lesion classification.
The proposed framework has been validated through comprehensive experiments on two clinical datasets.
To support the scientific community, we are releasing our extensive multi-phase MR dataset for liver lesion analysis to the public.
arXiv Detail & Related papers (2024-02-27T06:32:56Z) - BEFUnet: A Hybrid CNN-Transformer Architecture for Precise Medical Image
Segmentation [0.0]
This paper proposes an innovative U-shaped network called BEFUnet, which enhances the fusion of body and edge information for precise medical image segmentation.
The BEFUnet comprises three main modules, including a novel Local Cross-Attention Feature (LCAF) fusion module, a novel Double-Level Fusion (DLF) module, and dual-branch encoder.
The LCAF module efficiently fuses edge and body features by selectively performing local cross-attention on features that are spatially close between the two modalities.
arXiv Detail & Related papers (2024-02-13T21:03:36Z) - BRAU-Net++: U-Shaped Hybrid CNN-Transformer Network for Medical Image Segmentation [11.986549780782724]
We propose a hybrid yet effective CNN-Transformer network, named BRAU-Net++, for an accurate medical image segmentation task.
Specifically, BRAU-Net++ uses bi-level routing attention as the core building block to design our u-shaped encoder-decoder structure.
Our proposed approach surpasses other state-of-the-art methods including its baseline: BRAU-Net.
arXiv Detail & Related papers (2024-01-01T10:49:09Z) - Dual-scale Enhanced and Cross-generative Consistency Learning for Semi-supervised Medical Image Segmentation [49.57907601086494]
Medical image segmentation plays a crucial role in computer-aided diagnosis.
We propose a novel Dual-scale Enhanced and Cross-generative consistency learning framework for semi-supervised medical image (DEC-Seg)
arXiv Detail & Related papers (2023-12-26T12:56:31Z) - M$^{2}$SNet: Multi-scale in Multi-scale Subtraction Network for Medical
Image Segmentation [73.10707675345253]
We propose a general multi-scale in multi-scale subtraction network (M$2$SNet) to finish diverse segmentation from medical image.
Our method performs favorably against most state-of-the-art methods under different evaluation metrics on eleven datasets of four different medical image segmentation tasks.
arXiv Detail & Related papers (2023-03-20T06:26:49Z) - MISSU: 3D Medical Image Segmentation via Self-distilling TransUNet [55.16833099336073]
We propose to self-distill a Transformer-based UNet for medical image segmentation.
It simultaneously learns global semantic information and local spatial-detailed features.
Our MISSU achieves the best performance over previous state-of-the-art methods.
arXiv Detail & Related papers (2022-06-02T07:38:53Z) - InDuDoNet+: A Model-Driven Interpretable Dual Domain Network for Metal
Artifact Reduction in CT Images [53.4351366246531]
We construct a novel interpretable dual domain network, termed InDuDoNet+, into which CT imaging process is finely embedded.
We analyze the CT values among different tissues, and merge the prior observations into a prior network for our InDuDoNet+, which significantly improve its generalization performance.
arXiv Detail & Related papers (2021-12-23T15:52:37Z) - TransAttUnet: Multi-level Attention-guided U-Net with Transformer for
Medical Image Segmentation [33.45471457058221]
This paper proposes a novel Transformer based medical image semantic segmentation framework called TransAttUnet.
In particular, we establish additional multi-scale skip connections between decoder blocks to aggregate the different semantic-scale upsampling features.
Our method consistently outperforms the state-of-the-art baselines.
arXiv Detail & Related papers (2021-07-12T09:17:06Z) - TransUNet: Transformers Make Strong Encoders for Medical Image
Segmentation [78.01570371790669]
Medical image segmentation is an essential prerequisite for developing healthcare systems.
On various medical image segmentation tasks, the u-shaped architecture, also known as U-Net, has become the de-facto standard.
We propose TransUNet, which merits both Transformers and U-Net, as a strong alternative for medical image segmentation.
arXiv Detail & Related papers (2021-02-08T16:10:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.