HST-MRF: Heterogeneous Swin Transformer with Multi-Receptive Field for
Medical Image Segmentation
- URL: http://arxiv.org/abs/2304.04614v1
- Date: Mon, 10 Apr 2023 14:30:03 GMT
- Title: HST-MRF: Heterogeneous Swin Transformer with Multi-Receptive Field for
Medical Image Segmentation
- Authors: Xiaofei Huang, Hongfang Gong, Jin Zhang
- Abstract summary: We propose a Heterogeneous Swin Transformer with Multi-Receptive Field (HST-MRF) model for medical image segmentation.
The main purpose is to solve the problem of loss of structural information caused by patch segmentation using transformer.
Experimental results show that our proposed method outperforms state-of-the-art models and can achieve superior performance.
- Score: 5.51045524851432
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The Transformer has been successfully used in medical image segmentation due
to its excellent long-range modeling capabilities. However, patch segmentation
is necessary when building a Transformer class model. This process may disrupt
the tissue structure in medical images, resulting in the loss of relevant
information. In this study, we proposed a Heterogeneous Swin Transformer with
Multi-Receptive Field (HST-MRF) model based on U-shaped networks for medical
image segmentation. The main purpose is to solve the problem of loss of
structural information caused by patch segmentation using transformer by fusing
patch information under different receptive fields. The heterogeneous Swin
Transformer (HST) is the core module, which achieves the interaction of
multi-receptive field patch information through heterogeneous attention and
passes it to the next stage for progressive learning. We also designed a
two-stage fusion module, multimodal bilinear pooling (MBP), to assist HST in
further fusing multi-receptive field information and combining low-level and
high-level semantic information for accurate localization of lesion regions. In
addition, we developed adaptive patch embedding (APE) and soft channel
attention (SCA) modules to retain more valuable information when acquiring
patch embedding and filtering channel features, respectively, thereby improving
model segmentation quality. We evaluated HST-MRF on multiple datasets for polyp
and skin lesion segmentation tasks. Experimental results show that our proposed
method outperforms state-of-the-art models and can achieve superior
performance. Furthermore, we verified the effectiveness of each module and the
benefits of multi-receptive field segmentation in reducing the loss of
structural information through ablation experiments.
Related papers
- ASPS: Augmented Segment Anything Model for Polyp Segmentation [77.25557224490075]
The Segment Anything Model (SAM) has introduced unprecedented potential for polyp segmentation.
SAM's Transformer-based structure prioritizes global and low-frequency information.
CFA integrates a trainable CNN encoder branch with a frozen ViT encoder, enabling the integration of domain-specific knowledge.
arXiv Detail & Related papers (2024-06-30T14:55:32Z) - Improved EATFormer: A Vision Transformer for Medical Image Classification [0.0]
This paper presents an improved Algorithm-based Transformer architecture for medical image classification using Vision Transformers.
The proposed EATFormer architecture combines the strengths of Convolutional Neural Networks and Vision Transformers.
Experimental results on the Chest X-ray and Kvasir datasets demonstrate that the proposed EATFormer significantly improves prediction speed and accuracy compared to baseline models.
arXiv Detail & Related papers (2024-03-19T21:40:20Z) - Dual-scale Enhanced and Cross-generative Consistency Learning for Semi-supervised Medical Image Segmentation [49.57907601086494]
Medical image segmentation plays a crucial role in computer-aided diagnosis.
We propose a novel Dual-scale Enhanced and Cross-generative consistency learning framework for semi-supervised medical image (DEC-Seg)
arXiv Detail & Related papers (2023-12-26T12:56:31Z) - PMP-Swin: Multi-Scale Patch Message Passing Swin Transformer for Retinal
Disease Classification [9.651435376561741]
We propose a new framework named Multi-Scale Patch Message Passing Swin Transformer for multi-class retinal disease classification.
Specifically, we design a Patch Message Passing (PMP) module based on the Message Passing mechanism to establish global interaction for pathological semantic features.
arXiv Detail & Related papers (2023-11-20T11:09:09Z) - Affine-Consistent Transformer for Multi-Class Cell Nuclei Detection [76.11864242047074]
We propose a novel Affine-Consistent Transformer (AC-Former), which directly yields a sequence of nucleus positions.
We introduce an Adaptive Affine Transformer (AAT) module, which can automatically learn the key spatial transformations to warp original images for local network training.
Experimental results demonstrate that the proposed method significantly outperforms existing state-of-the-art algorithms on various benchmarks.
arXiv Detail & Related papers (2023-10-22T02:27:02Z) - Curriculum-Based Augmented Fourier Domain Adaptation for Robust Medical
Image Segmentation [18.830738606514736]
This work proposes the Curriculum-based Augmented Fourier Domain Adaptation (Curri-AFDA) for robust medical image segmentation.
In particular, our curriculum learning strategy is based on the causal relationship of a model under different levels of data shift.
Experiments on two segmentation tasks of Retina and Nuclei collected from multiple sites and scanners suggest that our proposed method yields superior adaptation and generalization performance.
arXiv Detail & Related papers (2023-06-06T08:56:58Z) - Self-Supervised Correction Learning for Semi-Supervised Biomedical Image
Segmentation [84.58210297703714]
We propose a self-supervised correction learning paradigm for semi-supervised biomedical image segmentation.
We design a dual-task network, including a shared encoder and two independent decoders for segmentation and lesion region inpainting.
Experiments on three medical image segmentation datasets for different tasks demonstrate the outstanding performance of our method.
arXiv Detail & Related papers (2023-01-12T08:19:46Z) - Reliable Joint Segmentation of Retinal Edema Lesions in OCT Images [55.83984261827332]
In this paper, we propose a novel reliable multi-scale wavelet-enhanced transformer network.
We develop a novel segmentation backbone that integrates a wavelet-enhanced feature extractor network and a multi-scale transformer module.
Our proposed method achieves better segmentation accuracy with a high degree of reliability as compared to other state-of-the-art segmentation approaches.
arXiv Detail & Related papers (2022-12-01T07:32:56Z) - Class-Aware Generative Adversarial Transformers for Medical Image
Segmentation [39.14169989603906]
We present CA-GANformer, a novel type of generative adversarial transformers, for medical image segmentation.
First, we take advantage of the pyramid structure to construct multi-scale representations and handle multi-scale variations.
We then design a novel class-aware transformer module to better learn the discriminative regions of objects with semantic structures.
arXiv Detail & Related papers (2022-01-26T03:50:02Z) - TransAttUnet: Multi-level Attention-guided U-Net with Transformer for
Medical Image Segmentation [33.45471457058221]
This paper proposes a novel Transformer based medical image semantic segmentation framework called TransAttUnet.
In particular, we establish additional multi-scale skip connections between decoder blocks to aggregate the different semantic-scale upsampling features.
Our method consistently outperforms the state-of-the-art baselines.
arXiv Detail & Related papers (2021-07-12T09:17:06Z) - Cross-Modality Brain Tumor Segmentation via Bidirectional
Global-to-Local Unsupervised Domain Adaptation [61.01704175938995]
In this paper, we propose a novel Bidirectional Global-to-Local (BiGL) adaptation framework under a UDA scheme.
Specifically, a bidirectional image synthesis and segmentation module is proposed to segment the brain tumor.
The proposed method outperforms several state-of-the-art unsupervised domain adaptation methods by a large margin.
arXiv Detail & Related papers (2021-05-17T10:11:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.