Swin Deformable Attention Hybrid U-Net for Medical Image Segmentation
- URL: http://arxiv.org/abs/2302.14450v2
- Date: Wed, 27 Sep 2023 07:56:37 GMT
- Title: Swin Deformable Attention Hybrid U-Net for Medical Image Segmentation
- Authors: Lichao Wang, Jiahao Huang, Xiaodan Xing, Guang Yang
- Abstract summary: We propose to incorporate the Shifted Window (Swin) Deformable Attention into a hybrid architecture to improve segmentation performance.
Our proposed Swin Deformable Attention Hybrid UNet (SDAH-UNet) demonstrates state-of-the-art performance on both anatomical and lesion segmentation tasks.
- Score: 3.407509559779547
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Medical image segmentation is a crucial task in the field of medical image
analysis. Harmonizing the convolution and multi-head self-attention mechanism
is a recent research focus in this field, with various combination methods
proposed. However, the lack of interpretability of these hybrid models remains
a common pitfall, limiting their practical application in clinical scenarios.
To address this issue, we propose to incorporate the Shifted Window (Swin)
Deformable Attention into a hybrid architecture to improve segmentation
performance while ensuring explainability. Our proposed Swin Deformable
Attention Hybrid UNet (SDAH-UNet) demonstrates state-of-the-art performance on
both anatomical and lesion segmentation tasks. Moreover, we provide a direct
and visual explanation of the model focalization and how the model forms it,
enabling clinicians to better understand and trust the decision of the model.
Our approach could be a promising solution to the challenge of developing
accurate and interpretable medical image segmentation models.
Related papers
- MedUHIP: Towards Human-In-the-Loop Medical Segmentation [5.520419627866446]
Medical image segmentation is particularly complicated by inherent uncertainties.
We propose a novel approach that integrates an textbfuncertainty-aware model with textbfhuman-in-the-loop interaction
Our method showcases superior segmentation capabilities, outperforming a wide range of deterministic and uncertainty-aware models.
arXiv Detail & Related papers (2024-08-03T01:06:02Z) - Advancing Medical Image Segmentation: Morphology-Driven Learning with Diffusion Transformer [4.672688418357066]
We propose a novel Transformer Diffusion (DTS) model for robust segmentation in the presence of noise.
Our model, which analyzes the morphological representation of images, shows better results than the previous models in various medical imaging modalities.
arXiv Detail & Related papers (2024-08-01T07:35:54Z) - QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge [93.61262892578067]
Uncertainty in medical image segmentation tasks, especially inter-rater variability, presents a significant challenge.
This variability directly impacts the development and evaluation of automated segmentation algorithms.
We report the set-up and summarize the benchmark results of the Quantification of Uncertainties in Biomedical Image Quantification Challenge (QUBIQ)
arXiv Detail & Related papers (2024-03-19T17:57:24Z) - Dual-scale Enhanced and Cross-generative Consistency Learning for Semi-supervised Medical Image Segmentation [49.57907601086494]
Medical image segmentation plays a crucial role in computer-aided diagnosis.
We propose a novel Dual-scale Enhanced and Cross-generative consistency learning framework for semi-supervised medical image (DEC-Seg)
arXiv Detail & Related papers (2023-12-26T12:56:31Z) - EviPrompt: A Training-Free Evidential Prompt Generation Method for
Segment Anything Model in Medical Images [14.899388051854084]
Medical image segmentation has immense clinical applicability but remains a challenge despite advancements in deep learning.
This paper introduces a novel training-free evidential prompt generation method named EviPrompt to overcome these issues.
The proposed method, built on the inherent similarities within medical images, requires only a single reference image-annotation pair.
arXiv Detail & Related papers (2023-11-10T21:22:22Z) - Performance Analysis of UNet and Variants for Medical Image Segmentation [1.5410557873153836]
This study aims to explore the application of deep learning models, particularly focusing on the UNet architecture and its variants, in medical image segmentation.
The findings reveal that the standard UNet, when extended with a deep network layer, is a proficient medical image segmentation model.
The Res-UNet and Attention Res-UNet architectures demonstrate smoother convergence and superior performance, particularly when handling fine image details.
arXiv Detail & Related papers (2023-09-22T17:20:40Z) - Introducing Shape Prior Module in Diffusion Model for Medical Image
Segmentation [7.7545714516743045]
We propose an end-to-end framework called VerseDiff-UNet, which leverages the denoising diffusion probabilistic model (DDPM)
Our approach integrates the diffusion model into a standard U-shaped architecture.
We evaluate our method on a single dataset of spine images acquired through X-ray imaging.
arXiv Detail & Related papers (2023-09-12T03:05:00Z) - Multi-task Paired Masking with Alignment Modeling for Medical
Vision-Language Pre-training [55.56609500764344]
We propose a unified framework based on Multi-task Paired Masking with Alignment (MPMA) to integrate the cross-modal alignment task into the joint image-text reconstruction framework.
We also introduce a Memory-Augmented Cross-Modal Fusion (MA-CMF) module to fully integrate visual information to assist report reconstruction.
arXiv Detail & Related papers (2023-05-13T13:53:48Z) - Ambiguous Medical Image Segmentation using Diffusion Models [60.378180265885945]
We introduce a single diffusion model-based approach that produces multiple plausible outputs by learning a distribution over group insights.
Our proposed model generates a distribution of segmentation masks by leveraging the inherent sampling process of diffusion.
Comprehensive results show that our proposed approach outperforms existing state-of-the-art ambiguous segmentation networks.
arXiv Detail & Related papers (2023-04-10T17:58:22Z) - MedSegDiff-V2: Diffusion based Medical Image Segmentation with
Transformer [53.575573940055335]
We propose a novel Transformer-based Diffusion framework, called MedSegDiff-V2.
We verify its effectiveness on 20 medical image segmentation tasks with different image modalities.
arXiv Detail & Related papers (2023-01-19T03:42:36Z) - Few-shot Medical Image Segmentation using a Global Correlation Network
with Discriminative Embedding [60.89561661441736]
We propose a novel method for few-shot medical image segmentation.
We construct our few-shot image segmentor using a deep convolutional network trained episodically.
We enhance discriminability of deep embedding to encourage clustering of the feature domains of the same class.
arXiv Detail & Related papers (2020-12-10T04:01:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.