Skin Lesion Segmentation Improved by Transformer-based Networks with
Inter-scale Dependency Modeling
- URL: http://arxiv.org/abs/2310.13604v1
- Date: Fri, 20 Oct 2023 15:53:51 GMT
- Title: Skin Lesion Segmentation Improved by Transformer-based Networks with
Inter-scale Dependency Modeling
- Authors: Sania Eskandari, Janet Lumpp, Luis Sanchez Giraldo
- Abstract summary: Melanoma is a dangerous type of skin cancer resulting from abnormal skin cell growth.
The symmetrical U-Net model's reliance on convolutional operations hinders its ability to capture long-range dependencies.
Several Transformer-based U-Net topologies have recently been created to overcome this limitation.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Melanoma, a dangerous type of skin cancer resulting from abnormal skin cell
growth, can be treated if detected early. Various approaches using Fully
Convolutional Networks (FCNs) have been proposed, with the U-Net architecture
being prominent To aid in its diagnosis through automatic skin lesion
segmentation. However, the symmetrical U-Net model's reliance on convolutional
operations hinders its ability to capture long-range dependencies crucial for
accurate medical image segmentation. Several Transformer-based U-Net topologies
have recently been created to overcome this limitation by replacing CNN blocks
with different Transformer modules to capture local and global representations.
Furthermore, the U-shaped structure is hampered by semantic gaps between the
encoder and decoder. This study intends to increase the network's feature
re-usability by carefully building the skip connection path. Integrating an
already calculated attention affinity within the skip connection path improves
the typical concatenation process utilized in the conventional skip connection
path. As a result, we propose a U-shaped hierarchical Transformer-based
structure for skin lesion segmentation and an Inter-scale Context Fusion (ISCF)
method that uses attention correlations in each stage of the encoder to
adaptively combine the contexts from each stage to mitigate semantic gaps. The
findings from two skin lesion segmentation benchmarks support the ISCF module's
applicability and effectiveness. The code is publicly available at
\url{https://github.com/saniaesk/skin-lesion-segmentation}
Related papers
- TransUKAN:Computing-Efficient Hybrid KAN-Transformer for Enhanced Medical Image Segmentation [5.280523424712006]
U-Net is currently the most widely used architecture for medical image segmentation.
We have improved the KAN to reduce memory usage and computational load.
This approach enhances the model's capability to capture nonlinear relationships.
arXiv Detail & Related papers (2024-09-23T02:52:49Z) - SkinMamba: A Precision Skin Lesion Segmentation Architecture with Cross-Scale Global State Modeling and Frequency Boundary Guidance [0.559239450391449]
Skin lesion segmentation is a crucial method for identifying early skin cancer.
We propose a hybrid architecture based on Mamba and CNN, called SkinMamba.
It maintains linear complexity while offering powerful long-range dependency modeling and local feature extraction capabilities.
arXiv Detail & Related papers (2024-09-17T05:02:38Z) - Prototype Learning Guided Hybrid Network for Breast Tumor Segmentation in DCE-MRI [58.809276442508256]
We propose a hybrid network via the combination of convolution neural network (CNN) and transformer layers.
The experimental results on private and public DCE-MRI datasets demonstrate that the proposed hybrid network superior performance than the state-of-the-art methods.
arXiv Detail & Related papers (2024-08-11T15:46:00Z) - Inter-Scale Dependency Modeling for Skin Lesion Segmentation with
Transformer-based Networks [0.0]
Melanoma is a dangerous form of skin cancer caused by the abnormal growth of skin cells.
FCN approaches, including the U-Net architecture, can automatically segment skin lesions to aid diagnosis.
The symmetrical U-Net model has shown outstanding results, but its use of a convolutional operation limits its ability to capture long-range dependencies.
arXiv Detail & Related papers (2023-10-20T16:20:25Z) - Reliable Joint Segmentation of Retinal Edema Lesions in OCT Images [55.83984261827332]
In this paper, we propose a novel reliable multi-scale wavelet-enhanced transformer network.
We develop a novel segmentation backbone that integrates a wavelet-enhanced feature extractor network and a multi-scale transformer module.
Our proposed method achieves better segmentation accuracy with a high degree of reliability as compared to other state-of-the-art segmentation approaches.
arXiv Detail & Related papers (2022-12-01T07:32:56Z) - Attention Swin U-Net: Cross-Contextual Attention Mechanism for Skin
Lesion Segmentation [4.320393382724066]
We propose Att-SwinU-Net, an attention-based Swin U-Net extension, for medical image segmentation.
We argue that the classical concatenation operation utilized in the skip connection path can be further improved by incorporating an attention mechanism.
arXiv Detail & Related papers (2022-10-30T17:41:35Z) - TransNorm: Transformer Provides a Strong Spatial Normalization Mechanism
for a Deep Segmentation Model [4.320393382724066]
convolutional neural networks (CNNs) have been the prevailing technique in the medical image processing era.
We propose Trans-Norm, a novel deep segmentation framework which consolidates a Transformer module into both encoder and skip-connections of the standard U-Net.
arXiv Detail & Related papers (2022-07-27T09:54:10Z) - MISSU: 3D Medical Image Segmentation via Self-distilling TransUNet [55.16833099336073]
We propose to self-distill a Transformer-based UNet for medical image segmentation.
It simultaneously learns global semantic information and local spatial-detailed features.
Our MISSU achieves the best performance over previous state-of-the-art methods.
arXiv Detail & Related papers (2022-06-02T07:38:53Z) - Semantic Correspondence with Transformers [68.37049687360705]
We propose Cost Aggregation with Transformers (CATs) to find dense correspondences between semantically similar images.
We include appearance affinity modelling to disambiguate the initial correlation maps and multi-level aggregation.
We conduct experiments to demonstrate the effectiveness of the proposed model over the latest methods and provide extensive ablation studies.
arXiv Detail & Related papers (2021-06-04T14:39:03Z) - Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation [63.46694853953092]
Swin-Unet is an Unet-like pure Transformer for medical image segmentation.
tokenized image patches are fed into the Transformer-based U-shaped decoder-Decoder architecture.
arXiv Detail & Related papers (2021-05-12T09:30:26Z) - TransUNet: Transformers Make Strong Encoders for Medical Image
Segmentation [78.01570371790669]
Medical image segmentation is an essential prerequisite for developing healthcare systems.
On various medical image segmentation tasks, the u-shaped architecture, also known as U-Net, has become the de-facto standard.
We propose TransUNet, which merits both Transformers and U-Net, as a strong alternative for medical image segmentation.
arXiv Detail & Related papers (2021-02-08T16:10:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.