Skin Lesion Segmentation Improved by Transformer-based Networks with
Inter-scale Dependency Modeling
- URL: http://arxiv.org/abs/2310.13604v1
- Date: Fri, 20 Oct 2023 15:53:51 GMT
- Title: Skin Lesion Segmentation Improved by Transformer-based Networks with
Inter-scale Dependency Modeling
- Authors: Sania Eskandari, Janet Lumpp, Luis Sanchez Giraldo
- Abstract summary: Melanoma is a dangerous type of skin cancer resulting from abnormal skin cell growth.
The symmetrical U-Net model's reliance on convolutional operations hinders its ability to capture long-range dependencies.
Several Transformer-based U-Net topologies have recently been created to overcome this limitation.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Melanoma, a dangerous type of skin cancer resulting from abnormal skin cell
growth, can be treated if detected early. Various approaches using Fully
Convolutional Networks (FCNs) have been proposed, with the U-Net architecture
being prominent To aid in its diagnosis through automatic skin lesion
segmentation. However, the symmetrical U-Net model's reliance on convolutional
operations hinders its ability to capture long-range dependencies crucial for
accurate medical image segmentation. Several Transformer-based U-Net topologies
have recently been created to overcome this limitation by replacing CNN blocks
with different Transformer modules to capture local and global representations.
Furthermore, the U-shaped structure is hampered by semantic gaps between the
encoder and decoder. This study intends to increase the network's feature
re-usability by carefully building the skip connection path. Integrating an
already calculated attention affinity within the skip connection path improves
the typical concatenation process utilized in the conventional skip connection
path. As a result, we propose a U-shaped hierarchical Transformer-based
structure for skin lesion segmentation and an Inter-scale Context Fusion (ISCF)
method that uses attention correlations in each stage of the encoder to
adaptively combine the contexts from each stage to mitigate semantic gaps. The
findings from two skin lesion segmentation benchmarks support the ISCF module's
applicability and effectiveness. The code is publicly available at
\url{https://github.com/saniaesk/skin-lesion-segmentation}
Related papers
- Inter-Scale Dependency Modeling for Skin Lesion Segmentation with
Transformer-based Networks [0.0]
Melanoma is a dangerous form of skin cancer caused by the abnormal growth of skin cells.
FCN approaches, including the U-Net architecture, can automatically segment skin lesions to aid diagnosis.
The symmetrical U-Net model has shown outstanding results, but its use of a convolutional operation limits its ability to capture long-range dependencies.
arXiv Detail & Related papers (2023-10-20T16:20:25Z) - Adaptive Context Selection for Polyp Segmentation [99.9959901908053]
We propose an adaptive context selection based encoder-decoder framework which is composed of Local Context Attention (LCA) module, Global Context Module (GCM) and Adaptive Selection Module (ASM)
LCA modules deliver local context features from encoder layers to decoder layers, enhancing the attention to the hard region which is determined by the prediction map of previous layer.
GCM aims to further explore the global context features and send to the decoder layers. ASM is used for adaptive selection and aggregation of context features through channel-wise attention.
arXiv Detail & Related papers (2023-01-12T04:06:44Z) - Reliable Joint Segmentation of Retinal Edema Lesions in OCT Images [55.83984261827332]
In this paper, we propose a novel reliable multi-scale wavelet-enhanced transformer network.
We develop a novel segmentation backbone that integrates a wavelet-enhanced feature extractor network and a multi-scale transformer module.
Our proposed method achieves better segmentation accuracy with a high degree of reliability as compared to other state-of-the-art segmentation approaches.
arXiv Detail & Related papers (2022-12-01T07:32:56Z) - Attention Swin U-Net: Cross-Contextual Attention Mechanism for Skin
Lesion Segmentation [4.320393382724066]
We propose Att-SwinU-Net, an attention-based Swin U-Net extension, for medical image segmentation.
We argue that the classical concatenation operation utilized in the skip connection path can be further improved by incorporating an attention mechanism.
arXiv Detail & Related papers (2022-10-30T17:41:35Z) - TransNorm: Transformer Provides a Strong Spatial Normalization Mechanism
for a Deep Segmentation Model [4.320393382724066]
convolutional neural networks (CNNs) have been the prevailing technique in the medical image processing era.
We propose Trans-Norm, a novel deep segmentation framework which consolidates a Transformer module into both encoder and skip-connections of the standard U-Net.
arXiv Detail & Related papers (2022-07-27T09:54:10Z) - MISSU: 3D Medical Image Segmentation via Self-distilling TransUNet [55.16833099336073]
We propose to self-distill a Transformer-based UNet for medical image segmentation.
It simultaneously learns global semantic information and local spatial-detailed features.
Our MISSU achieves the best performance over previous state-of-the-art methods.
arXiv Detail & Related papers (2022-06-02T07:38:53Z) - Contextual Attention Network: Transformer Meets U-Net [0.0]
convolutional neural networks (CNN) have become the de facto standard and attained immense success in medical image segmentation.
However, CNN based methods fail to build long-range dependencies and global context connections.
Recent articles have exploited Transformer variants for medical image segmentation tasks.
arXiv Detail & Related papers (2022-03-02T21:10:24Z) - Semantic Correspondence with Transformers [68.37049687360705]
We propose Cost Aggregation with Transformers (CATs) to find dense correspondences between semantically similar images.
We include appearance affinity modelling to disambiguate the initial correlation maps and multi-level aggregation.
We conduct experiments to demonstrate the effectiveness of the proposed model over the latest methods and provide extensive ablation studies.
arXiv Detail & Related papers (2021-06-04T14:39:03Z) - Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation [63.46694853953092]
Swin-Unet is an Unet-like pure Transformer for medical image segmentation.
tokenized image patches are fed into the Transformer-based U-shaped decoder-Decoder architecture.
arXiv Detail & Related papers (2021-05-12T09:30:26Z) - TransUNet: Transformers Make Strong Encoders for Medical Image
Segmentation [78.01570371790669]
Medical image segmentation is an essential prerequisite for developing healthcare systems.
On various medical image segmentation tasks, the u-shaped architecture, also known as U-Net, has become the de-facto standard.
We propose TransUNet, which merits both Transformers and U-Net, as a strong alternative for medical image segmentation.
arXiv Detail & Related papers (2021-02-08T16:10:50Z) - MA-Unet: An improved version of Unet based on multi-scale and attention
mechanism for medical image segmentation [4.082245106486775]
convolutional neural networks (CNNs) are promoting the development of medical image semantic segmentation.
In this paper, we try to eliminate semantic ambiguity in skip connection operations by adding attention gates (AGs)
Our model obtains better segmentation performance while introducing fewer parameters.
arXiv Detail & Related papers (2020-12-20T15:29:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.