D-TrAttUnet: Dual-Decoder Transformer-Based Attention Unet Architecture
for Binary and Multi-classes Covid-19 Infection Segmentation
- URL: http://arxiv.org/abs/2303.15576v1
- Date: Mon, 27 Mar 2023 20:05:09 GMT
- Title: D-TrAttUnet: Dual-Decoder Transformer-Based Attention Unet Architecture
for Binary and Multi-classes Covid-19 Infection Segmentation
- Authors: Fares Bougourzi and Cosimo Distante and Fadi Dornaika and Abdelmalik
Taleb-Ahmed
- Abstract summary: We propose a new Transformer-CNN based approach for Covid-19 infection segmentation from the CT slices.
The Transformer-CNN encoder is built using Transformer layers, UpResBlocks, ResBlocks and max-pooling layers.
The proposed D-TrAttUnet architecture is evaluated for both Binary and Multi-classes Covid-19 infection segmentation.
- Score: 18.231677739397977
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: In the last three years, the world has been facing a global crisis caused by
Covid-19 pandemic. Medical imaging has been playing a crucial role in the
fighting against this disease and saving the human lives. Indeed, CT-scans has
proved their efficiency in diagnosing, detecting, and following-up the Covid-19
infection. In this paper, we propose a new Transformer-CNN based approach for
Covid-19 infection segmentation from the CT slices. The proposed D-TrAttUnet
architecture has an Encoder-Decoder structure, where compound Transformer-CNN
encoder and Dual-Decoders are proposed. The Transformer-CNN encoder is built
using Transformer layers, UpResBlocks, ResBlocks and max-pooling layers. The
Dual-Decoder consists of two identical CNN decoders with attention gates. The
two decoders are used to segment the infection and the lung regions
simultaneously and the losses of the two tasks are joined. The proposed
D-TrAttUnet architecture is evaluated for both Binary and Multi-classes
Covid-19 infection segmentation. The experimental results prove the efficiency
of the proposed approach to deal with the complexity of Covid-19 segmentation
task from limited data. Furthermore, D-TrAttUnet architecture outperforms three
baseline CNN segmentation architectures (Unet, AttUnet and Unet++) and three
state-of-the-art architectures (AnamNet, SCOATNet and CopleNet), in both Binary
and Mutli-classes segmentation tasks.
Related papers
- D-TrAttUnet: Toward Hybrid CNN-Transformer Architecture for Generic and Subtle Segmentation in Medical Images [18.087365906278013]
D-TrAttUnet architecture is based on the observation that different diseases often target specific organs.
Our architecture includes an encoder-decoder structure with a composite-CNN encoder and dual decoders.
To validate our approach, we performed evaluations on the Covid-19 and Bone Metastasis segmentation tasks.
arXiv Detail & Related papers (2024-05-07T10:09:41Z) - Rethinking Attention Gated with Hybrid Dual Pyramid Transformer-CNN for Generalized Segmentation in Medical Imaging [17.07490339960335]
We introduce a novel hybrid CNN-Transformer segmentation architecture (PAG-TransYnet) designed for efficiently building a strong CNN-Transformer encoder.
Our approach exploits attention gates within a Dual Pyramid hybrid encoder.
arXiv Detail & Related papers (2024-04-28T14:37:10Z) - 3D TransUNet: Advancing Medical Image Segmentation through Vision
Transformers [40.21263511313524]
Medical image segmentation plays a crucial role in advancing healthcare systems for disease diagnosis and treatment planning.
The u-shaped architecture, popularly known as U-Net, has proven highly successful for various medical image segmentation tasks.
To address these limitations, researchers have turned to Transformers, renowned for their global self-attention mechanisms.
arXiv Detail & Related papers (2023-10-11T18:07:19Z) - CATS v2: Hybrid encoders for robust medical segmentation [12.194439938007672]
Convolutional Neural Networks (CNNs) have exhibited strong performance in medical image segmentation tasks.
However, due to the limited field of view of convolution kernel, it is hard for CNNs to fully represent global information.
We propose CATS v2 with hybrid encoders, which better leverage both local and global information.
arXiv Detail & Related papers (2023-08-11T20:21:54Z) - Dual Multi-scale Mean Teacher Network for Semi-supervised Infection
Segmentation in Chest CT Volume for COVID-19 [76.51091445670596]
Automated detecting lung infections from computed tomography (CT) data plays an important role for combating COVID-19.
Most current COVID-19 infection segmentation methods mainly relied on 2D CT images, which lack 3D sequential constraint.
Existing 3D CT segmentation methods focus on single-scale representations, which do not achieve the multiple level receptive field sizes on 3D volume.
arXiv Detail & Related papers (2022-11-10T13:11:21Z) - Cats: Complementary CNN and Transformer Encoders for Segmentation [13.288195115791758]
We propose a model with double encoders for 3D biomedical image segmentation.
We fuse the information from the convolutional encoder and the transformer, and pass it to the decoder to obtain the results.
Compared to the state-of-the-art models with and without transformers on each task, our proposed method obtains higher Dice scores across the board.
arXiv Detail & Related papers (2022-08-24T14:25:11Z) - BCS-Net: Boundary, Context and Semantic for Automatic COVID-19 Lung
Infection Segmentation from CT Images [83.82141604007899]
BCS-Net is a novel network for automatic COVID-19 lung infection segmentation from CT images.
BCS-Net follows an encoder-decoder architecture, and more designs focus on the decoder stage.
In each BCSR block, the attention-guided global context (AGGC) module is designed to learn the most valuable encoder features for decoder.
arXiv Detail & Related papers (2022-07-17T08:54:07Z) - MISSU: 3D Medical Image Segmentation via Self-distilling TransUNet [55.16833099336073]
We propose to self-distill a Transformer-based UNet for medical image segmentation.
It simultaneously learns global semantic information and local spatial-detailed features.
Our MISSU achieves the best performance over previous state-of-the-art methods.
arXiv Detail & Related papers (2022-06-02T07:38:53Z) - Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation [63.46694853953092]
Swin-Unet is an Unet-like pure Transformer for medical image segmentation.
tokenized image patches are fed into the Transformer-based U-shaped decoder-Decoder architecture.
arXiv Detail & Related papers (2021-05-12T09:30:26Z) - Deep ensembles based on Stochastic Activation Selection for Polyp
Segmentation [82.61182037130406]
This work deals with medical image segmentation and in particular with accurate polyp detection and segmentation during colonoscopy examinations.
Basic architecture in image segmentation consists of an encoder and a decoder.
We compare some variant of the DeepLab architecture obtained by varying the decoder backbone.
arXiv Detail & Related papers (2021-04-02T02:07:37Z) - CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image
Segmentation [95.51455777713092]
Convolutional neural networks (CNNs) have been the de facto standard for nowadays 3D medical image segmentation.
We propose a novel framework that efficiently bridges a bf Convolutional neural network and a bf Transformer bf (CoTr) for accurate 3D medical image segmentation.
arXiv Detail & Related papers (2021-03-04T13:34:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.