G-CASCADE: Efficient Cascaded Graph Convolutional Decoding for 2D
Medical Image Segmentation
- URL: http://arxiv.org/abs/2310.16175v1
- Date: Tue, 24 Oct 2023 20:41:04 GMT
- Title: G-CASCADE: Efficient Cascaded Graph Convolutional Decoding for 2D
Medical Image Segmentation
- Authors: Md Mostafijur Rahman and Radu Marculescu
- Abstract summary: We propose a new graph convolution-based decoder namely, Cascaded Graph Convolutional Attention Decoder (G-CASCADE)
G-CASCADE progressively refines multi-stage feature maps generated by hierarchical transformer encoders with an efficient graph convolution block.
Our decoder can easily be used with other hierarchical encoders for general-purpose semantic and medical image segmentation tasks.
- Score: 8.550528610846456
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: In recent years, medical image segmentation has become an important
application in the field of computer-aided diagnosis. In this paper, we are the
first to propose a new graph convolution-based decoder namely, Cascaded Graph
Convolutional Attention Decoder (G-CASCADE), for 2D medical image segmentation.
G-CASCADE progressively refines multi-stage feature maps generated by
hierarchical transformer encoders with an efficient graph convolution block.
The encoder utilizes the self-attention mechanism to capture long-range
dependencies, while the decoder refines the feature maps preserving long-range
information due to the global receptive fields of the graph convolution block.
Rigorous evaluations of our decoder with multiple transformer encoders on five
medical image segmentation tasks (i.e., Abdomen organs, Cardiac organs, Polyp
lesions, Skin lesions, and Retinal vessels) show that our model outperforms
other state-of-the-art (SOTA) methods. We also demonstrate that our decoder
achieves better DICE scores than the SOTA CASCADE decoder with 80.8% fewer
parameters and 82.3% fewer FLOPs. Our decoder can easily be used with other
hierarchical encoders for general-purpose semantic and medical image
segmentation tasks.
Related papers
- D-TrAttUnet: Toward Hybrid CNN-Transformer Architecture for Generic and Subtle Segmentation in Medical Images [18.087365906278013]
D-TrAttUnet architecture is based on the observation that different diseases often target specific organs.
Our architecture includes an encoder-decoder structure with a composite-CNN encoder and dual decoders.
To validate our approach, we performed evaluations on the Covid-19 and Bone Metastasis segmentation tasks.
arXiv Detail & Related papers (2024-05-07T10:09:41Z) - Extreme Encoder Output Frame Rate Reduction: Improving Computational
Latencies of Large End-to-End Models [59.57732929473519]
We apply multiple frame reduction layers in the encoder to compress encoder outputs into a small number of output frames.
We demonstrate that we can generate one encoder output frame for every 2.56 sec of input speech, without significantly affecting word error rate on a large-scale voice search task.
arXiv Detail & Related papers (2024-02-27T03:40:44Z) - Dual-scale Enhanced and Cross-generative Consistency Learning for Semi-supervised Medical Image Segmentation [49.57907601086494]
Medical image segmentation plays a crucial role in computer-aided diagnosis.
We propose a novel Dual-scale Enhanced and Cross-generative consistency learning framework for semi-supervised medical image (DEC-Seg)
arXiv Detail & Related papers (2023-12-26T12:56:31Z) - Dilated-UNet: A Fast and Accurate Medical Image Segmentation Approach
using a Dilated Transformer and U-Net Architecture [0.6445605125467572]
This paper introduces Dilated-UNet, which combines a Dilated Transformer block with the U-Net architecture for accurate and fast medical image segmentation.
The results of our experiments show that Dilated-UNet outperforms other models on several challenging medical image segmentation datasets.
arXiv Detail & Related papers (2023-04-22T17:20:13Z) - On the Importance of Image Encoding in Automated Chest X-Ray Report
Generation [4.843654097048771]
Chest X-ray is one of the most popular medical imaging modalities due to its accessibility and effectiveness.
There is a chronic shortage of well-trained radiologists who can interpret these images and diagnose the patient's condition.
automated radiology report generation can be a very helpful tool in clinical practice.
arXiv Detail & Related papers (2022-11-24T08:02:52Z) - HistoSeg : Quick attention with multi-loss function for multi-structure
segmentation in digital histology images [0.696194614504832]
Medical image segmentation assists in computer-aided diagnosis, surgeries, and treatment.
We proposed an generalization-Decoder Network, Quick Attention Module and a Multi Loss Function.
We evaluate the capability of our proposed network on two publicly available datasets for medical image segmentation MoNuSeg and GlaS.
arXiv Detail & Related papers (2022-09-01T21:10:00Z) - Small Lesion Segmentation in Brain MRIs with Subpixel Embedding [105.1223735549524]
We present a method to segment MRI scans of the human brain into ischemic stroke lesion and normal tissues.
We propose a neural network architecture in the form of a standard encoder-decoder where predictions are guided by a spatial expansion embedding network.
arXiv Detail & Related papers (2021-09-18T00:21:17Z) - Crosslink-Net: Double-branch Encoder Segmentation Network via Fusing
Vertical and Horizontal Convolutions [58.71117402626524]
We present a novel double-branch encoder architecture for medical image segmentation.
Our architecture is inspired by two observations: 1) Since the discrimination of features learned via square convolutional kernels needs to be further improved, we propose to utilize non-square vertical and horizontal convolutional kernels.
The experiments validate the effectiveness of our model on four datasets.
arXiv Detail & Related papers (2021-07-24T02:58:32Z) - Atrous Residual Interconnected Encoder to Attention Decoder Framework
for Vertebrae Segmentation via 3D Volumetric CT Images [1.8146155083014204]
This paper proposes a novel algorithm for automated vertebrae segmentation via 3D volumetric spine CT images.
The proposed model is based on the structure of encoder to decoder, using layer normalization to optimize mini-batch training performance.
The experimental results show that our model achieves competitive performance compared with other state-of-the-art medical semantic segmentation methods.
arXiv Detail & Related papers (2021-04-08T12:09:16Z) - Deep ensembles based on Stochastic Activation Selection for Polyp
Segmentation [82.61182037130406]
This work deals with medical image segmentation and in particular with accurate polyp detection and segmentation during colonoscopy examinations.
Basic architecture in image segmentation consists of an encoder and a decoder.
We compare some variant of the DeepLab architecture obtained by varying the decoder backbone.
arXiv Detail & Related papers (2021-04-02T02:07:37Z) - TransUNet: Transformers Make Strong Encoders for Medical Image
Segmentation [78.01570371790669]
Medical image segmentation is an essential prerequisite for developing healthcare systems.
On various medical image segmentation tasks, the u-shaped architecture, also known as U-Net, has become the de-facto standard.
We propose TransUNet, which merits both Transformers and U-Net, as a strong alternative for medical image segmentation.
arXiv Detail & Related papers (2021-02-08T16:10:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.