Multi Kernel Positional Embedding ConvNeXt for Polyp Segmentation
- URL: http://arxiv.org/abs/2301.06673v2
- Date: Thu, 15 Jun 2023 08:08:06 GMT
- Title: Multi Kernel Positional Embedding ConvNeXt for Polyp Segmentation
- Authors: Trong-Hieu Nguyen Mau, Quoc-Huy Trinh, Nhat-Tan Bui, Minh-Triet Tran,
Hai-Dang Nguyen
- Abstract summary: We propose a novel framework composed of ConvNeXt backbone and Multi Kernel Positional Embedding block.
Our model achieves the Dice coefficient of 0.8818 and the IOU score of 0.8163 on the Kvasir-SEG dataset.
- Score: 7.31341312596412
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Medical image segmentation is the technique that helps doctor view and has a
precise diagnosis, particularly in Colorectal Cancer. Specifically, with the
increase in cases, the diagnosis and identification need to be faster and more
accurate for many patients; in endoscopic images, the segmentation task has
been vital to helping the doctor identify the position of the polyps or the
ache in the system correctly. As a result, many efforts have been made to apply
deep learning to automate polyp segmentation, mostly to ameliorate the U-shape
structure. However, the simple skip connection scheme in UNet leads to
deficient context information and the semantic gap between feature maps from
the encoder and decoder. To deal with this problem, we propose a novel
framework composed of ConvNeXt backbone and Multi Kernel Positional Embedding
block. Thanks to the suggested module, our method can attain better accuracy
and generalization in the polyps segmentation task. Extensive experiments show
that our model achieves the Dice coefficient of 0.8818 and the IOU score of
0.8163 on the Kvasir-SEG dataset. Furthermore, on various datasets, we make
competitive achievement results with other previous state-of-the-art methods.
Related papers
- Enhancing Polyp Segmentation via Encoder Attention and Dynamic Kernel Update [0.0]
Polyp segmentation is a critical step in colorectal cancer detection, yet it remains challenging due to the diverse shapes, sizes, and low contrast boundaries of polyps.<n>We propose a novel framework that improves segmentation accuracy and efficiency by integrating a Dynamic Kernel (DK) mechanism with a global Attention module.
arXiv Detail & Related papers (2025-09-27T21:16:09Z) - Large Language Model Evaluated Stand-alone Attention-Assisted Graph Neural Network with Spatial and Structural Information Interaction for Precise Endoscopic Image Segmentation [16.773882069530426]
We propose FOCUS-Med, which stands for Fusion of spatial and structural graph with attentional context-aware polyp segmentation.<n> FOCUS-Med integrates a Dual Graph Convolutional Network (Dual-GCN) module to capture contextual spatial and topological structural dependencies.<n>Experiments on public benchmarks demonstrate that FOCUS-Med achieves state-of-the-art performance across five key metrics.
arXiv Detail & Related papers (2025-08-09T15:53:19Z) - Towards a Benchmark for Colorectal Cancer Segmentation in Endorectal Ultrasound Videos: Dataset and Model Development [59.74920439478643]
In this paper, we collect and annotated the first benchmark dataset that covers diverse ERUS scenarios.
Our ERUS-10K dataset comprises 77 videos and 10,000 high-resolution annotated frames.
We introduce a benchmark model for colorectal cancer segmentation, named the Adaptive Sparse-context TRansformer (ASTR)
arXiv Detail & Related papers (2024-08-19T15:04:42Z) - GCtx-UNet: Efficient Network for Medical Image Segmentation [0.2353157426758003]
GCtx-UNet is a lightweight segmentation architecture that can capture global and local image features with accuracy better than state-of-the-art approaches.
GCtx-UNet is evaluated on the Synapse multi-organ abdominal CT dataset, the ACDC cardiac MRI dataset, and several polyp segmentation datasets.
arXiv Detail & Related papers (2024-06-09T19:17:14Z) - Dual-scale Enhanced and Cross-generative Consistency Learning for Semi-supervised Medical Image Segmentation [49.57907601086494]
Medical image segmentation plays a crucial role in computer-aided diagnosis.
We propose a novel Dual-scale Enhanced and Cross-generative consistency learning framework for semi-supervised medical image (DEC-Seg)
arXiv Detail & Related papers (2023-12-26T12:56:31Z) - Lesion-aware Dynamic Kernel for Polyp Segmentation [49.63274623103663]
We propose a lesion-aware dynamic network (LDNet) for polyp segmentation.
It is a traditional u-shape encoder-decoder structure incorporated with a dynamic kernel generation and updating scheme.
This simple but effective scheme endows our model with powerful segmentation performance and generalization capability.
arXiv Detail & Related papers (2023-01-12T09:53:57Z) - Automatic Polyp Segmentation via Multi-scale Subtraction Network [100.94922587360871]
In clinical practice, precise polyp segmentation provides important information in the early detection of colorectal cancer.
Most existing methods are based on U-shape structure and use element-wise addition or concatenation to fuse different level features progressively in decoder.
We propose a multi-scale subtraction network (MSNet) to segment polyp from colonoscopy image.
arXiv Detail & Related papers (2021-08-11T07:54:07Z) - Deep ensembles based on Stochastic Activation Selection for Polyp
Segmentation [82.61182037130406]
This work deals with medical image segmentation and in particular with accurate polyp detection and segmentation during colonoscopy examinations.
Basic architecture in image segmentation consists of an encoder and a decoder.
We compare some variant of the DeepLab architecture obtained by varying the decoder backbone.
arXiv Detail & Related papers (2021-04-02T02:07:37Z) - PraNet: Parallel Reverse Attention Network for Polyp Segmentation [155.93344756264824]
We propose a parallel reverse attention network (PraNet) for accurate polyp segmentation in colonoscopy images.
We first aggregate the features in high-level layers using a parallel partial decoder (PPD)
In addition, we mine the boundary cues using a reverse attention (RA) module, which is able to establish the relationship between areas and boundary cues.
arXiv Detail & Related papers (2020-06-13T08:13:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.