Multi Kernel Positional Embedding ConvNeXt for Polyp Segmentation
- URL: http://arxiv.org/abs/2301.06673v2
- Date: Thu, 15 Jun 2023 08:08:06 GMT
- Title: Multi Kernel Positional Embedding ConvNeXt for Polyp Segmentation
- Authors: Trong-Hieu Nguyen Mau, Quoc-Huy Trinh, Nhat-Tan Bui, Minh-Triet Tran,
Hai-Dang Nguyen
- Abstract summary: We propose a novel framework composed of ConvNeXt backbone and Multi Kernel Positional Embedding block.
Our model achieves the Dice coefficient of 0.8818 and the IOU score of 0.8163 on the Kvasir-SEG dataset.
- Score: 7.31341312596412
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Medical image segmentation is the technique that helps doctor view and has a
precise diagnosis, particularly in Colorectal Cancer. Specifically, with the
increase in cases, the diagnosis and identification need to be faster and more
accurate for many patients; in endoscopic images, the segmentation task has
been vital to helping the doctor identify the position of the polyps or the
ache in the system correctly. As a result, many efforts have been made to apply
deep learning to automate polyp segmentation, mostly to ameliorate the U-shape
structure. However, the simple skip connection scheme in UNet leads to
deficient context information and the semantic gap between feature maps from
the encoder and decoder. To deal with this problem, we propose a novel
framework composed of ConvNeXt backbone and Multi Kernel Positional Embedding
block. Thanks to the suggested module, our method can attain better accuracy
and generalization in the polyps segmentation task. Extensive experiments show
that our model achieves the Dice coefficient of 0.8818 and the IOU score of
0.8163 on the Kvasir-SEG dataset. Furthermore, on various datasets, we make
competitive achievement results with other previous state-of-the-art methods.
Related papers
- Towards a Benchmark for Colorectal Cancer Segmentation in Endorectal Ultrasound Videos: Dataset and Model Development [59.74920439478643]
In this paper, we collect and annotated the first benchmark dataset that covers diverse ERUS scenarios.
Our ERUS-10K dataset comprises 77 videos and 10,000 high-resolution annotated frames.
We introduce a benchmark model for colorectal cancer segmentation, named the Adaptive Sparse-context TRansformer (ASTR)
arXiv Detail & Related papers (2024-08-19T15:04:42Z) - GCtx-UNet: Efficient Network for Medical Image Segmentation [0.2353157426758003]
GCtx-UNet is a lightweight segmentation architecture that can capture global and local image features with accuracy better than state-of-the-art approaches.
GCtx-UNet is evaluated on the Synapse multi-organ abdominal CT dataset, the ACDC cardiac MRI dataset, and several polyp segmentation datasets.
arXiv Detail & Related papers (2024-06-09T19:17:14Z) - Dual-scale Enhanced and Cross-generative Consistency Learning for Semi-supervised Medical Image Segmentation [49.57907601086494]
Medical image segmentation plays a crucial role in computer-aided diagnosis.
We propose a novel Dual-scale Enhanced and Cross-generative consistency learning framework for semi-supervised medical image (DEC-Seg)
arXiv Detail & Related papers (2023-12-26T12:56:31Z) - Lesion-aware Dynamic Kernel for Polyp Segmentation [49.63274623103663]
We propose a lesion-aware dynamic network (LDNet) for polyp segmentation.
It is a traditional u-shape encoder-decoder structure incorporated with a dynamic kernel generation and updating scheme.
This simple but effective scheme endows our model with powerful segmentation performance and generalization capability.
arXiv Detail & Related papers (2023-01-12T09:53:57Z) - Automatic Polyp Segmentation via Multi-scale Subtraction Network [100.94922587360871]
In clinical practice, precise polyp segmentation provides important information in the early detection of colorectal cancer.
Most existing methods are based on U-shape structure and use element-wise addition or concatenation to fuse different level features progressively in decoder.
We propose a multi-scale subtraction network (MSNet) to segment polyp from colonoscopy image.
arXiv Detail & Related papers (2021-08-11T07:54:07Z) - Deep ensembles based on Stochastic Activation Selection for Polyp
Segmentation [82.61182037130406]
This work deals with medical image segmentation and in particular with accurate polyp detection and segmentation during colonoscopy examinations.
Basic architecture in image segmentation consists of an encoder and a decoder.
We compare some variant of the DeepLab architecture obtained by varying the decoder backbone.
arXiv Detail & Related papers (2021-04-02T02:07:37Z) - PraNet: Parallel Reverse Attention Network for Polyp Segmentation [155.93344756264824]
We propose a parallel reverse attention network (PraNet) for accurate polyp segmentation in colonoscopy images.
We first aggregate the features in high-level layers using a parallel partial decoder (PPD)
In addition, we mine the boundary cues using a reverse attention (RA) module, which is able to establish the relationship between areas and boundary cues.
arXiv Detail & Related papers (2020-06-13T08:13:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.