M3FPolypSegNet: Segmentation Network with Multi-frequency Feature Fusion
for Polyp Localization in Colonoscopy Images
- URL: http://arxiv.org/abs/2310.05538v2
- Date: Tue, 10 Oct 2023 03:41:51 GMT
- Title: M3FPolypSegNet: Segmentation Network with Multi-frequency Feature Fusion
for Polyp Localization in Colonoscopy Images
- Authors: Ju-Hyeon Nam, Seo-Hyeong Park, Nur Suriza Syazwany, Yerim Jung, Yu-Han
Im and Sang-Chul Lee
- Abstract summary: Multi-Frequency Feature Fusion Polyp Network (M3FPolypSegNet) was proposed to decompose the input image into low/high/full-frequency components.
We used three independent multi-frequency encoders to map multiple input images into a high-dimensional feature space.
We designed three multi-task learning (i.e., region, edge, and distance) in four decoder blocks to learn the structural characteristics of the region.
- Score: 1.389360509566256
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Polyp segmentation is crucial for preventing colorectal cancer a common type
of cancer. Deep learning has been used to segment polyps automatically, which
reduces the risk of misdiagnosis. Localizing small polyps in colonoscopy images
is challenging because of its complex characteristics, such as color,
occlusion, and various shapes of polyps. To address this challenge, a novel
frequency-based fully convolutional neural network, Multi-Frequency Feature
Fusion Polyp Segmentation Network (M3FPolypSegNet) was proposed to decompose
the input image into low/high/full-frequency components to use the
characteristics of each component. We used three independent multi-frequency
encoders to map multiple input images into a high-dimensional feature space. In
the Frequency-ASPP Scalable Attention Module (F-ASPP SAM), ASPP was applied
between each frequency component to preserve scale information. Subsequently,
scalable attention was applied to emphasize polyp regions in a high-dimensional
feature space. Finally, we designed three multi-task learning (i.e., region,
edge, and distance) in four decoder blocks to learn the structural
characteristics of the region. The proposed model outperformed various
segmentation models with performance gains of 6.92% and 7.52% on average for
all metrics on CVC-ClinicDB and BKAI-IGH-NeoPolyp, respectively.
Related papers
- PSTNet: Enhanced Polyp Segmentation with Multi-scale Alignment and Frequency Domain Integration [17.1088588766663]
Polyp Network with Shunted Transformer (PSTNet) is a novel approach that integrates both RGB and frequency domain cues present in the images.
PSTNet comprises three key modules: the Frequency characterization Attention Module (FCAM) for extracting frequency cues and capturing polyp characteristics, the Feature Supplementary Alignment Module (FSAM) for aligning semantic information and reducing noise, and the Cross Perception localization Module (CPM) for synergizing frequency cues with high-level semantics to achieve efficient polyp segmentation.
arXiv Detail & Related papers (2024-09-13T02:52:25Z) - Polyp segmentation in colonoscopy images using DeepLabV3++ [3.0182171147100076]
We introduce the DeepLabv3++ model which is an enhanced version of the DeepLabv3+ architecture.
The proposed model incorporates diverse separable convolutional layers and attention mechanisms within the MSPP block, enhancing its capacity to capture multi-scale and directional features.
The experimental analysis shows that DeepLabV3++ outperforms several state-of-the-art models in polyp segmentation tasks.
arXiv Detail & Related papers (2024-07-27T19:24:55Z) - Efficient Polyp Segmentation Via Integrity Learning [14.34505893948565]
This paper introduces the integrity concept in polyp segmentation at both macro and micro levels, aiming to alleviate integrity deficiency.
Our Integrity Capturing Polyp (IC-PolypSeg) network utilizes lightweight backbones and 3 key components for integrity ameliorating.
arXiv Detail & Related papers (2023-09-15T08:11:05Z) - Lesion-aware Dynamic Kernel for Polyp Segmentation [49.63274623103663]
We propose a lesion-aware dynamic network (LDNet) for polyp segmentation.
It is a traditional u-shape encoder-decoder structure incorporated with a dynamic kernel generation and updating scheme.
This simple but effective scheme endows our model with powerful segmentation performance and generalization capability.
arXiv Detail & Related papers (2023-01-12T09:53:57Z) - Adaptive Context Selection for Polyp Segmentation [99.9959901908053]
We propose an adaptive context selection based encoder-decoder framework which is composed of Local Context Attention (LCA) module, Global Context Module (GCM) and Adaptive Selection Module (ASM)
LCA modules deliver local context features from encoder layers to decoder layers, enhancing the attention to the hard region which is determined by the prediction map of previous layer.
GCM aims to further explore the global context features and send to the decoder layers. ASM is used for adaptive selection and aggregation of context features through channel-wise attention.
arXiv Detail & Related papers (2023-01-12T04:06:44Z) - Polyp-PVT: Polyp Segmentation with Pyramid Vision Transformers [124.01928050651466]
We propose a new type of polyp segmentation method, named Polyp-PVT.
The proposed model, named Polyp-PVT, effectively suppresses noises in the features and significantly improves their expressive capabilities.
arXiv Detail & Related papers (2021-08-16T07:09:06Z) - Automatic Polyp Segmentation via Multi-scale Subtraction Network [100.94922587360871]
In clinical practice, precise polyp segmentation provides important information in the early detection of colorectal cancer.
Most existing methods are based on U-shape structure and use element-wise addition or concatenation to fuse different level features progressively in decoder.
We propose a multi-scale subtraction network (MSNet) to segment polyp from colonoscopy image.
arXiv Detail & Related papers (2021-08-11T07:54:07Z) - PSConv: Squeezing Feature Pyramid into One Compact Poly-Scale
Convolutional Layer [76.44375136492827]
Convolutional Neural Networks (CNNs) are often scale-sensitive.
We bridge this regret by exploiting multi-scale features in a finer granularity.
The proposed convolution operation, named Poly-Scale Convolution (PSConv), mixes up a spectrum of dilation rates.
arXiv Detail & Related papers (2020-07-13T05:14:11Z) - PraNet: Parallel Reverse Attention Network for Polyp Segmentation [155.93344756264824]
We propose a parallel reverse attention network (PraNet) for accurate polyp segmentation in colonoscopy images.
We first aggregate the features in high-level layers using a parallel partial decoder (PPD)
In addition, we mine the boundary cues using a reverse attention (RA) module, which is able to establish the relationship between areas and boundary cues.
arXiv Detail & Related papers (2020-06-13T08:13:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.