FLDNet: A Foreground-Aware Network for Polyp Segmentation Leveraging
Long-Distance Dependencies
- URL: http://arxiv.org/abs/2309.05987v1
- Date: Tue, 12 Sep 2023 06:32:42 GMT
- Title: FLDNet: A Foreground-Aware Network for Polyp Segmentation Leveraging
Long-Distance Dependencies
- Authors: Xuefeng Wei, Xuan Zhou
- Abstract summary: We propose FLDNet, a Transformer-based neural network that captures long-distance dependencies for accurate polyp segmentation.
Our proposed method, FLDNet, was evaluated using seven metrics on common datasets and demonstrated superiority over state-of-the-art methods on widely-used evaluation measures.
- Score: 1.7623838912231695
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Given the close association between colorectal cancer and polyps, the
diagnosis and identification of colorectal polyps play a critical role in the
detection and surgical intervention of colorectal cancer. In this context, the
automatic detection and segmentation of polyps from various colonoscopy images
has emerged as a significant problem that has attracted broad attention.
Current polyp segmentation techniques face several challenges: firstly, polyps
vary in size, texture, color, and pattern; secondly, the boundaries between
polyps and mucosa are usually blurred, existing studies have focused on
learning the local features of polyps while ignoring the long-range
dependencies of the features, and also ignoring the local context and global
contextual information of the combined features. To address these challenges,
we propose FLDNet (Foreground-Long-Distance Network), a Transformer-based
neural network that captures long-distance dependencies for accurate polyp
segmentation. Specifically, the proposed model consists of three main modules:
a pyramid-based Transformer encoder, a local context module, and a
foreground-Aware module. Multilevel features with long-distance dependency
information are first captured by the pyramid-based transformer encoder. On the
high-level features, the local context module obtains the local characteristics
related to the polyps by constructing different local context information. The
coarse map obtained by decoding the reconstructed highest-level features guides
the feature fusion process in the foreground-Aware module of the high-level
features to achieve foreground enhancement of the polyps. Our proposed method,
FLDNet, was evaluated using seven metrics on common datasets and demonstrated
superiority over state-of-the-art methods on widely-used evaluation measures.
Related papers
- PSTNet: Enhanced Polyp Segmentation with Multi-scale Alignment and Frequency Domain Integration [17.1088588766663]
Polyp Network with Shunted Transformer (PSTNet) is a novel approach that integrates both RGB and frequency domain cues present in the images.
PSTNet comprises three key modules: the Frequency characterization Attention Module (FCAM) for extracting frequency cues and capturing polyp characteristics, the Feature Supplementary Alignment Module (FSAM) for aligning semantic information and reducing noise, and the Cross Perception localization Module (CPM) for synergizing frequency cues with high-level semantics to achieve efficient polyp segmentation.
arXiv Detail & Related papers (2024-09-13T02:52:25Z) - Multi-Layer Dense Attention Decoder for Polyp Segmentation [10.141956829529859]
We propose a novel decoder architecture aimed at hierarchically aggregating locally enhanced multi-level dense features.
Specifically, we introduce a novel module named Dense Attention Gate (DAG), which adaptively fuses all previous layers' features to establish local feature relations among all layers.
Our experiments and comparisons with nine competing segmentation models demonstrate that the proposed architecture achieves state-of-the-art performance.
arXiv Detail & Related papers (2024-03-27T01:15:05Z) - Lesion-aware Dynamic Kernel for Polyp Segmentation [49.63274623103663]
We propose a lesion-aware dynamic network (LDNet) for polyp segmentation.
It is a traditional u-shape encoder-decoder structure incorporated with a dynamic kernel generation and updating scheme.
This simple but effective scheme endows our model with powerful segmentation performance and generalization capability.
arXiv Detail & Related papers (2023-01-12T09:53:57Z) - Adaptive Context Selection for Polyp Segmentation [99.9959901908053]
We propose an adaptive context selection based encoder-decoder framework which is composed of Local Context Attention (LCA) module, Global Context Module (GCM) and Adaptive Selection Module (ASM)
LCA modules deliver local context features from encoder layers to decoder layers, enhancing the attention to the hard region which is determined by the prediction map of previous layer.
GCM aims to further explore the global context features and send to the decoder layers. ASM is used for adaptive selection and aggregation of context features through channel-wise attention.
arXiv Detail & Related papers (2023-01-12T04:06:44Z) - LCPFormer: Towards Effective 3D Point Cloud Analysis via Local Context
Propagation in Transformers [60.51925353387151]
We propose a novel module named Local Context Propagation (LCP) to exploit the message passing between neighboring local regions.
We use the overlap points of adjacent local regions as intermediaries, then re-weight the features of these shared points from different local regions before passing them to the next layers.
The proposed method is applicable to different tasks and outperforms various transformer-based methods in benchmarks including 3D shape classification and dense prediction tasks.
arXiv Detail & Related papers (2022-10-23T15:43:01Z) - Polyp-PVT: Polyp Segmentation with Pyramid Vision Transformers [124.01928050651466]
We propose a new type of polyp segmentation method, named Polyp-PVT.
The proposed model, named Polyp-PVT, effectively suppresses noises in the features and significantly improves their expressive capabilities.
arXiv Detail & Related papers (2021-08-16T07:09:06Z) - Automatic Polyp Segmentation via Multi-scale Subtraction Network [100.94922587360871]
In clinical practice, precise polyp segmentation provides important information in the early detection of colorectal cancer.
Most existing methods are based on U-shape structure and use element-wise addition or concatenation to fuse different level features progressively in decoder.
We propose a multi-scale subtraction network (MSNet) to segment polyp from colonoscopy image.
arXiv Detail & Related papers (2021-08-11T07:54:07Z) - PraNet: Parallel Reverse Attention Network for Polyp Segmentation [155.93344756264824]
We propose a parallel reverse attention network (PraNet) for accurate polyp segmentation in colonoscopy images.
We first aggregate the features in high-level layers using a parallel partial decoder (PPD)
In addition, we mine the boundary cues using a reverse attention (RA) module, which is able to establish the relationship between areas and boundary cues.
arXiv Detail & Related papers (2020-06-13T08:13:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.