Transformer-Enhanced Iterative Feedback Mechanism for Polyp Segmentation
- URL: http://arxiv.org/abs/2409.05875v1
- Date: Sun, 25 Aug 2024 02:39:45 GMT
- Title: Transformer-Enhanced Iterative Feedback Mechanism for Polyp Segmentation
- Authors: Nikhil Kumar Tomar, Debesh Jha, Koushik Biswas, Tyler M. Berzin, Rajesh Keswani, Michael Wallace, Ulas Bagci,
- Abstract summary: We introduce textittextbfacFANetv2, an advanced encoder-decoder network designed to accurately segment polyps from colonoscopy images.
FANetv2 iteratively refines its binary segmentation masks through a novel feedback attention mechanism.
It integrates essential information about the number (one or many) and size (small, medium, large) of polyps to further enhance its feature representation capabilities.
- Score: 3.017482151674131
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Colorectal cancer (CRC) is the third most common cause of cancer diagnosed in the United States and the second leading cause of cancer-related death among both genders. Notably, CRC is the leading cause of cancer in younger men less than 50 years old. Colonoscopy is considered the gold standard for the early diagnosis of CRC. Skills vary significantly among endoscopists, and a high miss rate is reported. Automated polyp segmentation can reduce the missed rates, and timely treatment is possible in the early stage. To address this challenge, we introduce \textit{\textbf{\ac{FANetv2}}}, an advanced encoder-decoder network designed to accurately segment polyps from colonoscopy images. Leveraging an initial input mask generated by Otsu thresholding, FANetv2 iteratively refines its binary segmentation masks through a novel feedback attention mechanism informed by the mask predictions of previous epochs. Additionally, it employs a text-guided approach that integrates essential information about the number (one or many) and size (small, medium, large) of polyps to further enhance its feature representation capabilities. This dual-task approach facilitates accurate polyp segmentation and aids in the auxiliary classification of polyp attributes, significantly boosting the model's performance. Our comprehensive evaluations on the publicly available BKAI-IGH and CVC-ClinicDB datasets demonstrate the superior performance of FANetv2, evidenced by high dice similarity coefficients (DSC) of 0.9186 and 0.9481, along with low Hausdorff distances of 2.83 and 3.19, respectively. The source code for FANetv2 is available at https://github.com/xxxxx/FANetv2.
Related papers
- ASPS: Augmented Segment Anything Model for Polyp Segmentation [77.25557224490075]
The Segment Anything Model (SAM) has introduced unprecedented potential for polyp segmentation.
SAM's Transformer-based structure prioritizes global and low-frequency information.
CFA integrates a trainable CNN encoder branch with a frozen ViT encoder, enabling the integration of domain-specific knowledge.
arXiv Detail & Related papers (2024-06-30T14:55:32Z) - Lesion-aware Dynamic Kernel for Polyp Segmentation [49.63274623103663]
We propose a lesion-aware dynamic network (LDNet) for polyp segmentation.
It is a traditional u-shape encoder-decoder structure incorporated with a dynamic kernel generation and updating scheme.
This simple but effective scheme endows our model with powerful segmentation performance and generalization capability.
arXiv Detail & Related papers (2023-01-12T09:53:57Z) - DilatedSegNet: A Deep Dilated Segmentation Network for Polyp
Segmentation [2.6179759969345002]
Colorectal cancer (CRC) is the second leading cause of cancer-related death worldwide.
Powered by deep learning, computer-aided diagnosis (CAD) systems can detect regions in the colon overlooked by physicians during colonoscopy.
Lacking high accuracy and real-time speed are the essential obstacles to be overcome for successful clinical integration of such systems.
arXiv Detail & Related papers (2022-10-24T20:36:30Z) - TransResU-Net: Transformer based ResU-Net for Real-Time Colonoscopy
Polyp Segmentation [1.9875031133911856]
The miss rate of polyps, adenomas and advanced adenomas remains significantly high.
Deep learning-based computer-aided diagnosis (CADx) system may help gastroenterologists to identify polyps that may otherwise be missed.
TransResU-Net could be a strong benchmark for building a real-time polyp detection system.
arXiv Detail & Related papers (2022-06-17T19:36:37Z) - Automatic Polyp Segmentation with Multiple Kernel Dilated Convolution
Network [3.1374864575817214]
In this study, we introduce a novel deep learning architecture, named textbfMKDCNet, for automatic polyp segmentation.
Experiments on four publicly available polyp datasets and cell nuclei dataset show that the proposed MKDCNet outperforms the state-of-the-art methods.
MKDCNet can be a strong benchmark for building real-time systems for clinical colonoscopies.
arXiv Detail & Related papers (2022-06-13T15:47:38Z) - Self-Supervised U-Net for Segmenting Flat and Sessile Polyps [63.62764375279861]
Development of colorectal polyps is one of the earliest signs of cancer.
Early detection and resection of polyps can greatly increase survival rate to 90%.
Computer-Aided Diagnosis systems(CADx) has been proposed that detect polyps by processing the colonoscopic videos.
arXiv Detail & Related papers (2021-10-17T09:31:20Z) - Cross-Site Severity Assessment of COVID-19 from CT Images via Domain
Adaptation [64.59521853145368]
Early and accurate severity assessment of Coronavirus disease 2019 (COVID-19) based on computed tomography (CT) images offers a great help to the estimation of intensive care unit event.
To augment the labeled data and improve the generalization ability of the classification model, it is necessary to aggregate data from multiple sites.
This task faces several challenges including class imbalance between mild and severe infections, domain distribution discrepancy between sites, and presence of heterogeneous features.
arXiv Detail & Related papers (2021-09-08T07:56:51Z) - Automatic Polyp Segmentation via Multi-scale Subtraction Network [100.94922587360871]
In clinical practice, precise polyp segmentation provides important information in the early detection of colorectal cancer.
Most existing methods are based on U-shape structure and use element-wise addition or concatenation to fuse different level features progressively in decoder.
We propose a multi-scale subtraction network (MSNet) to segment polyp from colonoscopy image.
arXiv Detail & Related papers (2021-08-11T07:54:07Z) - AG-CUResNeSt: A Novel Method for Colon Polyp Segmentation [0.0]
This paper proposes a novel neural network architecture called AG-CUResNeSt, which enhances Coupled UNets using the robust ResNeSt backbone and attention gates.
We show that our proposed method achieves state-of-the-art accuracy compared to existing methods.
arXiv Detail & Related papers (2021-05-02T06:36:36Z) - PraNet: Parallel Reverse Attention Network for Polyp Segmentation [155.93344756264824]
We propose a parallel reverse attention network (PraNet) for accurate polyp segmentation in colonoscopy images.
We first aggregate the features in high-level layers using a parallel partial decoder (PPD)
In addition, we mine the boundary cues using a reverse attention (RA) module, which is able to establish the relationship between areas and boundary cues.
arXiv Detail & Related papers (2020-06-13T08:13:43Z) - Colorectal Polyp Segmentation by U-Net with Dilation Convolution [9.840695333927496]
Colorectal cancer (CRC) is one of the most commonly diagnosed cancers and a leading cause of cancer deaths in the United States.
Currently, the most common way for colorectal polyp detection and precancerous pathology is the colonoscopy.
We propose a novel end-to-end deep learning framework for the colorectal polyp segmentation.
arXiv Detail & Related papers (2019-12-26T23:27:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.