TGANet: Text-guided attention for improved polyp segmentation
- URL: http://arxiv.org/abs/2205.04280v1
- Date: Mon, 9 May 2022 13:53:26 GMT
- Title: TGANet: Text-guided attention for improved polyp segmentation
- Authors: Nikhil Kumar Tomar, Debesh Jha, Ulas Bagci, Sharib Ali
- Abstract summary: Colonoscopy is a gold standard procedure but is highly operator-dependent. Automated polyp segmentation, a precancerous precursor, can minimize missed rates and timely treatment of colon cancer at an early stage.
We exploit size-related and polyp number-related features in the form of text attention during training.
Our proposed text-guided attention network (TGANet) can generalize well to variable-sized polyps in different datasets.
- Score: 2.3293678240472517
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Colonoscopy is a gold standard procedure but is highly operator-dependent.
Automated polyp segmentation, a precancerous precursor, can minimize missed
rates and timely treatment of colon cancer at an early stage. Even though there
are deep learning methods developed for this task, variability in polyp size
can impact model training, thereby limiting it to the size attribute of the
majority of samples in the training dataset that may provide sub-optimal
results to differently sized polyps. In this work, we exploit size-related and
polyp number-related features in the form of text attention during training. We
introduce an auxiliary classification task to weight the text-based embedding
that allows network to learn additional feature representations that can
distinctly adapt to differently sized polyps and can adapt to cases with
multiple polyps. Our experimental results demonstrate that these added text
embeddings improve the overall performance of the model compared to
state-of-the-art segmentation methods. We explore four different datasets and
provide insights for size-specific improvements. Our proposed text-guided
attention network (TGANet) can generalize well to variable-sized polyps in
different datasets.
Related papers
- Polyp-E: Benchmarking the Robustness of Deep Segmentation Models via Polyp Editing [32.30835026874521]
In daily clinical practice, clinicians exhibit robustness in identifying polyps with both location and size variations.
It is uncertain if deep segmentation models can achieve comparable robustness in automated colonoscopic analysis.
We focus on evaluating the robustness of segmentation models on the polyps with various attributes and healthy samples.
arXiv Detail & Related papers (2024-10-22T06:30:37Z) - ASPS: Augmented Segment Anything Model for Polyp Segmentation [77.25557224490075]
The Segment Anything Model (SAM) has introduced unprecedented potential for polyp segmentation.
SAM's Transformer-based structure prioritizes global and low-frequency information.
CFA integrates a trainable CNN encoder branch with a frozen ViT encoder, enabling the integration of domain-specific knowledge.
arXiv Detail & Related papers (2024-06-30T14:55:32Z) - ECC-PolypDet: Enhanced CenterNet with Contrastive Learning for Automatic
Polyp Detection [88.4359020192429]
Existing methods either involve computationally expensive context aggregation or lack prior modeling of polyps, resulting in poor performance in challenging cases.
In this paper, we propose the Enhanced CenterNet with Contrastive Learning (ECC-PolypDet), a two-stage training & end-to-end inference framework.
Box-assisted Contrastive Learning (BCL) during training to minimize the intra-class difference and maximize the inter-class difference between foreground polyps and backgrounds, enabling our model to capture concealed polyps.
In the fine-tuning stage, we introduce the IoU-guided Sample Re-weighting
arXiv Detail & Related papers (2024-01-10T07:03:41Z) - A Survey on Deep Learning for Polyp Segmentation: Techniques, Challenges
and Future Trends [41.267545810720044]
Early detection and assessment of polyps play a crucial role in the prevention and treatment of colorectal cancer (CRC)
In the past, people often relied on manually extracted lower-level features such as color, texture, and shape.
With the advent of deep learning, more and more outstanding medical image segmentation algorithms have emerged.
arXiv Detail & Related papers (2023-11-30T09:14:37Z) - Lesion-aware Dynamic Kernel for Polyp Segmentation [49.63274623103663]
We propose a lesion-aware dynamic network (LDNet) for polyp segmentation.
It is a traditional u-shape encoder-decoder structure incorporated with a dynamic kernel generation and updating scheme.
This simple but effective scheme endows our model with powerful segmentation performance and generalization capability.
arXiv Detail & Related papers (2023-01-12T09:53:57Z) - Stepwise Feature Fusion: Local Guides Global [14.394421688712052]
We propose a new State-Of-The-Art model for medical image segmentation, the SSFormer, which uses a pyramid Transformer encoder to improve the generalization ability of models.
Our proposed Progressive Locality Decoder can be adapted to the pyramid Transformer backbone to emphasize local features and attention dispersion.
arXiv Detail & Related papers (2022-03-07T10:36:38Z) - Self-Supervised U-Net for Segmenting Flat and Sessile Polyps [63.62764375279861]
Development of colorectal polyps is one of the earliest signs of cancer.
Early detection and resection of polyps can greatly increase survival rate to 90%.
Computer-Aided Diagnosis systems(CADx) has been proposed that detect polyps by processing the colonoscopic videos.
arXiv Detail & Related papers (2021-10-17T09:31:20Z) - Automatic Polyp Segmentation via Multi-scale Subtraction Network [100.94922587360871]
In clinical practice, precise polyp segmentation provides important information in the early detection of colorectal cancer.
Most existing methods are based on U-shape structure and use element-wise addition or concatenation to fuse different level features progressively in decoder.
We propose a multi-scale subtraction network (MSNet) to segment polyp from colonoscopy image.
arXiv Detail & Related papers (2021-08-11T07:54:07Z) - PraNet: Parallel Reverse Attention Network for Polyp Segmentation [155.93344756264824]
We propose a parallel reverse attention network (PraNet) for accurate polyp segmentation in colonoscopy images.
We first aggregate the features in high-level layers using a parallel partial decoder (PPD)
In addition, we mine the boundary cues using a reverse attention (RA) module, which is able to establish the relationship between areas and boundary cues.
arXiv Detail & Related papers (2020-06-13T08:13:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.