Real-time automatic polyp detection in colonoscopy using feature
enhancement module and spatiotemporal similarity correlation unit
- URL: http://arxiv.org/abs/2201.10079v1
- Date: Tue, 25 Jan 2022 03:40:30 GMT
- Title: Real-time automatic polyp detection in colonoscopy using feature
enhancement module and spatiotemporal similarity correlation unit
- Authors: Jianwei Xu, Ran Zhao, Yizhou Yu, Qingwei Zhang, Xianzhang Bian, Jun
Wang, Zhizheng Ge, and Dahong Qian
- Abstract summary: State-of-the-art methods are based on convolutional neural networks (CNNs)
Our method combines the two-dimensional (2-D) CNN-based real-time object detector network withtemporal information.
It's demonstrated that our method provides a performance improvement in sensitivity, precision and specificity, and has great potential to be applied in clinical colonoscopy.
- Score: 34.28382404976628
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Automatic detection of polyps is challenging because different polyps vary
greatly, while the changes between polyps and their analogues are small. The
state-of-the-art methods are based on convolutional neural networks (CNNs).
However, they may fail due to lack of training data, resulting in high rates of
missed detection and false positives (FPs). In order to solve these problems,
our method combines the two-dimensional (2-D) CNN-based real-time object
detector network with spatiotemporal information. Firstly, we use a 2-D
detector network to detect static images and frames, and based on the detector
network, we propose two feature enhancement modules-the FP Relearning Module
(FPRM) to make the detector network learning more about the features of FPs for
higher precision, and the Image Style Transfer Module (ISTM) to enhance the
features of polyps for sensitivity improvement. In video detection, we
integrate spatiotemporal information, which uses Structural Similarity (SSIM)
to measure the similarity between video frames. Finally, we propose the
Inter-frame Similarity Correlation Unit (ISCU) to combine the results obtained
by the detector network and frame similarity to make the final decision. We
verify our method on both private databases and publicly available databases.
Experimental results show that these modules and units provide a performance
improvement compared with the baseline method. Comparison with the
state-of-the-art methods shows that the proposed method outperforms the
existing ones which can meet real-time constraints. It's demonstrated that our
method provides a performance improvement in sensitivity, precision and
specificity, and has great potential to be applied in clinical colonoscopy.
Related papers
- SSTFB: Leveraging self-supervised pretext learning and temporal self-attention with feature branching for real-time video polyp segmentation [4.027361638728112]
We propose a video polyp segmentation method that performs self-supervised learning as an auxiliary task and a spatial-temporal self-attention mechanism for improved representation learning.
Our experimental results demonstrate an improvement with respect to several state-of-the-art (SOTA) methods.
Our ablation study confirms that the choice of the proposed joint end-to-end training improves network accuracy by over 3% and nearly 10% on both the Dice similarity coefficient and intersection-over-union.
arXiv Detail & Related papers (2024-06-14T17:33:11Z) - DA-Flow: Dual Attention Normalizing Flow for Skeleton-based Video Anomaly Detection [52.74152717667157]
We propose a lightweight module called Dual Attention Module (DAM) for capturing cross-dimension interaction relationships in-temporal skeletal data.
It employs the frame attention mechanism to identify the most significant frames and the skeleton attention mechanism to capture broader relationships across fixed partitions with minimal parameters and flops.
arXiv Detail & Related papers (2024-06-05T06:18:03Z) - BetterNet: An Efficient CNN Architecture with Residual Learning and Attention for Precision Polyp Segmentation [0.6062751776009752]
This research presents BetterNet, a convolutional neural network architecture that combines residual learning and attention methods to enhance the accuracy of polyp segmentation.
BetterNet shows promise in integrating computer-assisted diagnosis techniques to enhance the detection of polyps and the early recognition of cancer.
arXiv Detail & Related papers (2024-05-05T21:08:49Z) - Domain Adaptive Synapse Detection with Weak Point Annotations [63.97144211520869]
We present AdaSyn, a framework for domain adaptive synapse detection with weak point annotations.
In the WASPSYN challenge at I SBI 2023, our method ranks the 1st place.
arXiv Detail & Related papers (2023-08-31T05:05:53Z) - Accurate Real-time Polyp Detection in Videos from Concatenation of
Latent Features Extracted from Consecutive Frames [5.2009074009536524]
Convolutional neural networks (CNNs) are vulnerable to small changes in the input image.
A CNN-based model may miss the same polyp appearing in a series of consecutive frames.
We propose an efficient feature concatenation method for a CNN-based encoder-decoder model.
arXiv Detail & Related papers (2023-03-10T11:51:22Z) - Lesion-aware Dynamic Kernel for Polyp Segmentation [49.63274623103663]
We propose a lesion-aware dynamic network (LDNet) for polyp segmentation.
It is a traditional u-shape encoder-decoder structure incorporated with a dynamic kernel generation and updating scheme.
This simple but effective scheme endows our model with powerful segmentation performance and generalization capability.
arXiv Detail & Related papers (2023-01-12T09:53:57Z) - Polyp-PVT: Polyp Segmentation with Pyramid Vision Transformers [124.01928050651466]
We propose a new type of polyp segmentation method, named Polyp-PVT.
The proposed model, named Polyp-PVT, effectively suppresses noises in the features and significantly improves their expressive capabilities.
arXiv Detail & Related papers (2021-08-16T07:09:06Z) - Multiscale Detection of Cancerous Tissue in High Resolution Slide Scans [0.0]
We present an algorithm for multi-scale tumor (chimeric cell) detection in high resolution slide scans.
Our approach modifies the effective receptive field at different layers in a CNN so that objects with a broad range of varying scales can be detected in a single forward pass.
arXiv Detail & Related papers (2020-10-01T18:56:46Z) - A Deep Convolutional Neural Network for the Detection of Polyps in
Colonoscopy Images [12.618653234201089]
We propose a deep convolutional neural network based model for the computerized detection of polyps within colonoscopy images.
Data augmentation techniques such as photometric and geometric distortions are adapted to overcome the obstacles faced in polyp detection.
arXiv Detail & Related papers (2020-08-15T13:55:44Z) - MuCAN: Multi-Correspondence Aggregation Network for Video
Super-Resolution [63.02785017714131]
Video super-resolution (VSR) aims to utilize multiple low-resolution frames to generate a high-resolution prediction for each frame.
Inter- and intra-frames are the key sources for exploiting temporal and spatial information.
We build an effective multi-correspondence aggregation network (MuCAN) for VSR.
arXiv Detail & Related papers (2020-07-23T05:41:27Z) - iffDetector: Inference-aware Feature Filtering for Object Detection [70.8678270164057]
We introduce a generic Inference-aware Feature Filtering (IFF) module that can easily be combined with modern detectors.
IFF performs closed-loop optimization by leveraging high-level semantics to enhance the convolutional features.
IFF can be fused with CNN-based object detectors in a plug-and-play manner with negligible computational cost overhead.
arXiv Detail & Related papers (2020-06-23T02:57:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.