Colon Polyps Detection from Colonoscopy Images Using Deep Learning
- URL: http://arxiv.org/abs/2508.13188v1
- Date: Thu, 14 Aug 2025 20:16:23 GMT
- Title: Colon Polyps Detection from Colonoscopy Images Using Deep Learning
- Authors: Md Al Amin, Bikash Kumar Paul,
- Abstract summary: Colon polyps are precursors to colorectal cancer, a leading cause of cancer-related mortality worldwide.<n>This study investigates the application of deep learning-based object detection for early polyp identification using colonoscopy images.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Colon polyps are precursors to colorectal cancer, a leading cause of cancer-related mortality worldwide. Early detection is critical for improving patient outcomes. This study investigates the application of deep learning-based object detection for early polyp identification using colonoscopy images. We utilize the Kvasir-SEG dataset, applying extensive data augmentation and splitting the data into training (80\%), validation (20\% of training), and testing (20\%) sets. Three variants of the YOLOv5 architecture (YOLOv5s, YOLOv5m, YOLOv5l) are evaluated. Experimental results show that YOLOv5l outperforms the other variants, achieving a mean average precision (mAP) of 85.1\%, with the highest average Intersection over Union (IoU) of 0.86. These findings demonstrate that YOLOv5l provides superior detection performance for colon polyp localization, offering a promising tool for enhancing colorectal cancer screening accuracy.
Related papers
- Investigating the Impact of Histopathological Foundation Models on Regressive Prediction of Homologous Recombination Deficiency [52.50039435394964]
We systematically evaluate foundation models for regression-based tasks.<n>We extract patch-level features from whole slide images (WSI) using five state-of-the-art foundation models.<n>Models are trained to predict continuous HRD scores based on these extracted features across breast, endometrial, and lung cancer cohorts.
arXiv Detail & Related papers (2026-01-29T14:06:50Z) - Multi-Modal Oral Cancer Detection Using Weighted Ensemble Convolutional Neural Networks [0.0]
Late diagnosis of Oral Squamous Cell Carcinoma contributes significantly to its high global mortality rate.<n>This study aims to improve early detection of OSCC by developing a multimodal deep learning framework.<n>The ensemble model demonstrated improved diagnostic robustness with an overall accuracy of 84.58% on a multimodal validation dataset of 55 samples.
arXiv Detail & Related papers (2025-10-04T17:06:46Z) - YOLO-LAN: Precise Polyp Detection via Optimized Loss, Augmentations and Negatives [0.639901418243611]
YOLO-LAN is a YOLO-based polyp detection pipeline trained using M2IoU loss, versatile data augmentations and negative data.<n>We show robustness based on polyp size and precise location detection, making it clinically relevant in AI-assisted colorectal screening.
arXiv Detail & Related papers (2025-09-23T15:41:44Z) - PolypSeg-GradCAM: Towards Explainable Computer-Aided Gastrointestinal Disease Detection Using U-Net Based Segmentation and Grad-CAM Visualization on the Kvasir Dataset [7.02937797539818]
Colorectal cancer (CRC) remains one of the leading causes of cancer-related morbidity and mortality worldwide.<n>Deep learning methods have demonstrated strong potential for automated polyp analysis, but their limited interpretability remains a barrier to clinical adoption.<n>We present PolypSeg-GradCAM, a framework that integrates the U-Net architecture with Gradient-weighted Class Activation Mapping (Grad-CAM) for transparent polyp segmentation.
arXiv Detail & Related papers (2025-09-17T02:57:33Z) - A Disease-Centric Vision-Language Foundation Model for Precision Oncology in Kidney Cancer [54.58205672910646]
RenalCLIP is a visual-language foundation model for characterization, diagnosis and prognosis of renal mass.<n>It achieved better performance and superior generalizability across 10 core tasks spanning the full clinical workflow of kidney cancer.
arXiv Detail & Related papers (2025-08-22T17:48:19Z) - A Lightweight and Robust Framework for Real-Time Colorectal Polyp Detection Using LOF-Based Preprocessing and YOLO-v11n [0.3495246564946556]
This study introduces a new, lightweight, and efficient framework for polyp detection.<n>It combines the Local Outlier Factor algorithm for filtering noisy data with the YOLO-v11n deep learning model.<n>Compared to previous YOLO-based methods, our model demonstrates enhanced accuracy and efficiency.
arXiv Detail & Related papers (2025-07-14T23:36:54Z) - Semi-Supervised Learning for Dose Prediction in Targeted Radionuclide: A Synthetic Data Study [44.18078370879369]
Targeted Radionuclide Therapy (TRT) is a modern strategy in radiation oncology that aims to administer a potent radiation dose specifically to cancer cells.<n>Deep learning holds promise for personalizing TRT doses.<n>Current methods require large time series of SPECT imaging, which is hardly achievable in routine clinical practice.
arXiv Detail & Related papers (2025-03-07T12:21:09Z) - Consisaug: A Consistency-based Augmentation for Polyp Detection in Endoscopy Image Analysis [3.716941460306804]
We introduce Consisaug, an innovative and effective methodology to augment data that leverages deep learning.
We implement our Consisaug on five public polyp datasets and at three backbones, and the results show the effectiveness of our method.
arXiv Detail & Related papers (2024-04-17T13:09:44Z) - DG-TTA: Out-of-domain Medical Image Segmentation through Augmentation and Descriptor-driven Domain Generalization and Test-Time Adaptation [43.842694540544194]
Applying pretrained medical deep learning segmentation models on out-of-domain images often yields predictions of insufficient quality.<n>In this study, we propose to use a powerful generalizing descriptor along with augmentation to enable domain-generalized pretraining and test-time adaptation.
arXiv Detail & Related papers (2023-12-11T10:26:21Z) - Osteoporosis Prescreening using Panoramic Radiographs through a Deep
Convolutional Neural Network with Attention Mechanism [65.70943212672023]
Deep convolutional neural network (CNN) with an attention module can detect osteoporosis on panoramic radiographs.
dataset of 70 panoramic radiographs (PRs) from 70 different subjects of age between 49 to 60 was used.
arXiv Detail & Related papers (2021-10-19T00:03:57Z) - Self-Supervised U-Net for Segmenting Flat and Sessile Polyps [63.62764375279861]
Development of colorectal polyps is one of the earliest signs of cancer.
Early detection and resection of polyps can greatly increase survival rate to 90%.
Computer-Aided Diagnosis systems(CADx) has been proposed that detect polyps by processing the colonoscopic videos.
arXiv Detail & Related papers (2021-10-17T09:31:20Z) - Bootstrapping Your Own Positive Sample: Contrastive Learning With
Electronic Health Record Data [62.29031007761901]
This paper proposes a novel contrastive regularized clinical classification model.
We introduce two unique positive sampling strategies specifically tailored for EHR data.
Our framework yields highly competitive experimental results in predicting the mortality risk on real-world COVID-19 EHR data.
arXiv Detail & Related papers (2021-04-07T06:02:04Z) - Automatic Polyp Segmentation using Fully Convolutional Neural Network [0.0]
The miss-rate of colorectal polyps during colonoscopy is between 6 to 27%.
The use of an automated, accurate, and real-time polyp segmentation during colonoscopy examinations can help the clinicians to eliminate missing lesions and prevent further progression of colorectal cancer.
The Medico automatic polyp segmentation challenge'' provides an opportunity to study polyp segmentation and build a fast segmentation model.
The experiments demonstrate that the model trained on the Kvasir-SEG dataset and tested on an unseen dataset achieves a dice coefficient of 0.7801, mIoU of 0.6847, recall of 0.8077, and precision of 0.8
arXiv Detail & Related papers (2021-01-11T16:20:57Z) - CovidDeep: SARS-CoV-2/COVID-19 Test Based on Wearable Medical Sensors
and Efficient Neural Networks [51.589769497681175]
The novel coronavirus (SARS-CoV-2) has led to a pandemic.
The current testing regime based on Reverse Transcription-Polymerase Chain Reaction for SARS-CoV-2 has been unable to keep up with testing demands.
We propose a framework called CovidDeep that combines efficient DNNs with commercially available WMSs for pervasive testing of the virus.
arXiv Detail & Related papers (2020-07-20T21:47:28Z) - A Comprehensive Study of Data Augmentation Strategies for Prostate
Cancer Detection in Diffusion-weighted MRI using Convolutional Neural
Networks [9.554833667156913]
We have applied five most frequently used augmentation techniques (random rotation, horizontal flip, vertical flip, random crop, and translation) to a training dataset of 217 patients.
We evaluated the effect of each method on the accuracy of prostate cancer detection.
The shallow network outperformed the deep network with the best 2D slice-based AUC of 0.85 obtained by the rotation method.
arXiv Detail & Related papers (2020-06-01T14:31:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.