Automated Bleeding Detection and Classification in Wireless Capsule   Endoscopy with YOLOv8-X
        - URL: http://arxiv.org/abs/2412.16624v1
- Date: Sat, 21 Dec 2024 13:37:11 GMT
- Title: Automated Bleeding Detection and Classification in Wireless Capsule   Endoscopy with YOLOv8-X
- Authors: Pavan C Shekar, Vivek Kanhangad, Shishir Maheshwari, T Sunil Kumar, 
- Abstract summary: This paper presents our solution to the Auto-WCEBleedGen Version V1 Challenge.<n>We developed a unified YOLOv8-X model for both detection and classification of bleeding regions.<n>Our approach achieved 96.10% classification accuracy and 76.8% mean Average Precision (mAP) at 0.5 IoU on the val idation dataset.
- Score: 2.6374023322018916
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   Gastrointestinal (GI) bleeding, a critical indicator of digestive system disorders, re quires efficient and accurate detection methods. This paper presents our solution to the Auto-WCEBleedGen Version V1 Challenge, where we achieved the consolation position. We developed a unified YOLOv8-X model for both detection and classification of bleeding regions in Wireless Capsule Endoscopy (WCE) images. Our approach achieved 96.10% classification accuracy and 76.8% mean Average Precision (mAP) at 0.5 IoU on the val idation dataset. Through careful dataset curation and annotation, we assembled and trained on 6,345 diverse images to ensure robust model performance. Our implementa tion code and trained models are publicly available at https://github.com/pavan98765/Auto-WCEBleedGen. 
 
      
        Related papers
        - HistoART: Histopathology Artifact Detection and Reporting Tool [37.31105955164019]
 Whole Slide Imaging (WSI) is widely used to digitize tissue specimens for detailed, high-resolution examination.<n>WSI remains vulnerable to artifacts introduced during slide preparation and scanning.<n>We propose and compare three robust artifact detection approaches for WSIs.
 arXiv  Detail & Related papers  (2025-06-23T17:22:19Z)
- Exploring Few-Shot Object Detection on Blood Smear Images: A Case Study   of Leukocytes and Schistocytes [37.440449828136586]
 Investigation focuses on a novel approach termed DE-ViT.<n>This methodology is employed in a Few-Shot paradigm, wherein training relies on a limited number of images.<n>While DE-ViT has demonstrated state-of-the-art performance on the COCO and LVIS datasets, both baseline models surpassed its performance on the Raabin-WBC dataset.
 arXiv  Detail & Related papers  (2025-03-21T12:46:49Z)
- Transformer-Based Wireless Capsule Endoscopy Bleeding Tissue Detection   and Classification [0.562479170374811]
 We design an end-to-end trainable model for the automatic detection and classification of bleeding and non-bleeding frames.
Based on the DETR model, our model uses the Resnet50 for feature extraction, the transformer encoder-decoder for bleeding and non-bleeding region detection, and a feedforward neural network for classification.
Trained in an end-to-end approach on the Auto-WCEBleedGen Version 1 challenge training set, our model performs both detection and classification tasks as a single unit.
 arXiv  Detail & Related papers  (2024-12-26T13:49:39Z)
- ColonNet: A Hybrid Of DenseNet121 And U-NET Model For Detection And   Segmentation Of GI Bleeding [1.2499537119440245]
 This study presents an integrated deep learning model for automatic detection and classification of Gastrointestinal bleeding in the frames extracted from Wireless Capsule Endoscopy (WCE) videos.<n>The dataset has been released as part of Auto-WCBleedGen Challenge Version V2 hosted by the MISAHUB team.
 arXiv  Detail & Related papers  (2024-12-06T17:48:06Z)
- Capsule Endoscopy Multi-classification via Gated Attention and Wavelet   Transformations [1.5146068448101746]
 Abnormalities in the gastrointestinal tract significantly influence the patient's health and require a timely diagnosis.<n>The work presents the process of developing and evaluating a novel model designed to classify gastrointestinal anomalies from a video frame.<n> integration of Omni Dimensional Gated Attention (OGA) mechanism and Wavelet transformation techniques into the model's architecture allowed the model to focus on the most critical areas.<n>The model's performance is benchmarked against two base models, VGG16 and ResNet50, demonstrating its enhanced ability to identify and classify a range of gastrointestinal abnormalities accurately.
 arXiv  Detail & Related papers  (2024-10-25T08:01:35Z)
- Integrating Deep Feature Extraction and Hybrid ResNet-DenseNet Model for   Multi-Class Abnormality Detection in Endoscopic Images [0.9374652839580183]
 The aim is to automate the identification of ten GI abnormality classes, including angioectasia, bleeding, and ulcers.
The proposed model achieves an overall accuracy of 94% across a well-structured dataset.
 arXiv  Detail & Related papers  (2024-10-24T06:10:31Z)
- Enhancing Diagnostic Reliability of Foundation Model with Uncertainty   Estimation in OCT Images [41.002573031087856]
 We developed a foundation model with uncertainty estimation (FMUE) to detect 11 retinal conditions on optical coherence tomography ( OCT)
FMUE achieved a higher F1 score of 96.76% than two state-of-the-art algorithms, RETFound and UIOS, and got further improvement with thresholding strategy to 98.44%.
Our model is superior to two ophthalmologists with a higher F1 score (95.17% vs. 61.93% &71.72%)
 arXiv  Detail & Related papers  (2024-06-18T03:04:52Z)
- A Robust Pipeline for Classification and Detection of Bleeding Frames in   Wireless Capsule Endoscopy using Swin Transformer and RT-DETR [1.7499351967216343]
 Solution combines the Swin Transformer for the initial classification of bleeding frames and RT-DETR for further detection of bleeding.
On the validation set, this approach achieves a classification accuracy of 98.5% compared to 91.7% without any pre-processing.
On the test set, this approach achieves a classification accuracy and F1 score of 87.0% and 89.0% respectively.
 arXiv  Detail & Related papers  (2024-06-12T09:58:42Z)
- Uncertainty-inspired Open Set Learning for Retinal Anomaly
  Identification [71.06194656633447]
 We establish an uncertainty-inspired open-set (UIOS) model, which was trained with fundus images of 9 retinal conditions.
Our UIOS model with thresholding strategy achieved an F1 score of 99.55%, 97.01% and 91.91% for the internal testing set.
UIOS correctly predicted high uncertainty scores, which would prompt the need for a manual check in the datasets of non-target categories retinal diseases, low-quality fundus images, and non-fundus images.
 arXiv  Detail & Related papers  (2023-04-08T10:47:41Z)
- Learning to diagnose cirrhosis from radiological and histological labels
  with joint self and weakly-supervised pretraining strategies [62.840338941861134]
 We propose to leverage transfer learning from large datasets annotated by radiologists, to predict the histological score available on a small annex dataset.
We compare different pretraining methods, namely weakly-supervised and self-supervised ones, to improve the prediction of the cirrhosis.
This method outperforms the baseline classification of the METAVIR score, reaching an AUC of 0.84 and a balanced accuracy of 0.75.
 arXiv  Detail & Related papers  (2023-02-16T17:06:23Z)
- Corneal endothelium assessment in specular microscopy images with Fuchs'
  dystrophy via deep regression of signed distance maps [48.498376125522114]
 This paper proposes a UNet-based segmentation approach that requires minimal post-processing.
It achieves reliable CE morphometric assessment and guttae identification across all degrees of Fuchs' dystrophy.
 arXiv  Detail & Related papers  (2022-10-13T15:34:20Z)
- CIRCA: comprehensible online system in support of chest X-rays-based
  COVID-19 diagnosis [37.41181188499616]
 Deep learning techniques can help in the faster detection of COVID-19 cases and monitoring of disease progression.
Five different datasets were used to construct a representative dataset of 23 799 CXRs for model training.
A U-Net-based model was developed to identify a clinically relevant region of the CXR.
 arXiv  Detail & Related papers  (2022-10-11T13:30:34Z)
- Self-supervised contrastive learning of echocardiogram videos enables
  label-efficient cardiac disease diagnosis [48.64462717254158]
 We developed a self-supervised contrastive learning approach, EchoCLR, to catered to echocardiogram videos.
When fine-tuned on small portions of labeled data, EchoCLR pretraining significantly improved classification performance for left ventricular hypertrophy (LVH) and aortic stenosis (AS)
 EchoCLR is unique in its ability to learn representations of medical videos and demonstrates that SSL can enable label-efficient disease classification from small, labeled datasets.
 arXiv  Detail & Related papers  (2022-07-23T19:17:26Z)
- Many-to-One Distribution Learning and K-Nearest Neighbor Smoothing for
  Thoracic Disease Identification [83.6017225363714]
 deep learning has become the most powerful computer-aided diagnosis technology for improving disease identification performance.
For chest X-ray imaging, annotating large-scale data requires professional domain knowledge and is time-consuming.
In this paper, we propose many-to-one distribution learning (MODL) and K-nearest neighbor smoothing (KNNS) methods to improve a single model's disease identification performance.
 arXiv  Detail & Related papers  (2021-02-26T02:29:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.