GastroVision: A Multi-class Endoscopy Image Dataset for Computer Aided
Gastrointestinal Disease Detection
- URL: http://arxiv.org/abs/2307.08140v2
- Date: Thu, 17 Aug 2023 18:21:30 GMT
- Title: GastroVision: A Multi-class Endoscopy Image Dataset for Computer Aided
Gastrointestinal Disease Detection
- Authors: Debesh Jha, Vanshali Sharma, Neethi Dasu, Nikhil Kumar Tomar, Steven
Hicks, M.K. Bhuyan, Pradip K. Das, Michael A. Riegler, P{\aa}l Halvorsen,
Ulas Bagci, Thomas de Lange
- Abstract summary: This dataset includes different anatomical landmarks, pathological abnormalities, polyp removal cases and normal findings from the GI tract.
It was annotated and verified by experienced GI endoscopists.
We believe our dataset can facilitate the development of AI-based algorithms for GI disease detection and classification.
- Score: 6.231109933741383
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Integrating real-time artificial intelligence (AI) systems in clinical
practices faces challenges such as scalability and acceptance. These challenges
include data availability, biased outcomes, data quality, lack of transparency,
and underperformance on unseen datasets from different distributions. The
scarcity of large-scale, precisely labeled, and diverse datasets are the major
challenge for clinical integration. This scarcity is also due to the legal
restrictions and extensive manual efforts required for accurate annotations
from clinicians. To address these challenges, we present \textit{GastroVision},
a multi-center open-access gastrointestinal (GI) endoscopy dataset that
includes different anatomical landmarks, pathological abnormalities, polyp
removal cases and normal findings (a total of 27 classes) from the GI tract.
The dataset comprises 8,000 images acquired from B{\ae}rum Hospital in Norway
and Karolinska University Hospital in Sweden and was annotated and verified by
experienced GI endoscopists. Furthermore, we validate the significance of our
dataset with extensive benchmarking based on the popular deep learning based
baseline models. We believe our dataset can facilitate the development of
AI-based algorithms for GI disease detection and classification. Our dataset is
available at \url{https://osf.io/84e7f/}.
Related papers
- SMILE-UHURA Challenge -- Small Vessel Segmentation at Mesoscopic Scale from Ultra-High Resolution 7T Magnetic Resonance Angiograms [60.35639972035727]
The lack of publicly available annotated datasets has impeded the development of robust, machine learning-driven segmentation algorithms.
The SMILE-UHURA challenge addresses the gap in publicly available annotated datasets by providing an annotated dataset of Time-of-Flight angiography acquired with 7T MRI.
Dice scores reached up to 0.838 $pm$ 0.066 and 0.716 $pm$ 0.125 on the respective datasets, with an average performance of up to 0.804 $pm$ 0.15.
arXiv Detail & Related papers (2024-11-14T17:06:00Z) - FedCVD: The First Real-World Federated Learning Benchmark on Cardiovascular Disease Data [52.55123685248105]
Cardiovascular diseases (CVDs) are currently the leading cause of death worldwide, highlighting the critical need for early diagnosis and treatment.
Machine learning (ML) methods can help diagnose CVDs early, but their performance relies on access to substantial data with high quality.
This paper presents the first real-world FL benchmark for cardiovascular disease detection, named FedCVD.
arXiv Detail & Related papers (2024-10-28T02:24:01Z) - Detecting Unforeseen Data Properties with Diffusion Autoencoder Embeddings using Spine MRI data [7.757515290013924]
Deep learning has made significant strides in medical imaging, leveraging the use of large datasets to improve diagnostics and prognostics.
Large datasets often come with inherent errors through subject selection and acquisition.
We investigate the use of Diffusion Autoencoder embeddings for uncovering and understanding data characteristics and biases.
arXiv Detail & Related papers (2024-10-14T07:24:26Z) - Multi-OCT-SelfNet: Integrating Self-Supervised Learning with Multi-Source Data Fusion for Enhanced Multi-Class Retinal Disease Classification [2.5091334993691206]
Development of a robust deep-learning model for retinal disease diagnosis requires a substantial dataset for training.
The capacity to generalize effectively on smaller datasets remains a persistent challenge.
We've combined a wide range of data sources to improve performance and generalization to new data.
arXiv Detail & Related papers (2024-09-17T17:22:35Z) - ISLES 2024: The first longitudinal multimodal multi-center real-world dataset in (sub-)acute stroke [2.7919032539697444]
Stroke remains a leading cause of global morbidity and mortality, placing a heavy socioeconomic burden.
To develop machine learning algorithms that can extract meaningful and reproducible models of brain function from stroke images.
Our dataset is the first to offer comprehensive longitudinal stroke data, including acute CT imaging with angiography and perfusion, follow-up MRI at 2-9 days, and acute and longitudinal clinical data up to a three-month outcome.
arXiv Detail & Related papers (2024-08-20T18:59:52Z) - A Lung Nodule Dataset with Histopathology-based Cancer Type Annotation [12.617587827105496]
This research aims to bridge the gap by providing publicly accessible datasets and reliable tools for medical diagnosis.
We curated a diverse dataset of lung Computed Tomography (CT) images, comprising 330 annotated nodules (nodules are labeled as bounding boxes) from 95 distinct patients.
These promising results demonstrate that the dataset has a feasible application and further facilitate intelligent auxiliary diagnosis.
arXiv Detail & Related papers (2024-06-26T06:39:11Z) - Eye-gaze Guided Multi-modal Alignment for Medical Representation Learning [65.54680361074882]
Eye-gaze Guided Multi-modal Alignment (EGMA) framework harnesses eye-gaze data for better alignment of medical visual and textual features.
We conduct downstream tasks of image classification and image-text retrieval on four medical datasets.
arXiv Detail & Related papers (2024-03-19T03:59:14Z) - Real-World Multi-Domain Data Applications for Generalizations to
Clinical Settings [1.508558791031741]
Deep learning models perform well when trained on standardized datasets from artificial settings, such as clinical trials.
We show that by employing a self-supervised approach with transfer learning on a multi-domain real-world dataset, we can achieve 16% relative improvement on a standardized dataset.
arXiv Detail & Related papers (2020-07-24T17:41:23Z) - Trajectories, bifurcations and pseudotime in large clinical datasets:
applications to myocardial infarction and diabetes data [94.37521840642141]
We suggest a semi-supervised methodology for the analysis of large clinical datasets, characterized by mixed data types and missing values.
The methodology is based on application of elastic principal graphs which can address simultaneously the tasks of dimensionality reduction, data visualization, clustering, feature selection and quantifying the geodesic distances (pseudotime) in partially ordered sequences of observations.
arXiv Detail & Related papers (2020-07-07T21:04:55Z) - Deep Mining External Imperfect Data for Chest X-ray Disease Screening [57.40329813850719]
We argue that incorporating an external CXR dataset leads to imperfect training data, which raises the challenges.
We formulate the multi-label disease classification problem as weighted independent binary tasks according to the categories.
Our framework simultaneously models and tackles the domain and label discrepancies, enabling superior knowledge mining ability.
arXiv Detail & Related papers (2020-06-06T06:48:40Z) - VerSe: A Vertebrae Labelling and Segmentation Benchmark for
Multi-detector CT Images [121.31355003451152]
Large Scale Vertebrae Challenge (VerSe) was organised in conjunction with the International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) in 2019 and 2020.
We present the the results of this evaluation and further investigate the performance-variation at vertebra-level, scan-level, and at different fields-of-view.
arXiv Detail & Related papers (2020-01-24T21:09:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.