ERS: a novel comprehensive endoscopy image dataset for machine learning,
compliant with the MST 3.0 specification
- URL: http://arxiv.org/abs/2201.08746v1
- Date: Fri, 21 Jan 2022 15:39:45 GMT
- Title: ERS: a novel comprehensive endoscopy image dataset for machine learning,
compliant with the MST 3.0 specification
- Authors: Jan Cychnerski, Tomasz Dziubich, Adam Brzeski
- Abstract summary: The article presents a new multi-label comprehensive image dataset from flexible endoscopy, colonoscopy and capsule endoscopy, named ERS.
The dataset contains around 6000 precisely and 115,000 approximately labeled frames from endoscopy videos, precise and 22,600 approximate segmentation masks, and 1.23 million unlabeled frames from flexible and capsule endoscopy videos.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: The article presents a new multi-label comprehensive image dataset from
flexible endoscopy, colonoscopy and capsule endoscopy, named ERS. The
collection has been labeled according to the full medical specification of
'Minimum Standard Terminology 3.0' (MST 3.0), describing all possible findings
in the gastrointestinal tract (104 possible labels), extended with an
additional 19 labels useful in common machine learning applications.
The dataset contains around 6000 precisely and 115,000 approximately labeled
frames from endoscopy videos, 3600 precise and 22,600 approximate segmentation
masks, and 1.23 million unlabeled frames from flexible and capsule endoscopy
videos. The labeled data cover almost entirely the MST 3.0 standard. The data
came from 1520 videos of 1135 patients.
Additionally, this paper proposes and describes four exemplary experiments in
gastrointestinal image classification task performed using the created dataset.
The obtained results indicate the high usefulness and flexibility of the
dataset in training and testing machine learning algorithms in the field of
endoscopic data analysis.
Related papers
- SMILE-UHURA Challenge -- Small Vessel Segmentation at Mesoscopic Scale from Ultra-High Resolution 7T Magnetic Resonance Angiograms [60.35639972035727]
The lack of publicly available annotated datasets has impeded the development of robust, machine learning-driven segmentation algorithms.
The SMILE-UHURA challenge addresses the gap in publicly available annotated datasets by providing an annotated dataset of Time-of-Flight angiography acquired with 7T MRI.
Dice scores reached up to 0.838 $pm$ 0.066 and 0.716 $pm$ 0.125 on the respective datasets, with an average performance of up to 0.804 $pm$ 0.15.
arXiv Detail & Related papers (2024-11-14T17:06:00Z) - Domain-Adaptive Pre-training of Self-Supervised Foundation Models for Medical Image Classification in Gastrointestinal Endoscopy [0.024999074238880488]
Video capsule endoscopy has transformed gastrointestinal endoscopy (GIE) diagnostics by offering a non-invasive method for capturing detailed images of the gastrointestinal tract.
Video capsule endoscopy has transformed gastrointestinal endoscopy (GIE) diagnostics by offering a non-invasive method for capturing detailed images of the gastrointestinal tract.
However, its potential is limited by the sheer volume of images generated during the imaging procedure, which can take anywhere from 6-8 hours and often produce up to 1 million images.
arXiv Detail & Related papers (2024-10-21T22:52:25Z) - Eye-gaze Guided Multi-modal Alignment for Medical Representation Learning [65.54680361074882]
Eye-gaze Guided Multi-modal Alignment (EGMA) framework harnesses eye-gaze data for better alignment of medical visual and textual features.
We conduct downstream tasks of image classification and image-text retrieval on four medical datasets.
arXiv Detail & Related papers (2024-03-19T03:59:14Z) - Towards Unifying Anatomy Segmentation: Automated Generation of a
Full-body CT Dataset via Knowledge Aggregation and Anatomical Guidelines [113.08940153125616]
We generate a dataset of whole-body CT scans with $142$ voxel-level labels for 533 volumes providing comprehensive anatomical coverage.
Our proposed procedure does not rely on manual annotation during the label aggregation stage.
We release our trained unified anatomical segmentation model capable of predicting $142$ anatomical structures on CT data.
arXiv Detail & Related papers (2023-07-25T09:48:13Z) - GastroVision: A Multi-class Endoscopy Image Dataset for Computer Aided
Gastrointestinal Disease Detection [6.231109933741383]
This dataset includes different anatomical landmarks, pathological abnormalities, polyp removal cases and normal findings from the GI tract.
It was annotated and verified by experienced GI endoscopists.
We believe our dataset can facilitate the development of AI-based algorithms for GI disease detection and classification.
arXiv Detail & Related papers (2023-07-16T19:36:03Z) - Self-supervision for medical image classification: state-of-the-art
performance with ~100 labeled training samples per class [0.0]
We analyze the performance of self-supervised DL within the self-distillation with no labels (DINO) framework.
We achieve state-of-the-art classification performance for all three imaging modalities and data sets.
arXiv Detail & Related papers (2023-04-11T11:51:50Z) - Self-Supervised Learning as a Means To Reduce the Need for Labeled Data
in Medical Image Analysis [64.4093648042484]
We use a dataset of chest X-ray images with bounding box labels for 13 different classes of anomalies.
We show that it is possible to achieve similar performance to a fully supervised model in terms of mean average precision and accuracy with only 60% of the labeled data.
arXiv Detail & Related papers (2022-06-01T09:20:30Z) - EndoMapper dataset of complete calibrated endoscopy procedures [8.577980383972005]
This paper introduces the Endomapper dataset, the first collection of complete endoscopy sequences acquired during regular medical practice.
Data will be used to build a 3D mapping and localization systems that can perform special task like, for example, detect blind zones during exploration.
arXiv Detail & Related papers (2022-04-29T17:10:01Z) - WSSS4LUAD: Grand Challenge on Weakly-supervised Tissue Semantic
Segmentation for Lung Adenocarcinoma [51.50991881342181]
This challenge includes 10,091 patch-level annotations and over 130 million labeled pixels.
First place team achieved mIoU of 0.8413 (tumor: 0.8389, stroma: 0.7931, normal: 0.8919)
arXiv Detail & Related papers (2022-04-13T15:27:05Z) - MedMNIST v2: A Large-Scale Lightweight Benchmark for 2D and 3D
Biomedical Image Classification [59.10015984688104]
MedMNIST v2 is a large-scale MNIST-like dataset collection of standardized biomedical images.
The resulting dataset consists of 708,069 2D images and 10,214 3D images in total.
arXiv Detail & Related papers (2021-10-27T22:02:04Z) - Heidelberg Colorectal Data Set for Surgical Data Science in the Sensor
Operating Room [1.6276355161958829]
This paper introduces the Heidelberg Colorectal (HeiCo) data set - the first publicly available data set enabling comprehensive benchmarking of medical instrument detection and segmentation algorithms.
Our data set comprises 30 laparoscopic videos and corresponding sensor data from medical devices in the operating room for three different types of laparoscopic surgery.
arXiv Detail & Related papers (2020-05-07T14:04:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.