OPTIMAM Mammography Image Database: a large scale resource of
mammography images and clinical data
- URL: http://arxiv.org/abs/2004.04742v1
- Date: Thu, 9 Apr 2020 17:12:13 GMT
- Title: OPTIMAM Mammography Image Database: a large scale resource of
mammography images and clinical data
- Authors: Mark D Halling-Brown, Lucy M Warren, Dominic Ward, Emma Lewis,
Alistair Mackenzie, Matthew G Wallis, Louise Wilkinson, Rosalind M
Given-Wilson, Rita McAvinchey and Kenneth C Young
- Abstract summary: A major barrier to medical imaging research is a lack of large databases of medical images which share images with other researchers.
The OPTIMAM image database (OMI-DB) has been developed to overcome these barriers.
The database contains over 2.5 million images from 173,319 women collected from three UK breast screening centres.
- Score: 0.2600410195810869
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A major barrier to medical imaging research and in particular the development
of artificial intelligence (AI) is a lack of large databases of medical images
which share images with other researchers. Without such databases it is not
possible to train generalisable AI algorithms, and large amounts of time and
funding is spent collecting smaller datasets at individual research centres.
The OPTIMAM image database (OMI-DB) has been developed to overcome these
barriers. OMI-DB consists of several relational databases and cloud storage
systems, containing mammography images and associated clinical and pathological
information. The database contains over 2.5 million images from 173,319 women
collected from three UK breast screening centres. This includes 154,832 women
with normal breasts, 6909 women with benign findings, 9690 women with
screen-detected cancers and 1888 women with interval cancers. Collection is
on-going and all women are followed-up and their clinical status updated
according to subsequent screening episodes. The availability of prior screening
mammograms and interval cancers is a vital resource for AI development. Data
from OMI-DB has been shared with over 30 research groups and companies, since
2014. This progressive approach has been possible through sharing agreements
between the funder and approved academic and commercial research groups. A
research dataset such as the OMI-DB provides a powerful resource for research.
Related papers
- Deep Learning Predicts Mammographic Breast Density in Clinical Breast Ultrasound Images [0.0]
Mammographic breast density, as defined by the American College of Radiology's Breast Imaging Reporting and Data System (BI-RADS), is one of the strongest risk factors for breast cancer.
The purpose of this study was to explore an artificial intelligence (AI) model to predict BI-RADS mammographic breast density category from clinical, handheld BUS imaging.
arXiv Detail & Related papers (2024-10-31T21:28:20Z) - RadGenome-Chest CT: A Grounded Vision-Language Dataset for Chest CT Analysis [56.57177181778517]
RadGenome-Chest CT is a large-scale, region-guided 3D chest CT interpretation dataset based on CT-RATE.
We leverage the latest powerful universal segmentation and large language models to extend the original datasets.
arXiv Detail & Related papers (2024-04-25T17:11:37Z) - M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models [49.5030774873328]
Previous research has primarily focused on 2D medical images, leaving 3D images under-explored, despite their richer spatial information.
We present a large-scale 3D multi-modal medical dataset, M3D-Data, comprising 120K image-text pairs and 662K instruction-response pairs.
We also introduce a new 3D multi-modal medical benchmark, M3D-Bench, which facilitates automatic evaluation across eight tasks.
arXiv Detail & Related papers (2024-03-31T06:55:12Z) - Breast Ultrasound Report Generation using LangChain [58.07183284468881]
We propose the integration of multiple image analysis tools through a LangChain using Large Language Models (LLM) into the breast reporting process.
Our method can accurately extract relevant features from ultrasound images, interpret them in a clinical context, and produce comprehensive and standardized reports.
arXiv Detail & Related papers (2023-12-05T00:28:26Z) - BMAD: Benchmarks for Medical Anomaly Detection [51.22159321912891]
Anomaly detection (AD) is a fundamental research problem in machine learning and computer vision.
In medical imaging, AD is especially vital for detecting and diagnosing anomalies that may indicate rare diseases or conditions.
We introduce a comprehensive evaluation benchmark for assessing anomaly detection methods on medical images.
arXiv Detail & Related papers (2023-06-20T20:23:46Z) - A Multi-Institutional Open-Source Benchmark Dataset for Breast Cancer
Clinical Decision Support using Synthetic Correlated Diffusion Imaging Data [82.74877848011798]
Cancer-Net BCa is a multi-institutional open-source benchmark dataset of volumetric CDI$s$ imaging data of breast cancer patients.
Cancer-Net BCa is publicly available as a part of a global open-source initiative dedicated to accelerating advancement in machine learning to aid clinicians in the fight against cancer.
arXiv Detail & Related papers (2023-04-12T05:41:44Z) - Federated Learning Enables Big Data for Rare Cancer Boundary Detection [98.5549882883963]
We present findings from the largest Federated ML study to-date, involving data from 71 healthcare institutions across 6 continents.
We generate an automatic tumor boundary detector for the rare disease of glioblastoma.
We demonstrate a 33% improvement over a publicly trained model to delineate the surgically targetable tumor, and 23% improvement over the tumor's entire extent.
arXiv Detail & Related papers (2022-04-22T17:27:00Z) - VinDr-Mammo: A large-scale benchmark dataset for computer-aided
diagnosis in full-field digital mammography [0.5452925161262461]
VinDr-Mammo is a new benchmark dataset of full-field digital mammography (FFDM)
The dataset consists of 5,000 mammography exams, each of which has four standard views and is double read with disagreement.
It is created for the assessment of Breast Imaging Reporting and Data System (BI-RADS) and density at the breast level.
arXiv Detail & Related papers (2022-03-20T18:17:42Z) - The EMory BrEast imaging Dataset (EMBED): A Racially Diverse, Granular
Dataset of 3.5M Screening and Diagnostic Mammograms [2.243792799100692]
The EMory BrEast imaging dataset contains 3650,000 2D and diagnostic mammograms for 116,000 women divided equally between White and African American patients.
Our goal is to share this dataset with research partners to aid in development and validation of breast AI models that will serve all patients fairly and help decrease bias in medical AI.
arXiv Detail & Related papers (2022-02-08T14:40:59Z) - Artificial Intelligence For Breast Cancer Detection: Trends & Directions [0.0]
This article analyzes different imaging modalities that have been exploited by researchers to automate the task of breast cancer detection.
This article then summarizes AI and computer vision based state-of-the-art methods proposed in the last decade, to detect breast cancer.
arXiv Detail & Related papers (2021-10-03T07:22:21Z) - Detection of masses and architectural distortions in digital breast
tomosynthesis: a publicly available dataset of 5,060 patients and a deep
learning model [4.3359550072619255]
We have curated and made publicly available a large-scale dataset of digital breast tomosynthesis images.
It contains 22,032 reconstructed volumes belonging to 5,610 studies from 5,060 patients.
We developed a single-phase deep learning detection model and tested it using our dataset to serve as a baseline for future research.
arXiv Detail & Related papers (2020-11-13T18:33:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.