Detection of masses and architectural distortions in digital breast
tomosynthesis: a publicly available dataset of 5,060 patients and a deep
learning model
- URL: http://arxiv.org/abs/2011.07995v3
- Date: Fri, 1 Jan 2021 21:28:21 GMT
- Title: Detection of masses and architectural distortions in digital breast
tomosynthesis: a publicly available dataset of 5,060 patients and a deep
learning model
- Authors: Mateusz Buda, Ashirbani Saha, Ruth Walsh, Sujata Ghate, Nianyi Li,
Albert \'Swi\k{e}cicki, Joseph Y. Lo, Maciej A. Mazurowski
- Abstract summary: We have curated and made publicly available a large-scale dataset of digital breast tomosynthesis images.
It contains 22,032 reconstructed volumes belonging to 5,610 studies from 5,060 patients.
We developed a single-phase deep learning detection model and tested it using our dataset to serve as a baseline for future research.
- Score: 4.3359550072619255
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Breast cancer screening is one of the most common radiological tasks with
over 39 million exams performed each year. While breast cancer screening has
been one of the most studied medical imaging applications of artificial
intelligence, the development and evaluation of the algorithms are hindered due
to the lack of well-annotated large-scale publicly available datasets. This is
particularly an issue for digital breast tomosynthesis (DBT) which is a
relatively new breast cancer screening modality. We have curated and made
publicly available a large-scale dataset of digital breast tomosynthesis
images. It contains 22,032 reconstructed DBT volumes belonging to 5,610 studies
from 5,060 patients. This included four groups: (1) 5,129 normal studies, (2)
280 studies where additional imaging was needed but no biopsy was performed,
(3) 112 benign biopsied studies, and (4) 89 studies with cancer. Our dataset
included masses and architectural distortions which were annotated by two
experienced radiologists. Additionally, we developed a single-phase deep
learning detection model and tested it using our dataset to serve as a baseline
for future research. Our model reached a sensitivity of 65% at 2 false
positives per breast. Our large, diverse, and highly-curated dataset will
facilitate development and evaluation of AI algorithms for breast cancer
screening through providing data for training as well as common set of cases
for model validation. The performance of the model developed in our study shows
that the task remains challenging and will serve as a baseline for future model
development.
Related papers
- Deep Learning Predicts Mammographic Breast Density in Clinical Breast Ultrasound Images [0.0]
mammographic breast density is one of the strongest risk factors for breast cancer.
Breast ultrasound (BUS) is an alternative breast cancer screening modality.
The purpose of this study was to explore an artificial intelligence (AI) model to predict BI-RADS mammographic breast density from BUS imaging.
arXiv Detail & Related papers (2024-10-31T21:28:20Z) - Improving Breast Cancer Grade Prediction with Multiparametric MRI Created Using Optimized Synthetic Correlated Diffusion Imaging [71.91773485443125]
Grading plays a vital role in breast cancer treatment planning.
The current tumor grading method involves extracting tissue from patients, leading to stress, discomfort, and high medical costs.
This paper examines using optimized CDI$s$ to improve breast cancer grade prediction.
arXiv Detail & Related papers (2024-05-13T15:48:26Z) - Publicly available datasets of breast histopathology H&E whole-slide
images: A scoping review [0.0]
We identified the publicly available datasets of breast H&E stained whole-slide images (WSI) that can be used to develop deep learning algorithms.
This dataset has a considerable selection bias that can impact the robustness and generalizability of the trained algorithms.
arXiv Detail & Related papers (2023-06-02T13:50:30Z) - Breast Cancer Detection and Diagnosis: A comparative study of
state-of-the-arts deep learning architectures [3.883460584034766]
The survival rates for breast cancer patients in certain third-world countries, like South Africa, are alarmingly low.
Medical specialists and researchers have turned to domain-specific AI approaches, specifically deep learning models, to develop end-to-end solutions.
This research focuses on evaluating the performance of various cutting-edge convolutional neural network (CNN) architectures in comparison to a relatively new model called the Vision Trans-former (ViT)
arXiv Detail & Related papers (2023-05-31T15:21:34Z) - Cancer-Net BCa-S: Breast Cancer Grade Prediction using Volumetric Deep
Radiomic Features from Synthetic Correlated Diffusion Imaging [82.74877848011798]
The prevalence of breast cancer continues to grow, affecting about 300,000 females in the United States in 2023.
The gold-standard Scarff-Bloom-Richardson (SBR) grade has been shown to consistently indicate a patient's response to chemotherapy.
In this paper, we study the efficacy of deep learning for breast cancer grading based on synthetic correlated diffusion (CDI$s$) imaging.
arXiv Detail & Related papers (2023-04-12T15:08:34Z) - A Multi-Institutional Open-Source Benchmark Dataset for Breast Cancer
Clinical Decision Support using Synthetic Correlated Diffusion Imaging Data [82.74877848011798]
Cancer-Net BCa is a multi-institutional open-source benchmark dataset of volumetric CDI$s$ imaging data of breast cancer patients.
Cancer-Net BCa is publicly available as a part of a global open-source initiative dedicated to accelerating advancement in machine learning to aid clinicians in the fight against cancer.
arXiv Detail & Related papers (2023-04-12T05:41:44Z) - High-resolution synthesis of high-density breast mammograms: Application
to improved fairness in deep learning based mass detection [48.88813637974911]
Computer-aided detection systems based on deep learning have shown good performance in breast cancer detection.
High-density breasts show poorer detection performance since dense tissues can mask or even simulate masses.
This study aims to improve the mass detection performance in high-density breasts using synthetic high-density full-field digital mammograms.
arXiv Detail & Related papers (2022-09-20T15:57:12Z) - Federated Learning Enables Big Data for Rare Cancer Boundary Detection [98.5549882883963]
We present findings from the largest Federated ML study to-date, involving data from 71 healthcare institutions across 6 continents.
We generate an automatic tumor boundary detector for the rare disease of glioblastoma.
We demonstrate a 33% improvement over a publicly trained model to delineate the surgically targetable tumor, and 23% improvement over the tumor's entire extent.
arXiv Detail & Related papers (2022-04-22T17:27:00Z) - The EMory BrEast imaging Dataset (EMBED): A Racially Diverse, Granular
Dataset of 3.5M Screening and Diagnostic Mammograms [2.243792799100692]
The EMory BrEast imaging dataset contains 3650,000 2D and diagnostic mammograms for 116,000 women divided equally between White and African American patients.
Our goal is to share this dataset with research partners to aid in development and validation of breast AI models that will serve all patients fairly and help decrease bias in medical AI.
arXiv Detail & Related papers (2022-02-08T14:40:59Z) - EMT-NET: Efficient multitask network for computer-aided diagnosis of
breast cancer [58.720142291102135]
We propose an efficient and light-weighted learning architecture to classify and segment breast tumors simultaneously.
We incorporate a segmentation task into a tumor classification network, which makes the backbone network learn representations focused on tumor regions.
The accuracy, sensitivity, and specificity of tumor classification is 88.6%, 94.1%, and 85.3%, respectively.
arXiv Detail & Related papers (2022-01-13T05:24:40Z) - Synthesizing lesions using contextual GANs improves breast cancer
classification on mammograms [0.4297070083645048]
We present a novel generative adversarial network (GAN) model for data augmentation that can realistically synthesize and remove lesions on mammograms.
With self-attention and semi-supervised learning components, the U-net-based architecture can generate high resolution (256x256px) outputs.
arXiv Detail & Related papers (2020-05-29T21:23:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.