BRACS: A Dataset for BReAst Carcinoma Subtyping in H&E Histology Images
- URL: http://arxiv.org/abs/2111.04740v1
- Date: Mon, 8 Nov 2021 15:04:16 GMT
- Title: BRACS: A Dataset for BReAst Carcinoma Subtyping in H&E Histology Images
- Authors: Nadia Brancati, Anna Maria Anniciello, Pushpak Pati, Daniel Riccio,
Giosu\`e Scognamiglio, Guillaume Jaume, Giuseppe De Pietro, Maurizio Di
Bonito, Antonio Foncubierta, Gerardo Botti, Maria Gabrani, Florinda Feroce,
and Maria Frucci
- Abstract summary: We introduce the BReAst Carcinoma Subtyping dataset, a large cohort of annotated Hematoxylin & Eosin (H&E)-stained images to facilitate the characterization of breast lesions.
BRACS contains 547 Whole-Slide Images (WSIs), and 4539 Regions of Interest (ROIs) extracted from the WSIs.
- Score: 4.974822167947921
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Breast cancer is the most commonly diagnosed cancer and registers the highest
number of deaths for women with cancer. Recent advancements in diagnostic
activities combined with large-scale screening policies have significantly
lowered the mortality rates for breast cancer patients. However, the manual
inspection of tissue slides by the pathologists is cumbersome, time-consuming,
and is subject to significant inter- and intra-observer variability. Recently,
the advent of whole-slide scanning systems have empowered the rapid
digitization of pathology slides, and enabled to develop digital workflows.
These advances further enable to leverage Artificial Intelligence (AI) to
assist, automate, and augment pathological diagnosis. But the AI techniques,
especially Deep Learning (DL), require a large amount of high-quality annotated
data to learn from. Constructing such task-specific datasets poses several
challenges, such as, data-acquisition level constrains, time-consuming and
expensive annotations, and anonymization of private information. In this paper,
we introduce the BReAst Carcinoma Subtyping (BRACS) dataset, a large cohort of
annotated Hematoxylin & Eosin (H&E)-stained images to facilitate the
characterization of breast lesions. BRACS contains 547 Whole-Slide Images
(WSIs), and 4539 Regions of Interest (ROIs) extracted from the WSIs. Each WSI,
and respective ROIs, are annotated by the consensus of three board-certified
pathologists into different lesion categories. Specifically, BRACS includes
three lesion types, i.e., benign, malignant and atypical, which are further
subtyped into seven categories. It is, to the best of our knowledge, the
largest annotated dataset for breast cancer subtyping both at WSI- and
ROI-level. Further, by including the understudied atypical lesions, BRACS
offers an unique opportunity for leveraging AI to better understand their
characteristics.
Related papers
- Deep BI-RADS Network for Improved Cancer Detection from Mammograms [3.686808512438363]
We introduce a novel multi-modal approach that combines textual BI-RADS lesion descriptors with visual mammogram content.
Our method employs iterative attention layers to effectively fuse these different modalities.
Experiments on the CBIS-DDSM dataset demonstrate substantial improvements across all metrics.
arXiv Detail & Related papers (2024-11-16T21:32:51Z) - Towards a Benchmark for Colorectal Cancer Segmentation in Endorectal Ultrasound Videos: Dataset and Model Development [59.74920439478643]
In this paper, we collect and annotated the first benchmark dataset that covers diverse ERUS scenarios.
Our ERUS-10K dataset comprises 77 videos and 10,000 high-resolution annotated frames.
We introduce a benchmark model for colorectal cancer segmentation, named the Adaptive Sparse-context TRansformer (ASTR)
arXiv Detail & Related papers (2024-08-19T15:04:42Z) - Post-Hoc Explainability of BI-RADS Descriptors in a Multi-task Framework
for Breast Cancer Detection and Segmentation [48.08423125835335]
MT-BI-RADS is a novel explainable deep learning approach for tumor detection in Breast Ultrasound (BUS) images.
It offers three levels of explanations to enable radiologists to comprehend the decision-making process in predicting tumor malignancy.
arXiv Detail & Related papers (2023-08-27T22:07:42Z) - A Preliminary Investigation into Search and Matching for Tumour
Discrimination in WHO Breast Taxonomy Using Deep Networks [3.3519874057464283]
We analyzed the WHO breast taxonomy spanning 35 tumour types using deep features extracted from a state-of-the-art deep learning model.
The patch similarity search within the WHO breast taxonomy data reached over 88% accuracy when validating through "majority vote"
These results show for the first time that complex relationships among common and rare breast lesions can be investigated using an indexed digital archive.
arXiv Detail & Related papers (2023-08-22T03:40:46Z) - CARE: A Large Scale CT Image Dataset and Clinical Applicable Benchmark
Model for Rectal Cancer Segmentation [8.728236864462302]
Rectal cancer segmentation of CT image plays a crucial role in timely clinical diagnosis, radiotherapy treatment, and follow-up.
These obstacles arise from the intricate anatomical structures of the rectum and the difficulties in performing differential diagnosis of rectal cancer.
To address these issues, this work introduces a novel large scale rectal cancer CT image dataset CARE with pixel-level annotations for both normal and cancerous rectum.
We also propose a novel medical cancer lesion segmentation benchmark model named U-SAM.
The model is specifically designed to tackle the challenges posed by the intricate anatomical structures of abdominal organs by incorporating prompt information.
arXiv Detail & Related papers (2023-08-16T10:51:27Z) - ACROBAT -- a multi-stain breast cancer histological whole-slide-image
data set from routine diagnostics for computational pathology [1.6619031082709266]
The analysis of FFPE tissue sections stained with haematoxylin and eosin (H&E) orchemistry (IHC) is an essential part of the pathologic assessment of surgically resected breast cancer specimens.
This data set has the potential to enable many different avenues of computational pathology research.
arXiv Detail & Related papers (2022-11-24T14:16:36Z) - Breast Cancer Classification using Deep Learned Features Boosted with
Handcrafted Features [0.0]
It is of utmost importance for the research community to come up with the framework for early detection, classification and diagnosis.
In this article, a novel framework for classification of breast cancer using mammograms is proposed.
The proposed framework combines robust features extracted from novel Convolutional Neural Network (CNN) features with handcrafted features.
arXiv Detail & Related papers (2022-06-26T07:54:09Z) - Topological Data Analysis of copy number alterations in cancer [70.85487611525896]
We explore the potential to capture information contained in cancer genomic information using a novel topology-based approach.
We find that this technique has the potential to extract meaningful low-dimensional representations in cancer somatic genetic data.
arXiv Detail & Related papers (2020-11-22T17:31:23Z) - Spatio-spectral deep learning methods for in-vivo hyperspectral
laryngeal cancer detection [49.32653090178743]
Early detection of head and neck tumors is crucial for patient survival.
Hyperspectral imaging (HSI) can be used for non-invasive detection of head and neck tumors.
We present multiple deep learning techniques for in-vivo laryngeal cancer detection based on HSI.
arXiv Detail & Related papers (2020-04-21T17:07:18Z) - Stan: Small tumor-aware network for breast ultrasound image segmentation [68.8204255655161]
We propose a novel deep learning architecture called Small Tumor-Aware Network (STAN) to improve the performance of segmenting tumors with different size.
The proposed approach outperformed the state-of-the-art approaches in segmenting small breast tumors.
arXiv Detail & Related papers (2020-02-03T22:25:01Z) - VerSe: A Vertebrae Labelling and Segmentation Benchmark for
Multi-detector CT Images [121.31355003451152]
Large Scale Vertebrae Challenge (VerSe) was organised in conjunction with the International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) in 2019 and 2020.
We present the the results of this evaluation and further investigate the performance-variation at vertebra-level, scan-level, and at different fields-of-view.
arXiv Detail & Related papers (2020-01-24T21:09:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.