XDEEP-MSI: Explainable Bias-Rejecting Microsatellite Instability Deep
Learning System In Colorectal Cancer
- URL: http://arxiv.org/abs/2110.15350v1
- Date: Thu, 28 Oct 2021 17:58:01 GMT
- Title: XDEEP-MSI: Explainable Bias-Rejecting Microsatellite Instability Deep
Learning System In Colorectal Cancer
- Authors: Aurelia Bustos (1), Artemio Pay\'a (2 and 3), Andres Torrubia (1),
Rodrigo Jover (2 and 3), Xavier Llor (4), Xavier Bessa (5), Antoni Castells
(6), Cristina Alenda (2 and 3) ((1) AI Cancer Research Unit Medbravo, (2)
Alicante University General Hospital, Spain, (3) Alicante Institute for
Health and Biomedical Research ISABIAL, (4) Department of Medicine and Cancer
Center at Yale University, Connecticut, (5) Hospital del Mar Medical Research
Institute IMIM, Barcelona, Spain, (6) Hospital Cl\'inic University of
Barcelona IDIBAPS CIBERehd, Spain)
- Abstract summary: We present a system for the prediction of microsatellite instability (MSI) from H&E images of colorectal cancer using deep learning (DL) techniques customized for tissue microarrays (TMAs)
The system incorporates an end-to-end image preprocessing module that produces tiles at multiple magnifications in the regions of interest as guided by a tissue module, and a multiple-bias rejecting module.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We present a system for the prediction of microsatellite instability (MSI)
from H&E images of colorectal cancer using deep learning (DL) techniques
customized for tissue microarrays (TMAs). The system incorporates an end-to-end
image preprocessing module that produces tiles at multiple magnifications in
the regions of interest as guided by a tissue classifier module, and a
multiple-bias rejecting module. The training and validation TMA samples were
obtained from the EPICOLON project and further enriched with samples from a
single institution. A systematic study of biases at tile level identified three
protected (bias) variables associated with the learned representations of a
baseline model: the project of origin of samples, the patient spot and the TMA
glass where each spot was placed. A multiple bias rejecting technique based on
adversarial training is implemented at the DL architecture so to directly avoid
learning the batch effects of those variables. The learned features from the
bias-ablated model have maximum discriminative power with respect to the task
and minimal statistical mean dependence with the biases. The impact of
different magnifications, types of tissues and the model performance at tile vs
patient level is analyzed. The AUC at tile level, and including all three
selected tissues (tumor epithelium, mucine and lymphocytic regions) and 4
magnifications, was 0.87 +/- 0.03 and increased to 0.9 +/- 0.03 at patient
level. To the best of our knowledge, this is the first work that incorporates a
multiple bias ablation technique at the DL architecture in digital pathology,
and the first using TMAs for the MSI prediction task.
Related papers
- Revisiting Automatic Data Curation for Vision Foundation Models in Digital Pathology [41.34847597178388]
Vision foundation models (FMs) learn to represent histological features in highly heterogeneous tiles extracted from whole-slide images.
We investigate the potential of unsupervised automatic data curation at the tile-level, taking into account 350 million tiles.
arXiv Detail & Related papers (2025-03-24T14:23:48Z) - Fast and Accurate Gigapixel Pathological Image Classification with Hierarchical Distillation Multi-Instance Learning [51.525891360380285]
HDMIL is a hierarchical distillation multi-instance learning framework that achieves fast and accurate classification by eliminating irrelevant patches.
HDMIL consists of two key components: the dynamic multi-instance network (DMIN) and the lightweight instance pre-screening network (LIPN)
arXiv Detail & Related papers (2025-02-28T15:10:07Z) - A digital eye-fixation biomarker using a deep anomaly scheme to classify Parkisonian patterns [0.6249768559720122]
Oculomotor alterations constitute a promising biomarker to detect and characterize Parkinson's disease (PD)
Recent advances on machine learning and video analysis have encouraged novel characterizations of eye movement patterns.
This work introduces a novel video analysis scheme to quantify Parkinsonian eye fixation patterns with an anomaly detection framework.
arXiv Detail & Related papers (2025-02-25T01:34:08Z) - Transformer-Based Self-Supervised Learning for Histopathological Classification of Ischemic Stroke Clot Origin [0.0]
Identifying the thromboembolism source in ischemic stroke is crucial for treatment and secondary prevention.
This study describes a self-supervised deep learning approach in digital pathology of emboli for classifying ischemic stroke clot origin.
arXiv Detail & Related papers (2024-05-01T23:40:12Z) - Equipping Computational Pathology Systems with Artifact Processing Pipelines: A Showcase for Computation and Performance Trade-offs [0.7226586370054761]
We propose a mixture of experts (MoE) scheme for detecting five notable artifacts, including damaged tissue, blur, folded tissue, air bubbles, and histologically irrelevant blood.
We developed DL pipelines using two MoEs and two multiclass models of state-of-the-art deep convolutional neural networks (DCNNs) and vision transformers (ViTs)
The proposed MoE yields 86.15% F1 and 97.93% sensitivity scores on unseen data, retaining less computational cost for inference than MoE using ViTs.
arXiv Detail & Related papers (2024-03-12T15:22:05Z) - The effect of data augmentation and 3D-CNN depth on Alzheimer's Disease
detection [51.697248252191265]
This work summarizes and strictly observes best practices regarding data handling, experimental design, and model evaluation.
We focus on Alzheimer's Disease (AD) detection, which serves as a paradigmatic example of challenging problem in healthcare.
Within this framework, we train predictive 15 models, considering three different data augmentation strategies and five distinct 3D CNN architectures.
arXiv Detail & Related papers (2023-09-13T10:40:41Z) - Rethinking Semi-Supervised Medical Image Segmentation: A
Variance-Reduction Perspective [51.70661197256033]
We propose ARCO, a semi-supervised contrastive learning framework with stratified group theory for medical image segmentation.
We first propose building ARCO through the concept of variance-reduced estimation and show that certain variance-reduction techniques are particularly beneficial in pixel/voxel-level segmentation tasks.
We experimentally validate our approaches on eight benchmarks, i.e., five 2D/3D medical and three semantic segmentation datasets, with different label settings.
arXiv Detail & Related papers (2023-02-03T13:50:25Z) - Successive Subspace Learning for Cardiac Disease Classification with
Two-phase Deformation Fields from Cine MRI [36.044984400761535]
This work proposes a lightweight successive subspace learning framework for CVD classification.
It is based on an interpretable feedforward design, in conjunction with a cardiac atlas.
Compared with 3D CNN-based approaches, our framework achieves superior classification performance with 140$times$ fewer parameters.
arXiv Detail & Related papers (2023-01-21T15:00:59Z) - Federated Learning for the Classification of Tumor Infiltrating
Lymphocytes [5.881088147423591]
We evaluate the performance of federated learning (FL) in developing deep learning models for analysis of digitized tissue sections.
Deep learning classification model was trained using 50*50 square micron patches extracted from whole slide images.
arXiv Detail & Related papers (2022-03-30T19:10:50Z) - Multi-Scale Input Strategies for Medulloblastoma Tumor Classification
using Deep Transfer Learning [59.30734371401316]
Medulloblastoma is the most common malignant brain cancer among children.
CNN has shown promising results for MB subtype classification.
We study the impact of tile size and input strategy.
arXiv Detail & Related papers (2021-09-14T09:42:37Z) - Cross-Site Severity Assessment of COVID-19 from CT Images via Domain
Adaptation [64.59521853145368]
Early and accurate severity assessment of Coronavirus disease 2019 (COVID-19) based on computed tomography (CT) images offers a great help to the estimation of intensive care unit event.
To augment the labeled data and improve the generalization ability of the classification model, it is necessary to aggregate data from multiple sites.
This task faces several challenges including class imbalance between mild and severe infections, domain distribution discrepancy between sites, and presence of heterogeneous features.
arXiv Detail & Related papers (2021-09-08T07:56:51Z) - DeepSMILE: Contrastive self-supervised pre-training benefits MSI and HRD
classification directly from H&E whole-slide images in colorectal and breast
cancer [22.46523830554047]
We propose a Deep learning-based weak label learning method for analyzing whole slide images (WSIs) of Hematoxylin and Eosin stained tumor tissue.
We apply DeepSMILE to the task of Homologous recombination deficiency (HRD) and microsatellite instability (MSI) prediction.
arXiv Detail & Related papers (2021-07-20T11:00:16Z) - A multi-stage machine learning model on diagnosis of esophageal
manometry [50.591267188664666]
The framework includes deep-learning models at the swallow-level stage and feature-based machine learning models at the study-level stage.
This is the first artificial-intelligence-style model to automatically predict CC diagnosis of HRM study from raw multi-swallow data.
arXiv Detail & Related papers (2021-06-25T20:09:23Z) - Relational Subsets Knowledge Distillation for Long-tailed Retinal
Diseases Recognition [65.77962788209103]
We propose class subset learning by dividing the long-tailed data into multiple class subsets according to prior knowledge.
It enforces the model to focus on learning the subset-specific knowledge.
The proposed framework proved to be effective for the long-tailed retinal diseases recognition task.
arXiv Detail & Related papers (2021-04-22T13:39:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.