Development and Validation of a Deep Learning-Based Microsatellite
Instability Predictor from Prostate Cancer Whole-Slide Images
- URL: http://arxiv.org/abs/2310.08743v1
- Date: Thu, 12 Oct 2023 22:09:53 GMT
- Title: Development and Validation of a Deep Learning-Based Microsatellite
Instability Predictor from Prostate Cancer Whole-Slide Images
- Authors: Qiyuan Hu, Abbas A. Rizvi, Geoffery Schau, Kshitij Ingale, Yoni
Muller, Rachel Baits, Sebastian Pretzer, A\"icha BenTaieb, Abigail Gordhamer,
Roberto Nussenzveig, Adam Cole, Matthew O. Leavitt, Rohan P. Joshi, Nike
Beaubier, Martin C. Stumpe, Kunal Nagpal
- Abstract summary: Microsatellite instability-high (MSI-H) is a tumor agnostic biomarker for immune checkpoint inhibitor therapy.
We developed and validated an AI-based MSI-H diagnostic model on a large real-world cohort of routine H&E slides.
This algorithm has the potential to direct prostate cancer patients toward immunotherapy and to identify MSI-H cases secondary to Lynch syndrome.
- Score: 1.4942902702740595
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Microsatellite instability-high (MSI-H) is a tumor agnostic biomarker for
immune checkpoint inhibitor therapy. However, MSI status is not routinely
tested in prostate cancer, in part due to low prevalence and assay cost. As
such, prediction of MSI status from hematoxylin and eosin (H&E) stained
whole-slide images (WSIs) could identify prostate cancer patients most likely
to benefit from confirmatory testing and becoming eligible for immunotherapy.
Prostate biopsies and surgical resections from de-identified records of
consecutive prostate cancer patients referred to our institution were analyzed.
Their MSI status was determined by next generation sequencing. Patients before
a cutoff date were split into an algorithm development set (n=4015, MSI-H 1.8%)
and a paired validation set (n=173, MSI-H 19.7%) that consisted of two serial
sections from each sample, one stained and scanned internally and the other at
an external site. Patients after the cutoff date formed the temporal validation
set (n=1350, MSI-H 2.3%). Attention-based multiple instance learning models
were trained to predict MSI-H from H&E WSIs. The MSI-H predictor achieved area
under the receiver operating characteristic curve values of 0.78 (95% CI
[0.69-0.86]), 0.72 (95% CI [0.63-0.81]), and 0.72 (95% CI [0.62-0.82]) on the
internally prepared, externally prepared, and temporal validation sets,
respectively. While MSI-H status is significantly correlated with Gleason
score, the model remained predictive within each Gleason score subgroup. In
summary, we developed and validated an AI-based MSI-H diagnostic model on a
large real-world cohort of routine H&E slides, which effectively generalized to
externally stained and scanned samples and a temporally independent validation
cohort. This algorithm has the potential to direct prostate cancer patients
toward immunotherapy and to identify MSI-H cases secondary to Lynch syndrome.
Related papers
- Advanced Meta-Ensemble Machine Learning Models for Early and Accurate Sepsis Prediction to Improve Patient Outcomes [0.0]
This paper examines the limitations of traditional sepsis screening tools like Systemic Inflammatory Response Syndrome, Modified Early Warning Score, and Quick Sequential Organ Failure Assessment.
We propose using machine learning techniques - Random Forest, Extreme Gradient Boosting, and Decision Tree models - to predict sepsis onset.
Our study evaluates these models individually and in a combined meta-ensemble approach using key metrics such as Accuracy, Precision, Recall, F1 score, and Area Under the Receiver Operating Characteristic Curve.
arXiv Detail & Related papers (2024-07-11T00:51:32Z) - Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank [69.90493129893112]
Recent genome-wide association studies (GWAS) have uncovered the genetic basis of complex traits, but show an under-representation of non-European descent individuals.
Here, we assess whether we can improve disease prediction across diverse ancestries using multiomic data.
arXiv Detail & Related papers (2024-04-26T16:39:50Z) - Clinical Melanoma Diagnosis with Artificial Intelligence: Insights from
a Prospective Multicenter Study [1.2397589403129072]
AI has proven to be helpful for enhancing melanoma detection.
Existing studies are limited by low sample sizes, too homogenous datasets, or lack of inclusion of rare melanoma subtypes.
We assessed 'All Data are Ext' (ADAE), an established open-source algorithm for detecting melanomas, by comparing its diagnostic accuracy to that of dermatologists.
arXiv Detail & Related papers (2024-01-25T14:03:54Z) - A Fully Automated Pipeline Using Swin Transformers for Deep
Learning-Based Blood Segmentation on Head CT Scans After Aneurysmal
Subarachnoid Hemorrhage [0.0]
We develop and validate an artificial intelligence-driven, fully automated blood segmentation tool for spontaneous subarachnoid hemorrhage (SAH) patients.
We retrospectively analyzed NCCT scans from patients with confirmed aneurysmal subarachnoid hemorrhage (aSAH) utilizing the Swin UNETR for segmentation.
arXiv Detail & Related papers (2023-12-29T10:57:51Z) - Quantifying Impairment and Disease Severity Using AI Models Trained on
Healthy Subjects [27.786240241494436]
COnfidence-Based chaRacterization of Anomalies (COBRA) score exploits the decrease in confidence of these models when presented with impaired or diseased patients.
We applied the COBRA score to address a key limitation of current clinical evaluation of upper-body impairment in stroke patients.
arXiv Detail & Related papers (2023-11-21T18:45:52Z) - Bootstrapping Your Own Positive Sample: Contrastive Learning With
Electronic Health Record Data [62.29031007761901]
This paper proposes a novel contrastive regularized clinical classification model.
We introduce two unique positive sampling strategies specifically tailored for EHR data.
Our framework yields highly competitive experimental results in predicting the mortality risk on real-world COVID-19 EHR data.
arXiv Detail & Related papers (2021-04-07T06:02:04Z) - Deep Learning for fully automatic detection, segmentation, and Gleason
Grade estimation of prostate cancer in multiparametric Magnetic Resonance
Images [0.731365367571807]
This paper proposes a fully automatic system based on Deep Learning that takes a prostate mpMRI from a PCa-suspect patient.
It locates PCa lesions, segments them, and predicts their most likely Gleason grade group (GGG)
The code for the ProstateX-trained system has been made openly available at https://github.com/OscarPellicer/prostate_lesion_detection.
arXiv Detail & Related papers (2021-03-23T16:08:43Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z) - MIA-Prognosis: A Deep Learning Framework to Predict Therapy Response [58.0291320452122]
This paper aims at a unified deep learning approach to predict patient prognosis and therapy response.
We formalize the prognosis modeling as a multi-modal asynchronous time series classification task.
Our predictive model could further stratify low-risk and high-risk patients in terms of long-term survival.
arXiv Detail & Related papers (2020-10-08T15:30:17Z) - CovidDeep: SARS-CoV-2/COVID-19 Test Based on Wearable Medical Sensors
and Efficient Neural Networks [51.589769497681175]
The novel coronavirus (SARS-CoV-2) has led to a pandemic.
The current testing regime based on Reverse Transcription-Polymerase Chain Reaction for SARS-CoV-2 has been unable to keep up with testing demands.
We propose a framework called CovidDeep that combines efficient DNNs with commercially available WMSs for pervasive testing of the virus.
arXiv Detail & Related papers (2020-07-20T21:47:28Z) - Hemogram Data as a Tool for Decision-making in COVID-19 Management:
Applications to Resource Scarcity Scenarios [62.997667081978825]
COVID-19 pandemics has challenged emergency response systems worldwide, with widespread reports of essential services breakdown and collapse of health care structure.
This work describes a machine learning model derived from hemogram exam data performed in symptomatic patients.
Proposed models can predict COVID-19 qRT-PCR results in symptomatic individuals with high accuracy, sensitivity and specificity.
arXiv Detail & Related papers (2020-05-10T01:45:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.