From Whole-slide Image to Biomarker Prediction: A Protocol for
End-to-End Deep Learning in Computational Pathology
- URL: http://arxiv.org/abs/2312.10944v1
- Date: Mon, 18 Dec 2023 05:46:57 GMT
- Title: From Whole-slide Image to Biomarker Prediction: A Protocol for
End-to-End Deep Learning in Computational Pathology
- Authors: Omar S. M. El Nahhas, Marko van Treeck, Georg W\"olflein, Michaela
Unger, Marta Ligero, Tim Lenz, Sophia J. Wagner, Katherine J. Hewitt, Firas
Khader, Sebastian Foersch, Daniel Truhn, Jakob Nikolas Kather
- Abstract summary: This protocol describes a practical workflow for solid tumor associative modeling in pathology (STAMP)
The STAMP workflow is biomarker agnostic and allows for genetic- and clinicopathologic tabular data to be included as an additional input.
The protocol consists of five main stages which have been successfully applied to various research problems.
- Score: 0.725241982525598
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Hematoxylin- and eosin (H&E) stained whole-slide images (WSIs) are the
foundation of diagnosis of cancer. In recent years, development of deep
learning-based methods in computational pathology enabled the prediction of
biomarkers directly from WSIs. However, accurately linking tissue phenotype to
biomarkers at scale remains a crucial challenge for democratizing complex
biomarkers in precision oncology. This protocol describes a practical workflow
for solid tumor associative modeling in pathology (STAMP), enabling prediction
of biomarkers directly from WSIs using deep learning. The STAMP workflow is
biomarker agnostic and allows for genetic- and clinicopathologic tabular data
to be included as an additional input, together with histopathology images. The
protocol consists of five main stages which have been successfully applied to
various research problems: formal problem definition, data preprocessing,
modeling, evaluation and clinical translation. The STAMP workflow
differentiates itself through its focus on serving as a collaborative framework
that can be used by clinicians and engineers alike for setting up research
projects in the field of computational pathology. As an example task, we
applied STAMP to the prediction of microsatellite instability (MSI) status in
colorectal cancer, showing accurate performance for the identification of
MSI-high tumors. Moreover, we provide an open-source codebase which has been
deployed at several hospitals across the globe to set up computational
pathology workflows. The STAMP workflow requires one workday of hands-on
computational execution and basic command line knowledge.
Related papers
- Towards a Comprehensive Benchmark for Pathological Lymph Node Metastasis in Breast Cancer Sections [21.75452517154339]
We reprocessed 1,399 whole slide images (WSIs) and labels from the Camelyon-16 and Camelyon-17 datasets.
Based on the sizes of re-annotated tumor regions, we upgraded the binary cancer screening task to a four-class task.
arXiv Detail & Related papers (2024-11-16T09:19:24Z) - How quantum computing can enhance biomarker discovery [0.14043931310479377]
Quantum algorithms, particularly in machine learning, are mapped to key applications in biomarker discovery.
The opportunities and challenges associated with the algorithms and applications are discussed.
An outlook is provided concerning open research challenges.
arXiv Detail & Related papers (2024-11-15T16:50:05Z) - CryoFM: A Flow-based Foundation Model for Cryo-EM Densities [50.291974465864364]
We present CryoFM, a foundation model designed as a generative model, learning the distribution of high-quality density maps.
Built on flow matching, CryoFM is trained to accurately capture the prior distribution of biomolecular density maps.
arXiv Detail & Related papers (2024-10-11T08:53:58Z) - BioMNER: A Dataset for Biomedical Method Entity Recognition [25.403593761614424]
We propose a novel dataset for biomedical method entity recognition.
We employ an automated BioMethod entity recognition and information retrieval system to assist human annotation.
Our empirical findings reveal that the large parameter counts of language models surprisingly inhibit the effective assimilation of entity extraction patterns.
arXiv Detail & Related papers (2024-06-28T16:34:24Z) - WEEP: A method for spatial interpretation of weakly supervised CNN models in computational pathology [0.36096289461554343]
We propose a novel method, Wsi rEgion sElection aPproach (WEEP), for model interpretation.
We demonstrate WEEP on a binary classification task in the area of breast cancer computational pathology.
arXiv Detail & Related papers (2024-03-22T14:32:02Z) - HistGen: Histopathology Report Generation via Local-Global Feature Encoding and Cross-modal Context Interaction [16.060286162384536]
HistGen is a learning-empowered framework for histopathology report generation.
It aims to boost report generation by aligning whole slide images (WSIs) and diagnostic reports from local and global granularity.
Experimental results on WSI report generation show the proposed model outperforms state-of-the-art (SOTA) models by a large margin.
arXiv Detail & Related papers (2024-03-08T15:51:43Z) - BiomedGPT: A Generalist Vision-Language Foundation Model for Diverse Biomedical Tasks [68.39821375903591]
Generalist AI holds the potential to address limitations due to its versatility in interpreting different data types.
Here, we propose BiomedGPT, the first open-source and lightweight vision-language foundation model.
arXiv Detail & Related papers (2023-05-26T17:14:43Z) - SEMPAI: a Self-Enhancing Multi-Photon Artificial Intelligence for
prior-informed assessment of muscle function and pathology [48.54269377408277]
We introduce the Self-Enhancing Multi-Photon Artificial Intelligence (SEMPAI), that integrates hypothesis-driven priors in a data-driven Deep Learning approach.
SEMPAI performs joint learning of several tasks to enable prediction for small datasets.
SEMPAI outperforms state-of-the-art biomarkers in six of seven predictive tasks, including those with scarce data.
arXiv Detail & Related papers (2022-10-28T17:03:04Z) - Lung Cancer Lesion Detection in Histopathology Images Using Graph-Based
Sparse PCA Network [93.22587316229954]
We propose a graph-based sparse principal component analysis (GS-PCA) network, for automated detection of cancerous lesions on histological lung slides stained by hematoxylin and eosin (H&E)
We evaluate the performance of the proposed algorithm on H&E slides obtained from an SVM K-rasG12D lung cancer mouse model using precision/recall rates, F-score, Tanimoto coefficient, and area under the curve (AUC) of the receiver operator characteristic (ROC)
arXiv Detail & Related papers (2021-10-27T19:28:36Z) - MIMO: Mutual Integration of Patient Journey and Medical Ontology for
Healthcare Representation Learning [49.57261599776167]
We propose an end-to-end robust Transformer-based solution, Mutual Integration of patient journey and Medical Ontology (MIMO) for healthcare representation learning and predictive analytics.
arXiv Detail & Related papers (2021-07-20T07:04:52Z) - G-MIND: An End-to-End Multimodal Imaging-Genetics Framework for
Biomarker Identification and Disease Classification [49.53651166356737]
We propose a novel deep neural network architecture to integrate imaging and genetics data, as guided by diagnosis, that provides interpretable biomarkers.
We have evaluated our model on a population study of schizophrenia that includes two functional MRI (fMRI) paradigms and Single Nucleotide Polymorphism (SNP) data.
arXiv Detail & Related papers (2021-01-27T19:28:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.