From Whole-slide Image to Biomarker Prediction: A Protocol for
End-to-End Deep Learning in Computational Pathology
- URL: http://arxiv.org/abs/2312.10944v1
- Date: Mon, 18 Dec 2023 05:46:57 GMT
- Title: From Whole-slide Image to Biomarker Prediction: A Protocol for
End-to-End Deep Learning in Computational Pathology
- Authors: Omar S. M. El Nahhas, Marko van Treeck, Georg W\"olflein, Michaela
Unger, Marta Ligero, Tim Lenz, Sophia J. Wagner, Katherine J. Hewitt, Firas
Khader, Sebastian Foersch, Daniel Truhn, Jakob Nikolas Kather
- Abstract summary: This protocol describes a practical workflow for solid tumor associative modeling in pathology (STAMP)
The STAMP workflow is biomarker agnostic and allows for genetic- and clinicopathologic tabular data to be included as an additional input.
The protocol consists of five main stages which have been successfully applied to various research problems.
- Score: 0.725241982525598
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Hematoxylin- and eosin (H&E) stained whole-slide images (WSIs) are the
foundation of diagnosis of cancer. In recent years, development of deep
learning-based methods in computational pathology enabled the prediction of
biomarkers directly from WSIs. However, accurately linking tissue phenotype to
biomarkers at scale remains a crucial challenge for democratizing complex
biomarkers in precision oncology. This protocol describes a practical workflow
for solid tumor associative modeling in pathology (STAMP), enabling prediction
of biomarkers directly from WSIs using deep learning. The STAMP workflow is
biomarker agnostic and allows for genetic- and clinicopathologic tabular data
to be included as an additional input, together with histopathology images. The
protocol consists of five main stages which have been successfully applied to
various research problems: formal problem definition, data preprocessing,
modeling, evaluation and clinical translation. The STAMP workflow
differentiates itself through its focus on serving as a collaborative framework
that can be used by clinicians and engineers alike for setting up research
projects in the field of computational pathology. As an example task, we
applied STAMP to the prediction of microsatellite instability (MSI) status in
colorectal cancer, showing accurate performance for the identification of
MSI-high tumors. Moreover, we provide an open-source codebase which has been
deployed at several hospitals across the globe to set up computational
pathology workflows. The STAMP workflow requires one workday of hands-on
computational execution and basic command line knowledge.
Related papers
- AI Assisted Cervical Cancer Screening for Cytology Samples in Developing Countries [0.18472148461613155]
Cervical cancer remains a significant health challenge, with high incidence and mortality rates.
Conventional Liquid-Based Cytology(LBC) is a labor-intensive process, requires expert pathologists and is highly prone to errors.
This paper introduces an innovative approach that integrates low-cost biological microscopes with our simple and efficient AI algorithms for automated whole-slide analysis.
arXiv Detail & Related papers (2025-04-29T05:18:59Z) - BioX-CPath: Biologically-driven Explainable Diagnostics for Multistain IHC Computational Pathology [0.9603373981832565]
BioX-CPath is an explainable graph neural network architecture for whole slide image (WSI) classification.
At its core, BioX-CPath introduces a novel Stain-Aware Attention Pooling (SAAP) module that generates biologically meaningful, stain-aware patient embeddings.
arXiv Detail & Related papers (2025-03-26T18:00:22Z) - From Pixels to Histopathology: A Graph-Based Framework for Interpretable Whole Slide Image Analysis [81.19923502845441]
We develop a graph-based framework that constructs WSI graph representations.
We build tissue representations (nodes) that follow biological boundaries rather than arbitrary patches.
In our method's final step, we solve the diagnostic task through a graph attention network.
arXiv Detail & Related papers (2025-03-14T20:15:04Z) - Pathological Prior-Guided Multiple Instance Learning For Mitigating Catastrophic Forgetting in Breast Cancer Whole Slide Image Classification [50.899861205016265]
We propose a new framework PaGMIL to mitigate catastrophic forgetting in breast cancer WSI classification.
Our framework introduces two key components into the common MIL model architecture.
We evaluate the continual learning performance of PaGMIL across several public breast cancer datasets.
arXiv Detail & Related papers (2025-03-08T04:51:58Z) - Towards a Comprehensive Benchmark for Pathological Lymph Node Metastasis in Breast Cancer Sections [21.75452517154339]
We reprocessed 1,399 whole slide images (WSIs) and labels from the Camelyon-16 and Camelyon-17 datasets.
Based on the sizes of re-annotated tumor regions, we upgraded the binary cancer screening task to a four-class task.
arXiv Detail & Related papers (2024-11-16T09:19:24Z) - CryoFM: A Flow-based Foundation Model for Cryo-EM Densities [50.291974465864364]
We present CryoFM, a foundation model designed as a generative model, learning the distribution of high-quality density maps.
Built on flow matching, CryoFM is trained to accurately capture the prior distribution of biomolecular density maps.
arXiv Detail & Related papers (2024-10-11T08:53:58Z) - BioMNER: A Dataset for Biomedical Method Entity Recognition [25.403593761614424]
We propose a novel dataset for biomedical method entity recognition.
We employ an automated BioMethod entity recognition and information retrieval system to assist human annotation.
Our empirical findings reveal that the large parameter counts of language models surprisingly inhibit the effective assimilation of entity extraction patterns.
arXiv Detail & Related papers (2024-06-28T16:34:24Z) - WEEP: A method for spatial interpretation of weakly supervised CNN models in computational pathology [0.36096289461554343]
We propose a novel method, Wsi rEgion sElection aPproach (WEEP), for model interpretation.
We demonstrate WEEP on a binary classification task in the area of breast cancer computational pathology.
arXiv Detail & Related papers (2024-03-22T14:32:02Z) - HistGen: Histopathology Report Generation via Local-Global Feature Encoding and Cross-modal Context Interaction [16.060286162384536]
HistGen is a learning-empowered framework for histopathology report generation.
It aims to boost report generation by aligning whole slide images (WSIs) and diagnostic reports from local and global granularity.
Experimental results on WSI report generation show the proposed model outperforms state-of-the-art (SOTA) models by a large margin.
arXiv Detail & Related papers (2024-03-08T15:51:43Z) - Deep learning-based instance segmentation for the precise automated
quantification of digital breast cancer immunohistochemistry images [1.8434042562191815]
We demonstrate the feasibility of using a deep learning-based instance segmentation architecture for the automatic quantification of both nuclear and membrane biomarkers applied to IHC-stained slides.
We have collected annotations over samples of HE, ER and Ki-67 (nuclear biomarkers) and HER2 (membrane biomarker) IHC-stained images.
We have trained two models, so-called nuclei- and membrane-aware segmentation models, which, once successfully validated, have revealed to be a promising method to segment nuclei instances in IHC-stained images.
arXiv Detail & Related papers (2023-11-22T22:23:47Z) - BiomedGPT: A Generalist Vision-Language Foundation Model for Diverse Biomedical Tasks [68.39821375903591]
Generalist AI holds the potential to address limitations due to its versatility in interpreting different data types.
Here, we propose BiomedGPT, the first open-source and lightweight vision-language foundation model.
arXiv Detail & Related papers (2023-05-26T17:14:43Z) - SEMPAI: a Self-Enhancing Multi-Photon Artificial Intelligence for
prior-informed assessment of muscle function and pathology [48.54269377408277]
We introduce the Self-Enhancing Multi-Photon Artificial Intelligence (SEMPAI), that integrates hypothesis-driven priors in a data-driven Deep Learning approach.
SEMPAI performs joint learning of several tasks to enable prediction for small datasets.
SEMPAI outperforms state-of-the-art biomarkers in six of seven predictive tasks, including those with scarce data.
arXiv Detail & Related papers (2022-10-28T17:03:04Z) - Lung Cancer Lesion Detection in Histopathology Images Using Graph-Based
Sparse PCA Network [93.22587316229954]
We propose a graph-based sparse principal component analysis (GS-PCA) network, for automated detection of cancerous lesions on histological lung slides stained by hematoxylin and eosin (H&E)
We evaluate the performance of the proposed algorithm on H&E slides obtained from an SVM K-rasG12D lung cancer mouse model using precision/recall rates, F-score, Tanimoto coefficient, and area under the curve (AUC) of the receiver operator characteristic (ROC)
arXiv Detail & Related papers (2021-10-27T19:28:36Z) - MIMO: Mutual Integration of Patient Journey and Medical Ontology for
Healthcare Representation Learning [49.57261599776167]
We propose an end-to-end robust Transformer-based solution, Mutual Integration of patient journey and Medical Ontology (MIMO) for healthcare representation learning and predictive analytics.
arXiv Detail & Related papers (2021-07-20T07:04:52Z) - G-MIND: An End-to-End Multimodal Imaging-Genetics Framework for
Biomarker Identification and Disease Classification [49.53651166356737]
We propose a novel deep neural network architecture to integrate imaging and genetics data, as guided by diagnosis, that provides interpretable biomarkers.
We have evaluated our model on a population study of schizophrenia that includes two functional MRI (fMRI) paradigms and Single Nucleotide Polymorphism (SNP) data.
arXiv Detail & Related papers (2021-01-27T19:28:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.