Screen Them All: High-Throughput Pan-Cancer Genetic and Phenotypic Biomarker Screening from H&E Whole Slide Images
- URL: http://arxiv.org/abs/2408.09554v4
- Date: Mon, 14 Jul 2025 13:55:24 GMT
- Title: Screen Them All: High-Throughput Pan-Cancer Genetic and Phenotypic Biomarker Screening from H&E Whole Slide Images
- Authors: Yi Kan Wang, Ludmila Tydlitatova, Jeremy D. Kunz, Gerard Oakley, Bonnie Kar Bo Chow, Ran A. Godrich, Matthew C. H. Lee, Hamed Aghdam, Alican Bozkurt, Michal Zelechowski, Chad Vanderbilt, Christopher Kanan, Juan A. Retamero, Peter Hamilton, Razik Yousfi, Thomas J. Fuchs, David S. Klimstra, Siqi Liu,
- Abstract summary: OmniScreen is an AI-based system leveraging Virchow2 embeddings extracted from 60,529 cancer patients.<n>It employs a unified model to predict a broad range of clinically relevant biomarkers across cancers.<n>It reliably identifies therapeutic targets and shared phenotypic features across common and rare tumors.
- Score: 10.358246499005062
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Molecular assays are standard of care for detecting genomic alterations in cancer prognosis and therapy selection but are costly, tissue-destructive and time-consuming. Artificial intelligence (AI) applied to routine hematoxylin and eosin (H&E)-stained whole slide images (WSIs) offers a fast and economical alternative for screening molecular biomarkers. We introduce OmniScreen, a high-throughput AI-based system leveraging Virchow2 embeddings extracted from 60,529 cancer patients with paired 489-gene MSK-IMPACT targeted biomarker panel and WSIs. Unlike conventional approaches that train separate models for each biomarker, OmniScreen employs a unified model to predict a broad range of clinically relevant biomarkers across cancers, including low-prevalence targets impractical to model individually. OmniScreen reliably identifies therapeutic targets and shared phenotypic features across common and rare tumors. We investigate the biomarker prediction probabilities and accuracies of OmniScreen in relation to tumor area, cohort size, histologic subtype alignment, and pathway-level morphological patterns. These findings underscore the potential of OmniScreen for routine clinical screening.
Related papers
- A Multicentric Dataset for Training and Benchmarking Breast Cancer Segmentation in H&E Slides [1.2783652545738993]
We introduce BrEast cancEr hisTopathoLogy sEgmentation (BEETLE), a dataset for multiclass semantic segmentation of H&E-stained breast cancer WSIs.<n>It consists of 587 biopsies and resections from three collaborating clinical centers and two public datasets, digitized using seven scanners, and covers all molecular subtypes and histological grades.<n>The dataset's diversity and relevance to the rapidly growing field of automated biomarker quantification in breast cancer ensure its high potential for reuse.
arXiv Detail & Related papers (2025-10-02T14:09:21Z) - Multimodal AI-driven Biomarker for Early Detection of Cancer Cachexia [14.27396467108753]
Cancer cachexia is a multifactorial syndrome characterized by progressive muscle wasting, metabolic dysfunction, and systemic inflammation.
There is no single definitive biomarker for cachexia.
This study proposes a multimodal AI-based biomarker for early cancer cachexia detection.
arXiv Detail & Related papers (2025-03-09T22:32:37Z) - Computational Methods for Breast Cancer Molecular Profiling through Routine Histopathology: A Review [0.2671776059280352]
Recent advancements in artificial intelligence have enabled digital pathology to analyze histopathologic images for targeted molecular and broader omic biomarkers.
These technologies offer the capability to extract various biomarkers such as genomic, transcriptomic, proteomic, and metabolomic markers directly from the routine hematoxylin and eosin stained images.
arXiv Detail & Related papers (2024-12-01T08:13:49Z) - Biomarker based Cancer Classification using an Ensemble with Pre-trained Models [2.2436844508175224]
We propose a novel ensemble model combining pre-trained Hyperfast model, XGBoost, and LightGBM for multi-class classification tasks.
We leverage a meta-trained Hyperfast model for classifying cancer, accomplishing the highest AUC of 0.9929.
We also propose a novel ensemble model combining pre-trained Hyperfast model, XGBoost, and LightGBM for multi-class classification tasks, achieving an incremental increase in accuracy (0.9464)
arXiv Detail & Related papers (2024-06-14T14:43:59Z) - MMIL: A novel algorithm for disease associated cell type discovery [58.044870442206914]
Single-cell datasets often lack individual cell labels, making it challenging to identify cells associated with disease.
We introduce Mixture Modeling for Multiple Learning Instance (MMIL), an expectation method that enables the training and calibration of cell-level classifiers.
arXiv Detail & Related papers (2024-06-12T15:22:56Z) - Using Multiparametric MRI with Optimized Synthetic Correlated Diffusion Imaging to Enhance Breast Cancer Pathologic Complete Response Prediction [71.91773485443125]
Neoadjuvant chemotherapy has recently gained popularity as a promising treatment strategy for breast cancer.
The current process to recommend neoadjuvant chemotherapy relies on the subjective evaluation of medical experts.
This research investigates the application of optimized CDI$s$ to enhance breast cancer pathologic complete response prediction.
arXiv Detail & Related papers (2024-05-13T15:40:56Z) - Deep Learning-based Prediction of Breast Cancer Tumor and Immune Phenotypes from Histopathology [3.9270231212340354]
There is not a widely available method to reproducibly measure tumor and immune phenotypes for each patient's tumor.
We applied multiple instance learning (MIL) algorithms to assess activity of ten biologically relevant pathways from the hematoxylin and eosin slide of primary breast tumors.
Our trained models recognize biologically relevant spatial patterns of cell sub-populations from H&E.
arXiv Detail & Related papers (2024-04-25T08:15:37Z) - Improving Performance in Colorectal Cancer Histology Decomposition using Deep and Ensemble Machine Learning [0.7082642128219231]
Histologic samples stained with hematoxylin and eosin are commonly used in colorectal cancer management.
Recent research highlights the potential of convolutional neural networks (CNNs) in facilitating the extraction of clinically relevant biomarkers from readily available images.
CNN-based biomarkers can predict patient outcomes comparably to golden standards, with the added advantages of speed, automation, and minimal cost.
arXiv Detail & Related papers (2023-10-25T19:46:27Z) - Deep Learning Predicts Biomarker Status and Discovers Related
Histomorphology Characteristics for Low-Grade Glioma [21.281553456323998]
Biomarker detection is an indispensable part in the diagnosis and treatment of low-grade glioma (LGG)
We propose an interpretable deep learning pipeline to predict the status of five biomarkers in LGG using only hematoxylin and eosin-stained whole slide images and slide-level biomarker status labels.
Our pipeline not only provides a novel approach for biomarker prediction, enhancing the applicability of molecular treatments for LGG patients but also facilitates the discovery of new mechanisms in molecular functionality and LGG progression.
arXiv Detail & Related papers (2023-10-11T13:05:33Z) - A marker-less human motion analysis system for motion-based biomarker
discovery in knee disorders [60.99112047564336]
The NHS has been having increased difficulty seeing all low-risk patients, this includes but not limited to suspected osteoarthritis (OA) patients.
We propose a novel method of automated biomarker identification for diagnosis of knee disorders and the monitoring of treatment progression.
arXiv Detail & Related papers (2023-04-26T16:47:42Z) - Regression-based Deep-Learning predicts molecular biomarkers from
pathology slides [40.24757332810004]
We developed and evaluated a new self-supervised attention-based weakly supervised regression method that predicts continuous biomarkers directly from images.
Using regression significantly enhances the accuracy of biomarker prediction, while also improving the interpretability of the results over classification.
Our open-source regression approach offers a promising alternative for continuous biomarker analysis in computational pathology.
arXiv Detail & Related papers (2023-04-11T11:43:51Z) - Artificial-intelligence-based molecular classification of diffuse
gliomas using rapid, label-free optical imaging [59.79875531898648]
DeepGlioma is an artificial-intelligence-based diagnostic screening system.
DeepGlioma can predict the molecular alterations used by the World Health Organization to define the adult-type diffuse glioma taxonomy.
arXiv Detail & Related papers (2023-03-23T18:50:18Z) - Deep Learning-Based Prediction of Molecular Tumor Biomarkers from H&E: A
Practical Review [0.0]
Molecular and genomic properties are critical in selecting cancer treatments to target individual tumors.
Applying machine learning to H&E images can provide a more cost-effective screening method.
This article reviews the diverse applications across cancer types and the methodology to train and validate these models.
arXiv Detail & Related papers (2022-11-27T14:57:41Z) - A Pathologist-Informed Workflow for Classification of Prostate Glands in
Histopathology [62.997667081978825]
Pathologists diagnose and grade prostate cancer by examining tissue from needle biopsies on glass slides.
Cancer's severity and risk of metastasis are determined by the Gleason grade, a score based on the organization and morphology of prostate cancer glands.
This paper proposes an automated workflow that follows pathologists' textitmodus operandi, isolating and classifying multi-scale patches of individual glands.
arXiv Detail & Related papers (2022-09-27T14:08:19Z) - Lymphocyte Classification in Hyperspectral Images of Ovarian Cancer
Tissue Biopsy Samples [94.37521840642141]
We present a machine learning pipeline to segment white blood cell pixels in hyperspectral images of biopsy cores.
These cells are clinically important for diagnosis, but some prior work has struggled to incorporate them due to difficulty obtaining precise pixel labels.
arXiv Detail & Related papers (2022-03-23T00:58:27Z) - Multi-Scale Hybrid Vision Transformer for Learning Gastric Histology:
AI-Based Decision Support System for Gastric Cancer Treatment [50.89811515036067]
Gastric endoscopic screening is an effective way to decide appropriate gastric cancer (GC) treatment at an early stage, reducing GC-associated mortality rate.
We propose a practical AI system that enables five subclassifications of GC pathology, which can be directly matched to general GC treatment guidance.
arXiv Detail & Related papers (2022-02-17T08:33:52Z) - Malignancy Prediction and Lesion Identification from Clinical
Dermatological Images [65.1629311281062]
We consider machine-learning-based malignancy prediction and lesion identification from clinical dermatological images.
We first identify all lesions present in the image regardless of sub-type or likelihood of malignancy, then it estimates their likelihood of malignancy, and through aggregation, it also generates an image-level likelihood of malignancy.
arXiv Detail & Related papers (2021-04-02T20:52:05Z) - Exploring Genetic-histologic Relationships in Breast Cancer [28.91314299138311]
This work uses deep learning to predict genomic biomarkers from breast cancer histopathology images.
We outperform the existing works with a minimum improvement of 0.02 and a maximum of 0.13 AUROC scores across all tasks.
arXiv Detail & Related papers (2021-03-15T00:53:47Z) - Cancer Gene Profiling through Unsupervised Discovery [49.28556294619424]
We introduce a novel, automatic and unsupervised framework to discover low-dimensional gene biomarkers.
Our method is based on the LP-Stability algorithm, a high dimensional center-based unsupervised clustering algorithm.
Our signature reports promising results on distinguishing immune inflammatory and immune desert tumors.
arXiv Detail & Related papers (2021-02-11T09:04:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.