Machine learning-based multimodal prognostic models integrating pathology images and high-throughput omic data for overall survival prediction in cancer: a systematic review
- URL: http://arxiv.org/abs/2507.16876v2
- Date: Tue, 29 Jul 2025 11:19:34 GMT
- Title: Machine learning-based multimodal prognostic models integrating pathology images and high-throughput omic data for overall survival prediction in cancer: a systematic review
- Authors: Charlotte Jennings, Andrew Broad, Lucy Godson, Emily Clarke, David Westhead, Darren Treanor,
- Abstract summary: Multimodal machine learning integrating histopathology and molecular data shows promise for cancer prognostication.<n>We systematically reviewed studies combining whole slide images (WSIs) and high- throughput omics to predict overall survival.<n>All studies showed unclear/high bias, limited external validation, and little focus on clinical utility.
- Score: 0.59374762912328
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Multimodal machine learning integrating histopathology and molecular data shows promise for cancer prognostication. We systematically reviewed studies combining whole slide images (WSIs) and high-throughput omics to predict overall survival. Searches of EMBASE, PubMed, and Cochrane CENTRAL (12/08/2024), plus citation screening, identified eligible studies. Data extraction used CHARMS; bias was assessed with PROBAST+AI; synthesis followed SWiM and PRISMA 2020. Protocol: PROSPERO (CRD42024594745). Forty-eight studies (all since 2017) across 19 cancer types met criteria; all used The Cancer Genome Atlas. Approaches included regularised Cox regression (n=4), classical ML (n=13), and deep learning (n=31). Reported c-indices ranged 0.550-0.857; multimodal models typically outperformed unimodal ones. However, all studies showed unclear/high bias, limited external validation, and little focus on clinical utility. Multimodal WSI-omics survival prediction is a fast-growing field with promising results but needs improved methodological rigor, broader datasets, and clinical evaluation. Funded by NPIC, Leeds Teaching Hospitals NHS Trust, UK (Project 104687), supported by UKRI Industrial Strategy Challenge Fund.
Related papers
- Deep Survival Analysis in Multimodal Medical Data: A Parametric and Probabilistic Approach with Competing Risks [47.19194118883552]
We introduce a multimodal deep learning framework for survival analysis capable of modeling both single and competing risks scenarios.<n>We propose SAMVAE (Survival Analysis Multimodal Variational Autoencoder), a novel deep learning architecture designed for survival prediction.
arXiv Detail & Related papers (2025-07-10T14:29:48Z) - MIL vs. Aggregation: Evaluating Patient-Level Survival Prediction Strategies Using Graph-Based Learning [52.231128973251124]
We compare various strategies for predicting survival at the WSI and patient level.<n>The former treats each WSI as an independent sample, mimicking the strategy adopted in other works.<n>The latter comprises methods to either aggregate the predictions of the several WSIs or automatically identify the most relevant slide.
arXiv Detail & Related papers (2025-03-29T11:14:02Z) - Multi-modal Medical Image Fusion For Non-Small Cell Lung Cancer Classification [7.002657345547741]
Non-small cell lung cancer (NSCLC) is a predominant cause of cancer mortality worldwide.
In this paper, we introduce an innovative integration of multi-modal data, synthesizing fused medical imaging (CT and PET scans) with clinical health records and genomic data.
Our research surpasses existing approaches, as evidenced by a substantial enhancement in NSCLC detection and classification precision.
arXiv Detail & Related papers (2024-09-27T12:59:29Z) - Self-Normalizing Foundation Model for Enhanced Multi-Omics Data Analysis in Oncology [0.0]
SeNMo is a foundation model that has been trained on multi-omics data across 33 cancer types.
We trained SeNMo for the task of overall survival of patients using pan-cancer multi-omics data involving 33 cancer sites.
SeNMo was validated on two independent cohorts: Moffitt Cancer Center and CPTAC lung squamous cell carcinoma.
arXiv Detail & Related papers (2024-05-13T22:45:44Z) - Survival Prediction Across Diverse Cancer Types Using Neural Networks [40.392772795903795]
Gastric cancer and Colon adenocarcinoma represent widespread and challenging malignancies.
Medical community has embraced the 5-year survival rate as a vital metric for estimating patient outcomes.
This study introduces a pioneering approach to enhance survival prediction models for gastric and Colon adenocarcinoma patients.
arXiv Detail & Related papers (2024-04-11T21:47:13Z) - Applications of artificial intelligence in the analysis of histopathology images of gliomas: a review [0.33999813472511115]
This review examines 83 publicly available research studies that have proposed AI-based methods for whole-slide histopathology images of human gliomas.
The focus of current research is the assessment of hematoxylin and eosin-stained tissue sections of adult-type diffuse gliomas.
So far, AI-based methods have achieved promising results, but are not yet used in real clinical settings.
arXiv Detail & Related papers (2024-01-26T17:29:01Z) - Pathology-and-genomics Multimodal Transformer for Survival Outcome
Prediction [43.1748594898772]
We propose a multimodal transformer (PathOmics) integrating pathology and genomics insights into colon-related cancer survival prediction.
We emphasize the unsupervised pretraining to capture the intrinsic interaction between tissue microenvironments in gigapixel whole slide images.
We evaluate our approach on both TCGA colon and rectum cancer cohorts, showing that the proposed approach is competitive and outperforms state-of-the-art studies.
arXiv Detail & Related papers (2023-07-22T00:59:26Z) - Deep learning methods for drug response prediction in cancer:
predominant and emerging trends [50.281853616905416]
Exploiting computational predictive models to study and treat cancer holds great promise in improving drug development and personalized design of treatment plans.
A wave of recent papers demonstrates promising results in predicting cancer response to drug treatments while utilizing deep learning methods.
This review allows to better understand the current state of the field and identify major challenges and promising solution paths.
arXiv Detail & Related papers (2022-11-18T03:26:31Z) - EOCSA: Predicting Prognosis of Epithelial Ovarian Cancer with Whole
Slide Histopathological Images [22.227676868758195]
Ovarian cancer is one of the most serious cancers that threaten women around the world.
In this study, we designed a deep framework named EOCSA which analyzes the prognosis of EOC patients based on pathological whole slide images (WSIs)
The experimental results demonstrate that our proposed framework has achieved state-of-the-art performance of 0.980 C-index.
arXiv Detail & Related papers (2022-10-11T08:40:40Z) - Federated Learning Enables Big Data for Rare Cancer Boundary Detection [98.5549882883963]
We present findings from the largest Federated ML study to-date, involving data from 71 healthcare institutions across 6 continents.
We generate an automatic tumor boundary detector for the rare disease of glioblastoma.
We demonstrate a 33% improvement over a publicly trained model to delineate the surgically targetable tumor, and 23% improvement over the tumor's entire extent.
arXiv Detail & Related papers (2022-04-22T17:27:00Z) - Analysis of MRI Biomarkers for Brain Cancer Survival Prediction [0.3093890460224435]
Prediction of Overall Survival (OS) of brain cancer patients from multi-modal MRI is a challenging field of research.
Most of the existing literature on survival prediction is based on Radiomic features.
Age was found to be the most important biological predictor.
arXiv Detail & Related papers (2021-09-03T05:35:47Z) - CT-based COVID-19 Triage: Deep Multitask Learning Improves Joint
Identification and Severity Quantification [45.86448200141968]
We describe two basic setups: Identification of COVID-19 to prioritize studies of potentially infected patients to isolate them as early as possible; Severity quantification to highlight studies of severe patients and direct them to a hospital or provide emergency medical care.
We propose a multitask approach to consolidate both triage approaches and propose a convolutional neural network to combine all available labels within a single model.
We train our model on approximately 2000 publicly available CT studies and test it with a carefully designed set consisting of 32 COVID-19 studies, 30 cases with bacterial pneumonia, 31 healthy patients, and 30 patients with other lung pathologies to emulate a typical patient flow in
arXiv Detail & Related papers (2020-06-02T08:05:06Z) - A Systematic Approach to Featurization for Cancer Drug Sensitivity
Predictions with Deep Learning [49.86828302591469]
We train >35,000 neural network models, sweeping over common featurization techniques.
We found the RNA-seq to be highly redundant and informative even with subsets larger than 128 features.
arXiv Detail & Related papers (2020-04-30T20:42:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.