NeuroRAD-FM: A Foundation Model for Neuro-Oncology with Distributionally Robust Training
- URL: http://arxiv.org/abs/2509.15416v1
- Date: Thu, 18 Sep 2025 20:43:08 GMT
- Title: NeuroRAD-FM: A Foundation Model for Neuro-Oncology with Distributionally Robust Training
- Authors: Moinak Bhattacharya, Angelica P. Kurtz, Fabio M. Iwamoto, Prateek Prasanna, Gagandeep Singh,
- Abstract summary: Existing foundation models (FMs) perform poorly in predicting uncommon molecular markers.<n>We developed a neuro-oncology specific FM with a distributionally robust loss function, enabling accurate estimation of tumor phenotypes.<n>For survival, c-index improved at all sites: CUIMC 0.592 to 0.597, UPenn 0.647 to 0.672, UCSF 0.600 to 0.627.
- Score: 12.168374910250572
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: Neuro-oncology poses unique challenges for machine learning due to heterogeneous data and tumor complexity, limiting the ability of foundation models (FMs) to generalize across cohorts. Existing FMs also perform poorly in predicting uncommon molecular markers, which are essential for treatment response and risk stratification. To address these gaps, we developed a neuro-oncology specific FM with a distributionally robust loss function, enabling accurate estimation of tumor phenotypes while maintaining cross-institution generalization. We pretrained self-supervised backbones (BYOL, DINO, MAE, MoCo) on multi-institutional brain tumor MRI and applied distributionally robust optimization (DRO) to mitigate site and class imbalance. Downstream tasks included molecular classification of common markers (MGMT, IDH1, 1p/19q, EGFR), uncommon alterations (ATRX, TP53, CDKN2A/2B, TERT), continuous markers (Ki-67, TP53), and overall survival prediction in IDH1 wild-type glioblastoma at UCSF, UPenn, and CUIMC. Our method improved molecular prediction and reduced site-specific embedding differences. At CUIMC, mean balanced accuracy rose from 0.744 to 0.785 and AUC from 0.656 to 0.676, with the largest gains for underrepresented endpoints (CDKN2A/2B accuracy 0.86 to 0.92, AUC 0.73 to 0.92; ATRX AUC 0.69 to 0.82; Ki-67 accuracy 0.60 to 0.69). For survival, c-index improved at all sites: CUIMC 0.592 to 0.597, UPenn 0.647 to 0.672, UCSF 0.600 to 0.627. Grad-CAM highlighted tumor and peri-tumoral regions, confirming interpretability. Overall, coupling FMs with DRO yields more site-invariant representations, improves prediction of common and uncommon markers, and enhances survival discrimination, underscoring the need for prospective validation and integration of longitudinal and interventional signals to advance precision neuro-oncology.
Related papers
- MeCaMIL: Causality-Aware Multiple Instance Learning for Fair and Interpretable Whole Slide Image Diagnosis [40.3028468133626]
Multiple instance learning (MIL) has emerged as the dominant paradigm for whole slide image (WSI) analysis in computational pathology.<n>textbfMeCaMIL, a causality-aware MIL framework, explicitly models demographic confounders through structured causal graphs.<n>MeCaMIL achieves superior fairness -- demographic disparity variance drops by over 65% relative reduction on average across attributes.
arXiv Detail & Related papers (2025-11-14T06:47:21Z) - Peptidomic-Based Prediction Model for Coronary Heart Disease Using a Multilayer Perceptron Neural Network [0.0]
Coronary heart disease (CHD) is a leading cause of death worldwide and contributes significantly to annual healthcare expenditures.<n>To develop a non-invasive diagnostic approach, we designed a model based on a multilayer perceptron (MLP) neural network.<n>The model achieved a precision, sensitivity, and specificity of 95.67 percent, with an F1-score of 0.9565.
arXiv Detail & Related papers (2025-09-04T04:54:02Z) - Predicting Length of Stay in Neurological ICU Patients Using Classical Machine Learning and Neural Network Models: A Benchmark Study on MIMIC-IV [49.1574468325115]
This study explores multiple ML approaches for predicting LOS in ICU specifically for the patients with neurological diseases based on the MIMIC-IV dataset.<n>The evaluated models include classic ML algorithms (K-Nearest Neighbors, Random Forest, XGBoost and CatBoost) and Neural Networks (LSTM, BERT and Temporal Fusion Transformer)
arXiv Detail & Related papers (2025-05-23T14:06:42Z) - Pixels to Prognosis: Harmonized Multi-Region CT-Radiomics and Foundation-Model Signatures Across Multicentre NSCLC Data [3.6651189622079223]
Harmonization and multi-region feature integration improve survival prediction in non-small cell lung cancer (NSCLC) patients.<n>We analyzed CT scans and clinical data from 876 NSCLC patients across five centers.
arXiv Detail & Related papers (2025-05-23T13:41:52Z) - AXIAL: Attention-based eXplainability for Interpretable Alzheimer's Localized Diagnosis using 2D CNNs on 3D MRI brain scans [43.06293430764841]
This study presents an innovative method for Alzheimer's disease diagnosis using 3D MRI designed to enhance the explainability of model decisions.
Our approach adopts a soft attention mechanism, enabling 2D CNNs to extract volumetric representations.
With voxel-level precision, our method identified which specific areas are being paid attention to, identifying these predominant brain regions.
arXiv Detail & Related papers (2024-07-02T16:44:00Z) - BioFusionNet: Deep Learning-Based Survival Risk Stratification in ER+ Breast Cancer Through Multifeature and Multimodal Data Fusion [16.83901927767791]
We present BioFusionNet, a deep learning framework that fuses image-derived features with genetic and clinical data to obtain a holistic profile.
Our model achieves a mean concordance index of 0.77 and a time-dependent area under the curve of 0.84, outperforming state-of-the-art methods.
arXiv Detail & Related papers (2024-02-16T14:19:33Z) - CIMIL-CRC: a clinically-informed multiple instance learning framework for patient-level colorectal cancer molecular subtypes classification from H\&E stained images [42.771819949806655]
We introduce CIMIL-CRC', a framework that solves the MSI/MSS MIL problem by efficiently combining a pre-trained feature extraction model with principal component analysis (PCA) to aggregate information from all patches.
We assessed our CIMIL-CRC method using the average area under the curve (AUC) from a 5-fold cross-validation experimental setup for model development on the TCGA-CRC-DX cohort.
arXiv Detail & Related papers (2024-01-29T12:56:11Z) - MRI-based classification of IDH mutation and 1p/19q codeletion status of
gliomas using a 2.5D hybrid multi-task convolutional neural network [0.18374319565577152]
Isocitrate dehydrogenase mutation and 1p/19q codeletion status are important prognostic markers for glioma.
Our goal was to develop artificial intelligence-based methods to non-invasively determine these molecular alterations from MRI.
A 2.5D hybrid convolutional neural network was proposed to simultaneously localize the tumor and classify its molecular status.
arXiv Detail & Related papers (2022-10-07T18:46:39Z) - Deep learning-based approach to reveal tumor mutational burden status
from whole slide images across multiple cancer types [41.61294299606317]
Tumor mutational burden (TMB) is a potential genomic biomarker of immunotherapy.
TMB detected through whole exome sequencing lacks clinical penetration in low-resource settings.
In this study, we proposed a multi-scale deep learning framework to address the detection of TMB status from routinely used whole slide images.
arXiv Detail & Related papers (2022-04-07T07:02:32Z) - Predicting Axillary Lymph Node Metastasis in Early Breast Cancer Using
Deep Learning on Primary Tumor Biopsy Slides [17.564585510792227]
We developed a deep learning (DL)-based primary tumor biopsy signature for predicting axillary lymph node (ALN) metastasis.
A total of 1,058 EBC patients with pathologically confirmed ALN status were enrolled from May 2010 to August 2020.
arXiv Detail & Related papers (2021-12-04T02:23:18Z) - Comparison of Machine Learning Classifiers to Predict Patient Survival
and Genetics of GBM: Towards a Standardized Model for Clinical Implementation [44.02622933605018]
Radiomic models have been shown to outperform clinical data for outcome prediction in glioblastoma (GBM)
We aimed to compare nine machine learning classifiers to predict overall survival (OS), isocitrate dehydrogenase (IDH) mutation, O-6-methylguanine-DNA-methyltransferase (MGMT) promoter methylation, epidermal growth factor receptor (EGFR) VII amplification and Ki-67 expression in GBM patients.
xGB obtained maximum accuracy for OS (74.5%), AB for IDH mutation (88%), MGMT methylation (71,7%), Ki-67 expression (86,6%), and EGFR amplification (81,
arXiv Detail & Related papers (2021-02-10T15:10:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.