Methodology for Comparing Machine Learning Algorithms for Survival Analysis
- URL: http://arxiv.org/abs/2510.24473v1
- Date: Tue, 28 Oct 2025 14:42:28 GMT
- Title: Methodology for Comparing Machine Learning Algorithms for Survival Analysis
- Authors: Lucas Buk Cardoso, Simone Aldrey Angelo, Yasmin Pacheco Gil Bonilha, Fernando Maia, Adeylson Guimarães Ribeiro, Maria Paula Curado, Gisele Aparecida Fernandes, Vanderlei Cunha Parro, Flávio Almeida de Magalhães Cipparrone, Alexandre Dias Porto Chiavegatto Filho, Tatiana Natasha Toporcov,
- Abstract summary: Six machine learning models for survival analysis were evaluated.<n>XGB-AFT achieved the best performance (C-Index = 0.7618; IPCW = 0.7532, followed by GBSA and RSF)
- Score: 55.65997641180011
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: This study presents a comparative methodological analysis of six machine learning models for survival analysis (MLSA). Using data from nearly 45,000 colorectal cancer patients in the Hospital-Based Cancer Registries of S\~ao Paulo, we evaluated Random Survival Forest (RSF), Gradient Boosting for Survival Analysis (GBSA), Survival SVM (SSVM), XGBoost-Cox (XGB-Cox), XGBoost-AFT (XGB-AFT), and LightGBM (LGBM), capable of predicting survival considering censored data. Hyperparameter optimization was performed with different samplers, and model performance was assessed using the Concordance Index (C-Index), C-Index IPCW, time-dependent AUC, and Integrated Brier Score (IBS). Survival curves produced by the models were compared with predictions from classification algorithms, and predictor interpretation was conducted using SHAP and permutation importance. XGB-AFT achieved the best performance (C-Index = 0.7618; IPCW = 0.7532), followed by GBSA and RSF. The results highlight the potential and applicability of MLSA to improve survival prediction and support decision making.
Related papers
- Investigating the Impact of Histopathological Foundation Models on Regressive Prediction of Homologous Recombination Deficiency [52.50039435394964]
We systematically evaluate foundation models for regression-based tasks.<n>We extract patch-level features from whole slide images (WSI) using five state-of-the-art foundation models.<n>Models are trained to predict continuous HRD scores based on these extracted features across breast, endometrial, and lung cancer cohorts.
arXiv Detail & Related papers (2026-01-29T14:06:50Z) - Deep Survival Analysis in Multimodal Medical Data: A Parametric and Probabilistic Approach with Competing Risks [47.19194118883552]
We introduce a multimodal deep learning framework for survival analysis capable of modeling both single and competing risks scenarios.<n>We propose SAMVAE (Survival Analysis Multimodal Variational Autoencoder), a novel deep learning architecture designed for survival prediction.
arXiv Detail & Related papers (2025-07-10T14:29:48Z) - Predicting Length of Stay in Neurological ICU Patients Using Classical Machine Learning and Neural Network Models: A Benchmark Study on MIMIC-IV [49.1574468325115]
This study explores multiple ML approaches for predicting LOS in ICU specifically for the patients with neurological diseases based on the MIMIC-IV dataset.<n>The evaluated models include classic ML algorithms (K-Nearest Neighbors, Random Forest, XGBoost and CatBoost) and Neural Networks (LSTM, BERT and Temporal Fusion Transformer)
arXiv Detail & Related papers (2025-05-23T14:06:42Z) - Predicting Survivability of Cancer Patients with Metastatic Patterns Using Explainable AI [2.6182192515316247]
This study leverages machine learning (ML) to predict the survivability of cancer patients with metastatic patterns.<n>MSK-MET dataset includes genomic and clinical data from 25,775 patients across 27 cancer types.<n>XGBoost emerged as the best performer with an area under the curve (AUC) of 0.82.
arXiv Detail & Related papers (2025-04-07T20:48:15Z) - MIL vs. Aggregation: Evaluating Patient-Level Survival Prediction Strategies Using Graph-Based Learning [52.231128973251124]
We compare various strategies for predicting survival at the WSI and patient level.<n>The former treats each WSI as an independent sample, mimicking the strategy adopted in other works.<n>The latter comprises methods to either aggregate the predictions of the several WSIs or automatically identify the most relevant slide.
arXiv Detail & Related papers (2025-03-29T11:14:02Z) - Machine Learning for ALSFRS-R Score Prediction: Making Sense of the Sensor Data [44.99833362998488]
Amyotrophic Lateral Sclerosis (ALS) is a rapidly progressive neurodegenerative disease that presents individuals with limited treatment options.
The present investigation, spearheaded by the iDPP@CLEF 2024 challenge, focuses on utilizing sensor-derived data obtained through an app.
arXiv Detail & Related papers (2024-07-10T19:17:23Z) - Survival modeling using deep learning, machine learning and statistical methods: A comparative analysis for predicting mortality after hospital admission [9.719996519981333]
We conducted a comparative study of several survival analysis methods, including Cox proportional hazards (CoxPH), stepwise CoxPH, elastic net penalized Cox model, and GBM learning.
As a case study, we performed a retrospective analysis of patients admitted through the emergency department of a tertiary hospital from 2017 to 2019.
The results of the C-index indicate that deep learning achieved comparable performance, with DeepSurv producing the best discrimination.
arXiv Detail & Related papers (2024-03-04T10:46:02Z) - Predicting Risk of Dementia with Survival Machine Learning and
Statistical Methods: Results on the English Longitudinal Study of Ageing
Cohort [0.0]
Machine learning models that aim to predict dementia onset usually follow the classification methodology ignoring the time until an event happens.
This study presents an alternative, using survival analysis within the context of machine learning techniques.
Two survival method extensions based on machine learning algorithms of Random Forest and Elastic Net are applied to train, optimise, and validate predictive models.
arXiv Detail & Related papers (2023-06-17T12:15:32Z) - Penalized Deep Partially Linear Cox Models with Application to CT Scans
of Lung Cancer Patients [42.09584755334577]
Lung cancer is a leading cause of cancer mortality globally, highlighting the importance of understanding its mortality risks to design effective therapies.
The National Lung Screening Trial (NLST) employed computed tomography texture analysis to quantify the mortality risks of lung cancer patients.
We propose a novel Penalized Deep Partially Linear Cox Model (Penalized DPLC), which incorporates the SCAD penalty to select important texture features and employs a deep neural network to estimate the nonparametric component of the model.
arXiv Detail & Related papers (2023-03-09T15:38:16Z) - Interpretability methods of machine learning algorithms with
applications in breast cancer diagnosis [1.1470070927586016]
We used interpretability techniques, such as the Global Surrogate (GS) method, the Individual Expectation (ICE) plots and the Conditional Shapley values (SV)
The best performance for breast cancer diagnosis was achieved by the proposed ENN (96.6% accuracy and 0.96 area under the ROC curve)
arXiv Detail & Related papers (2022-02-04T13:41:30Z) - Analysis of MRI Biomarkers for Brain Cancer Survival Prediction [0.3093890460224435]
Prediction of Overall Survival (OS) of brain cancer patients from multi-modal MRI is a challenging field of research.
Most of the existing literature on survival prediction is based on Radiomic features.
Age was found to be the most important biological predictor.
arXiv Detail & Related papers (2021-09-03T05:35:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.