Explainable Censored Learning: Finding Critical Features with Long Term
Prognostic Values for Survival Prediction
- URL: http://arxiv.org/abs/2209.15450v1
- Date: Fri, 30 Sep 2022 12:56:29 GMT
- Title: Explainable Censored Learning: Finding Critical Features with Long Term
Prognostic Values for Survival Prediction
- Authors: Xinxing Wu, Chong Peng, Richard Charnigo, Qiang Cheng
- Abstract summary: We introduce a novel, easily deployable approach, called EXplainable CEnsored Learning (EXCEL), to iteratively exploit critical variables.
We show that EXCEL can effectively identify critical features and achieve performance on par with or better than the original models.
- Score: 28.943631598055926
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Interpreting critical variables involved in complex biological processes
related to survival time can help understand prediction from survival models,
evaluate treatment efficacy, and develop new therapies for patients. Currently,
the predictive results of deep learning (DL)-based models are better than or as
good as standard survival methods, they are often disregarded because of their
lack of transparency and little interpretability, which is crucial to their
adoption in clinical applications. In this paper, we introduce a novel, easily
deployable approach, called EXplainable CEnsored Learning (EXCEL), to
iteratively exploit critical variables and simultaneously implement (DL) model
training based on these variables. First, on a toy dataset, we illustrate the
principle of EXCEL; then, we mathematically analyze our proposed method, and we
derive and prove tight generalization error bounds; next, on two semi-synthetic
datasets, we show that EXCEL has good anti-noise ability and stability;
finally, we apply EXCEL to a variety of real-world survival datasets including
clinical data and genetic data, demonstrating that EXCEL can effectively
identify critical features and achieve performance on par with or better than
the original models. It is worth pointing out that EXCEL is flexibly deployed
in existing or emerging models for explainable survival data in the presence of
right censoring.
Related papers
- Machine Learning for ALSFRS-R Score Prediction: Making Sense of the Sensor Data [44.99833362998488]
Amyotrophic Lateral Sclerosis (ALS) is a rapidly progressive neurodegenerative disease that presents individuals with limited treatment options.
The present investigation, spearheaded by the iDPP@CLEF 2024 challenge, focuses on utilizing sensor-derived data obtained through an app.
arXiv Detail & Related papers (2024-07-10T19:17:23Z) - CEL: A Continual Learning Model for Disease Outbreak Prediction by
Leveraging Domain Adaptation via Elastic Weight Consolidation [4.693707128262634]
This study introduces a novel CEL model for continual learning by leveraging domain adaptation via Elastic Weight Consolidation (EWC)
CEL's robustness and reliability are underscored by its minimal 65% forgetting rate and 18% higher memory stability compared to existing benchmark studies.
arXiv Detail & Related papers (2024-01-17T03:26:04Z) - Meta Transfer of Self-Supervised Knowledge: Foundation Model in Action
for Post-Traumatic Epilepsy Prediction [0.6291443816903801]
We introduce a novel training strategy for our foundation model.
We demonstrate that the proposed strategy significantly improves task performance on small-scale clinical datasets.
Results further demonstrated the enhanced generalizability of our foundation model.
arXiv Detail & Related papers (2023-12-21T07:42:49Z) - Learning Objective-Specific Active Learning Strategies with Attentive
Neural Processes [72.75421975804132]
Learning Active Learning (LAL) suggests to learn the active learning strategy itself, allowing it to adapt to the given setting.
We propose a novel LAL method for classification that exploits symmetry and independence properties of the active learning problem.
Our approach is based on learning from a myopic oracle, which gives our model the ability to adapt to non-standard objectives.
arXiv Detail & Related papers (2023-09-11T14:16:37Z) - Robust Learning with Progressive Data Expansion Against Spurious
Correlation [65.83104529677234]
We study the learning process of a two-layer nonlinear convolutional neural network in the presence of spurious features.
Our analysis suggests that imbalanced data groups and easily learnable spurious features can lead to the dominance of spurious features during the learning process.
We propose a new training algorithm called PDE that efficiently enhances the model's robustness for a better worst-group performance.
arXiv Detail & Related papers (2023-06-08T05:44:06Z) - A Statistical Learning Take on the Concordance Index for Survival
Analysis [0.29005223064604074]
We provide C-index Fisher-consistency results and excess risk bounds for several commonly used cost functions in survival analysis.
We also study the general case where no model assumption is made and present a new, off-the-shelf method that is shown to be consistent with the C-index.
arXiv Detail & Related papers (2023-02-23T14:33:54Z) - HyperImpute: Generalized Iterative Imputation with Automatic Model
Selection [77.86861638371926]
We propose a generalized iterative imputation framework for adaptively and automatically configuring column-wise models.
We provide a concrete implementation with out-of-the-box learners, simulators, and interfaces.
arXiv Detail & Related papers (2022-06-15T19:10:35Z) - LifeLonger: A Benchmark for Continual Disease Classification [59.13735398630546]
We introduce LifeLonger, a benchmark for continual disease classification on the MedMNIST collection.
Task and class incremental learning of diseases address the issue of classifying new samples without re-training the models from scratch.
Cross-domain incremental learning addresses the issue of dealing with datasets originating from different institutions while retaining the previously obtained knowledge.
arXiv Detail & Related papers (2022-04-12T12:25:05Z) - Bootstrapping Your Own Positive Sample: Contrastive Learning With
Electronic Health Record Data [62.29031007761901]
This paper proposes a novel contrastive regularized clinical classification model.
We introduce two unique positive sampling strategies specifically tailored for EHR data.
Our framework yields highly competitive experimental results in predicting the mortality risk on real-world COVID-19 EHR data.
arXiv Detail & Related papers (2021-04-07T06:02:04Z) - Uncovering the structure of clinical EEG signals with self-supervised
learning [64.4754948595556]
Supervised learning paradigms are often limited by the amount of labeled data that is available.
This phenomenon is particularly problematic in clinically-relevant data, such as electroencephalography (EEG)
By extracting information from unlabeled data, it might be possible to reach competitive performance with deep neural networks.
arXiv Detail & Related papers (2020-07-31T14:34:47Z) - Variational Learning of Individual Survival Distributions [21.40142425105635]
We introduce a variational time-to-event prediction model, named Variational Survival Inference (VSI), which builds upon recent advances in distribution learning techniques and deep neural networks.
To validate effectiveness of our approach, an extensive set of experiments on both synthetic and real-world datasets is carried out, showing improved performance relative to competing solutions.
arXiv Detail & Related papers (2020-03-09T22:09:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.