Clinical Risk Prediction with Temporal Probabilistic Asymmetric
Multi-Task Learning
- URL: http://arxiv.org/abs/2006.12777v4
- Date: Thu, 18 Feb 2021 15:11:26 GMT
- Title: Clinical Risk Prediction with Temporal Probabilistic Asymmetric
Multi-Task Learning
- Authors: A. Tuan Nguyen, Hyewon Jeong, Eunho Yang, Sung Ju Hwang
- Abstract summary: Multi-task learning methods should be used with caution for safety-critical applications, such as clinical risk prediction.
Existing asymmetric multi-task learning methods tackle this negative transfer problem by performing knowledge transfer from tasks with low loss to tasks with high loss.
We propose a novel temporal asymmetric multi-task learning model that performs knowledge transfer from certain tasks/timesteps to relevant uncertain tasks, based on feature-level uncertainty.
- Score: 80.66108902283388
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Although recent multi-task learning methods have shown to be effective in
improving the generalization of deep neural networks, they should be used with
caution for safety-critical applications, such as clinical risk prediction.
This is because even if they achieve improved task-average performance, they
may still yield degraded performance on individual tasks, which may be critical
(e.g., prediction of mortality risk). Existing asymmetric multi-task learning
methods tackle this negative transfer problem by performing knowledge transfer
from tasks with low loss to tasks with high loss. However, using loss as a
measure of reliability is risky since it could be a result of overfitting. In
the case of time-series prediction tasks, knowledge learned for one task (e.g.,
predicting the sepsis onset) at a specific timestep may be useful for learning
another task (e.g., prediction of mortality) at a later timestep, but lack of
loss at each timestep makes it difficult to measure the reliability at each
timestep. To capture such dynamically changing asymmetric relationships between
tasks in time-series data, we propose a novel temporal asymmetric multi-task
learning model that performs knowledge transfer from certain tasks/timesteps to
relevant uncertain tasks, based on feature-level uncertainty. We validate our
model on multiple clinical risk prediction tasks against various deep learning
models for time-series prediction, which our model significantly outperforms,
without any sign of negative transfer. Further qualitative analysis of learned
knowledge graphs by clinicians shows that they are helpful in analyzing the
predictions of the model. Our final code is available at
https://github.com/anhtuan5696/TPAMTL.
Related papers
- Towards Robust Continual Learning with Bayesian Adaptive Moment Regularization [51.34904967046097]
Continual learning seeks to overcome the challenge of catastrophic forgetting, where a model forgets previously learnt information.
We introduce a novel prior-based method that better constrains parameter growth, reducing catastrophic forgetting.
Results show that BAdam achieves state-of-the-art performance for prior-based methods on challenging single-headed class-incremental experiments.
arXiv Detail & Related papers (2023-09-15T17:10:51Z) - Time Associated Meta Learning for Clinical Prediction [78.99422473394029]
We propose a novel time associated meta learning (TAML) method to make effective predictions at multiple future time points.
To address the sparsity problem after task splitting, TAML employs a temporal information sharing strategy to augment the number of positive samples.
We demonstrate the effectiveness of TAML on multiple clinical datasets, where it consistently outperforms a range of strong baselines.
arXiv Detail & Related papers (2023-03-05T03:54:54Z) - An Evolutionary Approach to Dynamic Introduction of Tasks in Large-scale
Multitask Learning Systems [4.675744559395732]
Multitask learning assumes that models capable of learning from multiple tasks can achieve better quality and efficiency via knowledge transfer.
State of the art ML models rely on high customization for each task and leverage size and data scale rather than scaling the number of tasks.
We propose an evolutionary method that can generate a large scale multitask model and can support the dynamic and continuous addition of new tasks.
arXiv Detail & Related papers (2022-05-25T13:10:47Z) - Masked prediction tasks: a parameter identifiability view [49.533046139235466]
We focus on the widely used self-supervised learning method of predicting masked tokens.
We show that there is a rich landscape of possibilities, out of which some prediction tasks yield identifiability, while others do not.
arXiv Detail & Related papers (2022-02-18T17:09:32Z) - Uncertainty Modeling for Out-of-Distribution Generalization [56.957731893992495]
We argue that the feature statistics can be properly manipulated to improve the generalization ability of deep learning models.
Common methods often consider the feature statistics as deterministic values measured from the learned features.
We improve the network generalization ability by modeling the uncertainty of domain shifts with synthesized feature statistics during training.
arXiv Detail & Related papers (2022-02-08T16:09:12Z) - Learning Curves for Sequential Training of Neural Networks:
Self-Knowledge Transfer and Forgetting [9.734033555407406]
We consider neural networks in the neural tangent kernel regime that continually learn target functions from task to task.
We investigate a variant of continual learning where the model learns the same target function in multiple tasks.
Even for the same target, the trained model shows some transfer and forgetting depending on the sample size of each task.
arXiv Detail & Related papers (2021-12-03T00:25:01Z) - Patient Outcome and Zero-shot Diagnosis Prediction with
Hypernetwork-guided Multitask Learning [3.392432412743858]
Multitask deep learning has been applied to patient outcome prediction from text.
Diagnose prediction among the multiple tasks has the generalizability issue due to rare diseases or unseen diagnoses.
We propose a hypernetwork-based approach that generates task-conditioned parameters and coefficients of multitask prediction heads.
arXiv Detail & Related papers (2021-09-07T12:52:26Z) - Data-driven method for real-time prediction and uncertainty
quantification of fatigue failure under stochastic loading using artificial
neural networks and Gaussian process regression [0.0]
Methods for early failure prediction are essential for engineering, military, and civil applications.
Uncertainty (UQ) is of major importance for real-time decision-making purposes.
arXiv Detail & Related papers (2021-03-11T13:57:08Z) - Ambiguity in Sequential Data: Predicting Uncertain Futures with
Recurrent Models [110.82452096672182]
We propose an extension of the Multiple Hypothesis Prediction (MHP) model to handle ambiguous predictions with sequential data.
We also introduce a novel metric for ambiguous problems, which is better suited to account for uncertainties.
arXiv Detail & Related papers (2020-03-10T09:15:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.