Predicting cardiovascular risk from national administrative databases
using a combined survival analysis and deep learning approach
- URL: http://arxiv.org/abs/2011.14032v1
- Date: Sat, 28 Nov 2020 00:10:25 GMT
- Title: Predicting cardiovascular risk from national administrative databases
using a combined survival analysis and deep learning approach
- Authors: Sebastiano Barbieri, Suneela Mehta, Billy Wu, Chrianna Bharat, Katrina
Poppe, Louisa Jorm, Rod Jackson
- Abstract summary: This study compared the performance of deep learning extensions of survival analysis models with traditional Cox proportional hazards (CPH) models.
Deep learning models significantly outperformed CPH models on the basis of proportion of explained time-to-event occurrence.
Deep learning models can be applied to large health administrative databases to derive interpretable CVD risk prediction equations.
- Score: 0.3463527836552467
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: AIMS. This study compared the performance of deep learning extensions of
survival analysis models with traditional Cox proportional hazards (CPH) models
for deriving cardiovascular disease (CVD) risk prediction equations in national
health administrative datasets. METHODS. Using individual person linkage of
multiple administrative datasets, we constructed a cohort of all New Zealand
residents aged 30-74 years who interacted with publicly funded health services
during 2012, and identified hospitalisations and deaths from CVD over five
years of follow-up. After excluding people with prior CVD or heart failure,
sex-specific deep learning and CPH models were developed to estimate the risk
of fatal or non-fatal CVD events within five years. The proportion of explained
time-to-event occurrence, calibration, and discrimination were compared between
models across the whole study population and in specific risk groups. FINDINGS.
First CVD events occurred in 61,927 of 2,164,872 people. Among diagnoses and
procedures, the largest 'local' hazard ratios were associated by the deep
learning models with tobacco use in women (2.04, 95%CI: 1.99-2.10) and with
chronic obstructive pulmonary disease with acute lower respiratory infection in
men (1.56, 95%CI: 1.50-1.62). Other identified predictors (e.g. hypertension,
chest pain, diabetes) aligned with current knowledge about CVD risk predictors.
The deep learning models significantly outperformed the CPH models on the basis
of proportion of explained time-to-event occurrence (Royston and Sauerbrei's
R-squared: 0.468 vs. 0.425 in women and 0.383 vs. 0.348 in men), calibration,
and discrimination (all p<0.0001). INTERPRETATION. Deep learning extensions of
survival analysis models can be applied to large health administrative
databases to derive interpretable CVD risk prediction equations that are more
accurate than traditional CPH models.
Related papers
- Deep State-Space Generative Model For Correlated Time-to-Event Predictions [54.3637600983898]
We propose a deep latent state-space generative model to capture the interactions among different types of correlated clinical events.
Our method also uncovers meaningful insights about the latent correlations among mortality and different types of organ failures.
arXiv Detail & Related papers (2024-07-28T02:42:36Z) - Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank [69.90493129893112]
Recent genome-wide association studies (GWAS) have uncovered the genetic basis of complex traits, but show an under-representation of non-European descent individuals.
Here, we assess whether we can improve disease prediction across diverse ancestries using multiomic data.
arXiv Detail & Related papers (2024-04-26T16:39:50Z) - Interpretable Survival Analysis for Heart Failure Risk Prediction [50.64739292687567]
We propose a novel survival analysis pipeline that is both interpretable and competitive with state-of-the-art survival models.
Our pipeline achieves state-of-the-art performance and provides interesting and novel insights about risk factors for heart failure.
arXiv Detail & Related papers (2023-10-24T02:56:05Z) - At-Admission Prediction of Mortality and Pulmonary Embolism in COVID-19
Patients Using Statistical and Machine Learning Methods: An International
Cohort Study [0.0]
It is highly important to develop predictive tools for pulmonary embolism in COVID-19 patients.
We propose a cost-sensitive gradient-boosted machine learning model that predicts occurrence of PE and death at admission.
arXiv Detail & Related papers (2023-05-18T14:55:27Z) - A hybrid CNN-RNN approach for survival analysis in a Lung Cancer
Screening study [17.26942882598847]
We present a hybrid CNN-RNN approach to investigate long-term survival of subjects in a lung cancer screening study.
The models were trained on subjects who underwent cardiovascular and respiratory deaths.
The Cox neural network can achieve an IPCW C-index of 0.75 on the internal dataset and 0.69 on an external dataset.
arXiv Detail & Related papers (2023-03-19T23:00:41Z) - Predicting adverse outcomes following catheter ablation treatment for
atrial fibrillation [2.202746751854349]
We developed prognostic survival models for predicting adverse outcomes after catheter ablation treatment for AF.
Traditional and deep survival models were trained to predict major bleeding events and a composite of heart failure, stroke, cardiac arrest, and death.
arXiv Detail & Related papers (2022-11-22T02:55:51Z) - SurvLatent ODE : A Neural ODE based time-to-event model with competing
risks for longitudinal data improves cancer-associated Deep Vein Thrombosis
(DVT) prediction [68.8204255655161]
We propose a generative time-to-event model, SurvLatent ODE, which parameterizes a latent representation under irregularly sampled data.
Our model then utilizes the latent representation to flexibly estimate survival times for multiple competing events without specifying shapes of event-specific hazard function.
SurvLatent ODE outperforms the current clinical standard Khorana Risk scores for stratifying DVT risk groups.
arXiv Detail & Related papers (2022-04-20T17:28:08Z) - Development of an accessible 10-year Digital CArdioVAscular (DiCAVA)
risk assessment: a UK Biobank study [0.46180371154032895]
The aim was to develop a new risk model (DiCAVA) using statistical and machine learning techniques.
A secondary goal was to identify new patient-centric variables that could be incorporated into CVD risk assessments.
arXiv Detail & Related papers (2021-04-20T16:01:50Z) - An explainable Transformer-based deep learning model for the prediction
of incident heart failure [22.513476932615845]
We developed a novel Transformer deep-learning model for prediction of incident heart failure involving 100,071 patients.
The model achieved 0.93 and 0.93 area under the receiver operator curve and 0.69 and 0.70 area under the precision-recall curve.
The importance of contextualised medical information was revealed in sensitivity analyses.
arXiv Detail & Related papers (2021-01-27T12:45:15Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z) - A General Framework for Survival Analysis and Multi-State Modelling [70.31153478610229]
We use neural ordinary differential equations as a flexible and general method for estimating multi-state survival models.
We show that our model exhibits state-of-the-art performance on popular survival data sets and demonstrate its efficacy in a multi-state setting.
arXiv Detail & Related papers (2020-06-08T19:24:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.