Learning Clinical Concepts for Predicting Risk of Progression to Severe
COVID-19
- URL: http://arxiv.org/abs/2208.13126v1
- Date: Sun, 28 Aug 2022 02:59:35 GMT
- Title: Learning Clinical Concepts for Predicting Risk of Progression to Severe
COVID-19
- Authors: Helen Zhou, Cheng Cheng, Kelly J. Shields, Gursimran Kochhar, Tariq
Cheema, Zachary C. Lipton, Jeremy C. Weiss
- Abstract summary: Using data from a major healthcare provider, we develop survival models predicting severe COVID-19 progression.
We develop two sets of high-performance risk scores: (i) an unconstrained model built from all available features; and (ii) a pipeline that learns a small set of clinical concepts before training a risk predictor.
- Score: 17.781861866125023
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: With COVID-19 now pervasive, identification of high-risk individuals is
crucial. Using data from a major healthcare provider in Southwestern
Pennsylvania, we develop survival models predicting severe COVID-19
progression. In this endeavor, we face a tradeoff between more accurate models
relying on many features and less accurate models relying on a few features
aligned with clinician intuition. Complicating matters, many EHR features tend
to be under-coded, degrading the accuracy of smaller models. In this study, we
develop two sets of high-performance risk scores: (i) an unconstrained model
built from all available features; and (ii) a pipeline that learns a small set
of clinical concepts before training a risk predictor. Learned concepts boost
performance over the corresponding features (C-index 0.858 vs. 0.844) and
demonstrate improvements over (i) when evaluated out-of-sample (subsequent time
periods). Our models outperform previous works (C-index 0.844-0.872 vs.
0.598-0.810).
Related papers
- Symptom-based Machine Learning Models for the Early Detection of
COVID-19: A Narrative Review [0.0]
Machine learning models can analyze large datasets, incorporating patient-reported symptoms, clinical data, and medical imaging.
In this paper, we provide an overview of the landscape of symptoms-only machine learning models for predicting COVID-19, including their performance and limitations.
The review will also examine the performance of symptom-based models when compared to image-based models.
arXiv Detail & Related papers (2023-12-08T01:41:42Z) - Performance or Trust? Why Not Both. Deep AUC Maximization with
Self-Supervised Learning for COVID-19 Chest X-ray Classifications [72.52228843498193]
In training deep learning models, a compromise often must be made between performance and trust.
In this work, we integrate a new surrogate loss with self-supervised learning for computer-aided screening of COVID-19 patients.
arXiv Detail & Related papers (2021-12-14T21:16:52Z) - On the explainability of hospitalization prediction on a large COVID-19
patient dataset [45.82374977939355]
We develop various AI models to predict hospitalization on a large (over 110$k$) cohort of COVID-19 positive-tested US patients.
Despite high data unbalance, the models reach average precision 0.96-0.98 (0.75-0.85), recall 0.96-0.98 (0.74-0.85), and $F_score 0.97-0.98 (0.79-0.83) on the non-hospitalized (or hospitalized) class.
arXiv Detail & Related papers (2021-10-28T10:23:38Z) - The Report on China-Spain Joint Clinical Testing for Rapid COVID-19 Risk
Screening by Eye-region Manifestations [59.48245489413308]
We developed and tested a COVID-19 rapid prescreening model using the eye-region images captured in China and Spain with cellphone cameras.
The performance was measured using area under receiver-operating-characteristic curve (AUC), sensitivity, specificity, accuracy, and F1.
arXiv Detail & Related papers (2021-09-18T02:28:01Z) - Bootstrapping Your Own Positive Sample: Contrastive Learning With
Electronic Health Record Data [62.29031007761901]
This paper proposes a novel contrastive regularized clinical classification model.
We introduce two unique positive sampling strategies specifically tailored for EHR data.
Our framework yields highly competitive experimental results in predicting the mortality risk on real-world COVID-19 EHR data.
arXiv Detail & Related papers (2021-04-07T06:02:04Z) - COVID-19 Prognosis via Self-Supervised Representation Learning and
Multi-Image Prediction [32.91440827855392]
We consider the task of predicting two types of patient deterioration based on chest X-rays.
Due to the relative scarcity of COVID-19 patient data, existing solutions leverage supervised pretraining on related non-COVID images.
In this paper, we use self-supervised learning based on the momentum contrast (MoCo) method in the pretraining phase to learn more general image representations to use for downstream tasks.
arXiv Detail & Related papers (2021-01-13T07:03:17Z) - End-2-End COVID-19 Detection from Breath & Cough Audio [68.41471917650571]
We demonstrate the first attempt to diagnose COVID-19 using end-to-end deep learning from a crowd-sourced dataset of audio samples.
We introduce a novel modelling strategy using a custom deep neural network to diagnose COVID-19 from a joint breath and cough representation.
arXiv Detail & Related papers (2021-01-07T01:13:00Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z) - Short-term forecasting COVID-19 cumulative confirmed cases: Perspectives
for Brazil [3.0711362702464675]
The new Coronavirus (COVID-19) is an emerging disease responsible for infecting millions of people since the first notification until nowadays.
In this paper, autoregressive integrated moving average (ARIMA), cubist (CUBIST), random forest (RF), ridge regression (RIDGE), and stacking-ensemble learning are evaluated.
The developed models can generate accurate forecasting, achieving errors in a range of 0.87% - 3.51%, 1.02% - 5.63%, and 0.95% - 6.90% in one, three, and six-days-ahead, respectively.
arXiv Detail & Related papers (2020-07-21T17:58:58Z) - A Physiology-Driven Computational Model for Post-Cardiac Arrest Outcome
Prediction [0.7930054475711718]
The aim of this study was to build computational models to predict post-CA outcome.
We hypothesized that model performance could be enhanced by integrating physiological time series (PTS) data and by training machine learning (ML) classifiers.
Results attest to the effectiveness of ML models for post-CA predictive modeling and suggest that PTS recorded in very early phase after resuscitation encode short-term outcome probabilities.
arXiv Detail & Related papers (2020-02-09T07:53:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.