Predicting sepsis in multi-site, multi-national intensive care cohorts
using deep learning
- URL: http://arxiv.org/abs/2107.05230v1
- Date: Mon, 12 Jul 2021 07:21:58 GMT
- Title: Predicting sepsis in multi-site, multi-national intensive care cohorts
using deep learning
- Authors: Michael Moor, Nicolas Bennet, Drago Plecko, Max Horn, Bastian Rieck,
Nicolai Meinshausen, Peter B\"uhlmann, Karsten Borgwardt
- Abstract summary: We developed and validated a machine learning (ML) system for the prediction of sepsis in the ICU.
Our analysis represents the largest multi-national, multi-centre in-ICU study for sepsis prediction using ML to date.
- Score: 12.63135352255575
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Despite decades of clinical research, sepsis remains a global public health
crisis with high mortality, and morbidity. Currently, when sepsis is detected
and the underlying pathogen is identified, organ damage may have already
progressed to irreversible stages. Effective sepsis management is therefore
highly time-sensitive. By systematically analysing trends in the plethora of
clinical data available in the intensive care unit (ICU), an early prediction
of sepsis could lead to earlier pathogen identification, resistance testing,
and effective antibiotic and supportive treatment, and thereby become a
life-saving measure. Here, we developed and validated a machine learning (ML)
system for the prediction of sepsis in the ICU. Our analysis represents the
largest multi-national, multi-centre in-ICU study for sepsis prediction using
ML to date. Our dataset contains $156,309$ unique ICU admissions, which
represent a refined and harmonised subset of five large ICU databases
originating from three countries. Using the international consensus definition
Sepsis-3, we derived hourly-resolved sepsis label annotations, amounting to
$26,734$ ($17.1\%$) septic stays. We compared our approach, a deep
self-attention model, to several clinical baselines as well as ML baselines and
performed an extensive internal and external validation within and across
databases. On average, our model was able to predict sepsis with an AUROC of
$0.847 \pm 0.050$ (internal out-of sample validation) and $0.761 \pm 0.052$
(external validation). For a harmonised prevalence of $17\%$, at $80\%$ recall
our model detects septic patients with $39\%$ precision 3.7 hours in advance.
Related papers
- SepsisLab: Early Sepsis Prediction with Uncertainty Quantification and Active Sensing [67.8991481023825]
Sepsis is the leading cause of in-hospital mortality in the USA.
Existing predictive models are usually trained on high-quality data with few missing information.
For the potential high-risk patients with low confidence due to limited observations, we propose a robust active sensing algorithm.
arXiv Detail & Related papers (2024-07-24T04:47:36Z) - Multimodal Pretraining of Medical Time Series and Notes [45.89025874396911]
Deep learning models show promise in extracting meaningful patterns, but they require extensive labeled data.
We propose a novel approach employing self-supervised pretraining, focusing on the alignment of clinical measurements and notes.
In downstream tasks, including in-hospital mortality prediction and phenotyping, our model outperforms baselines in settings where only a fraction of the data is labeled.
arXiv Detail & Related papers (2023-12-11T21:53:40Z) - Vital Sign Forecasting for Sepsis Patients in ICUs [5.543372375499915]
This paper uses state-of-the-art deep learning (DL) architectures to introduce a multi-step forecasting system.
We introduce a DL-based vital sign forecasting system that predicts up to 3 hours of future vital signs from 6 hours of past data.
We evaluate the performance of our models using mean squared error (MSE) and dynamic time warping (DTW) metrics.
arXiv Detail & Related papers (2023-11-08T15:47:58Z) - NPRL: Nightly Profile Representation Learning for Early Sepsis Onset
Prediction in ICU Trauma Patients [5.476582906474746]
Sepsis is a syndrome that develops in the body in response to the presence of an infection.
Current machine learning algorithms have demonstrated poor performance and are insufficient for anticipating sepsis onset early.
We propose a novel but realistic prediction framework that predicts sepsis onset each morning using the most recent data collected the previous night.
arXiv Detail & Related papers (2023-04-25T11:27:27Z) - Learning to diagnose cirrhosis from radiological and histological labels
with joint self and weakly-supervised pretraining strategies [62.840338941861134]
We propose to leverage transfer learning from large datasets annotated by radiologists, to predict the histological score available on a small annex dataset.
We compare different pretraining methods, namely weakly-supervised and self-supervised ones, to improve the prediction of the cirrhosis.
This method outperforms the baseline classification of the METAVIR score, reaching an AUC of 0.84 and a balanced accuracy of 0.75.
arXiv Detail & Related papers (2023-02-16T17:06:23Z) - ALRt: An Active Learning Framework for Irregularly Sampled Temporal Data [1.370633147306388]
Sepsis is a deadly condition affecting many patients in the hospital.
We propose the use of Active Learning Recurrent Neural Networks (ALRts) for short temporal horizons to improve the prediction of irregularly sampled temporal events such as sepsis.
We show that an active learning RNN model trained on limited data can form robust sepsis predictions comparable to models using the entire training dataset.
arXiv Detail & Related papers (2022-12-13T04:31:49Z) - Integrating Physiological Time Series and Clinical Notes with
Transformer for Early Prediction of Sepsis [10.791880225915255]
Sepsis is a leading cause of death in the Intensive Care Units (ICU)
We propose a multimodal Transformer model for early sepsis prediction.
We use the physiological time series data and clinical notes for each patient within $36$ hours of ICU admission.
arXiv Detail & Related papers (2022-03-28T03:19:03Z) - Generalizing electrocardiogram delineation: training convolutional
neural networks with synthetic data augmentation [63.51064808536065]
Existing databases for ECG delineation are small, being insufficient in size and in the array of pathological conditions they represent.
This article delves has two main contributions. First, a pseudo-synthetic data generation algorithm was developed, based in probabilistically composing ECG traces given "pools" of fundamental segments, as cropped from the original databases, and a set of rules for their arrangement into coherent synthetic traces.
Second, two novel segmentation-based loss functions have been developed, which attempt at enforcing the prediction of an exact number of independent structures and at producing closer segmentation boundaries by focusing on a reduced number of samples.
arXiv Detail & Related papers (2021-11-25T10:11:41Z) - Improving Early Sepsis Prediction with Multi Modal Learning [5.129463113166068]
Clinical text provides essential information to estimate the severity of sepsis.
We employ state-of-the-art NLP models such as BERT and a highly specialized NLP model in Amazon Comprehend Medical to represent the text.
Our methods significantly outperforms a clinical criteria suggested by experts, qSOFA, as well as the winning model of the PhysioNet Computing in Cardiology Challenge for predicting Sepsis.
arXiv Detail & Related papers (2021-07-23T09:25:31Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z) - Trajectories, bifurcations and pseudotime in large clinical datasets:
applications to myocardial infarction and diabetes data [94.37521840642141]
We suggest a semi-supervised methodology for the analysis of large clinical datasets, characterized by mixed data types and missing values.
The methodology is based on application of elastic principal graphs which can address simultaneously the tasks of dimensionality reduction, data visualization, clustering, feature selection and quantifying the geodesic distances (pseudotime) in partially ordered sequences of observations.
arXiv Detail & Related papers (2020-07-07T21:04:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.