LATTE: Label-efficient Incident Phenotyping from Longitudinal Electronic
Health Records
- URL: http://arxiv.org/abs/2305.11407v1
- Date: Fri, 19 May 2023 03:28:51 GMT
- Title: LATTE: Label-efficient Incident Phenotyping from Longitudinal Electronic
Health Records
- Authors: Jun Wen, Jue Hou, Clara-Lea Bonzel, Yihan Zhao, Victor M. Castro,
Vivian S. Gainer, Dana Weisenfeld, Tianrun Cai, Yuk-Lam Ho, Vidul A.
Panickan, Lauren Costa, Chuan Hong, J. Michael Gaziano, Katherine P. Liao,
Junwei Lu, Kelly Cho, Tianxi Cai
- Abstract summary: We propose a LAbel-efficienT incidenT phEnotyping algorithm to accurately annotate the timing of clinical events from longitudinal EHR data.
LATTE is evaluated on three analyses: the onset of type-2 diabetes, heart failure, and the onset and relapses of multiple sclerosis.
- Score: 11.408950540503112
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Electronic health record (EHR) data are increasingly used to support
real-world evidence (RWE) studies. Yet its ability to generate reliable RWE is
limited by the lack of readily available precise information on the timing of
clinical events such as the onset time of heart failure. We propose a
LAbel-efficienT incidenT phEnotyping (LATTE) algorithm to accurately annotate
the timing of clinical events from longitudinal EHR data. By leveraging the
pre-trained semantic embedding vectors from large-scale EHR data as prior
knowledge, LATTE selects predictive EHR features in a concept re-weighting
module by mining their relationship to the target event and compresses their
information into longitudinal visit embeddings through a visit attention
learning network. LATTE employs a recurrent neural network to capture the
sequential dependency between the target event and visit embeddings
before/after it. To improve label efficiency, LATTE constructs highly
informative longitudinal silver-standard labels from large-scale unlabeled
patients to perform unsupervised pre-training and semi-supervised joint
training. Finally, LATTE enhances cross-site portability via contrastive
representation learning. LATTE is evaluated on three analyses: the onset of
type-2 diabetes, heart failure, and the onset and relapses of multiple
sclerosis. We use various evaluation metrics present in the literature
including the $ABC_{gain}$, the proportion of reduction in the area between the
observed event indicator and the predicted cumulative incidences in reference
to the prediction per incident prevalence. LATTE consistently achieves
substantial improvement over benchmark methods such as SAMGEP and RETAIN in all
settings.
Related papers
- REST: Efficient and Accelerated EEG Seizure Analysis through Residual State Updates [54.96885726053036]
This paper introduces a novel graph-based residual state update mechanism (REST) for real-time EEG signal analysis.
By leveraging a combination of graph neural networks and recurrent structures, REST efficiently captures both non-Euclidean geometry and temporal dependencies within EEG data.
Our model demonstrates high accuracy in both seizure detection and classification tasks.
arXiv Detail & Related papers (2024-06-03T16:30:19Z) - TA-RNN: an Attention-based Time-aware Recurrent Neural Network Architecture for Electronic Health Records [0.0]
Deep learning methods such as Recurrent Neural Networks (RNN) have been utilized to analyze EHR to model disease progression and predict diagnosis.
In this study, we propose two interpretable DL architectures based on RNN, namely Time-Aware RNN (TA-RNN) and TA-RNN-Autoencoder (TA-RNN-AE)
To mitigate the impact of irregular time intervals, we propose incorporating time embedding of the elapsed times between visits.
arXiv Detail & Related papers (2024-01-26T07:34:53Z) - Evaluating the Fairness of the MIMIC-IV Dataset and a Baseline
Algorithm: Application to the ICU Length of Stay Prediction [65.268245109828]
This paper uses the MIMIC-IV dataset to examine the fairness and bias in an XGBoost binary classification model predicting the ICU length of stay.
The research reveals class imbalances in the dataset across demographic attributes and employs data preprocessing and feature extraction.
The paper concludes with recommendations for fairness-aware machine learning techniques for mitigating biases and the need for collaborative efforts among healthcare professionals and data scientists.
arXiv Detail & Related papers (2023-12-31T16:01:48Z) - Vital Sign Forecasting for Sepsis Patients in ICUs [5.543372375499915]
This paper uses state-of-the-art deep learning (DL) architectures to introduce a multi-step forecasting system.
We introduce a DL-based vital sign forecasting system that predicts up to 3 hours of future vital signs from 6 hours of past data.
We evaluate the performance of our models using mean squared error (MSE) and dynamic time warping (DTW) metrics.
arXiv Detail & Related papers (2023-11-08T15:47:58Z) - MELEP: A Novel Predictive Measure of Transferability in Multi-Label ECG Diagnosis [1.3654846342364306]
We introduce MELEP, a measure designed to estimate the effectiveness of knowledge transfer from a pre-trained model to a downstream ECG diagnosis task.
Our experiments show that MELEP can predict the performance of pre-trained convolutional and recurrent deep neural networks, on small and imbalanced ECG data.
arXiv Detail & Related papers (2023-10-27T14:57:10Z) - Cumulative Stay-time Representation for Electronic Health Records in
Medical Event Time Prediction [8.261597797345342]
We propose a novel data representation for EHR called cumulative stay-time representation (CTR)
CTR directly models such cumulative health conditions.
We derive a trainable construction of CTR based on neural networks that has the flexibility to fit the target data.
arXiv Detail & Related papers (2022-04-28T12:34:41Z) - CEHR-BERT: Incorporating temporal information from structured EHR data
to improve prediction tasks [0.0]
We develop a new BERT adaptation, CEHR-BERT, to incorporate temporal information using a hybrid approach.
CEHR-BERT was trained on a subset of Columbia University Irving Medical Center-York Presbyterian Hospital's clinical data.
arXiv Detail & Related papers (2021-11-10T16:53:32Z) - BiteNet: Bidirectional Temporal Encoder Network to Predict Medical
Outcomes [53.163089893876645]
We propose a novel self-attention mechanism that captures the contextual dependency and temporal relationships within a patient's healthcare journey.
An end-to-end bidirectional temporal encoder network (BiteNet) then learns representations of the patient's journeys.
We have evaluated the effectiveness of our methods on two supervised prediction and two unsupervised clustering tasks with a real-world EHR dataset.
arXiv Detail & Related papers (2020-09-24T00:42:36Z) - Prediction of the onset of cardiovascular diseases from electronic
health records using multi-task gated recurrent units [51.14334174570822]
We propose a multi-task recurrent neural network with attention mechanism for predicting cardiovascular events from electronic health records.
The proposed approach is compared to a standard clinical risk predictor (QRISK) and machine learning alternatives using 5-year data from a NHS Foundation Trust.
arXiv Detail & Related papers (2020-07-16T17:43:13Z) - AutoHR: A Strong End-to-end Baseline for Remote Heart Rate Measurement
with Neural Searching [76.4844593082362]
We investigate the reason why existing end-to-end networks perform poorly in challenging conditions and establish a strong baseline for remote HR measurement with architecture search (NAS)
Comprehensive experiments are performed on three benchmark datasets on both intra-temporal and cross-dataset testing.
arXiv Detail & Related papers (2020-04-26T05:43:21Z) - Short Term Blood Glucose Prediction based on Continuous Glucose
Monitoring Data [53.01543207478818]
This study explores the use of Continuous Glucose Monitoring (CGM) data as input for digital decision support tools.
We investigate how Recurrent Neural Networks (RNNs) can be used for Short Term Blood Glucose (STBG) prediction.
arXiv Detail & Related papers (2020-02-06T16:39:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.