Deep Representation Learning of Patient Data from Electronic Health
Records (EHR): A Systematic Review
- URL: http://arxiv.org/abs/2010.02809v2
- Date: Mon, 16 Nov 2020 16:24:43 GMT
- Title: Deep Representation Learning of Patient Data from Electronic Health
Records (EHR): A Systematic Review
- Authors: Yuqi Si, Jingcheng Du, Zhao Li, Xiaoqian Jiang, Timothy Miller, Fei
Wang, W. Jim Zheng, Kirk Roberts
- Abstract summary: Patient representation learning refers to learning a dense mathematical representation of a patient that encodes meaningful information from Electronic Health Records.
This study presents a systematic review of this field and provides both qualitative and quantitative analyses from a methodological perspective.
- Score: 20.621261286239967
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Patient representation learning refers to learning a dense mathematical
representation of a patient that encodes meaningful information from Electronic
Health Records (EHRs). This is generally performed using advanced deep learning
methods. This study presents a systematic review of this field and provides
both qualitative and quantitative analyses from a methodological perspective.
We identified studies developing patient representations from EHRs with deep
learning methods from MEDLINE, EMBASE, Scopus, the Association for Computing
Machinery (ACM) Digital Library, and Institute of Electrical and Electronics
Engineers (IEEE) Xplore Digital Library. After screening 363 articles, 49
papers were included for a comprehensive data collection. We noticed a typical
workflow starting with feeding raw data, applying deep learning models, and
ending with clinical outcome predictions as evaluations of the learned
representations. Specifically, learning representations from structured EHR
data was dominant (37 out of 49 studies). Recurrent Neural Networks were widely
applied as the deep learning architecture (LSTM: 13 studies, GRU: 11 studies).
Disease prediction was the most common application and evaluation (31 studies).
Benchmark datasets were mostly unavailable (28 studies) due to privacy concerns
of EHR data, and code availability was assured in 20 studies. We show the
importance and feasibility of learning comprehensive representations of patient
EHR data through a systematic review. Advances in patient representation
learning techniques will be essential for powering patient-level EHR analyses.
Future work will still be devoted to leveraging the richness and potential of
available EHR data. Knowledge distillation and advanced learning techniques
will be exploited to assist the capability of learning patient representation
further.
Related papers
- Recent Advances in Predictive Modeling with Electronic Health Records [71.19967863320647]
utilizing EHR data for predictive modeling presents several challenges due to its unique characteristics.
Deep learning has demonstrated its superiority in various applications, including healthcare.
arXiv Detail & Related papers (2024-02-02T00:31:01Z) - Evaluating the Fairness of the MIMIC-IV Dataset and a Baseline
Algorithm: Application to the ICU Length of Stay Prediction [65.268245109828]
This paper uses the MIMIC-IV dataset to examine the fairness and bias in an XGBoost binary classification model predicting the ICU length of stay.
The research reveals class imbalances in the dataset across demographic attributes and employs data preprocessing and feature extraction.
The paper concludes with recommendations for fairness-aware machine learning techniques for mitigating biases and the need for collaborative efforts among healthcare professionals and data scientists.
arXiv Detail & Related papers (2023-12-31T16:01:48Z) - Toward Cohort Intelligence: A Universal Cohort Representation Learning
Framework for Electronic Health Record Analysis [15.137213823470544]
We propose a universal COhort Representation lEarning (CORE) framework to augment EHR utilization by leveraging the fine-grained cohort information among patients.
CORE is readily applicable to diverse backbone models, serving as a universal plug-in framework to infuse cohort information into healthcare methods for boosted performance.
arXiv Detail & Related papers (2023-04-10T09:12:37Z) - On the Importance of Clinical Notes in Multi-modal Learning for EHR Data [0.0]
Previous research has shown that jointly using clinical notes with electronic health record data improved predictive performance for patient monitoring.
We first confirm that performance significantly improves over state-of-the-art EHR data models when combining EHR data and clinical notes.
We then provide an analysis showing improvements arise almost exclusively from a subset of notes containing broader context on patient state rather than clinician notes.
arXiv Detail & Related papers (2022-12-06T15:18:57Z) - Modeling electronic health record data using a knowledge-graph-embedded
topic model [6.170782354287972]
We present KG-ETM, an end-to-end knowledge graph-based multimodal embedded topic model.
KG-ETM distills latent disease topics from EHR data by learning the embedding from the medical knowledge graphs.
Our model is also able to discover interpretable and accurate patient representations for patient stratification and drug recommendations.
arXiv Detail & Related papers (2022-06-03T07:58:17Z) - When Accuracy Meets Privacy: Two-Stage Federated Transfer Learning
Framework in Classification of Medical Images on Limited Data: A COVID-19
Case Study [77.34726150561087]
COVID-19 pandemic has spread rapidly and caused a shortage of global medical resources.
CNN has been widely utilized and verified in analyzing medical images.
arXiv Detail & Related papers (2022-03-24T02:09:41Z) - Time Series Prediction using Deep Learning Methods in Healthcare [0.0]
Traditional machine learning methods face two main challenges in dealing with healthcare predictive analytics tasks.
The high-dimensional nature of healthcare data needs labor-intensive processes to select an appropriate set of features for each new task.
Recent deep learning methods have shown promising performance for various healthcare prediction tasks.
arXiv Detail & Related papers (2021-08-30T18:14:27Z) - Self-Supervised Graph Learning with Hyperbolic Embedding for Temporal
Health Event Prediction [13.24834156675212]
We propose a hyperbolic embedding method with information flow to pre-train medical code representations in a hierarchical structure.
We incorporate these pre-trained representations into a graph neural network to detect disease complications.
We present a new hierarchy-enhanced historical prediction proxy task in our self-supervised learning framework to fully utilize EHR data.
arXiv Detail & Related papers (2021-06-09T00:42:44Z) - BiteNet: Bidirectional Temporal Encoder Network to Predict Medical
Outcomes [53.163089893876645]
We propose a novel self-attention mechanism that captures the contextual dependency and temporal relationships within a patient's healthcare journey.
An end-to-end bidirectional temporal encoder network (BiteNet) then learns representations of the patient's journeys.
We have evaluated the effectiveness of our methods on two supervised prediction and two unsupervised clustering tasks with a real-world EHR dataset.
arXiv Detail & Related papers (2020-09-24T00:42:36Z) - Uncovering the structure of clinical EEG signals with self-supervised
learning [64.4754948595556]
Supervised learning paradigms are often limited by the amount of labeled data that is available.
This phenomenon is particularly problematic in clinically-relevant data, such as electroencephalography (EEG)
By extracting information from unlabeled data, it might be possible to reach competitive performance with deep neural networks.
arXiv Detail & Related papers (2020-07-31T14:34:47Z) - Opportunities and Challenges of Deep Learning Methods for
Electrocardiogram Data: A Systematic Review [62.490310870300746]
The electrocardiogram (ECG) is one of the most commonly used diagnostic tools in medicine and healthcare.
Deep learning methods have achieved promising results on predictive healthcare tasks using ECG signals.
This paper presents a systematic review of deep learning methods for ECG data from both modeling and application perspectives.
arXiv Detail & Related papers (2019-12-28T02:44:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.