Modeling electronic health record data using a knowledge-graph-embedded
topic model
- URL: http://arxiv.org/abs/2206.01436v1
- Date: Fri, 3 Jun 2022 07:58:17 GMT
- Title: Modeling electronic health record data using a knowledge-graph-embedded
topic model
- Authors: Yuesong Zou, Ahmad Pesaranghader, Aman Verma, David Buckeridge and Yue
Li
- Abstract summary: We present KG-ETM, an end-to-end knowledge graph-based multimodal embedded topic model.
KG-ETM distills latent disease topics from EHR data by learning the embedding from the medical knowledge graphs.
Our model is also able to discover interpretable and accurate patient representations for patient stratification and drug recommendations.
- Score: 6.170782354287972
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The rapid growth of electronic health record (EHR) datasets opens up
promising opportunities to understand human diseases in a systematic way.
However, effective extraction of clinical knowledge from the EHR data has been
hindered by its sparsity and noisy information. We present KG-ETM, an
end-to-end knowledge graph-based multimodal embedded topic model. KG-ETM
distills latent disease topics from EHR data by learning the embedding from the
medical knowledge graphs. We applied KG-ETM to a large-scale EHR dataset
consisting of over 1 million patients. We evaluated its performance based on
EHR reconstruction and drug imputation. KG-ETM demonstrated superior
performance over the alternative methods on both tasks. Moreover, our model
learned clinically meaningful graph-informed embedding of the EHR codes. In
additional, our model is also able to discover interpretable and accurate
patient representations for patient stratification and drug recommendations.
Related papers
- DualMAR: Medical-Augmented Representation from Dual-Expertise Perspectives [20.369746122143063]
We propose DualMAR, a framework that enhances prediction tasks through both individual observation data and public knowledge bases.
By retrieving and angular coordinates upon polar space, DualMAR enables accurate predictions based on rich hierarchical and semantic embeddings from KG.
arXiv Detail & Related papers (2024-10-25T20:25:22Z) - Synthesizing Multimodal Electronic Health Records via Predictive Diffusion Models [69.06149482021071]
We propose a novel EHR data generation model called EHRPD.
It is a diffusion-based model designed to predict the next visit based on the current one while also incorporating time interval estimation.
We conduct experiments on two public datasets and evaluate EHRPD from fidelity, privacy, and utility perspectives.
arXiv Detail & Related papers (2024-06-20T02:20:23Z) - Recent Advances in Predictive Modeling with Electronic Health Records [71.19967863320647]
utilizing EHR data for predictive modeling presents several challenges due to its unique characteristics.
Deep learning has demonstrated its superiority in various applications, including healthcare.
arXiv Detail & Related papers (2024-02-02T00:31:01Z) - MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data
Augmentation [58.93221876843639]
This paper introduces a novel, end-to-end diffusion-based risk prediction model, named MedDiffusion.
It enhances risk prediction performance by creating synthetic patient data during training to enlarge sample space.
It discerns hidden relationships between patient visits using a step-wise attention mechanism, enabling the model to automatically retain the most vital information for generating high-quality data.
arXiv Detail & Related papers (2023-10-04T01:36:30Z) - Knowledge Graph Embedding with Electronic Health Records Data via Latent
Graphical Block Model [13.398292423857756]
We propose to infer the conditional dependency structure among EHR features via a latent graphical block model (LGBM)
We establish the statistical rates of the proposed estimators and show the perfect recovery of the block structure.
arXiv Detail & Related papers (2023-05-31T16:18:46Z) - Dynamic Graph Enhanced Contrastive Learning for Chest X-ray Report
Generation [92.73584302508907]
We propose a knowledge graph with Dynamic structure and nodes to facilitate medical report generation with Contrastive Learning.
In detail, the fundamental structure of our graph is pre-constructed from general knowledge.
Each image feature is integrated with its very own updated graph before being fed into the decoder module for report generation.
arXiv Detail & Related papers (2023-03-18T03:53:43Z) - On the Importance of Clinical Notes in Multi-modal Learning for EHR Data [0.0]
Previous research has shown that jointly using clinical notes with electronic health record data improved predictive performance for patient monitoring.
We first confirm that performance significantly improves over state-of-the-art EHR data models when combining EHR data and clinical notes.
We then provide an analysis showing improvements arise almost exclusively from a subset of notes containing broader context on patient state rather than clinician notes.
arXiv Detail & Related papers (2022-12-06T15:18:57Z) - Integrated Convolutional and Recurrent Neural Networks for Health Risk
Prediction using Patient Journey Data with Many Missing Values [9.418011774179794]
This paper proposes a novel end-to-end approach to modeling EHR patient journey data with Integrated Convolutional and Recurrent Neural Networks.
Our model can capture both long- and short-term temporal patterns within each patient journey and effectively handle the high degree of missingness in EHR data without any imputation data generation.
arXiv Detail & Related papers (2022-11-11T07:36:18Z) - Predicting Patient Readmission Risk from Medical Text via Knowledge
Graph Enhanced Multiview Graph Convolution [67.72545656557858]
We propose a new method that uses medical text of Electronic Health Records for prediction.
We represent discharge summaries of patients with multiview graphs enhanced by an external knowledge graph.
Experimental results prove the effectiveness of our method, yielding state-of-the-art performance.
arXiv Detail & Related papers (2021-12-19T01:45:57Z) - Demographic Aware Probabilistic Medical Knowledge Graph Embeddings of
Electronic Medical Records [0.5524804393257919]
Medical knowledge graphs (KGs) constructed from Electronic Medical Records (EMR) contain abundant information about patients and medical entities.
DarLING is a demographic-aware medical KG embedding framework that explicitly incorporates demographics in the medical entities space by associating patient demographics with a corresponding hyperplane.
We evaluate DARLING through link prediction for treatments and medicines, on a medical KG constructed from EMR data, and illustrate its superior performance compared to existing KG embedding models.
arXiv Detail & Related papers (2021-03-22T15:45:05Z) - Variational Knowledge Distillation for Disease Classification in Chest
X-Rays [102.04931207504173]
We propose itvariational knowledge distillation (VKD), which is a new probabilistic inference framework for disease classification based on X-rays.
We demonstrate the effectiveness of our method on three public benchmark datasets with paired X-ray images and EHRs.
arXiv Detail & Related papers (2021-03-19T14:13:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.