A Graph-based Imputation Method for Sparse Medical Records
- URL: http://arxiv.org/abs/2111.09084v1
- Date: Wed, 17 Nov 2021 13:06:08 GMT
- Title: A Graph-based Imputation Method for Sparse Medical Records
- Authors: Ramon Vinas, Xu Zheng and Jer Hayes
- Abstract summary: We propose a graph-based imputation method that is robust to sparsity and to unreliable unmeasured events.
Results indicate that the model learns to embed different event types in a clinically meaningful way.
- Score: 3.136861161060886
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Electronic Medical Records (EHR) are extremely sparse. Only a small
proportion of events (symptoms, diagnoses, and treatments) are observed in the
lifetime of an individual. The high degree of missingness of EHR can be
attributed to a large number of factors, including device failure, privacy
concerns, or other unexpected reasons. Unfortunately, many traditional
imputation methods are not well suited for highly sparse data and scale poorly
to high dimensional datasets. In this paper, we propose a graph-based
imputation method that is both robust to sparsity and to unreliable unmeasured
events. Our approach compares favourably to several standard and
state-of-the-art imputation methods in terms of performance and runtime.
Moreover, results indicate that the model learns to embed different event types
in a clinically meaningful way. Our work can facilitate the diagnosis of novel
diseases based on the clinical history of past events, with the potential to
increase our understanding of the landscape of comorbidities.
Related papers
- Deep State-Space Generative Model For Correlated Time-to-Event Predictions [54.3637600983898]
We propose a deep latent state-space generative model to capture the interactions among different types of correlated clinical events.
Our method also uncovers meaningful insights about the latent correlations among mortality and different types of organ failures.
arXiv Detail & Related papers (2024-07-28T02:42:36Z) - Automatic diagnosis of knee osteoarthritis severity using Swin
transformer [55.01037422579516]
Knee osteoarthritis (KOA) is a widespread condition that can cause chronic pain and stiffness in the knee joint.
We propose an automated approach that employs the Swin Transformer to predict the severity of KOA.
arXiv Detail & Related papers (2023-07-10T09:49:30Z) - Temporal-spatial Correlation Attention Network for Clinical Data
Analysis in Intensive Care Unit [27.885961694582896]
We propose a temporal-saptial correlation attention network (TSCAN) to handle some clinical characteristic prediction problems.
Based on the design of the attention mechanism model, our approach can effectively remove irrelevant items in clinical data and irrelevant nodes in time.
Our method can also find key clinical indicators of important outcomes that can be used to improve treatment options.
arXiv Detail & Related papers (2023-06-03T00:38:40Z) - Practical Challenges in Differentially-Private Federated Survival
Analysis of Medical Data [57.19441629270029]
In this paper, we take advantage of the inherent properties of neural networks to federate the process of training of survival analysis models.
In the realistic setting of small medical datasets and only a few data centers, this noise makes it harder for the models to converge.
We propose DPFed-post which adds a post-processing stage to the private federated learning scheme.
arXiv Detail & Related papers (2022-02-08T10:03:24Z) - Predicting Patient Readmission Risk from Medical Text via Knowledge
Graph Enhanced Multiview Graph Convolution [67.72545656557858]
We propose a new method that uses medical text of Electronic Health Records for prediction.
We represent discharge summaries of patients with multiview graphs enhanced by an external knowledge graph.
Experimental results prove the effectiveness of our method, yielding state-of-the-art performance.
arXiv Detail & Related papers (2021-12-19T01:45:57Z) - Identifying Stroke Indicators Using Rough Sets [0.7340017786387767]
We propose a novel rough-set based technique for ranking the importance of the various EHR records in detecting stroke.
Age, average glucose level, heart disease, and hypertension were the most essential attributes for detecting stroke in patients.
arXiv Detail & Related papers (2021-10-19T06:04:48Z) - Self-Supervised Graph Learning with Hyperbolic Embedding for Temporal
Health Event Prediction [13.24834156675212]
We propose a hyperbolic embedding method with information flow to pre-train medical code representations in a hierarchical structure.
We incorporate these pre-trained representations into a graph neural network to detect disease complications.
We present a new hierarchy-enhanced historical prediction proxy task in our self-supervised learning framework to fully utilize EHR data.
arXiv Detail & Related papers (2021-06-09T00:42:44Z) - Bootstrapping Your Own Positive Sample: Contrastive Learning With
Electronic Health Record Data [62.29031007761901]
This paper proposes a novel contrastive regularized clinical classification model.
We introduce two unique positive sampling strategies specifically tailored for EHR data.
Our framework yields highly competitive experimental results in predicting the mortality risk on real-world COVID-19 EHR data.
arXiv Detail & Related papers (2021-04-07T06:02:04Z) - An efficient representation of chronological events in medical texts [9.118144540451514]
We proposed a systematic methodology for learning from chronological events available in clinical notes.
The proposed methodological it path signature framework creates a non-parametric hierarchical representation of sequential events of any type.
The methodology was developed and externally validated using the largest in the UK secondary care mental health EHR data.
arXiv Detail & Related papers (2020-10-16T14:54:29Z) - Trajectories, bifurcations and pseudotime in large clinical datasets:
applications to myocardial infarction and diabetes data [94.37521840642141]
We suggest a semi-supervised methodology for the analysis of large clinical datasets, characterized by mixed data types and missing values.
The methodology is based on application of elastic principal graphs which can address simultaneously the tasks of dimensionality reduction, data visualization, clustering, feature selection and quantifying the geodesic distances (pseudotime) in partially ordered sequences of observations.
arXiv Detail & Related papers (2020-07-07T21:04:55Z) - Deep Recurrent Model for Individualized Prediction of Alzheimer's
Disease Progression [4.034948808542701]
Alzheimer's disease (AD) is one of the major causes of dementia and is characterized by slow progression over several years.
We propose a novel computational framework that can predict the phenotypic measurements of MRI biomarkers and trajectories of clinical status.
arXiv Detail & Related papers (2020-05-06T08:08:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.