Heterogeneous electronic medical record representation for similarity
computing
- URL: http://arxiv.org/abs/2104.14229v1
- Date: Thu, 29 Apr 2021 09:38:14 GMT
- Title: Heterogeneous electronic medical record representation for similarity
computing
- Authors: Hoda Memarzadeh, Nasser Ghadiri, Maryam Lotfi Shahreza and Suresh
Pokharel
- Abstract summary: Patient similarity assessment is one of the secondary tasks in identifying patients who are similar to a given patient.
This article examines a new data representation method for Electronic Medical Records (EMRs)
We propose a method that captures the co-occurrence of different medical events, including signs, symptoms, and diseases extracted via unstructured data and structured data.
- Score: 3.039568795810294
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Due to the widespread use of tools and the development of text processing
techniques, the size and range of clinical data are not limited to structured
data. The rapid growth of recorded information has led to big data platforms in
healthcare that could be used to improve patients' primary care and serve
various secondary purposes. Patient similarity assessment is one of the
secondary tasks in identifying patients who are similar to a given patient, and
it helps derive insights from similar patients' records to provide better
treatment. This type of assessment is based on calculating the distance between
patients. Since representing and calculating the similarity of patients plays
an essential role in many secondary uses of electronic records, this article
examines a new data representation method for Electronic Medical Records (EMRs)
while taking into account the information in clinical narratives for similarity
computing. Some previous works are based on structured data types, while other
works only use unstructured data. However, a comprehensive representation of
the information contained in the EMR requires the effective aggregation of both
structured and unstructured data. To address the limitations of previous
methods, we propose a method that captures the co-occurrence of different
medical events, including signs, symptoms, and diseases extracted via
unstructured data and structured data. It integrates data as discriminative
features to construct a temporal tree, considering the difference between
events that have short-term and long-term impacts. Our results show that
considering signs, symptoms, and diseases in every time interval leads to less
MSE and more precision compared to baseline representations that do not
consider this information or consider them separately from structured data.
Related papers
- Sequential Inference of Hospitalization Electronic Health Records Using Probabilistic Models [3.2988476179015005]
In this work we design a probabilistic unsupervised model for multiple arbitrary-length sequences contained in hospitalization Electronic Health Record (EHR) data.
The model uses a latent variable structure and captures complex relationships between medications, diagnoses, laboratory tests, neurological assessments, and medications.
Inference algorithms are derived that use partial data to infer properties of the complete sequences, including their length and presence of specific values.
arXiv Detail & Related papers (2024-03-27T21:06:26Z) - Modelling Patient Trajectories Using Multimodal Information [0.0]
We propose a solution to model patient trajectories that combines different types of information and considers the temporal aspect of clinical data.
The developed solution was evaluated on two different clinical outcomes, unexpected patient readmission and disease progression.
arXiv Detail & Related papers (2022-09-09T10:20:54Z) - Unsupervised pre-training of graph transformers on patient population
graphs [48.02011627390706]
We propose a graph-transformer-based network to handle heterogeneous clinical data.
We show the benefit of our pre-training method in a self-supervised and a transfer learning setting.
arXiv Detail & Related papers (2022-07-21T16:59:09Z) - How to Leverage Multimodal EHR Data for Better Medical Predictions? [13.401754962583771]
The complexity of electronic health records ( EHR) data is a challenge for the application of deep learning.
In this paper, we first extract the accompanying clinical notes from EHR and propose a method to integrate these data.
The results on two medical prediction tasks show that our fused model with different data outperforms the state-of-the-art method.
arXiv Detail & Related papers (2021-10-29T13:26:05Z) - Sequential Diagnosis Prediction with Transformer and Ontological
Representation [35.88195694025553]
We propose an end-to-end robust transformer-based model called SETOR to handle irregular intervals between a patient's visits with admitted timestamps and length of stay in each visit.
Experiments conducted on two real-world healthcare datasets show that, our sequential diagnoses prediction model SETOR achieves better predictive results than previous state-of-the-art approaches.
arXiv Detail & Related papers (2021-09-07T13:09:55Z) - MIMO: Mutual Integration of Patient Journey and Medical Ontology for
Healthcare Representation Learning [49.57261599776167]
We propose an end-to-end robust Transformer-based solution, Mutual Integration of patient journey and Medical Ontology (MIMO) for healthcare representation learning and predictive analytics.
arXiv Detail & Related papers (2021-07-20T07:04:52Z) - BiteNet: Bidirectional Temporal Encoder Network to Predict Medical
Outcomes [53.163089893876645]
We propose a novel self-attention mechanism that captures the contextual dependency and temporal relationships within a patient's healthcare journey.
An end-to-end bidirectional temporal encoder network (BiteNet) then learns representations of the patient's journeys.
We have evaluated the effectiveness of our methods on two supervised prediction and two unsupervised clustering tasks with a real-world EHR dataset.
arXiv Detail & Related papers (2020-09-24T00:42:36Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Trajectories, bifurcations and pseudotime in large clinical datasets:
applications to myocardial infarction and diabetes data [94.37521840642141]
We suggest a semi-supervised methodology for the analysis of large clinical datasets, characterized by mixed data types and missing values.
The methodology is based on application of elastic principal graphs which can address simultaneously the tasks of dimensionality reduction, data visualization, clustering, feature selection and quantifying the geodesic distances (pseudotime) in partially ordered sequences of observations.
arXiv Detail & Related papers (2020-07-07T21:04:55Z) - Comparative Visual Analytics for Assessing Medical Records with Sequence
Embedding [17.752580010414228]
We develop a visual analytics system to support comparative studies of patient records.
We demonstrate the effectiveness of our design and system with case studies using a real-world dataset from the neonatal intensive care unit of UC Davis.
arXiv Detail & Related papers (2020-02-18T19:29:30Z) - Learning Dynamic and Personalized Comorbidity Networks from Event Data
using Deep Diffusion Processes [102.02672176520382]
Comorbid diseases co-occur and progress via complex temporal patterns that vary among individuals.
In electronic health records we can observe the different diseases a patient has, but can only infer the temporal relationship between each co-morbid condition.
We develop deep diffusion processes to model "dynamic comorbidity networks"
arXiv Detail & Related papers (2020-01-08T15:47:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.