Sequential Diagnosis Prediction with Transformer and Ontological
Representation
- URL: http://arxiv.org/abs/2109.03069v1
- Date: Tue, 7 Sep 2021 13:09:55 GMT
- Title: Sequential Diagnosis Prediction with Transformer and Ontological
Representation
- Authors: Xueping Peng, Guodong Long, Tao Shen, Sen Wang, Jing Jiang
- Abstract summary: We propose an end-to-end robust transformer-based model called SETOR to handle irregular intervals between a patient's visits with admitted timestamps and length of stay in each visit.
Experiments conducted on two real-world healthcare datasets show that, our sequential diagnoses prediction model SETOR achieves better predictive results than previous state-of-the-art approaches.
- Score: 35.88195694025553
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Sequential diagnosis prediction on the Electronic Health Record (EHR) has
been proven crucial for predictive analytics in the medical domain. EHR data,
sequential records of a patient's interactions with healthcare systems, has
numerous inherent characteristics of temporality, irregularity and data
insufficiency. Some recent works train healthcare predictive models by making
use of sequential information in EHR data, but they are vulnerable to
irregular, temporal EHR data with the states of admission/discharge from
hospital, and insufficient data. To mitigate this, we propose an end-to-end
robust transformer-based model called SETOR, which exploits neural ordinary
differential equation to handle both irregular intervals between a patient's
visits with admitted timestamps and length of stay in each visit, to alleviate
the limitation of insufficient data by integrating medical ontology, and to
capture the dependencies between the patient's visits by employing multi-layer
transformer blocks. Experiments conducted on two real-world healthcare datasets
show that, our sequential diagnoses prediction model SETOR not only achieves
better predictive results than previous state-of-the-art approaches,
irrespective of sufficient or insufficient training data, but also derives more
interpretable embeddings of medical codes. The experimental codes are available
at the GitHub repository (https://github.com/Xueping/SETOR).
Related papers
- Synthesizing Multimodal Electronic Health Records via Predictive Diffusion Models [69.06149482021071]
We propose a novel EHR data generation model called EHRPD.
It is a diffusion-based model designed to predict the next visit based on the current one while also incorporating time interval estimation.
We conduct experiments on two public datasets and evaluate EHRPD from fidelity, privacy, and utility perspectives.
arXiv Detail & Related papers (2024-06-20T02:20:23Z) - Sequential Inference of Hospitalization Electronic Health Records Using Probabilistic Models [3.2988476179015005]
In this work we design a probabilistic unsupervised model for multiple arbitrary-length sequences contained in hospitalization Electronic Health Record (EHR) data.
The model uses a latent variable structure and captures complex relationships between medications, diagnoses, laboratory tests, neurological assessments, and medications.
Inference algorithms are derived that use partial data to infer properties of the complete sequences, including their length and presence of specific values.
arXiv Detail & Related papers (2024-03-27T21:06:26Z) - MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data
Augmentation [58.93221876843639]
This paper introduces a novel, end-to-end diffusion-based risk prediction model, named MedDiffusion.
It enhances risk prediction performance by creating synthetic patient data during training to enlarge sample space.
It discerns hidden relationships between patient visits using a step-wise attention mechanism, enabling the model to automatically retain the most vital information for generating high-quality data.
arXiv Detail & Related papers (2023-10-04T01:36:30Z) - Contrastive Learning-based Imputation-Prediction Networks for
In-hospital Mortality Risk Modeling using EHRs [9.578930989075035]
This paper presents a contrastive learning-based imputation-prediction network for predicting in-hospital mortality risks using EHR data.
Our approach introduces graph analysis-based patient stratification modeling in the imputation process to group similar patients.
Experiments on two real-world EHR datasets show that our approach outperforms the state-of-the-art approaches in both imputation and prediction tasks.
arXiv Detail & Related papers (2023-08-19T03:24:34Z) - MIMO: Mutual Integration of Patient Journey and Medical Ontology for
Healthcare Representation Learning [49.57261599776167]
We propose an end-to-end robust Transformer-based solution, Mutual Integration of patient journey and Medical Ontology (MIMO) for healthcare representation learning and predictive analytics.
arXiv Detail & Related papers (2021-07-20T07:04:52Z) - MedGPT: Medical Concept Prediction from Clinical Narratives [0.23488056916440858]
Temporal modelling of a patient's medical history can be used to predict future events.
We present MedGPT, a novel transformer-based pipeline that uses Named Entity Recognition and Linking tools.
We show that our model captures medical knowledge by testing it on an experimental medical multiple choice question answering task.
arXiv Detail & Related papers (2021-07-07T10:36:28Z) - Medical data wrangling with sequential variational autoencoders [5.9207487081080705]
This paper proposes to model medical data records with heterogeneous data types and bursty missing data using sequential variational autoencoders (VAEs)
We show that Shi-VAE achieves the best performance in terms of using both metrics, with lower computational complexity than the GP-VAE model.
arXiv Detail & Related papers (2021-03-12T10:59:26Z) - DeepRite: Deep Recurrent Inverse TreatmEnt Weighting for Adjusting
Time-varying Confounding in Modern Longitudinal Observational Data [68.29870617697532]
We propose Deep Recurrent Inverse TreatmEnt weighting (DeepRite) for time-varying confounding in longitudinal data.
DeepRite is shown to recover the ground truth from synthetic data, and estimate unbiased treatment effects from real data.
arXiv Detail & Related papers (2020-10-28T15:05:08Z) - BiteNet: Bidirectional Temporal Encoder Network to Predict Medical
Outcomes [53.163089893876645]
We propose a novel self-attention mechanism that captures the contextual dependency and temporal relationships within a patient's healthcare journey.
An end-to-end bidirectional temporal encoder network (BiteNet) then learns representations of the patient's journeys.
We have evaluated the effectiveness of our methods on two supervised prediction and two unsupervised clustering tasks with a real-world EHR dataset.
arXiv Detail & Related papers (2020-09-24T00:42:36Z) - DeepEnroll: Patient-Trial Matching with Deep Embedding and Entailment
Prediction [67.91606509226132]
Clinical trials are essential for drug development but often suffer from expensive, inaccurate and insufficient patient recruitment.
DeepEnroll is a cross-modal inference learning model to jointly encode enrollment criteria (tabular data) into a shared latent space for matching inference.
arXiv Detail & Related papers (2020-01-22T17:51:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.