Predicting clinical outcomes from patient care pathways represented with temporal knowledge graphs
- URL: http://arxiv.org/abs/2502.21138v2
- Date: Wed, 30 Apr 2025 15:52:56 GMT
- Title: Predicting clinical outcomes from patient care pathways represented with temporal knowledge graphs
- Authors: Jong Ho Jhee, Alberto Megina, PacĂ´me Constant Dit Beaufils, Matilde Karakachoff, Richard Redon, Alban Gaignard, Adrien Coulet,
- Abstract summary: It is unclear how knowledge graph data representations and their embedding, which are competitive in some settings, could be of interest in biomedical predictive modeling.<n>We simulated synthetic but realistic data of patients with intracranial aneurysm and experimented on the task of predicting their clinical outcome.<n>Our study illustrates that in our case, a graph representation and Graph Convolutional Network (GCN) embeddings reach the best performance for a predictive task from observational data.
- Score: 0.6596280437011043
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Background: With the increasing availability of healthcare data, predictive modeling finds many applications in the biomedical domain, such as the evaluation of the level of risk for various conditions, which in turn can guide clinical decision making. However, it is unclear how knowledge graph data representations and their embedding, which are competitive in some settings, could be of interest in biomedical predictive modeling. Method: We simulated synthetic but realistic data of patients with intracranial aneurysm and experimented on the task of predicting their clinical outcome. We compared the performance of various classification approaches on tabular data versus a graph-based representation of the same data. Next, we investigated how the adopted schema for representing first individual data and second temporal data impacts predictive performances. Results: Our study illustrates that in our case, a graph representation and Graph Convolutional Network (GCN) embeddings reach the best performance for a predictive task from observational data. We emphasize the importance of the adopted schema and of the consideration of literal values in the representation of individual data. Our study also moderates the relative impact of various time encoding on GCN performance.
Related papers
- Evaluating the Predictive Features of Person-Centric Knowledge Graph Embeddings: Unfolding Ablation Studies [0.757843972001219]
We propose a systematic approach to examine the results of GNN models trained with structured and unstructured information from the MIMIC-III dataset.
We show the robustness of this approach in identifying predictive features in PKGs for the task of readmission prediction.
arXiv Detail & Related papers (2024-08-27T09:48:25Z) - AdaMedGraph: Adaboosting Graph Neural Networks for Personalized Medicine [31.424781716926848]
We propose a novel algorithm named ours, which can automatically select important features to construct multiple patient similarity graphs.
ours is evaluated on two real-world medical scenarios and shows superiors performance.
arXiv Detail & Related papers (2023-11-24T06:27:25Z) - Knowledge Graph Representations to enhance Intensive Care Time-Series
Predictions [4.660203987415476]
Our proposed methodology integrates medical knowledge with ICU data, improving clinical decision modeling.
It combines graph representations with vital signs and clinical reports, enhancing performance.
Our model includes an interpretability component to understand how knowledge graph nodes affect predictions.
arXiv Detail & Related papers (2023-11-13T09:11:55Z) - MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data
Augmentation [58.93221876843639]
This paper introduces a novel, end-to-end diffusion-based risk prediction model, named MedDiffusion.
It enhances risk prediction performance by creating synthetic patient data during training to enlarge sample space.
It discerns hidden relationships between patient visits using a step-wise attention mechanism, enabling the model to automatically retain the most vital information for generating high-quality data.
arXiv Detail & Related papers (2023-10-04T01:36:30Z) - Unsupervised pre-training of graph transformers on patient population
graphs [48.02011627390706]
We propose a graph-transformer-based network to handle heterogeneous clinical data.
We show the benefit of our pre-training method in a self-supervised and a transfer learning setting.
arXiv Detail & Related papers (2022-07-21T16:59:09Z) - Deep Co-Attention Network for Multi-View Subspace Learning [73.3450258002607]
We propose a deep co-attention network for multi-view subspace learning.
It aims to extract both the common information and the complementary information in an adversarial setting.
In particular, it uses a novel cross reconstruction loss and leverages the label information to guide the construction of the latent representation.
arXiv Detail & Related papers (2021-02-15T18:46:44Z) - Deep Learning with Heterogeneous Graph Embeddings for Mortality
Prediction from Electronic Health Records [2.2859570135269625]
We train a Heterogeneous Graph Model (HGM) on Electronic Health Record data and use the resulting embedding vector as additional information added to a Conal Neural Network (CNN) model for predicting in-hospital mortality.
We find that adding HGM to a CNN model increases the mortality prediction accuracy up to 4%.
arXiv Detail & Related papers (2020-12-28T02:27:09Z) - BiteNet: Bidirectional Temporal Encoder Network to Predict Medical
Outcomes [53.163089893876645]
We propose a novel self-attention mechanism that captures the contextual dependency and temporal relationships within a patient's healthcare journey.
An end-to-end bidirectional temporal encoder network (BiteNet) then learns representations of the patient's journeys.
We have evaluated the effectiveness of our methods on two supervised prediction and two unsupervised clustering tasks with a real-world EHR dataset.
arXiv Detail & Related papers (2020-09-24T00:42:36Z) - Trajectories, bifurcations and pseudotime in large clinical datasets:
applications to myocardial infarction and diabetes data [94.37521840642141]
We suggest a semi-supervised methodology for the analysis of large clinical datasets, characterized by mixed data types and missing values.
The methodology is based on application of elastic principal graphs which can address simultaneously the tasks of dimensionality reduction, data visualization, clustering, feature selection and quantifying the geodesic distances (pseudotime) in partially ordered sequences of observations.
arXiv Detail & Related papers (2020-07-07T21:04:55Z) - Temporal Phenotyping using Deep Predictive Clustering of Disease
Progression [97.88605060346455]
We develop a deep learning approach for clustering time-series data, where each cluster comprises patients who share similar future outcomes of interest.
Experiments on two real-world datasets show that our model achieves superior clustering performance over state-of-the-art benchmarks.
arXiv Detail & Related papers (2020-06-15T20:48:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.