MedML: Fusing Medical Knowledge and Machine Learning Models for Early
Pediatric COVID-19 Hospitalization and Severity Prediction
- URL: http://arxiv.org/abs/2207.12283v1
- Date: Mon, 25 Jul 2022 15:56:14 GMT
- Title: MedML: Fusing Medical Knowledge and Machine Learning Models for Early
Pediatric COVID-19 Hospitalization and Severity Prediction
- Authors: Junyi Gao, Chaoqi Yang, George Heintz, Scott Barrows, Elise Albers,
Mary Stapel, Sara Warfield, Adam Cross, Jimeng Sun, the N3C consortium
- Abstract summary: We respond to the national Pediatric COVID-19 data challenge with a novel machine learning model, MedML.
MedML extracts the most predictive features based on medical knowledge and propensity scores from over 6 million medical concepts.
We evaluate MedML across 143,605 patients for the hospitalization prediction task and 11,465 patients for the severity prediction task.
- Score: 27.352097332678213
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The COVID-19 pandemic has caused devastating economic and social disruption,
straining the resources of healthcare institutions worldwide. This has led to a
nationwide call for models to predict hospitalization and severe illness in
patients with COVID-19 to inform distribution of limited healthcare resources.
We respond to one of these calls specific to the pediatric population. To
address this challenge, we study two prediction tasks for the pediatric
population using electronic health records: 1) predicting which children are
more likely to be hospitalized, and 2) among hospitalized children, which
individuals are more likely to develop severe symptoms.
We respond to the national Pediatric COVID-19 data challenge with a novel
machine learning model, MedML. MedML extracts the most predictive features
based on medical knowledge and propensity scores from over 6 million medical
concepts and incorporates the inter-feature relationships between heterogeneous
medical features via graph neural networks (GNN). We evaluate MedML across
143,605 patients for the hospitalization prediction task and 11,465 patients
for the severity prediction task using data from the National Cohort
Collaborative (N3C) dataset. We also report detailed group-level and
individual-level feature importance analyses to evaluate the model
interpretability.
MedML achieves up to a 7% higher AUROC score and up to a 14% higher AUPRC
score compared to the best baseline machine learning models and performs well
across all nine national geographic regions and over all three-month spans
since the start of the pandemic. Our cross-disciplinary research team has
developed a method of incorporating clinical domain knowledge as the framework
for a new type of machine learning model that is more predictive and
explainable than current state-of-the-art data-driven feature selection
methods.
Related papers
- Reasoning-Enhanced Healthcare Predictions with Knowledge Graph Community Retrieval [61.70489848327436]
KARE is a novel framework that integrates knowledge graph (KG) community-level retrieval with large language models (LLMs) reasoning.
Extensive experiments demonstrate that KARE outperforms leading models by up to 10.8-15.0% on MIMIC-III and 12.6-12.7% on MIMIC-IV for mortality and readmission predictions.
arXiv Detail & Related papers (2024-10-06T18:46:28Z) - Foresight -- Deep Generative Modelling of Patient Timelines using
Electronic Health Records [46.024501445093755]
Temporal modelling of medical history can be used to forecast and simulate future events, estimate risk, suggest alternative diagnoses or forecast complications.
We present Foresight, a novel GPT3-based pipeline that uses NER+L tools (i.e. MedCAT) to convert document text into structured, coded concepts.
arXiv Detail & Related papers (2022-12-13T19:06:00Z) - Feature-context driven Federated Meta-Learning for Rare Disease
Prediction [5.841823822793997]
We propose a novel approach for rare disease prediction based on federated meta-learning.
We show that our approach out-performs the original federated meta-learning algorithm in accuracy and speed.
arXiv Detail & Related papers (2021-12-29T02:18:43Z) - Developing and validating multi-modal models for mortality prediction in
COVID-19 patients: a multi-center retrospective study [1.5308395762165423]
We develop and validate multi-modal models for COVID-19 mortality prediction using multi-center patient data.
Our goal is to create a toolkit that would assist investigators and organizations in building multi-modal models for prediction, classification and/or optimization.
arXiv Detail & Related papers (2021-09-01T04:46:27Z) - MIMO: Mutual Integration of Patient Journey and Medical Ontology for
Healthcare Representation Learning [49.57261599776167]
We propose an end-to-end robust Transformer-based solution, Mutual Integration of patient journey and Medical Ontology (MIMO) for healthcare representation learning and predictive analytics.
arXiv Detail & Related papers (2021-07-20T07:04:52Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z) - BiteNet: Bidirectional Temporal Encoder Network to Predict Medical
Outcomes [53.163089893876645]
We propose a novel self-attention mechanism that captures the contextual dependency and temporal relationships within a patient's healthcare journey.
An end-to-end bidirectional temporal encoder network (BiteNet) then learns representations of the patient's journeys.
We have evaluated the effectiveness of our methods on two supervised prediction and two unsupervised clustering tasks with a real-world EHR dataset.
arXiv Detail & Related papers (2020-09-24T00:42:36Z) - Select-ProtoNet: Learning to Select for Few-Shot Disease Subtype
Prediction [55.94378672172967]
We focus on few-shot disease subtype prediction problem, identifying subgroups of similar patients.
We introduce meta learning techniques to develop a new model, which can extract the common experience or knowledge from interrelated clinical tasks.
Our new model is built upon a carefully designed meta-learner, called Prototypical Network, that is a simple yet effective meta learning machine for few-shot image classification.
arXiv Detail & Related papers (2020-09-02T02:50:30Z) - Individualized Prediction of COVID-19 Adverse outcomes with MLHO [9.197411456718708]
We developed an end-to-end Machine Learning framework that leverages iterative feature and algorithm selection to predict Health outcomes.
We modeled the four adverse outcomes utilizing about 600 features representing patients' pre-COVID health records and demographics.
Our results demonstrated that while demographic variables are important predictors of adverse outcomes after a COVID-19 infection, the incorporation of the past clinical records are vital for a reliable prediction model.
arXiv Detail & Related papers (2020-08-10T02:44:52Z) - From predictions to prescriptions: A data-driven response to COVID-19 [42.57407485467993]
We propose a comprehensive data-driven approach to understand the clinical characteristics of COVID-19.
We build personalized calculators to predict the risk of infection and mortality.
We propose an optimization model to re-allocate ventilators and alleviate shortages.
arXiv Detail & Related papers (2020-06-30T03:34:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.