Developing a Robust Computable Phenotype Definition Workflow to Describe
Health and Disease in Observational Health Research
- URL: http://arxiv.org/abs/2304.06504v1
- Date: Thu, 30 Mar 2023 15:29:54 GMT
- Title: Developing a Robust Computable Phenotype Definition Workflow to Describe
Health and Disease in Observational Health Research
- Authors: Jacob S. Zelko, Sarah Gasman, Shenita R. Freeman, Dong Yun Lee, Jaan
Altosaar, Azza Shoaibi, Gowtham Rao
- Abstract summary: Health informatics is built upon patient health data.
standardization is required to compute population statistics that are common metrics used in fields such as epidemiology.
While standards exist to structure and analyze patient data, analogous best practices for rigorously defining patient populations do not exist.
- Score: 0.6465251961564604
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Health informatics can inform decisions that practitioners, patients,
policymakers, and researchers need to make about health and disease. Health
informatics is built upon patient health data leading to the need to codify
patient health information. Such standardization is required to compute
population statistics (such as prevalence, incidence, etc.) that are common
metrics used in fields such as epidemiology. Reliable decision-making about
health and disease rests on our ability to organize, analyze, and assess data
repositories that contain patient health data.
While standards exist to structure and analyze patient data across patient
data sources such as health information exchanges, clinical data repositories,
and health data marketplaces, analogous best practices for rigorously defining
patient populations in health informatics contexts do not exist. Codifying best
practices for developing disease definitions could support the effective
development of clinical guidelines, inform algorithms used in clinical decision
support systems, and additional patient guidelines.
In this paper, we present a workflow for the development of phenotype
definitions. This workflow presents a series of recommendations for defining
health and disease. Various examples within this paper are presented to
demonstrate this workflow in health informatics contexts.
Related papers
- Addressing Data Heterogeneity in Federated Learning of Cox Proportional Hazards Models [8.798959872821962]
This paper outlines an approach in the domain of federated survival analysis, specifically the Cox Proportional Hazards (CoxPH) model.
We present an FL approach that employs feature-based clustering to enhance model accuracy across synthetic datasets and real-world applications.
arXiv Detail & Related papers (2024-07-20T18:34:20Z) - Patient-Centric Knowledge Graphs: A Survey of Current Methods,
Challenges, and Applications [2.913761513290171]
Patient-Centric Knowledge Graphs (PCKGs) represent an important shift in healthcare that focuses on individualized patient care.
PCKGs integrate various types of health data to provide healthcare professionals with a comprehensive understanding of a patient's health.
This literature review explores the methodologies, challenges, and opportunities associated with PCKGs.
arXiv Detail & Related papers (2024-02-20T00:07:55Z) - Data-Centric Foundation Models in Computational Healthcare: A Survey [21.53211505568379]
Foundation models (FMs) as an emerging suite of AI techniques have struck a wave of opportunities in computational healthcare.
We discuss key perspectives in AI security, assessment, and alignment with human values.
We offer a promising outlook of FM-based analytics to enhance the performance of patient outcome and clinical workflow.
arXiv Detail & Related papers (2024-01-04T08:00:32Z) - MedDiffusion: Boosting Health Risk Prediction via Diffusion-based Data
Augmentation [58.93221876843639]
This paper introduces a novel, end-to-end diffusion-based risk prediction model, named MedDiffusion.
It enhances risk prediction performance by creating synthetic patient data during training to enlarge sample space.
It discerns hidden relationships between patient visits using a step-wise attention mechanism, enabling the model to automatically retain the most vital information for generating high-quality data.
arXiv Detail & Related papers (2023-10-04T01:36:30Z) - Informing clinical assessment by contextualizing post-hoc explanations
of risk prediction models in type-2 diabetes [50.8044927215346]
We consider a comorbidity risk prediction scenario and focus on contexts regarding the patients clinical state.
We employ several state-of-the-art LLMs to present contexts around risk prediction model inferences and evaluate their acceptability.
Our paper is one of the first end-to-end analyses identifying the feasibility and benefits of contextual explanations in a real-world clinical use case.
arXiv Detail & Related papers (2023-02-11T18:07:11Z) - The Medkit-Learn(ing) Environment: Medical Decision Modelling through
Simulation [81.72197368690031]
We present a new benchmarking suite designed specifically for medical sequential decision making.
The Medkit-Learn(ing) Environment is a publicly available Python package providing simple and easy access to high-fidelity synthetic medical data.
arXiv Detail & Related papers (2021-06-08T10:38:09Z) - Clinical Outcome Prediction from Admission Notes using Self-Supervised
Knowledge Integration [55.88616573143478]
Outcome prediction from clinical text can prevent doctors from overlooking possible risks.
Diagnoses at discharge, procedures performed, in-hospital mortality and length-of-stay prediction are four common outcome prediction targets.
We propose clinical outcome pre-training to integrate knowledge about patient outcomes from multiple public sources.
arXiv Detail & Related papers (2021-02-08T10:26:44Z) - UNITE: Uncertainty-based Health Risk Prediction Leveraging Multi-sourced
Data [81.00385374948125]
We present UNcertaInTy-based hEalth risk prediction (UNITE) model.
UNITE provides accurate disease risk prediction and uncertainty estimation leveraging multi-sourced health data.
We evaluate UNITE on real-world disease risk prediction tasks: nonalcoholic fatty liver disease (NASH) and Alzheimer's disease (AD)
UNITE achieves up to 0.841 in F1 score for AD detection, up to 0.609 in PR-AUC for NASH detection, and outperforms various state-of-the-art baselines by up to $19%$ over the best baseline.
arXiv Detail & Related papers (2020-10-22T02:28:11Z) - Interpretable Disease Prediction based on Reinforcement Path Reasoning
over Knowledge Graphs [15.339137501579087]
We formulated the disease prediction task as a random walk along a knowledge graph (KG)
We build a KG to record relationships between diseases and risk factors according to validated medical knowledge.
The trajectory generated by the object represents an interpretable disease progression path of the given patient.
arXiv Detail & Related papers (2020-10-16T10:46:28Z) - Hemogram Data as a Tool for Decision-making in COVID-19 Management:
Applications to Resource Scarcity Scenarios [62.997667081978825]
COVID-19 pandemics has challenged emergency response systems worldwide, with widespread reports of essential services breakdown and collapse of health care structure.
This work describes a machine learning model derived from hemogram exam data performed in symptomatic patients.
Proposed models can predict COVID-19 qRT-PCR results in symptomatic individuals with high accuracy, sensitivity and specificity.
arXiv Detail & Related papers (2020-05-10T01:45:03Z) - Personal Health Knowledge Graphs for Patients [18.71820749477523]
Existing patient data analytics platforms fail to incorporate information that has context, is personal, and topical to patients.
For a recommendation system to give a suitable response to a query, it should consider personal information about the patient's health history.
arXiv Detail & Related papers (2020-03-31T19:35:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.