Toward Cohort Intelligence: A Universal Cohort Representation Learning
Framework for Electronic Health Record Analysis
- URL: http://arxiv.org/abs/2304.04468v3
- Date: Wed, 12 Apr 2023 07:12:42 GMT
- Title: Toward Cohort Intelligence: A Universal Cohort Representation Learning
Framework for Electronic Health Record Analysis
- Authors: Changshuo Liu, Wenqiao Zhang, Beng Chin Ooi, James Wei Luen Yip,
Lingze Zeng, Kaiping Zheng
- Abstract summary: We propose a universal COhort Representation lEarning (CORE) framework to augment EHR utilization by leveraging the fine-grained cohort information among patients.
CORE is readily applicable to diverse backbone models, serving as a universal plug-in framework to infuse cohort information into healthcare methods for boosted performance.
- Score: 15.137213823470544
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Electronic Health Records (EHR) are generated from clinical routine care
recording valuable information of broad patient populations, which provide
plentiful opportunities for improving patient management and intervention
strategies in clinical practice. To exploit the enormous potential of EHR data,
a popular EHR data analysis paradigm in machine learning is EHR representation
learning, which first leverages the individual patient's EHR data to learn
informative representations by a backbone, and supports diverse health-care
downstream tasks grounded on the representations. Unfortunately, such a
paradigm fails to access the in-depth analysis of patients' relevance, which is
generally known as cohort studies in clinical practice. Specifically, patients
in the same cohort tend to share similar characteristics, implying their
resemblance in medical conditions such as symptoms or diseases. In this paper,
we propose a universal COhort Representation lEarning (CORE) framework to
augment EHR utilization by leveraging the fine-grained cohort information among
patients. In particular, CORE first develops an explicit patient modeling task
based on the prior knowledge of patients' diagnosis codes, which measures the
latent relevance among patients to adaptively divide the cohorts for each
patient. Based on the constructed cohorts, CORE recodes the pre-extracted EHR
data representation from intra- and inter-cohort perspectives, yielding
augmented EHR data representation learning. CORE is readily applicable to
diverse backbone models, serving as a universal plug-in framework to infuse
cohort information into healthcare methods for boosted performance. We conduct
an extensive experimental evaluation on two real-world datasets, and the
experimental results demonstrate the effectiveness and generalizability of
CORE.
Related papers
- CohortNet: Empowering Cohort Discovery for Interpretable Healthcare Analytics [23.284528154162977]
We propose a cohort auto-discovery model, CohortNet, for interpretable healthcare analysis.
CohortNet learns fine-grained patient representations by separately processing each feature.
It classifies each feature into distinct states and employs a cohort exploration strategy.
arXiv Detail & Related papers (2024-06-20T06:12:23Z) - TREEMENT: Interpretable Patient-Trial Matching via Personalized Dynamic
Tree-Based Memory Network [54.332862955411656]
Clinical trials are critical for drug development but often suffer from expensive and inefficient patient recruitment.
In recent years, machine learning models have been proposed for speeding up patient recruitment via automatically matching patients with clinical trials.
We introduce a dynamic tree-based memory network model named TREEMENT to provide accurate and interpretable patient trial matching.
arXiv Detail & Related papers (2023-07-19T12:35:09Z) - Large Language Models for Healthcare Data Augmentation: An Example on
Patient-Trial Matching [49.78442796596806]
We propose an innovative privacy-aware data augmentation approach for patient-trial matching (LLM-PTM)
Our experiments demonstrate a 7.32% average improvement in performance using the proposed LLM-PTM method, and the generalizability to new data is improved by 12.12%.
arXiv Detail & Related papers (2023-03-24T03:14:00Z) - On the Importance of Clinical Notes in Multi-modal Learning for EHR Data [0.0]
Previous research has shown that jointly using clinical notes with electronic health record data improved predictive performance for patient monitoring.
We first confirm that performance significantly improves over state-of-the-art EHR data models when combining EHR data and clinical notes.
We then provide an analysis showing improvements arise almost exclusively from a subset of notes containing broader context on patient state rather than clinician notes.
arXiv Detail & Related papers (2022-12-06T15:18:57Z) - Towards Trustworthy Cross-patient Model Development [3.109478324371548]
We study differences in model performance and explainability when trained for all patients and one patient at a time.
The results show that patients' demographics has a large impact on the performance and explainability and thus trustworthiness.
arXiv Detail & Related papers (2021-12-20T10:51:04Z) - SANSformers: Self-Supervised Forecasting in Electronic Health Records
with Attention-Free Models [48.07469930813923]
This work aims to forecast the demand for healthcare services, by predicting the number of patient visits to healthcare facilities.
We introduce SANSformer, an attention-free sequential model designed with specific inductive biases to cater for the unique characteristics of EHR data.
Our results illuminate the promising potential of tailored attention-free models and self-supervised pretraining in refining healthcare utilization predictions across various patient demographics.
arXiv Detail & Related papers (2021-08-31T08:23:56Z) - Bootstrapping Your Own Positive Sample: Contrastive Learning With
Electronic Health Record Data [62.29031007761901]
This paper proposes a novel contrastive regularized clinical classification model.
We introduce two unique positive sampling strategies specifically tailored for EHR data.
Our framework yields highly competitive experimental results in predicting the mortality risk on real-world COVID-19 EHR data.
arXiv Detail & Related papers (2021-04-07T06:02:04Z) - Adversarial Sample Enhanced Domain Adaptation: A Case Study on
Predictive Modeling with Electronic Health Records [57.75125067744978]
We propose a data augmentation method to facilitate domain adaptation.
adversarially generated samples are used during domain adaptation.
Results confirm the effectiveness of our method and the generality on different tasks.
arXiv Detail & Related papers (2021-01-13T03:20:20Z) - Deep Representation Learning of Patient Data from Electronic Health
Records (EHR): A Systematic Review [20.621261286239967]
Patient representation learning refers to learning a dense mathematical representation of a patient that encodes meaningful information from Electronic Health Records.
This study presents a systematic review of this field and provides both qualitative and quantitative analyses from a methodological perspective.
arXiv Detail & Related papers (2020-10-06T15:18:02Z) - BiteNet: Bidirectional Temporal Encoder Network to Predict Medical
Outcomes [53.163089893876645]
We propose a novel self-attention mechanism that captures the contextual dependency and temporal relationships within a patient's healthcare journey.
An end-to-end bidirectional temporal encoder network (BiteNet) then learns representations of the patient's journeys.
We have evaluated the effectiveness of our methods on two supervised prediction and two unsupervised clustering tasks with a real-world EHR dataset.
arXiv Detail & Related papers (2020-09-24T00:42:36Z) - Deep Representation Learning of Electronic Health Records to Unlock
Patient Stratification at Scale [0.5498849973527224]
We present an unsupervised framework based on deep learning to process heterogeneous EHRs.
We derive patient representations that can efficiently and effectively enable patient stratification at scale.
arXiv Detail & Related papers (2020-03-14T00:04:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.