DeepEnroll: Patient-Trial Matching with Deep Embedding and Entailment
Prediction
- URL: http://arxiv.org/abs/2001.08179v2
- Date: Thu, 23 Jan 2020 02:39:47 GMT
- Title: DeepEnroll: Patient-Trial Matching with Deep Embedding and Entailment
Prediction
- Authors: Xingyao Zhang, Cao Xiao, Lucas M. Glass, Jimeng Sun
- Abstract summary: Clinical trials are essential for drug development but often suffer from expensive, inaccurate and insufficient patient recruitment.
DeepEnroll is a cross-modal inference learning model to jointly encode enrollment criteria (tabular data) into a shared latent space for matching inference.
- Score: 67.91606509226132
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Clinical trials are essential for drug development but often suffer from
expensive, inaccurate and insufficient patient recruitment. The core problem of
patient-trial matching is to find qualified patients for a trial, where patient
information is stored in electronic health records (EHR) while trial
eligibility criteria (EC) are described in text documents available on the web.
How to represent longitudinal patient EHR? How to extract complex logical rules
from EC? Most existing works rely on manual rule-based extraction, which is
time consuming and inflexible for complex inference. To address these
challenges, we proposed DeepEnroll, a cross-modal inference learning model to
jointly encode enrollment criteria (text) and patients records (tabular data)
into a shared latent space for matching inference. DeepEnroll applies a
pre-trained Bidirectional Encoder Representations from Transformers(BERT) model
to encode clinical trial information into sentence embedding. And uses a
hierarchical embedding model to represent patient longitudinal EHR. In
addition, DeepEnroll is augmented by a numerical information embedding and
entailment module to reason over numerical information in both EC and EHR.
These encoders are trained jointly to optimize patient-trial matching score. We
evaluated DeepEnroll on the trial-patient matching task with demonstrated on
real world datasets. DeepEnroll outperformed the best baseline by up to 12.4%
in average F1.
Related papers
- Towards Efficient Patient Recruitment for Clinical Trials: Application of a Prompt-Based Learning Model [0.7373617024876725]
Clinical trials are essential for advancing pharmaceutical interventions, but they face a bottleneck in selecting eligible participants.
The complex nature of unstructured medical texts presents challenges in efficiently identifying participants.
In this study, we aimed to evaluate the performance of a prompt-based large language model for the cohort selection task.
arXiv Detail & Related papers (2024-04-24T20:42:28Z) - SHAPE: A Sample-adaptive Hierarchical Prediction Network for Medication
Recommendation [22.899946140205962]
We propose a novel Sample-adaptive Hierarchical medicAtion Prediction nEtwork, termed SHAPE, to tackle the challenges in the medication recommendation task.
Specifically, we design a compact intra-visit set encoder to encode the relationship in the medical event for obtaining visit-level representation.
To endow the model with the capability of modeling the variable visit length, we introduce a soft curriculum learning method to assign the difficulty of each sample automatically by the visit length.
arXiv Detail & Related papers (2023-09-09T08:28:04Z) - TREEMENT: Interpretable Patient-Trial Matching via Personalized Dynamic
Tree-Based Memory Network [54.332862955411656]
Clinical trials are critical for drug development but often suffer from expensive and inefficient patient recruitment.
In recent years, machine learning models have been proposed for speeding up patient recruitment via automatically matching patients with clinical trials.
We introduce a dynamic tree-based memory network model named TREEMENT to provide accurate and interpretable patient trial matching.
arXiv Detail & Related papers (2023-07-19T12:35:09Z) - Sequential Diagnosis Prediction with Transformer and Ontological
Representation [35.88195694025553]
We propose an end-to-end robust transformer-based model called SETOR to handle irregular intervals between a patient's visits with admitted timestamps and length of stay in each visit.
Experiments conducted on two real-world healthcare datasets show that, our sequential diagnoses prediction model SETOR achieves better predictive results than previous state-of-the-art approaches.
arXiv Detail & Related papers (2021-09-07T13:09:55Z) - Bootstrapping Your Own Positive Sample: Contrastive Learning With
Electronic Health Record Data [62.29031007761901]
This paper proposes a novel contrastive regularized clinical classification model.
We introduce two unique positive sampling strategies specifically tailored for EHR data.
Our framework yields highly competitive experimental results in predicting the mortality risk on real-world COVID-19 EHR data.
arXiv Detail & Related papers (2021-04-07T06:02:04Z) - DeepRite: Deep Recurrent Inverse TreatmEnt Weighting for Adjusting
Time-varying Confounding in Modern Longitudinal Observational Data [68.29870617697532]
We propose Deep Recurrent Inverse TreatmEnt weighting (DeepRite) for time-varying confounding in longitudinal data.
DeepRite is shown to recover the ground truth from synthetic data, and estimate unbiased treatment effects from real data.
arXiv Detail & Related papers (2020-10-28T15:05:08Z) - BiteNet: Bidirectional Temporal Encoder Network to Predict Medical
Outcomes [53.163089893876645]
We propose a novel self-attention mechanism that captures the contextual dependency and temporal relationships within a patient's healthcare journey.
An end-to-end bidirectional temporal encoder network (BiteNet) then learns representations of the patient's journeys.
We have evaluated the effectiveness of our methods on two supervised prediction and two unsupervised clustering tasks with a real-world EHR dataset.
arXiv Detail & Related papers (2020-09-24T00:42:36Z) - COMPOSE: Cross-Modal Pseudo-Siamese Network for Patient Trial Matching [70.08786840301435]
We propose CrOss-Modal PseudO-SiamEse network (COMPOSE) to address these challenges for patient-trial matching.
Experiment results show COMPOSE can reach 98.0% AUC on patient-criteria matching and 83.7% accuracy on patient-trial matching.
arXiv Detail & Related papers (2020-06-15T21:01:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.