COMPOSE: Cross-Modal Pseudo-Siamese Network for Patient Trial Matching
- URL: http://arxiv.org/abs/2006.08765v1
- Date: Mon, 15 Jun 2020 21:01:33 GMT
- Title: COMPOSE: Cross-Modal Pseudo-Siamese Network for Patient Trial Matching
- Authors: Junyi Gao, Cao Xiao, Lucas M. Glass, Jimeng Sun
- Abstract summary: We propose CrOss-Modal PseudO-SiamEse network (COMPOSE) to address these challenges for patient-trial matching.
Experiment results show COMPOSE can reach 98.0% AUC on patient-criteria matching and 83.7% accuracy on patient-trial matching.
- Score: 70.08786840301435
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Clinical trials play important roles in drug development but often suffer
from expensive, inaccurate and insufficient patient recruitment. The
availability of massive electronic health records (EHR) data and trial
eligibility criteria (EC) bring a new opportunity to data driven patient
recruitment. One key task named patient-trial matching is to find qualified
patients for clinical trials given structured EHR and unstructured EC text
(both inclusion and exclusion criteria). How to match complex EC text with
longitudinal patient EHRs? How to embed many-to-many relationships between
patients and trials? How to explicitly handle the difference between inclusion
and exclusion criteria? In this paper, we proposed CrOss-Modal PseudO-SiamEse
network (COMPOSE) to address these challenges for patient-trial matching. One
path of the network encodes EC using convolutional highway network. The other
path processes EHR with multi-granularity memory network that encodes
structured patient records into multiple levels based on medical ontology.
Using the EC embedding as query, COMPOSE performs attentional record alignment
and thus enables dynamic patient-trial matching. COMPOSE also introduces a
composite loss term to maximize the similarity between patient records and
inclusion criteria while minimize the similarity to the exclusion criteria.
Experiment results show COMPOSE can reach 98.0% AUC on patient-criteria
matching and 83.7% accuracy on patient-trial matching, which leads 24.3%
improvement over the best baseline on real-world patient-trial matching tasks.
Related papers
- TrialBench: Multi-Modal Artificial Intelligence-Ready Clinical Trial Datasets [57.067409211231244]
This paper presents meticulously curated AIready datasets covering multi-modal data (e.g., drug molecule, disease code, text, categorical/numerical features) and 8 crucial prediction challenges in clinical trial design.
We provide basic validation methods for each task to ensure the datasets' usability and reliability.
We anticipate that the availability of such open-access datasets will catalyze the development of advanced AI approaches for clinical trial design.
arXiv Detail & Related papers (2024-06-30T09:13:10Z) - Zero-Shot Clinical Trial Patient Matching with LLMs [40.31971412825736]
Large language models (LLMs) offer a promising solution to automated screening.
We design an LLM-based system which, given a patient's medical history as unstructured clinical text, evaluates whether that patient meets a set of inclusion criteria.
Our system achieves state-of-the-art scores on the n2c2 2018 cohort selection benchmark.
arXiv Detail & Related papers (2024-02-05T00:06:08Z) - TREEMENT: Interpretable Patient-Trial Matching via Personalized Dynamic
Tree-Based Memory Network [54.332862955411656]
Clinical trials are critical for drug development but often suffer from expensive and inefficient patient recruitment.
In recent years, machine learning models have been proposed for speeding up patient recruitment via automatically matching patients with clinical trials.
We introduce a dynamic tree-based memory network model named TREEMENT to provide accurate and interpretable patient trial matching.
arXiv Detail & Related papers (2023-07-19T12:35:09Z) - Towards Fair Patient-Trial Matching via Patient-Criterion Level Fairness
Constraint [50.35075018041199]
This work proposes a fair patient-trial matching framework by generating a patient-criterion level fairness constraint.
The experimental results on real-world patient-trial and patient-criterion matching tasks demonstrate that the proposed framework can successfully alleviate the predictions that tend to be biased.
arXiv Detail & Related papers (2023-03-24T03:59:19Z) - Large Language Models for Healthcare Data Augmentation: An Example on
Patient-Trial Matching [49.78442796596806]
We propose an innovative privacy-aware data augmentation approach for patient-trial matching (LLM-PTM)
Our experiments demonstrate a 7.32% average improvement in performance using the proposed LLM-PTM method, and the generalizability to new data is improved by 12.12%.
arXiv Detail & Related papers (2023-03-24T03:14:00Z) - Deep Representation Learning of Electronic Health Records to Unlock
Patient Stratification at Scale [0.5498849973527224]
We present an unsupervised framework based on deep learning to process heterogeneous EHRs.
We derive patient representations that can efficiently and effectively enable patient stratification at scale.
arXiv Detail & Related papers (2020-03-14T00:04:20Z) - DeepEnroll: Patient-Trial Matching with Deep Embedding and Entailment
Prediction [67.91606509226132]
Clinical trials are essential for drug development but often suffer from expensive, inaccurate and insufficient patient recruitment.
DeepEnroll is a cross-modal inference learning model to jointly encode enrollment criteria (tabular data) into a shared latent space for matching inference.
arXiv Detail & Related papers (2020-01-22T17:51:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.