Related papers: CLaDMoP: Learning Transferrable Models from Successful Clinical Trials via LLMs

CLaDMoP: Learning Transferrable Models from Successful Clinical Trials via LLMs

URL: http://arxiv.org/abs/2505.18527v1
Date: Sat, 24 May 2025 05:45:32 GMT
Title: CLaDMoP: Learning Transferrable Models from Successful Clinical Trials via LLMs
Authors: Yiqing Zhang, Xiaozhong Liu, Fabricio Murai,
Abstract summary: We introduce CLaDMoP, a new pre-training approach for clinical trial outcome prediction.<n>CLaDMoP encodes trials' eligibility criteria-linked to a lightweight Drug-Molecule branch through a novel multi-level fusion technique.<n>CLaDMoP achieves up to 10.5% improvement in PR-AUC and 3.6% in ROC-AUC, while attaining comparable F1 score to MEXA-CTP.
Score: 14.116060944536011
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Many existing models for clinical trial outcome prediction are optimized using task-specific loss functions on trial phase-specific data. While this scheme may boost prediction for common diseases and drugs, it can hinder learning of generalizable representations, leading to more false positives/negatives. To address this limitation, we introduce CLaDMoP, a new pre-training approach for clinical trial outcome prediction, alongside the Successful Clinical Trials dataset(SCT), specifically designed for this task. CLaDMoP leverages a Large Language Model-to encode trials' eligibility criteria-linked to a lightweight Drug-Molecule branch through a novel multi-level fusion technique. To efficiently fuse long embeddings across levels, we incorporate a grouping block, drastically reducing computational overhead. CLaDMoP avoids reliance on task-specific objectives by pre-training on a "pair matching" proxy task. Compared to established zero-shot and few-shot baselines, our method significantly improves both PR-AUC and ROC-AUC, especially for phase I and phase II trials. We further evaluate and perform ablation on CLaDMoP after Parameter-Efficient Fine-Tuning, comparing it to state-of-the-art supervised baselines, including MEXA-CTP, on the Trial Outcome Prediction(TOP) benchmark. CLaDMoP achieves up to 10.5% improvement in PR-AUC and 3.6% in ROC-AUC, while attaining comparable F1 score to MEXA-CTP, highlighting its potential for clinical trial outcome prediction. Code and SCT dataset can be downloaded from https://github.com/murai-lab/CLaDMoP.

Related papers

AUTOCT: Automating Interpretable Clinical Trial Prediction with LLM Agents [47.640779069547534]
AutoCT is a novel framework that combines the reasoning capabilities of large language models with the explainability of classical machine learning.<n>We show that AutoCT performs on par with or better than SOTA methods on clinical trial prediction tasks within only a limited number of self-refinement iterations.
arXiv Detail & Related papers (2025-06-04T11:50:55Z)
MEXA-CTP: Mode Experts Cross-Attention for Clinical Trial Outcome Prediction [14.116060944536011]
We propose a light-weight attention-based model, MEXA-CTP, to integrate readily-available multi-modal data and generate effective representations.<n>Our experiments on the Trial Outcome Prediction benchmark demonstrate that MEXA-CTP improves upon existing approaches by up to 11.3% in F1 score, 12.2% in PR-AUC, and 2.5% in ROC-AUC.
arXiv Detail & Related papers (2025-01-12T14:35:31Z)
Scaling Laws for Predicting Downstream Performance in LLMs [75.28559015477137]
This work focuses on the pre-training loss as a more computation-efficient metric for performance estimation.<n>We present FLP-M, a fundamental approach for performance prediction that addresses the practical need to integrate datasets from multiple sources during pre-training.
arXiv Detail & Related papers (2024-10-11T04:57:48Z)
Early Prediction of Causes (not Effects) in Healthcare by Long-Term Clinical Time Series Forecasting [11.96384267146423]
We propose to directly predict the causes via time series forecasting (TSF) of clinical variables. Because model training does not rely on a particular label anymore, the forecasted data can be used to predict any consensus-based label.
arXiv Detail & Related papers (2024-08-07T14:52:06Z)
An Efficient and Flexible Deep Learning Method for Signal Delineation via Keypoints Estimation [4.2807943283312095]
We present Keypoint Estimation for Electrocardiogram Delineation (KEED), a novel DL model designed for keypoint estimation. KEED organically offers an output aligned with clinical expectations. Remarkably, KEED significantly outperforms despite being optimized with an extremely limited annotated data.
arXiv Detail & Related papers (2024-07-24T08:13:10Z)
Hyperparameters in Continual Learning: A Reality Check [53.30082523545212]
Continual learning (CL) aims to train a model on a sequence of tasks while balancing the trade-off between plasticity (learning new tasks) and stability (retaining prior knowledge)
arXiv Detail & Related papers (2024-03-14T03:13:01Z)
Improving Clinical Decision Support through Interpretable Machine Learning and Error Handling in Electronic Health Records [6.594072648536156]
Trust-MAPS translates clinical domain knowledge into high-dimensional, mixed-integer programming models.<n>Trust-scores emerge as clinically meaningful features that not only boost predictive performance for clinical decision support tasks, but also lend interpretability to ML models.
arXiv Detail & Related papers (2023-08-21T15:14:49Z)
SECRETS: Subject-Efficient Clinical Randomized Controlled Trials using Synthetic Intervention [0.0]
Cross-over trials can reduce sample size requirements by measuring the treatment effect per individual. We propose a novel framework, SECRETS, which estimates the individual treatment effect (ITE) per patient in the RCT study without using any external data. We show that SECRETS can improve the power of an RCT while maintaining comparable significance levels.
arXiv Detail & Related papers (2023-05-08T22:37:16Z)
SPOT: Sequential Predictive Modeling of Clinical Trial Outcome with Meta-Learning [67.8195828626489]
Clinical trials are essential to drug development but time-consuming, costly, and prone to failure. We propose Sequential Predictive mOdeling of clinical Trial outcome (SPOT) that first identifies trial topics to cluster the multi-sourced trial data into relevant trial topics. With the consideration of each trial sequence as a task, it uses a meta-learning strategy to achieve a point where the model can rapidly adapt to new tasks with minimal updates.
arXiv Detail & Related papers (2023-04-07T23:04:27Z)
Robust and Agnostic Learning of Conditional Distributional Treatment Effects [62.44901952244514]
The conditional average treatment effect (CATE) is the best point prediction of individual causal effects. In aggregate analyses, this is usually addressed by measuring distributional treatment effect (DTE) We provide a new robust and model-agnostic methodology for learning the conditional DTE (CDTE) for a wide class of problems.
arXiv Detail & Related papers (2022-05-23T17:40:31Z)
Bayesian prognostic covariate adjustment [59.75318183140857]
Historical data about disease outcomes can be integrated into the analysis of clinical trials in many ways. We build on existing literature that uses prognostic scores from a predictive model to increase the efficiency of treatment effect estimates.
arXiv Detail & Related papers (2020-12-24T05:19:03Z)
Increasing the efficiency of randomized trial estimates via linear adjustment for a prognostic score [59.75318183140857]
Estimating causal effects from randomized experiments is central to clinical research. Most methods for historical borrowing achieve reductions in variance by sacrificing strict type-I error rate control.
arXiv Detail & Related papers (2020-12-17T21:10:10Z)
Deep Learning for Virtual Screening: Five Reasons to Use ROC Cost Functions [80.12620331438052]
deep learning has become an important tool for rapid screening of billions of molecules in silico for potential hits containing desired chemical features. Despite its importance, substantial challenges persist in training these models, such as severe class imbalance, high decision thresholds, and lack of ground truth labels in some datasets. We argue in favor of directly optimizing the receiver operating characteristic (ROC) in such cases, due to its robustness to class imbalance.
arXiv Detail & Related papers (2020-06-25T08:46:37Z)

This list is automatically generated from the titles and abstracts of the papers in this site.