Identifying and Characterising Response in Clinical Trials: Development and Validation of a Machine Learning Approach in Colorectal Cancer
- URL: http://arxiv.org/abs/2603.00757v1
- Date: Sat, 28 Feb 2026 18:00:26 GMT
- Title: Identifying and Characterising Response in Clinical Trials: Development and Validation of a Machine Learning Approach in Colorectal Cancer
- Authors: Adam Marcus, Paul Agapow,
- Abstract summary: Precision medicine promises to transform health care by offering individualised treatments that dramatically improve clinical outcomes.<n>Current approaches are limited to static measures of treatment success, neglecting the repeated measures found in most clinical trials.<n>Our approach combines the concept of partly conditional modelling with treatment effect estimation based on the Virtual Twins method.<n>Performance was evaluated using synthetic data and applied to clinical trials examining the effectiveness of panitumumab to treat metastatic colorectal cancer.
- Score: 0.45835414225547183
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Precision medicine promises to transform health care by offering individualised treatments that dramatically improve clinical outcomes. A necessary prerequisite is to identify subgroups of patients who respond differently to different therapies. Current approaches are limited to static measures of treatment success, neglecting the repeated measures found in most clinical trials. Our approach combines the concept of partly conditional modelling with treatment effect estimation based on the Virtual Twins method. The resulting time-specific responses to treatment are characterised using survLIME, an extension of Local Interpretable Model-agnostic Explanations (LIME) to survival data. Performance was evaluated using synthetic data and applied to clinical trials examining the effectiveness of panitumumab to treat metastatic colorectal cancer. An area under the receiver operating characteristic curve (AUC) of 0.77 for identifying fixed responders was achieved in a 1000 patient simulation. When considering dynamic responders, partly conditional modelling increased the AUC from 0.597 to 0.685. Applying the approach to colorectal cancer trials found genetic mutations, sites of metastasis, and ethnicity as important factors for response to treatment. Our approach can accommodate a dynamic response to treatment while potentially providing better performance than existing methods in instances of a fixed response to treatment. When applied to clinical data we attain results consistent with the literature.
Related papers
- A Causal Machine Learning Framework for Treatment Personalization in Clinical Trials: Application to Ulcerative Colitis [0.7799711162530713]
We present a modular causal machine learning framework that evaluates each question separately.<n>We apply this framework to patient-level data from the UNIFI maintenance trial of ustekinumab in ulcerative colitis.
arXiv Detail & Related papers (2026-02-09T00:26:30Z) - Developing Predictive and Robust Radiomics Models for Chemotherapy Response in High-Grade Serous Ovarian Carcinoma [0.9165796164519936]
Neoadjuvant chemotherapy (NACT) is often used to reduce tumor burden before surgery, but about 40% of patients show limited response.<n>Radiomics, combined with machine learning (ML), offers a promising non-invasive method for predicting NACT response.<n>This study aimed to improve response prediction in HGSOC patients undergoing NACT by integration different feature selection methods.
arXiv Detail & Related papers (2026-01-13T11:29:02Z) - Dimension-reduced outcome-weighted learning for estimating individualized treatment regimes in observational studies [1.338174941551702]
Individualized treatment regimes (ITRs) aim to improve clinical outcomes by assigning treatment based on patient-specific characteristics.<n>We propose a novel sufficient dimension reduction approach that targets the contrast between potential outcomes and identifies a low-dimensional subspace.<n>We show that the proposed method achieves universal consistency, i.e., its risk converges to the Bayes risk, under mild regularity conditions.
arXiv Detail & Related papers (2026-01-11T05:38:08Z) - ClinDEF: A Dynamic Evaluation Framework for Large Language Models in Clinical Reasoning [58.01333341218153]
We propose ClinDEF, a dynamic framework for assessing clinical reasoning in LLMs through simulated diagnostic dialogues.<n>Our method generates patient cases and facilitates multi-turn interactions between an LLM-based doctor and an automated patient agent.<n>Experiments show that ClinDEF effectively exposes critical clinical reasoning gaps in state-of-the-art LLMs.
arXiv Detail & Related papers (2025-12-29T12:58:58Z) - Adaptive Identification and Modeling of Clinical Pathways with Process Mining [4.810514867998534]
Clinical pathways are specialized healthcare plans that model patient treatment procedures.<n>We propose a two-phase modeling method using process mining.<n>We demonstrate our approach using Synthea, a benchmark dataset simulating patient treatments for SARS-CoV-2 infections.
arXiv Detail & Related papers (2025-12-03T13:37:37Z) - Prior-informed optimization of treatment recommendation via bandit algorithms trained on large language model-processed historical records [0.6875312133832079]
Current medical practice depends on standardized treatment frameworks and empirical methodologies that neglect individual patient variations.<n>We develop a comprehensive system integrating Large Language Models (LLMs), Conditional Tabular Generative Adversarial Networks (CTGAN), T-learner counterfactual models, and contextual bandit approaches.
arXiv Detail & Related papers (2025-10-21T18:57:00Z) - TREEMENT: Interpretable Patient-Trial Matching via Personalized Dynamic
Tree-Based Memory Network [54.332862955411656]
Clinical trials are critical for drug development but often suffer from expensive and inefficient patient recruitment.
In recent years, machine learning models have been proposed for speeding up patient recruitment via automatically matching patients with clinical trials.
We introduce a dynamic tree-based memory network model named TREEMENT to provide accurate and interpretable patient trial matching.
arXiv Detail & Related papers (2023-07-19T12:35:09Z) - Exploiting segmentation labels and representation learning to forecast
therapy response of PDAC patients [60.78505216352878]
We propose a hybrid deep neural network pipeline to predict tumour response to initial chemotherapy.
We leverage a combination of representation transfer from segmentation to classification, as well as localisation and representation learning.
Our approach yields a remarkably data-efficient method able to predict treatment response with a ROC-AUC of 63.7% using only 477 datasets in total.
arXiv Detail & Related papers (2022-11-08T11:50:31Z) - Benchmarking Heterogeneous Treatment Effect Models through the Lens of
Interpretability [82.29775890542967]
Estimating personalized effects of treatments is a complex, yet pervasive problem.
Recent developments in the machine learning literature on heterogeneous treatment effect estimation gave rise to many sophisticated, but opaque, tools.
We use post-hoc feature importance methods to identify features that influence the model's predictions.
arXiv Detail & Related papers (2022-06-16T17:59:05Z) - Clinical Outcome Prediction from Admission Notes using Self-Supervised
Knowledge Integration [55.88616573143478]
Outcome prediction from clinical text can prevent doctors from overlooking possible risks.
Diagnoses at discharge, procedures performed, in-hospital mortality and length-of-stay prediction are four common outcome prediction targets.
We propose clinical outcome pre-training to integrate knowledge about patient outcomes from multiple public sources.
arXiv Detail & Related papers (2021-02-08T10:26:44Z) - MIA-Prognosis: A Deep Learning Framework to Predict Therapy Response [58.0291320452122]
This paper aims at a unified deep learning approach to predict patient prognosis and therapy response.
We formalize the prognosis modeling as a multi-modal asynchronous time series classification task.
Our predictive model could further stratify low-risk and high-risk patients in terms of long-term survival.
arXiv Detail & Related papers (2020-10-08T15:30:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.