Efficient and robust transfer learning of optimal individualized
treatment regimes with right-censored survival data
- URL: http://arxiv.org/abs/2301.05491v1
- Date: Fri, 13 Jan 2023 11:47:10 GMT
- Title: Efficient and robust transfer learning of optimal individualized
treatment regimes with right-censored survival data
- Authors: Pan Zhao, Julie Josse, Shu Yang
- Abstract summary: An individualized treatment regime (ITR) is a decision rule that assigns treatments based on patients' characteristics.
We propose a doubly robust estimator of the value function, and the optimal ITR is learned by maximizing the value function within a pre-specified class of ITRs.
We evaluate the empirical performance of the proposed method by simulation studies and a real data application of sodium bicarbonate therapy for patients with severe metabolic acidaemia.
- Score: 7.308241944759317
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: An individualized treatment regime (ITR) is a decision rule that assigns
treatments based on patients' characteristics. The value function of an ITR is
the expected outcome in a counterfactual world had this ITR been implemented.
Recently, there has been increasing interest in combining heterogeneous data
sources, such as leveraging the complementary features of randomized controlled
trial (RCT) data and a large observational study (OS). Usually, a covariate
shift exists between the source and target population, rendering the
source-optimal ITR unnecessarily optimal for the target population. We present
an efficient and robust transfer learning framework for estimating the optimal
ITR with right-censored survival data that generalizes well to the target
population. The value function accommodates a broad class of functionals of
survival distributions, including survival probabilities and restrictive mean
survival times (RMSTs). We propose a doubly robust estimator of the value
function, and the optimal ITR is learned by maximizing the value function
within a pre-specified class of ITRs. We establish the $N^{-1/3}$ rate of
convergence for the estimated parameter indexing the optimal ITR, and show that
the proposed optimal value estimator is consistent and asymptotically normal
even with flexible machine learning methods for nuisance parameter estimation.
We evaluate the empirical performance of the proposed method by simulation
studies and a real data application of sodium bicarbonate therapy for patients
with severe metabolic acidaemia in the intensive care unit (ICU), combining a
RCT and an observational study with heterogeneity.
Related papers
- Adaptive-TMLE for the Average Treatment Effect based on Randomized Controlled Trial Augmented with Real-World Data [0.0]
We consider the problem of estimating the average treatment effect (ATE) when both randomized control trial (RCT) data and real-world data (RWD) are available.
We introduce an adaptive targeted minimum loss-based estimation framework to estimate them.
arXiv Detail & Related papers (2024-05-12T07:10:26Z) - Robust Learning for Optimal Dynamic Treatment Regimes with Observational Data [0.0]
We study the statistical learning of optimal dynamic treatment regimes (DTRs) that guide the optimal treatment assignment for each individual at each stage based on the individual's evolving history.
arXiv Detail & Related papers (2024-03-30T02:33:39Z) - Fusing Individualized Treatment Rules Using Secondary Outcomes [7.657053163626398]
We learn an ITR that not only maximizes the value function for the primary outcome, but also approximates the optimal rule for the secondary outcomes.
Two algorithms are proposed to estimate the ITR using surrogate loss functions.
We prove that the agreement rate between the estimated ITR of the primary outcome and the optimal ITRs of the secondary outcomes converges to the true agreement rate faster than if the secondary outcomes are not taken into consideration.
arXiv Detail & Related papers (2024-02-13T22:16:36Z) - B-Learner: Quasi-Oracle Bounds on Heterogeneous Causal Effects Under
Hidden Confounding [51.74479522965712]
We propose a meta-learner called the B-Learner, which can efficiently learn sharp bounds on the CATE function under limits on hidden confounding.
We prove its estimates are valid, sharp, efficient, and have a quasi-oracle property with respect to the constituent estimators under more general conditions than existing methods.
arXiv Detail & Related papers (2023-04-20T18:07:19Z) - Auto-FedRL: Federated Hyperparameter Optimization for
Multi-institutional Medical Image Segmentation [48.821062916381685]
Federated learning (FL) is a distributed machine learning technique that enables collaborative model training while avoiding explicit data sharing.
In this work, we propose an efficient reinforcement learning(RL)-based federated hyperparameter optimization algorithm, termed Auto-FedRL.
The effectiveness of the proposed method is validated on a heterogeneous data split of the CIFAR-10 dataset and two real-world medical image segmentation datasets.
arXiv Detail & Related papers (2022-03-12T04:11:42Z) - Targeted Optimal Treatment Regime Learning Using Summary Statistics [12.767669486030352]
We consider an ITR estimation problem where the source and target populations may be heterogeneous.
We develop a weighting framework that tailors an ITR for a given target population by leveraging the available summary statistics.
Specifically, we propose a calibrated augmented inverse probability weighted estimator of the value function for the target population and estimate an optimal ITR.
arXiv Detail & Related papers (2022-01-17T06:11:31Z) - Assessment of Treatment Effect Estimators for Heavy-Tailed Data [70.72363097550483]
A central obstacle in the objective assessment of treatment effect (TE) estimators in randomized control trials (RCTs) is the lack of ground truth (or validation set) to test their performance.
We provide a novel cross-validation-like methodology to address this challenge.
We evaluate our methodology across 709 RCTs implemented in the Amazon supply chain.
arXiv Detail & Related papers (2021-12-14T17:53:01Z) - Stochastic Optimization of Areas Under Precision-Recall Curves with
Provable Convergence [66.83161885378192]
Area under ROC (AUROC) and precision-recall curves (AUPRC) are common metrics for evaluating classification performance for imbalanced problems.
We propose a technical method to optimize AUPRC for deep learning.
arXiv Detail & Related papers (2021-04-18T06:22:21Z) - Bootstrapping Your Own Positive Sample: Contrastive Learning With
Electronic Health Record Data [62.29031007761901]
This paper proposes a novel contrastive regularized clinical classification model.
We introduce two unique positive sampling strategies specifically tailored for EHR data.
Our framework yields highly competitive experimental results in predicting the mortality risk on real-world COVID-19 EHR data.
arXiv Detail & Related papers (2021-04-07T06:02:04Z) - Estimating heterogeneous survival treatment effect in observational data
using machine learning [9.951103976634407]
Methods for estimating heterogeneous treatment effect in observational data have largely focused on continuous or binary outcomes.
Using flexible machine learning methods in the counterfactual framework is a promising approach to address challenges due to complex individual characteristics.
arXiv Detail & Related papers (2020-08-17T01:02:14Z) - DTR Bandit: Learning to Make Response-Adaptive Decisions With Low Regret [59.81290762273153]
Dynamic treatment regimes (DTRs) are personalized, adaptive, multi-stage treatment plans that adapt treatment decisions to an individual's initial features and to intermediate outcomes and features at each subsequent stage.
We propose a novel algorithm that, by carefully balancing exploration and exploitation, is guaranteed to achieve rate-optimal regret when the transition and reward models are linear.
arXiv Detail & Related papers (2020-05-06T13:03:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.