Related papers: Stage-Aware Learning for Dynamic Treatments

Stage-Aware Learning for Dynamic Treatments

URL: http://arxiv.org/abs/2310.19300v1
Date: Mon, 30 Oct 2023 06:35:31 GMT
Title: Stage-Aware Learning for Dynamic Treatments
Authors: Hanwen Ye, Wenzhuo Zhou, Ruoqing Zhu, Annie Qu
Abstract summary: We propose a novel individualized learning method for dynamic treatment regimes. We focus on prioritizing alignment between the observed treatment trajectory and the one obtained by the optimal regime across decision stages. By relaxing the restriction that the observed trajectory must be fully aligned with the optimal treatments, our approach substantially improves the sample efficiency and stability of inverse probability weighted methods.
Score: 4.033641609534417
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recent advances in dynamic treatment regimes (DTRs) provide powerful optimal treatment searching algorithms, which are tailored to individuals' specific needs and able to maximize their expected clinical benefits. However, existing algorithms could suffer from insufficient sample size under optimal treatments, especially for chronic diseases involving long stages of decision-making. To address these challenges, we propose a novel individualized learning method which estimates the DTR with a focus on prioritizing alignment between the observed treatment trajectory and the one obtained by the optimal regime across decision stages. By relaxing the restriction that the observed trajectory must be fully aligned with the optimal treatments, our approach substantially improves the sample efficiency and stability of inverse probability weighted based methods. In particular, the proposed learning scheme builds a more general framework which includes the popular outcome weighted learning framework as a special case of ours. Moreover, we introduce the notion of stage importance scores along with an attention mechanism to explicitly account for heterogeneity among decision stages. We establish the theoretical properties of the proposed approach, including the Fisher consistency and finite-sample performance bound. Empirically, we evaluate the proposed method in extensive simulated environments and a real case study for COVID-19 pandemic.

Related papers

Optimistic Algorithms for Adaptive Estimation of the Average Treatment Effect [36.25361703897723]
Recent advances in martingale theory have paved the way for adaptive methods that can enhance the power of downstream inference. We study adaptive sampling procedures that take advantage of optimalally optimal causal inference procedures. Our findings mark a step forward in advancing adaptive causal inference methods in theory and practice.
arXiv Detail & Related papers (2025-02-07T05:39:32Z)
Uncertainty-Aware Optimal Treatment Selection for Clinical Time Series [4.656302602746229]
This paper introduces a novel method integrating counterfactual estimation techniques and uncertainty quantification. We validate our method using two simulated datasets, one focused on the cardiovascular system and the other on COVID-19. Our findings indicate that our method has robust performance across different counterfactual estimation baselines.
arXiv Detail & Related papers (2024-10-11T13:56:25Z)
Experimenting on Markov Decision Processes with Local Treatments [13.182388658918502]
We investigate the randomized experiments within dynamical systems modeled as Markov Decision Processes (MDPs) Our goal is to assess the impact of treatment and control policies on long-term cumulative rewards from relatively short-term observations.
arXiv Detail & Related papers (2024-07-29T00:41:11Z)
Policy Learning for Optimal Dynamic Treatment Regimes with Observational Data [0.0]
We study the statistical learning of optimal dynamic treatment regimes (DTRs) that determine the optimal treatment assignment for each individual at each stage based on their evolving history.
arXiv Detail & Related papers (2024-03-30T02:33:39Z)
Safe and Interpretable Estimation of Optimal Treatment Regimes [54.257304443780434]
We operationalize a safe and interpretable framework to identify optimal treatment regimes. Our findings support personalized treatment strategies based on a patient's medical history and pharmacological features.
arXiv Detail & Related papers (2023-10-23T19:59:10Z)
A Flexible Framework for Incorporating Patient Preferences Into Q-Learning [1.2891210250935146]
In real-world healthcare problems, there are often multiple competing outcomes of interest, such as treatment efficacy and side effect severity. statistical methods for estimating dynamic treatment regimes (DTRs) usually assume a single outcome of interest. This includes restrictions to a single time point and two outcomes, the inability to incorporate self-reported patient preferences and limited theoretical guarantees.
arXiv Detail & Related papers (2023-07-22T08:58:07Z)
Provably Efficient UCB-type Algorithms For Learning Predictive State Representations [55.00359893021461]
The sequential decision-making problem is statistically learnable if it admits a low-rank structure modeled by predictive state representations (PSRs) This paper proposes the first known UCB-type approach for PSRs, featuring a novel bonus term that upper bounds the total variation distance between the estimated and true models. In contrast to existing approaches for PSRs, our UCB-type algorithms enjoy computational tractability, last-iterate guaranteed near-optimal policy, and guaranteed model accuracy.
arXiv Detail & Related papers (2023-07-01T18:35:21Z)
Improved Policy Evaluation for Randomized Trials of Algorithmic Resource Allocation [54.72195809248172]
We present a new estimator leveraging our proposed novel concept, that involves retrospective reshuffling of participants across experimental arms at the end of an RCT. We prove theoretically that such an estimator is more accurate than common estimators based on sample means.
arXiv Detail & Related papers (2023-02-06T05:17:22Z)
TCFimt: Temporal Counterfactual Forecasting from Individual Multiple Treatment Perspective [50.675845725806724]
We propose a comprehensive framework of temporal counterfactual forecasting from an individual multiple treatment perspective (TCFimt) TCFimt constructs adversarial tasks in a seq2seq framework to alleviate selection and time-varying bias and designs a contrastive learning-based block to decouple a mixed treatment effect into separated main treatment effects and causal interactions. The proposed method shows satisfactory performance in predicting future outcomes with specific treatments and in choosing optimal treatment type and timing than state-of-the-art methods.
arXiv Detail & Related papers (2022-12-17T15:01:05Z)
Estimating Optimal Infinite Horizon Dynamic Treatment Regimes via pT-Learning [2.0625936401496237]
Recent advances in mobile health (mHealth) technology provide an effective way to monitor individuals' health statuses and deliver just-in-time personalized interventions. The practical use of mHealth technology raises unique challenges to existing methodologies on learning an optimal dynamic treatment regime. We propose a Proximal Temporal Learning framework to estimate an optimal regime adaptively adjusted between deterministic and sparse policy models.
arXiv Detail & Related papers (2021-10-20T18:38:22Z)
Resource Planning for Hospitals Under Special Consideration of the COVID-19 Pandemic: Optimization and Sensitivity Analysis [87.31348761201716]
Crises like the COVID-19 pandemic pose a serious challenge to health-care institutions. BaBSim.Hospital is a tool for capacity planning based on discrete event simulation. We aim to investigate and optimize these parameters to improve BaBSim.Hospital.
arXiv Detail & Related papers (2021-05-16T12:38:35Z)
Proximal Learning for Individualized Treatment Regimes Under Unmeasured Confounding [3.020737957610002]
We develop approaches to estimating optimal individualized treatment regimes (ITRs) in the presence of unmeasured confounding. Based on these results, we propose several classification-based approaches to finding a variety of restricted in-class optimal ITRs.
arXiv Detail & Related papers (2021-05-03T21:49:49Z)
Generalization Bounds and Representation Learning for Estimation of Potential Outcomes and Causal Effects [61.03579766573421]
We study estimation of individual-level causal effects, such as a single patient's response to alternative medication. We devise representation learning algorithms that minimize our bound, by regularizing the representation's induced treatment group distance. We extend these algorithms to simultaneously learn a weighted representation to further reduce treatment group distances.
arXiv Detail & Related papers (2020-01-21T10:16:33Z)

This list is automatically generated from the titles and abstracts of the papers in this site.