Reliable Off-Policy Learning for Dosage Combinations
- URL: http://arxiv.org/abs/2305.19742v2
- Date: Fri, 27 Oct 2023 14:48:59 GMT
- Title: Reliable Off-Policy Learning for Dosage Combinations
- Authors: Jonas Schweisthal, Dennis Frauen, Valentyn Melnychuk, Stefan
Feuerriegel
- Abstract summary: Decision-making in personalized medicine must often make choices for dosage combinations.
We propose a novel method for reliable off-policy learning for dosage combinations.
- Score: 27.385663284378854
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Decision-making in personalized medicine such as cancer therapy or critical
care must often make choices for dosage combinations, i.e., multiple continuous
treatments. Existing work for this task has modeled the effect of multiple
treatments independently, while estimating the joint effect has received little
attention but comes with non-trivial challenges. In this paper, we propose a
novel method for reliable off-policy learning for dosage combinations. Our
method proceeds along three steps: (1) We develop a tailored neural network
that estimates the individualized dose-response function while accounting for
the joint effect of multiple dependent dosages. (2) We estimate the generalized
propensity score using conditional normalizing flows in order to detect regions
with limited overlap in the shared covariate-treatment space. (3) We present a
gradient-based learning algorithm to find the optimal, individualized dosage
combinations. Here, we ensure reliable estimation of the policy value by
avoiding regions with limited overlap. We finally perform an extensive
evaluation of our method to show its effectiveness. To the best of our
knowledge, ours is the first work to provide a method for reliable off-policy
learning for optimal dosage combinations.
Related papers
- Individualized Multi-Treatment Response Curves Estimation using RBF-net with Shared Neurons [1.1119247609126184]
Our non-parametric modeling of the response curves relies on radial basis function (RBF)-nets with shared hidden neurons.
Applying our proposed method to MIMIC data, we obtain several interesting findings related to the impact of different treatment strategies on the length of ICU stay and 12-hour SOFA score for sepsis patients who are home-discharged.
arXiv Detail & Related papers (2024-01-29T21:13:01Z) - The Blessings of Multiple Treatments and Outcomes in Treatment Effect
Estimation [53.81860494566915]
Existing studies leveraged proxy variables or multiple treatments to adjust for confounding bias.
In many real-world scenarios, there is greater interest in studying the effects on multiple outcomes.
We show that parallel studies of multiple outcomes involved in this setting can assist each other in causal identification.
arXiv Detail & Related papers (2023-09-29T14:33:48Z) - A Flexible Framework for Incorporating Patient Preferences Into
Q-Learning [1.2891210250935146]
In real-world healthcare problems, there are often multiple competing outcomes of interest, such as treatment efficacy and side effect severity.
statistical methods for estimating dynamic treatment regimes (DTRs) usually assume a single outcome of interest.
This includes restrictions to a single time point and two outcomes, the inability to incorporate self-reported patient preferences and limited theoretical guarantees.
arXiv Detail & Related papers (2023-07-22T08:58:07Z) - TCFimt: Temporal Counterfactual Forecasting from Individual Multiple
Treatment Perspective [50.675845725806724]
We propose a comprehensive framework of temporal counterfactual forecasting from an individual multiple treatment perspective (TCFimt)
TCFimt constructs adversarial tasks in a seq2seq framework to alleviate selection and time-varying bias and designs a contrastive learning-based block to decouple a mixed treatment effect into separated main treatment effects and causal interactions.
The proposed method shows satisfactory performance in predicting future outcomes with specific treatments and in choosing optimal treatment type and timing than state-of-the-art methods.
arXiv Detail & Related papers (2022-12-17T15:01:05Z) - Policy Learning for Optimal Individualized Dose Intervals [3.9801611649762263]
We propose a new method to estimate such a policy.
We prove that our estimated policy is consistent, and its risk converges to that of the best-in-class policy at a root-nn rate.
arXiv Detail & Related papers (2022-02-24T17:59:20Z) - SurvITE: Learning Heterogeneous Treatment Effects from Time-to-Event
Data [83.50281440043241]
We study the problem of inferring heterogeneous treatment effects from time-to-event data.
We propose a novel deep learning method for treatment-specific hazard estimation based on balancing representations.
arXiv Detail & Related papers (2021-10-26T20:13:17Z) - On Inductive Biases for Heterogeneous Treatment Effect Estimation [91.3755431537592]
We investigate how to exploit structural similarities of an individual's potential outcomes (POs) under different treatments.
We compare three end-to-end learning strategies to overcome this problem.
arXiv Detail & Related papers (2021-06-07T16:30:46Z) - Kernel Assisted Learning for Personalized Dose Finding [20.52632915107782]
An individualized dose rule recommends a dose level within a continuous safe dose range based on patient level information.
In this article, we propose a kernel assisted learning method for estimating the optimal individualized dose rule.
arXiv Detail & Related papers (2020-07-19T23:03:26Z) - Learning for Dose Allocation in Adaptive Clinical Trials with Safety
Constraints [84.09488581365484]
Phase I dose-finding trials are increasingly challenging as the relationship between efficacy and toxicity of new compounds becomes more complex.
Most commonly used methods in practice focus on identifying a Maximum Tolerated Dose (MTD) by learning only from toxicity events.
We present a novel adaptive clinical trial methodology that aims at maximizing the cumulative efficacies while satisfying the toxicity safety constraint with high probability.
arXiv Detail & Related papers (2020-06-09T03:06:45Z) - Treatment Policy Learning in Multiobjective Settings with Fully Observed
Outcomes [6.944742823560999]
We present, compare, and evaluate three approaches for learning individualized treatment policies.
We show that all approaches learn policies that achieve strictly better performance on all outcomes than clinicians.
arXiv Detail & Related papers (2020-06-01T13:22:51Z) - Almost-Matching-Exactly for Treatment Effect Estimation under Network
Interference [73.23326654892963]
We propose a matching method that recovers direct treatment effects from randomized experiments where units are connected in an observed network.
Our method matches units almost exactly on counts of unique subgraphs within their neighborhood graphs.
arXiv Detail & Related papers (2020-03-02T15:21:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.