Treatment Policy Learning in Multiobjective Settings with Fully Observed
Outcomes
- URL: http://arxiv.org/abs/2006.00927v2
- Date: Thu, 13 Aug 2020 02:06:50 GMT
- Title: Treatment Policy Learning in Multiobjective Settings with Fully Observed
Outcomes
- Authors: Soorajnath Boominathan, Michael Oberst, Helen Zhou, Sanjat Kanjilal,
David Sontag
- Abstract summary: We present, compare, and evaluate three approaches for learning individualized treatment policies.
We show that all approaches learn policies that achieve strictly better performance on all outcomes than clinicians.
- Score: 6.944742823560999
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In several medical decision-making problems, such as antibiotic prescription,
laboratory testing can provide precise indications for how a patient will
respond to different treatment options. This enables us to "fully observe" all
potential treatment outcomes, but while present in historical data, these
results are infeasible to produce in real-time at the point of the initial
treatment decision. Moreover, treatment policies in these settings often need
to trade off between multiple competing objectives, such as effectiveness of
treatment and harmful side effects. We present, compare, and evaluate three
approaches for learning individualized treatment policies in this setting:
First, we consider two indirect approaches, which use predictive models of
treatment response to construct policies optimal for different trade-offs
between objectives. Second, we consider a direct approach that constructs such
a set of policies without intermediate models of outcomes. Using a medical
dataset of Urinary Tract Infection (UTI) patients, we show that all approaches
learn policies that achieve strictly better performance on all outcomes than
clinicians, while also trading off between different objectives. We demonstrate
additional benefits of the direct approach, including flexibly incorporating
other goals such as deferral to physicians on simple cases.
Related papers
- Pruning the Path to Optimal Care: Identifying Systematically Suboptimal Medical Decision-Making with Inverse Reinforcement Learning [14.688842697886484]
We present a novel application of Inverse Reinforcement Learning that identifies suboptimal clinician actions based on the actions of their peers.
This approach centers two stages of IRL with an intermediate step to prune trajectories displaying behavior that deviates significantly from the consensus.
arXiv Detail & Related papers (2024-11-07T23:16:59Z) - The Blessings of Multiple Treatments and Outcomes in Treatment Effect
Estimation [53.81860494566915]
Existing studies leveraged proxy variables or multiple treatments to adjust for confounding bias.
In many real-world scenarios, there is greater interest in studying the effects on multiple outcomes.
We show that parallel studies of multiple outcomes involved in this setting can assist each other in causal identification.
arXiv Detail & Related papers (2023-09-29T14:33:48Z) - A Flexible Framework for Incorporating Patient Preferences Into
Q-Learning [1.2891210250935146]
In real-world healthcare problems, there are often multiple competing outcomes of interest, such as treatment efficacy and side effect severity.
statistical methods for estimating dynamic treatment regimes (DTRs) usually assume a single outcome of interest.
This includes restrictions to a single time point and two outcomes, the inability to incorporate self-reported patient preferences and limited theoretical guarantees.
arXiv Detail & Related papers (2023-07-22T08:58:07Z) - TCFimt: Temporal Counterfactual Forecasting from Individual Multiple
Treatment Perspective [50.675845725806724]
We propose a comprehensive framework of temporal counterfactual forecasting from an individual multiple treatment perspective (TCFimt)
TCFimt constructs adversarial tasks in a seq2seq framework to alleviate selection and time-varying bias and designs a contrastive learning-based block to decouple a mixed treatment effect into separated main treatment effects and causal interactions.
The proposed method shows satisfactory performance in predicting future outcomes with specific treatments and in choosing optimal treatment type and timing than state-of-the-art methods.
arXiv Detail & Related papers (2022-12-17T15:01:05Z) - Causal Modeling of Policy Interventions From Sequences of Treatments and
Outcomes [5.107614397012659]
Data-driven decision-making requires the ability to predict what happens if a policy is changed.
Existing methods that predict how the outcome evolves assume that the tentative sequences of future treatments are fixed in advance.
In practice, the treatments are determinedally by a policy and may depend on the efficiency of previous treatments.
arXiv Detail & Related papers (2022-09-09T06:50:37Z) - Benchmarking Heterogeneous Treatment Effect Models through the Lens of
Interpretability [82.29775890542967]
Estimating personalized effects of treatments is a complex, yet pervasive problem.
Recent developments in the machine learning literature on heterogeneous treatment effect estimation gave rise to many sophisticated, but opaque, tools.
We use post-hoc feature importance methods to identify features that influence the model's predictions.
arXiv Detail & Related papers (2022-06-16T17:59:05Z) - Disentangled Counterfactual Recurrent Networks for Treatment Effect
Inference over Time [71.30985926640659]
We introduce the Disentangled Counterfactual Recurrent Network (DCRN), a sequence-to-sequence architecture that estimates treatment outcomes over time.
With an architecture that is completely inspired by the causal structure of treatment influence over time, we advance forecast accuracy and disease understanding.
We demonstrate that DCRN outperforms current state-of-the-art methods in forecasting treatment responses, on both real and simulated data.
arXiv Detail & Related papers (2021-12-07T16:40:28Z) - Semi-Supervised Variational Reasoning for Medical Dialogue Generation [70.838542865384]
Two key characteristics are relevant for medical dialogue generation: patient states and physician actions.
We propose an end-to-end variational reasoning approach to medical dialogue generation.
A physician policy network composed of an action-classifier and two reasoning detectors is proposed for augmented reasoning ability.
arXiv Detail & Related papers (2021-05-13T04:14:35Z) - Learning Individualized Treatment Rules with Estimated Translated
Inverse Propensity Score [29.606141542532356]
In this paper, we focus on learning individualized treatment rules (ITRs) to derive a treatment policy.
In our framework, we cast ITRs learning as a contextual bandit problem and minimize the expected risk of the treatment policy.
As a long-term goal, our derived policy might eventually lead to better clinical guidelines for the administration of IV and VP.
arXiv Detail & Related papers (2020-07-02T13:13:56Z) - Optimizing Medical Treatment for Sepsis in Intensive Care: from
Reinforcement Learning to Pre-Trial Evaluation [2.908482270923597]
Our aim is to establish a framework where reinforcement learning (RL) of optimizing interventions retrospectively allows us a regulatory compliant pathway to prospective clinical testing of the learned policies.
We focus on infections in intensive care units which are one of the major causes of death and difficult to treat because of the complex and opaque patient dynamics.
arXiv Detail & Related papers (2020-03-13T20:31:47Z) - Generalization Bounds and Representation Learning for Estimation of
Potential Outcomes and Causal Effects [61.03579766573421]
We study estimation of individual-level causal effects, such as a single patient's response to alternative medication.
We devise representation learning algorithms that minimize our bound, by regularizing the representation's induced treatment group distance.
We extend these algorithms to simultaneously learn a weighted representation to further reduce treatment group distances.
arXiv Detail & Related papers (2020-01-21T10:16:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.