Related papers: Treatment Policy Learning in Multiobjective Settings with Fully Observed Outcomes

Treatment Policy Learning in Multiobjective Settings with Fully Observed Outcomes

URL: http://arxiv.org/abs/2006.00927v2
Date: Thu, 13 Aug 2020 02:06:50 GMT
Title: Treatment Policy Learning in Multiobjective Settings with Fully Observed Outcomes
Authors: Soorajnath Boominathan, Michael Oberst, Helen Zhou, Sanjat Kanjilal, David Sontag
Abstract summary: We present, compare, and evaluate three approaches for learning individualized treatment policies. We show that all approaches learn policies that achieve strictly better performance on all outcomes than clinicians.
Score: 6.944742823560999
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In several medical decision-making problems, such as antibiotic prescription, laboratory testing can provide precise indications for how a patient will respond to different treatment options. This enables us to "fully observe" all potential treatment outcomes, but while present in historical data, these results are infeasible to produce in real-time at the point of the initial treatment decision. Moreover, treatment policies in these settings often need to trade off between multiple competing objectives, such as effectiveness of treatment and harmful side effects. We present, compare, and evaluate three approaches for learning individualized treatment policies in this setting: First, we consider two indirect approaches, which use predictive models of treatment response to construct policies optimal for different trade-offs between objectives. Second, we consider a direct approach that constructs such a set of policies without intermediate models of outcomes. Using a medical dataset of Urinary Tract Infection (UTI) patients, we show that all approaches learn policies that achieve strictly better performance on all outcomes than clinicians, while also trading off between different objectives. We demonstrate additional benefits of the direct approach, including flexibly incorporating other goals such as deferral to physicians on simple cases.

Related papers

Towards Regulatory-Confirmed Adaptive Clinical Trials: Machine Learning Opportunities and Solutions [59.28853595868749]
We introduce two new objectives for future clinical trials that integrate regulatory constraints and treatment policy value for both the entire population and under-served populations. We formulate Randomize First Augment Next (RFAN), a new framework for designing Phase III clinical trials. Our framework consists of a standard randomized component followed by an adaptive one, jointly meant to efficiently and safely acquire and assign patients into treatment arms during the trial.
arXiv Detail & Related papers (2025-03-12T10:17:54Z)
Pruning the Path to Optimal Care: Identifying Systematically Suboptimal Medical Decision-Making with Inverse Reinforcement Learning [14.688842697886484]
We present a novel application of Inverse Reinforcement Learning that identifies suboptimal clinician actions based on the actions of their peers. This approach centers two stages of IRL with an intermediate step to prune trajectories displaying behavior that deviates significantly from the consensus.
arXiv Detail & Related papers (2024-11-07T23:16:59Z)
The Blessings of Multiple Treatments and Outcomes in Treatment Effect Estimation [53.81860494566915]
Existing studies leveraged proxy variables or multiple treatments to adjust for confounding bias. In many real-world scenarios, there is greater interest in studying the effects on multiple outcomes. We show that parallel studies of multiple outcomes involved in this setting can assist each other in causal identification.
arXiv Detail & Related papers (2023-09-29T14:33:48Z)
A Flexible Framework for Incorporating Patient Preferences Into Q-Learning [1.2891210250935146]
In real-world healthcare problems, there are often multiple competing outcomes of interest, such as treatment efficacy and side effect severity. statistical methods for estimating dynamic treatment regimes (DTRs) usually assume a single outcome of interest. This includes restrictions to a single time point and two outcomes, the inability to incorporate self-reported patient preferences and limited theoretical guarantees.
arXiv Detail & Related papers (2023-07-22T08:58:07Z)
TCFimt: Temporal Counterfactual Forecasting from Individual Multiple Treatment Perspective [50.675845725806724]
We propose a comprehensive framework of temporal counterfactual forecasting from an individual multiple treatment perspective (TCFimt) TCFimt constructs adversarial tasks in a seq2seq framework to alleviate selection and time-varying bias and designs a contrastive learning-based block to decouple a mixed treatment effect into separated main treatment effects and causal interactions. The proposed method shows satisfactory performance in predicting future outcomes with specific treatments and in choosing optimal treatment type and timing than state-of-the-art methods.
arXiv Detail & Related papers (2022-12-17T15:01:05Z)
Causal Modeling of Policy Interventions From Sequences of Treatments and Outcomes [5.107614397012659]
Data-driven decision-making requires the ability to predict what happens if a policy is changed. Existing methods that predict how the outcome evolves assume that the tentative sequences of future treatments are fixed in advance. In practice, the treatments are determinedally by a policy and may depend on the efficiency of previous treatments.
arXiv Detail & Related papers (2022-09-09T06:50:37Z)
Benchmarking Heterogeneous Treatment Effect Models through the Lens of Interpretability [82.29775890542967]
Estimating personalized effects of treatments is a complex, yet pervasive problem. Recent developments in the machine learning literature on heterogeneous treatment effect estimation gave rise to many sophisticated, but opaque, tools. We use post-hoc feature importance methods to identify features that influence the model's predictions.
arXiv Detail & Related papers (2022-06-16T17:59:05Z)
Disentangled Counterfactual Recurrent Networks for Treatment Effect Inference over Time [71.30985926640659]
We introduce the Disentangled Counterfactual Recurrent Network (DCRN), a sequence-to-sequence architecture that estimates treatment outcomes over time. With an architecture that is completely inspired by the causal structure of treatment influence over time, we advance forecast accuracy and disease understanding. We demonstrate that DCRN outperforms current state-of-the-art methods in forecasting treatment responses, on both real and simulated data.
arXiv Detail & Related papers (2021-12-07T16:40:28Z)
Semi-Supervised Variational Reasoning for Medical Dialogue Generation [70.838542865384]
Two key characteristics are relevant for medical dialogue generation: patient states and physician actions. We propose an end-to-end variational reasoning approach to medical dialogue generation. A physician policy network composed of an action-classifier and two reasoning detectors is proposed for augmented reasoning ability.
arXiv Detail & Related papers (2021-05-13T04:14:35Z)
Learning Individualized Treatment Rules with Estimated Translated Inverse Propensity Score [29.606141542532356]
In this paper, we focus on learning individualized treatment rules (ITRs) to derive a treatment policy. In our framework, we cast ITRs learning as a contextual bandit problem and minimize the expected risk of the treatment policy. As a long-term goal, our derived policy might eventually lead to better clinical guidelines for the administration of IV and VP.
arXiv Detail & Related papers (2020-07-02T13:13:56Z)
Optimizing Medical Treatment for Sepsis in Intensive Care: from Reinforcement Learning to Pre-Trial Evaluation [2.908482270923597]
Our aim is to establish a framework where reinforcement learning (RL) of optimizing interventions retrospectively allows us a regulatory compliant pathway to prospective clinical testing of the learned policies. We focus on infections in intensive care units which are one of the major causes of death and difficult to treat because of the complex and opaque patient dynamics.
arXiv Detail & Related papers (2020-03-13T20:31:47Z)
Generalization Bounds and Representation Learning for Estimation of Potential Outcomes and Causal Effects [61.03579766573421]
We study estimation of individual-level causal effects, such as a single patient's response to alternative medication. We devise representation learning algorithms that minimize our bound, by regularizing the representation's induced treatment group distance. We extend these algorithms to simultaneously learn a weighted representation to further reduce treatment group distances.
arXiv Detail & Related papers (2020-01-21T10:16:33Z)

This list is automatically generated from the titles and abstracts of the papers in this site.