Median Optimal Treatment Regimes
- URL: http://arxiv.org/abs/2103.01802v1
- Date: Tue, 2 Mar 2021 15:26:20 GMT
- Title: Median Optimal Treatment Regimes
- Authors: Liu Leqi, Edward H. Kennedy
- Abstract summary: We propose a new median optimal treatment regime that treats individuals whose conditional median is higher under treatment.
This ensures that optimal decisions for individuals from the same group are not overly influenced by a small fraction of the group.
We introduce a new measure of value, the Average Conditional Median Effect (ACME), which summarizes across-group median treatment outcomes of a policy.
- Score: 7.241149193573696
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Optimal treatment regimes are personalized policies for making a treatment
decision based on subject characteristics, with the policy chosen to maximize
some value. It is common to aim to maximize the mean outcome in the population,
via a regime assigning treatment only to those whose mean outcome is higher
under treatment versus control. However, the mean can be an unstable measure of
centrality, resulting in imprecise statistical procedures, as well as unfair
decisions that can be overly influenced by a small fraction of subjects. In
this work, we propose a new median optimal treatment regime that instead treats
individuals whose conditional median is higher under treatment. This ensures
that optimal decisions for individuals from the same group are not overly
influenced either by (i) a small fraction of the group (unlike the mean
criterion), or (ii) unrelated subjects from different groups (unlike marginal
median/quantile criteria). We introduce a new measure of value, the Average
Conditional Median Effect (ACME), which summarizes across-group median
treatment outcomes of a policy, and which the optimal median treatment regime
maximizes. After developing key motivating examples that distinguish median
optimal treatment regimes from mean and marginal median optimal treatment
regimes, we give a nonparametric efficiency bound for estimating the ACME of a
policy, and propose a new doubly robust-style estimator that achieves the
efficiency bound under weak conditions. Finite-sample properties of the
estimator are explored via numerical simulations and the proposed algorithm is
illustrated using data from a randomized clinical trial in patients with HIV.
Related papers
- Are causal effect estimations enough for optimal recommendations under multitreatment scenarios? [2.4578723416255754]
It is essential to include a causal effect estimation analysis to compare potential outcomes under different treatments or controls.
We propose a comprehensive methodology for multitreatment selection.
arXiv Detail & Related papers (2024-10-07T16:37:35Z) - Learning Robust Treatment Rules for Censored Data [14.95510487866686]
We propose two criteria for estimating optimal treatment rules.
We show improved performance compared to existing methods.
We also demonstrate the proposed method using AIDS clinical data.
arXiv Detail & Related papers (2024-08-17T09:58:58Z) - Optimal Baseline Corrections for Off-Policy Contextual Bandits [61.740094604552475]
We aim to learn decision policies that optimize an unbiased offline estimate of an online reward metric.
We propose a single framework built on their equivalence in learning scenarios.
Our framework enables us to characterize the variance-optimal unbiased estimator and provide a closed-form solution for it.
arXiv Detail & Related papers (2024-05-09T12:52:22Z) - Policy Gradient with Active Importance Sampling [55.112959067035916]
Policy gradient (PG) methods significantly benefit from IS, enabling the effective reuse of previously collected samples.
However, IS is employed in RL as a passive tool for re-weighting historical samples.
We look for the best behavioral policy from which to collect samples to reduce the policy gradient variance.
arXiv Detail & Related papers (2024-05-09T09:08:09Z) - Robust and Agnostic Learning of Conditional Distributional Treatment
Effects [62.44901952244514]
The conditional average treatment effect (CATE) is the best point prediction of individual causal effects.
In aggregate analyses, this is usually addressed by measuring distributional treatment effect (DTE)
We provide a new robust and model-agnostic methodology for learning the conditional DTE (CDTE) for a wide class of problems.
arXiv Detail & Related papers (2022-05-23T17:40:31Z) - Adaptive Semi-Supervised Inference for Optimal Treatment Decisions with
Electronic Medical Record Data [18.77246683875067]
The optimal treatment regime that yields the greatest overall expected clinical outcome of interest has attracted a lot of attention.
We consider estimation of the optimal treatment regime with electronic medical record data under a semi-supervised setting.
arXiv Detail & Related papers (2022-03-04T13:54:35Z) - Off-Policy Evaluation with Policy-Dependent Optimization Response [90.28758112893054]
We develop a new framework for off-policy evaluation with a textitpolicy-dependent linear optimization response.
We construct unbiased estimators for the policy-dependent estimand by a perturbation method.
We provide a general algorithm for optimizing causal interventions.
arXiv Detail & Related papers (2022-02-25T20:25:37Z) - Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium
Learning from Offline Datasets [101.5329678997916]
We study episodic two-player zero-sum Markov games (MGs) in the offline setting.
The goal is to find an approximate Nash equilibrium (NE) policy pair based on a dataset collected a priori.
arXiv Detail & Related papers (2022-02-15T15:39:30Z) - Reinforcement Learning with Heterogeneous Data: Estimation and Inference [84.72174994749305]
We introduce the K-Heterogeneous Markov Decision Process (K-Hetero MDP) to address sequential decision problems with population heterogeneity.
We propose the Auto-Clustered Policy Evaluation (ACPE) for estimating the value of a given policy, and the Auto-Clustered Policy Iteration (ACPI) for estimating the optimal policy in a given policy class.
We present simulations to support our theoretical findings, and we conduct an empirical study on the standard MIMIC-III dataset.
arXiv Detail & Related papers (2022-01-31T20:58:47Z) - Evaluating Treatment Prioritization Rules via Rank-Weighted Average
Treatment Effects [24.258855352542096]
We propose rank-weighted average treatment effect metrics as a simple and general family of metrics for comparing and testing the quality of treatment prioritization rules.
RATE metrics are agnostic to how the prioritization rules were derived, and only assess how well they identify individuals that benefit the most from treatment.
We showcase RATE in the context of a number of applications, including optimal targeting of aspirin to stroke patients.
arXiv Detail & Related papers (2021-11-15T18:22:35Z) - CAPITAL: Optimal Subgroup Identification via Constrained Policy Tree
Search [10.961093227672398]
A clinically meaningful subgroup learning approach should identify the maximum number of patients who can benefit from the better treatment.
We present an optimal subgroup selection rule (SSR) that maximizes the number of selected patients.
We propose a ConstrAined PolIcy Tree seArch aLgorithm to find the optimal SSR within the interpretable decision tree class.
arXiv Detail & Related papers (2021-10-11T22:41:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.