Robust Recursive Partitioning for Heterogeneous Treatment Effects with
Uncertainty Quantification
- URL: http://arxiv.org/abs/2006.07917v2
- Date: Sun, 18 Oct 2020 03:00:18 GMT
- Title: Robust Recursive Partitioning for Heterogeneous Treatment Effects with
Uncertainty Quantification
- Authors: Hyun-Suk Lee, Yao Zhang, William Zame, Cong Shen, Jang-Won Lee,
Mihaela van der Schaar
- Abstract summary: Subgroup analysis of treatment effects plays an important role in applications from medicine to public policy to recommender systems.
Most of the current methods of subgroup analysis begin with a particular algorithm for estimating individualized treatment effects (ITE)
This paper develops a new method for subgroup analysis, R2P, that addresses all these weaknesses.
- Score: 84.53697297858146
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Subgroup analysis of treatment effects plays an important role in
applications from medicine to public policy to recommender systems. It allows
physicians (for example) to identify groups of patients for whom a given drug
or treatment is likely to be effective and groups of patients for which it is
not. Most of the current methods of subgroup analysis begin with a particular
algorithm for estimating individualized treatment effects (ITE) and identify
subgroups by maximizing the difference across subgroups of the average
treatment effect in each subgroup. These approaches have several weaknesses:
they rely on a particular algorithm for estimating ITE, they ignore
(in)homogeneity within identified subgroups, and they do not produce good
confidence estimates. This paper develops a new method for subgroup analysis,
R2P, that addresses all these weaknesses. R2P uses an arbitrary, exogenously
prescribed algorithm for estimating ITE and quantifies the uncertainty of the
ITE estimation, using a construction that is more robust than other methods.
Experiments using synthetic and semi-synthetic datasets (based on real data)
demonstrate that R2P constructs partitions that are simultaneously more
homogeneous within groups and more heterogeneous across groups than the
partitions produced by other methods. Moreover, because R2P can employ any ITE
estimator, it also produces much narrower confidence intervals with a
prescribed coverage guarantee than other methods.
Related papers
- A structured regression approach for evaluating model performance across intersectional subgroups [53.91682617836498]
Disaggregated evaluation is a central task in AI fairness assessment, where the goal is to measure an AI system's performance across different subgroups.
We introduce a structured regression approach to disaggregated evaluation that we demonstrate can yield reliable system performance estimates even for very small subgroups.
arXiv Detail & Related papers (2024-01-26T14:21:45Z) - Causal Machine Learning for Moderation Effects [0.0]
We propose a new parameter, the balanced group average treatment effect (BGATE)
By taking the difference of two BGATEs, we can analyse heterogeneity more meaningfully than by comparing two GATEs.
Adding additional identifying assumptions allows specific balanced differences in treatment effects between groups to be interpreted causally.
arXiv Detail & Related papers (2024-01-16T11:34:59Z) - Statistical Performance Guarantee for Subgroup Identification with
Generic Machine Learning [1.0878040851638]
We develop uniform confidence bands for estimation of the group average treatment effect sorted by generic ML algorithm (GATES)
We analyze a clinical trial of late-stage prostate cancer and find a relatively large proportion of exceptional responders.
arXiv Detail & Related papers (2023-10-12T01:41:47Z) - A One-shot Framework for Distributed Clustered Learning in Heterogeneous
Environments [54.172993875654015]
The paper proposes a family of communication efficient methods for distributed learning in heterogeneous environments.
One-shot approach, based on local computations at the users and a clustering based aggregation step at the server is shown to provide strong learning guarantees.
For strongly convex problems it is shown that, as long as the number of data points per user is above a threshold, the proposed approach achieves order-optimal mean-squared error rates in terms of the sample size.
arXiv Detail & Related papers (2022-09-22T09:04:10Z) - Statistical Inference for Heterogeneous Treatment Effects Discovered by Generic Machine Learning in Randomized Experiments [0.9208007322096533]
We develop a general approach to statistical inference for heterogeneous treatment effects discovered by a generic ML algorithm.
We show how to estimate the average treatment effect within each of these groups, and construct a valid confidence interval.
arXiv Detail & Related papers (2022-03-28T05:43:46Z) - On Inductive Biases for Heterogeneous Treatment Effect Estimation [91.3755431537592]
We investigate how to exploit structural similarities of an individual's potential outcomes (POs) under different treatments.
We compare three end-to-end learning strategies to overcome this problem.
arXiv Detail & Related papers (2021-06-07T16:30:46Z) - Finding Subgroups with Significant Treatment Effects [20.457122933924463]
We propose a machine-learning method that is specifically optimized for finding such subgroups in noisy data.
Unlike available methods for personalized treatment assignment, our tool is designed to take significance testing into account.
It produces a subgroup that is chosen to maximize the probability of obtaining a statistically significant positive treatment effect.
arXiv Detail & Related papers (2021-03-12T03:36:03Z) - Structured Sparsity Inducing Adaptive Optimizers for Deep Learning [94.23102887731417]
In this paper, we derive the weighted proximal operator, which is a necessary component of proximal gradient methods.
We show that this adaptive method, together with the weighted proximal operators derived here, is indeed capable of finding solutions with structure in their sparsity patterns.
arXiv Detail & Related papers (2021-02-07T18:06:23Z) - Deep Semi-Supervised Embedded Clustering (DSEC) for Stratification of
Heart Failure Patients [50.48904066814385]
In this work we apply deep semi-supervised embedded clustering to determine data-driven patient subgroups of heart failure.
We find clinically relevant clusters from an embedded space derived from heterogeneous data.
The proposed algorithm can potentially find new undiagnosed subgroups of patients that have different outcomes.
arXiv Detail & Related papers (2020-12-24T12:56:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.