Robust Recursive Partitioning for Heterogeneous Treatment Effects with
Uncertainty Quantification
- URL: http://arxiv.org/abs/2006.07917v2
- Date: Sun, 18 Oct 2020 03:00:18 GMT
- Title: Robust Recursive Partitioning for Heterogeneous Treatment Effects with
Uncertainty Quantification
- Authors: Hyun-Suk Lee, Yao Zhang, William Zame, Cong Shen, Jang-Won Lee,
Mihaela van der Schaar
- Abstract summary: Subgroup analysis of treatment effects plays an important role in applications from medicine to public policy to recommender systems.
Most of the current methods of subgroup analysis begin with a particular algorithm for estimating individualized treatment effects (ITE)
This paper develops a new method for subgroup analysis, R2P, that addresses all these weaknesses.
- Score: 84.53697297858146
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Subgroup analysis of treatment effects plays an important role in
applications from medicine to public policy to recommender systems. It allows
physicians (for example) to identify groups of patients for whom a given drug
or treatment is likely to be effective and groups of patients for which it is
not. Most of the current methods of subgroup analysis begin with a particular
algorithm for estimating individualized treatment effects (ITE) and identify
subgroups by maximizing the difference across subgroups of the average
treatment effect in each subgroup. These approaches have several weaknesses:
they rely on a particular algorithm for estimating ITE, they ignore
(in)homogeneity within identified subgroups, and they do not produce good
confidence estimates. This paper develops a new method for subgroup analysis,
R2P, that addresses all these weaknesses. R2P uses an arbitrary, exogenously
prescribed algorithm for estimating ITE and quantifies the uncertainty of the
ITE estimation, using a construction that is more robust than other methods.
Experiments using synthetic and semi-synthetic datasets (based on real data)
demonstrate that R2P constructs partitions that are simultaneously more
homogeneous within groups and more heterogeneous across groups than the
partitions produced by other methods. Moreover, because R2P can employ any ITE
estimator, it also produces much narrower confidence intervals with a
prescribed coverage guarantee than other methods.
Related papers
- Almost Asymptotically Optimal Active Clustering Through Pairwise Observations [59.20614082241528]
We propose a new analysis framework for clustering $M$ items into an unknown number of $K$ distinct groups using noisy and actively collected responses.<n>We establish a fundamental lower bound on the expected number of queries needed to achieve a desired confidence in the accuracy of the clustering.<n>We develop a computationally feasible variant of the Generalized Likelihood Ratio statistic and show that its performance gap to the lower bound can be accurately empirically estimated.
arXiv Detail & Related papers (2026-02-05T14:16:47Z) - Efficient Subgroup Analysis via Optimal Trees with Global Parameter Fusion [4.874780144224057]
Subgroup analysis allows practitioners to pinpoint populations for whom a treatment is especially beneficial or protective.<n>We propose a fused optimal causal tree method that leverages mixed integer optimization (MIO) to facilitate precise subgroup identification.<n>We provide theoretical guarantees by rigorously establishing out of sample risk bounds and comparing them with those of classical tree based methods.
arXiv Detail & Related papers (2026-02-03T23:26:19Z) - Chiseling: Powerful and Valid Subgroup Selection via Interactive Machine Learning [7.170797040538138]
In regression and causal inference, controlled subgroup selection aims to identify a subgroup on which the average response or treatment effect is above a given threshold.<n>We propose a novel framework called chiseling that allows the analyst to interactively refine and test a candidate subgroup by iteratively shrinking it.
arXiv Detail & Related papers (2025-09-23T18:52:05Z) - Causal Clustering for Conditional Average Treatment Effects Estimation and Subgroup Discovery [5.669361767058639]
Estimating heterogeneous treatment effects is critical in domains such as personalized medicine, resource allocation, and policy evaluation.<n>We propose a novel framework that clusters individuals based on estimated treatment effects using a learned kernel derived from causal forests.
arXiv Detail & Related papers (2025-09-06T17:01:23Z) - Project-Probe-Aggregate: Efficient Fine-Tuning for Group Robustness [53.96714099151378]
We propose a three-step approach for parameter-efficient fine-tuning of image-text foundation models.
Our method improves its two key components: minority samples identification and the robust training algorithm.
Our theoretical analysis shows that our PPA enhances minority group identification and is Bayes optimal for minimizing the balanced group error.
arXiv Detail & Related papers (2025-03-12T15:46:12Z) - A structured regression approach for evaluating model performance across intersectional subgroups [53.91682617836498]
Disaggregated evaluation is a central task in AI fairness assessment, where the goal is to measure an AI system's performance across different subgroups.
We introduce a structured regression approach to disaggregated evaluation that we demonstrate can yield reliable system performance estimates even for very small subgroups.
arXiv Detail & Related papers (2024-01-26T14:21:45Z) - Subgroup analysis methods for time-to-event outcomes in heterogeneous
randomized controlled trials [7.940293148084845]
Non-significant randomized control trials can hide subgroups of good responders to experimental drugs.
We provide an open source Python package, available on Github, containing our generation process and our comprehensive benchmark framework.
arXiv Detail & Related papers (2024-01-22T11:00:49Z) - Causal Machine Learning for Moderation Effects [0.0]
We propose a new parameter, the balanced group average treatment effect (BGATE)
By taking the difference of two BGATEs, we can analyse heterogeneity more meaningfully than by comparing two GATEs.
Adding additional identifying assumptions allows specific balanced differences in treatment effects between groups to be interpreted causally.
arXiv Detail & Related papers (2024-01-16T11:34:59Z) - Statistical Performance Guarantee for Subgroup Identification with
Generic Machine Learning [1.0878040851638]
We develop uniform confidence bands for estimation of the group average treatment effect sorted by generic ML algorithm (GATES)
We analyze a clinical trial of late-stage prostate cancer and find a relatively large proportion of exceptional responders.
arXiv Detail & Related papers (2023-10-12T01:41:47Z) - A One-shot Framework for Distributed Clustered Learning in Heterogeneous
Environments [54.172993875654015]
The paper proposes a family of communication efficient methods for distributed learning in heterogeneous environments.
One-shot approach, based on local computations at the users and a clustering based aggregation step at the server is shown to provide strong learning guarantees.
For strongly convex problems it is shown that, as long as the number of data points per user is above a threshold, the proposed approach achieves order-optimal mean-squared error rates in terms of the sample size.
arXiv Detail & Related papers (2022-09-22T09:04:10Z) - Statistical Inference for Heterogeneous Treatment Effects Discovered by Generic Machine Learning in Randomized Experiments [0.9208007322096533]
We develop a general approach to statistical inference for heterogeneous treatment effects discovered by a generic ML algorithm.
We show how to estimate the average treatment effect within each of these groups, and construct a valid confidence interval.
arXiv Detail & Related papers (2022-03-28T05:43:46Z) - On Inductive Biases for Heterogeneous Treatment Effect Estimation [91.3755431537592]
We investigate how to exploit structural similarities of an individual's potential outcomes (POs) under different treatments.
We compare three end-to-end learning strategies to overcome this problem.
arXiv Detail & Related papers (2021-06-07T16:30:46Z) - Finding Subgroups with Significant Treatment Effects [20.457122933924463]
We propose a machine-learning method that is specifically optimized for finding such subgroups in noisy data.
Unlike available methods for personalized treatment assignment, our tool is designed to take significance testing into account.
It produces a subgroup that is chosen to maximize the probability of obtaining a statistically significant positive treatment effect.
arXiv Detail & Related papers (2021-03-12T03:36:03Z) - Structured Sparsity Inducing Adaptive Optimizers for Deep Learning [94.23102887731417]
In this paper, we derive the weighted proximal operator, which is a necessary component of proximal gradient methods.
We show that this adaptive method, together with the weighted proximal operators derived here, is indeed capable of finding solutions with structure in their sparsity patterns.
arXiv Detail & Related papers (2021-02-07T18:06:23Z) - Deep Semi-Supervised Embedded Clustering (DSEC) for Stratification of
Heart Failure Patients [50.48904066814385]
In this work we apply deep semi-supervised embedded clustering to determine data-driven patient subgroups of heart failure.
We find clinically relevant clusters from an embedded space derived from heterogeneous data.
The proposed algorithm can potentially find new undiagnosed subgroups of patients that have different outcomes.
arXiv Detail & Related papers (2020-12-24T12:56:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.