Finding Subgroups with Significant Treatment Effects
- URL: http://arxiv.org/abs/2103.07066v2
- Date: Wed, 20 Dec 2023 09:19:41 GMT
- Title: Finding Subgroups with Significant Treatment Effects
- Authors: Jann Spiess and Vasilis Syrgkanis and Victor Yaneng Wang
- Abstract summary: We propose a machine-learning method that is specifically optimized for finding such subgroups in noisy data.
Unlike available methods for personalized treatment assignment, our tool is designed to take significance testing into account.
It produces a subgroup that is chosen to maximize the probability of obtaining a statistically significant positive treatment effect.
- Score: 20.457122933924463
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Researchers often run resource-intensive randomized controlled trials (RCTs)
to estimate the causal effects of interventions on outcomes of interest. Yet
these outcomes are often noisy, and estimated overall effects can be small or
imprecise. Nevertheless, we may still be able to produce reliable evidence of
the efficacy of an intervention by finding subgroups with significant effects.
In this paper, we propose a machine-learning method that is specifically
optimized for finding such subgroups in noisy data. Unlike available methods
for personalized treatment assignment, our tool is fundamentally designed to
take significance testing into account: it produces a subgroup that is chosen
to maximize the probability of obtaining a statistically significant positive
treatment effect. We provide a computationally efficient implementation using
decision trees and demonstrate its gain over selecting subgroups based on
positive (estimated) treatment effects. Compared to standard tree-based
regression and classification tools, this approach tends to yield higher power
in detecting subgroups affected by the treatment.
Related papers
- A structured regression approach for evaluating model performance across intersectional subgroups [53.91682617836498]
Disaggregated evaluation is a central task in AI fairness assessment, where the goal is to measure an AI system's performance across different subgroups.
We introduce a structured regression approach to disaggregated evaluation that we demonstrate can yield reliable system performance estimates even for very small subgroups.
arXiv Detail & Related papers (2024-01-26T14:21:45Z) - SubgroupTE: Advancing Treatment Effect Estimation with Subgroup
Identification [7.598921240673158]
We propose a novel treatment effect estimation model, named SubgroupTE, which incorporates subgroup identification in TEE.
SubgroupTE identifies heterogeneous subgroups with different treatment responses and more precisely estimates treatment effects by considering subgroup-specific causal effects.
Experiments on the synthetic and semi-synthetic datasets exhibit the outstanding performance of SubgroupTE compared with the state-of-the-art models on treatment effect estimation.
arXiv Detail & Related papers (2024-01-22T21:41:26Z) - Statistical Performance Guarantee for Subgroup Identification with
Generic Machine Learning [1.0878040851638]
We develop uniform confidence bands for estimation of the group average treatment effect sorted by generic ML algorithm (GATES)
We analyze a clinical trial of late-stage prostate cancer and find a relatively large proportion of exceptional responders.
arXiv Detail & Related papers (2023-10-12T01:41:47Z) - Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment
Effect Estimation [137.3520153445413]
A notable gap exists in the evaluation of causal discovery methods, where insufficient emphasis is placed on downstream inference.
We evaluate seven established baseline causal discovery methods including a newly proposed method based on GFlowNets.
The results of our study demonstrate that some of the algorithms studied are able to effectively capture a wide range of useful and diverse ATE modes.
arXiv Detail & Related papers (2023-07-11T02:58:10Z) - Doubly Robust Estimation of Direct and Indirect Quantile Treatment
Effects with Machine Learning [0.0]
We suggest a machine learning estimator of direct and indirect quantile treatment effects under a selection-on-observables assumption.
The proposed method is based on the efficient score functions of the cumulative distribution functions of potential outcomes.
We also propose a multiplier bootstrap for statistical inference and show the validity of the multiplier.
arXiv Detail & Related papers (2023-07-03T14:27:15Z) - B-Learner: Quasi-Oracle Bounds on Heterogeneous Causal Effects Under
Hidden Confounding [51.74479522965712]
We propose a meta-learner called the B-Learner, which can efficiently learn sharp bounds on the CATE function under limits on hidden confounding.
We prove its estimates are valid, sharp, efficient, and have a quasi-oracle property with respect to the constituent estimators under more general conditions than existing methods.
arXiv Detail & Related papers (2023-04-20T18:07:19Z) - Sample Constrained Treatment Effect Estimation [28.156207324508706]
We focus on designing efficient randomized controlled trials, to accurately estimate the effect of some treatment on a population of $n$ individuals.
In particular, we study sample-constrained treatment effect estimation, where we must select a subset of $s ll n$ individuals from the population to experiment on.
arXiv Detail & Related papers (2022-10-12T21:13:47Z) - Assessment of Treatment Effect Estimators for Heavy-Tailed Data [70.72363097550483]
A central obstacle in the objective assessment of treatment effect (TE) estimators in randomized control trials (RCTs) is the lack of ground truth (or validation set) to test their performance.
We provide a novel cross-validation-like methodology to address this challenge.
We evaluate our methodology across 709 RCTs implemented in the Amazon supply chain.
arXiv Detail & Related papers (2021-12-14T17:53:01Z) - Robust Recursive Partitioning for Heterogeneous Treatment Effects with
Uncertainty Quantification [84.53697297858146]
Subgroup analysis of treatment effects plays an important role in applications from medicine to public policy to recommender systems.
Most of the current methods of subgroup analysis begin with a particular algorithm for estimating individualized treatment effects (ITE)
This paper develops a new method for subgroup analysis, R2P, that addresses all these weaknesses.
arXiv Detail & Related papers (2020-06-14T14:50:02Z) - Almost-Matching-Exactly for Treatment Effect Estimation under Network
Interference [73.23326654892963]
We propose a matching method that recovers direct treatment effects from randomized experiments where units are connected in an observed network.
Our method matches units almost exactly on counts of unique subgraphs within their neighborhood graphs.
arXiv Detail & Related papers (2020-03-02T15:21:20Z) - Fisher-Schultz Lecture: Generic Machine Learning Inference on
Heterogenous Treatment Effects in Randomized Experiments, with an Application
to Immunization in India [3.3449509626538543]
We propose strategies to estimate and make inference on key features of heterogeneous effects in randomized experiments.
Key features include best linear predictors of the effects using machine learning proxies, average effects sorted by impact groups, and average characteristics of most and least impacted units.
arXiv Detail & Related papers (2017-12-13T14:47:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.