High-Dimensional Feature Selection for Sample Efficient Treatment Effect
Estimation
- URL: http://arxiv.org/abs/2011.01979v1
- Date: Tue, 3 Nov 2020 19:54:16 GMT
- Title: High-Dimensional Feature Selection for Sample Efficient Treatment Effect
Estimation
- Authors: Kristjan Greenewald, Dmitriy Katz-Rogozhnikov, Karthik Shanmugam
- Abstract summary: The estimation of causal treatment effects from observational data is a fundamental problem in causal inference.
We propose a common objective function involving outcomes across treatment cohorts.
We validate our approach with experiments on treatment effect estimation.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The estimation of causal treatment effects from observational data is a
fundamental problem in causal inference. To avoid bias, the effect estimator
must control for all confounders. Hence practitioners often collect data for as
many covariates as possible to raise the chances of including the relevant
confounders. While this addresses the bias, this has the side effect of
significantly increasing the number of data samples required to accurately
estimate the effect due to the increased dimensionality. In this work, we
consider the setting where out of a large number of covariates $X$ that satisfy
strong ignorability, an unknown sparse subset $S$ is sufficient to include to
achieve zero bias, i.e. $c$-equivalent to $X$. We propose a common objective
function involving outcomes across treatment cohorts with nonconvex joint
sparsity regularization that is guaranteed to recover $S$ with high probability
under a linear outcome model for $Y$ and subgaussian covariates for each of the
treatment cohort. This improves the effect estimation sample complexity so that
it scales with the cardinality of the sparse subset $S$ and $\log |X|$, as
opposed to the cardinality of the full set $X$. We validate our approach with
experiments on treatment effect estimation.
Related papers
- Semiparametric conformal prediction [79.6147286161434]
Risk-sensitive applications require well-calibrated prediction sets over multiple, potentially correlated target variables.
We treat the scores as random vectors and aim to construct the prediction set accounting for their joint correlation structure.
We report desired coverage and competitive efficiency on a range of real-world regression problems.
arXiv Detail & Related papers (2024-11-04T14:29:02Z) - Continuous Treatment Effects with Surrogate Outcomes [12.548638259932915]
We study the role of surrogates in estimating continuous treatment effects.
We propose a doubly robust method to efficiently incorporate surrogates in the analysis.
arXiv Detail & Related papers (2024-01-31T20:50:18Z) - Sample Constrained Treatment Effect Estimation [28.156207324508706]
We focus on designing efficient randomized controlled trials, to accurately estimate the effect of some treatment on a population of $n$ individuals.
In particular, we study sample-constrained treatment effect estimation, where we must select a subset of $s ll n$ individuals from the population to experiment on.
arXiv Detail & Related papers (2022-10-12T21:13:47Z) - DESCN: Deep Entire Space Cross Networks for Individual Treatment Effect
Estimation [7.060064266376701]
Causal Inference has wide applications in various areas such as E-commerce and precision medicine.
This paper proposes Deep Entire Space Cross Networks (DESCN) to model treatment effects from an end-to-end perspective.
arXiv Detail & Related papers (2022-07-19T01:25:31Z) - Robust and Agnostic Learning of Conditional Distributional Treatment
Effects [62.44901952244514]
The conditional average treatment effect (CATE) is the best point prediction of individual causal effects.
In aggregate analyses, this is usually addressed by measuring distributional treatment effect (DTE)
We provide a new robust and model-agnostic methodology for learning the conditional DTE (CDTE) for a wide class of problems.
arXiv Detail & Related papers (2022-05-23T17:40:31Z) - Treatment Effect Risk: Bounds and Inference [58.442274475425144]
Since the average treatment effect measures the change in social welfare, even if positive, there is a risk of negative effect on, say, some 10% of the population.
In this paper we consider how to nonetheless assess this important risk measure, formalized as the conditional value at risk (CVaR) of the ITE distribution.
Some bounds can also be interpreted as summarizing a complex CATE function into a single metric and are of interest independently of being a bound.
arXiv Detail & Related papers (2022-01-15T17:21:26Z) - A General Framework for Treatment Effect Estimation in Semi-Supervised and High Dimensional Settings [0.0]
We develop a family of SS estimators which are more robust and (2) more efficient than their supervised counterparts.
We further establish root-n consistency and normality of our SS estimators whenever the propensity score in the model is correctly specified.
Our estimators are shown to be semi-parametrically efficient as long as all the nuisance functions are correctly specified.
arXiv Detail & Related papers (2022-01-03T04:12:44Z) - Assessment of Treatment Effect Estimators for Heavy-Tailed Data [70.72363097550483]
A central obstacle in the objective assessment of treatment effect (TE) estimators in randomized control trials (RCTs) is the lack of ground truth (or validation set) to test their performance.
We provide a novel cross-validation-like methodology to address this challenge.
We evaluate our methodology across 709 RCTs implemented in the Amazon supply chain.
arXiv Detail & Related papers (2021-12-14T17:53:01Z) - Heavy-tailed Streaming Statistical Estimation [58.70341336199497]
We consider the task of heavy-tailed statistical estimation given streaming $p$ samples.
We design a clipped gradient descent and provide an improved analysis under a more nuanced condition on the noise of gradients.
arXiv Detail & Related papers (2021-08-25T21:30:27Z) - What can the millions of random treatments in nonexperimental data
reveal about causes? [0.0]
The article introduces one such model and a Bayesian approach to combine the $O(n2)$ pairwise observations typically available in nonexperimnetal data.
We demonstrate that the proposed approach recovers causal effects in common NSW samples, as well as in arbitrary subpopulations and an order-of-magnitude larger supersample.
arXiv Detail & Related papers (2021-05-03T20:13:34Z) - Deconfounding Scores: Feature Representations for Causal Effect
Estimation with Weak Overlap [140.98628848491146]
We introduce deconfounding scores, which induce better overlap without biasing the target of estimation.
We show that deconfounding scores satisfy a zero-covariance condition that is identifiable in observed data.
In particular, we show that this technique could be an attractive alternative to standard regularizations.
arXiv Detail & Related papers (2021-04-12T18:50:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.