Causal Effect Estimation from Observational and Interventional Data
Through Matrix Weighted Linear Estimators
- URL: http://arxiv.org/abs/2306.06002v1
- Date: Fri, 9 Jun 2023 16:16:53 GMT
- Title: Causal Effect Estimation from Observational and Interventional Data
Through Matrix Weighted Linear Estimators
- Authors: Klaus-Rudolf Kladny, Julius von K\"ugelgen, Bernhard Sch\"olkopf,
Michael Muehlebach
- Abstract summary: We study causal effect estimation from a mixture of observational and interventional data.
We show that the statistical efficiency in terms of expected squared error can be improved by combining estimators.
- Score: 11.384045395629123
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We study causal effect estimation from a mixture of observational and
interventional data in a confounded linear regression model with multivariate
treatments. We show that the statistical efficiency in terms of expected
squared error can be improved by combining estimators arising from both the
observational and interventional setting. To this end, we derive methods based
on matrix weighted linear estimators and prove that our methods are
asymptotically unbiased in the infinite sample limit. This is an important
improvement compared to the pooled estimator using the union of interventional
and observational data, for which the bias only vanishes if the ratio of
observational to interventional data tends to zero. Studies on synthetic data
confirm our theoretical findings. In settings where confounding is substantial
and the ratio of observational to interventional data is large, our estimators
outperform a Stein-type estimator and various other baselines.
Related papers
- On semi-supervised estimation using exponential tilt mixture models [12.347498345854715]
Consider a semi-supervised setting with a labeled dataset of binary responses and predictors and an unlabeled dataset with only predictors.
For semi-supervised estimation, we develop further analysis and understanding of a statistical approach using exponential tilt mixture (ETM) models.
arXiv Detail & Related papers (2023-11-14T19:53:26Z) - Approximating Counterfactual Bounds while Fusing Observational, Biased
and Randomised Data Sources [64.96984404868411]
We address the problem of integrating data from multiple, possibly biased, observational and interventional studies.
We show that the likelihood of the available data has no local maxima.
We then show how the same approach can address the general case of multiple datasets.
arXiv Detail & Related papers (2023-07-31T11:28:24Z) - Covariate-Balancing-Aware Interpretable Deep Learning models for
Treatment Effect Estimation [15.465045049754336]
We propose an upper bound for the bias of average treatment estimation under the strong ignorability assumption.
We implement this upper bound as an objective function being minimized by leveraging a novel additive neural network architecture.
The proposed method is illustrated by re-examining the benchmark datasets for causal inference, and it outperforms the state-of-art.
arXiv Detail & Related papers (2022-03-07T07:42:40Z) - Statistical Inference for High-Dimensional Linear Regression with
Blockwise Missing Data [13.48481978963297]
Blockwise missing data occurs when we integrate multisource or multimodality data where different sources or modalities contain complementary information.
We propose a computationally efficient estimator for the regression coefficient vector based on carefully constructed unbiased estimating equations.
Numerical studies and application analysis of the Alzheimer's Disease Neuroimaging Initiative data show that the proposed method performs better and benefits more from unsupervised samples than existing methods.
arXiv Detail & Related papers (2021-06-07T05:12:42Z) - Efficient Causal Inference from Combined Observational and
Interventional Data through Causal Reductions [68.6505592770171]
Unobserved confounding is one of the main challenges when estimating causal effects.
We propose a novel causal reduction method that replaces an arbitrary number of possibly high-dimensional latent confounders.
We propose a learning algorithm to estimate the parameterized reduced model jointly from observational and interventional data.
arXiv Detail & Related papers (2021-03-08T14:29:07Z) - On Disentangled Representations Learned From Correlated Data [59.41587388303554]
We bridge the gap to real-world scenarios by analyzing the behavior of the most prominent disentanglement approaches on correlated data.
We show that systematically induced correlations in the dataset are being learned and reflected in the latent representations.
We also demonstrate how to resolve these latent correlations, either using weak supervision during training or by post-hoc correcting a pre-trained model with a small number of labels.
arXiv Detail & Related papers (2020-06-14T12:47:34Z) - Enabling Counterfactual Survival Analysis with Balanced Representations [64.17342727357618]
Survival data are frequently encountered across diverse medical applications, i.e., drug development, risk profiling, and clinical trials.
We propose a theoretically grounded unified framework for counterfactual inference applicable to survival outcomes.
arXiv Detail & Related papers (2020-06-14T01:15:00Z) - Performance metrics for intervention-triggering prediction models do not
reflect an expected reduction in outcomes from using the model [71.9860741092209]
Clinical researchers often select among and evaluate risk prediction models.
Standard metrics calculated from retrospective data are only related to model utility under certain assumptions.
When predictions are delivered repeatedly throughout time, the relationship between standard metrics and utility is further complicated.
arXiv Detail & Related papers (2020-06-02T16:26:49Z) - Machine learning for causal inference: on the use of cross-fit
estimators [77.34726150561087]
Doubly-robust cross-fit estimators have been proposed to yield better statistical properties.
We conducted a simulation study to assess the performance of several estimators for the average causal effect (ACE)
When used with machine learning, the doubly-robust cross-fit estimators substantially outperformed all of the other estimators in terms of bias, variance, and confidence interval coverage.
arXiv Detail & Related papers (2020-04-21T23:09:55Z) - Estimating Treatment Effects with Observed Confounders and Mediators [25.338901482522648]
Given a causal graph, the do-calculus can express treatment effects as functionals of the observational joint distribution that can be estimated empirically.
Sometimes the do-calculus identifies multiple valid formulae, prompting us to compare the statistical properties of the corresponding estimators.
In this paper, we investigate the over-identified scenario where both confounders and mediators are observed, rendering both estimators valid.
arXiv Detail & Related papers (2020-03-26T15:50:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.