Approximating Counterfactual Bounds while Fusing Observational, Biased
and Randomised Data Sources
- URL: http://arxiv.org/abs/2307.16577v1
- Date: Mon, 31 Jul 2023 11:28:24 GMT
- Title: Approximating Counterfactual Bounds while Fusing Observational, Biased
and Randomised Data Sources
- Authors: Marco Zaffalon and Alessandro Antonucci and Rafael Caba\~nas and David
Huber
- Abstract summary: We address the problem of integrating data from multiple, possibly biased, observational and interventional studies.
We show that the likelihood of the available data has no local maxima.
We then show how the same approach can address the general case of multiple datasets.
- Score: 64.96984404868411
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We address the problem of integrating data from multiple, possibly biased,
observational and interventional studies, to eventually compute counterfactuals
in structural causal models. We start from the case of a single observational
dataset affected by a selection bias. We show that the likelihood of the
available data has no local maxima. This enables us to use the causal
expectation-maximisation scheme to approximate the bounds for partially
identifiable counterfactual queries, which are the focus of this paper. We then
show how the same approach can address the general case of multiple datasets,
no matter whether interventional or observational, biased or unbiased, by
remapping it into the former one via graphical transformations. Systematic
numerical experiments and a case study on palliative care show the
effectiveness of our approach, while hinting at the benefits of fusing
heterogeneous data sources to get informative outcomes in case of partial
identifiability.
Related papers
- Learning to Bound Counterfactual Inference in Structural Causal Models
from Observational and Randomised Data [64.96984404868411]
We derive a likelihood characterisation for the overall data that leads us to extend a previous EM-based algorithm.
The new algorithm learns to approximate the (unidentifiability) region of model parameters from such mixed data sources.
It delivers interval approximations to counterfactual results, which collapse to points in the identifiable case.
arXiv Detail & Related papers (2022-12-06T12:42:11Z) - Generalizing experimental findings: identification beyond adjustments [2.5889737226898437]
We aim to generalize the results of a randomized controlled trial (RCT) to a target population with the help of some observational data.
We consider examples where the experimental findings cannot be generalized by an adjustment.
We show that the generalization may still be possible by other identification strategies that can be derived by applying do-calculus.
arXiv Detail & Related papers (2022-06-14T09:00:17Z) - Detecting hidden confounding in observational data using multiple
environments [0.81585306387285]
We present a theory for testable conditional independencies that are only absent when there is hidden confounding.
In most cases, the proposed procedure correctly predicts the presence of hidden confounding.
arXiv Detail & Related papers (2022-05-27T12:20:09Z) - The interventional Bayesian Gaussian equivalent score for Bayesian
causal inference with unknown soft interventions [0.0]
In certain settings, such as genomics, we may have data from heterogeneous study conditions, with soft (partial) interventions only pertaining to a subset of the study variables.
We define the interventional BGe score for a mixture of observational and interventional data, where the targets and effects of intervention may be unknown.
arXiv Detail & Related papers (2022-05-05T12:32:08Z) - Combining Observational and Randomized Data for Estimating Heterogeneous
Treatment Effects [82.20189909620899]
Estimating heterogeneous treatment effects is an important problem across many domains.
Currently, most existing works rely exclusively on observational data.
We propose to estimate heterogeneous treatment effects by combining large amounts of observational data and small amounts of randomized data.
arXiv Detail & Related papers (2022-02-25T18:59:54Z) - Adaptive Data Analysis with Correlated Observations [21.969356766737622]
We show that, in some cases, differential privacy guarantees even when there are dependencies within the sample.
We show that the connection between transcript-compression and adaptive data analysis can be extended to the non-iid setting.
arXiv Detail & Related papers (2022-01-21T14:00:30Z) - Multi-Source Causal Inference Using Control Variates [81.57072928775509]
We propose a general algorithm to estimate causal effects from emphmultiple data sources.
We show theoretically that this reduces the variance of the ATE estimate.
We apply this framework to inference from observational data under an outcome selection bias.
arXiv Detail & Related papers (2021-03-30T21:20:51Z) - Bayesian Sparse Factor Analysis with Kernelized Observations [67.60224656603823]
Multi-view problems can be faced with latent variable models.
High-dimensionality and non-linear issues are traditionally handled by kernel methods.
We propose merging both approaches into single model.
arXiv Detail & Related papers (2020-06-01T14:25:38Z) - Learning Overlapping Representations for the Estimation of
Individualized Treatment Effects [97.42686600929211]
Estimating the likely outcome of alternatives from observational data is a challenging problem.
We show that algorithms that learn domain-invariant representations of inputs are often inappropriate.
We develop a deep kernel regression algorithm and posterior regularization framework that substantially outperforms the state-of-the-art on a variety of benchmarks data sets.
arXiv Detail & Related papers (2020-01-14T12:56:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.