Distributional Instrumental Variable Method
- URL: http://arxiv.org/abs/2502.07641v2
- Date: Tue, 18 Feb 2025 13:02:39 GMT
- Title: Distributional Instrumental Variable Method
- Authors: Anastasiia Holovchak, Sorawit Saengkyongam, Nicolai Meinshausen, Xinwei Shen,
- Abstract summary: The aim of this work is to estimate the entire interventional distribution.
We propose a method called Distributional Instrumental Variable (DIV), which uses generative modelling in a nonlinear IV setting.
- Score: 4.34680331569334
- License:
- Abstract: The instrumental variable (IV) approach is commonly used to infer causal effects in the presence of unmeasured confounding. Existing methods typically aim to estimate the mean causal effects, whereas a few other methods focus on quantile treatment effects. The aim of this work is to estimate the entire interventional distribution. We propose a method called Distributional Instrumental Variable (DIV), which uses generative modelling in a nonlinear IV setting. We establish identifiability of the interventional distribution under general assumptions and demonstrate an 'under-identified' case, where DIV can identify the causal effects while two-step least squares fails to. Our empirical results show that the DIV method performs well for a broad range of simulated data, exhibiting advantages over existing IV approaches in terms of the identifiability and estimation error of the mean or quantile treatment effects. Furthermore, we apply DIV to an economic data set to examine the causal relation between institutional quality and economic development and our results align well with the original study. We also apply DIV to a single-cell data set, where we study the generalizability and stability in predicting gene expression under unseen interventions. The software implementations of DIV are available in R and Python.
Related papers
- Targeted Data Fusion for Causal Survival Analysis Under Distribution Shift [46.84912148188679]
Causal inference has the potential to improve the generalizability, transportability, and replicability of scientific findings.
Existing data fusion methods focus on binary or continuous outcomes.
We propose two novel approaches for multi-source causal survival analysis.
arXiv Detail & Related papers (2025-01-30T23:21:25Z) - Estimating Individual Dose-Response Curves under Unobserved Confounders from Observational Data [6.166869525631879]
We present ContiVAE, a novel framework for estimating causal effects of continuous treatments, measured by individual dose-response curves.
We show that ContiVAE outperforms existing methods by up to 62%, demonstrating its robustness and flexibility.
arXiv Detail & Related papers (2024-10-21T07:24:26Z) - Targeted Cause Discovery with Data-Driven Learning [66.86881771339145]
We propose a novel machine learning approach for inferring causal variables of a target variable from observations.
We employ a neural network trained to identify causality through supervised learning on simulated data.
Empirical results demonstrate the effectiveness of our method in identifying causal relationships within large-scale gene regulatory networks.
arXiv Detail & Related papers (2024-08-29T02:21:11Z) - Meta-Learners for Partially-Identified Treatment Effects Across Multiple Environments [67.80453452949303]
Estimating the conditional average treatment effect (CATE) from observational data is relevant for many applications such as personalized medicine.
Here, we focus on the widespread setting where the observational data come from multiple environments.
We propose different model-agnostic learners (so-called meta-learners) to estimate the bounds that can be used in combination with arbitrary machine learning models.
arXiv Detail & Related papers (2024-06-04T16:31:43Z) - Interpretable Causal Inference for Analyzing Wearable, Sensor, and Distributional Data [62.56890808004615]
We develop an interpretable method for distributional data analysis that ensures trustworthy and robust decision-making.
We demonstrate ADD MALTS' utility by studying the effectiveness of continuous glucose monitors in mitigating diabetes risks.
arXiv Detail & Related papers (2023-12-17T00:42:42Z) - Nonparametric Identifiability of Causal Representations from Unknown
Interventions [63.1354734978244]
We study causal representation learning, the task of inferring latent causal variables and their causal relations from mixtures of the variables.
Our goal is to identify both the ground truth latents and their causal graph up to a set of ambiguities which we show to be irresolvable from interventional data.
arXiv Detail & Related papers (2023-06-01T10:51:58Z) - Instrumental Variables in Causal Inference and Machine Learning: A
Survey [26.678154268037595]
Causal inference is a process of using assumptions to draw conclusions about the causal relationships between variables based on data.
A growing literature in both causal inference and machine learning proposes to use Instrumental Variables (IV)
This paper serves as the first effort to systematically and comprehensively introduce and discuss the IV methods and their applications in both causal inference and machine learning.
arXiv Detail & Related papers (2022-12-12T08:59:04Z) - Causal Inference with Conditional Instruments using Deep Generative
Models [21.771832598942677]
A standard IV is expected to be related to the treatment variable and independent of all other variables in the system.
conditional IV (CIV) method has been proposed to allow a variable to be an instrument conditioning on a set of variables.
We propose to learn the representations of a CIV and its conditioning set from data with latent confounders for average causal effect estimation.
arXiv Detail & Related papers (2022-11-29T14:31:54Z) - Discovering Ancestral Instrumental Variables for Causal Inference from
Observational Data [0.0]
Instrumental variable (IV) is a powerful approach to inferring the causal effect of a treatment on an outcome of interest from observational data.
Existing IV methods require that an IV is selected and justified with domain knowledge.
In this paper, we study and design a data-driven algorithm to discover valid IVs from data under mild assumptions.
arXiv Detail & Related papers (2022-06-04T07:48:13Z) - The interventional Bayesian Gaussian equivalent score for Bayesian
causal inference with unknown soft interventions [0.0]
In certain settings, such as genomics, we may have data from heterogeneous study conditions, with soft (partial) interventions only pertaining to a subset of the study variables.
We define the interventional BGe score for a mixture of observational and interventional data, where the targets and effects of intervention may be unknown.
arXiv Detail & Related papers (2022-05-05T12:32:08Z) - Efficient Causal Inference from Combined Observational and
Interventional Data through Causal Reductions [68.6505592770171]
Unobserved confounding is one of the main challenges when estimating causal effects.
We propose a novel causal reduction method that replaces an arbitrary number of possibly high-dimensional latent confounders.
We propose a learning algorithm to estimate the parameterized reduced model jointly from observational and interventional data.
arXiv Detail & Related papers (2021-03-08T14:29:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.