Related papers: Instrumental Variables in Causal Inference and Machine Learning: A Survey

Instrumental Variables in Causal Inference and Machine Learning: A Survey

URL: http://arxiv.org/abs/2212.05778v1
Date: Mon, 12 Dec 2022 08:59:04 GMT
Title: Instrumental Variables in Causal Inference and Machine Learning: A Survey
Authors: Anpeng Wu, Kun Kuang, Ruoxuan Xiong, Fei Wu
Abstract summary: Causal inference is a process of using assumptions to draw conclusions about the causal relationships between variables based on data. A growing literature in both causal inference and machine learning proposes to use Instrumental Variables (IV) This paper serves as the first effort to systematically and comprehensively introduce and discuss the IV methods and their applications in both causal inference and machine learning.
Score: 26.678154268037595
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Causal inference is the process of using assumptions, study designs, and estimation strategies to draw conclusions about the causal relationships between variables based on data. This allows researchers to better understand the underlying mechanisms at work in complex systems and make more informed decisions. In many settings, we may not fully observe all the confounders that affect both the treatment and outcome variables, complicating the estimation of causal effects. To address this problem, a growing literature in both causal inference and machine learning proposes to use Instrumental Variables (IV). This paper serves as the first effort to systematically and comprehensively introduce and discuss the IV methods and their applications in both causal inference and machine learning. First, we provide the formal definition of IVs and discuss the identification problem of IV regression methods under different assumptions. Second, we categorize the existing work on IV methods into three streams according to the focus on the proposed methods, including two-stage least squares with IVs, control function with IVs, and evaluation of IVs. For each stream, we present both the classical causal inference methods, and recent developments in the machine learning literature. Then, we introduce a variety of applications of IV methods in real-world scenarios and provide a summary of the available datasets and algorithms. Finally, we summarize the literature, discuss the open problems and suggest promising future research directions for IV methods and their applications. We also develop a toolkit of IVs methods reviewed in this survey at https://github.com/causal-machine-learning-lab/mliv.

Related papers

Flow IV: Counterfactual Inference In Nonseparable Outcome Models Using Instrumental Variables [2.3213238782019316]
We show that under standard IV assumptions, along with the assumptions that latent noises in treatment and outcome are strictly monotonic and jointly Gaussian, the treatment-outcome relationship becomes uniquely identifiable from observed data.<n>This enables counterfactual inference even in nonseparable models.<n>We implement our approach by training a normalizing flow to maximize the likelihood of the observed data, demonstrating accurate recovery of the underlying outcome function.
arXiv Detail & Related papers (2025-08-02T11:24:03Z)
Distributional Instrumental Variable Method [4.34680331569334]
The aim of this work is to estimate the entire interventional distribution. We propose a method called Distributional Instrumental Variable (DIV), which uses generative modelling in a nonlinear IV setting.
arXiv Detail & Related papers (2025-02-11T15:33:06Z)
Disentangled Representation Learning for Causal Inference with Instruments [31.67220687652054]
Existing IV based estimators need a known IV or other strong assumptions, such as the existence of two or more IVs in the system. In this paper, we consider a relaxed requirement, which assumes there is an IV proxy in the system without knowing which variable is the proxy. We propose a Variational AutoEncoder (VAE) based disentangled representation learning method to learn an IV representation from a dataset with latent confounders.
arXiv Detail & Related papers (2024-12-05T22:18:48Z)
Mining Causality: AI-Assisted Search for Instrumental Variables [0.0]
We propose using large language models to search for new IVs through narratives and counterfactual reasoning. We argue that multi-step and role-playing prompting strategies are effective for the endogenous decision-making processes of economic agents.
arXiv Detail & Related papers (2024-09-21T17:19:29Z)
Regularized DeepIV with Model Selection [72.17508967124081]
Regularized DeepIV (RDIV) regression can converge to the least-norm IV solution. Our method matches the current state-of-the-art convergence rate.
arXiv Detail & Related papers (2024-03-07T05:38:56Z)
SoK: Privacy-Preserving Data Synthesis [72.92263073534899]
This paper focuses on privacy-preserving data synthesis (PPDS) by providing a comprehensive overview, analysis, and discussion of the field. We put forth a master recipe that unifies two prominent strands of research in PPDS: statistical methods and deep learning (DL)-based methods.
arXiv Detail & Related papers (2023-07-05T08:29:31Z)
Instrumental Variable Learning for Chest X-ray Classification [52.68170685918908]
We propose an interpretable instrumental variable (IV) learning framework to eliminate the spurious association and obtain accurate causal representation. Our approach's performance is demonstrated using the MIMIC-CXR, NIH ChestX-ray 14, and CheXpert datasets.
arXiv Detail & Related papers (2023-05-20T03:12:23Z)
Causal Inference with Conditional Instruments using Deep Generative Models [21.771832598942677]
A standard IV is expected to be related to the treatment variable and independent of all other variables in the system. conditional IV (CIV) method has been proposed to allow a variable to be an instrument conditioning on a set of variables. We propose to learn the representations of a CIV and its conditioning set from data with latent confounders for average causal effect estimation.
arXiv Detail & Related papers (2022-11-29T14:31:54Z)
Discovering Ancestral Instrumental Variables for Causal Inference from Observational Data [0.0]
Instrumental variable (IV) is a powerful approach to inferring the causal effect of a treatment on an outcome of interest from observational data. Existing IV methods require that an IV is selected and justified with domain knowledge. In this paper, we study and design a data-driven algorithm to discover valid IVs from data under mild assumptions.
arXiv Detail & Related papers (2022-06-04T07:48:13Z)
Ancestral Instrument Method for Causal Inference without Complete Knowledge [0.0]
Unobserved confounding is the main obstacle to causal effect estimation from observational data. Conditional IVs have been proposed to relax the requirement of standard IVs by conditioning on a set of observed variables. We develop an algorithm for unbiased causal effect estimation with a given ancestral IV and observational data.
arXiv Detail & Related papers (2022-01-11T07:02:16Z)
Auto IV: Counterfactual Prediction via Automatic Instrumental Variable Decomposition [21.90157954233519]
Instrumental variables (IVs) play an important role in causal inference with unobserved confounders. Existing IV-based counterfactual prediction methods need well-predefined IVs. We propose a novel algorithm to automatically generate representations serving the role of IVs from observed variables.
arXiv Detail & Related papers (2021-07-13T07:30:21Z)
Individual Explanations in Machine Learning Models: A Case Study on Poverty Estimation [63.18666008322476]
Machine learning methods are being increasingly applied in sensitive societal contexts. The present case study has two main objectives. First, to expose these challenges and how they affect the use of relevant and novel explanations methods. And second, to present a set of strategies that mitigate such challenges, as faced when implementing explanation methods in a relevant application domain.
arXiv Detail & Related papers (2021-04-09T01:54:58Z)
Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning [107.70165026669308]
In offline reinforcement learning (RL) an optimal policy is learned solely from a priori collected observational data. We study a confounded Markov decision process where the transition dynamics admit an additive nonlinear functional form. We propose a provably efficient IV-aided Value Iteration (IVVI) algorithm based on a primal-dual reformulation of the conditional moment restriction.
arXiv Detail & Related papers (2021-02-19T13:01:40Z)
A Survey on Causal Inference [64.45536158710014]
Causal inference is a critical research topic across many domains, such as statistics, computer science, education, public policy and economics. Various causal effect estimation methods for observational data have sprung up.
arXiv Detail & Related papers (2020-02-05T21:35:29Z)

This list is automatically generated from the titles and abstracts of the papers in this site.