Instrumental Variables in Causal Inference and Machine Learning: A
Survey
- URL: http://arxiv.org/abs/2212.05778v1
- Date: Mon, 12 Dec 2022 08:59:04 GMT
- Title: Instrumental Variables in Causal Inference and Machine Learning: A
Survey
- Authors: Anpeng Wu, Kun Kuang, Ruoxuan Xiong, Fei Wu
- Abstract summary: Causal inference is a process of using assumptions to draw conclusions about the causal relationships between variables based on data.
A growing literature in both causal inference and machine learning proposes to use Instrumental Variables (IV)
This paper serves as the first effort to systematically and comprehensively introduce and discuss the IV methods and their applications in both causal inference and machine learning.
- Score: 26.678154268037595
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Causal inference is the process of using assumptions, study designs, and
estimation strategies to draw conclusions about the causal relationships
between variables based on data. This allows researchers to better understand
the underlying mechanisms at work in complex systems and make more informed
decisions. In many settings, we may not fully observe all the confounders that
affect both the treatment and outcome variables, complicating the estimation of
causal effects. To address this problem, a growing literature in both causal
inference and machine learning proposes to use Instrumental Variables (IV).
This paper serves as the first effort to systematically and comprehensively
introduce and discuss the IV methods and their applications in both causal
inference and machine learning. First, we provide the formal definition of IVs
and discuss the identification problem of IV regression methods under different
assumptions. Second, we categorize the existing work on IV methods into three
streams according to the focus on the proposed methods, including two-stage
least squares with IVs, control function with IVs, and evaluation of IVs. For
each stream, we present both the classical causal inference methods, and recent
developments in the machine learning literature. Then, we introduce a variety
of applications of IV methods in real-world scenarios and provide a summary of
the available datasets and algorithms. Finally, we summarize the literature,
discuss the open problems and suggest promising future research directions for
IV methods and their applications. We also develop a toolkit of IVs methods
reviewed in this survey at https://github.com/causal-machine-learning-lab/mliv.
Related papers
- Mining Causality: AI-Assisted Search for Instrumental Variables [0.0]
We propose using large language models to search for new IVs through narratives and counterfactual reasoning.
We demonstrate how to construct prompts to search for potentially valid IVs.
We apply our method to three well-known examples in economics: returns to schooling, production functions, and peer effects.
arXiv Detail & Related papers (2024-09-21T17:19:29Z) - Regularized DeepIV with Model Selection [72.17508967124081]
Regularized DeepIV (RDIV) regression can converge to the least-norm IV solution.
Our method matches the current state-of-the-art convergence rate.
arXiv Detail & Related papers (2024-03-07T05:38:56Z) - SoK: Privacy-Preserving Data Synthesis [72.92263073534899]
This paper focuses on privacy-preserving data synthesis (PPDS) by providing a comprehensive overview, analysis, and discussion of the field.
We put forth a master recipe that unifies two prominent strands of research in PPDS: statistical methods and deep learning (DL)-based methods.
arXiv Detail & Related papers (2023-07-05T08:29:31Z) - Instrumental Variable Learning for Chest X-ray Classification [52.68170685918908]
We propose an interpretable instrumental variable (IV) learning framework to eliminate the spurious association and obtain accurate causal representation.
Our approach's performance is demonstrated using the MIMIC-CXR, NIH ChestX-ray 14, and CheXpert datasets.
arXiv Detail & Related papers (2023-05-20T03:12:23Z) - Causal Inference with Conditional Instruments using Deep Generative
Models [21.771832598942677]
A standard IV is expected to be related to the treatment variable and independent of all other variables in the system.
conditional IV (CIV) method has been proposed to allow a variable to be an instrument conditioning on a set of variables.
We propose to learn the representations of a CIV and its conditioning set from data with latent confounders for average causal effect estimation.
arXiv Detail & Related papers (2022-11-29T14:31:54Z) - Discovering Ancestral Instrumental Variables for Causal Inference from
Observational Data [0.0]
Instrumental variable (IV) is a powerful approach to inferring the causal effect of a treatment on an outcome of interest from observational data.
Existing IV methods require that an IV is selected and justified with domain knowledge.
In this paper, we study and design a data-driven algorithm to discover valid IVs from data under mild assumptions.
arXiv Detail & Related papers (2022-06-04T07:48:13Z) - Ancestral Instrument Method for Causal Inference without Complete
Knowledge [0.0]
Unobserved confounding is the main obstacle to causal effect estimation from observational data.
Conditional IVs have been proposed to relax the requirement of standard IVs by conditioning on a set of observed variables.
We develop an algorithm for unbiased causal effect estimation with a given ancestral IV and observational data.
arXiv Detail & Related papers (2022-01-11T07:02:16Z) - Auto IV: Counterfactual Prediction via Automatic Instrumental Variable
Decomposition [21.90157954233519]
Instrumental variables (IVs) play an important role in causal inference with unobserved confounders.
Existing IV-based counterfactual prediction methods need well-predefined IVs.
We propose a novel algorithm to automatically generate representations serving the role of IVs from observed variables.
arXiv Detail & Related papers (2021-07-13T07:30:21Z) - Individual Explanations in Machine Learning Models: A Case Study on
Poverty Estimation [63.18666008322476]
Machine learning methods are being increasingly applied in sensitive societal contexts.
The present case study has two main objectives. First, to expose these challenges and how they affect the use of relevant and novel explanations methods.
And second, to present a set of strategies that mitigate such challenges, as faced when implementing explanation methods in a relevant application domain.
arXiv Detail & Related papers (2021-04-09T01:54:58Z) - Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning [107.70165026669308]
In offline reinforcement learning (RL) an optimal policy is learned solely from a priori collected observational data.
We study a confounded Markov decision process where the transition dynamics admit an additive nonlinear functional form.
We propose a provably efficient IV-aided Value Iteration (IVVI) algorithm based on a primal-dual reformulation of the conditional moment restriction.
arXiv Detail & Related papers (2021-02-19T13:01:40Z) - A Survey on Causal Inference [64.45536158710014]
Causal inference is a critical research topic across many domains, such as statistics, computer science, education, public policy and economics.
Various causal effect estimation methods for observational data have sprung up.
arXiv Detail & Related papers (2020-02-05T21:35:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.