Causal Reasoning in the Presence of Latent Confounders via Neural ADMG
Learning
- URL: http://arxiv.org/abs/2303.12703v1
- Date: Wed, 22 Mar 2023 16:45:54 GMT
- Title: Causal Reasoning in the Presence of Latent Confounders via Neural ADMG
Learning
- Authors: Matthew Ashman, Chao Ma, Agrin Hilmkil, Joel Jennings, Cheng Zhang
- Abstract summary: Latent confounding has been a long-standing obstacle for causal reasoning from observational data.
We propose a novel neural causal model based on autoregressive flows for ADMG learning.
- Score: 8.649109147825985
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Latent confounding has been a long-standing obstacle for causal reasoning
from observational data. One popular approach is to model the data using
acyclic directed mixed graphs (ADMGs), which describe ancestral relations
between variables using directed and bidirected edges. However, existing
methods using ADMGs are based on either linear functional assumptions or a
discrete search that is complicated to use and lacks computational tractability
for large datasets. In this work, we further extend the existing body of work
and develop a novel gradient-based approach to learning an ADMG with non-linear
functional relations from observational data. We first show that the presence
of latent confounding is identifiable under the assumptions of bow-free ADMGs
with non-linear additive noise models. With this insight, we propose a novel
neural causal model based on autoregressive flows for ADMG learning. This not
only enables us to determine complex causal structural relationships behind the
data in the presence of latent confounding, but also estimate their functional
relationships (hence treatment effects) simultaneously. We further validate our
approach via experiments on both synthetic and real-world datasets, and
demonstrate the competitive performance against relevant baselines.
Related papers
- Graph-based Unsupervised Disentangled Representation Learning via Multimodal Large Language Models [42.17166746027585]
We introduce a bidirectional weighted graph-based framework to learn factorized attributes and their interrelations within complex data.
Specifically, we propose a $beta$-VAE based module to extract factors as the initial nodes of the graph.
By integrating these complementary modules, our model successfully achieves fine-grained, practical and unsupervised disentanglement.
arXiv Detail & Related papers (2024-07-26T15:32:21Z) - PerturbScore: Connecting Discrete and Continuous Perturbations in NLP [64.28423650146877]
In this paper, we aim to connect discrete perturbations with continuous perturbations.
We design a regression task as a PerturbScore to learn the correlation automatically.
We find that we can build a connection between discrete and continuous perturbations and use the proposed PerturbScore to learn such correlation.
arXiv Detail & Related papers (2023-10-13T06:50:15Z) - Spurious Correlations and Where to Find Them [17.1264393170134]
Spurious correlations occur when a model learns unreliable features from the data.
We collect some of the commonly studied hypotheses behind the occurrence of spurious correlations.
We investigate their influence on standard ERM baselines using synthetic datasets generated from causal graphs.
arXiv Detail & Related papers (2023-08-21T21:06:36Z) - Less is More: Mitigate Spurious Correlations for Open-Domain Dialogue
Response Generation Models by Causal Discovery [52.95935278819512]
We conduct the first study on spurious correlations for open-domain response generation models based on a corpus CGDIALOG curated in our work.
Inspired by causal discovery algorithms, we propose a novel model-agnostic method for training and inference of response generation model.
arXiv Detail & Related papers (2023-03-02T06:33:48Z) - Causal Inference via Nonlinear Variable Decorrelation for Healthcare
Applications [60.26261850082012]
We introduce a novel method with a variable decorrelation regularizer to handle both linear and nonlinear confounding.
We employ association rules as new representations using association rule mining based on the original features to increase model interpretability.
arXiv Detail & Related papers (2022-09-29T17:44:14Z) - Spatio-temporally separable non-linear latent factor learning: an
application to somatomotor cortex fMRI data [0.0]
Models of fMRI data that can perform whole-brain discovery of latent factors are understudied.
New methods for efficient spatial weight-sharing are critical to deal with the high dimensionality of the data and the presence of noise.
Our approach is evaluated on data with multiple motor sub-tasks to assess whether the model captures disentangled latent factors that correspond to each sub-task.
arXiv Detail & Related papers (2022-05-26T21:30:22Z) - Diffusion Causal Models for Counterfactual Estimation [18.438307666925425]
We consider the task of counterfactual estimation from observational imaging data given a known causal structure.
We propose Diff-SCM, a deep structural causal model that builds on recent advances of generative energy-based models.
We find that Diff-SCM produces more realistic and minimal counterfactuals than baselines on MNIST data and can also be applied to ImageNet data.
arXiv Detail & Related papers (2022-02-21T12:23:01Z) - Learning Neural Causal Models with Active Interventions [83.44636110899742]
We introduce an active intervention-targeting mechanism which enables a quick identification of the underlying causal structure of the data-generating process.
Our method significantly reduces the required number of interactions compared with random intervention targeting.
We demonstrate superior performance on multiple benchmarks from simulated to real-world data.
arXiv Detail & Related papers (2021-09-06T13:10:37Z) - Disentangling Observed Causal Effects from Latent Confounders using
Method of Moments [67.27068846108047]
We provide guarantees on identifiability and learnability under mild assumptions.
We develop efficient algorithms based on coupled tensor decomposition with linear constraints to obtain scalable and guaranteed solutions.
arXiv Detail & Related papers (2021-01-17T07:48:45Z) - Relation-Guided Representation Learning [53.60351496449232]
We propose a new representation learning method that explicitly models and leverages sample relations.
Our framework well preserves the relations between samples.
By seeking to embed samples into subspace, we show that our method can address the large-scale and out-of-sample problem.
arXiv Detail & Related papers (2020-07-11T10:57:45Z) - Causal Modeling with Stochastic Confounders [11.881081802491183]
This work extends causal inference with confounders.
We propose a new approach to variational estimation for causal inference based on a representer theorem with a random input space.
arXiv Detail & Related papers (2020-04-24T00:34:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.