Human-in-the-Loop Causal Discovery under Latent Confounding using   Ancestral GFlowNets
        - URL: http://arxiv.org/abs/2309.12032v2
- Date: Fri, 01 Nov 2024 16:46:49 GMT
- Title: Human-in-the-Loop Causal Discovery under Latent Confounding using   Ancestral GFlowNets
- Authors: Tiago da Silva, Eliezer Silva, António Góis, Dominik Heider, Samuel Kaski, Diego Mesquita, Adèle Ribeiro, 
- Abstract summary: Most causal discovery algorithms do not provide uncertainty estimates, making it hard for users to interpret results and improve the inference process.
We propose to sample (causal) ancestral graphs proportionally to a belief distribution based on a score function, such as the Bayesian information criterion (BIC)
We then introduce an optimal experimental design to iteratively probe the expert about the relations among variables, effectively reducing the uncertainty of our belief over ancestral graphs.
- Score: 15.95243318673688
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract:   Structure learning is the crux of causal inference. Notably, causal discovery (CD) algorithms are brittle when data is scarce, possibly inferring imprecise causal relations that contradict expert knowledge -- especially when considering latent confounders. To aggravate the issue, most CD methods do not provide uncertainty estimates, making it hard for users to interpret results and improve the inference process. Surprisingly, while CD is a human-centered affair, no works have focused on building methods that both 1) output uncertainty estimates that can be verified by experts and 2) interact with those experts to iteratively refine CD. To solve these issues, we start by proposing to sample (causal) ancestral graphs proportionally to a belief distribution based on a score function, such as the Bayesian information criterion (BIC), using generative flow networks. Then, we leverage the diversity in candidate graphs and introduce an optimal experimental design to iteratively probe the expert about the relations among variables, effectively reducing the uncertainty of our belief over ancestral graphs. Finally, we update our samples to incorporate human feedback via importance sampling. Importantly, our method does not require causal sufficiency (i.e., unobserved confounders may exist). Experiments with synthetic observational data show that our method can accurately sample from distributions over ancestral graphs and that we can greatly improve inference quality with human aid. 
 
      
        Related papers
        - Learning to Defer for Causal Discovery with Imperfect Experts [59.071731337922664]
 We propose L2D-CD, a method for gauging the correctness of expert recommendations and optimally combining them with data-driven causal discovery results.
We evaluate L2D-CD on the canonical T"ubingen pairs dataset and demonstrate its superior performance compared to both the causal discovery method and the expert used in isolation.
 arXiv  Detail & Related papers  (2025-02-18T18:55:53Z)
- Challenges and Considerations in the Evaluation of Bayesian Causal   Discovery [49.0053848090947]
 Representing uncertainty in causal discovery is a crucial component for experimental design, and more broadly, for safe and reliable causal decision making.
Unlike non-Bayesian causal discovery, which relies on a single estimated causal graph and model parameters for assessment, causal discovery presents challenges due to the nature of its quantity.
No consensus on the most suitable metric for evaluation.
 arXiv  Detail & Related papers  (2024-06-05T12:45:23Z)
- D-BIAS: A Causality-Based Human-in-the-Loop System for Tackling
  Algorithmic Bias [57.87117733071416]
 We propose D-BIAS, a visual interactive tool that embodies human-in-the-loop AI approach for auditing and mitigating social biases.
A user can detect the presence of bias against a group by identifying unfair causal relationships in the causal network.
For each interaction, say weakening/deleting a biased causal edge, the system uses a novel method to simulate a new (debiased) dataset.
 arXiv  Detail & Related papers  (2022-08-10T03:41:48Z)
- Active Bayesian Causal Inference [72.70593653185078]
 We propose Active Bayesian Causal Inference (ABCI), a fully-Bayesian active learning framework for integrated causal discovery and reasoning.
ABCI jointly infers a posterior over causal models and queries of interest.
We show that our approach is more data-efficient than several baselines that only focus on learning the full causal graph.
 arXiv  Detail & Related papers  (2022-06-04T22:38:57Z)
- Do Deep Neural Networks Always Perform Better When Eating More Data? [82.6459747000664]
 We design experiments from Identically Independent Distribution(IID) and Out of Distribution(OOD)
Under IID condition, the amount of information determines the effectivity of each sample, the contribution of samples and difference between classes determine the amount of class information.
Under OOD condition, the cross-domain degree of samples determine the contributions, and the bias-fitting caused by irrelevant elements is a significant factor of cross-domain.
 arXiv  Detail & Related papers  (2022-05-30T15:40:33Z)
- Principled Knowledge Extrapolation with GANs [92.62635018136476]
 We study counterfactual synthesis from a new perspective of knowledge extrapolation.
We show that an adversarial game with a closed-form discriminator can be used to address the knowledge extrapolation problem.
Our method enjoys both elegant theoretical guarantees and superior performance in many scenarios.
 arXiv  Detail & Related papers  (2022-05-21T08:39:42Z)
- BayesIMP: Uncertainty Quantification for Causal Data Fusion [52.184885680729224]
 We study the causal data fusion problem, where datasets pertaining to multiple causal graphs are combined to estimate the average treatment effect of a target variable.
We introduce a framework which combines ideas from probabilistic integration and kernel mean embeddings to represent interventional distributions in the reproducing kernel Hilbert space.
 arXiv  Detail & Related papers  (2021-06-07T10:14:18Z)
- Bayesian Model Averaging for Data Driven Decision Making when Causality
  is Partially Known [0.0]
 We use ensemble methods like Bayesian Model Averaging (BMA) to infer set of causal graphs.
We provide decisions by computing the expected value and risk of potential interventions explicitly.
 arXiv  Detail & Related papers  (2021-05-12T01:55:45Z)
- FRITL: A Hybrid Method for Causal Discovery in the Presence of Latent
  Confounders [46.31784571870808]
 We show that under some mild assumptions, the model is uniquely identified by a hybrid method.
Our method leverages the advantages of constraint-based methods and independent noise-based methods to handle both confounded and unconfounded situations.
 arXiv  Detail & Related papers  (2021-03-26T03:12:14Z)
- MissDeepCausal: Causal Inference from Incomplete Data Using Deep Latent
  Variable Models [14.173184309520453]
 State-of-the-art methods for causal inference don't consider missing values.
Missing data require an adapted unconfoundedness hypothesis.
Latent confounders whose distribution is learned through variational autoencoders adapted to missing values are considered.
 arXiv  Detail & Related papers  (2020-02-25T12:58:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.