CCSL: A Causal Structure Learning Method from Multiple Unknown
Environments
- URL: http://arxiv.org/abs/2111.09666v1
- Date: Thu, 18 Nov 2021 12:50:53 GMT
- Title: CCSL: A Causal Structure Learning Method from Multiple Unknown
Environments
- Authors: Wei Chen, Yunjin Wu, Ruichu Cai, Yueguo Chen, Zhifeng Hao
- Abstract summary: We propose a unified Causal Cluster Structures Learning (named CCSL) method for causal discovery from non-i.i.d. data.
This method simultaneously integrates the following two tasks: 1) clustering subjects with the same causal mechanism; 2) learning causal structures from the samples of subjects.
- Score: 32.61349047509467
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Most existing causal structure learning methods require data to be
independent and identically distributed (i.i.d.), which often cannot be
guaranteed when the data come from different environments. Some previous
efforts try to tackle this problem in two independent stages, i.e., first
discovering i.i.d. clusters from non-i.i.d. samples, then learning the causal
structures from different groups. This straightforward solution ignores the
intrinsic connections between the two stages, that is both the clustering stage
and the learning stage should be guided by the same causal mechanism. Towards
this end, we propose a unified Causal Cluster Structures Learning (named CCSL)
method for causal discovery from non-i.i.d. data. This method simultaneously
integrates the following two tasks: 1) clustering subjects with the same causal
mechanism; 2) learning causal structures from the samples of subjects.
Specifically, for the former, we provide a Causality-related Chinese Restaurant
Process to cluster samples based on the similarity of the causal structure; for
the latter, we introduce a variational-inference-based approach to learn the
causal structures. Theoretical results provide identification of the causal
model and the clustering model under the linear non-Gaussian assumption.
Experimental results on both simulated and real-world data further validate the
correctness and effectiveness of the proposed method.
Related papers
- Towards Causal Foundation Model: on Duality between Causal Inference and Attention [18.046388712804042]
We take a first step towards building causally-aware foundation models for treatment effect estimations.
We propose a novel, theoretically justified method called Causal Inference with Attention (CInA)
arXiv Detail & Related papers (2023-10-01T22:28:34Z) - Inducing Causal Structure for Abstractive Text Summarization [76.1000380429553]
We introduce a Structural Causal Model (SCM) to induce the underlying causal structure of the summarization data.
We propose a Causality Inspired Sequence-to-Sequence model (CI-Seq2Seq) to learn the causal representations that can mimic the causal factors.
Experimental results on two widely used text summarization datasets demonstrate the advantages of our approach.
arXiv Detail & Related papers (2023-08-24T16:06:36Z) - Learning Latent Structural Causal Models [31.686049664958457]
In machine learning tasks, one often operates on low-level data like image pixels or high-dimensional vectors.
We present a tractable approximate inference method which performs joint inference over the causal variables, structure and parameters of the latent Structural Causal Model.
arXiv Detail & Related papers (2022-10-24T20:09:44Z) - Amortized Inference for Causal Structure Learning [72.84105256353801]
Learning causal structure poses a search problem that typically involves evaluating structures using a score or independence test.
We train a variational inference model to predict the causal structure from observational/interventional data.
Our models exhibit robust generalization capabilities under substantial distribution shift.
arXiv Detail & Related papers (2022-05-25T17:37:08Z) - Causal Discovery in Linear Structural Causal Models with Deterministic
Relations [27.06618125828978]
We focus on the task of causal discovery form observational data.
We derive a set of necessary and sufficient conditions for unique identifiability of the causal structure.
arXiv Detail & Related papers (2021-10-30T21:32:42Z) - Systematic Evaluation of Causal Discovery in Visual Model Based
Reinforcement Learning [76.00395335702572]
A central goal for AI and causality is the joint discovery of abstract representations and causal structure.
Existing environments for studying causal induction are poorly suited for this objective because they have complicated task-specific causal graphs.
In this work, our goal is to facilitate research in learning representations of high-level variables as well as causal structures among them.
arXiv Detail & Related papers (2021-07-02T05:44:56Z) - The Causal Neural Connection: Expressiveness, Learnability, and
Inference [125.57815987218756]
An object called structural causal model (SCM) represents a collection of mechanisms and sources of random variation of the system under investigation.
In this paper, we show that the causal hierarchy theorem (Thm. 1, Bareinboim et al., 2020) still holds for neural models.
We introduce a special type of SCM called a neural causal model (NCM), and formalize a new type of inductive bias to encode structural constraints necessary for performing causal inferences.
arXiv Detail & Related papers (2021-07-02T01:55:18Z) - A Distance Covariance-based Kernel for Nonlinear Causal Clustering in
Heterogeneous Populations [1.2763567932588586]
We introduce a distance covariance-based kernel designed specifically to measure the similarity between the underlying nonlinear causal structures of different samples.
This kernel enables us to perform clustering to identify the homogeneous subpopulations.
We demonstrate using our kernel for causal clustering with an application in genetics, allowing us to reason about the latent transcription factor networks regulating measured gene expression levels.
arXiv Detail & Related papers (2021-06-07T10:16:34Z) - Few-shot Visual Reasoning with Meta-analogical Contrastive Learning [141.2562447971]
We propose to solve a few-shot (or low-shot) visual reasoning problem, by resorting to analogical reasoning.
We extract structural relationships between elements in both domains, and enforce them to be as similar as possible with analogical learning.
We validate our method on RAVEN dataset, on which it outperforms state-of-the-art method, with larger gains when the training data is scarce.
arXiv Detail & Related papers (2020-07-23T14:00:34Z) - A Critical View of the Structural Causal Model [89.43277111586258]
We show that one can identify the cause and the effect without considering their interaction at all.
We propose a new adversarial training method that mimics the disentangled structure of the causal model.
Our multidimensional method outperforms the literature methods on both synthetic and real world datasets.
arXiv Detail & Related papers (2020-02-23T22:52:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.