Causal Component Analysis
- URL: http://arxiv.org/abs/2305.17225v3
- Date: Wed, 17 Jan 2024 14:22:19 GMT
- Title: Causal Component Analysis
- Authors: Liang Wendong, Armin Keki\'c, Julius von K\"ugelgen, Simon Buchholz,
Michel Besserve, Luigi Gresele, Bernhard Sch\"olkopf
- Abstract summary: We introduce an intermediate problem termed Causal Component Analysis (CauCA)
CauCA can be viewed as a generalization of Independent Component Analysis (ICA)
We demonstrate its effectiveness through extensive synthetic experiments in the CauCA and ICA setting.
- Score: 20.570470860880125
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Independent Component Analysis (ICA) aims to recover independent latent
variables from observed mixtures thereof. Causal Representation Learning (CRL)
aims instead to infer causally related (thus often statistically dependent)
latent variables, together with the unknown graph encoding their causal
relationships. We introduce an intermediate problem termed Causal Component
Analysis (CauCA). CauCA can be viewed as a generalization of ICA, modelling the
causal dependence among the latent components, and as a special case of CRL. In
contrast to CRL, it presupposes knowledge of the causal graph, focusing solely
on learning the unmixing function and the causal mechanisms. Any impossibility
results regarding the recovery of the ground truth in CauCA also apply for CRL,
while possibility results may serve as a stepping stone for extensions to CRL.
We characterize CauCA identifiability from multiple datasets generated through
different types of interventions on the latent causal variables. As a
corollary, this interventional perspective also leads to new identifiability
results for nonlinear ICA -- a special case of CauCA with an empty graph --
requiring strictly fewer datasets than previous results. We introduce a
likelihood-based approach using normalizing flows to estimate both the unmixing
function and the causal mechanisms, and demonstrate its effectiveness through
extensive synthetic experiments in the CauCA and ICA setting.
Related papers
- DeCaFlow: A Deconfounding Causal Generative Model [58.411886466157185]
Causal generative models (CGMs) have recently emerged as capable approaches to simulate the causal mechanisms generating our observations.
We introduce DeCaFlow, a CGM that accounts for hidden confounders in a single amortized training process using only observational data and the causal graph.
For the first time to our knowledge, we show that a confounded counterfactual query is identifiable, and thus solvable by DeCaFlow.
arXiv Detail & Related papers (2025-03-19T11:14:16Z) - On the Identifiability of Causal Abstractions [15.785002371773139]
Causal representation learning enhances machine learning models' robustness and generalizability.
We focus on a family of CRL methods that uses contrastive data pairs in the observable space to identify the latent causal model.
We introduce a theoretical framework that calculates the degree to which we can identify a causal model, given a set of possible interventions.
arXiv Detail & Related papers (2025-03-13T19:34:05Z) - Nonparametric Identifiability of Causal Representations from Unknown
Interventions [63.1354734978244]
We study causal representation learning, the task of inferring latent causal variables and their causal relations from mixtures of the variables.
Our goal is to identify both the ground truth latents and their causal graph up to a set of ambiguities which we show to be irresolvable from interventional data.
arXiv Detail & Related papers (2023-06-01T10:51:58Z) - Active Bayesian Causal Inference [72.70593653185078]
We propose Active Bayesian Causal Inference (ABCI), a fully-Bayesian active learning framework for integrated causal discovery and reasoning.
ABCI jointly infers a posterior over causal models and queries of interest.
We show that our approach is more data-efficient than several baselines that only focus on learning the full causal graph.
arXiv Detail & Related papers (2022-06-04T22:38:57Z) - Effect Identification in Cluster Causal Diagrams [51.42809552422494]
We introduce a new type of graphical model called cluster causal diagrams (for short, C-DAGs)
C-DAGs allow for the partial specification of relationships among variables based on limited prior knowledge.
We develop the foundations and machinery for valid causal inferences over C-DAGs.
arXiv Detail & Related papers (2022-02-22T21:27:31Z) - Multi-modality fusion using canonical correlation analysis methods:
Application in breast cancer survival prediction from histology and genomics [16.537929113715432]
We study the use of canonical correlation analysis (CCA) and penalized variants of CCA for the fusion of two modalities.
We analytically show that, with known model parameters, posterior mean estimators that jointly use both modalities outperform arbitrary linear mixing of single modality posterior estimators in latent variable prediction.
arXiv Detail & Related papers (2021-11-27T21:18:01Z) - Causal Discovery in Linear Structural Causal Models with Deterministic
Relations [27.06618125828978]
We focus on the task of causal discovery form observational data.
We derive a set of necessary and sufficient conditions for unique identifiability of the causal structure.
arXiv Detail & Related papers (2021-10-30T21:32:42Z) - Discovering Latent Causal Variables via Mechanism Sparsity: A New
Principle for Nonlinear ICA [81.4991350761909]
Independent component analysis (ICA) refers to an ensemble of methods which formalize this goal and provide estimation procedure for practical application.
We show that the latent variables can be recovered up to a permutation if one regularizes the latent mechanisms to be sparse.
arXiv Detail & Related papers (2021-07-21T14:22:14Z) - Disentangling Observed Causal Effects from Latent Confounders using
Method of Moments [67.27068846108047]
We provide guarantees on identifiability and learnability under mild assumptions.
We develop efficient algorithms based on coupled tensor decomposition with linear constraints to obtain scalable and guaranteed solutions.
arXiv Detail & Related papers (2021-01-17T07:48:45Z) - On Disentangled Representations Learned From Correlated Data [59.41587388303554]
We bridge the gap to real-world scenarios by analyzing the behavior of the most prominent disentanglement approaches on correlated data.
We show that systematically induced correlations in the dataset are being learned and reflected in the latent representations.
We also demonstrate how to resolve these latent correlations, either using weak supervision during training or by post-hoc correcting a pre-trained model with a small number of labels.
arXiv Detail & Related papers (2020-06-14T12:47:34Z) - Probabilistic Canonical Correlation Analysis for Sparse Count Data [3.1753001245931323]
Canonical correlation analysis is an important technique for exploring the relationship between two sets of continuous variables.
We propose a model-based probabilistic approach for correlation and canonical correlation estimation for two sparse count data sets.
arXiv Detail & Related papers (2020-05-11T02:19:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.