Causal Order Identification to Address Confounding: Binary Variables
- URL: http://arxiv.org/abs/2108.04947v1
- Date: Tue, 10 Aug 2021 22:09:43 GMT
- Title: Causal Order Identification to Address Confounding: Binary Variables
- Authors: Joe Suzuki and Yusuke Inaoka
- Abstract summary: This paper considers an extension of the linear non-Gaussian acyclic model (LiNGAM)
LiNGAM determines the causal order among variables from a dataset when the variables are expressed by a set of linear equations, including noise.
- Score: 4.56877715768796
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper considers an extension of the linear non-Gaussian acyclic model
(LiNGAM) that determines the causal order among variables from a dataset when
the variables are expressed by a set of linear equations, including noise. In
particular, we assume that the variables are binary. The existing LiNGAM
assumes that no confounding is present, which is restrictive in practice. Based
on the concept of independent component analysis (ICA), this paper proposes an
extended framework in which the mutual information among the noises is
minimized. Another significant contribution is to reduce the realization of the
shortest path problem. The distance between each pair of nodes expresses an
associated mutual information value, and the path with the minimum sum (KL
divergence) is sought. Although $p!$ mutual information values should be
compared, this paper dramatically reduces the computation when no confounding
is present. The proposed algorithm finds the globally optimal solution, while
the existing locally greedily seek the order based on hypothesis testing. We
use the best estimator in the sense of Bayes/MDL that correctly detects
independence for mutual information estimation. Experiments using artificial
and actual data show that the proposed version of LiNGAM achieves significantly
better performance, particularly when confounding is present.
Related papers
- Pareto Set Identification With Posterior Sampling [14.121842087273167]
This paper studies PSI in the transductive linear setting with potentially correlated objectives.
We propose the PSIPS algorithm that deals simultaneously with structure and correlation without paying the computational cost of existing oracle-based algorithms.
arXiv Detail & Related papers (2024-11-07T18:15:38Z) - Learning causal graphs using variable grouping according to ancestral relationship [7.126300090990439]
When the sample size is small relative to the number of variables, the accuracy of estimating causal graphs using existing methods decreases.
Some methods are not feasible when the sample size is smaller than the number of variables.
To circumvent these problems, some researchers proposed causal structure learning algorithms using divide-and-conquer approaches.
arXiv Detail & Related papers (2024-03-21T04:42:04Z) - Optimal Multi-Distribution Learning [88.3008613028333]
Multi-distribution learning seeks to learn a shared model that minimizes the worst-case risk across $k$ distinct data distributions.
We propose a novel algorithm that yields an varepsilon-optimal randomized hypothesis with a sample complexity on the order of (d+k)/varepsilon2.
arXiv Detail & Related papers (2023-12-08T16:06:29Z) - Distributionally Robust Skeleton Learning of Discrete Bayesian Networks [9.46389554092506]
We consider the problem of learning the exact skeleton of general discrete Bayesian networks from potentially corrupted data.
We propose to optimize the most adverse risk over a family of distributions within bounded Wasserstein distance or KL divergence to the empirical distribution.
We present efficient algorithms and show the proposed methods are closely related to the standard regularized regression approach.
arXiv Detail & Related papers (2023-11-10T15:33:19Z) - Local Discovery by Partitioning: Polynomial-Time Causal Discovery Around Exposure-Outcome Pairs [18.31538168213386]
We propose local discovery by partitioning (LDP) for causal inference tasks.
LDP is a constraint-based procedure that returns a VAS for an exposure-outcome pair under latent confounding.
Adjustment sets from LDP yield less biased and more precise average treatment effect estimates than baseline discovery algorithms.
arXiv Detail & Related papers (2023-10-25T14:53:10Z) - Revisiting Rotation Averaging: Uncertainties and Robust Losses [51.64986160468128]
We argue that the main problem of current methods is the minimized cost function that is only weakly connected with the input data via the estimated epipolar.
We propose to better model the underlying noise distributions by directly propagating the uncertainty from the point correspondences into the rotation averaging.
arXiv Detail & Related papers (2023-03-09T11:51:20Z) - Learning versus Refutation in Noninteractive Local Differential Privacy [133.80204506727526]
We study two basic statistical tasks in non-interactive local differential privacy (LDP): learning and refutation.
Our main result is a complete characterization of the sample complexity of PAC learning for non-interactive LDP protocols.
arXiv Detail & Related papers (2022-10-26T03:19:24Z) - Meta-Learning for Relative Density-Ratio Estimation [59.75321498170363]
Existing methods for (relative) density-ratio estimation (DRE) require many instances from both densities.
We propose a meta-learning method for relative DRE, which estimates the relative density-ratio from a few instances by using knowledge in related datasets.
We empirically demonstrate the effectiveness of the proposed method by using three problems: relative DRE, dataset comparison, and outlier detection.
arXiv Detail & Related papers (2021-07-02T02:13:45Z) - Examining and Combating Spurious Features under Distribution Shift [94.31956965507085]
We define and analyze robust and spurious representations using the information-theoretic concept of minimal sufficient statistics.
We prove that even when there is only bias of the input distribution, models can still pick up spurious features from their training data.
Inspired by our analysis, we demonstrate that group DRO can fail when groups do not directly account for various spurious correlations.
arXiv Detail & Related papers (2021-06-14T05:39:09Z) - Sparse PCA via $l_{2,p}$-Norm Regularization for Unsupervised Feature
Selection [138.97647716793333]
We propose a simple and efficient unsupervised feature selection method, by combining reconstruction error with $l_2,p$-norm regularization.
We present an efficient optimization algorithm to solve the proposed unsupervised model, and analyse the convergence and computational complexity of the algorithm theoretically.
arXiv Detail & Related papers (2020-12-29T04:08:38Z) - Binary Classification of Gaussian Mixtures: Abundance of Support
Vectors, Benign Overfitting and Regularization [39.35822033674126]
We study binary linear classification under a generative Gaussian mixture model.
We derive novel non-asymptotic bounds on the classification error of the latter.
Our results extend to a noisy model with constant probability noise flips.
arXiv Detail & Related papers (2020-11-18T07:59:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.