Causal Inference Despite Limited Global Confounding via Mixture Models
- URL: http://arxiv.org/abs/2112.11602v5
- Date: Wed, 31 May 2023 08:12:37 GMT
- Title: Causal Inference Despite Limited Global Confounding via Mixture Models
- Authors: Spencer L. Gordon, Bijan Mazaheri, Yuval Rabani, Leonard J. Schulman
- Abstract summary: A finite $k$-mixture of such models is graphically represented by a larger graph.
We give the first algorithm to learn mixtures of non-empty DAGs.
- Score: 4.721845865189578
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: A Bayesian Network is a directed acyclic graph (DAG) on a set of $n$ random
variables (the vertices); a Bayesian Network Distribution (BND) is a
probability distribution on the random variables that is Markovian on the
graph. A finite $k$-mixture of such models is graphically represented by a
larger graph which has an additional ``hidden'' (or ``latent'') random variable
$U$, ranging in $\{1,\ldots,k\}$, and a directed edge from $U$ to every other
vertex. Models of this type are fundamental to causal inference, where $U$
models an unobserved confounding effect of multiple populations, obscuring the
causal relationships in the observable DAG. By solving the mixture problem and
recovering the joint probability distribution with $U$, traditionally
unidentifiable causal relationships become identifiable. Using a reduction to
the more well-studied ``product'' case on empty graphs, we give the first
algorithm to learn mixtures of non-empty DAGs.
Related papers
- Generation is better than Modification: Combating High Class Homophily Variance in Graph Anomaly Detection [51.11833609431406]
Homophily distribution differences between different classes are significantly greater than those in homophilic and heterophilic graphs.
We introduce a new metric called Class Homophily Variance, which quantitatively describes this phenomenon.
To mitigate its impact, we propose a novel GNN model named Homophily Edge Generation Graph Neural Network (HedGe)
arXiv Detail & Related papers (2024-03-15T14:26:53Z) - Information-Theoretic Thresholds for Planted Dense Cycles [52.076657911275525]
We study a random graph model for small-world networks which are ubiquitous in social and biological sciences.
For both detection and recovery of the planted dense cycle, we characterize the information-theoretic thresholds in terms of $n$, $tau$, and an edge-wise signal-to-noise ratio $lambda$.
arXiv Detail & Related papers (2024-02-01T03:39:01Z) - Understanding Heterophily for Graph Neural Networks [42.640057865981156]
We present theoretical understandings of the impacts of different heterophily patterns for Graph Neural Networks (GNNs)
We show that the separability gains are determined by the normalized distance of the $l$-powered neighborhood distributions.
Experiments on both synthetic and real-world data verify the effectiveness of our theory.
arXiv Detail & Related papers (2024-01-17T11:01:28Z) - Inferences on Mixing Probabilities and Ranking in Mixed-Membership
Models [5.992878098797828]
Network data is prevalent in numerous big data applications including economics and health networks.
In this paper, we model the network using the Degree-Corrected Mixed Membership (DCMM) model.
We derive novel finite-sample expansion for the $boldsymbolpi_i(k)$s which allows us to obtain distributions and confidence interval of the membership mixing probabilities and other related population quantities.
arXiv Detail & Related papers (2023-08-29T02:35:45Z) - Causal Bandits for Linear Structural Equation Models [58.2875460517691]
This paper studies the problem of designing an optimal sequence of interventions in a causal graphical model.
It is assumed that the graph's structure is known and has $N$ nodes.
Two algorithms are proposed for the frequentist (UCB-based) and Bayesian settings.
arXiv Detail & Related papers (2022-08-26T16:21:31Z) - Diffusion models as plug-and-play priors [98.16404662526101]
We consider the problem of inferring high-dimensional data $mathbfx$ in a model that consists of a prior $p(mathbfx)$ and an auxiliary constraint $c(mathbfx,mathbfy)$.
The structure of diffusion models allows us to perform approximate inference by iterating differentiation through the fixed denoising network enriched with different amounts of noise.
arXiv Detail & Related papers (2022-06-17T21:11:36Z) - Combinatorial Causal Bandits [25.012065471684025]
In causal bandits, the learning agent chooses at most $K$ variables in each round to intervene, with the goal of minimizing expected regret on the target variable $Y$.
We study under the context of binary generalized linear models (BGLMs) with a succinct parametric representation of the causal models.
We present the algorithm BGLM-OFU for Markovian BGLMs based on the maximum likelihood estimation method, and show that it achieves $O(sqrtTlog T)$ regret, where $T$ is the time horizon.
arXiv Detail & Related papers (2022-06-04T14:14:58Z) - On the Generative Utility of Cyclic Conditionals [103.1624347008042]
We study whether and how can we model a joint distribution $p(x,z)$ using two conditional models $p(x|z)$ that form a cycle.
We propose the CyGen framework for cyclic-conditional generative modeling, including methods to enforce compatibility and use the determined distribution to fit and generate data.
arXiv Detail & Related papers (2021-06-30T10:23:45Z) - Generalized Independent Noise Condition for Estimating Latent Variable
Causal Graphs [39.24319581164022]
We propose a Generalized Independent Noise (GIN) condition to estimate latent variable graphs.
We show that GIN helps locate latent variables and identify their causal structure, including causal directions.
arXiv Detail & Related papers (2020-10-10T06:11:06Z) - Fairness constraints can help exact inference in structured prediction [37.76221231305701]
We study a generative model with an undirected connected graph $G$ and true vector of binary labels.
We find that, in contrast to the known trade-offs between fairness and model performance, the addition of the fairness constraint improves the probability of exact recovery.
arXiv Detail & Related papers (2020-07-01T04:11:29Z) - Block-Approximated Exponential Random Graphs [77.4792558024487]
An important challenge in the field of exponential random graphs (ERGs) is the fitting of non-trivial ERGs on large graphs.
We propose an approximative framework to such non-trivial ERGs that result in dyadic independence (i.e., edge independent) distributions.
Our methods are scalable to sparse graphs consisting of millions of nodes.
arXiv Detail & Related papers (2020-02-14T11:42:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.