Related papers: Identifiability of a statistical model with two latent vectors: Importance of the dimensionality relation and application to graph embedding

Identifiability of a statistical model with two latent vectors: Importance of the dimensionality relation and application to graph embedding

URL: http://arxiv.org/abs/2405.19760v1
Date: Thu, 30 May 2024 07:11:20 GMT
Title: Identifiability of a statistical model with two latent vectors: Importance of the dimensionality relation and application to graph embedding
Authors: Hiroaki Sasaki,
Abstract summary: Identifiability of statistical models is a key notion in unsupervised representation learning. This paper proposes a statistical model of two latent vectors with single auxiliary data generalizing nonlinear ICA. Surprisingly, we prove that the indeterminacies of the proposed model has the same as emphlinear ICA under certain conditions.
Score: 2.6651200086513107
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Identifiability of statistical models is a key notion in unsupervised representation learning. Recent work of nonlinear independent component analysis (ICA) employs auxiliary data and has established identifiable conditions. This paper proposes a statistical model of two latent vectors with single auxiliary data generalizing nonlinear ICA, and establishes various identifiability conditions. Unlike previous work, the two latent vectors in the proposed model can have arbitrary dimensions, and this property enables us to reveal an insightful dimensionality relation among two latent vectors and auxiliary data in identifiability conditions. Furthermore, surprisingly, we prove that the indeterminacies of the proposed model has the same as \emph{linear} ICA under certain conditions: The elements in the latent vector can be recovered up to their permutation and scales. Next, we apply the identifiability theory to a statistical model for graph data. As a result, one of the identifiability conditions includes an appealing implication: Identifiability of the statistical model could depend on the maximum value of link weights in graph data. Then, we propose a practical method for identifiable graph embedding. Finally, we numerically demonstrate that the proposed method well-recovers the latent vectors and model identifiability clearly depends on the maximum value of link weights, which supports the implication of our theoretical results

Related papers

Trek-Based Parameter Identification for Linear Causal Models With Arbitrarily Structured Latent Variables [1.4425878137951234]
We develop a criterion to certify whether causal effects are identifiable in linear structural equation models with latent variables.<n>Our novel latent-subgraph criterion is a purely graphical condition that is sufficient for identifiability of causal effects.
arXiv Detail & Related papers (2025-07-24T08:10:44Z)
Nonparametric learning of heterogeneous graphical model on network-linked data [19.215806260939473]
This paper proposes a nonparametric graphical model that accommodates heterogeneous graph structures without imposing any distributional assumptions.<n>It transforms the graph learning task into solving a finite-dimensional linear equation system by leveraging the properties of vector-valued kernel Hilbert space.<n>Its effectiveness is also demonstrated through a variety of simulated examples and a real application to the statistician coauthorship dataset.
arXiv Detail & Related papers (2025-07-02T08:37:15Z)
Unfolding Tensors to Identify the Graph in Discrete Latent Bipartite Graphical Models [1.7132914341329848]
We use a tensor unfolding technique to prove a new identifiability result for discrete bipartite graphical models. Our result has useful implications for these models' trustworthy applications in scientific disciplines and interpretable machine learning.
arXiv Detail & Related papers (2025-01-18T23:08:25Z)
Graph-Dictionary Signal Model for Sparse Representations of Multivariate Data [49.77103348208835]
We define a novel Graph-Dictionary signal model, where a finite set of graphs characterizes relationships in data distribution through a weighted sum of their Laplacians. We propose a framework to infer the graph dictionary representation from observed data, along with a bilinear generalization of the primal-dual splitting algorithm to solve the learning problem. We exploit graph-dictionary representations in a motor imagery decoding task on brain activity data, where we classify imagined motion better than standard methods.
arXiv Detail & Related papers (2024-11-08T17:40:43Z)
Estimating Causal Effects from Learned Causal Networks [56.14597641617531]
We propose an alternative paradigm for answering causal-effect queries over discrete observable variables. We learn the causal Bayesian network and its confounding latent variables directly from the observational data. We show that this emphmodel completion learning approach can be more effective than estimand approaches.
arXiv Detail & Related papers (2024-08-26T08:39:09Z)
Cyclic Directed Probabilistic Graphical Model: A Proposal Based on Structured Outcomes [0.0]
We describe a probabilistic graphical model - probabilistic relation network - that allows the direct capture of directional cyclic dependencies. This model does not violate the probability axioms, and it supports learning from observed data. Notably, it supports probabilistic inference, making it a prospective tool in data analysis and in expert and design-making applications.
arXiv Detail & Related papers (2023-10-25T10:19:03Z)
Goodness-of-Fit of Attributed Probabilistic Graph Generative Models [11.58149447373971]
We define goodness of fit in terms of the mean square contingency coefficient for random binary networks. We apply these criteria to verify the representation capability of a probabilistic generative model for various popular types of graph models.
arXiv Detail & Related papers (2023-07-28T18:48:09Z)
Sufficient Identification Conditions and Semiparametric Estimation under Missing Not at Random Mechanisms [4.211128681972148]
Conducting valid statistical analyses is challenging in the presence of missing-not-at-random (MNAR) data. We consider a MNAR model that generalizes several prior popular MNAR models in two ways. We propose methods for testing the independence restrictions encoded in such models using odds ratio as our parameter of interest.
arXiv Detail & Related papers (2023-06-10T13:46:16Z)
Identifying Weight-Variant Latent Causal Models [82.14087963690561]
We find that transitivity acts as a key role in impeding the identifiability of latent causal representations. Under some mild assumptions, we can show that the latent causal representations can be identified up to trivial permutation and scaling. We propose a novel method, termed Structural caUsAl Variational autoEncoder, which directly learns latent causal representations and causal relationships among them.
arXiv Detail & Related papers (2022-08-30T11:12:59Z)
Staged trees and asymmetry-labeled DAGs [2.66269503676104]
We introduce a minimal Bayesian network representation of the staged tree, which can be used to read conditional independences in an intuitive way. We also define a new labeled graph, termed asymmetry-labeled directed acyclic graph, whose edges are labeled to denote the type of dependence existing between any two random variables.
arXiv Detail & Related papers (2021-08-04T12:20:47Z)
Typing assumptions improve identification in causal discovery [123.06886784834471]
Causal discovery from observational data is a challenging task to which an exact solution cannot always be identified. We propose a new set of assumptions that constrain possible causal relationships based on the nature of the variables.
arXiv Detail & Related papers (2021-07-22T14:23:08Z)
PSD Representations for Effective Probability Models [117.35298398434628]
We show that a recently proposed class of positive semi-definite (PSD) models for non-negative functions is particularly suited to this end. We characterize both approximation and generalization capabilities of PSD models, showing that they enjoy strong theoretical guarantees. Our results open the way to applications of PSD models to density estimation, decision theory and inference.
arXiv Detail & Related papers (2021-06-30T15:13:39Z)
On Linear Identifiability of Learned Representations [26.311880922890843]
We study identifiability in the context of representation learning. We show that a large family of discriminative models are identifiable in function space, up to a linear indeterminacy. We derive sufficient conditions for linear identifiability and provide empirical support for the result on both simulated and real-world data.
arXiv Detail & Related papers (2020-07-01T23:33:37Z)
Asymptotic Analysis of an Ensemble of Randomly Projected Linear Discriminants [94.46276668068327]
In [1], an ensemble of randomly projected linear discriminants is used to classify datasets. We develop a consistent estimator of the misclassification probability as an alternative to the computationally-costly cross-validation estimator. We also demonstrate the use of our estimator for tuning the projection dimension on both real and synthetic data.
arXiv Detail & Related papers (2020-04-17T12:47:04Z)

This list is automatically generated from the titles and abstracts of the papers in this site.