Related papers: Graph Structure Inference with BAM: Introducing the Bilinear Attention Mechanism

Graph Structure Inference with BAM: Introducing the Bilinear Attention Mechanism

URL: http://arxiv.org/abs/2402.07735v2
Date: Tue, 13 Feb 2024 09:48:47 GMT
Title: Graph Structure Inference with BAM: Introducing the Bilinear Attention Mechanism
Authors: Philipp Froehlich and Heinz Koeppl
Abstract summary: We propose a novel neural network model for supervised graph structure learning. The model is trained with variably shaped and coupled input data. Our method demonstrates robust generalizability across both linear and various types of non-linear dependencies.
Score: 31.99564199048314
License: http://creativecommons.org/licenses/by/4.0/
Abstract: In statistics and machine learning, detecting dependencies in datasets is a central challenge. We propose a novel neural network model for supervised graph structure learning, i.e., the process of learning a mapping between observational data and their underlying dependence structure. The model is trained with variably shaped and coupled simulated input data and requires only a single forward pass through the trained network for inference. By leveraging structural equation models and employing randomly generated multivariate Chebyshev polynomials for the simulation of training data, our method demonstrates robust generalizability across both linear and various types of non-linear dependencies. We introduce a novel bilinear attention mechanism (BAM) for explicit processing of dependency information, which operates on the level of covariance matrices of transformed data and respects the geometry of the manifold of symmetric positive definite matrices. Empirical evaluation demonstrates the robustness of our method in detecting a wide range of dependencies, excelling in undirected graph estimation and proving competitive in completed partially directed acyclic graph estimation through a novel two-step approach.

Related papers

Nonparametric learning of heterogeneous graphical model on network-linked data [19.215806260939473]
This paper proposes a nonparametric graphical model that accommodates heterogeneous graph structures without imposing any distributional assumptions.<n>It transforms the graph learning task into solving a finite-dimensional linear equation system by leveraging the properties of vector-valued kernel Hilbert space.<n>Its effectiveness is also demonstrated through a variety of simulated examples and a real application to the statistician coauthorship dataset.
arXiv Detail & Related papers (2025-07-02T08:37:15Z)
SVarM: Linear Support Varifold Machines for Classification and Regression on Geometric Data [4.212663349859165]
This work proposes SVarM to exploit varifold representations of shapes as measures and their duality with test functions.<n>We develop classification and regression models on shape datasets by introducing a neural network-based representation of the trainable test function.
arXiv Detail & Related papers (2025-06-01T21:55:15Z)
Transfer Learning for High-dimensional Reduced Rank Time Series Models [0.0]
We focus on transfer learning for sequences of observations with temporal dependencies and a more intricate model parameter structure. We propose a new transfer learning algorithm tailored for estimating high-dimensional VAR models characterized by low-rank and sparse structures.
arXiv Detail & Related papers (2025-04-22T08:15:59Z)
Induced Covariance for Causal Discovery in Linear Sparse Structures [55.2480439325792]
Causal models seek to unravel the cause-effect relationships among variables from observed data. This paper introduces a novel causal discovery algorithm designed for settings in which variables exhibit linearly sparse relationships.
arXiv Detail & Related papers (2024-10-02T04:01:38Z)
Cyclic Directed Probabilistic Graphical Model: A Proposal Based on Structured Outcomes [0.0]
We describe a probabilistic graphical model - probabilistic relation network - that allows the direct capture of directional cyclic dependencies. This model does not violate the probability axioms, and it supports learning from observed data. Notably, it supports probabilistic inference, making it a prospective tool in data analysis and in expert and design-making applications.
arXiv Detail & Related papers (2023-10-25T10:19:03Z)
Representation Transfer Learning via Multiple Pre-trained models for Linear Regression [3.5788754401889014]
We consider the problem of learning a linear regression model on a data domain of interest (target) given few samples. To aid learning, we are provided with a set of pre-trained regression models that are trained on potentially different data domains. We propose a representation transfer based learning method for constructing the target model.
arXiv Detail & Related papers (2023-05-25T19:35:24Z)
Towards a mathematical understanding of learning from few examples with nonlinear feature maps [68.8204255655161]
We consider the problem of data classification where the training set consists of just a few data points. We reveal key relationships between the geometry of an AI model's feature space, the structure of the underlying data distributions, and the model's generalisation capabilities.
arXiv Detail & Related papers (2022-11-07T14:52:58Z)
Learning Graphical Factor Models with Riemannian Optimization [70.13748170371889]
This paper proposes a flexible algorithmic framework for graph learning under low-rank structural constraints. The problem is expressed as penalized maximum likelihood estimation of an elliptical distribution. We leverage geometries of positive definite matrices and positive semi-definite matrices of fixed rank that are well suited to elliptical models.
arXiv Detail & Related papers (2022-10-21T13:19:45Z)
Amortised Inference in Structured Generative Models with Explaining Away [16.92791301062903]
We extend the output of amortised variational inference to incorporate structured factors over multiple variables. We show that appropriately parameterised factors can be combined efficiently with variational message passing in elaborate graphical structures. We then fit the structured model to high-dimensional neural spiking time-series from the hippocampus of freely moving rodents.
arXiv Detail & Related papers (2022-09-12T12:52:15Z)
Score-based Generative Modeling of Graphs via the System of Stochastic Differential Equations [57.15855198512551]
We propose a novel score-based generative model for graphs with a continuous-time framework. We show that our method is able to generate molecules that lie close to the training distribution yet do not violate the chemical valency rule.
arXiv Detail & Related papers (2022-02-05T08:21:04Z)
Row-clustering of a Point Process-valued Matrix [2.0391237204597363]
We study a matrix whose entries are marked log-Gaussian Cox processes and cluster rows of such a matrix. An efficient semi-parametric Expectation-Solution (ES) algorithm combined with functional principal component analysis (FPCA) of point processes is proposed for model estimation. The effectiveness of the proposed framework is demonstrated through simulation studies and a real data analysis.
arXiv Detail & Related papers (2021-10-04T06:27:26Z)
Regularization of Mixture Models for Robust Principal Graph Learning [0.0]
A regularized version of Mixture Models is proposed to learn a principal graph from a distribution of $D$-dimensional data points. Parameters of the model are iteratively estimated through an Expectation-Maximization procedure.
arXiv Detail & Related papers (2021-06-16T18:00:02Z)
Joint Network Topology Inference via Structured Fusion Regularization [70.30364652829164]
Joint network topology inference represents a canonical problem of learning multiple graph Laplacian matrices from heterogeneous graph signals. We propose a general graph estimator based on a novel structured fusion regularization. We show that the proposed graph estimator enjoys both high computational efficiency and rigorous theoretical guarantee.
arXiv Detail & Related papers (2021-03-05T04:42:32Z)
Learned Factor Graphs for Inference from Stationary Time Sequences [107.63351413549992]
We propose a framework that combines model-based algorithms and data-driven ML tools for stationary time sequences. neural networks are developed to separately learn specific components of a factor graph describing the distribution of the time sequence. We present an inference algorithm based on learned stationary factor graphs, which learns to implement the sum-product scheme from labeled data.
arXiv Detail & Related papers (2020-06-05T07:06:19Z)

This list is automatically generated from the titles and abstracts of the papers in this site.