Riemannian Langevin Monte Carlo schemes for sampling PSD matrices with
fixed rank
- URL: http://arxiv.org/abs/2309.04072v1
- Date: Fri, 8 Sep 2023 02:09:40 GMT
- Title: Riemannian Langevin Monte Carlo schemes for sampling PSD matrices with
fixed rank
- Authors: Tianmin Yu and Shixin Zheng and Jianfeng Lu and Govind Menon and
Xiangxiong Zhang
- Abstract summary: We present two explicit schemes to sample matrices from Gibbs distributions on $mathcal Sn,p_+$.
We also provide examples of energy functions with explicit Gibbs distributions that allow numerical validation of these schemes.
- Score: 5.0397419406319095
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper introduces two explicit schemes to sample matrices from Gibbs
distributions on $\mathcal S^{n,p}_+$, the manifold of real positive
semi-definite (PSD) matrices of size $n\times n$ and rank $p$. Given an energy
function $\mathcal E:\mathcal S^{n,p}_+\to \mathbb{R}$ and certain Riemannian
metrics $g$ on $\mathcal S^{n,p}_+$, these schemes rely on an Euler-Maruyama
discretization of the Riemannian Langevin equation (RLE) with Brownian motion
on the manifold. We present numerical schemes for RLE under two fundamental
metrics on $\mathcal S^{n,p}_+$: (a) the metric obtained from the embedding of
$\mathcal S^{n,p}_+ \subset \mathbb{R}^{n\times n} $; and (b) the
Bures-Wasserstein metric corresponding to quotient geometry. We also provide
examples of energy functions with explicit Gibbs distributions that allow
numerical validation of these schemes.
Related papers
- Scaling of symmetry-restricted quantum circuits [42.803917477133346]
In this work, we investigate the properties of $mathcalMSU(2N)$, $mathcalM$-invariant subspaces of the special unitary Lie group $SU(2N)$.
arXiv Detail & Related papers (2024-06-14T12:12:15Z) - Efficient Sampling on Riemannian Manifolds via Langevin MCMC [51.825900634131486]
We study the task efficiently sampling from a Gibbs distribution $d pi* = eh d vol_g$ over aian manifold $M$ via (geometric) Langevin MCMC.
Our results apply to general settings where $pi*$ can be non exponential and $Mh$ can have negative Ricci curvature.
arXiv Detail & Related papers (2024-02-15T22:59:14Z) - Provably learning a multi-head attention layer [55.2904547651831]
Multi-head attention layer is one of the key components of the transformer architecture that sets it apart from traditional feed-forward models.
In this work, we initiate the study of provably learning a multi-head attention layer from random examples.
We prove computational lower bounds showing that in the worst case, exponential dependence on $m$ is unavoidable.
arXiv Detail & Related papers (2024-02-06T15:39:09Z) - A Unified Framework for Uniform Signal Recovery in Nonlinear Generative
Compressed Sensing [68.80803866919123]
Under nonlinear measurements, most prior results are non-uniform, i.e., they hold with high probability for a fixed $mathbfx*$ rather than for all $mathbfx*$ simultaneously.
Our framework accommodates GCS with 1-bit/uniformly quantized observations and single index models as canonical examples.
We also develop a concentration inequality that produces tighter bounds for product processes whose index sets have low metric entropy.
arXiv Detail & Related papers (2023-09-25T17:54:19Z) - Learning a Single Neuron with Adversarial Label Noise via Gradient
Descent [50.659479930171585]
We study a function of the form $mathbfxmapstosigma(mathbfwcdotmathbfx)$ for monotone activations.
The goal of the learner is to output a hypothesis vector $mathbfw$ that $F(mathbbw)=C, epsilon$ with high probability.
arXiv Detail & Related papers (2022-06-17T17:55:43Z) - Markovian Repeated Interaction Quantum Systems [0.0]
We study a class of dynamical semigroups $(mathbbLn)_ninmathbbN$ that emerge, by a Feynman--Kac type formalism, from a random quantum dynamical system.
As a physical application, we consider the case where the $mathcalL_omega$'s are the reduced dynamical maps describing the repeated interactions of a system with thermal probes.
arXiv Detail & Related papers (2022-02-10T20:52:40Z) - Metric Hypertransformers are Universal Adapted Maps [4.83420384410068]
metric hypertransformers (MHTs) are capable of approxing any adapted map $F:mathscrXmathbbZrightarrow mathscrYmathbbZ$ with approximable complexity.
Our results provide the first (quantimat) universal approximation theorem compatible with any such $mathscrX$ and $mathscrY$.
arXiv Detail & Related papers (2022-01-31T10:03:46Z) - Spectral properties of sample covariance matrices arising from random
matrices with independent non identically distributed columns [50.053491972003656]
It was previously shown that the functionals $texttr(AR(z))$, for $R(z) = (frac1nXXT- zI_p)-1$ and $Ain mathcal M_p$ deterministic, have a standard deviation of order $O(|A|_* / sqrt n)$.
Here, we show that $|mathbb E[R(z)] - tilde R(z)|_F
arXiv Detail & Related papers (2021-09-06T14:21:43Z) - Global Convergence of Gradient Descent for Asymmetric Low-Rank Matrix
Factorization [49.090785356633695]
We study the asymmetric low-rank factorization problem: [mathbfU in mathbbRm min d, mathbfU$ and mathV$.
arXiv Detail & Related papers (2021-06-27T17:25:24Z) - Riemannian Geometry with differentiable ambient space and metric
operator [0.0]
We provide an embedded and submersed ambient structure for the double tangent bundle $mathcalTmathcalTmathcalM$ and the tangent of the horizontal bundle $mathcalHmathcalM$.
We obtain a formula for horizontal lifts of Jacobi fields, and a new closed-form formula for Jacobi fields of naturally reductive homogeneous spaces.
arXiv Detail & Related papers (2021-05-04T15:47:45Z) - Algebraic and geometric structures inside the Birkhoff polytope [0.0]
Birkhoff polytope $mathcalB_d$ consists of all bistochastic matrices of order $d$.
We prove that $mathcalL_d$ and $mathcalF_d$ are star-shaped with respect to the flat matrix.
arXiv Detail & Related papers (2021-01-27T09:51:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.