The Dynamics of Riemannian Robbins-Monro Algorithms
- URL: http://arxiv.org/abs/2206.06795v2
- Date: Thu, 16 Jun 2022 12:07:47 GMT
- Title: The Dynamics of Riemannian Robbins-Monro Algorithms
- Authors: Mohammad Reza Karimi, Ya-Ping Hsieh, Panayotis Mertikopoulos, Andreas
Krause
- Abstract summary: We propose a family of Riemannian algorithms generalizing and extending the seminal approximation framework of Robbins and Monro.
Compared to their Euclidean counterparts, Riemannian algorithms are much less understood due to lack of a global linear structure on the manifold.
We provide a general template of almost sure convergence results that mirrors and extends the existing theory for Euclidean Robbins-Monro schemes.
- Score: 101.29301565229265
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Many important learning algorithms, such as stochastic gradient methods, are
often deployed to solve nonlinear problems on Riemannian manifolds. Motivated
by these applications, we propose a family of Riemannian algorithms
generalizing and extending the seminal stochastic approximation framework of
Robbins and Monro. Compared to their Euclidean counterparts, Riemannian
iterative algorithms are much less understood due to the lack of a global
linear structure on the manifold. We overcome this difficulty by introducing an
extended Fermi coordinate frame which allows us to map the asymptotic behavior
of the proposed Riemannian Robbins-Monro (RRM) class of algorithms to that of
an associated deterministic dynamical system under very mild assumptions on the
underlying manifold. In so doing, we provide a general template of almost sure
convergence results that mirrors and extends the existing theory for Euclidean
Robbins-Monro schemes, albeit with a significantly more involved analysis that
requires a number of new geometric ingredients. We showcase the flexibility of
the proposed RRM framework by using it to establish the convergence of a
retraction-based analogue of the popular optimistic / extra-gradient methods
for solving minimization problems and games, and we provide a unified treatment
for their convergence.
Related papers
- RMLR: Extending Multinomial Logistic Regression into General Geometries [64.16104856124029]
Our framework only requires minimal geometric properties, thus exhibiting broad applicability.
We develop five families of SPD MLRs under five types of power-deformed metrics.
On rotation matrices we propose Lie MLR based on the popular bi-invariant metric.
arXiv Detail & Related papers (2024-09-28T18:38:21Z) - Convergence and Complexity Guarantee for Inexact First-order Riemannian Optimization Algorithms [18.425648833592312]
We show that tBMM converges to an $ilon$-stationary point within $O(epsilon-2)$.
Under a mild iteration, the results still hold when the subproblem is solved in iterations in each provided the total optimality gap is bounded.
arXiv Detail & Related papers (2024-05-05T22:53:14Z) - Convergence and complexity of block majorization-minimization for constrained block-Riemannian optimization [18.425648833592312]
Blockization-minimization (BMM) is a simple iterative gradient for non-exhaustive subspace estimation.
Our analysis makes explicit use of Euclidian constraints.
arXiv Detail & Related papers (2023-12-16T05:40:19Z) - Riemannian stochastic optimization methods avoid strict saddle points [68.80251170757647]
We show that policies under study avoid strict saddle points / submanifolds with probability 1.
This result provides an important sanity check as it shows that, almost always, the limit state of an algorithm can only be a local minimizer.
arXiv Detail & Related papers (2023-11-04T11:12:24Z) - Curvature-Independent Last-Iterate Convergence for Games on Riemannian
Manifolds [77.4346324549323]
We show that a step size agnostic to the curvature of the manifold achieves a curvature-independent and linear last-iterate convergence rate.
To the best of our knowledge, the possibility of curvature-independent rates and/or last-iterate convergence has not been considered before.
arXiv Detail & Related papers (2023-06-29T01:20:44Z) - On Riemannian Approach for Constrained Optimization Model in Extreme
Classification Problems [2.7436792484073638]
A constrained optimization problem is formulated as an optimization problem on matrix manifold.
The proposed approach is tested on several real world large scale multi-label datasets.
arXiv Detail & Related papers (2021-09-30T11:28:35Z) - Nonlinear matrix recovery using optimization on the Grassmann manifold [18.655422834567577]
We investigate the problem of recovering a partially observed high-rank clustering matrix whose columns obey a nonlinear structure such as a union of subspaces.
We show that the alternating limit converges to a unique point using the Kurdyka-Lojasi property.
arXiv Detail & Related papers (2021-09-13T16:13:13Z) - Optimization on manifolds: A symplectic approach [127.54402681305629]
We propose a dissipative extension of Dirac's theory of constrained Hamiltonian systems as a general framework for solving optimization problems.
Our class of (accelerated) algorithms are not only simple and efficient but also applicable to a broad range of contexts.
arXiv Detail & Related papers (2021-07-23T13:43:34Z) - Bayesian Quadrature on Riemannian Data Manifolds [79.71142807798284]
A principled way to model nonlinear geometric structure inherent in data is provided.
However, these operations are typically computationally demanding.
In particular, we focus on Bayesian quadrature (BQ) to numerically compute integrals over normal laws.
We show that by leveraging both prior knowledge and an active exploration scheme, BQ significantly reduces the number of required evaluations.
arXiv Detail & Related papers (2021-02-12T17:38:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.