Low-Rank Sinkhorn Factorization
- URL: http://arxiv.org/abs/2103.04737v1
- Date: Mon, 8 Mar 2021 13:18:45 GMT
- Title: Low-Rank Sinkhorn Factorization
- Authors: Meyer Scetbon, Marco Cuturi, Gabriel Peyr\'e
- Abstract summary: We introduce an explicit factorization of low rank couplings as a product of textitsub-coupling factors linked by a common marginal.
We prove the non-asymptotic stationary convergence of this algorithm and illustrate its efficiency on benchmark experiments.
- Score: 45.87981728307819
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Several recent applications of optimal transport (OT) theory to machine
learning have relied on regularization, notably entropy and the Sinkhorn
algorithm. Because matrix-vector products are pervasive in the Sinkhorn
algorithm, several works have proposed to \textit{approximate} kernel matrices
appearing in its iterations using low-rank factors. Another route lies instead
in imposing low-rank constraints on the feasible set of couplings considered in
OT problems, with no approximations on cost nor kernel matrices. This route was
first explored by Forrow et al., 2018, who proposed an algorithm tailored for
the squared Euclidean ground cost, using a proxy objective that can be solved
through the machinery of regularized 2-Wasserstein barycenters. Building on
this, we introduce in this work a generic approach that aims at solving, in
full generality, the OT problem under low-rank constraints with arbitrary
costs. Our algorithm relies on an explicit factorization of low rank couplings
as a product of \textit{sub-coupling} factors linked by a common marginal;
similar to an NMF approach, we alternatively updates these factors. We prove
the non-asymptotic stationary convergence of this algorithm and illustrate its
efficiency on benchmark experiments.
Related papers
- A Sample Efficient Alternating Minimization-based Algorithm For Robust Phase Retrieval [56.67706781191521]
In this work, we present a robust phase retrieval problem where the task is to recover an unknown signal.
Our proposed oracle avoids the need for computationally spectral descent, using a simple gradient step and outliers.
arXiv Detail & Related papers (2024-09-07T06:37:23Z) - Stochastic Optimization for Non-convex Problem with Inexact Hessian
Matrix, Gradient, and Function [99.31457740916815]
Trust-region (TR) and adaptive regularization using cubics have proven to have some very appealing theoretical properties.
We show that TR and ARC methods can simultaneously provide inexact computations of the Hessian, gradient, and function values.
arXiv Detail & Related papers (2023-10-18T10:29:58Z) - Infeasible Deterministic, Stochastic, and Variance-Reduction Algorithms for Optimization under Orthogonality Constraints [9.301728976515255]
This article provides new practical and theoretical developments for the landing algorithm.
First, the method is extended to the Stiefel manifold.
We also consider variance reduction algorithms when the cost function is an average of many functions.
arXiv Detail & Related papers (2023-03-29T07:36:54Z) - Fast Computation of Optimal Transport via Entropy-Regularized Extragradient Methods [75.34939761152587]
Efficient computation of the optimal transport distance between two distributions serves as an algorithm that empowers various applications.
This paper develops a scalable first-order optimization-based method that computes optimal transport to within $varepsilon$ additive accuracy.
arXiv Detail & Related papers (2023-01-30T15:46:39Z) - Faster Projection-Free Augmented Lagrangian Methods via Weak Proximal
Oracle [16.290192687098383]
This paper considers a convex composite optimization problem with affine constraints.
Motivated by high-dimensional applications in which exact projection/proximal computations are not tractable, we propose a textitprojection-free augmented Lagrangian-based method.
arXiv Detail & Related papers (2022-10-25T12:51:43Z) - Rethinking Initialization of the Sinkhorn Algorithm [36.72281550968301]
We show that data-dependent initializers result in dramatic speed-ups, with no effect on differentiability as long as implicit differentiation is used.
Ours rely on closed-forms for exact or approximate OT solutions known in the 1D, Gaussian or GMM settings.
arXiv Detail & Related papers (2022-06-15T16:23:03Z) - Lattice-Based Methods Surpass Sum-of-Squares in Clustering [98.46302040220395]
Clustering is a fundamental primitive in unsupervised learning.
Recent work has established lower bounds against the class of low-degree methods.
We show that, perhaps surprisingly, this particular clustering model textitdoes not exhibit a statistical-to-computational gap.
arXiv Detail & Related papers (2021-12-07T18:50:17Z) - Nonlinear matrix recovery using optimization on the Grassmann manifold [18.655422834567577]
We investigate the problem of recovering a partially observed high-rank clustering matrix whose columns obey a nonlinear structure such as a union of subspaces.
We show that the alternating limit converges to a unique point using the Kurdyka-Lojasi property.
arXiv Detail & Related papers (2021-09-13T16:13:13Z) - Hybrid Trilinear and Bilinear Programming for Aligning Partially
Overlapping Point Sets [85.71360365315128]
In many applications, we need algorithms which can align partially overlapping point sets are invariant to the corresponding corresponding RPM algorithm.
We first show that the objective is a cubic bound function. We then utilize the convex envelopes of trilinear and bilinear monomial transformations to derive its lower bound.
We next develop a branch-and-bound (BnB) algorithm which only branches over the transformation variables and runs efficiently.
arXiv Detail & Related papers (2021-01-19T04:24:23Z) - Low-Rank Factorization for Rank Minimization with Nonconvex Regularizers [0.0]
Minimizing the convex relaxation to the nuclear norm is of interest in recommender systems and robust principal component analysis.
We develop efficient algorithms for the rank factorization of nuclear programs put forth by Buriro and nucleariro.
arXiv Detail & Related papers (2020-06-13T19:04:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.