Unbalanced Optimal Transport meets Sliced-Wasserstein
- URL: http://arxiv.org/abs/2306.07176v1
- Date: Mon, 12 Jun 2023 15:15:00 GMT
- Title: Unbalanced Optimal Transport meets Sliced-Wasserstein
- Authors: Thibault S\'ejourn\'e, Cl\'ement Bonet, Kilian Fatras, Kimia Nadjahi,
Nicolas Courty
- Abstract summary: We propose two new loss functions based on the idea of slicing unbalanced OT, and study their induced topology and statistical properties.
We show that the resulting methodology is modular as it encompasses and extends prior related work.
- Score: 11.44982599214965
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Optimal transport (OT) has emerged as a powerful framework to compare
probability measures, a fundamental task in many statistical and machine
learning problems. Substantial advances have been made over the last decade in
designing OT variants which are either computationally and statistically more
efficient, or more robust to the measures and datasets to compare. Among them,
sliced OT distances have been extensively used to mitigate optimal transport's
cubic algorithmic complexity and curse of dimensionality. In parallel,
unbalanced OT was designed to allow comparisons of more general positive
measures, while being more robust to outliers. In this paper, we propose to
combine these two concepts, namely slicing and unbalanced OT, to develop a
general framework for efficiently comparing positive measures. We propose two
new loss functions based on the idea of slicing unbalanced OT, and study their
induced topology and statistical properties. We then develop a fast
Frank-Wolfe-type algorithm to compute these loss functions, and show that the
resulting methodology is modular as it encompasses and extends prior related
work. We finally conduct an empirical analysis of our loss functions and
methodology on both synthetic and real datasets, to illustrate their relevance
and applicability.
Related papers
- Progressive Entropic Optimal Transport Solvers [33.821924561619895]
We propose a new class of EOT solvers (ProgOT) that can estimate both plans and transport maps.
We provide experimental evidence demonstrating that ProgOT is a faster and more robust alternative to standard solvers.
We also prove statistical consistency of our approach for estimating optimal transport maps.
arXiv Detail & Related papers (2024-06-07T16:33:08Z) - Fast Computation of Optimal Transport via Entropy-Regularized Extragradient Methods [75.34939761152587]
Efficient computation of the optimal transport distance between two distributions serves as an algorithm that empowers various applications.
This paper develops a scalable first-order optimization-based method that computes optimal transport to within $varepsilon$ additive accuracy.
arXiv Detail & Related papers (2023-01-30T15:46:39Z) - Unbalanced Optimal Transport, from Theory to Numerics [0.0]
We argue that unbalanced OT, entropic regularization and Gromov-Wasserstein (GW) can work hand-in-hand to turn OT into efficient geometric loss functions for data sciences.
The main motivation for this review is to explain how unbalanced OT, entropic regularization and GW can work hand-in-hand to turn OT into efficient geometric loss functions for data sciences.
arXiv Detail & Related papers (2022-11-16T09:02:52Z) - Simple Stochastic and Online Gradient DescentAlgorithms for Pairwise
Learning [65.54757265434465]
Pairwise learning refers to learning tasks where the loss function depends on a pair instances.
Online descent (OGD) is a popular approach to handle streaming data in pairwise learning.
In this paper, we propose simple and online descent to methods for pairwise learning.
arXiv Detail & Related papers (2021-11-23T18:10:48Z) - Communication-efficient Byzantine-robust distributed learning with
statistical guarantee [2.8407238889624877]
Communication efficiency and robustness are two major issues in modern distributed learning framework.
This paper develops two communication-efficient and robust distributed learning algorithms for convex problems.
arXiv Detail & Related papers (2021-02-28T01:38:37Z) - Improving Approximate Optimal Transport Distances using Quantization [23.319746583489763]
Optimal transport is a popular tool in machine learning to compare probability measures geometrically.
Linear programming algorithms for computing OT scale cubically in the size of the input, making OT impractical in the large-sample regime.
We introduce a practical algorithm, which relies on a quantization step, to estimate OT distances between measures given cheap sample access.
arXiv Detail & Related papers (2021-02-25T08:45:06Z) - Efficient Robust Optimal Transport with Application to Multi-Label
Classification [12.521494095948068]
We model the feature-feature relationship via a symmetric positive semi-definite Mahalanobis metric in the OT cost function.
We view the resulting optimization problem as a non-linear OT problem, which we solve using the Frank-Wolfe algorithm.
Empirical results on the discriminative learning setting, such as tag prediction and multi-class classification, illustrate the good performance of our approach.
arXiv Detail & Related papers (2020-10-22T16:43:52Z) - Bilevel Optimization: Convergence Analysis and Enhanced Design [63.64636047748605]
Bilevel optimization is a tool for many machine learning problems.
We propose a novel stoc-efficientgradient estimator named stoc-BiO.
arXiv Detail & Related papers (2020-10-15T18:09:48Z) - Robust Optimal Transport with Applications in Generative Modeling and
Domain Adaptation [120.69747175899421]
Optimal Transport (OT) distances such as Wasserstein have been used in several areas such as GANs and domain adaptation.
We propose a computationally-efficient dual form of the robust OT optimization that is amenable to modern deep learning applications.
Our approach can train state-of-the-art GAN models on noisy datasets corrupted with outlier distributions.
arXiv Detail & Related papers (2020-10-12T17:13:40Z) - Efficient Learning of Generative Models via Finite-Difference Score
Matching [111.55998083406134]
We present a generic strategy to efficiently approximate any-order directional derivative with finite difference.
Our approximation only involves function evaluations, which can be executed in parallel, and no gradient computations.
arXiv Detail & Related papers (2020-07-07T10:05:01Z) - Projection Robust Wasserstein Distance and Riemannian Optimization [107.93250306339694]
We show that projection robustly solidstein (PRW) is a robust variant of Wasserstein projection (WPP)
This paper provides a first step into the computation of the PRW distance and provides the links between their theory and experiments on and real data.
arXiv Detail & Related papers (2020-06-12T20:40:22Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.