Inference via robust optimal transportation: theory and methods
- URL: http://arxiv.org/abs/2301.06297v4
- Date: Thu, 29 Feb 2024 07:20:56 GMT
- Title: Inference via robust optimal transportation: theory and methods
- Authors: Yiming Ma, Hang Liu, Davide La Vecchia, Metthieu Lerasle
- Abstract summary: Optimal transportation theory and the related $p$-Wasserstein distance are widely-applied in statistics and machine learning.
In spite of their popularity, inference based on these tools has some issues.
- Score: 7.690743442192021
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Optimal transportation theory and the related $p$-Wasserstein distance
($W_p$, $p\geq 1$) are widely-applied in statistics and machine learning. In
spite of their popularity, inference based on these tools has some issues. For
instance, it is sensitive to outliers and it may not be even defined when the
underlying model has infinite moments. To cope with these problems, first we
consider a robust version of the primal transportation problem and show that it
defines the {robust Wasserstein distance}, $W^{(\lambda)}$, depending on a
tuning parameter $\lambda > 0$. Second, we illustrate the link between $W_1$
and $W^{(\lambda)}$ and study its key measure theoretic aspects. Third, we
derive some concentration inequalities for $W^{(\lambda)}$. Fourth, we use
$W^{(\lambda)}$ to define minimum distance estimators, we provide their
statistical guarantees and we illustrate how to apply the derived concentration
inequalities for a data driven selection of $\lambda$. Fifth, we provide the
{dual} form of the robust optimal transportation problem and we apply it to
machine learning problems (generative adversarial networks and domain
adaptation). Numerical exercises provide evidence of the benefits yielded by
our novel methods.
Related papers
- Inverse Entropic Optimal Transport Solves Semi-supervised Learning via Data Likelihood Maximization [65.8915778873691]
conditional distributions is a central problem in machine learning.
We propose a new learning paradigm that integrates both paired and unpaired data.
Our approach also connects intriguingly with inverse entropic optimal transport (OT)
arXiv Detail & Related papers (2024-10-03T16:12:59Z) - Relative-Translation Invariant Wasserstein Distance [82.6068808353647]
We introduce a new family of distances, relative-translation invariant Wasserstein distances ($RW_p$)
We show that $RW_p distances are also real distance metrics defined on the quotient set $mathcalP_p(mathbbRn)/sim$ invariant to distribution translations.
arXiv Detail & Related papers (2024-09-04T03:41:44Z) - All You Need is Resistance: On the Equivalence of Effective Resistance and Certain Optimal Transport Problems on Graphs [48.84819106277247]
We claim that effective resistance and optimal transport on graphs should be understood as one and the same, up to a choice of $p$.
We show explicit connections to optimal stopping times and random walks on graphs, graph Sobolev spaces, and a Benamou-Brenier type formula for $2$-Beckmann distance.
We propose further study of the usage of these metrics where Wasserstein distance may produce computational bottlenecks.
arXiv Detail & Related papers (2024-04-23T17:50:52Z) - Approximate Algorithms For $k$-Sparse Wasserstein Barycenter With Outliers [10.259254824702555]
We study the $k$-sparse Wasserstein Barycenter problem in the presence of outliers.
Existing WB algorithms cannot be directly extended to handle the case with outliers.
We propose a clustering based LP method that yields constant approximation factor for the $k$-sparse WB with outliers problem.
arXiv Detail & Related papers (2024-04-20T15:01:35Z) - Online non-parametric likelihood-ratio estimation by Pearson-divergence
functional minimization [55.98760097296213]
We introduce a new framework for online non-parametric LRE (OLRE) for the setting where pairs of iid observations $(x_t sim p, x'_t sim q)$ are observed over time.
We provide theoretical guarantees for the performance of the OLRE method along with empirical validation in synthetic experiments.
arXiv Detail & Related papers (2023-11-03T13:20:11Z) - Approximate Function Evaluation via Multi-Armed Bandits [51.146684847667125]
We study the problem of estimating the value of a known smooth function $f$ at an unknown point $boldsymbolmu in mathbbRn$, where each component $mu_i$ can be sampled via a noisy oracle.
We design an instance-adaptive algorithm that learns to sample according to the importance of each coordinate, and with probability at least $1-delta$ returns an $epsilon$ accurate estimate of $f(boldsymbolmu)$.
arXiv Detail & Related papers (2022-03-18T18:50:52Z) - TURF: A Two-factor, Universal, Robust, Fast Distribution Learning
Algorithm [64.13217062232874]
One of its most powerful and successful modalities approximates every distribution to an $ell$ distance essentially at most a constant times larger than its closest $t$-piece degree-$d_$.
We provide a method that estimates this number near-optimally, hence helps approach the best possible approximation.
arXiv Detail & Related papers (2022-02-15T03:49:28Z) - On Unbalanced Optimal Transport: Gradient Methods, Sparsity and
Approximation Error [18.19398247972205]
We study the Unbalanced Optimal Transport (UOT) between two measures of possibly different masses with at most $n$ components.
We propose a novel algorithm based on Gradient Extrapolation Method (GEM-UOT) to find an $varepsilon$-approximate solution to the UOT problem.
arXiv Detail & Related papers (2022-02-08T03:22:39Z) - Localization in 1D non-parametric latent space models from pairwise
affinities [6.982738885923206]
We consider the problem of estimating latent positions in a one-dimensional torus from pairwise affinities.
We introduce an estimation procedure that provably localizes all the latent positions with a maximum error of the order of $sqrtlog(n)/n$, with high-probability.
arXiv Detail & Related papers (2021-08-06T13:05:30Z) - Optimal rates for independence testing via $U$-statistic permutation
tests [7.090165638014331]
We study the problem of independence testing given independent and identically distributed pairs taking values in a $sigma$-finite, separable measure space.
We first show that there is no valid test of independence that is uniformly consistent against alternatives of the form $f: D(f) geq rho2 $.
arXiv Detail & Related papers (2020-01-15T19:04:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.