Rates of Estimation of Optimal Transport Maps using Plug-in Estimators
via Barycentric Projections
- URL: http://arxiv.org/abs/2107.01718v1
- Date: Sun, 4 Jul 2021 19:50:20 GMT
- Title: Rates of Estimation of Optimal Transport Maps using Plug-in Estimators
via Barycentric Projections
- Authors: Nabarun Deb, Promit Ghosal, and Bodhisattva Sen
- Abstract summary: We provide a comprehensive analysis of the rates of convergences for general plug-in estimators defined via barycentric projections.
Our main contribution is a new stability estimate for barycentric projections which proceeds under minimal smoothness assumptions.
We illustrate the usefulness of this stability estimate by first providing rates of convergence for the natural discrete-discrete and semi-discrete estimators of optimal transport maps.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Optimal transport maps between two probability distributions $\mu$ and $\nu$
on $\mathbb{R}^d$ have found extensive applications in both machine learning
and statistics. In practice, these maps need to be estimated from data sampled
according to $\mu$ and $\nu$. Plug-in estimators are perhaps most popular in
estimating transport maps in the field of computational optimal transport. In
this paper, we provide a comprehensive analysis of the rates of convergences
for general plug-in estimators defined via barycentric projections. Our main
contribution is a new stability estimate for barycentric projections which
proceeds under minimal smoothness assumptions and can be used to analyze
general plug-in estimators. We illustrate the usefulness of this stability
estimate by first providing rates of convergence for the natural
discrete-discrete and semi-discrete estimators of optimal transport maps. We
then use the same stability estimate to show that, under additional smoothness
assumptions of Besov type or Sobolev type, wavelet based or kernel smoothed
plug-in estimators respectively speed up the rates of convergence and
significantly mitigate the curse of dimensionality suffered by the natural
discrete-discrete/semi-discrete estimators. As a by-product of our analysis, we
also obtain faster rates of convergence for plug-in estimators of
$W_2(\mu,\nu)$, the Wasserstein distance between $\mu$ and $\nu$, under the
aforementioned smoothness assumptions, thereby complementing recent results in
Chizat et al. (2020). Finally, we illustrate the applicability of our results
in obtaining rates of convergence for Wasserstein barycenters between two
probability distributions and obtaining asymptotic detection thresholds for
some recent optimal-transport based tests of independence.
Related papers
- Conditional simulation via entropic optimal transport: Toward non-parametric estimation of conditional Brenier maps [13.355769319031184]
Conditional simulation is a fundamental task in statistical modeling.
One promising approach is to construct conditional Brenier maps, where the components of the map pushforward a reference distribution to conditionals of the target.
We propose a non-parametric estimator for conditional Brenier maps based on the computational scalability of emphentropic optimal transport.
arXiv Detail & Related papers (2024-11-11T17:32:47Z) - A Specialized Semismooth Newton Method for Kernel-Based Optimal
Transport [92.96250725599958]
Kernel-based optimal transport (OT) estimators offer an alternative, functional estimation procedure to address OT problems from samples.
We show that our SSN method achieves a global convergence rate of $O (1/sqrtk)$, and a local quadratic convergence rate under standard regularity conditions.
arXiv Detail & Related papers (2023-10-21T18:48:45Z) - Minimax estimation of discontinuous optimal transport maps: The
semi-discrete case [14.333765302506658]
We consider the problem of estimating the optimal transport map between two probability distributions, $P$ and $Q$ in $mathbb Rd$.
We show that an estimator based on entropic optimal transport converges at the minimax-optimal rate $n-1/2$, independent of dimension.
arXiv Detail & Related papers (2023-01-26T18:41:38Z) - Statistical Efficiency of Score Matching: The View from Isoperimetry [96.65637602827942]
We show a tight connection between statistical efficiency of score matching and the isoperimetric properties of the distribution being estimated.
We formalize these results both in the sample regime and in the finite regime.
arXiv Detail & Related papers (2022-10-03T06:09:01Z) - Near-optimal estimation of smooth transport maps with kernel
sums-of-squares [81.02564078640275]
Under smoothness conditions, the squared Wasserstein distance between two distributions could be efficiently computed with appealing statistical error upper bounds.
The object of interest for applications such as generative modeling is the underlying optimal transport map.
We propose the first tractable algorithm for which the statistical $L2$ error on the maps nearly matches the existing minimax lower-bounds for smooth map estimation.
arXiv Detail & Related papers (2021-12-03T13:45:36Z) - Entropic estimation of optimal transport maps [15.685006881635209]
We develop a method for estimating the optimal map between two distributions over $mathbbRd$ with rigorous finite-sample guarantees.
We show that our estimator is easy to compute using Sinkhorn's algorithm.
arXiv Detail & Related papers (2021-09-24T14:57:26Z) - Heavy-tailed Streaming Statistical Estimation [58.70341336199497]
We consider the task of heavy-tailed statistical estimation given streaming $p$ samples.
We design a clipped gradient descent and provide an improved analysis under a more nuanced condition on the noise of gradients.
arXiv Detail & Related papers (2021-08-25T21:30:27Z) - Plugin Estimation of Smooth Optimal Transport Maps [25.23336043463205]
We show that a number of natural estimators for the optimal transport map between two distributions are minimax optimal.
Our work provides new bounds on the risk of corresponding plugin estimators for the quadratic Wasserstein distance.
arXiv Detail & Related papers (2021-07-26T17:58:48Z) - On Projection Robust Optimal Transport: Sample Complexity and Model
Misspecification [101.0377583883137]
Projection robust (PR) OT seeks to maximize the OT cost between two measures by choosing a $k$-dimensional subspace onto which they can be projected.
Our first contribution is to establish several fundamental statistical properties of PR Wasserstein distances.
Next, we propose the integral PR Wasserstein (IPRW) distance as an alternative to the PRW distance, by averaging rather than optimizing on subspaces.
arXiv Detail & Related papers (2020-06-22T14:35:33Z) - $\gamma$-ABC: Outlier-Robust Approximate Bayesian Computation Based on a
Robust Divergence Estimator [95.71091446753414]
We propose to use a nearest-neighbor-based $gamma$-divergence estimator as a data discrepancy measure.
Our method achieves significantly higher robustness than existing discrepancy measures.
arXiv Detail & Related papers (2020-06-13T06:09:27Z) - Statistical Optimal Transport posed as Learning Kernel Embedding [0.0]
This work takes the novel approach of posing statistical Optimal Transport (OT) as that of learning the transport plan's kernel mean embedding from sample based estimates of marginal embeddings.
A key result is that, under very mild conditions, $epsilon$-optimal recovery of the transport plan as well as the Barycentric-projection based transport map is possible with a sample complexity that is completely dimension-free.
arXiv Detail & Related papers (2020-02-08T14:58:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.