Sobolev Transport: A Scalable Metric for Probability Measures with Graph
Metrics
- URL: http://arxiv.org/abs/2202.10723v1
- Date: Tue, 22 Feb 2022 08:27:58 GMT
- Title: Sobolev Transport: A Scalable Metric for Probability Measures with Graph
Metrics
- Authors: Tam Le and Truyen Nguyen and Dinh Phung and Viet Anh Nguyen
- Abstract summary: We consider probability measures supported on a graph metric space and propose a novel Sobolev transport metric.
We show that the Sobolev transport metric yields a closed-form formula for fast computation and it is negative definite.
We show that the space of probability measures endowed with this transport distance is isometric to a bounded convex set in a Euclidean space with a weighted $ell_p$ distance.
- Score: 25.591913014859184
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Optimal transport (OT) is a popular measure to compare probability
distributions. However, OT suffers a few drawbacks such as (i) a high
complexity for computation, (ii) indefiniteness which limits its applicability
to kernel machines. In this work, we consider probability measures supported on
a graph metric space and propose a novel Sobolev transport metric. We show that
the Sobolev transport metric yields a closed-form formula for fast computation
and it is negative definite. We show that the space of probability measures
endowed with this transport distance is isometric to a bounded convex set in a
Euclidean space with a weighted $\ell_p$ distance. We further exploit the
negative definiteness of the Sobolev transport to design positive-definite
kernels, and evaluate their performances against other baselines in document
classification with word embeddings and in topological data analysis.
Related papers
- Scalable Unbalanced Sobolev Transport for Measures on a Graph [23.99177001129992]
Optimal transport (OT) is a powerful tool for comparing probability measures.
OT suffers a few drawbacks: (i) input measures required to have the same mass, (ii) a high computational complexity, and (iii) indefiniteness.
Le et al. (2022) recently proposed Sobolev transport for measures on a graph having the same total mass by leveraging the graph structure over supports.
We show that the proposed unbalanced Sobolev transport admits a closed-form formula for fast computation, and it is also negative definite.
arXiv Detail & Related papers (2023-02-24T07:35:38Z) - Minimax estimation of discontinuous optimal transport maps: The
semi-discrete case [14.333765302506658]
We consider the problem of estimating the optimal transport map between two probability distributions, $P$ and $Q$ in $mathbb Rd$.
We show that an estimator based on entropic optimal transport converges at the minimax-optimal rate $n-1/2$, independent of dimension.
arXiv Detail & Related papers (2023-01-26T18:41:38Z) - Kernel-based off-policy estimation without overlap: Instance optimality
beyond semiparametric efficiency [53.90687548731265]
We study optimal procedures for estimating a linear functional based on observational data.
For any convex and symmetric function class $mathcalF$, we derive a non-asymptotic local minimax bound on the mean-squared error.
arXiv Detail & Related papers (2023-01-16T02:57:37Z) - Robust computation of optimal transport by $\beta$-potential
regularization [79.24513412588745]
Optimal transport (OT) has become a widely used tool in the machine learning field to measure the discrepancy between probability distributions.
We propose regularizing OT with the beta-potential term associated with the so-called $beta$-divergence.
We experimentally demonstrate that the transport matrix computed with our algorithm helps estimate a probability distribution robustly even in the presence of outliers.
arXiv Detail & Related papers (2022-12-26T18:37:28Z) - Statistical Efficiency of Score Matching: The View from Isoperimetry [96.65637602827942]
We show a tight connection between statistical efficiency of score matching and the isoperimetric properties of the distribution being estimated.
We formalize these results both in the sample regime and in the finite regime.
arXiv Detail & Related papers (2022-10-03T06:09:01Z) - Entropic Optimal Transport in Random Graphs [8.7314407902481]
In graph analysis, a classic task consists in computing similarity measures between (groups of) nodes.
We show that it is possible to consistently estimate distances between groups of nodes in the latent space.
arXiv Detail & Related papers (2022-01-11T13:52:34Z) - Near-optimal estimation of smooth transport maps with kernel
sums-of-squares [81.02564078640275]
Under smoothness conditions, the squared Wasserstein distance between two distributions could be efficiently computed with appealing statistical error upper bounds.
The object of interest for applications such as generative modeling is the underlying optimal transport map.
We propose the first tractable algorithm for which the statistical $L2$ error on the maps nearly matches the existing minimax lower-bounds for smooth map estimation.
arXiv Detail & Related papers (2021-12-03T13:45:36Z) - A Note on Optimizing Distributions using Kernel Mean Embeddings [94.96262888797257]
Kernel mean embeddings represent probability measures by their infinite-dimensional mean embeddings in a reproducing kernel Hilbert space.
We show that when the kernel is characteristic, distributions with a kernel sum-of-squares density are dense.
We provide algorithms to optimize such distributions in the finite-sample setting.
arXiv Detail & Related papers (2021-06-18T08:33:45Z) - The Unbalanced Gromov Wasserstein Distance: Conic Formulation and
Relaxation [0.0]
Comparing metric measure spaces (i.e. a metric space endowed with aprobability distribution) is at the heart of many machine learning problems.
The most popular distance between such metric spaces is the metric measureGro-Wasserstein (GW) distance of which is a quadratic.
The GW formulation alleviates the comparison of metric spaces equipped with arbitrary positive measures up to isometries.
arXiv Detail & Related papers (2020-09-09T12:38:14Z) - Linear Optimal Transport Embedding: Provable Wasserstein classification
for certain rigid transformations and perturbations [79.23797234241471]
Discriminating between distributions is an important problem in a number of scientific fields.
The Linear Optimal Transportation (LOT) embeds the space of distributions into an $L2$-space.
We demonstrate the benefits of LOT on a number of distribution classification problems.
arXiv Detail & Related papers (2020-08-20T19:09:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.