Spectral convergence of diffusion maps: improved error bounds and an
alternative normalisation
- URL: http://arxiv.org/abs/2006.02037v3
- Date: Wed, 7 Apr 2021 22:44:45 GMT
- Title: Spectral convergence of diffusion maps: improved error bounds and an
alternative normalisation
- Authors: Caroline L. Wormell and Sebastian Reich
- Abstract summary: This paper uses new approaches to improve the error bounds in the model case where the distribution is supported on a hypertorus.
We match long-standing pointwise error bounds for both the spectral data and the norm convergence of the operator discretisation.
We also introduce an alternative normalisation for diffusion maps based on Sinkhorn weights.
- Score: 0.6091702876917281
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Diffusion maps is a manifold learning algorithm widely used for
dimensionality reduction. Using a sample from a distribution, it approximates
the eigenvalues and eigenfunctions of associated Laplace-Beltrami operators.
Theoretical bounds on the approximation error are however generally much weaker
than the rates that are seen in practice. This paper uses new approaches to
improve the error bounds in the model case where the distribution is supported
on a hypertorus. For the data sampling (variance) component of the error we
make spatially localised compact embedding estimates on certain Hardy spaces;
we study the deterministic (bias) component as a perturbation of the
Laplace-Beltrami operator's associated PDE, and apply relevant spectral
stability results. Using these approaches, we match long-standing pointwise
error bounds for both the spectral data and the norm convergence of the
operator discretisation.
We also introduce an alternative normalisation for diffusion maps based on
Sinkhorn weights. This normalisation approximates a Langevin diffusion on the
sample and yields a symmetric operator approximation. We prove that it has
better convergence compared with the standard normalisation on flat domains,
and present a highly efficient algorithm to compute the Sinkhorn weights.
Related papers
- Distributed Markov Chain Monte Carlo Sampling based on the Alternating
Direction Method of Multipliers [143.6249073384419]
In this paper, we propose a distributed sampling scheme based on the alternating direction method of multipliers.
We provide both theoretical guarantees of our algorithm's convergence and experimental evidence of its superiority to the state-of-the-art.
In simulation, we deploy our algorithm on linear and logistic regression tasks and illustrate its fast convergence compared to existing gradient-based methods.
arXiv Detail & Related papers (2024-01-29T02:08:40Z) - Integral Operator Approaches for Scattered Data Fitting on Spheres [16.389581549801253]
We study the approximation performance of a class of weighted spectral filter algorithms.
We derive optimal Sobolev-type error estimates of weighted spectral filter algorithms.
arXiv Detail & Related papers (2024-01-27T04:42:50Z) - Sampling and estimation on manifolds using the Langevin diffusion [45.57801520690309]
Two estimators of linear functionals of $mu_phi $ based on the discretized Markov process are considered.
Error bounds are derived for sampling and estimation using a discretization of an intrinsically defined Langevin diffusion.
arXiv Detail & Related papers (2023-12-22T18:01:11Z) - Sharp error estimates for target measure diffusion maps with
applications to the committor problem [0.0]
We obtain sharp error estimates for the consistency error of the Target Measure Diffusion map (TMDmap) (Banisch et al. 2020)
The resulting convergence rates are consistent with the approximation theory of graph Laplacians.
We use these results to study an important application of TMDmap in the analysis of rare events in systems governed by overdamped Langevin dynamics.
arXiv Detail & Related papers (2023-12-22T03:52:17Z) - Noise-Free Sampling Algorithms via Regularized Wasserstein Proximals [3.4240632942024685]
We consider the problem of sampling from a distribution governed by a potential function.
This work proposes an explicit score based MCMC method that is deterministic, resulting in a deterministic evolution for particles.
arXiv Detail & Related papers (2023-08-28T23:51:33Z) - Robust Inference of Manifold Density and Geometry by Doubly Stochastic
Scaling [8.271859911016719]
We develop tools for robust inference under high-dimensional noise.
We show that our approach is robust to variability in technical noise levels across cell types.
arXiv Detail & Related papers (2022-09-16T15:39:11Z) - Efficient CDF Approximations for Normalizing Flows [64.60846767084877]
We build upon the diffeomorphic properties of normalizing flows to estimate the cumulative distribution function (CDF) over a closed region.
Our experiments on popular flow architectures and UCI datasets show a marked improvement in sample efficiency as compared to traditional estimators.
arXiv Detail & Related papers (2022-02-23T06:11:49Z) - On the Double Descent of Random Features Models Trained with SGD [78.0918823643911]
We study properties of random features (RF) regression in high dimensions optimized by gradient descent (SGD)
We derive precise non-asymptotic error bounds of RF regression under both constant and adaptive step-size SGD setting.
We observe the double descent phenomenon both theoretically and empirically.
arXiv Detail & Related papers (2021-10-13T17:47:39Z) - Optimal policy evaluation using kernel-based temporal difference methods [78.83926562536791]
We use kernel Hilbert spaces for estimating the value function of an infinite-horizon discounted Markov reward process.
We derive a non-asymptotic upper bound on the error with explicit dependence on the eigenvalues of the associated kernel operator.
We prove minimax lower bounds over sub-classes of MRPs.
arXiv Detail & Related papers (2021-09-24T14:48:20Z) - Spectral clustering under degree heterogeneity: a case for the random
walk Laplacian [83.79286663107845]
This paper shows that graph spectral embedding using the random walk Laplacian produces vector representations which are completely corrected for node degree.
In the special case of a degree-corrected block model, the embedding concentrates about K distinct points, representing communities.
arXiv Detail & Related papers (2021-05-03T16:36:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.