How do kernel-based sensor fusion algorithms behave under high
dimensional noise?
- URL: http://arxiv.org/abs/2111.10940v1
- Date: Mon, 22 Nov 2021 01:42:28 GMT
- Title: How do kernel-based sensor fusion algorithms behave under high
dimensional noise?
- Authors: Xiucai Ding and Hau-Tieng Wu
- Abstract summary: We study the behavior of two kernel based sensor fusion algorithms, NCCA and alternating diffusion (AD)
We show that if NCCA and AD are directly applied to the noisy point clouds without any sanity check, it may generate artificial information that misleads scientists' interpretation.
On the other hand, we prove that if the bandwidths are selected adequately, both NCCA and AD can be made robust to high dimensional noise when the SNRs are relatively large.
- Score: 9.13755431537592
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We study the behavior of two kernel based sensor fusion algorithms,
nonparametric canonical correlation analysis (NCCA) and alternating diffusion
(AD), under the nonnull setting that the clean datasets collected from two
sensors are modeled by a common low dimensional manifold embedded in a high
dimensional Euclidean space and the datasets are corrupted by high dimensional
noise. We establish the asymptotic limits and convergence rates for the
eigenvalues of the associated kernel matrices assuming that the sample
dimension and sample size are comparably large, where NCCA and AD are conducted
using the Gaussian kernel. It turns out that both the asymptotic limits and
convergence rates depend on the signal-to-noise ratio (SNR) of each sensor and
selected bandwidths. On one hand, we show that if NCCA and AD are directly
applied to the noisy point clouds without any sanity check, it may generate
artificial information that misleads scientists' interpretation. On the other
hand, we prove that if the bandwidths are selected adequately, both NCCA and AD
can be made robust to high dimensional noise when the SNRs are relatively
large.
Related papers
- Hyperspectral Band Selection based on Generalized 3DTV and Tensor CUR Decomposition [8.812294191190896]
Hyperspectral Imaging serves as an important technique in remote sensing.
High dimensionality and data volume pose significant computational challenges.
We propose a novel hyperspectral band selection model by decomposing the data into a low-rank and smooth component and a sparse one.
arXiv Detail & Related papers (2024-05-02T02:23:38Z) - Score-based Diffusion Models in Function Space [140.792362459734]
Diffusion models have recently emerged as a powerful framework for generative modeling.
We introduce a mathematically rigorous framework called Denoising Diffusion Operators (DDOs) for training diffusion models in function space.
We show that the corresponding discretized algorithm generates accurate samples at a fixed cost independent of the data resolution.
arXiv Detail & Related papers (2023-02-14T23:50:53Z) - Robust Inference of Manifold Density and Geometry by Doubly Stochastic
Scaling [8.271859911016719]
We develop tools for robust inference under high-dimensional noise.
We show that our approach is robust to variability in technical noise levels across cell types.
arXiv Detail & Related papers (2022-09-16T15:39:11Z) - Multiple Kernel Clustering with Dual Noise Minimization [56.009011016367744]
Multiple kernel clustering (MKC) aims to group data by integrating complementary information from base kernels.
In this paper, we rigorously define dual noise and propose a novel parameter-free MKC algorithm by minimizing them.
We observe that dual noise will pollute the block diagonal structures and incur the degeneration of clustering performance, and C-noise exhibits stronger destruction than N-noise.
arXiv Detail & Related papers (2022-07-13T08:37:42Z) - Learning Low-Dimensional Nonlinear Structures from High-Dimensional
Noisy Data: An Integral Operator Approach [5.975670441166475]
We propose a kernel-spectral embedding algorithm for learning low-dimensional nonlinear structures from high-dimensional and noisy observations.
The algorithm employs an adaptive bandwidth selection procedure which does not rely on prior knowledge of the underlying manifold.
The obtained low-dimensional embeddings can be further utilized for downstream purposes such as data visualization, clustering and prediction.
arXiv Detail & Related papers (2022-02-28T22:46:34Z) - Kernel Two-Sample Tests in High Dimension: Interplay Between Moment Discrepancy and Dimension-and-Sample Orders [1.9303929635966661]
We study the behavior of kernel two-sample tests when the dimension and sample sizes both diverge to infinity.
We establish the central limit theorem (CLT) under both the null hypothesis and the local and fixed alternatives.
The new non-null CLT results allow us to perform exact power analysis, which reveals a delicate interplay between the moment discrepancy that can be detected.
arXiv Detail & Related papers (2021-12-31T23:12:44Z) - Lattice partition recovery with dyadic CART [79.96359947166592]
We study piece-wise constant signals corrupted by additive Gaussian noise over a $d$-dimensional lattice.
Data of this form naturally arise in a host of applications, and the tasks of signal detection or testing, de-noising and estimation have been studied extensively in the statistical and signal processing literature.
In this paper we consider instead the problem of partition recovery, i.e.of estimating the partition of the lattice induced by the constancy regions of the unknown signal.
We prove that a DCART-based procedure consistently estimates the underlying partition at a rate of order $sigma2 k*
arXiv Detail & Related papers (2021-05-27T23:41:01Z) - Random Features for the Neural Tangent Kernel [57.132634274795066]
We propose an efficient feature map construction of the Neural Tangent Kernel (NTK) of fully-connected ReLU network.
We show that dimension of the resulting features is much smaller than other baseline feature map constructions to achieve comparable error bounds both in theory and practice.
arXiv Detail & Related papers (2021-04-03T09:08:12Z) - Impact of signal-to-noise ratio and bandwidth on graph Laplacian
spectrum from high-dimensional noisy point cloud [8.680676599607125]
We study the spectrum of kernel-based graph Laplacian constructed from high-dimensional and noisy random point cloud.
We quantify how the signal and noise interact over different regimes of signal-to-noise ratio (SNR)
arXiv Detail & Related papers (2020-11-21T05:19:04Z) - Deep Networks for Direction-of-Arrival Estimation in Low SNR [89.45026632977456]
We introduce a Convolutional Neural Network (CNN) that is trained from mutli-channel data of the true array manifold matrix.
We train a CNN in the low-SNR regime to predict DoAs across all SNRs.
Our robust solution can be applied in several fields, ranging from wireless array sensors to acoustic microphones or sonars.
arXiv Detail & Related papers (2020-11-17T12:52:18Z) - Isolation Distributional Kernel: A New Tool for Point & Group Anomaly
Detection [76.1522587605852]
Isolation Distributional Kernel (IDK) is a new way to measure the similarity between two distributions.
We demonstrate IDK's efficacy and efficiency as a new tool for kernel based anomaly detection for both point and group anomalies.
arXiv Detail & Related papers (2020-09-24T12:25:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.