Non-parametric Hypothesis Tests for Distributional Group Symmetry
- URL: http://arxiv.org/abs/2307.15834v1
- Date: Fri, 28 Jul 2023 22:51:28 GMT
- Title: Non-parametric Hypothesis Tests for Distributional Group Symmetry
- Authors: Kenny Chiu, Benjamin Bloem-Reddy
- Abstract summary: This work formulates non-parametric hypothesis tests for the presence or absence of general group symmetry.
We provide a general formulation of tests for symmetry that apply to two broad settings.
We apply them to testing for symmetry in geomagnetic satellite data and in two problems from high-energy particle physics.
- Score: 2.1320960069210484
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Symmetry plays a central role in the sciences, machine learning, and
statistics. For situations in which data are known to obey a symmetry, a
multitude of methods that exploit symmetry have been developed. Statistical
tests for the presence or absence of general group symmetry, however, are
largely non-existent. This work formulates non-parametric hypothesis tests,
based on a single independent and identically distributed sample, for
distributional symmetry under a specified group. We provide a general
formulation of tests for symmetry that apply to two broad settings. The first
setting tests for the invariance of a marginal or joint distribution under the
action of a compact group. Here, an asymptotically unbiased test only requires
a computable metric on the space of probability distributions and the ability
to sample uniformly random group elements. Building on this, we propose an
easy-to-implement conditional Monte Carlo test and prove that it achieves exact
$p$-values with finitely many observations and Monte Carlo samples. The second
setting tests for the invariance or equivariance of a conditional distribution
under the action of a locally compact group. We show that the test for
conditional invariance or equivariance can be formulated as particular tests of
conditional independence. We implement these tests from both settings using
kernel methods and study them empirically on synthetic data. Finally, we apply
them to testing for symmetry in geomagnetic satellite data and in two problems
from high-energy particle physics.
Related papers
- Theory on Score-Mismatched Diffusion Models and Zero-Shot Conditional Samplers [49.97755400231656]
We present the first performance guarantee with explicit dimensional general score-mismatched diffusion samplers.
We show that score mismatches result in an distributional bias between the target and sampling distributions, proportional to the accumulated mismatch between the target and training distributions.
This result can be directly applied to zero-shot conditional samplers for any conditional model, irrespective of measurement noise.
arXiv Detail & Related papers (2024-10-17T16:42:12Z) - Equivariant score-based generative models provably learn distributions with symmetries efficiently [7.90752151686317]
Empirical studies have demonstrated that incorporating symmetries into generative models can provide better generalization and sampling efficiency.
We provide the first theoretical analysis and guarantees of score-based generative models (SGMs) for learning distributions that are invariant with respect to some group symmetry.
arXiv Detail & Related papers (2024-10-02T05:14:28Z) - Self-Supervised Detection of Perfect and Partial Input-Dependent Symmetries [11.54837584979607]
Group equivariance can overly constrain models if the symmetries in the group differ from those observed in data.
We propose a method able to detect the level of symmetry of each input without the need for labels.
Our framework is general enough to accommodate different families of both continuous and discrete symmetry distributions.
arXiv Detail & Related papers (2023-12-19T15:11:46Z) - Sequential Predictive Two-Sample and Independence Testing [114.4130718687858]
We study the problems of sequential nonparametric two-sample and independence testing.
We build upon the principle of (nonparametric) testing by betting.
arXiv Detail & Related papers (2023-04-29T01:30:33Z) - Bootstrapped Edge Count Tests for Nonparametric Two-Sample Inference
Under Heterogeneity [5.8010446129208155]
We develop a new nonparametric testing procedure that accurately detects differences between the two samples.
A comprehensive simulation study and an application to detecting user behaviors in online games demonstrates the excellent non-asymptotic performance of the proposed test.
arXiv Detail & Related papers (2023-04-26T22:25:44Z) - Targeted Separation and Convergence with Kernel Discrepancies [61.973643031360254]
kernel-based discrepancy measures are required to (i) separate a target P from other probability measures or (ii) control weak convergence to P.
In this article we derive new sufficient and necessary conditions to ensure (i) and (ii)
For MMDs on separable metric spaces, we characterize those kernels that separate Bochner embeddable measures and introduce simple conditions for separating all measures with unbounded kernels.
arXiv Detail & Related papers (2022-09-26T16:41:16Z) - Statistical Properties of the Entropy from Ordinal Patterns [55.551675080361335]
Knowing the joint distribution of the pair Entropy-Statistical Complexity for a large class of time series models would allow statistical tests that are unavailable to date.
We characterize the distribution of the empirical Shannon's Entropy for any model under which the true normalized Entropy is neither zero nor one.
We present a bilateral test that verifies if there is enough evidence to reject the hypothesis that two signals produce ordinal patterns with the same Shannon's Entropy.
arXiv Detail & Related papers (2022-09-15T23:55:58Z) - Nonparametric Conditional Local Independence Testing [69.31200003384122]
Conditional local independence is an independence relation among continuous time processes.
No nonparametric test of conditional local independence has been available.
We propose such a nonparametric test based on double machine learning.
arXiv Detail & Related papers (2022-03-25T10:31:02Z) - Generalized Sliced Distances for Probability Distributions [47.543990188697734]
We introduce a broad family of probability metrics, coined as Generalized Sliced Probability Metrics (GSPMs)
GSPMs are rooted in the generalized Radon transform and come with a unique geometric interpretation.
We consider GSPM-based gradient flows for generative modeling applications and show that under mild assumptions, the gradient flow converges to the global optimum.
arXiv Detail & Related papers (2020-02-28T04:18:00Z) - The Chi-Square Test of Distance Correlation [7.748852202364896]
chi-square test is non-parametric, extremely fast, and applicable to bias-corrected distance correlation using any strong negative type metric or characteristic kernel.
We show that the underlying chi-square distribution well approximates and dominates the limiting null distribution in upper tail, prove the chi-square test can be valid and consistent for testing independence.
arXiv Detail & Related papers (2019-12-27T15:16:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.