Sequential Predictive Two-Sample and Independence Testing
- URL: http://arxiv.org/abs/2305.00143v2
- Date: Wed, 19 Jul 2023 18:06:46 GMT
- Title: Sequential Predictive Two-Sample and Independence Testing
- Authors: Aleksandr Podkopaev, Aaditya Ramdas
- Abstract summary: We study the problems of sequential nonparametric two-sample and independence testing.
We build upon the principle of (nonparametric) testing by betting.
- Score: 114.4130718687858
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We study the problems of sequential nonparametric two-sample and independence
testing. Sequential tests process data online and allow using observed data to
decide whether to stop and reject the null hypothesis or to collect more data,
while maintaining type I error control. We build upon the principle of
(nonparametric) testing by betting, where a gambler places bets on future
observations and their wealth measures evidence against the null hypothesis.
While recently developed kernel-based betting strategies often work well on
simple distributions, selecting a suitable kernel for high-dimensional or
structured data, such as images, is often nontrivial. To address this drawback,
we design prediction-based betting strategies that rely on the following fact:
if a sequentially updated predictor starts to consistently determine (a) which
distribution an instance is drawn from, or (b) whether an instance is drawn
from the joint distribution or the product of the marginal distributions (the
latter produced by external randomization), it provides evidence against the
two-sample or independence nulls respectively. We empirically demonstrate the
superiority of our tests over kernel-based approaches under structured
settings. Our tests can be applied beyond the case of independent and
identically distributed data, remaining valid and powerful even when the data
distribution drifts over time.
Related papers
- Theory on Score-Mismatched Diffusion Models and Zero-Shot Conditional Samplers [49.97755400231656]
We present the first performance guarantee with explicit dimensional general score-mismatched diffusion samplers.
We show that score mismatches result in an distributional bias between the target and sampling distributions, proportional to the accumulated mismatch between the target and training distributions.
This result can be directly applied to zero-shot conditional samplers for any conditional model, irrespective of measurement noise.
arXiv Detail & Related papers (2024-10-17T16:42:12Z) - Sequential Kernelized Independence Testing [101.22966794822084]
We design sequential kernelized independence tests inspired by kernelized dependence measures.
We demonstrate the power of our approaches on both simulated and real data.
arXiv Detail & Related papers (2022-12-14T18:08:42Z) - Testing Independence of Exchangeable Random Variables [19.973896010415977]
Given well-shuffled data, can we determine whether the data items are statistically (in)dependent?
We will show that this is possible and develop tests that can confidently reject the null hypothesis that data is independent and identically distributed.
One potential application is in Deep Learning, where data is often scraped from the whole internet, with duplications abound.
arXiv Detail & Related papers (2022-10-22T08:55:48Z) - Model-Free Sequential Testing for Conditional Independence via Testing
by Betting [8.293345261434943]
The proposed test allows researchers to analyze an incoming i.i.d. data stream with any arbitrary dependency structure.
We allow the processing of data points online as soon as they arrive and stop data acquisition once significant results are detected.
arXiv Detail & Related papers (2022-10-01T20:05:33Z) - Robust Continual Test-time Adaptation: Instance-aware BN and
Prediction-balanced Memory [58.72445309519892]
We present a new test-time adaptation scheme that is robust against non-i.i.d. test data streams.
Our novelty is mainly two-fold: (a) Instance-Aware Batch Normalization (IABN) that corrects normalization for out-of-distribution samples, and (b) Prediction-balanced Reservoir Sampling (PBRS) that simulates i.i.d. data stream from non-i.i.d. stream in a class-balanced manner.
arXiv Detail & Related papers (2022-08-10T03:05:46Z) - Nonparametric Conditional Local Independence Testing [69.31200003384122]
Conditional local independence is an independence relation among continuous time processes.
No nonparametric test of conditional local independence has been available.
We propose such a nonparametric test based on double machine learning.
arXiv Detail & Related papers (2022-03-25T10:31:02Z) - Context-Aware Drift Detection [0.0]
Two-sample tests of homogeneity form the foundation upon which existing approaches to drift detection build.
We develop a more general drift detection framework built upon a foundation of two-sample tests for conditional distributional treatment effects.
arXiv Detail & Related papers (2022-03-16T14:23:02Z) - Conformal prediction for the design problem [72.14982816083297]
In many real-world deployments of machine learning, we use a prediction algorithm to choose what data to test next.
In such settings, there is a distinct type of distribution shift between the training and test data.
We introduce a method to quantify predictive uncertainty in such settings.
arXiv Detail & Related papers (2022-02-08T02:59:12Z) - Comparing Sequential Forecasters [35.38264087676121]
Consider two forecasters, each making a single prediction for a sequence of events over time.
How might we compare these forecasters, either online or post-hoc, while avoiding unverifiable assumptions on how the forecasts and outcomes were generated?
We present novel sequential inference procedures for estimating the time-varying difference in forecast scores.
We empirically validate our approaches by comparing real-world baseball and weather forecasters.
arXiv Detail & Related papers (2021-09-30T22:54:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.