Near-Optimal Non-Parametric Sequential Tests and Confidence Sequences
with Possibly Dependent Observations
- URL: http://arxiv.org/abs/2212.14411v5
- Date: Mon, 11 Mar 2024 17:33:13 GMT
- Title: Near-Optimal Non-Parametric Sequential Tests and Confidence Sequences
with Possibly Dependent Observations
- Authors: Aurelien Bibaut, Nathan Kallus, Michael Lindon
- Abstract summary: We provide the first type-I-error and expected-rejection-time guarantees under general non-data generating processes.
We show how to apply our results to inference on parameters defined by estimating equations, such as average treatment effects.
- Score: 44.71254888821376
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Sequential tests and their implied confidence sequences, which are valid at
arbitrary stopping times, promise flexible statistical inference and on-the-fly
decision making. However, strong guarantees are limited to parametric
sequential tests that under-cover in practice or concentration-bound-based
sequences that over-cover and have suboptimal rejection times. In this work, we
consider classic delayed-start normal-mixture sequential probability ratio
tests, and we provide the first asymptotic type-I-error and
expected-rejection-time guarantees under general non-parametric data generating
processes, where the asymptotics are indexed by the test's burn-in time. The
type-I-error results primarily leverage a martingale strong invariance
principle and establish that these tests (and their implied confidence
sequences) have type-I error rates asymptotically equivalent to the desired
(possibly varying) $\alpha$-level. The expected-rejection-time results
primarily leverage an identity inspired by It\^o's lemma and imply that, in
certain asymptotic regimes, the expected rejection time is asymptotically
equivalent to the minimum possible among $\alpha$-level tests. We show how to
apply our results to sequential inference on parameters defined by estimating
equations, such as average treatment effects. Together, our results establish
these (ostensibly parametric) tests as general-purpose, non-parametric, and
near-optimal. We illustrate this via numerical simulations and a real-data
application to A/B testing at Netflix.
Related papers
- Conformal Generative Modeling with Improved Sample Efficiency through Sequential Greedy Filtering [55.15192437680943]
Generative models lack rigorous statistical guarantees for their outputs.
We propose a sequential conformal prediction method producing prediction sets that satisfy a rigorous statistical guarantee.
This guarantee states that with high probability, the prediction sets contain at least one admissible (or valid) example.
arXiv Detail & Related papers (2024-10-02T15:26:52Z) - Mitigating LLM Hallucinations via Conformal Abstention [70.83870602967625]
We develop a principled procedure for determining when a large language model should abstain from responding in a general domain.
We leverage conformal prediction techniques to develop an abstention procedure that benefits from rigorous theoretical guarantees on the hallucination rate (error rate)
Experimentally, our resulting conformal abstention method reliably bounds the hallucination rate on various closed-book, open-domain generative question answering datasets.
arXiv Detail & Related papers (2024-04-04T11:32:03Z) - Precise Error Rates for Computationally Efficient Testing [75.63895690909241]
We revisit the question of simple-versus-simple hypothesis testing with an eye towards computational complexity.
An existing test based on linear spectral statistics achieves the best possible tradeoff curve between type I and type II error rates.
arXiv Detail & Related papers (2023-11-01T04:41:16Z) - Sequential Permutation Testing of Random Forest Variable Importance
Measures [68.8204255655161]
It is proposed here to use sequential permutation tests and sequential p-value estimation to reduce the high computational costs associated with conventional permutation tests.
The results of simulation studies confirm that the theoretical properties of the sequential tests apply.
The numerical stability of the methods is investigated in two additional application studies.
arXiv Detail & Related papers (2022-06-02T20:16:50Z) - Nonparametric Conditional Local Independence Testing [69.31200003384122]
Conditional local independence is an independence relation among continuous time processes.
No nonparametric test of conditional local independence has been available.
We propose such a nonparametric test based on double machine learning.
arXiv Detail & Related papers (2022-03-25T10:31:02Z) - Comparing Sequential Forecasters [35.38264087676121]
Consider two forecasters, each making a single prediction for a sequence of events over time.
How might we compare these forecasters, either online or post-hoc, while avoiding unverifiable assumptions on how the forecasts and outcomes were generated?
We present novel sequential inference procedures for estimating the time-varying difference in forecast scores.
We empirically validate our approaches by comparing real-world baseball and weather forecasters.
arXiv Detail & Related papers (2021-09-30T22:54:46Z) - Time-uniform central limit theory and asymptotic confidence sequences [34.00292366598841]
Confidence sequences (CS) provide valid inference at arbitrary stopping times and incur no penalties for "peeking" at the data.
CSs are nonasymptotic, enjoying finite-sample guarantees but not the aforementioned broad applicability of confidence intervals.
Asymptotic CSs forgo nonasymptotic validity for CLT-like versatility and (asymptotic) time-uniform guarantees.
arXiv Detail & Related papers (2021-03-11T05:45:35Z) - Asymptotic Validity and Finite-Sample Properties of Approximate Randomization Tests [2.28438857884398]
Our key theoretical contribution is a non-asymptotic bound on the discrepancy between the size of an approximate randomization test and the size of the original randomization test using noiseless data.
We illustrate our theory through several examples, including tests of significance in linear regression.
arXiv Detail & Related papers (2019-08-12T16:09:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.