Estimating means of bounded random variables by betting
- URL: http://arxiv.org/abs/2010.09686v7
- Date: Thu, 25 Aug 2022 18:26:51 GMT
- Title: Estimating means of bounded random variables by betting
- Authors: Ian Waudby-Smith and Aaditya Ramdas
- Abstract summary: This paper derives confidence intervals (CI) and time-uniform confidence sequences (CS) for the classical problem of estimating an unknown mean from bounded observations.
We present a general approach for deriving concentration bounds, that can be seen as a generalization and improvement of the celebrated Chernoff method.
We show how to extend these ideas to sampling without replacement, another heavily studied problem.
- Score: 39.98103047898974
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper derives confidence intervals (CI) and time-uniform confidence
sequences (CS) for the classical problem of estimating an unknown mean from
bounded observations. We present a general approach for deriving concentration
bounds, that can be seen as a generalization and improvement of the celebrated
Chernoff method. At its heart, it is based on a class of composite nonnegative
martingales, with strong connections to testing by betting and the method of
mixtures. We show how to extend these ideas to sampling without replacement,
another heavily studied problem. In all cases, our bounds are adaptive to the
unknown variance, and empirically vastly outperform existing approaches based
on Hoeffding or empirical Bernstein inequalities and their recent
supermartingale generalizations. In short, we establish a new state-of-the-art
for four fundamental problems: CSs and CIs for bounded means, when sampling
with and without replacement.
Related papers
- A Unified Theory of Stochastic Proximal Point Methods without Smoothness [52.30944052987393]
Proximal point methods have attracted considerable interest owing to their numerical stability and robustness against imperfect tuning.
This paper presents a comprehensive analysis of a broad range of variations of the proximal point method (SPPM)
arXiv Detail & Related papers (2024-05-24T21:09:19Z) - Deep Anti-Regularized Ensembles provide reliable out-of-distribution
uncertainty quantification [4.750521042508541]
Deep ensemble often return overconfident estimates outside the training domain.
We show that an ensemble of networks with large weights fitting the training data are likely to meet these two objectives.
We derive a theoretical framework for this approach and show that the proposed optimization can be seen as a "water-filling" problem.
arXiv Detail & Related papers (2023-04-08T15:25:12Z) - Model-Based Uncertainty in Value Functions [89.31922008981735]
We focus on characterizing the variance over values induced by a distribution over MDPs.
Previous work upper bounds the posterior variance over values by solving a so-called uncertainty Bellman equation.
We propose a new uncertainty Bellman equation whose solution converges to the true posterior variance over values.
arXiv Detail & Related papers (2023-02-24T09:18:27Z) - On the Importance of Gradient Norm in PAC-Bayesian Bounds [92.82627080794491]
We propose a new generalization bound that exploits the contractivity of the log-Sobolev inequalities.
We empirically analyze the effect of this new loss-gradient norm term on different neural architectures.
arXiv Detail & Related papers (2022-10-12T12:49:20Z) - Clipped Stochastic Methods for Variational Inequalities with
Heavy-Tailed Noise [64.85879194013407]
We prove the first high-probability results with logarithmic dependence on the confidence level for methods for solving monotone and structured non-monotone VIPs.
Our results match the best-known ones in the light-tails case and are novel for structured non-monotone problems.
In addition, we numerically validate that the gradient noise of many practical formulations is heavy-tailed and show that clipping improves the performance of SEG/SGDA.
arXiv Detail & Related papers (2022-06-02T15:21:55Z) - From Optimality to Robustness: Dirichlet Sampling Strategies in
Stochastic Bandits [0.0]
We study a generic Dirichlet Sampling (DS) algorithm, based on pairwise comparisons of empirical indices computed with re-sampling of the arms' observations.
We show that different variants of this strategy achieve provably optimal regret guarantees when the distributions are bounded and logarithmic regret for semi-bounded distributions with a mild quantile condition.
arXiv Detail & Related papers (2021-11-18T14:34:21Z) - Tight Concentrations and Confidence Sequences from the Regret of
Universal Portfolio [30.750408480772027]
Jun and Orabona [COLT'19] have shown how to easily convert the regret guarantee of an online betting algorithm into a time-uniform concentration inequality.
We show that we can go even further: We show that the regret of a minimax betting algorithm gives rise to a new implicit empirical time-uniform concentration.
arXiv Detail & Related papers (2021-10-27T00:44:32Z) - Stochastic Gradient Descent-Ascent and Consensus Optimization for Smooth
Games: Convergence Analysis under Expected Co-coercivity [49.66890309455787]
We introduce the expected co-coercivity condition, explain its benefits, and provide the first last-iterate convergence guarantees of SGDA and SCO.
We prove linear convergence of both methods to a neighborhood of the solution when they use constant step-size.
Our convergence guarantees hold under the arbitrary sampling paradigm, and we give insights into the complexity of minibatching.
arXiv Detail & Related papers (2021-06-30T18:32:46Z) - Asymptotics of the Empirical Bootstrap Method Beyond Asymptotic
Normality [25.402400996745058]
We show that the limiting distribution of the empirical bootstrap estimator is consistent under stability conditions.
We propose three alternative ways to use the bootstrap method to build confidence intervals with coverage guarantees.
arXiv Detail & Related papers (2020-11-23T07:14:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.