Overlapping Batch Confidence Intervals on Statistical Functionals
Constructed from Time Series: Application to Quantiles, Optimization, and
Estimation
- URL: http://arxiv.org/abs/2307.08609v1
- Date: Mon, 17 Jul 2023 16:21:48 GMT
- Title: Overlapping Batch Confidence Intervals on Statistical Functionals
Constructed from Time Series: Application to Quantiles, Optimization, and
Estimation
- Authors: Ziwei Su, Raghu Pasupathy, Yingchieh Yeh, Peter W. Glynn
- Abstract summary: We propose a confidence interval procedure for statistical functionals constructed using data from a stationary time series.
The OBx limits, certain functionals of the Wiener process parameterized by the size of the batches and the extent of their overlap, form the essential machinery for characterizing dependence.
- Score: 5.068678962285631
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We propose a general purpose confidence interval procedure (CIP) for
statistical functionals constructed using data from a stationary time series.
The procedures we propose are based on derived distribution-free analogues of
the $\chi^2$ and Student's $t$ random variables for the statistical functional
context, and hence apply in a wide variety of settings including quantile
estimation, gradient estimation, M-estimation, CVAR-estimation, and arrival
process rate estimation, apart from more traditional statistical settings. Like
the method of subsampling, we use overlapping batches of time series data to
estimate the underlying variance parameter; unlike subsampling and the
bootstrap, however, we assume that the implied point estimator of the
statistical functional obeys a central limit theorem (CLT) to help identify the
weak asymptotics (called OB-x limits, x=I,II,III) of batched Studentized
statistics. The OB-x limits, certain functionals of the Wiener process
parameterized by the size of the batches and the extent of their overlap, form
the essential machinery for characterizing dependence, and consequently the
correctness of the proposed CIPs. The message from extensive numerical
experimentation is that in settings where a functional CLT on the point
estimator is in effect, using \emph{large overlapping batches} alongside OB-x
critical values yields confidence intervals that are often of significantly
higher quality than those obtained from more generic methods like subsampling
or the bootstrap. We illustrate using examples from CVaR estimation, ARMA
parameter estimation, and NHPP rate estimation; R and MATLAB code for OB-x
critical values is available at~\texttt{web.ics.purdue.edu/~pasupath/}.
Related papers
- Semiparametric conformal prediction [79.6147286161434]
Risk-sensitive applications require well-calibrated prediction sets over multiple, potentially correlated target variables.
We treat the scores as random vectors and aim to construct the prediction set accounting for their joint correlation structure.
We report desired coverage and competitive efficiency on a range of real-world regression problems.
arXiv Detail & Related papers (2024-11-04T14:29:02Z) - Statistical Inference for Temporal Difference Learning with Linear Function Approximation [62.69448336714418]
Temporal Difference (TD) learning, arguably the most widely used for policy evaluation, serves as a natural framework for this purpose.
In this paper, we study the consistency properties of TD learning with Polyak-Ruppert averaging and linear function approximation, and obtain three significant improvements over existing results.
arXiv Detail & Related papers (2024-10-21T15:34:44Z) - An Upper Confidence Bound Approach to Estimating the Maximum Mean [0.0]
We study estimation of the maximum mean using an upper confidence bound (UCB) approach.
We establish statistical guarantees, including strong consistency, mean squared errors, and central limit theorems (CLTs) for both estimators.
arXiv Detail & Related papers (2024-08-08T02:53:09Z) - Relaxed Quantile Regression: Prediction Intervals for Asymmetric Noise [51.87307904567702]
Quantile regression is a leading approach for obtaining such intervals via the empirical estimation of quantiles in the distribution of outputs.
We propose Relaxed Quantile Regression (RQR), a direct alternative to quantile regression based interval construction that removes this arbitrary constraint.
We demonstrate that this added flexibility results in intervals with an improvement in desirable qualities.
arXiv Detail & Related papers (2024-06-05T13:36:38Z) - Fundamental bounds for parameter estimation with few measurements [0.0]
We discuss different linear (Barankin-like) conditions that can be imposed on estimators and analyze when these conditions admit an optimal estimator with finite variance.
We show that, if the number of imposed conditions is larger than the number of measurement outcomes, there generally does not exist a corresponding estimator with finite variance.
We derive an extended Cram'er-Rao bound that is compatible with a finite variance in situations where the Barankin bound is undefined.
arXiv Detail & Related papers (2024-02-22T12:40:08Z) - Statistical Inference with Stochastic Gradient Methods under
$\phi$-mixing Data [9.77185962310918]
We propose a mini-batch SGD estimator for statistical inference when the data is $phi$-mixing.
The confidence intervals are constructed using an associated mini-batch SGD procedure.
The proposed method is memory-efficient and easy to implement in practice.
arXiv Detail & Related papers (2023-02-24T16:16:43Z) - Policy evaluation from a single path: Multi-step methods, mixing and
mis-specification [45.88067550131531]
We study non-parametric estimation of the value function of an infinite-horizon $gamma$-discounted Markov reward process.
We provide non-asymptotic guarantees for a general family of kernel-based multi-step temporal difference estimates.
arXiv Detail & Related papers (2022-11-07T23:15:25Z) - Automatically Differentiable Random Coefficient Logistic Demand
Estimation [0.0]
We show how the random coefficient logistic demand (BLP) model can be phrased as an automatically differentiable moment function.
This allows gradient-based frequentist and quasi-Bayesian estimation using the Continuously Updating Estimator (CUE)
Preliminary findings indicate that the CUE estimated using LTE and frequentist optimization has a lower bias but higher MAE compared to the traditional 2-Stage GMM (2S-GMM) approach.
arXiv Detail & Related papers (2021-06-08T18:50:11Z) - Sparse Feature Selection Makes Batch Reinforcement Learning More Sample
Efficient [62.24615324523435]
This paper provides a statistical analysis of high-dimensional batch Reinforcement Learning (RL) using sparse linear function approximation.
When there is a large number of candidate features, our result sheds light on the fact that sparsity-aware methods can make batch RL more sample efficient.
arXiv Detail & Related papers (2020-11-08T16:48:02Z) - Instability, Computational Efficiency and Statistical Accuracy [101.32305022521024]
We develop a framework that yields statistical accuracy based on interplay between the deterministic convergence rate of the algorithm at the population level, and its degree of (instability) when applied to an empirical object based on $n$ samples.
We provide applications of our general results to several concrete classes of models, including Gaussian mixture estimation, non-linear regression models, and informative non-response models.
arXiv Detail & Related papers (2020-05-22T22:30:52Z) - SUMO: Unbiased Estimation of Log Marginal Probability for Latent
Variable Models [80.22609163316459]
We introduce an unbiased estimator of the log marginal likelihood and its gradients for latent variable models based on randomized truncation of infinite series.
We show that models trained using our estimator give better test-set likelihoods than a standard importance-sampling based approach for the same average computational cost.
arXiv Detail & Related papers (2020-04-01T11:49:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.