Private Statistical Estimation of Many Quantiles
- URL: http://arxiv.org/abs/2302.06943v3
- Date: Tue, 26 Dec 2023 08:48:54 GMT
- Title: Private Statistical Estimation of Many Quantiles
- Authors: Cl\'ement Lalanne (ENS de Lyon, DANTE, OCKHAM), Aur\'elien Garivier
(UMPA-ENSL, MC2), R\'emi Gribonval (DANTE, OCKHAM)
- Abstract summary: Given a distribution and access to i.i.d. samples, we study the estimation of the inverse of its cumulative distribution function (the quantile function) at specific points.
This work studies the estimation of many statistical quantiles under differential privacy.
- Score: 0.41232474244672235
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This work studies the estimation of many statistical quantiles under
differential privacy. More precisely, given a distribution and access to i.i.d.
samples from it, we study the estimation of the inverse of its cumulative
distribution function (the quantile function) at specific points. For instance,
this task is of key importance in private data generation. We present two
different approaches. The first one consists in privately estimating the
empirical quantiles of the samples and using this result as an estimator of the
quantiles of the distribution. In particular, we study the statistical
properties of the recently published algorithm introduced by Kaplan et al. 2022
that privately estimates the quantiles recursively. The second approach is to
use techniques of density estimation in order to uniformly estimate the
quantile function on an interval. In particular, we show that there is a
tradeoff between the two methods. When we want to estimate many quantiles, it
is better to estimate the density rather than estimating the quantile function
at specific points.
Related papers
- Semiparametric conformal prediction [79.6147286161434]
Risk-sensitive applications require well-calibrated prediction sets over multiple, potentially correlated target variables.
We treat the scores as random vectors and aim to construct the prediction set accounting for their joint correlation structure.
We report desired coverage and competitive efficiency on a range of real-world regression problems.
arXiv Detail & Related papers (2024-11-04T14:29:02Z) - Relaxed Quantile Regression: Prediction Intervals for Asymmetric Noise [51.87307904567702]
Quantile regression is a leading approach for obtaining such intervals via the empirical estimation of quantiles in the distribution of outputs.
We propose Relaxed Quantile Regression (RQR), a direct alternative to quantile regression based interval construction that removes this arbitrary constraint.
We demonstrate that this added flexibility results in intervals with an improvement in desirable qualities.
arXiv Detail & Related papers (2024-06-05T13:36:38Z) - Bayesian Quantile Regression with Subset Selection: A Posterior Summarization Perspective [0.0]
Quantile regression is a powerful tool in epidemiological studies where interest lies in inferring how different exposures affect specific percentiles of the distribution of a health or life outcome.
Existing methods either estimate conditional quantiles separately for each quantile of interest or estimate the entire conditional distribution using semi- or non-parametric models.
We pose the fundamental problems of linear quantile estimation, uncertainty quantification, and subset selection from a Bayesian decision analysis perspective.
Our approach introduces a quantile-focused squared error loss, which enables efficient, closed-form computing and maintains a close relationship with Wasserstein-based density estimation.
arXiv Detail & Related papers (2023-11-03T17:19:31Z) - Estimation of mutual information via quantum kernel method [0.0]
Estimating mutual information (MI) plays a critical role to investigate the relationship among multiple random variables with a nonlinear correlation.
We propose a method for estimating mutual information using the quantum kernel.
arXiv Detail & Related papers (2023-10-19T00:53:16Z) - Auxiliary Quantile Forecasting with Linear Networks [6.155158115218501]
We propose a novel multi-task method for quantile forecasting with shared Linear layers.
Our method is based on the Implicit quantile learning approach.
We show learning auxiliary quantile tasks leads to state-of-the-art performance on deterministic forecasting benchmarks.
arXiv Detail & Related papers (2022-12-05T20:09:32Z) - Understanding the Under-Coverage Bias in Uncertainty Estimation [58.03725169462616]
quantile regression tends to emphunder-cover than the desired coverage level in reality.
We prove that quantile regression suffers from an inherent under-coverage bias.
Our theory reveals that this under-coverage bias stems from a certain high-dimensional parameter estimation error.
arXiv Detail & Related papers (2021-06-10T06:11:55Z) - Flexible Model Aggregation for Quantile Regression [92.63075261170302]
Quantile regression is a fundamental problem in statistical learning motivated by a need to quantify uncertainty in predictions.
We investigate methods for aggregating any number of conditional quantile models.
All of the models we consider in this paper can be fit using modern deep learning toolkits.
arXiv Detail & Related papers (2021-02-26T23:21:16Z) - Regularization Strategies for Quantile Regression [8.232258589877942]
We show that minimizing an expected pinball loss over a continuous distribution of quantiles is a good regularizer even when only predicting a specific quantile.
We show that lattice models enable regularizing the predicted distribution to a location-scale family.
arXiv Detail & Related papers (2021-02-09T21:10:35Z) - Neural Methods for Point-wise Dependency Estimation [129.93860669802046]
We focus on estimating point-wise dependency (PD), which quantitatively measures how likely two outcomes co-occur.
We demonstrate the effectiveness of our approaches in 1) MI estimation, 2) self-supervised representation learning, and 3) cross-modal retrieval task.
arXiv Detail & Related papers (2020-06-09T23:26:15Z) - Instability, Computational Efficiency and Statistical Accuracy [101.32305022521024]
We develop a framework that yields statistical accuracy based on interplay between the deterministic convergence rate of the algorithm at the population level, and its degree of (instability) when applied to an empirical object based on $n$ samples.
We provide applications of our general results to several concrete classes of models, including Gaussian mixture estimation, non-linear regression models, and informative non-response models.
arXiv Detail & Related papers (2020-05-22T22:30:52Z) - Posterior Ratio Estimation of Latent Variables [14.619879849533662]
In some applications, we want to compare distributions of random variables that are emphinferred from observations.
We study the problem of estimating the ratio between two posterior probability density functions of a latent variable.
arXiv Detail & Related papers (2020-02-15T16:46:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.