Distribution-Free Finite-Sample Guarantees and Split Conformal
Prediction
- URL: http://arxiv.org/abs/2210.14735v1
- Date: Wed, 26 Oct 2022 14:12:24 GMT
- Title: Distribution-Free Finite-Sample Guarantees and Split Conformal
Prediction
- Authors: Roel Hulsman
- Abstract summary: split conformal prediction represents a promising avenue to obtain finite-sample guarantees under minimal distribution-free assumptions.
We highlight the connection between split conformal prediction and classical tolerance predictors developed in the 1940s.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Modern black-box predictive models are often accompanied by weak performance
guarantees that only hold asymptotically in the size of the dataset or require
strong parametric assumptions. In response to this, split conformal prediction
represents a promising avenue to obtain finite-sample guarantees under minimal
distribution-free assumptions. Although prediction set validity most often
concerns marginal coverage, we explore the related but different guarantee of
tolerance regions, reformulating known results in the language of nested
prediction sets and extending on the duality between marginal coverage and
tolerance regions. Furthermore, we highlight the connection between split
conformal prediction and classical tolerance predictors developed in the 1940s,
as well as recent developments in distribution-free risk control. One result
that transfers from classical tolerance predictors is that the coverage of a
prediction set based on order statistics, conditional on the calibration set,
is a random variable stochastically dominating the Beta distribution. We
demonstrate the empirical effectiveness of our findings on synthetic and real
datasets using a popular split conformal prediction procedure called
conformalized quantile regression (CQR).
Related papers
- Bin-Conditional Conformal Prediction of Fatalities from Armed Conflict [0.5312303275762104]
We introduce a novel extension to conformal prediction algorithm which we call bin-conditional conformal prediction.
This method allows users to obtain individual-level prediction intervals for any arbitrary prediction model.
We apply the bin-conditional conformal prediction algorithm to forecast fatalities from armed conflict.
arXiv Detail & Related papers (2024-10-18T14:41:42Z) - Provably Reliable Conformal Prediction Sets in the Presence of Data Poisoning [53.42244686183879]
Conformal prediction provides model-agnostic and distribution-free uncertainty quantification.
Yet, conformal prediction is not reliable under poisoning attacks where adversaries manipulate both training and calibration data.
We propose reliable prediction sets (RPS): the first efficient method for constructing conformal prediction sets with provable reliability guarantees under poisoning.
arXiv Detail & Related papers (2024-10-13T15:37:11Z) - Online scalable Gaussian processes with conformal prediction for guaranteed coverage [32.21093722162573]
The consistency of the resulting uncertainty values hinges on the premise that the learning function conforms to the properties specified by the GP model.
We propose to wed the GP with the prevailing conformal prediction (CP), a distribution-free post-processing framework that produces it prediction sets with a provably valid coverage.
arXiv Detail & Related papers (2024-10-07T19:22:15Z) - Conformal Generative Modeling with Improved Sample Efficiency through Sequential Greedy Filtering [55.15192437680943]
Generative models lack rigorous statistical guarantees for their outputs.
We propose a sequential conformal prediction method producing prediction sets that satisfy a rigorous statistical guarantee.
This guarantee states that with high probability, the prediction sets contain at least one admissible (or valid) example.
arXiv Detail & Related papers (2024-10-02T15:26:52Z) - Calibrated Probabilistic Forecasts for Arbitrary Sequences [58.54729945445505]
Real-world data streams can change unpredictably due to distribution shifts, feedback loops and adversarial actors.
We present a forecasting framework ensuring valid uncertainty estimates regardless of how data evolves.
arXiv Detail & Related papers (2024-09-27T21:46:42Z) - Probabilistic Conformal Prediction with Approximate Conditional Validity [81.30551968980143]
We develop a new method for generating prediction sets that combines the flexibility of conformal methods with an estimate of the conditional distribution.
Our method consistently outperforms existing approaches in terms of conditional coverage.
arXiv Detail & Related papers (2024-07-01T20:44:48Z) - Conditional validity of heteroskedastic conformal regression [12.905195278168506]
Conformal prediction and split conformal prediction offer a distribution-free approach to estimating prediction intervals with statistical guarantees.
Recent work has shown that split conformal prediction can produce state-of-the-art prediction intervals when focusing on marginal coverage.
This paper tries to shed new light on how prediction intervals can be constructed, using methods such as normalized and Mondrian conformal prediction.
arXiv Detail & Related papers (2023-09-15T11:10:46Z) - Conformal Prediction with Missing Values [19.18178194789968]
We first show that the marginal coverage guarantee of conformal prediction holds on imputed data for any missingness distribution.
We then show that a universally consistent quantile regression algorithm trained on the imputed data is Bayes optimal for the pinball risk.
arXiv Detail & Related papers (2023-06-05T09:28:03Z) - CovarianceNet: Conditional Generative Model for Correct Covariance
Prediction in Human Motion Prediction [71.31516599226606]
We present a new method to correctly predict the uncertainty associated with the predicted distribution of future trajectories.
Our approach, CovariaceNet, is based on a Conditional Generative Model with Gaussian latent variables.
arXiv Detail & Related papers (2021-09-07T09:38:24Z) - Private Prediction Sets [72.75711776601973]
Machine learning systems need reliable uncertainty quantification and protection of individuals' privacy.
We present a framework that treats these two desiderata jointly.
We evaluate the method on large-scale computer vision datasets.
arXiv Detail & Related papers (2021-02-11T18:59:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.