Stopping Criterion Design for Recursive Bayesian Classification:
Analysis and Decision Geometry
- URL: http://arxiv.org/abs/2007.15568v2
- Date: Sun, 25 Apr 2021 23:23:22 GMT
- Title: Stopping Criterion Design for Recursive Bayesian Classification:
Analysis and Decision Geometry
- Authors: Aziz Kocanaogullari, Murat Akcakaya and Deniz Erdogmus
- Abstract summary: We propose a geometric interpretation over the state posterior progression.
We show that confidence thresholds defined over maximum of the state posteriors suffer from stiffness.
We then propose a new stopping/termination criterion with a geometrical insight to overcome the limitations.
- Score: 11.399206131178104
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Systems that are based on recursive Bayesian updates for classification limit
the cost of evidence collection through certain stopping/termination criteria
and accordingly enforce decision making. Conventionally, two termination
criteria based on pre-defined thresholds over (i) the maximum of the state
posterior distribution; and (ii) the state posterior uncertainty are commonly
used. In this paper, we propose a geometric interpretation over the state
posterior progression and accordingly we provide a point-by-point analysis over
the disadvantages of using such conventional termination criteria. For example,
through the proposed geometric interpretation we show that confidence
thresholds defined over maximum of the state posteriors suffer from stiffness
that results in unnecessary evidence collection whereas uncertainty based
thresholding methods are fragile to number of categories and terminate
prematurely if some state candidates are already discovered to be unfavorable.
Moreover, both types of termination methods neglect the evolution of posterior
updates. We then propose a new stopping/termination criterion with a
geometrical insight to overcome the limitations of these conventional methods
and provide a comparison in terms of decision accuracy and speed. We validate
our claims using simulations and using real experimental data obtained through
a brain computer interfaced typing system.
Related papers
- Inference with non-differentiable surrogate loss in a general high-dimensional classification framework [4.792322531593389]
We propose a kernel-smoothed decorrelated score to construct hypothesis testing and interval estimations.
Specifically, we adopt kernel approximations to smooth the discontinuous gradient near discontinuity points.
We establish the limiting distribution of the kernel-smoothed decorrelated score and its cross-fitted version in a high-dimensional setup.
arXiv Detail & Related papers (2024-05-20T01:50:35Z) - Likelihood Ratio Confidence Sets for Sequential Decision Making [51.66638486226482]
We revisit the likelihood-based inference principle and propose to use likelihood ratios to construct valid confidence sequences.
Our method is especially suitable for problems with well-specified likelihoods.
We show how to provably choose the best sequence of estimators and shed light on connections to online convex optimization.
arXiv Detail & Related papers (2023-11-08T00:10:21Z) - Calibrating Neural Simulation-Based Inference with Differentiable
Coverage Probability [50.44439018155837]
We propose to include a calibration term directly into the training objective of the neural model.
By introducing a relaxation of the classical formulation of calibration error we enable end-to-end backpropagation.
It is directly applicable to existing computational pipelines allowing reliable black-box posterior inference.
arXiv Detail & Related papers (2023-10-20T10:20:45Z) - Kernel Conditional Moment Constraints for Confounding Robust Inference [22.816690686310714]
We study policy evaluation of offline contextual bandits subject to unobserved confounders.
We propose a general estimator that provides a sharp lower bound of the policy value.
arXiv Detail & Related papers (2023-02-26T16:44:13Z) - Minimax Instrumental Variable Regression and $L_2$ Convergence
Guarantees without Identification or Closedness [71.42652863687117]
We study nonparametric estimation of instrumental variable (IV) regressions.
We propose a new penalized minimax estimator that can converge to a fixed IV solution.
We derive a strong $L$ error rate for our estimator under lax conditions.
arXiv Detail & Related papers (2023-02-10T18:08:49Z) - Optimal Treatment Regimes for Proximal Causal Learning [7.672587258250301]
We propose a novel optimal individualized treatment regime based on outcome and treatment confounding bridges.
We show that the value function of this new optimal treatment regime is superior to that of existing ones in the literature.
arXiv Detail & Related papers (2022-12-19T14:29:25Z) - The Implicit Delta Method [61.36121543728134]
In this paper, we propose an alternative, the implicit delta method, which works by infinitesimally regularizing the training loss of uncertainty.
We show that the change in the evaluation due to regularization is consistent for the variance of the evaluation estimator, even when the infinitesimal change is approximated by a finite difference.
arXiv Detail & Related papers (2022-11-11T19:34:17Z) - Estimation and Applications of Quantiles in Deep Binary Classification [0.0]
Quantile regression, based on check loss, is a widely used inferential paradigm in Statistics.
We consider the analogue of check loss in the binary classification setting.
We develop individualized confidence scores that can be used to decide whether a prediction is reliable.
arXiv Detail & Related papers (2021-02-09T07:07:42Z) - Doubly Robust Off-Policy Value and Gradient Estimation for Deterministic
Policies [80.42316902296832]
We study the estimation of policy value and gradient of a deterministic policy from off-policy data when actions are continuous.
In this setting, standard importance sampling and doubly robust estimators for policy value and gradient fail because the density ratio does not exist.
We propose several new doubly robust estimators based on different kernelization approaches.
arXiv Detail & Related papers (2020-06-06T15:52:05Z) - Confounding-Robust Policy Evaluation in Infinite-Horizon Reinforcement
Learning [70.01650994156797]
Off- evaluation of sequential decision policies from observational data is necessary in batch reinforcement learning such as education healthcare.
We develop an approach that estimates the bounds of a given policy.
We prove convergence to the sharp bounds as we collect more confounded data.
arXiv Detail & Related papers (2020-02-11T16:18:14Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.