Self-Bounding Majority Vote Learning Algorithms by the Direct
  Minimization of a Tight PAC-Bayesian C-Bound
        - URL: http://arxiv.org/abs/2104.13626v1
- Date: Wed, 28 Apr 2021 08:23:18 GMT
- Title: Self-Bounding Majority Vote Learning Algorithms by the Direct
  Minimization of a Tight PAC-Bayesian C-Bound
- Authors: Paul Viallard (LHC), Pascal Germain (ULaval), Amaury Habrard (LHC),
  Emilie Morvant (LHC)
- Abstract summary: We derive self-bounding majority vote learning algorithms based on PAC-Bayesian guarantees on the C-Bound.
Our algorithms are scalable and lead to accurate predictors paired with non-vacuous guarantees.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   In the PAC-Bayesian literature, the C-Bound refers to an insightful relation
between the risk of a majority vote classifier (under the zero-one loss) and
the first two moments of its margin (i.e., the expected margin and the voters'
diversity). Until now, learning algorithms developed in this framework minimize
the empirical version of the C-Bound, instead of explicit PAC-Bayesian
generalization bounds. In this paper, by directly optimizing PAC-Bayesian
guarantees on the C-Bound, we derive self-bounding majority vote learning
algorithms. Moreover, our algorithms based on gradient descent are scalable and
lead to accurate predictors paired with non-vacuous guarantees.
 
      
        Related papers
        - Towards Efficient Contrastive PAC Learning [6.209600119671225]
 We study contrastive learning under the PAC learning framework.
In this paper, we consider contrastive learning of the fundamental concept of linear representations.
We establish guarantees based on Rademacher complexity, and connect it to PAC guarantees under certain contrastive large-margin conditions.
 arXiv  Detail & Related papers  (2025-02-21T21:51:01Z)
- Multi-View Majority Vote Learning Algorithms: Direct Minimization of   PAC-Bayesian Bounds [0.8039067099377079]
 We extend PAC-Bayesian theory to introduce novel PAC-Bayesian bounds based on R'enyi divergence.
These bounds improve upon traditional Kullback-Leibler divergence and offer more refined complexity measures.
We also propose first and second-order oracle PAC-Bayesian bounds, along with an extension of the C-bound for multi-view learning.
 arXiv  Detail & Related papers  (2024-11-09T20:25:47Z)
- Provably Efficient Exploration in Constrained Reinforcement
  Learning:Posterior Sampling Is All You Need [15.113053885573171]
 We present a new algorithm based on posterior sampling for learning in constrained Markov decision processes (CMDP) in the infinite-horizon undiscounted setting.
The algorithm achieves near-optimal regret bounds while being advantageous empirically compared to the existing algorithms.
 arXiv  Detail & Related papers  (2023-09-27T15:48:36Z)
- Provably Efficient UCB-type Algorithms For Learning Predictive State
  Representations [55.00359893021461]
 The sequential decision-making problem is statistically learnable if it admits a low-rank structure modeled by predictive state representations (PSRs)
This paper proposes the first known UCB-type approach for PSRs, featuring a novel bonus term that upper bounds the total variation distance between the estimated and true models.
In contrast to existing approaches for PSRs, our UCB-type algorithms enjoy computational tractability, last-iterate guaranteed near-optimal policy, and guaranteed model accuracy.
 arXiv  Detail & Related papers  (2023-07-01T18:35:21Z)
- PAC-Bayesian Learning of Optimization Algorithms [6.624726878647541]
 We apply the PAC-Bayes theory to the setting of learning-to-optimize.
We learn optimization algorithms with provable generalization guarantees (PAC-bounds) and explicit trade-off between a high probability of convergence and a high convergence speed.
Our results rely on PAC-Bayes bounds for general, unbounded loss-functions based on exponential families.
 arXiv  Detail & Related papers  (2022-10-20T09:16:36Z)
- Differentially Private Stochastic Gradient Descent with Low-Noise [49.981789906200035]
 Modern machine learning algorithms aim to extract fine-grained information from data to provide accurate predictions, which often conflicts with the goal of privacy protection.
This paper addresses the practical and theoretical importance of developing privacy-preserving machine learning algorithms that ensure good performance while preserving privacy.
 arXiv  Detail & Related papers  (2022-09-09T08:54:13Z)
- On Leave-One-Out Conditional Mutual Information For Generalization [122.2734338600665]
 We derive information theoretic generalization bounds for supervised learning algorithms based on a new measure of leave-one-out conditional mutual information (loo-CMI)
Contrary to other CMI bounds, our loo-CMI bounds can be computed easily and can be interpreted in connection to other notions such as classical leave-one-out cross-validation.
We empirically validate the quality of the bound by evaluating its predicted generalization gap in scenarios for deep learning.
 arXiv  Detail & Related papers  (2022-07-01T17:58:29Z)
- Learning Stochastic Majority Votes by Minimizing a PAC-Bayes
  Generalization Bound [15.557653926558638]
 We investigate a counterpart of majority votes over finite ensembles of classifiers, and study its generalization properties.
We instantiate it with Dirichlet distributions: this allows for a closed-form and differentiable expression for the expected risk.
The resulting majority vote learning algorithm achieves state-of-the-art accuracy and benefits from (non-vacuous) tight bounds.
 arXiv  Detail & Related papers  (2021-06-23T16:57:23Z)
- Uniform-PAC Bounds for Reinforcement Learning with Linear Function
  Approximation [92.3161051419884]
 We study reinforcement learning with linear function approximation.
Existing algorithms only have high-probability regret and/or Probably Approximately Correct (PAC) sample complexity guarantees.
We propose a new algorithm called FLUTE, which enjoys uniform-PAC convergence to the optimal policy with high probability.
 arXiv  Detail & Related papers  (2021-06-22T08:48:56Z)
- Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality [131.45028999325797]
 We develop a doubly robust off-policy AC (DR-Off-PAC) for discounted MDP.
DR-Off-PAC adopts a single timescale structure, in which both actor and critics are updated simultaneously with constant stepsize.
We study the finite-time convergence rate and characterize the sample complexity for DR-Off-PAC to attain an $epsilon$-accurate optimal policy.
 arXiv  Detail & Related papers  (2021-02-23T18:56:13Z)
- A General Framework for the Practical Disintegration of PAC-Bayesian
  Bounds [2.516393111664279]
 We introduce new PAC-Bayesian generalization bounds that have the originality to provide disintegrated bounds.
Our bounds are easily optimizable and can be used to design learning algorithms.
 arXiv  Detail & Related papers  (2021-02-17T09:36:46Z)
- Adaptive Sampling for Best Policy Identification in Markov Decision
  Processes [79.4957965474334]
 We investigate the problem of best-policy identification in discounted Markov Decision (MDPs) when the learner has access to a generative model.
The advantages of state-of-the-art algorithms are discussed and illustrated.
 arXiv  Detail & Related papers  (2020-09-28T15:22:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.