CARMS: Categorical-Antithetic-REINFORCE Multi-Sample Gradient Estimator
- URL: http://arxiv.org/abs/2110.14002v1
- Date: Tue, 26 Oct 2021 20:14:30 GMT
- Title: CARMS: Categorical-Antithetic-REINFORCE Multi-Sample Gradient Estimator
- Authors: Alek Dimitriev and Mingyuan Zhou
- Abstract summary: We propose an unbiased estimator for categorical random variables based on multiple mutually negatively correlated (jointly antithetic) samples.
CARMS combines REINFORCE with copula based sampling to avoid duplicate samples and reduce its variance, while keeping the estimator unbiased using importance sampling.
We evaluate CARMS on several benchmark datasets on a generative modeling task, as well as a structured output prediction task, and find it to outperform competing methods including a strong self-control baseline.
- Score: 60.799183326613395
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Accurately backpropagating the gradient through categorical variables is a
challenging task that arises in various domains, such as training discrete
latent variable models. To this end, we propose CARMS, an unbiased estimator
for categorical random variables based on multiple mutually negatively
correlated (jointly antithetic) samples. CARMS combines REINFORCE with copula
based sampling to avoid duplicate samples and reduce its variance, while
keeping the estimator unbiased using importance sampling. It generalizes both
the ARMS antithetic estimator for binary variables, which is CARMS for two
categories, as well as LOORF/VarGrad, the leave-one-out REINFORCE estimator,
which is CARMS with independent samples. We evaluate CARMS on several benchmark
datasets on a generative modeling task, as well as a structured output
prediction task, and find it to outperform competing methods including a strong
self-control baseline. The code is publicly available.
Related papers
- Semiparametric conformal prediction [79.6147286161434]
Risk-sensitive applications require well-calibrated prediction sets over multiple, potentially correlated target variables.
We treat the scores as random vectors and aim to construct the prediction set accounting for their joint correlation structure.
We report desired coverage and competitive efficiency on a range of real-world regression problems.
arXiv Detail & Related papers (2024-11-04T14:29:02Z) - PASA: Attack Agnostic Unsupervised Adversarial Detection using Prediction & Attribution Sensitivity Analysis [2.5347892611213614]
Deep neural networks for classification are vulnerable to adversarial attacks, where small perturbations to input samples lead to incorrect predictions.
We develop a practical method for this characteristic of model prediction and feature attribution to detect adversarial samples.
Our approach demonstrates competitive performance even when an adversary is aware of the defense mechanism.
arXiv Detail & Related papers (2024-04-12T21:22:21Z) - Double Control Variates for Gradient Estimation in Discrete Latent
Variable Models [32.33171301923846]
We introduce a variance reduction technique for score function estimators.
We show that our estimator can have lower variance compared to other state-of-the-art estimators.
arXiv Detail & Related papers (2021-11-09T18:02:42Z) - Variance function estimation in regression model via aggregation
procedures [0.0]
We focus on two particular aggregation setting: Model Selection aggregation (MS) and Convex aggregation (C)
The construction of the estimator relies on a two-step procedure and requires two independent samples.
We show the consistency of the proposed method with respect to the L 2error both for MS and C aggregations.
arXiv Detail & Related papers (2021-10-06T13:03:52Z) - ARMS: Antithetic-REINFORCE-Multi-Sample Gradient for Binary Variables [60.799183326613395]
Antithetic REINFORCE-based Multi-Sample gradient estimator.
ARMS uses a copula to generate any number of mutually antithetic samples.
We evaluate ARMS on several datasets for training generative models, and our experimental results show that it outperforms competing methods.
arXiv Detail & Related papers (2021-05-28T23:19:54Z) - Optimal Off-Policy Evaluation from Multiple Logging Policies [77.62012545592233]
We study off-policy evaluation from multiple logging policies, each generating a dataset of fixed size, i.e., stratified sampling.
We find the OPE estimator for multiple loggers with minimum variance for any instance, i.e., the efficient one.
arXiv Detail & Related papers (2020-10-21T13:43:48Z) - Robust Finite Mixture Regression for Heterogeneous Targets [70.19798470463378]
We propose an FMR model that finds sample clusters and jointly models multiple incomplete mixed-type targets simultaneously.
We provide non-asymptotic oracle performance bounds for our model under a high-dimensional learning framework.
The results show that our model can achieve state-of-the-art performance.
arXiv Detail & Related papers (2020-10-12T03:27:07Z) - Learning Ising models from one or multiple samples [26.00403702328348]
We provide guarantees for one-sample estimation, quantifying the estimation error in terms of the metric entropy of a family of interaction matrices.
Our technical approach benefits from sparsifying a model's interaction network, conditioning on subsets of variables that make the dependencies in the resulting conditional distribution sufficiently weak.
arXiv Detail & Related papers (2020-04-20T15:17:05Z) - Estimating Gradients for Discrete Random Variables by Sampling without
Replacement [93.09326095997336]
We derive an unbiased estimator for expectations over discrete random variables based on sampling without replacement.
We show that our estimator can be derived as the Rao-Blackwellization of three different estimators.
arXiv Detail & Related papers (2020-02-14T14:15:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.