Fugu-MT 論文翻訳(概要): Beyond ReinMax: Low-Variance Gradient Estimators for Discrete Latent Variables

論文の概要: Beyond ReinMax: Low-Variance Gradient Estimators for Discrete Latent Variables

arxiv url: http://arxiv.org/abs/2603.08257v1
Date: Mon, 09 Mar 2026 11:27:20 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-10 15:13:15.837278
Title: Beyond ReinMax: Low-Variance Gradient Estimators for Discrete Latent Variables
Title（参考訳）: ReinMaxを超えて - 離散遅延変数のための低変数勾配推定器
Authors: Daniel Wang, Thang D. Bui,
Abstract要約: 離散潜在変数を含む機械学習モデルは、バックプロパゲーションを容易にするために勾配推定器を必要とする。本稿では,Hunの手法による近似を取り入れたReinMax推定器を提案する。我々の推定器は、離散潜在空間を持つ変分オートエンコーダの訓練において優れた性能を示す。
参考スコア（独自算出の注目度）: 2.0371811934548587
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Machine learning models involving discrete latent variables require gradient estimators to facilitate backpropagation in a computationally efficient manner. The most recent addition to the Straight-Through family of estimators, ReinMax, can be viewed from a numerical ODE perspective as incorporating an approximation via Heun's method to reduce bias, but at the cost of high variance. In this work, we introduce the ReinMax-Rao and ReinMax-CV estimators which incorporate Rao-Blackwellisation and control variate techniques into ReinMax to reduce its variance. Our estimators demonstrate superior performance on training variational autoencoders with discrete latent spaces. Furthermore, we investigate the possibility of leveraging alternative numerical methods for constructing more accurate gradient approximations and present an alternative view of ReinMax from a simpler numerical integration perspective.
Abstract（参考訳）: 離散潜在変数を含む機械学習モデルは、計算的に効率的なバックプロパゲーションを促進するために勾配推定器を必要とする。直近のStraight-Through族に追加されたReinMaxは、数値的なODEの観点から、偏りを減らすためにHeunの手法による近似を取り入れているが、高分散のコストがかかると見なすことができる。本稿では,Rao-Blackwellisation と ReinMax-CV を組み込んだ ReinMax-Rao および ReinMax-CV 推定器を紹介する。我々の推定器は、離散潜在空間を持つ変分オートエンコーダの訓練において優れた性能を示す。さらに、より正確な勾配近似を構築するための代替数値手法を活用する可能性について検討し、より単純な数値積分の観点からReinMaxの代替的な視点を示す。

論文の概要: Beyond ReinMax: Low-Variance Gradient Estimators for Discrete Latent Variables

関連論文リスト