Fugu-MT 論文翻訳(概要): Accelerated Gradient Methods with Biased Gradient Estimates: Risk Sensitivity, High-Probability Guarantees, and Large Deviation Bounds

論文の概要: Accelerated Gradient Methods with Biased Gradient Estimates: Risk Sensitivity, High-Probability Guarantees, and Large Deviation Bounds

arxiv url: http://arxiv.org/abs/2509.13628v1
Date: Wed, 17 Sep 2025 01:56:31 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-18 18:41:50.688828
Title: Accelerated Gradient Methods with Biased Gradient Estimates: Risk Sensitivity, High-Probability Guarantees, and Large Deviation Bounds
Title（参考訳）: Biased Gradient Estimates: Risk Sensitivity, High-Probability Guarantees, Large Deviation Bounds
Authors: Mert Gürbüzbalaban, Yasa Syed, Necdet Serhat Aybat,
Abstract要約: 一階法における収束率と強靭性への勾配のトレードオフについて検討する。我々はロバスト制御理論からリスク・センシティブ・インデックス(RSI)を通してロバスト性を定量化する。また、滑らかな凸関数に対するRSIと収束率境界との類似のトレードオフも観察する。
参考スコア（独自算出の注目度）: 12.025550076793396
License: http://creativecommons.org/licenses/by/4.0/
Abstract: We study trade-offs between convergence rate and robustness to gradient errors in first-order methods. Our focus is on generalized momentum methods (GMMs), a class that includes Nesterov's accelerated gradient, heavy-ball, and gradient descent. We allow stochastic gradient errors that may be adversarial and biased, and quantify robustness via the risk-sensitive index (RSI) from robust control theory. For quadratic objectives with i.i.d. Gaussian noise, we give closed-form expressions for RSI using 2x2 Riccati equations, revealing a Pareto frontier between RSI and convergence rate over stepsize and momentum choices. We prove a large-deviation principle for time-averaged suboptimality and show that the rate function is, up to scaling, the convex conjugate of the RSI. We further connect RSI to the $H_{\infty}$-norm, showing that stronger worst-case robustness (smaller $H_{\infty}$ norm) yields sharper decay of tail probabilities. Beyond quadratics, under biased sub-Gaussian gradient errors, we derive non-asymptotic bounds on a finite-time analogue of the RSI, giving finite-time high-probability guarantees and large-deviation bounds. We also observe an analogous trade-off between RSI and convergence-rate bounds for smooth strongly convex functions. To our knowledge, these are the first non-asymptotic guarantees and risk-sensitive analysis of GMMs with biased gradients. Numerical experiments on robust regression illustrate the results.
Abstract（参考訳）: 一階法における収束率と勾配誤差とのトレードオフについて検討する。我々の焦点は一般化運動量法(GMM)であり、ネステロフの加速勾配、重ボール、勾配勾配を含むクラスである。確率的勾配誤差を逆数・偏りとして許容し、ロバスト制御理論からリスク感応指数(RSI)を介してロバスト性を定量化する。ガウス雑音の二次目的に対して、2x2 Riccati 方程式を用いて RSI に対して閉形式表現を行い、ステップサイズと運動量選択に対する収束率のパレートフロンティアを明らかにする。我々は、時間平均的部分最適性に対する大きな決定原理を証明し、その速度関数がRSIの凸共役であることを示す。さらに RSI を $H_{\infty}$-norm に結び付け、より強い最悪のケースのロバスト性 (より小さい$H_{\infty}$ノルム) がテール確率のよりシャープな崩壊をもたらすことを示す。二次性を超えて、バイアス付き準ガウス勾配誤差の下では、RSIの有限時間アナログ上の非漸近境界を導出し、有限時間高確率保証と大偏差境界を与える。また、滑らかな凸関数に対するRSIと収束率境界との類似のトレードオフも観察する。我々の知る限り、これらは非漸近的保証であり、偏りのあるGMMのリスク感受性分析である。頑健な回帰に関する数値実験は、その結果を示している。

論文の概要: Accelerated Gradient Methods with Biased Gradient Estimates: Risk Sensitivity, High-Probability Guarantees, and Large Deviation Bounds

関連論文リスト