Fugu-MT 論文翻訳(概要): High-Probability Bounds for SGD under the Polyak-Lojasiewicz Condition with Markovian Noise

論文の概要: High-Probability Bounds for SGD under the Polyak-Lojasiewicz Condition with Markovian Noise

arxiv url: http://arxiv.org/abs/2603.14514v1
Date: Sun, 15 Mar 2026 17:50:03 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 16:19:35.861458
Title: High-Probability Bounds for SGD under the Polyak-Lojasiewicz Condition with Markovian Noise
Title（参考訳）: マルコフ雑音を伴うポリアク・ロジャシエヴィチ条件下でのSGDの高確率境界
Authors: Avik Kar, Siddharth Chandak, Rahul Singh, Eric Moulines, Shalabh Bhatnagar, Nicholas Bambos,
Abstract要約: PL条件下でのSGDの1次均一時間高確率結合について検討し, 勾配雑音はマルコフ差成分とマルティンゲール差成分の両方を含むことを示した。これはPL条件が多くの機械学習モデルやディープラーニングモデルで生じるため、有限時間保証の範囲を大幅に広げる。
参考スコア（独自算出の注目度）: 27.3629260943211
License: http://creativecommons.org/licenses/by/4.0/
Abstract: We present the first uniform-in-time high-probability bound for SGD under the PL condition, where the gradient noise contains both Markovian and martingale difference components. This significantly broadens the scope of finite-time guarantees, as the PL condition arises in many machine learning and deep learning models while Markovian noise naturally arises in decentralized optimization and online system identification problems. We further allow the magnitude of noise to grow with the function value, enabling the analysis of many practical sampling strategies. In addition to the high-probability guarantee, we establish a matching $1/k$ decay rate for the expected suboptimality. Our proof technique relies on the Poisson equation to handle the Markovian noise and a probabilistic induction argument to address the lack of almost-sure bounds on the objective. Finally, we demonstrate the applicability of our framework by analyzing three practical optimization problems: token-based decentralized linear regression, supervised learning with subsampling for privacy amplification, and online system identification.
Abstract（参考訳）: PL条件下でのSGDの1次均一時間高確率結合について検討し, 勾配雑音はマルコフ差成分とマルティンゲール差成分の両方を含むことを示した。これはPL条件が多くの機械学習モデルやディープラーニングモデルで発生するのに対して、マルコフのノイズは分散最適化やオンラインシステム識別問題で自然に発生するため、有限時間保証の範囲を大きく広げる。さらに、関数値によって雑音の大きさが大きくなることを許容し、多くの実用的なサンプリング戦略の分析を可能にする。高確率保証に加えて、期待される準最適度に対して、一致する1/k$の減衰率を確立する。我々の証明手法は、マルコフ雑音を扱うためにポアソン方程式と、目的のほとんど余剰境界の欠如に対処するための確率的帰納論に依存する。最後に、トークンベースの分散線形回帰、プライバシー増幅のためのサブサンプリングによる教師付き学習、オンラインシステム識別の3つの実用的な最適化問題を解析して、フレームワークの適用性を示す。

論文の概要: High-Probability Bounds for SGD under the Polyak-Lojasiewicz Condition with Markovian Noise

関連論文リスト