Fugu-MT 論文翻訳(概要): Ada2MS: A Hybrid Optimization Algorithm Based on Exponential Mixing of Elementwise and Global Second-Moment Estimates

論文の概要: Ada2MS: A Hybrid Optimization Algorithm Based on Exponential Mixing of Elementwise and Global Second-Moment Estimates

arxiv url: http://arxiv.org/abs/2605.20533v1
Date: Tue, 19 May 2026 22:11:04 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-21 19:19:56.395811
Title: Ada2MS: A Hybrid Optimization Algorithm Based on Exponential Mixing of Elementwise and Global Second-Moment Estimates
Title（参考訳）: Ada2MS: 要素とグローバルな第2モーメント推定の指数混合に基づくハイブリッド最適化アルゴリズム
Authors: Meng Zhu, Quan Xiao, Weidong Min,
Abstract要約: 本稿では,AdamW-like挙動と運動量-SGD-like挙動のスムーズな遷移を実現する最適化アルゴリズムであるAda2MSを提案する。本研究で評価された視覚的タスクについて,Ada2MSは統合比較プロトコルの下で競合する結果を得る。
参考スコア（独自算出の注目度）: 13.40796672049436
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Optimization algorithms are core methods by which machine learning models iteratively minimize loss functions, update parameters, learn from data, and improve performance. Momentum SGD and AdamW represent two important optimization paradigms. AdamW produces stable updates and usually has strong robustness across training scenarios, but its generalization performance is sometimes weaker than that of momentum methods. Momentum SGD can often obtain better generalization after careful tuning, but it is more sensitive to gradient-scale variation and hyperparameter settings. To balance the strengths and weaknesses of the two paradigms, this paper proposes Ada2MS, an optimization algorithm that achieves a smooth transition between AdamW-like behavior and momentum-SGD-like behavior through continuous exponential interpolation between elementwise second-moment estimates and global second-moment estimates. On the visual tasks evaluated in this study, Ada2MS obtains competitive results under a unified optimizer-comparison protocol. The code will be released at https://github.com/mengzhu0308/Ada2MS
Abstract（参考訳）: 最適化アルゴリズムは、機械学習モデルが損失関数を反復的に最小化し、パラメータを更新し、データから学習し、パフォーマンスを改善するためのコア手法である。モメンタムSGDとAdamWは2つの重要な最適化パラダイムを表している。 AdamWは安定した更新を生成し、トレーニングシナリオ全体で強い堅牢性を持つが、その一般化性能は運動量法よりも弱いことがある。モメンタムSGDは、注意深いチューニング後により良い一般化を得ることができるが、勾配スケールの変動やハイパーパラメータの設定に敏感である。本稿では,AdaW-likeの挙動と運動量-SGD-likeの挙動のスムーズな遷移を実現する最適化アルゴリズムであるAda2MSを提案する。本研究で評価された視覚的タスクについて,Ada2MSは統一されたオプティマイザ比較プロトコルの下で競合する結果を得る。コードはhttps://github.com/mengzhu0308/Ada2MSでリリースされる。

論文の概要: Ada2MS: A Hybrid Optimization Algorithm Based on Exponential Mixing of Elementwise and Global Second-Moment Estimates

関連論文リスト