Fugu-MT 論文翻訳(概要): Controlled disagreement improves generalization in decentralized training

論文の概要: Controlled disagreement improves generalization in decentralized training

arxiv url: http://arxiv.org/abs/2602.02899v1
Date: Mon, 02 Feb 2026 23:14:37 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-04 18:37:15.129202
Title: Controlled disagreement improves generalization in decentralized training
Title（参考訳）: 制御された不一致は分散訓練における一般化を改善する
Authors: Zesen Wang, Mikael Johansson,
Abstract要約: 集中型トレーニングは、コンセンサスエラーが収束と一般化を損なうため、集中型トレーニングよりも劣ると見なされることが多い。本研究は,Adaptive Consensus (DSGD-AC) を用いた分散SGDの導入により,この視点に挑戦する。これらの誤差はランダムノイズではなく、支配的なヘッセン部分空間と体系的に一致し、フラットなミニマに向けて最適化を導く構造的摂動として機能することを証明する。
参考スコア（独自算出の注目度）: 10.764160559530845
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Decentralized training is often regarded as inferior to centralized training because the consensus errors between workers are thought to undermine convergence and generalization, even with homogeneous data distributions. This work challenges this view by introducing decentralized SGD with Adaptive Consensus (DSGD-AC), which intentionally preserves non-vanishing consensus errors through a time-dependent scaling mechanism. We prove that these errors are not random noise but systematically align with the dominant Hessian subspace, acting as structured perturbations that guide optimization toward flatter minima. Across image classification and machine translation benchmarks, DSGD-AC consistently surpasses both standard DSGD and centralized SGD in test accuracy and solution flatness. Together, these results establish consensus errors as a useful implicit regularizer and open a new perspective on the design of decentralized learning algorithms.
Abstract（参考訳）: 分散トレーニングは、労働者間のコンセンサスエラーが、均質なデータ分布であっても収束と一般化を損なうと考えられるため、集中トレーニングよりも劣ると見なされることが多い。本研究は,DSGD-ACによる分散SGDの導入により,時間依存のスケーリング機構を通じて,意図的に非消滅的コンセンサスエラーを保存することにより,この視点に挑戦する。これらの誤差はランダムノイズではなく、支配的なヘッセン部分空間と体系的に一致し、フラットなミニマに向けて最適化を導く構造的摂動として機能することを証明する。画像分類と機械翻訳のベンチマークを通じて、DSGD-ACは、テスト精度とソリューション平坦性において、標準DSGDと集中型SGDの両方を一貫して上回っている。これらの結果とともに、コンセンサスエラーを有用な暗黙正則化器として確立し、分散学習アルゴリズムの設計に関する新たな視点を開く。

論文の概要: Controlled disagreement improves generalization in decentralized training

関連論文リスト