Fugu-MT 論文翻訳(概要): Optimization Trade-offs in Asynchronous Federated Learning: A Stochastic Networks Approach

論文の概要: Optimization Trade-offs in Asynchronous Federated Learning: A Stochastic Networks Approach

arxiv url: http://arxiv.org/abs/2603.26231v1
Date: Fri, 27 Mar 2026 09:53:53 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-30 21:49:48.430279
Title: Optimization Trade-offs in Asynchronous Federated Learning: A Stochastic Networks Approach
Title（参考訳）: 非同期フェデレーション学習における最適化トレードオフ-確率的ネットワークアプローチ
Authors: Abdelkrim Alahyane, Céline Comte, Matthieu Jonckheere,
Abstract要約: 非同期アルゴリズムは、到着時に更新を処理することで、更新スループットを向上する。収束を低下させる勾配の安定化と、不均一なデータの下でより高速なクライアントへのバイアス。我々は,クライアントとサーバのランダムな計算時間を共同でモデル化する汎用AsyncSGDのキューネットワークフレームワークを開発した。
参考スコア（独自算出の注目度）: 2.0200069355804264
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Synchronous federated learning scales poorly due to the straggler effect. Asynchronous algorithms increase the update throughput by processing updates upon arrival, but they introduce two fundamental challenges: gradient staleness, which degrades convergence, and bias toward faster clients under heterogeneous data distributions. Although algorithms such as AsyncSGD and Generalized AsyncSGD mitigate this bias via client-side task queues, most existing analyses neglect the underlying queueing dynamics and lack closed-form characterizations of the update throughput and gradient staleness. To close this gap, we develop a stochastic queueing-network framework for Generalized AsyncSGD that jointly models random computation times at the clients and the central server, as well as random uplink and downlink communication delays. Leveraging product-form network theory, we derive a closed-form expression for the update throughput, alongside closed-form upper bounds for both the communication round complexity and the expected wall-clock time required to reach an $ε$-stationary point. These results formally characterize the trade-off between gradient staleness and wall-clock convergence speed. We further extend the framework to quantify energy consumption under stochastic timing, revealing an additional trade-off between convergence speed and energy efficiency. Building on these analytical results, we propose gradient-based optimization strategies to jointly optimize routing and concurrency. Experiments on EMNIST demonstrate reductions of 29%--46% in convergence time and 36%--49% in energy consumption compared to AsyncSGD.
Abstract（参考訳）: 同期的フェデレーション学習は、ストラグラー効果により、不十分にスケールする。非同期アルゴリズムは到着時の更新処理によって更新スループットを向上するが,収束性を低下させる勾配の安定化と,異種データ分散下での高速クライアントへのバイアスという,2つの基本的な課題を導入している。 AsyncSGDやGeneralized AsyncSGDのようなアルゴリズムはこのバイアスをクライアントサイドのタスクキューで緩和するが、既存の分析のほとんどは、基礎となるキューのダイナミクスを無視し、更新スループットと勾配の安定性のクローズドフォームの特徴を欠いている。このギャップを埋めるために、我々は、クライアントと中央サーバのランダムな計算時間と、ランダムなアップリンクとダウンリンク通信遅延を共同でモデル化するGeneralized AsyncSGDのための確率的キュー・ネットワークフレームワークを開発する。積形式ネットワーク理論を応用して、更新スループットのクローズドフォーム表現と、通信ラウンドの複雑さと、ε$静止点に達するのに必要なウォールクロック時間の両方のクローズドフォーム上限を導出する。これらの結果は、勾配安定度と壁面収束速度のトレードオフを正式に特徴づける。さらに、確率的タイミングでエネルギー消費を定量化するための枠組みを拡張し、収束速度とエネルギー効率の間にさらなるトレードオフを明らかにする。これらの解析結果に基づいて、ルーティングと並列性を協調的に最適化するための勾配に基づく最適化手法を提案する。 EMNIST実験では, AsyncSGDと比較して収束時間の29%～46%, エネルギー消費の36%～49%が減少した。

論文の概要: Optimization Trade-offs in Asynchronous Federated Learning: A Stochastic Networks Approach

関連論文リスト