Fugu-MT 論文翻訳(概要): Fully Stochastic Primal-dual Gradient Algorithm for Non-convex Optimization on Random Graphs

論文の概要: Fully Stochastic Primal-dual Gradient Algorithm for Non-convex Optimization on Random Graphs

arxiv url: http://arxiv.org/abs/2410.18774v1
Date: Thu, 24 Oct 2024 14:26:58 GMT
ステータス: 翻訳完了
システム内更新日: 2024-11-28 17:07:39.942028
Title: Fully Stochastic Primal-dual Gradient Algorithm for Non-convex Optimization on Random Graphs
Title（参考訳）: ランダムグラフの非凸最適化のための確率的原始2次勾配アルゴリズム
Authors: Chung-Yiu Yau, Haoming Liu, Hoi-To Wai,
Abstract要約: 分散最適化アルゴリズムは、同期オーバーヘッドや断続的な通信といった問題に悩まされることが多い。 FSPDAは、非アンダーライン設定の下で正確に収束する最初のアルゴリズムである。 FSPDAの利点を示すために, 数値実験を行った。
参考スコア（独自算出の注目度）: 21.66341372216097
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Stochastic decentralized optimization algorithms often suffer from issues such as synchronization overhead and intermittent communication. This paper proposes a $\underline{\rm F}$ully $\underline{\rm S}$tochastic $\underline{\rm P}$rimal $\underline{\rm D}$ual gradient $\underline{\rm A}$lgorithm (FSPDA) that suggests an asynchronous decentralized procedure with (i) sparsified non-blocking communication on random undirected graphs and (ii) local stochastic gradient updates. FSPDA allows multiple local gradient steps to accelerate convergence to stationarity while finding a consensual solution with stochastic primal-dual updates. For problems with smooth (possibly non-convex) objective function, we show that FSPDA converges to an $\mathrm{\mathcal{O}( {\it \sigma /\sqrt{nT}} )}$-stationary solution after $\mathrm{\it T}$ iterations without assuming data heterogeneity. The performance of FSPDA is on par with state-of-the-art algorithms whose convergence depend on static graph and synchronous updates. To our best knowledge, FSPDA is the first asynchronous algorithm that converges exactly under the non-convex setting. Numerical experiments are presented to show the benefits of FSPDA.
Abstract（参考訳）: 確率的分散最適化アルゴリズムは、しばしば同期オーバーヘッドや断続的通信といった問題に悩まされる。本稿では,$\underline{\rm F}$ully $\underline{\rm S}$tochastic $\underline{\rm P}$rimal $\underline{\rm D}$ual gradient $\underline{\rm A}$lgorithm (FSPDA)を提案する。 (i)ランダムな非方向グラフ上のスパーシフィケート非ブロッキング通信と (ii)局所確率勾配更新。 FSPDAは、確率的原始双対更新を伴う合意解を見つけながら、複数の局所勾配ステップが定常性への収束を加速することを可能にする。滑らかな(おそらくは非凸な)目的関数を持つ問題に対して、FSPDA はデータ不均一性を仮定せずに $\mathrm{\mathcal{O}( {\displaystyle \sigma /\sqrt{nT}} )} を $\mathrm{\it T} の後に$-定常解に収束することを示す。 FSPDAの性能は、静的グラフと同期更新に依存する最先端のアルゴリズムと同等である。我々の知る限り、FSPDAは非凸条件下で正確に収束する最初の非同期アルゴリズムである。 FSPDAの利点を示すために, 数値実験を行った。

関連論文リスト

Nonconvex Stochastic Optimization under Heavy-Tailed Noises: Optimal Convergence without Gradient Clipping [21.865728815935665]
重み付き雑音下での最初の収束を提供するが、切断はしない。また、テールインデックス$mathfrakp$が事前に不明な場合には、最初の$mathcalO(Tfrac1-mathfrakp3mathfrakp-2)$収束率も設定する。
論文参考訳（メタデータ） (2024-12-27T08:46:46Z)
Two-Timescale Gradient Descent Ascent Algorithms for Nonconvex Minimax Optimization [77.3396841985172]
我々は、構造化された非極小最適化問題の解法として、2時間勾配上昇(TTGDA)を統一的に解析する。我々の貢献はTTGDAアルゴリズムを設計することであり、設定を超えて効果的です。
論文参考訳（メタデータ） (2024-08-21T20:14:54Z)
High Probability Convergence of Stochastic Gradient Methods [15.829413808059124]
最適解への初期距離に依存する有界収束を示す。 AdaGrad-Normのハイバウンドが得られることを示す。
論文参考訳（メタデータ） (2023-02-28T18:42:11Z)
ReSQueing Parallel and Private Stochastic Convex Optimization [59.53297063174519]
本稿では,BFG凸最適化(SCO: Reweighted Query (ReSQue) 推定ツールを提案する。我々はSCOの並列およびプライベート設定における最先端の複雑さを実現するアルゴリズムを開発した。
論文参考訳（メタデータ） (2023-01-01T18:51:29Z)
An Optimal Stochastic Algorithm for Decentralized Nonconvex Finite-sum Optimization [25.21457349137344]
私たちは、DEARESTが少なくとも$mathcal O(+sqrtmnLvarepsilon-2)$ 1次オラクル(IFO)コールと$mathcal O(Lvarepsilon-2/sqrt1-lambda_W)$通信ラウンドを必要とすることを示す証拠を示します。
論文参考訳（メタデータ） (2022-10-25T11:37:11Z)
Optimal Extragradient-Based Bilinearly-Coupled Saddle-Point Optimization [116.89941263390769]
滑らかな凸凹凸結合型サドル点問題, $min_mathbfxmax_mathbfyF(mathbfx) + H(mathbfx,mathbfy)$ を考える。漸進的勾配指数(AG-EG)降下指数アルゴリズムについて述べる。
論文参考訳（メタデータ） (2022-06-17T06:10:20Z)
Sharper Convergence Guarantees for Asynchronous SGD for Distributed and Federated Learning [77.22019100456595]
通信周波数の異なる分散計算作業者のトレーニングアルゴリズムを示す。本研究では,より厳密な収束率を$mathcalO!!(sigma2-2_avg!)とする。また,不均一性の項は,作業者の平均遅延によっても影響されることを示した。
論文参考訳（メタデータ） (2022-06-16T17:10:57Z)
Asynchronous Stochastic Optimization Robust to Arbitrary Delays [54.61797739710608]
遅延勾配の最適化を考えると、ステップt$毎に、アルゴリズムは古い計算を使って更新する - d_t$ for arbitrary delay $d_t gradient。本実験は,遅延分布が歪んだり重くなったりした場合のアルゴリズムの有効性とロバスト性を示す。
論文参考訳（メタデータ） (2021-06-22T15:50:45Z)
Accelerated Gradient Tracking over Time-varying Graphs for Decentralized Optimization [59.65871549878937]
実用的な単一ループ加速勾配追跡には$O(fracgamma1-sigma_gamma)2sqrtfracLepsilon)$が必要であることを証明している。我々の収束率は$O(frac1epsilon5/7)$と$O(fracLmu)5/7frac1(1-sigma)1.5logfrac1epsilon)$よりも大幅に改善した。
論文参考訳（メタデータ） (2021-04-06T15:34:14Z)
Convergence Analysis of Nonconvex Distributed Stochastic Zeroth-order Coordinate Method [3.860616339202303]
本稿では,$ZOn$局所コスト関数の合計により形成されるグローバルコスト関数を最小化する分散非最適化問題について検討する。エージェントは問題を解くためにzo座標法を近似する。
論文参考訳（メタデータ） (2021-03-24T03:07:46Z)
S-ADDOPT: Decentralized stochastic first-order optimization over directed graphs [16.96562173221624]
有向ネットワークノード上に分散する関数のスムーズかつ高コストな関数の和を最小化するために,分散凸最適化を提案する。特に,各ノードに1次オラクルを仮定するtextbftextttS-ADDOPTアルゴリズムを提案する。崩壊するステップサイズ$mathcalO (1/k)$に対して、textbfttS-ADDOPT が$mathcalO (1/k)$ で正解に達し、その収束はネットワーク非依存であることを示す。
論文参考訳（メタデータ） (2020-05-15T21:14:22Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。