Fugu-MT 論文翻訳(概要): Analysis of Schedule-Free Nonconvex Optimization

論文の概要: Analysis of Schedule-Free Nonconvex Optimization

arxiv url: http://arxiv.org/abs/2508.06743v1
Date: Fri, 08 Aug 2025 22:54:35 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-12 21:23:28.529091
Title: Analysis of Schedule-Free Nonconvex Optimization
Title（参考訳）: スケジュールフリー非凸最適化の解析
Authors: Connor Brown,
Abstract要約: 大規模学習アルゴリズムの根底にある一階法であるが、その収束性は慎重にスケジュールされたステップのヒンジを保証し、前例のないスケジュール自由地平線に依存する。我々の$Oレートが$O(log T)$に束縛されていることを示す。我々の研究はSFの地平線を拡張し、最適な非滑らかな速度で将来の方向をグラフ化する。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/publicdomain/zero/1.0/
Abstract: First-order methods underpin most large-scale learning algorithms, yet their classical convergence guarantees hinge on carefully scheduled step-sizes that depend on the total horizon $T$, which is rarely known in advance. The Schedule-Free (SF) method promises optimal performance with hyperparameters that are independent of $T$ by interpolating between Polyak--Ruppert averaging and momentum, but nonconvex analysis of SF has been limited or reliant on strong global assumptions. We introduce a robust Lyapunov framework that, under only $L$-smoothness and lower-boundedness, reduces SF analysis to a single-step descent inequality. This yields horizon-agnostic bounds in the nonconvex setting: $O(1/\log T)$ for constant step + PR averaging, $O(\log T/T)$ for a linearly growing step-size, and a continuum of $O(T^{-(1-\alpha)})$ rates for polynomial averaging. We complement these proofs with Performance Estimation Problem (PEP) experiments that numerically validate our rates and suggest that our $O(1/\log T)$ bound on the original nonconvex SF algorithm may tighten to $O(1/T)$. Our work extends SF's horizon-free guarantees to smooth nonconvex optimization and charts future directions for optimal nonconvex rates.
Abstract（参考訳）: 大規模学習アルゴリズムの根底にある一階法であるが、古典的な収束は、事前に知られていない総地平線$T$に依存する、注意深くスケジュールされたステップサイズでのヒンジを保証する。 Schedule-Free (SF) 法は、Polyak-Ruppert 平均運動量と運動量との補間により、$T$とは独立なハイパーパラメータによる最適性能を約束するが、SF の非凸解析は、強い大域的な仮定に制限あるいは依存している。我々は、L$-smoothnessとlow-boundednessしか持たないロバストなLyapunovフレームワークを導入し、SF解析を単一ステップの降下不等式に還元する。定数ステップ + PR平均化に対して$O(1/\log T)$、線形に成長するステップサイズに対して$O(\log T/T)$、多項式平均化に対する$O(T^{-(1-\alpha)} の連続体。我々はこれらの証明を、我々のレートを数値的に検証する性能推定問題(PEP)実験で補完し、元の非凸SFアルゴリズムに縛られた$O(1/\log T)$が$O(1/T)$に固まることを示唆する。我々の研究は、SFの水平自由保証をスムーズな非凸最適化に拡張し、最適非凸速度の今後の方向をグラフ化する。

論文の概要: Analysis of Schedule-Free Nonconvex Optimization

関連論文リスト