Fugu-MT 論文翻訳(概要): Lion Secretly Solves Constrained Optimization: As Lyapunov Predicts

論文の概要: Lion Secretly Solves Constrained Optimization: As Lyapunov Predicts

arxiv url: http://arxiv.org/abs/2310.05898v1
Date: Mon, 9 Oct 2023 17:41:29 GMT
ステータス: 翻訳完了
システム内更新日: 2023-10-10 22:12:15.193267
Title: Lion Secretly Solves Constrained Optimization: As Lyapunov Predicts
Title（参考訳）: Lyapunovの予測通り、ライオンは秘密裏に最適化する
Authors: Lizhang Chen, Bo Liu, Kaizhao Liang, Qiang Liu
Abstract要約: Lion(Evolved Sign Momentum)は、大規模なAIモデルのトレーニングにおいて有望な結果を示している。これはAdamWと同等か好意的に機能するが、メモリ効率は向上する。我々の分析は,ライオン更新のための新しいリャプノフ関数の開発によって可能となった。
参考スコア（独自算出の注目度）: 9.169190618233895
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Lion (Evolved Sign Momentum), a new optimizer discovered through program search, has shown promising results in training large AI models. It performs comparably or favorably to AdamW but with greater memory efficiency. As we can expect from the results of a random search program, Lion incorporates elements from several existing algorithms, including signed momentum, decoupled weight decay, Polak, and Nesterov momentum, but does not fit into any existing category of theoretically grounded optimizers. Thus, even though Lion appears to perform well as a general-purpose optimizer for a wide range of tasks, its theoretical basis remains uncertain. This lack of theoretical clarity limits opportunities to further enhance and expand Lion's efficacy. This work aims to demystify Lion. Based on both continuous-time and discrete-time analysis, we demonstrate that Lion is a theoretically novel and principled approach for minimizing a general loss function $f(x)$ while enforcing a bound constraint $\|x\|_\infty \leq 1/\lambda$. Lion achieves this through the incorporation of decoupled weight decay, where $\lambda$ represents the weight decay coefficient. Our analysis is made possible by the development of a new Lyapunov function for the Lion updates. It applies to a broader family of Lion-$\kappa$ algorithms, where the $\text{sign}(\cdot)$ operator in Lion is replaced by the subgradient of a convex function $\kappa$, leading to the solution of a general composite optimization problem of $\min_x f(x) + \kappa^*(x)$. Our findings provide valuable insights into the dynamics of Lion and pave the way for further improvements and extensions of Lion-related algorithms.
Abstract（参考訳）: プログラム検索を通じて発見された新しいオプティマイザであるLion(Evolved Sign Momentum)は、大規模なAIモデルのトレーニングにおいて有望な結果を示している。 AdamWと同等か好意的に動作するが、メモリ効率は高い。ランダム探索プログラムの結果から想像できるように、lionは、符号付き運動量、デカップリングされた重みの減衰、polak、ネステロフ運動量を含む、いくつかの既存のアルゴリズムの要素を組み込んでいるが、理論上既定のオプティマイザのどのカテゴリにも当てはまらない。したがって、ライオンは幅広いタスクの汎用最適化器として機能するように見えるが、理論的根拠は定かではない。この理論的明快さの欠如は、ライオンの有効性をさらに強化し拡大する機会を制限している。この作品はライオンを軽蔑することを目的としている。連続時間解析と離散時間解析の両方に基づき、Lion は一般損失関数 $f(x)$ を最小化し、有界制約 $\|x\|_\infty \leq 1/\lambda$ を強制する理論的および原理的アプローチであることを示した。ライオンはこれをデカップリングウェイト崩壊の包含によって達成し、$\lambda$はウェイト崩壊係数を表す。我々の分析はライオン更新のための新しいリアプノフ関数の開発によって可能である。これは、Lion-$\kappa$アルゴリズムのより広範なファミリーに適用され、Lionの$\text{sign}(\cdot)$演算子は凸関数 $\kappa$ の次数に置き換えられ、一般的な合成最適化問題である $\min_x f(x) + \kappa^*(x)$ の解となる。我々の発見はライオンのダイナミクスに関する貴重な洞察を与え、ライオン関連アルゴリズムのさらなる改良と拡張の道を開く。

論文の概要: Lion Secretly Solves Constrained Optimization: As Lyapunov Predicts

関連論文リスト