Fugu-MT 論文翻訳(概要): BiAdam: Fast Adaptive Bilevel Optimization Methods

論文の概要: BiAdam: Fast Adaptive Bilevel Optimization Methods

arxiv url: http://arxiv.org/abs/2106.11396v1
Date: Mon, 21 Jun 2021 20:16:40 GMT
ステータス: 翻訳完了
システム内更新日: 2021-06-23 15:03:34.300335
Title: BiAdam: Fast Adaptive Bilevel Optimization Methods
Title（参考訳）: biadam: 高速適応二レベル最適化手法
Authors: Feihu Huang and Heng Huang
Abstract要約: バイレベル最適化は多くの応用のために機械学習への関心が高まっている。制約付き最適化と制約なし最適化の両方に有用な分析フレームワークを提供する。
参考スコア（独自算出の注目度）: 104.96004056928474
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Bilevel optimization recently has attracted increased interest in machine learning due to its many applications such as hyper-parameter optimization and policy optimization. Although some methods recently have been proposed to solve the bilevel problems, these methods do not consider using adaptive learning rates. To fill this gap, in the paper, we propose a class of fast and effective adaptive methods for solving bilevel optimization problems that the outer problem is possibly nonconvex and the inner problem is strongly-convex. Specifically, we propose a fast single-loop BiAdam algorithm based on the basic momentum technique, which achieves a sample complexity of $\tilde{O}(\epsilon^{-4})$ for finding an $\epsilon$-stationary point. At the same time, we propose an accelerated version of BiAdam algorithm (VR-BiAdam) by using variance reduced technique, which reaches the best known sample complexity of $\tilde{O}(\epsilon^{-3})$. To further reduce computation in estimating derivatives, we propose a fast single-loop stochastic approximated BiAdam algorithm (saBiAdam) by avoiding the Hessian inverse, which still achieves a sample complexity of $\tilde{O}(\epsilon^{-4})$ without large batches. We further present an accelerated version of saBiAdam algorithm (VR-saBiAdam), which also reaches the best known sample complexity of $\tilde{O}(\epsilon^{-3})$. We apply the unified adaptive matrices to our methods as the SUPER-ADAM \citep{huang2021super}, which including many types of adaptive learning rates. Moreover, our framework can flexibly use the momentum and variance reduced techniques. In particular, we provide a useful convergence analysis framework for both the constrained and unconstrained bilevel optimization. To the best of our knowledge, we first study the adaptive bilevel optimization methods with adaptive learning rates.
Abstract（参考訳）: 双レベル最適化は最近、ハイパーパラメータ最適化やポリシー最適化といった多くの応用のために機械学習への関心が高まっている。近年,二段階問題を解くための手法が提案されているが,適応学習率は考慮されていない。このギャップを埋めるため,本論文では,外問題が非凸で内的問題が強凸であるような2レベル最適化問題を解くための高速かつ効果的な適応手法を提案する。具体的には、基本運動量法に基づく高速単ループbiadamアルゴリズムを提案する。これは$\epsilon$-stationary pointを求めるために$\tilde{o}(\epsilon^{-4})$のサンプル複雑性を達成する。同時に,分散還元手法を用いてビアダムアルゴリズムの高速化版 (VR-BiAdam) を提案し,この手法は$\tilde{O}(\epsilon^{-3})$の最もよく知られたサンプル複雑性に到達した。導関数を推定する際の計算をさらに削減するため、ヘッセン逆数を避けることで高速な単ループ確率近似ビアダムアルゴリズム(saBiAdam)を提案し、大きなバッチを伴わずに$\tilde{O}(\epsilon^{-4})$のサンプル複雑性を実現する。さらに、SaBiAdamアルゴリズムの高速化版(VR-saBiAdam)を提示し、このアルゴリズムは最もよく知られたサンプルの複雑さを$\tilde{O}(\epsilon^{-3})$とする。適応行列の統一化をsuper-adam \citep{huang2021super} として手法に適用し,様々な適応学習率について検討した。さらに,本フレームワークでは,モーメントと分散低減手法を柔軟に利用することができる。特に,制約付きおよび制約なしの2レベル最適化のための有用な収束解析フレームワークを提供する。まず,適応学習率を用いた適応的二段階最適化手法について検討する。

論文の概要: BiAdam: Fast Adaptive Bilevel Optimization Methods

関連論文リスト