Fugu-MT 論文翻訳(概要): Adaptivity and Non-stationarity: Problem-dependent Dynamic Regret for Online Convex Optimization

論文の概要: Adaptivity and Non-stationarity: Problem-dependent Dynamic Regret for Online Convex Optimization

arxiv url: http://arxiv.org/abs/2112.14368v2
Date: Mon, 1 May 2023 16:12:43 GMT
ステータス: 翻訳完了
システム内更新日: 2023-05-02 22:02:22.502959
Title: Adaptivity and Non-stationarity: Problem-dependent Dynamic Regret for Online Convex Optimization
Title（参考訳）: 適応性と非定常性:オンライン凸最適化における問題依存動的後悔
Authors: Peng Zhao, Yu-Jie Zhang, Lijun Zhang, Zhi-Hua Zhou
Abstract要約: 本稿では,スムーズさを生かし,問題依存量による動的後悔のT$への依存を補う新しいオンラインアルゴリズムを提案する。この結果が本質的な難易度に適応するのは, 既往の結果よりも厳密であり, 最悪の場合, 同一の値が保証されるためである。
参考スコア（独自算出の注目度）: 93.71361250701075
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We investigate online convex optimization in non-stationary environments and choose the dynamic regret as the performance measure, defined as the difference between cumulative loss incurred by the online algorithm and that of any feasible comparator sequence. Let $T$ be the time horizon and $P_T$ be the path length that essentially reflects the non-stationarity of environments, the state-of-the-art dynamic regret is $\mathcal{O}(\sqrt{T(1+P_T)})$. Although this bound is proved to be minimax optimal for convex functions, in this paper, we demonstrate that it is possible to further enhance the guarantee for some easy problem instances, particularly when online functions are smooth. Specifically, we introduce novel online algorithms that can exploit smoothness and replace the dependence on $T$ in dynamic regret with problem-dependent quantities: the variation in gradients of loss functions, the cumulative loss of the comparator sequence, and the minimum of these two terms. These quantities are at most $\mathcal{O}(T)$ while could be much smaller in benign environments. Therefore, our results are adaptive to the intrinsic difficulty of the problem, since the bounds are tighter than existing results for easy problems and meanwhile guarantee the same rate in the worst case. Notably, our proposed algorithms can achieve favorable dynamic regret with only one gradient per iteration, sharing the same gradient query complexity as the static regret minimization methods. To accomplish this, we introduce the framework of collaborative online ensemble. The proposed framework employs a two-layer online ensemble to handle non-stationarity, and uses optimistic online learning and further introduces crucial correction terms to facilitate effective collaboration within the meta-base two layers, thereby attaining adaptivity. We believe that the framework can be useful for broader problems.
Abstract（参考訳）: 非定常環境におけるオンライン凸最適化について検討し、オンラインアルゴリズムが生み出す累積損失と実行可能なコンパレータシーケンスとの差として定義される性能指標として動的後悔を選択する。 t$ を時間軸とし、$p_t$ を環境の非定常性を反映した経路長とし、最先端の動的後悔は$\mathcal{o}(\sqrt{t(1+p_t)})$である。この境界は凸関数に対してミニマックス最適であることが証明されているが,本稿では,簡単な問題,特にオンライン関数が滑らかである場合の保証をさらに強化できることを実証する。具体的には,損失関数の勾配の変動,コンパレータ列の累積損失,およびこれら2項の最小化など,スムーズさを生かし,動的後悔のT$への依存を問題依存量に置き換える新しいオンラインアルゴリズムを提案する。これらの量は少なくとも$\mathcal{O}(T)$であるが、良質な環境ではずっと小さい。したがって,本研究の結果は,既往の結果よりも厳密であり,かつ最悪の場合において同じ確率を保証できるため,本問題の本質的な難易度に適応する。特に,提案アルゴリズムは1イテレーションに1つの勾配しか持たず,静的な後悔最小化法と同じ勾配クエリの複雑さを共有できる。そこで本稿では,協調型オンラインアンサンブルの枠組みを紹介する。提案手法では,非定常性を扱うために2層オンラインアンサンブルを用い,楽観的なオンライン学習を行い,さらに重要な修正用語を導入して,メタベース2層間の効果的なコラボレーションを促進し,適応性を実現する。このフレームワークは幅広い問題に有効であると考えています。

論文の概要: Adaptivity and Non-stationarity: Problem-dependent Dynamic Regret for Online Convex Optimization

関連論文リスト