Fugu-MT 論文翻訳(概要): Optimal Anytime Algorithms for Online Convex Optimization with Adversarial Constraints

論文の概要: Optimal Anytime Algorithms for Online Convex Optimization with Adversarial Constraints

arxiv url: http://arxiv.org/abs/2510.22579v1
Date: Sun, 26 Oct 2025 08:35:37 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-28 19:54:32.539764
Title: Optimal Anytime Algorithms for Online Convex Optimization with Adversarial Constraints
Title（参考訳）: 逆制約を考慮したオンライン凸最適化のための最適随時アルゴリズム
Authors: Dhruv Sarkar, Abhishek Sinha,
Abstract要約: 本稿では,対向凸コスト関数の列を学習する問題に対して,任意のオンラインアルゴリズムを提案する。提案アルゴリズムは,標準的な倍数化手法を使わずに最適な性能バウンダリを実現する。我々のアルゴリズムは、$O(sqrtt)$ regret と $tildeO(sqrtt)$ cumulative constraint violation bounds for any $tgeq 1$。
参考スコア（独自算出の注目度）: 7.798233121583888
License: http://creativecommons.org/licenses/by/4.0/
Abstract: We propose an anytime online algorithm for the problem of learning a sequence of adversarial convex cost functions while approximately satisfying another sequence of adversarial online convex constraints. A sequential algorithm is called \emph{anytime} if it provides a non-trivial performance guarantee for any intermediate timestep $t$ without requiring prior knowledge of the length of the entire time horizon $T$. Our proposed algorithm achieves optimal performance bounds without resorting to the standard doubling trick, which has poor practical performance due to multiple restarts. Our core technical contribution is the use of time-varying Lyapunov functions to keep track of constraint violations. This must be contrasted with prior works that used a fixed Lyapunov function tuned to the known horizon length $T$. The use of time-varying Lyapunov function poses unique analytical challenges as properties, such as \emph{monotonicity}, on which the prior proofs rest, no longer hold. By introducing a new analytical technique, we show that our algorithm achieves $O(\sqrt{t})$ regret and $\tilde{O}(\sqrt{t})$ cumulative constraint violation bounds for any $t\geq 1$. We extend our results to the dynamic regret setting, achieving bounds that adapt to the path length of the comparator sequence without prior knowledge of its total length. We also present an adaptive algorithm in the optimistic setting, whose performance gracefully scales with the cumulative prediction error. We demonstrate the practical utility of our algorithm through numerical experiments involving the online shortest path problem.
Abstract（参考訳）: 本稿では, 対向凸コスト関数の列を学習する上で, 対向凸制約の列をほぼ満たしながら, 対向凸コスト関数の列を学習する問題に対して, 任意のオンラインアルゴリズムを提案する。シーケンシャルアルゴリズムは、任意の中間時間ステップ$t$に対して、時間軸全体の長さに関する事前の知識を必要とせずに、非自明な性能保証を提供する場合、 \emph{anytime} と呼ばれる。提案アルゴリズムは,複数再起動による実用性能の悪い標準的な倍数化手法を使わずに,最適な性能境界を実現する。我々の技術的な貢献は、制約違反の追跡に時間変化のLyapunov関数を使うことです。これは、既知の地平線長$T$に調整された固定されたリャプノフ函数を使った以前の作品と対比しなければならない。時変リプノフ函数の使用は、前述した証明がもはや保たない 'emph{monotonicity} のような性質として、ユニークな解析的問題を引き起こす。新しい解析手法を導入することにより、我々のアルゴリズムは、$O(\sqrt{t})$ regret と $\tilde{O}(\sqrt{t})$ cumulative constraint violation bounds for any $t\geq 1$。本研究は,コンパレータ列の経路長に適応する境界を,その全長さを事前に知ることなく,動的後悔設定に拡張する。また,楽観的な設定において,累積予測誤差を優雅にスケールする適応アルゴリズムを提案する。オンライン最短経路問題を含む数値実験により,本アルゴリズムの実用性を実証する。

論文の概要: Optimal Anytime Algorithms for Online Convex Optimization with Adversarial Constraints

関連論文リスト