Fugu-MT 論文翻訳(概要): A Perturbation Approach to Unconstrained Linear Bandits

論文の概要: A Perturbation Approach to Unconstrained Linear Bandits

arxiv url: http://arxiv.org/abs/2603.28201v1
Date: Mon, 30 Mar 2026 09:17:46 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-31 23:18:45.318391
Title: A Perturbation Approach to Unconstrained Linear Bandits
Title（参考訳）: 制約のない線形帯域に対する摂動的アプローチ
Authors: Andrew Jacobsen, Dorian Baudry, Shinji Ito, Nicolò Cesa-Bianchi,
Abstract要約: 我々は、制約のない帯域線形最適化(uBLO)の文脈で、Abernethy et al. (2008) の標準摂動に基づくアプローチを再考する。制約のない環境では、バンド線形最適化(BLO)を標準オンライン線形最適化(OLO)問題に効果的に還元することを示す。
参考スコア（独自算出の注目度）: 48.45987210959519
License: http://creativecommons.org/licenses/by/4.0/
Abstract: We revisit the standard perturbation-based approach of Abernethy et al. (2008) in the context of unconstrained Bandit Linear Optimization (uBLO). We show the surprising result that in the unconstrained setting, this approach effectively reduces Bandit Linear Optimization (BLO) to a standard Online Linear Optimization (OLO) problem. Our framework improves on prior work in several ways. First, we derive expected-regret guarantees when our perturbation scheme is combined with comparator-adaptive OLO algorithms, leading to new insights about the impact of different adversarial models on the resulting comparator-adaptive rates. We also extend our analysis to dynamic regret, obtaining the optimal $\sqrt{P_T}$ path-length dependencies without prior knowledge of $P_T$. We then develop the first high-probability guarantees for both static and dynamic regret in uBLO. Finally, we discuss lower bounds on the static regret, and prove the folklore $Ω(\sqrt{dT})$ rate for adversarial linear bandits on the unit Euclidean ball, which is of independent interest.
Abstract（参考訳）: 我々は、制約のない帯域線形最適化(uBLO)の文脈で、Abernethy et al (2008) の標準摂動に基づくアプローチを再検討する。制約のない環境では、バンド線形最適化(BLO)を標準オンライン線形最適化(OLO)問題に効果的に還元することを示す。私たちのフレームワークは、いくつかの方法で先行作業を改善します。まず、摂動スキームとコンパレータ適応型OLOアルゴリズムを組み合わせることで、異なる逆モデルが結果のコンパレータ適応率に与える影響について新たな知見を得る。また、解析を動的後悔にまで拡張し、$P_T$の事前知識なしで最適な$\sqrt{P_T}$パス長の依存関係を得る。次に、uBLOにおける静的および動的後悔の両方に対する最初の高確率保証を開発する。最後に、静的な後悔の下位境界について論じ、独立な興味を持つユークリッド球面上の逆線型包帯に対して、フォークロア$Ω(\sqrt{dT})$レートを証明した。

論文の概要: A Perturbation Approach to Unconstrained Linear Bandits

関連論文リスト