Fugu-MT 論文翻訳(概要): Revisiting Follow-the-Perturbed-Leader with Unbounded Perturbations in Bandit Problems

論文の概要: Revisiting Follow-the-Perturbed-Leader with Unbounded Perturbations in Bandit Problems

arxiv url: http://arxiv.org/abs/2508.18604v1
Date: Tue, 26 Aug 2025 02:12:18 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-27 17:42:38.639148
Title: Revisiting Follow-the-Perturbed-Leader with Unbounded Perturbations in Bandit Problems
Title（参考訳）: 帯域問題における非有界摂動を考慮した追従型摂動リーダの再検討
Authors: Jongyeong Lee, Junya Honda, Shinji Ito, Min-hwan Oh,
Abstract要約: FTRL(Follow-the-Regularized-Leader)ポリシーはBOBW(Best-of-Both-Worlds)を達成している。非対称なFr'echet型摂動の広いファミリーの下で、非有界摂動に対する古典的なFTRL-FTPL双対性を再検討し、FTPLに対するBOBW結果を確立する。
参考スコア（独自算出の注目度）: 60.58442311545223
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Follow-the-Regularized-Leader (FTRL) policies have achieved Best-of-Both-Worlds (BOBW) results in various settings through hybrid regularizers, whereas analogous results for Follow-the-Perturbed-Leader (FTPL) remain limited due to inherent analytical challenges. To advance the analytical foundations of FTPL, we revisit classical FTRL-FTPL duality for unbounded perturbations and establish BOBW results for FTPL under a broad family of asymmetric unbounded Fr\'echet-type perturbations, including hybrid perturbations combining Gumbel-type and Fr\'echet-type tails. These results not only extend the BOBW results of FTPL but also offer new insights into designing alternative FTPL policies competitive with hybrid regularization approaches. Motivated by earlier observations in two-armed bandits, we further investigate the connection between the $1/2$-Tsallis entropy and a Fr\'echet-type perturbation. Our numerical observations suggest that it corresponds to a symmetric Fr\'echet-type perturbation, and based on this, we establish the first BOBW guarantee for symmetric unbounded perturbations in the two-armed setting. In contrast, in general multi-armed bandits, we find an instance in which symmetric Fr\'echet-type perturbations violate the key condition for standard BOBW analysis, which is a problem not observed with asymmetric or nonnegative Fr\'echet-type perturbations. Although this example does not rule out alternative analyses achieving BOBW results, it suggests the limitations of directly applying the relationship observed in two-armed cases to the general case and thus emphasizes the need for further investigation to fully understand the behavior of FTPL in broader settings.
Abstract（参考訳）: FTRL (Follow-the-Regularized-Leader) ポリシは、BOBW (Best-of-Both-Worlds) をハイブリット正規化によって実現しているのに対して、FTPL (Follow-the-Perturbed-Leader) の類似性は、本質的に解析上の課題のために制限されている。 FTPLの非対称摂動に対する古典的FTRL-FTPL双対性を再検討し、非対称な非有界Fr'echet型摂動の広いファミリーの下でFTPLのBOBW結果を確立する。これらの結果は、FTPLのBOBW結果を拡張するだけでなく、ハイブリッド正規化アプローチと競合するFTPLポリシーを設計するための新たな洞察を提供する。両腕のバンディットで観測された初期の観測により、我々はさらに1/2$-TsallisエントロピーとFr'echet型摂動の関連について検討した。数値観測により, 対称なFr'echet型摂動に対応することが示唆され, このことから, 対称な非有界摂動に対する最初のBOBW保証が確立された。対照的に、一般的なマルチアームバンドでは、対称Fr'echet型摂動が標準BOBW解析の鍵条件に反するケースが見つかる。この例では、BOBWの結果を得られた代替分析を除外していないが、二本腕のケースで観察された関係を直接適用する限界を示唆しており、より広い環境でFTPLの振る舞いを十分に理解するためのさらなる調査の必要性を強調している。

論文の概要: Revisiting Follow-the-Perturbed-Leader with Unbounded Perturbations in Bandit Problems

関連論文リスト