Fugu-MT 論文翻訳(概要): Adaptive Learning Rates with Surrogate Probability for Follow-the-Perturbed-Leader

論文の概要: Adaptive Learning Rates with Surrogate Probability for Follow-the-Perturbed-Leader

arxiv url: http://arxiv.org/abs/2606.06043v1
Date: Thu, 04 Jun 2026 11:36:08 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-05 22:39:44.760446
Title: Adaptive Learning Rates with Surrogate Probability for Follow-the-Perturbed-Leader
Title（参考訳）: 追従型リーダのサロゲート確率を考慮した適応学習率
Authors: Jongyeong Lee, Junya Honda, Shinji Ito, Chansoo Kim,
Abstract要約: フォロー・ザ・レギュラライズド・リーダー・フレームワークは、オンライン学習の問題の有効性と柔軟性を示している。本稿では,サロゲート確率関数を導入することで,FTPLの適応学習率を提案する。本稿では,BOBWの適応学習率によるFTPLの保証について,専門家の助言で示す。
参考スコア（独自算出の注目度）: 34.785711821917424
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Follow-the-regularized-leader framework has shown effectiveness and flexibility in online learning problems, where the choice of learning rates are known to be crucial. Recently, adaptive learning rates defined in terms of the arm-selection probabilities, obtained by solving convex optimization, have achieved improved best-of-both-worlds (BOBW) guarantees in various bandit problems. In contrast, BOBW guarantees for its computationally efficient alternative, follow-the-perturbed-leader (FTPL), remain relatively limited since its optimization-free nature ironically makes the design of adaptive, probability-dependent learning rates non-trivial. To address this challenge, we propose an adaptive learning rate for FTPL by introducing surrogate probability functions that can be computed only from the available quantities, without requiring the exact probabilities. Based on these learning rates with surrogate functions, we provide the BOBW guarantee for FTPL with Pareto perturbations for any shape parameter $α>1$, generalizing prior results restricted to specific choices of $α=2$. We further show the BOBW guarantees for FTPL with adaptive learning rates in the bandit problem with expert advices. Our approach preserves the computational simplicity of FTPL while enabling probability-dependent adaptivity, and the surrogate-based methodology may be of independent interest in other algorithmic frameworks beyond FTPL and learning rate designs.
Abstract（参考訳）: フォロー・ザ・レギュラライズド・リーダー・フレームワークは、オンライン学習における効率性と柔軟性を示しており、学習率の選択が重要であることが知られている。近年,コンベックス最適化によって得られるアーム選択確率で定義される適応学習速度は,様々な帯域幅問題において改善されたベスト・オブ・ボス・ワールド(BOBW)保証を実現している。対照的に、BOBWは、その最適化のない性質が、適応的で確率に依存しない学習率の設計を非自明なものにしているため、その計算効率のよい代替案であるFTPL(英語版)を保証している。この課題に対処するために、我々は、正確な確率を必要とせず、利用可能な量からのみ計算可能なサロゲート確率関数を導入することで、FTPLの適応学習率を提案する。シュロゲート関数を用いたこれらの学習率に基づいて,任意の形状パラメータ$α>1$に対するPareto摂動を伴ってFTPLのBOBW保証を行い,先行結果をα=2$の特定の選択に限定して一般化する。さらに,BOBW の適応学習率による FTPL の保証について,専門家の助言で示す。提案手法は,確率依存適応性を実現しつつ,FTPLの計算の単純さを保ち,サロゲートに基づく手法は,FTPL以外のアルゴリズムフレームワークや学習率設計に独立して用いられる可能性がある。

論文の概要: Adaptive Learning Rates with Surrogate Probability for Follow-the-Perturbed-Leader

関連論文リスト