Fugu-MT 論文翻訳(概要): Asymptotically Optimal Linear Best Feasible Arm Identification with Fixed Budget

論文の概要: Asymptotically Optimal Linear Best Feasible Arm Identification with Fixed Budget

arxiv url: http://arxiv.org/abs/2506.02386v1
Date: Tue, 03 Jun 2025 02:56:26 GMT
ステータス: 翻訳完了
システム内更新日: 2025-06-04 21:47:35.203937
Title: Asymptotically Optimal Linear Best Feasible Arm Identification with Fixed Budget
Title（参考訳）: 固定予算を用いた漸近的最適リニアベストフェーブルアーム同定
Authors: Jie Bian, Vincent Y. F. Tan,
Abstract要約: 本稿では,誤差確率の指数的減衰を保証し,最適な腕識別のための新しいアルゴリズムを提案する。我々は,複雑性のレベルが異なる様々な問題インスタンスに対する包括的経験的評価を通じて,アルゴリズムの有効性を検証する。
参考スコア（独自算出の注目度）: 55.938644481736446
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The challenge of identifying the best feasible arm within a fixed budget has attracted considerable interest in recent years. However, a notable gap remains in the literature: the exact exponential rate at which the error probability approaches zero has yet to be established, even in the relatively simple setting of $K$-armed bandits with Gaussian noise. In this paper, we address this gap by examining the problem within the context of linear bandits. We introduce a novel algorithm for best feasible arm identification that guarantees an exponential decay in the error probability. Remarkably, the decay rate -- characterized by the exponent -- matches the theoretical lower bound derived using information-theoretic principles. Our approach leverages a posterior sampling framework embedded within a game-based sampling rule involving a min-learner and a max-learner. This strategy shares its foundations with Thompson sampling, but is specifically tailored to optimize the identification process under fixed-budget constraints. Furthermore, we validate the effectiveness of our algorithm through comprehensive empirical evaluations across various problem instances with different levels of complexity. The results corroborate our theoretical findings and demonstrate that our method outperforms several benchmark algorithms in terms of both accuracy and efficiency.
Abstract（参考訳）: 固定予算内で最高の実現可能な腕を特定するという課題は、近年、かなりの関心を集めている。誤差確率が 0 に近づく正確な指数速度はまだ確立されていないが、ガウス雑音を伴う$K$武装のバンディットは比較的単純な設定である。本稿では,線形バンディットの文脈における問題を調べることによって,このギャップに対処する。本稿では,誤差確率の指数的減衰を保証し,最適な腕識別のための新しいアルゴリズムを提案する。顕著なことに、指数によって特徴づけられる減衰速度は、情報理論の原理を用いて導かれる理論的な下界と一致する。本手法では,min-learnerとmax-learnerを含むゲームベースサンプリングルール内に埋め込まれた後続サンプリングフレームワークを活用する。この戦略は基礎をトンプソンサンプリングと共有しているが、固定予算制約の下での識別プロセスを最適化するために特別に調整されている。さらに,複雑性のレベルが異なる様々な問題インスタンスを対象とした包括的経験的評価により,アルゴリズムの有効性を検証した。その結果,提案手法は精度と効率の両面で,いくつかのベンチマークアルゴリズムより優れていることが示された。

論文の概要: Asymptotically Optimal Linear Best Feasible Arm Identification with Fixed Budget

関連論文リスト