Fugu-MT 論文翻訳(概要): Regret-Oracle Complexity Tradeoffs in Agnostic Online Learning

論文の概要: Regret-Oracle Complexity Tradeoffs in Agnostic Online Learning

arxiv url: http://arxiv.org/abs/2605.07155v1
Date: Fri, 08 May 2026 02:41:23 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-11 19:43:38.756475
Title: Regret-Oracle Complexity Tradeoffs in Agnostic Online Learning
Title（参考訳）: アグノスティックオンライン学習におけるレグレト・オラクルの複雑さのトレードオフ
Authors: Idan Attias, Steve Hanneke, Arvind Ramaswami,
Abstract要約: 従来のオンライン学習は、LittlestoneのStandard Optimal Algorithm(SOA)をベースラーナーとして利用して、実現可能な設定に還元することで、古典的に解決される。私たちはSOAを、オフラインの実証的なリスク最小化のオラクルを通じてのみ概念クラスにアクセスする、実現可能なベースラーナーに置き換えます。提案アルゴリズムは,クエリの総複雑性を$O(Td_mathrmVC+1)$に減らし,ほぼ最適の後悔を完全保存することを示した。
参考スコア（独自算出の注目度）: 37.15283418677639
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Agnostic online learning is classically solved via a reduction to the realizable setting, utilizing Littlestone's Standard Optimal Algorithm (SOA) as a base learner. However, the SOA is computationally intractable to execute even for a single round. To overcome this barrier, recent work in oracle-efficient online learning replaces the SOA with a realizable base learner that accesses the concept class exclusively through an offline empirical risk minimization (ERM) oracle. While such agnostic learners achieve near-optimal expected regret, they suffer from a doubly-exponential oracle complexity of $O\big(T^{2^{O(d_\mathrm{LD})}}\big)$, where $d_\mathrm{LD}$ is the Littlestone dimension and $T$ is the number of rounds. In this work, we significantly improve this oracle complexity while relying on an even weaker primitive: a weak-consistency oracle, which merely decides whether a given labeled dataset is realizable. At the core of our approach is an adaptive and dynamic agnostic-to-realizable reduction that actively prunes non-realizable label sequences on the fly. By using the VC dimension ($d_\mathrm{VC}$) to bound the number of dynamically maintained active paths, our algorithm reduces the total query complexity down to $O(T^{d_\mathrm{VC}+1})$ while perfectly preserving near-optimal expected regret. Crucially, this dynamic pruning also yields a memory reduction over the standard reduction. Furthermore, we formally quantify the regret--oracle complexity tradeoff, providing upper bounds that smoothly interpolate between restricted query budgets and attainable expected regret. We complement these with lower bounds proving that any learner restricted to $Q = o(\sqrt{T})$ queries must suffer an expected regret of $Ω(T/Q)$.
Abstract（参考訳）: 従来のオンライン学習は、LittlestoneのStandard Optimal Algorithm(SOA)をベースラーナーとして利用して、実現可能な設定に還元することで、古典的に解決される。しかし、SOAは1ラウンドでも実行でき、計算的に難解です。この障壁を克服するために、近年のオラクル効率のよいオンライン学習における作業は、オフラインの経験的リスク最小化(ERM)オラクルを通じてのみコンセプトクラスにアクセスする、実現可能なベースラーナーによってSOAを置き換える。そのような非依存的な学習者は、ほぼ最適に期待された後悔を達成できるが、それらは$O\big(T^{2^{O(d_\mathrm{LD})}}\big)$の二重排他的オラクル複雑性に悩まされる。この研究では、より弱いプリミティブ、すなわち、ラベル付きデータセットが実現可能であるかどうかを単に決定する弱い一貫性のオラクルに依存しながら、このオラクルの複雑さを著しく改善する。このアプローチのコアとなるのは、適応的で動的に不可知-実現可能な還元であり、これは、実現不可能なラベルシーケンスをオンザフライで積極的に引き起こす。 VC次元(d_\mathrm{VC}$)を動的に維持されるアクティブパスの数に限定することにより、我々のアルゴリズムはクエリの総複雑性を$O(T^{d_\mathrm{VC}+1})$に減らし、ほぼ最適に予測される後悔を完全に保存する。重要なことに、このダイナミックプルーニングは、標準のリダクションよりもメモリの削減をもたらす。さらに、我々は、制限されたクエリ予算と予測可能な後悔を円滑に補間する上限を提供するため、後悔とおかしな複雑さのトレードオフを正式に定量化します。我々はこれらを下限で補完し、学習者が$Q = o(\sqrt{T})$クエリに制限された場合、$Ω(T/Q)$を期待して後悔しなければならないことを示す。

論文の概要: Regret-Oracle Complexity Tradeoffs in Agnostic Online Learning

関連論文リスト