Fugu-MT 論文翻訳(概要): Cover meets Robbins while Betting on Bounded Data: $\ln n$ Regret and Almost Sure $\ln\ln n$ Regret

論文の概要: Cover meets Robbins while Betting on Bounded Data: $\ln n$ Regret and Almost Sure $\ln\ln n$ Regret

arxiv url: http://arxiv.org/abs/2604.20172v1
Date: Wed, 22 Apr 2026 04:27:54 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-23 15:36:10.96418
Title: Cover meets Robbins while Betting on Bounded Data: $\ln n$ Regret and Almost Sure $\ln\ln n$ Regret
Title（参考訳）: Coverがバウンドデータに賭けながらRobinsと出会う:$\lnn$ Regretとほぼ確実に$\ln\lnn$ Regret
Authors: Shubhada Agrawal, Aaditya Ramdas,
Abstract要約: 本稿では,RobinsとCoverの知見を組み合わせた新たな混合ベッティング戦略を提案する。われわれの論文は、2つの非常に異なる戦略でデータへの最高の適応性を実現することの価値を最初に指摘したものと思われる。
参考スコア（独自算出の注目度）: 39.04174642330437
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Consider betting against a sequence of data in $[0,1]$, where one is allowed to make any bet that is fair if the data have a conditional mean $m_0 \in (0,1)$. Cover's universal portfolio algorithm delivers a worst-case regret of $O(\ln n)$ compared to the best constant bet in hindsight, and this bound is unimprovable against adversarially generated data. In this work, we present a novel mixture betting strategy that combines insights from Robbins and Cover, and exhibits a different behavior: it eventually produces a regret of $O(\ln \ln n)$ on \emph{almost} all paths (a measure-one set of paths if each conditional mean equals $m_0$ and intrinsic variance increases to $\infty$), but has an $O(\log n)$ regret on the complement (a measure zero set of paths). Our paper appears to be the first to point out the value in hedging two very different strategies to achieve a best-of-both-worlds adaptivity to stochastic data and protection against adversarial data. We contrast our results to those in~\cite{agrawal2025regret} for a sub-Gaussian mixture on unbounded data: their worst-case regret has to be unbounded, but a similar hedging delivers both an optimal betting growth-rate and an almost sure $\ln\ln n$ regret on stochastic data. Finally, our strategy witnesses a sharp game-theoretic upper law of the iterated logarithm, analogous to~\cite{shafer2005probability}.
Abstract（参考訳）: もしデータが条件平均$m_0 \in (0,1)$を持つなら、公正な賭けをすることができる。 Coverのユニバーサルポートフォリオアルゴリズムは、後見の最良の定数ベットと比較すると、$O(\ln n)$の最悪の後悔をもたらす。最終的に、$O(\ln \ln n)$ on \emph{almost} all paths(各条件平均が$m_0$と内在的分散が$\infty$に増加する場合の経路の測度対1の集合)の後悔を生じさせるが、補集合(経路の測度ゼロの集合)に対して$O(\log n)$ regretを持つ。我々の論文は、確率的データに対するベスト・オブ・ワールド・アダプティビティを実現し、敵対的データに対する保護を実現するために、2つの非常に異なる戦略をヘッジする価値を最初に指摘したものと思われる。この結果と, 非有界データ上のガウス系混合物の〜\cite{agrawal2025regret}では, 最悪ケースの後悔は非有界データでなければならないが, 同様のヘッジは, 最適なベッティング成長速度とほぼ確実な$\ln\lnn$後悔の両方をもたらす。最終的に、我々の戦略は、反復対数の鋭いゲーム理論上の法則を、~\cite{shafer 2005probability}に類似している。

論文の概要: Cover meets Robbins while Betting on Bounded Data: $\ln n$ Regret and Almost Sure $\ln\ln n$ Regret

関連論文リスト