Fugu-MT 論文翻訳(概要): Deterministic Apple Tasting

論文の概要: Deterministic Apple Tasting

arxiv url: http://arxiv.org/abs/2410.10404v1
Date: Mon, 14 Oct 2024 11:54:46 GMT
ステータス: 翻訳完了
システム内更新日: 2024-10-29 21:44:49.452509
Title: Deterministic Apple Tasting
Title（参考訳）: 決定論的Appleのテイスティング
Authors: Zachary Chase, Idan Mehalel,
Abstract要約: 我々は、初めて広く適用可能な決定論的リンゴテイスティング学習者を提供する。すべてのクラス $mathcalH$ は簡単、困難、あるいは学習不能でなければならない、という三分法を証明します。我々の上限は、リンゴの味付けフィードバックに関する専門家のアドバイスから学ぶための決定論的アルゴリズムに基づいている。
参考スコア（独自算出の注目度）: 2.4554686192257424
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In binary ($0/1$) online classification with apple tasting feedback, the learner receives feedback only when predicting $1$. Besides some degenerate learning tasks, all previously known learning algorithms for this model are randomized. Consequently, prior to this work it was unknown whether deterministic apple tasting is generally feasible. In this work, we provide the first widely-applicable deterministic apple tasting learner, and show that in the realizable case, a hypothesis class is learnable if and only if it is deterministically learnable, confirming a conjecture of [Raman, Subedi, Raman, Tewari-24]. Quantitatively, we show that every class $\mathcal{H}$ is learnable with mistake bound $O \left(\sqrt{\mathtt{L}(\mathcal{H}) T \log T} \right)$ (where $\mathtt{L}(\mathcal{H})$ is the Littlestone dimension of $\mathcal{H}$), and that this is tight for some classes. We further study the agnostic case, in which the best hypothesis makes at most $k$ many mistakes, and prove a trichotomy stating that every class $\mathcal{H}$ must be either easy, hard, or unlearnable. Easy classes have (both randomized and deterministic) mistake bound $\Theta_{\mathcal{H}}(k)$. Hard classes have randomized mistake bound $\tilde{\Theta}_{\mathcal{H}} \left(k + \sqrt{T} \right)$, and deterministic mistake bound $\tilde{\Theta}_{\mathcal{H}} \left(\sqrt{k \cdot T} \right)$, where $T$ is the time horizon. Unlearnable classes have (both randomized and deterministic) mistake bound $\Theta(T)$. Our upper bound is based on a deterministic algorithm for learning from expert advice with apple tasting feedback, a problem interesting in its own right. For this problem, we show that the optimal deterministic mistake bound is $\Theta \left(\sqrt{T (k + \log n)} \right)$ for all $k$ and $T \leq n \leq 2^T$, where $n$ is the number of experts.
Abstract（参考訳）: リンゴの味付けフィードバックによるオンライン分類のバイナリ(0/1ドル)では、学習者は1ドルを予測したときにのみフィードバックを受け取る。いくつかの退化学習タスクに加えて、以前知られていたこのモデルの学習アルゴリズムは全てランダム化されている。そのため、この研究に先立ち、決定論的リンゴの味付けが一般的に可能であるかどうかは不明であった。本研究は,初めて広く適用可能な決定論的リンゴテイスティング学習者を提供し,実現可能な場合には,それが決定論的に学習可能である場合に限り,仮説クラスが学習可能であることを証明し,[Raman, Subedi, Raman, Tewari-24] の予想を確認する。定量的に言えば、すべてのクラス $\mathcal{H}$ は、間違った有界な$O \left(\sqrt{\matht{L}(\mathcal{H}) T \log T} \right)$ (ここで $\mathtt{L}(\mathcal{H})$ は $\mathcal{H}$ のリトルストーン次元であり、これはいくつかのクラスにとって厳密であることを示す。さらに、最良の仮説が少なくとも$k$の誤りを犯し、すべてのクラス$\mathcal{H}$は簡単、困難、あるいは学習不能である、という三分法を証明する。簡単なクラスは、(ランダム化と決定論的の両方の)誤りが$\Theta_{\mathcal{H}}(k)$に結びついている。ハードクラスはランダムな誤りを$\tilde{\Theta}_{\mathcal{H}} \left(k + \sqrt{T} \right)$, and deterministic mis bound $\tilde{\Theta}_{\mathcal{H}} \left(\sqrt{k \cdot T} \right)$, ここで$T$は時間地平線である。未学習クラスは(ランダム化および決定論的の両方の)誤りを$\Theta(T)$で有界に持つ。私たちの上限は、リンゴの味付けフィードバックに関する専門家のアドバイスから学ぶための決定論的アルゴリズムに基づいています。この問題に対して、最適決定論的誤り境界は、すべての$k$と$T \leq n \leq 2^T$に対して$\Theta \left(\sqrt{T (k + \log n)} \right)$である。

論文の概要: Deterministic Apple Tasting

関連論文リスト