Fugu-MT 論文翻訳(概要): Online Set Learning from Precision and Recall Feedback

論文の概要: Online Set Learning from Precision and Recall Feedback

arxiv url: http://arxiv.org/abs/2605.09565v1
Date: Sun, 10 May 2026 14:28:05 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-12 23:28:50.312694
Title: Online Set Learning from Precision and Recall Feedback
Title（参考訳）: 精度とリコールフィードバックからのオンラインセット学習
Authors: Lee Cohen, Yishay Mansour, Shay Moran, Han Shao,
Abstract要約: オンライン設定でドメインの未知のサブセットである$N_texttarget$を学習する問題を考察する。この単純なオンラインセット学習問題は、精度とリコール型のフィードバックで様々な学習シナリオを抽象化する。この設定で仮説クラスが学習可能であることを示し、それが有限のヴァプニク・チェルヴォネンキス次元を持つ場合に限る。
参考スコア（独自算出の注目度）: 60.00180898830079
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We consider the problem of learning an unknown subset $N_\text{target}$ of a domain in an online setting. In each round $t$, the learner predicts a set of items ${N}_t$ and receives one of two types of feedback, each with equal probability: precision feedback, in which a randomly chosen item from the predicted set $N_t$ is revealed and the learner is told whether it belongs to $N_\text{target}$ (incurring a reward if it does), or recall feedback, in which a randomly chosen item from the target set $N_\text{target}$ is revealed and the learner is told whether it belongs to $N_t$ (incurring a reward if it does). The goal is to maximize the cumulative reward over time. This simple online set learning problem abstracts a variety of learning scenarios with precision- and recall-type feedback. We show that a hypothesis class (a family of subsets of the domain) is learnable in this setting if and only if it has finite Vapnik-Chervonenkis (VC) dimension, mirroring the classical PAC characterization. However, the resulting algorithmic structure is markedly more intricate: in contrast to standard Probably Approximately Correct (PAC) learning -- where the algorithmic landscape is governed by the simple principle of Empirical Risk Minimization (ERM) -- our partial feedback model can invalidate ERM and even all proper learning rules. We develop algorithms to address the dependencies induced by the feedback, obtaining regret guarantees in both the realizable and agnostic settings. Our results provide a qualitative characterization of learnability in this model, addressing its most basic question, while pointing to a range of natural and intriguing open questions, including the determination of optimal regret rates.
Abstract（参考訳）: 我々は、未知のサブセットである$N_\text{target}$をオンライン設定で学習する問題を考える。各ラウンド$t$において、学習者は、${N}_t$のセットを予測し、それぞれ同じ確率で2種類のフィードバックの1つを受け取る: 精度フィードバック、予測セット$N_t$からランダムに選択されたアイテムが露呈され、学習者は、そのアイテムが$N_\text{target}$に属するかどうかを知らせる(その場合、報酬が発生する)、あるいは、ターゲットセット$N_\text{target}$からランダムに選択されたアイテムが露呈され、学習者は、$N_t$に属するかどうかを知らせる(その場合、報酬を受ける)。目標は、累積的な報酬を時間とともに最大化することです。この単純なオンラインセット学習問題は、精度とリコール型のフィードバックで様々な学習シナリオを抽象化する。この設定において、仮説クラス(領域の部分集合の族)が学習可能であることは、それが有限なVapnik-Chervonenkis (VC)次元を持ち、古典的なPAC特徴づけを反映している場合に限る。しかし、アルゴリズム構造は明らかに複雑で、標準的な確率的精度(PAC)学習とは対照的に、アルゴリズムのランドスケープは経験的リスク最小化(Empirical Risk Minimization、ERM)の単純な原則によって制御される。我々は,フィードバックによって引き起こされる依存関係に対処するアルゴリズムを開発し,実現可能な設定と不可知な設定の両方において,後悔の保証を得る。本研究は,本モデルにおける学習可能性の質的評価を提供し,その最も基本的な問題に対処すると同時に,最適後悔率の判定を含む,自然で興味をそそるオープンな問題に言及する。

論文の概要: Online Set Learning from Precision and Recall Feedback

関連論文リスト