Fugu-MT 論文翻訳(概要): PromptNCE: Pointwise Mutual Information Predictions Using Only LLMs and Contrastive Estimation Prompts

論文の概要: PromptNCE: Pointwise Mutual Information Predictions Using Only LLMs and Contrastive Estimation Prompts

arxiv url: http://arxiv.org/abs/2605.21776v1
Date: Wed, 20 May 2026 22:10:54 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-22 20:14:18.496983
Title: PromptNCE: Pointwise Mutual Information Predictions Using Only LLMs and Contrastive Estimation Prompts
Title（参考訳）: PromptNCE: LLMとContrastive Estimation Promptのみを用いたポイントワイズ相互情報予測
Authors: Juliette Woodrow, Chris Piech,
Abstract要約: テキストから相互情報を推定するには、通常、タスク固有の批評家を訓練する必要がある。大規模言語モデルでは、プロンプトと帰納確率のみを用いて、ポイントワイドな相互情報ゼロショットを推定できるかどうかを問う。我々は,3つの公開データセットに対して,人間由来の地中PMIを用いたベンチマークを導入し,情報理論に基づく5つのプロンプトベース推定手法を評価する。
参考スコア（独自算出の注目度）: 3.015315464409211
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Estimating mutual information from text usually requires training a task-specific critic, which limits its use in low-data settings. We ask whether large language models can instead estimate pointwise mutual information zero-shot, using only prompts and elicited probabilities. We introduce a benchmark with human-derived ground-truth PMI across three publicly available datasets, and evaluate five information-theoretic prompting-based estimators. Our main method, PromptNCE, frames conditional probability estimation as a contrastive task and augments the candidate set with an explicit OTHER category. We show theoretically that adding OTHER recovers the true conditional P(y | x) rather than just a ranking over listed candidates, turning a contrastive prompt into a general-purpose zero-shot probability estimator. PromptNCE is the best zero-shot method on all three datasets, reaching Spearman correlation up to 0.82 with human-derived PMI. We also present a case study in computer science education showing how these estimators can be used to score student knowledge summaries in a low-data setting.
Abstract（参考訳）: テキストから相互情報を推定するには、通常、タスク固有の批評家を訓練する必要がある。大規模言語モデルでは、プロンプトと帰納確率のみを用いて、ポイントワイドな相互情報ゼロショットを推定できるかどうかを問う。我々は,3つの公開データセットに対して,人間由来の地中PMIを用いたベンチマークを導入し,情報理論に基づく5つのプロンプトベース推定手法を評価する。提案手法であるPromptNCEは、条件付き確率推定をコントラスト的タスクとしてフレーム化し、明示的なOTHERカテゴリで候補集合を拡大する。理論的には、OTHER の追加は、リストされた候補に対するランキングではなく、真の条件 P(y | x) を復元し、対照的なプロンプトを汎用的なゼロショット確率推定器に変換する。 PromptNCEは3つのデータセットで最高のゼロショット法であり、人間由来のPMIとのSpearman相関は最大0.82に達する。また、コンピュータサイエンス教育におけるケーススタディとして、これらの推定器を用いて、低データ環境で学生の知識の要約を評価する方法を示す。

論文の概要: PromptNCE: Pointwise Mutual Information Predictions Using Only LLMs and Contrastive Estimation Prompts

関連論文リスト