Fugu-MT 論文翻訳(概要): PassiveQA: A Three-Action Framework for Epistemically Calibrated Question Answering via Supervised Finetuning

論文の概要: PassiveQA: A Three-Action Framework for Epistemically Calibrated Question Answering via Supervised Finetuning

arxiv url: http://arxiv.org/abs/2604.04565v1
Date: Mon, 06 Apr 2026 09:54:16 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-07 15:49:19.163774
Title: PassiveQA: A Three-Action Framework for Epistemically Calibrated Question Answering via Supervised Finetuning
Title（参考訳）: PassiveQA: 監督されたファインタニングによるエピステマティックな質問回答のための3要素フレームワーク
Authors: Madhav S Baidya,
Abstract要約: 不完全情報の下では、モデルが解答、解答、Ask、Abstainを判断しなければならない。本稿では,教師付き微調整によりモデル行動と情報満足度を整合させる3つのアクション・フレームワークであるPassiveQAを提案する。提案手法は,構造化情報状態表現,知識グラフ基底コンテキスト,および不足変数と決定推論を明示的にモデル化した微調整プランナーを統合する。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large Language Models (LLMs) have achieved strong performance in question answering and retrieval-augmented generation (RAG), yet they implicitly assume that user queries are fully specified and answerable. In real-world settings, queries are often incomplete, ambiguous, or missing critical variables, leading models to produce overconfident or hallucinated responses. In this work, we study decision-aware query resolution under incomplete information, where a model must determine whether to Answer, Ask for clarification, or Abstain. We show that standard and enhanced RAG systems do not reliably exhibit such epistemic awareness, defaulting to answer generation even when information is insufficient. To address this, we propose PassiveQA, a three-action framework that aligns model behaviour with information sufficiency through supervised finetuning. Our approach integrates structured information-state representations, knowledge graph-grounded context, and a finetuned planner that explicitly models missing variables and decision reasoning. Experiments across multiple QA datasets show that the finetuned planner achieves significant improvements in macro F1 and abstention recall while reducing hallucination rates, under a compute-constrained training regime. These results provide strong empirical evidence that epistemic decision-making must be learned during training rather than imposed at inference time.
Abstract（参考訳）: 大規模言語モデル (LLM) は質問応答および検索拡張生成 (RAG) において高い性能を達成しているが、ユーザクエリが完全に指定され、応答可能であることを暗黙的に仮定している。現実の環境では、クエリはしばしば不完全、曖昧、あるいは重要な変数が欠落し、モデルが過度に自信または幻覚的な応答を生成する。本研究では,不完全情報に基づく問合せの解法について検討し,解答・解答・解答・解答・解答について検討する。標準的なRAGシステムは,情報が不十分な場合でも,その認識を確実に示さないことを示し,応答生成をデフォルトとする。そこで本研究では,モデル動作を教師付き微調整による情報充実度に整合させる3つのアクション・フレームワークであるPassiveQAを提案する。提案手法は,構造化情報状態表現,知識グラフ基底コンテキスト,および不足変数と決定推論を明示的にモデル化した微調整プランナーを統合する。複数のQAデータセットにまたがる実験により、微調整プランナーは、計算制約付きトレーニング体制の下で、幻覚率を低減しつつ、マクロF1と無視リコールの大幅な改善を達成している。これらの結果は、推論時に課せられるのではなく、トレーニング中に疫学的な意思決定を学ばなければならないという強い実証的証拠を提供する。

論文の概要: PassiveQA: A Three-Action Framework for Epistemically Calibrated Question Answering via Supervised Finetuning

関連論文リスト