Fugu-MT 論文翻訳(概要): INTRYGUE: Induction-Aware Entropy Gating for Reliable RAG Uncertainty Estimation

論文の概要: INTRYGUE: Induction-Aware Entropy Gating for Reliable RAG Uncertainty Estimation

arxiv url: http://arxiv.org/abs/2603.21607v1
Date: Mon, 23 Mar 2026 06:02:16 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-24 19:11:39.513149
Title: INTRYGUE: Induction-Aware Entropy Gating for Reliable RAG Uncertainty Estimation
Title（参考訳）: INRYGUE:信頼性RAG不確実性推定のための誘導型エントロピーゲーティング
Authors: Alexandra Bazarova, Andrei Volodichev, Daria Kotova, Alexey Zaytsev,
Abstract要約: 本稿では,標準的なエントロピーに基づく不確実性定量法が,機械的パラドックスによりRAG設定でしばしば失敗することを示す。文脈利用に固有の内的「綱引き」が出現し、誘導頭部は正しい答えをコピーすることで接地応答を促進するが、それらは以前に確立された「エントロピーニューロン」を左右的に引き起こす本稿では,誘導ヘッドの活性化パターンに基づいて,予測エントロピーをゲートする機械的基盤手法であるINRYGUEを提案する。
参考スコア（独自算出の注目度）: 40.276670786703896
License: http://creativecommons.org/licenses/by/4.0/
Abstract: While retrieval-augmented generation (RAG) significantly improves the factual reliability of LLMs, it does not eliminate hallucinations, so robust uncertainty quantification (UQ) remains essential. In this paper, we reveal that standard entropy-based UQ methods often fail in RAG settings due to a mechanistic paradox. An internal "tug-of-war" inherent to context utilization appears: while induction heads promote grounded responses by copying the correct answer, they collaterally trigger the previously established "entropy neurons". This interaction inflates predictive entropy, causing the model to signal false uncertainty on accurate outputs. To address this, we propose INTRYGUE (Induction-Aware Entropy Gating for Uncertainty Estimation), a mechanistically grounded method that gates predictive entropy based on the activation patterns of induction heads. Evaluated across four RAG benchmarks and six open-source LLMs (4B to 13B parameters), INTRYGUE consistently matches or outperforms a wide range of UQ baselines. Our findings demonstrate that hallucination detection in RAG benefits from combining predictive uncertainty with interpretable, internal signals of context utilization.
Abstract（参考訳）: 検索増強生成(RAG)はLLMの信頼性を著しく向上させるが、幻覚を排除しないため、堅牢な不確実性定量化(UQ)が不可欠である。本稿では,標準的なエントロピーに基づくUQ手法が,機械的パラドックスによりRAG設定でしばしば失敗することを明らかにする。内部の「綱引き」は文脈利用に固有のもので、誘導ヘッドは正しい答えをコピーすることで接地された応答を促進するが、それらは以前に確立された「エントロピーニューロン」を左右に引き起こす。この相互作用は予測エントロピーを膨らませ、モデルが正確な出力に対して誤った不確実性を示す。そこで本研究では,誘導ヘッドのアクティベーションパターンに基づいて予測エントロピーをゲートする手法であるINTRYGUE(Induction-Aware Entropy Gating for Uncertainty Estimation)を提案する。 4つのRAGベンチマークと6つのオープンソースLCM(4Bから13Bパラメータ)で評価され、INRYGUEは、広範囲のUQベースラインと一貫して一致または性能を向上する。以上の結果から,RAGにおける幻覚検出は,予測の不確かさと解釈可能な内部信号との併用による効果が示唆された。

論文の概要: INTRYGUE: Induction-Aware Entropy Gating for Reliable RAG Uncertainty Estimation

関連論文リスト