Fugu-MT 論文翻訳(概要): From Confidence to Collapse in LLM Factual Robustness

論文の概要: From Confidence to Collapse in LLM Factual Robustness

arxiv url: http://arxiv.org/abs/2508.16267v2
Date: Tue, 26 Aug 2025 11:54:16 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-27 13:17:04.042642
Title: From Confidence to Collapse in LLM Factual Robustness
Title（参考訳）: LLMファクチュアルロバストネスの信頼性から崩壊へ
Authors: Alina Fastowski, Bardh Prenkaj, Gjergji Kasneci,
Abstract要約: 生成プロセスの観点から,現実的ロバスト性を測定するための原則的アプローチを導入する。 FRS(Factual Robustness Score)は、デコード条件における摂動に対する事実の安定性を定量化する新しい計量である。
参考スコア（独自算出の注目度）: 21.27503954808115
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Ensuring the robustness of factual knowledge in LLMs is critical for reliable applications in tasks such as question answering and reasoning. However, existing evaluation methods predominantly focus on performance-based metrics, often investigating from the perspective of prompt perturbations, which captures only the externally triggered side of knowledge robustness. To bridge this gap, we introduce a principled approach to measure factual robustness from the perspective of the generation process by analyzing token distribution entropy in combination with temperature scaling sensitivity. These two factors build the Factual Robustness Score (FRS), a novel metric which quantifies the stability of a fact against perturbations in decoding conditions, given its initial uncertainty. To validate our approach, we conduct extensive experiments on 5 LLMs across 3 closed-book QA datasets (SQuAD, TriviaQA, and HotpotQA). We show that factual robustness varies significantly -- smaller models report an FRS of $0.76$, larger ones $0.93$ -- with accuracy degrading by ~$60\%$ under increased uncertainty. These insights demonstrate how entropy and temperature scaling impact factual accuracy, and lay a foundation for developing more robust knowledge retention and retrieval in future models.
Abstract（参考訳）: LLMにおける事実知識の堅牢性を保証することは、質問応答や推論といったタスクにおける信頼性の高い応用にとって重要である。しかし、既存の評価手法は主にパフォーマンスベースのメトリクスに焦点をあて、しばしば急激な摂動の観点から調査し、外部から引き起こされる知識の堅牢性のみを捉えている。このギャップを埋めるために,トークン分布エントロピーを温度スケーリング感度と組み合わせて解析することにより,生成過程の観点から実効性を測定するための原則的アプローチを導入する。これらの2つの要因がFRS(Factual Robustness Score)を構築する。これは、最初の不確実性を考えると、デコード条件における摂動に対する事実の安定性を定量化する新しい計量である。提案手法を検証するため,3つのクローズドブックQAデータセット(SQuAD,TriviaQA,HotpotQA)にまたがる5つのLLM実験を行った。より小さなモデルでは FRS が 0.76$ で、より大きいモデルは0.93$ で、精度は ~60\% に低下する。これらの知見はエントロピーと温度のスケーリングが事実の精度にどのように影響するかを示し、将来のモデルにおけるより堅牢な知識保持と検索の基盤を築き上げている。

論文の概要: From Confidence to Collapse in LLM Factual Robustness

関連論文リスト