Fugu-MT 論文翻訳(概要): Semantic Energy: Detecting LLM Hallucination Beyond Entropy

論文の概要: Semantic Energy: Detecting LLM Hallucination Beyond Entropy

arxiv url: http://arxiv.org/abs/2508.14496v1
Date: Wed, 20 Aug 2025 07:33:50 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-21 16:52:41.378036
Title: Semantic Energy: Detecting LLM Hallucination Beyond Entropy
Title（参考訳）: セマンティックエネルギー:エントロピーを超えたLLM幻覚を検出する
Authors: Huan Ma, Jiadong Pan, Jing Liu, Yan Chen, Joey Tianyi Zhou, Guangyu Wang, Qinghua Hu, Hua Wu, Changqing Zhang, Haifeng Wang,
Abstract要約: 大規模言語モデル(LLM)は、現実のアプリケーションにますますデプロイされているが、幻覚の影響を受けやすいままである。不確実性推定は、そのような幻覚を検出するための実現可能なアプローチである。本稿では,新しい不確実性推定フレームワークであるセマンティック・エナジーを紹介する。
参考スコア（独自算出の注目度）: 87.19110227560867
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large Language Models (LLMs) are being increasingly deployed in real-world applications, but they remain susceptible to hallucinations, which produce fluent yet incorrect responses and lead to erroneous decision-making. Uncertainty estimation is a feasible approach to detect such hallucinations. For example, semantic entropy estimates uncertainty by considering the semantic diversity across multiple sampled responses, thus identifying hallucinations. However, semantic entropy relies on post-softmax probabilities and fails to capture the model's inherent uncertainty, causing it to be ineffective in certain scenarios. To address this issue, we introduce Semantic Energy, a novel uncertainty estimation framework that leverages the inherent confidence of LLMs by operating directly on logits of penultimate layer. By combining semantic clustering with a Boltzmann-inspired energy distribution, our method better captures uncertainty in cases where semantic entropy fails. Experiments across multiple benchmarks show that Semantic Energy significantly improves hallucination detection and uncertainty estimation, offering more reliable signals for downstream applications such as hallucination detection.
Abstract（参考訳）: 大規模言語モデル(LLM)は、現実のアプリケーションにますます多くデプロイされているが、幻覚の影響を受けやすいままであり、流動的で誤った反応をもたらし、誤った意思決定につながる。不確実性推定は、そのような幻覚を検出するための実現可能なアプローチである。例えば、セマンティックエントロピーは複数のサンプル応答のセマンティック多様性を考慮して不確実性を推定し、幻覚を識別する。しかし、セマンティックエントロピーは、ソフトマックス後の確率に依存しており、モデルの固有の不確実性を捉えることができず、特定のシナリオでは効果がない。この問題に対処するため,本研究では,LLMの固有信頼度を利用する新たな不確実性評価フレームワークであるセマンティック・エナジーを紹介する。セマンティッククラスタリングとボルツマンにインスパイアされたエネルギー分布を組み合わせることで、セマンティックエントロピーが失敗する場合の不確実性をよりよく捉えることができる。複数のベンチマーク実験により、セマンティック・エナジーは幻覚検出や不確実性推定を大幅に改善し、幻覚検出などの下流アプリケーションに信頼性の高い信号を提供することが示された。

論文の概要: Semantic Energy: Detecting LLM Hallucination Beyond Entropy

関連論文リスト