Fugu-MT 論文翻訳(概要): Detecting Hallucinations in Authentic LLM-Human Interactions

論文の概要: Detecting Hallucinations in Authentic LLM-Human Interactions

arxiv url: http://arxiv.org/abs/2510.10539v1
Date: Sun, 12 Oct 2025 10:46:24 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-14 18:06:29.998031
Title: Detecting Hallucinations in Authentic LLM-Human Interactions
Title（参考訳）: LLM-Human相互作用における幻覚の検出
Authors: Yujie Ren, Niklas Gruhlke, Anne Lauscher,
Abstract要約: AuthenHalluは、完全なLLM-ヒューマンインタラクションで構築された最初の幻覚検出ベンチマークである。統計的分析により,我々のベンチマークでは,クエリ応答対の31.4%に幻覚が生じることが示された。
参考スコア（独自算出の注目度）: 21.9643190742574
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: As large language models (LLMs) are increasingly applied in sensitive domains such as medicine and law, hallucination detection has become a critical task. Although numerous benchmarks have been proposed to advance research in this area, most of them are artificially constructed--either through deliberate hallucination induction or simulated interactions--rather than derived from genuine LLM-human dialogues. Consequently, these benchmarks fail to fully capture the characteristics of hallucinations that occur in real-world usage. To address this limitation, we introduce AuthenHallu, the first hallucination detection benchmark built entirely from authentic LLM-human interactions. For AuthenHallu, we select and annotate samples from genuine LLM-human dialogues, thereby providing a faithful reflection of how LLMs hallucinate in everyday user interactions. Statistical analysis shows that hallucinations occur in 31.4% of the query-response pairs in our benchmark, and this proportion increases dramatically to 60.0% in challenging domains such as Math & Number Problems. Furthermore, we explore the potential of using vanilla LLMs themselves as hallucination detectors and find that, despite some promise, their current performance remains insufficient in real-world scenarios.
Abstract（参考訳）: 大きな言語モデル (LLMs) が医学や法学などのセンシティブな分野にますます適用されているため、幻覚検出は重要な課題となっている。この分野の研究を進めるために多くのベンチマークが提案されているが、そのほとんどは、故意の幻覚誘導やシミュレートされた相互作用を通じて人工的に構築されている。その結果、これらのベンチマークは現実世界で発生する幻覚の特徴を完全に把握することができない。この制限に対処するために、認証されたLLM-ヒューマンインタラクションから構築された最初の幻覚検出ベンチマークであるAuthenHalluを紹介する。 AuthenHallu の場合、実際の LLM と人間との対話からサンプルを選択・注釈し、LLM が日常のユーザインタラクションにどう幻覚するかを忠実に反映する。統計的分析により,本ベンチマークのクエリ応答対の31.4%で幻覚が発生し,この割合はMath & Number Problemsなどの課題領域で60.0%へと劇的に上昇した。さらに,バニラLSM自体を幻覚検出器として用いる可能性について検討し,いくつかの約束があるものの,現実のシナリオでは現在の性能が不十分であることが確認された。

論文の概要: Detecting Hallucinations in Authentic LLM-Human Interactions

関連論文リスト