Fugu-MT 論文翻訳(概要): HAD: HAllucination Detection Language Models Based on a Comprehensive Hallucination Taxonomy

論文の概要: HAD: HAllucination Detection Language Models Based on a Comprehensive Hallucination Taxonomy

arxiv url: http://arxiv.org/abs/2510.19318v1
Date: Wed, 22 Oct 2025 07:28:37 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-25 03:08:15.244007
Title: HAD: HAllucination Detection Language Models Based on a Comprehensive Hallucination Taxonomy
Title（参考訳）: HAD:包括的幻覚分類に基づく幻覚検出言語モデル
Authors: Fan Xu, Xinyu Hu, Zhenghan Yu, Li Lin, Xu Zhang, Yang Zhang, Wei Zhou, Jinjie Gu, Xiaojun Wan,
Abstract要約: NLGタスクにまたがる11のカテゴリからなる総合的な幻覚分類を導入する。本稿では,幻覚検出,スパンレベル同定,修正を単一の推論プロセスに統合するHAllucination Detection (HAD)モデルを提案する。
参考スコア（独自算出の注目度）: 48.68088917291552
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The increasing reliance on natural language generation (NLG) models, particularly large language models, has raised concerns about the reliability and accuracy of their outputs. A key challenge is hallucination, where models produce plausible but incorrect information. As a result, hallucination detection has become a critical task. In this work, we introduce a comprehensive hallucination taxonomy with 11 categories across various NLG tasks and propose the HAllucination Detection (HAD) models https://github.com/pku0xff/HAD, which integrate hallucination detection, span-level identification, and correction into a single inference process. Trained on an elaborate synthetic dataset of about 90K samples, our HAD models are versatile and can be applied to various NLG tasks. We also carefully annotate a test set for hallucination detection, called HADTest, which contains 2,248 samples. Evaluations on in-domain and out-of-domain test sets show that our HAD models generally outperform the existing baselines, achieving state-of-the-art results on HaluEval, FactCHD, and FaithBench, confirming their robustness and versatility.
Abstract（参考訳）: 自然言語生成(NLG)モデル、特に大規模言語モデルへの依存が高まり、出力の信頼性と正確性に対する懸念が高まっている。重要な課題は幻覚であり、モデルが可視だが誤った情報を生成する。その結果,幻覚検出が重要な課題となった。本研究では,11のカテゴリーにまたがる総合的な幻覚分類を導入し,Hallucination Detection (HAD) モデル https://github.com/pku0xff/HAD を提案する。約90Kのサンプルからなる精巧な合成データセットに基づいて、我々のHADモデルは汎用的で、様々なNLGタスクに適用できる。また、2,248個のサンプルを含むHADTestと呼ばれる幻覚検出用テストセットを慎重に注釈付けする。ドメイン内およびドメイン外テストセットの評価は、私たちのHADモデルは、一般的に既存のベースラインを上回り、HaluEval、FactCHD、FaithBenchの最先端の結果を達成し、その堅牢性と汎用性を確認していることを示している。

論文の概要: HAD: HAllucination Detection Language Models Based on a Comprehensive Hallucination Taxonomy

関連論文リスト