Fugu-MT 論文翻訳(概要): Building Reliable Long-Form Generation via Hallucination Rejection Sampling

論文の概要: Building Reliable Long-Form Generation via Hallucination Rejection Sampling

arxiv url: http://arxiv.org/abs/2606.03628v1
Date: Tue, 02 Jun 2026 13:26:17 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-03 22:00:05.025045
Title: Building Reliable Long-Form Generation via Hallucination Rejection Sampling
Title（参考訳）: 幻覚除去サンプリングによる信頼性ロングフォームジェネレーションの構築
Authors: Lin Li, Georgia Channing, Suhaas M Bhat, Gabriel Davis Jones, Yarin Gal,
Abstract要約: 大規模言語モデル(LLM)は、オープンエンドテキスト生成において顕著な進歩を遂げてきたが、誤ったあるいはサポートされていないコンテンツを幻覚させる傾向にある。我々は、Segment-wise HAllucination Rejection Smpling (SHARS) という、新しい推論時幻覚緩和フレームワークを提案する。 SHARSは任意の幻覚検出器を使用して、生成中の幻覚セグメントを識別および拒絶し、忠実な内容が生成されるまで再サンプリングする。
参考スコア（独自算出の注目度）: 26.68294442366694
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Large language models (LLMs) have achieved remarkable progress in open-ended text generation, yet they remain prone to hallucinating incorrect or unsupported content, which undermines their reliability. This issue is exacerbated in long-form generation due to hallucination snowballing, a phenomenon where early errors propagate and compound into subsequent outputs. To address this challenge, we propose a novel inference-time hallucination mitigation framework, named Segment-wise HAllucination Rejection Sampling (SHARS), which uses an arbitrary hallucination detector to identify and reject hallucinated segments during generation and resample until faithful content is produced. By retaining only confident information and building subsequent generations upon it, the framework mitigates hallucination accumulation and enhances factual consistency. To instantiate this framework, we adopt semantic uncertainty as the detector and introduce several vital modifications to address its limitations and better adapt it to long-form text. Our method enables models to self-correct hallucinations without requiring external resources such as web search or knowledge bases, while remaining compatible with them for future extensions. Empirical evaluations on standardized hallucination benchmarks demonstrate that our method substantially reduces hallucinations in long-form generation while preserving or even improving the informativeness of generation. Code is available at: https://github.com/TreeLLi/hallucination-rejection-sampling.
Abstract（参考訳）: 大規模言語モデル(LLM)は、オープンエンドテキスト生成において顕著な進歩を遂げているが、その信頼性を損なう不正またはサポートされたコンテンツを幻覚させる傾向にある。この問題は、初期のエラーが伝播し、その後の出力に複合する現象である幻覚スノーボールにより、長期的な世代で悪化する。この課題に対処するために、任意の幻覚検出器を用いて、生成中の幻覚を識別・除去し、忠実なコンテンツが生成されるまで再サンプリングする、Segment-wise HAllucination Rejection Sampling (SHARS) という新しい推論時幻覚緩和フレームワークを提案する。自信のある情報のみを保持し、その後の世代を構築することにより、幻覚の蓄積を緩和し、事実整合性を高める。このフレームワークをインスタンス化するために、意味的不確実性を検出器として採用し、その制限に対処し、長文に適応するためにいくつかの重要な修正を導入する。提案手法は,Web検索や知識ベースなどの外部リソースを必要とせずに,将来的な拡張のためにモデルと互換性を保ちながら,幻覚を自己修正することを可能にする。規格化幻覚ベンチマークの実証評価により,提案手法は長文生成における幻覚を著しく低減し,生成情報の保存や改善を図っている。コードは、https://github.com/TreeLLi/hallucination-rejection-sampling.comで入手できる。

論文の概要: Building Reliable Long-Form Generation via Hallucination Rejection Sampling

関連論文リスト