Fugu-MT 論文翻訳(概要): IUQ: Interrogative Uncertainty Quantification for Long-Form Large Language Model Generation

論文の概要: IUQ: Interrogative Uncertainty Quantification for Long-Form Large Language Model Generation

arxiv url: http://arxiv.org/abs/2604.15109v2
Date: Sun, 19 Apr 2026 16:30:03 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-21 13:51:31.190851
Title: IUQ: Interrogative Uncertainty Quantification for Long-Form Large Language Model Generation
Title（参考訳）: IUQ: 長期大規模言語モデル生成のための相互不確実性定量化
Authors: Haozhi Fan, Jinhao Duan, Kaidi Xu,
Abstract要約: 本稿では,不確実性を定量化するために,サンプル間の一貫性とサンプル内忠実性を活用する新しいフレームワークであるInterrogative Uncertainity Quantification(IUQ)を紹介する。モデルファミリとモデルサイズにまたがる実験結果は、広く使用されている2つの長文生成データセットよりも、IUQの優れた性能を示す。
参考スコア（独自算出の注目度）: 25.78840651769687
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Despite the rapid advancement of Large Language Models (LLMs), uncertainty quantification in LLM generation is a persistent challenge. Although recent approaches have achieved strong performance by restricting LLMs to produce short or constrained answer sets, many real-world applications require long-form and free-form text generation. A key difficulty in this setting is that LLMs often produce responses that are semantically coherent yet factually inaccurate, while the underlying semantics are multifaceted and the linguistic structure is complex. To tackle this challenge, this paper introduces Interrogative Uncertainty Quantification (IUQ), a novel framework that leverages inter-sample consistency and intra-sample faithfulness to quantify the uncertainty in long-form LLM outputs. By utilizing an interrogate-then-respond paradigm, our method provides reliable measures of claim-level uncertainty and the model's faithfulness. Experimental results across diverse model families and model sizes demonstrate the superior performance of IUQ over two widely used long-form generation datasets. The code is available at https://github.com/louisfanhz/IUQ.
Abstract（参考訳）: LLM(Large Language Models)の急速な進歩にもかかわらず、LLM生成の不確実性定量化は永続的な課題である。近年のアプローチは、LLMを制限して短い、あるいは制約された応答集合を生成することで、高いパフォーマンスを実現しているが、多くの現実世界のアプリケーションは、長文および自由形テキスト生成を必要とする。この設定における重要な困難は、LLMが意味的に一貫性があるが事実的に不正確な応答をしばしば生成するのに対して、基礎となるセマンティクスは多面的であり、言語構造は複雑である。この課題に対処するために、長いLLM出力の不確かさを定量化するために、サンプル間の一貫性とサンプル内忠実性を活用する新しいフレームワークであるInterrogative Uncertainity Quantification (IUQ)を導入する。問合せ対応パラダイムを利用して,クレームレベルの不確実性とモデルの忠実度を信頼度として評価する。モデルファミリとモデルサイズにまたがる実験結果は、広く使用されている2つの長文生成データセットよりも、IUQの優れた性能を示す。コードはhttps://github.com/louisfanhz/IUQ.comで入手できる。

論文の概要: IUQ: Interrogative Uncertainty Quantification for Long-Form Large Language Model Generation

関連論文リスト