Fugu-MT 論文翻訳(概要): Knowledge-Level Consistency Reinforcement Learning: Dual-Fact Alignment for Long-Form Factuality

論文の概要: Knowledge-Level Consistency Reinforcement Learning: Dual-Fact Alignment for Long-Form Factuality

arxiv url: http://arxiv.org/abs/2509.23765v1
Date: Sun, 28 Sep 2025 09:23:06 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-30 22:32:19.431919
Title: Knowledge-Level Consistency Reinforcement Learning: Dual-Fact Alignment for Long-Form Factuality
Title（参考訳）: 知識レベル整合性強化学習:長期的ファクティリティのためのデュアルファクトアライメント
Authors: Junliang Li, Yucheng Wang, Yan Chen, Yu Ran, Ruiqing Zhang, Jing Liu, Hua Wu, Haifeng Wang,
Abstract要約: 幻覚と事実性の欠如は、大きな言語モデルの信頼性にとって重要な障害である。本稿では,政策モデルの表現された知識と基本モデルのパラメトリック知識との知識整合性に着目した新しい枠組みを提案する。
参考スコア（独自算出の注目度）: 27.687276551678583
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Hallucination and factuality deficits remain key obstacles to the reliability of large language models (LLMs) in long-form generation. Existing reinforcement learning from human feedback (RLHF) frameworks primarily rely on preference rewards, yet they often overlook the model's internal knowledge boundaries, exacerbating the so-called "hallucination tax". To address this challenge, we propose Knowledge-Level Consistency Reinforcement Learning Framework (KLCF), a novel framework that focuses on the knowledge consistency between the policy model's expressed knowledge and the base model's parametric knowledge, and introduces a Dual-Fact Alignment mechanism to jointly optimize factual recall and precision. Specifically, KLCF leverages pretrained knowledge boundaries to construct fact checklist, guiding online reinforcement learning to improve factual coverage and recall; simultaneously, it trains a self-assessment module based on the base model's internal knowledge to enhance factual precision during generation. Unlike prior methods that rely on external retrieval or heavy verification, our reward design is fully external-knowledge-free and lightweight, making KLCF efficient and easily scalable to large-scale training. Experimental results demonstrate that KLCF substantially improves factuality metrics across multiple long-form benchmarks and effectively alleviates model hallucinations.
Abstract（参考訳）: 幻覚と事実性障害は、長文生成における大規模言語モデル(LLM)の信頼性にとって重要な障害である。既存の人間フィードバック(RLHF)フレームワークからの強化学習は、主に好みの報酬に頼っているが、モデルの内部知識の境界を見落とし、いわゆる「ハロシン化税」が悪化する。この課題に対処するために,政策モデルの表現された知識と基本モデルのパラメトリック知識との知識整合性に着目した新しいフレームワークであるKLCF(Knowledge-Level Consistency Reinforcement Learning Framework)を提案する。具体的には、事前訓練された知識境界を活用して事実チェックリストを構築し、オンライン強化学習を指導し、事実カバレッジとリコールを改善すると同時に、ベースモデルの内部知識に基づいて自己評価モジュールを訓練し、生成時の事実精度を高める。外部検索や重度検証に依存する従来の手法とは異なり、報酬設計は完全に外部知識のない軽量で、KLCFを効率よく、大規模トレーニングにスケーラブルにします。実験結果から、KLCFは複数の長期ベンチマークにおける実測値を大幅に改善し、モデル幻覚を効果的に緩和することが示された。

論文の概要: Knowledge-Level Consistency Reinforcement Learning: Dual-Fact Alignment for Long-Form Factuality

関連論文リスト