Fugu-MT 論文翻訳(概要): Guaranteeing Knowledge Integration with Joint Decoding for Retrieval-Augmented Generation

論文の概要: Guaranteeing Knowledge Integration with Joint Decoding for Retrieval-Augmented Generation

arxiv url: http://arxiv.org/abs/2604.08046v1
Date: Thu, 09 Apr 2026 09:52:21 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-10 18:34:05.852385
Title: Guaranteeing Knowledge Integration with Joint Decoding for Retrieval-Augmented Generation
Title（参考訳）: 検索拡張生成のための共同復号化による知識統合の保証
Authors: Zhengyi Zhao, Shubo Zhang, Zezhong Wang, Yuxi Zhang, Huimin Wang, Yutian Zhao, Yefeng Zheng, Binyang Li, Kam-Fai Wong, Xian Wu,
Abstract要約: GuarantRAGは、証拠統合から推論を明示的に分離するフレームワークである。 5つのQAベンチマークの実験では、GurantRAGは最大で12.1%精度が向上している。
参考スコア（独自算出の注目度）: 45.13215113490545
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Retrieval-Augmented Generation (RAG) significantly enhances Large Language Models (LLMs) by providing access to external knowledge. However, current research primarily focuses on retrieval quality, often overlooking the critical ''integration bottleneck'': even when relevant documents are retrieved, LLMs frequently fail to utilize them effectively due to conflicts with their internal parametric knowledge. In this paper, we argue that implicitly resolving this conflict in a single generation pass is suboptimal. We introduce GuarantRAG, a framework that explicitly decouples reasoning from evidence integration. First, we generate an ''Inner-Answer'' based solely on parametric knowledge to capture the model's reasoning flow. Second, to guarantee faithful evidence extraction, we generate a ''Refer-Answer'' using a novel Contrastive DPO objective. This objective treats the parametric Inner-Answer as a negative constraint and the retrieved documents as positive ground truth, forcing the model to suppress internal hallucinations in favor of external evidence during this phase. Finally, rather than naive concatenation or using the DPO trained model directly, we propose a joint decoding mechanism that dynamically fuses the logical coherence of the Inner-Answer with the factual precision of the Refer-Answer at the token level. Experiments on five QA benchmarks demonstrate that GuarantRAG improves accuracy by up to 12.1% and reduces hallucinations by 16.3% compared to standard and dynamic RAG baselines.
Abstract（参考訳）: Retrieval-Augmented Generation (RAG)は、外部知識へのアクセスを提供することで、LLM(Large Language Models)を大幅に強化する。しかし、近年の研究では、重要な「統合ボトルネック」を見越して検索品質に重点を置いており、関連する文書を検索しても、内部のパラメトリック知識と矛盾するため、LLMは効果的に利用できないことが多い。本稿では、この衝突を1世代で暗黙的に解決することは、最適以下であると論じる。本稿では,証拠統合から推論を明示的に分離するフレームワークであるGurantRAGを紹介する。まず,モデルの推論フローを捉えるためのパラメトリック知識のみに基づく'インナー・アンサー'を生成する。第二に、忠実な証拠抽出を保証するために、新しいコントラストDPO目標を用いて'Refer-Answer'を生成する。この目的は、パラメトリックなインナーアンサーを負の制約として扱い、検索された文書を正の根拠の真理として扱い、このフェーズにおいて、モデルが外部の証拠を優先して内部幻覚を抑制することを強制する。最後に,DPO学習モデルを直接使用するのではなく,トークンレベルでのRefer-Answerの実際の精度と内部解答者の論理的コヒーレンスを動的に融合させる共同復号機構を提案する。 5つのQAベンチマークの実験では、GurantRAGは標準および動的RAGベースラインと比較して、精度を最大12.1%向上し、幻覚を16.3%低減している。

論文の概要: Guaranteeing Knowledge Integration with Joint Decoding for Retrieval-Augmented Generation

関連論文リスト