Fugu-MT 論文翻訳(概要): Improving End-to-End Training of Retrieval-Augmented Generation Models via Joint Stochastic Approximation

論文の概要: Improving End-to-End Training of Retrieval-Augmented Generation Models via Joint Stochastic Approximation

arxiv url: http://arxiv.org/abs/2508.18168v1
Date: Mon, 25 Aug 2025 16:17:16 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-26 18:43:45.859533
Title: Improving End-to-End Training of Retrieval-Augmented Generation Models via Joint Stochastic Approximation
Title（参考訳）: 共同確率近似による検索拡張生成モデルのエンドツーエンド学習の改善
Authors: Hongyu Cao, Yuxuan Wu, Yucheng Cai, Xianyu Zhao, Zhijian Ou,
Abstract要約: Retrieval-augmented Generation (RAG)は、パラメトリックメモリと非パラメトリックメモリを組み合わせたパラダイムとして広く認知されている。 RAGモデルのエンドツーエンド最適化における大きな課題は、関連するパスに対する限界化が必要であることである。本稿では,共同近似(JSA)に基づくRAGのエンドツーエンドトレーニングを提案する。 JSAアルゴリズムはEM(expectation-maximization)アルゴリズムの拡張であり、潜在変数モデルの推定に特に強力である。
参考スコア（独自算出の注目度）: 9.493788719707835
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Retrieval-augmented generation (RAG) has become a widely recognized paradigm to combine parametric memory with non-parametric memories. An RAG model consists of two serial connecting components (retriever and generator). A major challenge in end-to-end optimization of the RAG model is that marginalization over relevant passages (modeled as discrete latent variables) from a knowledge base is required. Traditional top-K marginalization and variational RAG (VRAG) suffer from biased or high-variance gradient estimates. In this paper, we propose and develop joint stochastic approximation (JSA) based end-to-end training of RAG, which is referred to as JSA-RAG. The JSA algorithm is a stochastic extension of the EM (expectation-maximization) algorithm and is particularly powerful in estimating discrete latent variable models. Extensive experiments are conducted on five datasets for two tasks (open-domain question answering, knowledge-grounded dialogs) and show that JSA-RAG significantly outperforms both vanilla RAG and VRAG. Further analysis shows the efficacy of JSA-RAG from the perspectives of generation, retrieval, and low-variance gradient estimate.
Abstract（参考訳）: Retrieval-augmented Generation (RAG) は、パラメトリックメモリと非パラメトリックメモリを組み合わせたパラダイムとして広く認知されている。 RAGモデルは、2つのシリアル接続コンポーネント(レトリバーとジェネレータ)から構成される。 RAGモデルのエンドツーエンド最適化における大きな課題は、知識ベースからの関連するパス(離散潜在変数としてモデル化される)に対する限界化が必要であることである。従来のトップK偏差化と変分RAG(VRAG)は偏差あるいは高分散勾配推定に悩まされている。本稿では,共同確率近似(JSA)に基づくRAGのエンドツーエンドトレーニングを提案し,開発する。 JSAアルゴリズムは、EM(Expectation-maximization)アルゴリズムの確率的拡張であり、特に離散潜在変数モデルの推定に強力である。 JSA-RAG がバニラRAG と VRAG の両方を著しく上回っていることを示す。さらに, 生成, 検索, 低分散勾配推定の観点から, JSA-RAGの有効性が示唆された。

論文の概要: Improving End-to-End Training of Retrieval-Augmented Generation Models via Joint Stochastic Approximation

関連論文リスト