Fugu-MT 論文翻訳(概要): A Comparison of Independent and Joint Fine-tuning Strategies for Retrieval-Augmented Generation

論文の概要: A Comparison of Independent and Joint Fine-tuning Strategies for Retrieval-Augmented Generation

arxiv url: http://arxiv.org/abs/2510.01600v1
Date: Thu, 02 Oct 2025 02:30:28 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-03 16:59:20.952866
Title: A Comparison of Independent and Joint Fine-tuning Strategies for Retrieval-Augmented Generation
Title（参考訳）: 検索・拡張生成のための独立型・結合型微調整法の比較
Authors: Neal Gregory Lawton, Alfy Samuel, Anoop Kumar, Daben Liu,
Abstract要約: 我々は、微調整型検索拡張生成(RAG)パイプラインの戦略を評価し、比較する。これらの戦略はすべて、EMとF1の世代品質の同等の改善を達成していることを観察する。
参考スコア（独自算出の注目度）: 4.199577388005438
License: http://creativecommons.org/licenses/by/4.0/
Abstract: A Comparison of Independent and Joint Fine-tuning Strategies for Retrieval-Augmented Generation Download PDF Neal Gregory Lawton, Alfy Samuel, Anoop Kumar, Daben Liu Published: 20 Aug 2025, Last Modified: 17 Sept 2025EMNLP 2025 FindingsConference, Publication Chairs, AuthorsRevisionsBibTeXCC BY 4.0 Keywords: Retrieval-Augmented Generation (RAG), Large Language Models (LLMs), Fine-tuning, Question Answering, Joint fine-tuning TL;DR: We evaluate and compare strategies for fine-tuning Retrieval Augmented Generation (RAG) pipelines, including independent fine-tuning, joint fine-tuning, and two-phase fine-tuning. Abstract: Retrieval augmented generation (RAG) is a popular framework for question answering that is powered by two large language models (LLMs): an embedding model that retrieves context documents from a database that are relevant to a given question, and a generator model that uses the retrieved context to generate an answer to the question. Both the embedding and generator models can be fine-tuned to increase performance of a RAG pipeline on a new task, but multiple fine-tuning strategies exist with different costs and benefits. In this paper, we evaluate and compare several RAG fine-tuning strategies, including independent, joint, and two-phase fine-tuning. In our experiments, we observe that all of these strategies achieve about equal improvement in EM and F1 generation quality metrics, although they have significantly different computational costs. We conclude the optimal fine-tuning strategy to use depends on whether the training dataset includes context labels and whether a grid search over the learning rates for the embedding and generator models is required.
Abstract（参考訳）: A Comparison of Independent and Joint Fine-tuning Strategies for Retrieval-Augmented Generation download PDF Neal Gregory Lawton, Alfy Samuel, Anoop Kumar, Daben Liu Published: 20 Aug 2025, Last Modified: 17 Sept 2025EMNLP 2025 FindingsConference, Publishation Chairs, Authors RevisionsBibTeXCC BY 4.0 Keywords: Retrieval-Augmented Generation (RAG), Large Language Models (LLMs), Fine-tuning, Question Answering, Joint Fine-tuning TL;DR:我々は、独立した微調整、微調整を含む微調整(RAG)パイプラインの評価と比較を行った。要約: Retrieval augmented generation (RAG)は2つの大きな言語モデル(LLM)をベースとした質問応答のための一般的なフレームワークであり、ある質問に関連するデータベースからコンテキスト文書を抽出する埋め込みモデルと、検索したコンテキストを使用して質問に対する回答を生成するジェネレータモデルである。埋め込みモデルとジェネレータモデルはどちらも、新しいタスクにおけるRAGパイプラインのパフォーマンスを向上させるために微調整できるが、異なるコストと利点で複数の微調整戦略が存在する。本稿では,独立性,関節性,二相微調整を含む複数のRAG微調整戦略を評価し,比較する。実験では,計算コストは著しく異なるものの,これらすべての戦略がEMとF1生成の品質指標の同等の改善について達成されていることを観察した。トレーニングデータセットにコンテキストラベルが含まれているか,組込みおよびジェネレータモデルに対する学習率をグリッドで検索する必要があるかによって,最適な微調整戦略が求められる。

論文の概要: A Comparison of Independent and Joint Fine-tuning Strategies for Retrieval-Augmented Generation

関連論文リスト