Fugu-MT 論文翻訳(概要): Efficient Test-Time Retrieval Augmented Generation

論文の概要: Efficient Test-Time Retrieval Augmented Generation

arxiv url: http://arxiv.org/abs/2511.01059v1
Date: Sun, 02 Nov 2025 19:32:39 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-05 16:37:27.047292
Title: Efficient Test-Time Retrieval Augmented Generation
Title（参考訳）: 効率的なテスト時間検索拡張ジェネレータ
Authors: Hailong Yin, Bin Zhu, Jingjing Chen, Chong-Wah Ngo,
Abstract要約: 本稿では, ET2RAG というテスト時間検索拡張フレームワークを提案し, 大規模言語モデルの性能向上を図る。具体的には、ET2RAGはトレーニング不要の手法であり、まず最も関連性の高い文書を検索し、LSMを増強し、多様な候補の応答を効率的に生成する。
参考スコア（独自算出の注目度）: 40.84120171611779
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Although Large Language Models (LLMs) demonstrate significant capabilities, their reliance on parametric knowledge often leads to inaccuracies. Retrieval Augmented Generation (RAG) mitigates this by incorporating external knowledge, but these methods may introduce irrelevant retrieved documents, leading to inaccurate responses. While the integration methods filter out incorrect answers from multiple responses, but lack external knowledge like RAG methods, and their high costs require balancing overhead with performance gains. To address these issues, we propose an Efficient Test-Time Retrieval-Augmented Generation Framework named ET2RAG to improve the performance of LLMs while maintaining efficiency. Specifically, ET2RAG is a training-free method, that first retrieves the most relevant documents and augments the LLMs to efficiently generate diverse candidate responses by managing response length. Then we compute the similarity of candidate responses and employ a majority voting mechanism to select the most suitable response as the final output. In particular, we discover that partial generation is sufficient to capture the key information necessary for consensus calculation, allowing us to effectively perform majority voting without the need for fully generated responses. Thus, we can reach a balance between computational cost and performance by managing the response length for the number of retrieved documents for majority voting. Experimental results demonstrate that ET2RAG significantly enhances performance across three tasks, including open-domain question answering, recipe generation and image captioning.
Abstract（参考訳）: LLM(Large Language Models)は重要な能力を示すが、パラメトリックな知識への依存はしばしば不正確な結果をもたらす。 Retrieval Augmented Generation (RAG)は、外部知識を取り入れることでこれを緩和するが、これらの手法は無関係に回収された文書を導入し、不正確な応答をもたらす可能性がある。統合メソッドは、複数のレスポンスから誤った回答をフィルタリングするが、RAGメソッドのような外部知識が欠如しており、その高いコストは、パフォーマンスの向上とオーバーヘッドのバランスを必要とする。これらの課題に対処するため, ET2RAG というテスト時間検索拡張フレームワークを提案する。具体的には、ET2RAGは、まず最も関連性の高い文書を検索し、LSMを増強し、応答長を管理することで、多様な候補応答を効率的に生成する、トレーニング不要の手法である。次に、候補応答の類似性を計算し、最終出力として最も適した応答を選択するために多数決機構を用いる。特に、部分生成はコンセンサス計算に必要な重要な情報を取得するのに十分であり、完全生成された応答を必要とせずに、多数決を効果的に行うことができる。これにより,検索した文書数に対する応答長を多数決で管理することで,計算コストと性能のバランスをとることができる。実験の結果,ET2RAGは,オープンドメイン質問応答,レシピ生成,画像キャプションなど,3つのタスクにおけるパフォーマンスを著しく向上させることがわかった。

論文の概要: Efficient Test-Time Retrieval Augmented Generation

関連論文リスト