Fugu-MT 論文翻訳(概要): LLM-Specific Utility: A New Perspective for Retrieval-Augmented Generation

論文の概要: LLM-Specific Utility: A New Perspective for Retrieval-Augmented Generation

arxiv url: http://arxiv.org/abs/2510.11358v1
Date: Mon, 13 Oct 2025 12:57:45 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-14 18:06:30.368955
Title: LLM-Specific Utility: A New Perspective for Retrieval-Augmented Generation
Title（参考訳）: LLM-Specific Utility:Retrieval-Augmented Generationの新しい視点
Authors: Hengran Zhang, Keping Bi, Jiafeng Guo, Jiaming Zhang, Shuaiqiang Wang, Dawei Yin, Xueqi Cheng,
Abstract要約: Retrieval-augmented Generation (RAG)は、外部知識を取り入れた大規模言語モデル(LLM)を強化する。既存の研究はしばしばユーティリティをジェネリック属性として扱い、異なるLLMが同じ通路から異なる利益をもたらすという事実を無視している。
参考スコア（独自算出の注目度）: 110.610512800947
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Retrieval-augmented generation (RAG) enhances large language models (LLMs) by incorporating external knowledge. While traditional retrieval focuses on relevance, RAG's effectiveness depends on the utility of retrieved passages, i.e., the usefulness in facilitating the generation of an accurate and comprehensive answer. Existing studies often treat utility as a generic attribute, ignoring the fact that different LLMs may benefit differently from the same passage due to variations in internal knowledge and comprehension ability. In this work, we introduce and systematically investigate the notion of LLM-specific utility. Through large-scale experiments across multiple datasets and LLMs, we demonstrate that human-annotated passages are not optimal for LLMs and that ground-truth utilitarian passages are not transferable across different LLMs. These findings highlight the necessity of adopting the LLM-specific utility in RAG research. Our findings indicate that some human-annotated passages are not ground-truth utilitarian passages for specific LLMs, partially due to the varying readability of queries and passages for LLMs, a tendency for which perplexity is a key metric. Based on these findings, we propose a benchmarking procedure for LLM-specific utility judgments. We evaluate existing utility judgment methods on six datasets and find that while verbalized methods using pseudo-answers perform robustly, LLMs struggle to assess utility effectively-failing to reject all passages for known queries and to select truly useful ones for unknown queries.
Abstract（参考訳）: Retrieval-augmented Generation (RAG)は、外部知識を取り入れた大規模言語モデル(LLM)を強化する。従来の検索は関連性に重点を置いているが、RAGの有効性は、検索されたパスの有用性、すなわち、正確で包括的な回答の生成を促進するための有用性に依存する。既存の研究はしばしばユーティリティを一般的な属性として扱うが、内部知識のバリエーションや理解能力の違いにより、異なるLLMが同一のパスから異なる利益を得る可能性があるという事実を無視している。本研究では,LLM固有のユーティリティの概念を導入し,体系的に検討する。複数のデータセットやLSMをまたいだ大規模な実験を通して、人間による注釈付き通路はLLMに最適ではなく、また、実効性のある通路は異なるLSM間で転送できないことを示した。これらの知見は,RAG研究にLLM固有のユーティリティを採用する必要性を浮き彫りにした。以上の結果から, 人為的注釈付き通路は, 特定のLCMに対して, クエリの可読性やLPMに対する通路の可読性の違いが原因であり, パープレキシティが重要な指標となる傾向が示唆された。そこで本研究では,LLM固有の実用性判断のためのベンチマーク手法を提案する。提案手法は,6つのデータセットに対して既存の効用判定手法を評価し,擬似回答を用いた動詞化手法が頑健に機能するのに対して,LLMは,既知のクエリの全てのパスを拒否し,未知のクエリに対して真に有用なものを選択するのに有効な効用判定法を評価するのに苦慮している。

論文の概要: LLM-Specific Utility: A New Perspective for Retrieval-Augmented Generation

関連論文リスト