Fugu-MT 論文翻訳(概要): Who Benefits from RAG? The Role of Exposure, Utility and Attribution Bias

論文の概要: Who Benefits from RAG? The Role of Exposure, Utility and Attribution Bias

arxiv url: http://arxiv.org/abs/2603.24218v1
Date: Wed, 25 Mar 2026 11:45:52 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-26 21:06:11.270266
Title: Who Benefits from RAG? The Role of Exposure, Utility and Attribution Bias
Title（参考訳）: RAGから恩恵を受けるのは誰か? 露光, 実用, 帰属バイアスの役割
Authors: Mahdi Dehghan, Graham McDonald,
Abstract要約: Retrieval-Augmented Generation (RAG)により強化されたLarge Language Models (LLM) は精度を大幅に向上した。 RAGにおける問合せ群公平度に及ぼす3つの要因の影響について検討した。
参考スコア（独自算出の注目度）: 3.874524858753285
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large Language Models (LLMs) enhanced with Retrieval-Augmented Generation (RAG) have achieved substantial improvements in accuracy by grounding their responses in external documents that are relevant to the user's query. However, relatively little work has investigated the impact of RAG in terms of fairness. Particularly, it is not yet known if queries that are associated with certain groups within a fairness category systematically receive higher accuracy, or accuracy improvements in RAG systems compared to LLM-only, a phenomenon we refer to as query group fairness. In this work, we conduct extensive experiments to investigate the impact of three key factors on query group fairness in RAG, namely: Group exposure, i.e., the proportion of documents from each group appearing in the retrieved set, determined by the retriever; Group utility, i.e., the degree to which documents from each group contribute to improving answer accuracy, capturing retriever-generator interactions; and Group attribution, i.e., the extent to which the generator relies on documents from each group when producing responses. We examine group-level average accuracy and accuracy improvements disparities across four fairness categories using three datasets derived from the TREC 2022 Fair Ranking Track for two tasks: article generation and title generation. Our findings show that RAG systems suffer from the query group fairness problem and amplify disparities in terms of average accuracy across queries from different groups, compared to an LLM-only setting. Moreover, group utility, exposure, and attribution can exhibit strong positive or negative correlations with average accuracy or accuracy improvements of queries from that group, highlighting their important role in fair RAG. Our data and code are publicly available from Github.
Abstract（参考訳）: Retrieval-Augmented Generation (RAG)により強化されたLarge Language Models (LLM) は、ユーザのクエリに関連する外部ドキュメントに応答を基盤として、精度を大幅に向上した。しかし, 公正性の観点からRAGの影響を調査する研究は比較的少ない。特に, フェアネスカテゴリ内の特定のグループに関連付けられたクエリが, LLMのみと比較して, RAGシステムの精度や精度の向上を体系的に受けているかどうかは不明である。本研究は,RAGにおける問合せ群フェアネスに対する3つの重要な要因,すなわち,検索者によって決定された検索集合に現れる各グループからの文書の割合,グループユーティリティ,各グループからの文書が回答精度の向上に寄与する度合い,検索者-生成者間相互作用のキャプチャ,グループ属性,応答の生成時に生成元が各グループからの文書に依存する程度などについて,広範囲にわたる実験を行った。 TREC 2022 Fair Ranking Trackから抽出した3つのデータセットを用いて,4つのフェアネスカテゴリーにおけるグループレベルの平均精度と精度改善の相違について検討した。以上の結果から,RAGシステムはクエリグループフェアネスの問題に悩まされており,LLMのみの設定に比べて,異なるグループからのクエリ間の平均精度の差が増大していることがわかった。さらに、グループユーティリティ、露出、属性は、そのグループからのクエリの平均的正確性や精度の改善と強い正あるいは負の相関を示し、彼らの公正なRAGにおける重要な役割を強調します。私たちのデータとコードはGithubから公開されています。

論文の概要: Who Benefits from RAG? The Role of Exposure, Utility and Attribution Bias

関連論文リスト