Fugu-MT 論文翻訳(概要): Evaluating the Robustness of Retrieval-Augmented Generation to Adversarial Evidence in the Health Domain

論文の概要: Evaluating the Robustness of Retrieval-Augmented Generation to Adversarial Evidence in the Health Domain

arxiv url: http://arxiv.org/abs/2509.03787v1
Date: Thu, 04 Sep 2025 00:45:58 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-05 20:21:10.001557
Title: Evaluating the Robustness of Retrieval-Augmented Generation to Adversarial Evidence in the Health Domain
Title（参考訳）: 健康領域の敵対的エビデンスに対する検索型生成のロバスト性の評価
Authors: Shakiba Amirshahi, Amin Bigdeli, Charles L. A. Clarke, Amira Ghenai,
Abstract要約: Retrieval augmented generation (RAG) システムは、検索された証拠や文脈を支援として提供することにより、Large Language Model (LLM) の応答を現実的に根拠付ける方法を提供する。 LLMは、回収された証拠に存在する誤報を吸収し、再生する。この問題は、抽出された証拠が、誤報を公布することを明示的に意図した敵対的な材料を含むと拡大される。
参考スコア（独自算出の注目度）: 8.094811345546118
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Retrieval augmented generation (RAG) systems provide a method for factually grounding the responses of a Large Language Model (LLM) by providing retrieved evidence, or context, as support. Guided by this context, RAG systems can reduce hallucinations and expand the ability of LLMs to accurately answer questions outside the scope of their training data. Unfortunately, this design introduces a critical vulnerability: LLMs may absorb and reproduce misinformation present in retrieved evidence. This problem is magnified if retrieved evidence contains adversarial material explicitly intended to promulgate misinformation. This paper presents a systematic evaluation of RAG robustness in the health domain and examines alignment between model outputs and ground-truth answers. We focus on the health domain due to the potential for harm caused by incorrect responses, as well as the availability of evidence-based ground truth for many common health-related questions. We conduct controlled experiments using common health questions, varying both the type and composition of the retrieved documents (helpful, harmful, and adversarial) as well as the framing of the question by the user (consistent, neutral, and inconsistent). Our findings reveal that adversarial documents substantially degrade alignment, but robustness can be preserved when helpful evidence is also present in the retrieval pool. These findings offer actionable insights for designing safer RAG systems in high-stakes domains by highlighting the need for retrieval safeguards. To enable reproducibility and facilitate future research, all experimental results are publicly available in our github repository. https://github.com/shakibaam/RAG_ROBUSTNESS_EVAL
Abstract（参考訳）: Retrieval augmented generation (RAG) システムは、検索された証拠や文脈を支援として提供することにより、Large Language Model (LLM) の応答を現実的に根拠付ける方法を提供する。この文脈によって、RAGシステムは幻覚を減らし、LLMがトレーニングデータの範囲外の質問に正確に答える能力を拡張することができる。残念ながら、この設計は致命的な脆弱性をもたらしている: LLMは、回収された証拠に存在する誤報を吸収し、再生する。この問題は、抽出された証拠が、誤報を公布することを明示的に意図した敵対的な材料を含むと拡大される。本稿では,健康領域におけるRAGのロバスト性を体系的に評価し,モデル出力とグラウンドトルース回答の整合性を検討する。我々は,不適切な反応による害の可能性を考慮し,健康領域に焦点をあてるとともに,多くの一般的な健康問題に対する証拠に基づく根拠的真理が利用可能である。我々は,検索した文書の種類や構成(有害,有害,敵対的)とユーザによる質問のフレーミング(一貫性,中立,矛盾)を両立させる,共通の健康質問を用いた制御実験を行った。以上の結果から, 逆行性文書の整合性は著しく低下するが, 有効な証拠が検索プールにも存在している場合, 頑健性は保たれることが明らかとなった。これらの知見は, より安全なRAGシステムを高信頼領域に設計する上で, より安全な安全ガードの必要性を強調することによって, 実用的な知見を提供するものである。再現性を実現し,今後の研究を容易にするため,実験結果はgithubリポジトリで公開されています。 https://github.com/shakibaam/RAG_ROBUSTNESS_EVAL

論文の概要: Evaluating the Robustness of Retrieval-Augmented Generation to Adversarial Evidence in the Health Domain

関連論文リスト