Fugu-MT 論文翻訳(概要): Towards Secure Retrieval-Augmented Generation: A Comprehensive Review of Threats, Defenses and Benchmarks

論文の概要: Towards Secure Retrieval-Augmented Generation: A Comprehensive Review of Threats, Defenses and Benchmarks

arxiv url: http://arxiv.org/abs/2603.21654v1
Date: Mon, 23 Mar 2026 07:32:59 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-24 19:11:39.542438
Title: Towards Secure Retrieval-Augmented Generation: A Comprehensive Review of Threats, Defenses and Benchmarks
Title（参考訳）: 安全回復型世代に向けて:脅威・防衛・ベンチマークの総合的レビュー
Authors: Yanming Mu, Hao Hu, Feiyang Li, Qiao Yuan, Jiang Wu, Zichuan Liu, Pengcheng Liu, Mei Wang, Hongwei Zhou, Yuling Liu,
Abstract要約: Retrieval-Augmented Generation (RAG)は、大規模言語モデルにおける幻覚とドメイン知識の欠如を、外部知識ベースを組み込むことによって著しく軽減する。本稿では、その基盤となる脆弱性メカニズムを分析し、データ中毒、敵攻撃、メンバシップ推論攻撃などのコア脅威ベクトルを体系的に分類する。潜在的なリスクについて深い洞察を得ることにより、この研究は、高度に堅牢で信頼性の高い次世代RAGシステムの開発を促進することを目指している。
参考スコア（独自算出の注目度）: 21.880338678029503
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Retrieval-Augmented Generation (RAG) significantly mitigates the hallucinations and domain knowledge deficiency in large language models by incorporating external knowledge bases. However, the multi-module architecture of RAG introduces complex system-level security vulnerabilities. Guided by the RAG workflow, this paper analyzes the underlying vulnerability mechanisms and systematically categorizes core threat vectors such as data poisoning, adversarial attacks, and membership inference attacks. Based on this threat assessment, we construct a taxonomy of RAG defense technologies from a dual perspective encompassing both input and output stages. The input-side analysis reviews data protection mechanisms including dynamic access control, homomorphic encryption retrieval, and adversarial pre-filtering. The output-side examination summarizes advanced leakage prevention techniques such as federated learning isolation, differential privacy perturbation, and lightweight data sanitization. To establish a unified benchmark for future experimental design, we consolidate authoritative test datasets, security standards, and evaluation frameworks. To the best of our knowledge, this paper presents the first end-to-end survey dedicated to the security of RAG systems. Distinct from existing literature that isolates specific vulnerabilities, we systematically map the entire pipeline-providing a unified analysis of threat models, defense mechanisms, and evaluation benchmarks. By enabling deep insights into potential risks, this work seeks to foster the development of highly robust and trustworthy next-generation RAG systems.
Abstract（参考訳）: Retrieval-Augmented Generation (RAG)は、大規模言語モデルにおける幻覚とドメイン知識の欠如を、外部知識ベースを組み込むことによって著しく軽減する。しかしながら、RAGのマルチモジュールアーキテクチャは複雑なシステムレベルのセキュリティ脆弱性を導入している。本稿では、RAGワークフローでガイドされた脆弱性メカニズムを分析し、データ中毒、敵攻撃、メンバシップ推論攻撃などの中核的な脅威ベクトルを体系的に分類する。この脅威評価に基づいて、入力と出力の両方の段階を含む二重視点からRAG防衛技術の分類を構築した。入力側分析は、動的アクセス制御、同型暗号化検索、および対向前フィルタリングを含むデータ保護機構をレビューする。出力側試験では、フェデレーション学習分離、差分プライバシー摂動、軽量データ衛生などの高度な漏洩防止手法を要約する。将来の実験設計のための統一ベンチマークを確立するため、信頼性テストデータセット、セキュリティ標準、評価フレームワークを統合する。そこで本研究では,RAGシステムのセキュリティに関するエンド・ツー・エンド調査を初めて実施する。特定の脆弱性を分離する既存の文献とは違い、脅威モデル、防御メカニズム、評価ベンチマークの統一分析を提供するパイプライン全体を体系的にマッピングする。この研究は、潜在的なリスクに対する深い洞察を可能にすることで、高度に堅牢で信頼性の高い次世代RAGシステムの開発を促進することを目指している。

論文の概要: Towards Secure Retrieval-Augmented Generation: A Comprehensive Review of Threats, Defenses and Benchmarks

関連論文リスト