Fugu-MT 論文翻訳(概要): VortexPIA: Indirect Prompt Injection Attack against LLMs for Efficient Extraction of User Privacy

論文の概要: VortexPIA: Indirect Prompt Injection Attack against LLMs for Efficient Extraction of User Privacy

arxiv url: http://arxiv.org/abs/2510.04261v1
Date: Sun, 05 Oct 2025 15:58:55 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-07 16:52:59.546081
Title: VortexPIA: Indirect Prompt Injection Attack against LLMs for Efficient Extraction of User Privacy
Title（参考訳）: VortexPIA: ユーザプライバシの効率的な抽出のためのLCMに対する間接的プロンプトインジェクション攻撃
Authors: Yu Cui, Sicheng Pan, Yifei Liu, Haibin Zhang, Cong Zuo,
Abstract要約: 大規模言語モデル(LLM)は、会話型AI(CAI)に広くデプロイされている。近年の研究では、LLMベースのCAIを操作して、人間から個人情報を抽出し、重大なセキュリティ上の脅威を生じさせることが示されている。ブラックボックス設定下でのプライバシー抽出を誘導する新しい間接的インジェクション攻撃であるtextscVortexPIAを提案する。
参考スコア（独自算出の注目度）: 22.037235521470468
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language models (LLMs) have been widely deployed in Conversational AIs (CAIs), while exposing privacy and security threats. Recent research shows that LLM-based CAIs can be manipulated to extract private information from human users, posing serious security threats. However, the methods proposed in that study rely on a white-box setting that adversaries can directly modify the system prompt. This condition is unlikely to hold in real-world deployments. The limitation raises a critical question: can unprivileged attackers still induce such privacy risks in practical LLM-integrated applications? To address this question, we propose \textsc{VortexPIA}, a novel indirect prompt injection attack that induces privacy extraction in LLM-integrated applications under black-box settings. By injecting token-efficient data containing false memories, \textsc{VortexPIA} misleads LLMs to actively request private information in batches. Unlike prior methods, \textsc{VortexPIA} allows attackers to flexibly define multiple categories of sensitive data. We evaluate \textsc{VortexPIA} on six LLMs, covering both traditional and reasoning LLMs, across four benchmark datasets. The results show that \textsc{VortexPIA} significantly outperforms baselines and achieves state-of-the-art (SOTA) performance. It also demonstrates efficient privacy requests, reduced token consumption, and enhanced robustness against defense mechanisms. We further validate \textsc{VortexPIA} on multiple realistic open-source LLM-integrated applications, demonstrating its practical effectiveness.
Abstract（参考訳）: 大規模言語モデル(LLM)は、プライバシとセキュリティの脅威を露呈しながら、会話型AI(CAI)に広くデプロイされている。近年の研究では、LLMベースのCAIを操作して、人間から個人情報を抽出し、重大なセキュリティ上の脅威を生じさせることが示されている。しかし、この研究で提案された手法は、敵がシステムプロンプトを直接修正できるホワイトボックス設定に依存している。この状態が現実のデプロイメントで維持される可能性は低い。特権のない攻撃者は、実用的LLM統合アプリケーションにおいて、そのようなプライバシーリスクをまだ引き起こせるのか? そこで本研究では,LCM統合アプリケーションにおいて,ブラックボックス設定下でのプライバシ抽出を誘導する新たな間接的プロンプトインジェクション攻撃である,textsc{VortexPIA}を提案する。偽の記憶を含むトークン効率のよいデータを注入することで、textsc{VortexPIA} は LLM を誤解して、バッチ内のプライベート情報を積極的に要求する。以前の方法とは異なり、 \textsc{VortexPIA} では攻撃者は複数の機密データのカテゴリを柔軟に定義できる。従来の LLM と推論 LLM の両方を4つのベンチマークデータセットでカバーし、6つの LLM 上で \textsc{VortexPIA} を評価する。その結果,textsc{VortexPIA} はベースラインを著しく上回り,SOTA(State-of-the-art)のパフォーマンスを実現していることがわかった。また、効率的なプライバシ要求、トークン消費の削減、防御メカニズムに対する堅牢性の向上も示している。さらに,複数のリアルなオープンソース LLM 統合アプリケーションに対して \textsc{VortexPIA} を検証し,その有効性を実証した。

論文の概要: VortexPIA: Indirect Prompt Injection Attack against LLMs for Efficient Extraction of User Privacy

関連論文リスト