Fugu-MT 論文翻訳(概要): Prompt-Unknown Promotion Attacks against LLM-based Sequential Recommender Systems

論文の概要: Prompt-Unknown Promotion Attacks against LLM-based Sequential Recommender Systems

arxiv url: http://arxiv.org/abs/2604.23640v1
Date: Sun, 26 Apr 2026 10:09:26 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-28 17:12:07.47586
Title: Prompt-Unknown Promotion Attacks against LLM-based Sequential Recommender Systems
Title（参考訳）: LLMに基づくシークエンシャルレコメンダシステムに対するプロンプト未知のプロモートアタック
Authors: Yuchuan Zhao, Tong Chen, Junliang Yu, Zongwei Wang, Lizhen Cui, Hongzhi Yin,
Abstract要約: 大規模言語モデルを用いたシーケンシャルレコメンデータシステム(LLM-SRS)は,最近顕著な性能を示した。本稿では, LLM-SRSにおけるアイテムプロモーション攻撃について, 攻撃者に対してシステムプロンプトと被害者モデルの両方が未知な状況下で, より現実的な状況下で検討する。
参考スコア（独自算出の注目度）: 51.504307822017985
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language model-powered sequential recommender systems (LLM-SRSs) have recently demonstrated remarkable performance, enabling recommendations through prompt-driven inference over user interaction sequences. However, this paradigm also introduces new security vulnerabilities, particularly text-level manipulations, rendering them appealing targets for promotion attacks that purposely boost the ranking of specific target items. Although such security risks have been receiving increasing attention, existing studies typically rely on an unrealistic assumption of access to either the victim model or prompt to unveil attack mechanisms. In this work, we investigate the item promotion attack in LLM-SRSs under a more realistic setting where both the system prompt and victim model are unknown to the attacker, and propose a Prompt-Unknown Dual-poisoning Attack (PUDA) framework. To simulate attacks under this full black-box setting, we introduce an LLM-based evolutionary refinement strategy that infers discrete system prompts, enabling the training of an effective surrogate model that mimics the behaviors of the victim model. Leveraging the distilled prompt and surrogate model, we devise a promotion attack that adversarially revises target item texts under semantic constraints, which is further complemented by the highly plausible, surrogate-generated poisoning sequences to enable cost-effective target item promotion. Extensive experiments on real-world datasets demonstrate that PUDA consistently outperforms state-of-the-art competitors in boosting the exposure of unpopular target items. Our findings reveal critical security risks in modern LLM-SRSs even when both prompts and models are protected, and highlight the need for more robust defensive means.
Abstract（参考訳）: 大規模言語モデルを用いたシーケンシャルレコメンデータシステム(LLM-SRS)は近年,ユーザインタラクションシーケンスに対するプロンプト駆動推論によるレコメンデーションを実現している。しかし、このパラダイムは新たなセキュリティ脆弱性、特にテキストレベルの操作を導入し、特定のターゲットアイテムのランキングを意図的に向上させるプロモーション攻撃のターゲットをアピールする。このようなセキュリティリスクはますます注目されているが、既存の研究は通常、被害者モデルへのアクセスや攻撃メカニズムの公表を非現実的な仮定に依存している。本研究では, LLM-SRSにおけるアイテムプロモーション攻撃について, 攻撃者に対してシステムプロンプトと被害者モデルの両方が未知な, より現実的な状況下で検討し, Prompt-Unknown Dual-poisoning Attack (PUDA) フレームワークを提案する。このフルブラックボックス設定下での攻撃をシミュレートするために,個別のシステムプロンプトを推論するLLMベースの進化的改善戦略を導入し,被害者モデルの振る舞いを模倣する効果的な代理モデルのトレーニングを可能にする。蒸留処理したプロンプトとサロゲートモデルを利用して,セマンティック制約下でターゲット項目のテキストを逆向きに修正するプロンプト攻撃を考案し,さらに高い確率で生成したサロゲート中毒シーケンスを補完することで,コスト効率の良いターゲット項目のプロモーションを可能にする。現実世界のデータセットに対する大規模な実験は、PUDAが不人気なターゲットアイテムの露出を増大させることで、最先端の競合他社を一貫して上回っていることを示している。本研究は, プロンプトとモデルの両方が保護されている場合でも, 現代のLSM-SRSにおいて重要なセキュリティリスクを明らかにし, より堅牢な防御手段の必要性を強調した。

論文の概要: Prompt-Unknown Promotion Attacks against LLM-based Sequential Recommender Systems

関連論文リスト