Fugu-MT 論文翻訳(概要): Visual Inception: Compromising Long-term Planning in Agentic Recommenders via Multimodal Memory Poisoning

論文の概要: Visual Inception: Compromising Long-term Planning in Agentic Recommenders via Multimodal Memory Poisoning

arxiv url: http://arxiv.org/abs/2604.16966v1
Date: Sat, 18 Apr 2026 11:15:37 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-21 21:52:52.264002
Title: Visual Inception: Compromising Long-term Planning in Agentic Recommenders via Multimodal Memory Poisoning
Title（参考訳）: ビジュアルインセプション:マルチモーダルメモリによるエージェントレコメンダの長期計画の妥協
Authors: Jiachen Qian,
Abstract要約: 私たちは「ビジュアルインセプション」と呼ばれる脅威を見つけます Visual Inceptionは、ユーザのアップロードしたイメージにトリガーを注入し、システムのメモリ内で“スリーパーエージェント”として機能する。将来の計画中に回収されると、これらの記憶はエージェントの推論チェーンをハイジャックし、即時注射なしで敵が定義した目標に向けて制御する。人間の認知に触発された二重プロセス防衛フレームワークであるCognitiveGuardを提案する。
参考スコア（独自算出の注目度）: 1.0998907972211756
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The evolution from static ranking models to Agentic Recommender Systems (Agentic RecSys) empowers AI agents to maintain long-term user profiles and autonomously plan service tasks. While this paradigm shift enhances personalization, it introduces a vulnerability: reliance on Long-term Memory (LTM). In this paper, we uncover a threat termed "Visual Inception." Unlike traditional adversarial attacks that seek immediate misclassification, Visual Inception injects triggers into user-uploaded images (e.g., lifestyle photos) that act as "sleeper agents" within the system's memory. When retrieved during future planning, these poisoned memories hijack the agent's reasoning chain, steering it toward adversary-defined goals (e.g., promoting high-margin products) without prompt injection. To mitigate this, we propose CognitiveGuard, a dual-process defense framework inspired by human cognition. It consists of a System 1 Perceptual Sanitizer (diffusion-based purification) to cleanse sensory inputs and a System 2 Reasoning Verifier (counterfactual consistency checks) to detect anomalies in memory-driven planning. Extensive experiments on a mock e-commerce agent environment demonstrate that Visual Inception achieves about 85% Goal-Hit Rate (GHR), while CognitiveGuard reduces this risk to around 10% with configurable latency trade-offs (about 1.5s in lite mode to about 6.5s for full sequential verification), without quality degradation under our setup.
Abstract（参考訳）: 静的ランキングモデルからエージェントレコメンダシステム(Agentic Recommender Systems, Agentic RecSys)への進化により、AIエージェントは長期的なユーザプロファイルを維持し、サービスタスクを自律的に計画することが可能になる。このパラダイムシフトはパーソナライゼーションを強化する一方で、LTM(Long-term Memory)への依存という脆弱性を導入している。本稿では,「ビジュアル・インセプション」と呼ばれる脅威を明らかにする。直近の誤分類を求める従来の敵攻撃とは異なり、Visual Inceptionはユーザーのアップロードした画像(例えばライフスタイルの写真)にトリガーを注入し、システムのメモリ内で「スリーパーエージェント」として機能する。将来の計画中に回復すると、これらの中毒した記憶はエージェントの推論チェーンをハイジャックし、敵が定義した目標(例えば、高マージン製品を推進)に向けて、即時注射なしで操る。これを緩和するために,人間の認知に触発された二重プロセス防衛フレームワークであるCognitiveGuardを提案する。 System 1 Perceptual Sanitizer (diffusion-based purification) と System 2 Reasoning Verifier (counterfactual consistency checks) で構成され、メモリ駆動計画における異常を検出する。モックなEコマースエージェント環境に関する大規模な実験は、Visual Inceptionが約85%のゴール-ハイトレート(GHR)を達成したことを実証している。

論文の概要: Visual Inception: Compromising Long-term Planning in Agentic Recommenders via Multimodal Memory Poisoning

関連論文リスト