Fugu-MT 論文翻訳(概要): Customization under Fire: Plugin Poisoning in Text-to-Image Ecosystem

論文の概要: Customization under Fire: Plugin Poisoning in Text-to-Image Ecosystem

arxiv url: http://arxiv.org/abs/2606.09151v1
Date: Mon, 08 Jun 2026 07:45:17 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-09 14:42:06.811346
Title: Customization under Fire: Plugin Poisoning in Text-to-Image Ecosystem
Title（参考訳）: 火災時のカスタマイズ: テキスト・ツー・イメージ・エコシステムにおけるプラグイン・ポゾン
Authors: Jiahao Chen, Xing He, Yong Yang, Xinfeng Li, Chunyi Zhou, Junhao Li, Zhe Ma, Tianyu Du, Shouling Ji,
Abstract要約: 本稿では,T2I エコシステムにおける信頼と特性を活用する LoRA プラグインサプライチェーンリスクに関する最初の体系的研究である PoisonLoRA を紹介する。プラットフォームによって検出されることなく,4つのシナリオにわたる6つのデータセット上で,CivitaiとLiblibの攻撃成功率(ASR)を約100%達成する。
参考スコア（独自算出の注目度）: 51.46397124066055
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The prosperity of text-to-image (T2I) models has fostered a vibrant share-and-play ecosystem centered on Low-Rank Adaptation (LoRA) plugins, which allow users to customize and share model capabilities with ease. This democratization, however, comes with a hidden but severe security risk. Malicious users could share and distribute seemingly benign LoRA plugins that contain hidden functionalities to poison the model-sharing market, like Civitai or Liblib, severely undermining the user trust that underpins this collaborative ecosystem and threatening the safety of countless downstream applications. Despite these risks, plugin poisoning in the real-world T2I ecosystem remains underexplored. This paper introduces PoisonLoRA, the first systematic study of LoRA plugin supply-chain risks that exploits the trust and characteristics within the T2I ecosystem. We identify two primary attack instances: (1) Concept Hijacking, where a hijacked LoRA could generate images to influence public opinion and spread propaganda, and (2) Task Injection, where a LoRA is injected to produce harmful content (e.g., NSFW images) only activated by a secret key. Critically, the malicious payload persists with virus-like propagation. Such propagations weaponize the very act of creative collaboration (e.g., LoRA merging) to spread its contagion, turning every remix into a new carrier. Extensive experiments validate that PoisonLoRA is both effective and stealthy. Specifically, we achieve approximately 100% attack success rates (ASR) on both Civitai and Liblib on 6 datasets across 4 scenarios, without being detected by the platforms. The poisoned LoRA demonstrates extreme robustness, with nearly 100% ASR even transferred to different base models and remixed more than 5 times.
Abstract（参考訳）: T2I(text-to-image)モデルの繁栄は、ローランド適応(LoRA)プラグインを中心とした活気ある共有とプレイのエコシステムを育み、モデル機能のカスタマイズと共有を容易にする。しかし、この民主化には秘密のセキュリティリスクが伴う。悪意のあるユーザは、CivitaiやLiblibのような、隠された機能を含む一見良質なLoRAプラグインを共有し、配布することで、このコラボレーティブなエコシステムを支え、無数の下流アプリケーションの安全性を脅かすユーザー信頼を著しく損なうことができる。これらのリスクにもかかわらず、現実世界のT2Iエコシステムにおけるプラグイン中毒は未発見のままである。本稿では,T2I エコシステムにおける信頼と特性を活用する LoRA プラグインサプライチェーンリスクに関する最初の体系的研究である PoisonLoRA を紹介する。 1) ハイジャックされたLoRAがイメージを生成して世論に影響を与えプロパガンダを広めるコンセプトハイジャック,(2) 有害なコンテンツ(例えばNSFW画像)を秘密鍵でのみ生成するためにLoRAを注入するタスクインジェクション,である。致命的なことに、悪意のあるペイロードはウイルスのような伝播で持続する。このような伝播は、創造的なコラボレーション(例えばLoRAのマージ)の行為を武器にして、その感染を拡大し、すべてのリミックスを新しいキャリアに変える。大規模な実験により、PoisonLoRAは効果とステルス性の両方があることが示された。具体的には,4つのシナリオにまたがる6つのデータセット上で,CivitaiとLiblibの攻撃成功率(ASR)を約100%達成する。毒性のLoRAは極端に堅牢性を示し、100%近いASRは異なるベースモデルに移行し、5回以上リミックスされた。

論文の概要: Customization under Fire: Plugin Poisoning in Text-to-Image Ecosystem

関連論文リスト