Fugu-MT 論文翻訳(概要): ME: Trigger Element Combination Backdoor Attack on Copyright Infringement

論文の概要: ME: Trigger Element Combination Backdoor Attack on Copyright Infringement

arxiv url: http://arxiv.org/abs/2506.10776v1
Date: Thu, 12 Jun 2025 14:51:27 GMT
ステータス: 翻訳完了
システム内更新日: 2025-06-13 15:37:22.797337
Title: ME: Trigger Element Combination Backdoor Attack on Copyright Infringement
Title（参考訳）: Me: 著作権侵害のバックドア攻撃にTrigger要素が組み合わさる
Authors: Feiyu Yang, Siyuan Liang, Aishan Liu, Dacheng Tao,
Abstract要約: SilentBadDiffusion(SBD)は、テキスト・ツー・イメージタスクにおけるSD攻撃において優れた性能を示す手法である。本稿では,SBDのような攻撃の研究に利用できる新しいデータセットを作成し,SBDに基づくマルチ要素攻撃法を提案する。著作権侵害率 (CIR) / First Attack Epoch (FAE) の2つの新しいデータセットは16.78% / 39.50と51.20% / 23.60で、それぞれPokemonとMijourneyのベンチマークに近いか、あるいは劣っている。
参考スコア（独自算出の注目度）: 61.06621533874629
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The capability of generative diffusion models (DMs) like Stable Diffusion (SD) in replicating training data could be taken advantage of by attackers to launch the Copyright Infringement Attack, with duplicated poisoned image-text pairs. SilentBadDiffusion (SBD) is a method proposed recently, which shew outstanding performance in attacking SD in text-to-image tasks. However, the feasible data resources in this area are still limited, some of them are even constrained or prohibited due to the issues like copyright ownership or inappropriate contents; And not all of the images in current datasets are suitable for the proposed attacking methods; Besides, the state-of-the-art (SoTA) performance of SBD is far from ideal when few generated poisoning samples could be adopted for attacks. In this paper, we raised new datasets accessible for researching in attacks like SBD, and proposed Multi-Element (ME) attack method based on SBD by increasing the number of poisonous visual-text elements per poisoned sample to enhance the ability of attacking, while importing Discrete Cosine Transform (DCT) for the poisoned samples to maintain the stealthiness. The Copyright Infringement Rate (CIR) / First Attack Epoch (FAE) we got on the two new datasets were 16.78% / 39.50 and 51.20% / 23.60, respectively close to or even outperformed benchmark Pokemon and Mijourney datasets. In condition of low subsampling ratio (5%, 6 poisoned samples), MESI and DCT earned CIR / FAE of 0.23% / 84.00 and 12.73% / 65.50, both better than original SBD, which failed to attack at all.
Abstract（参考訳）: トレーニングデータの複製における安定拡散(SD)のような生成拡散モデル(DM)の能力は、攻撃者が有毒な画像とテキストのペアを複製した著作権侵害攻撃を起動するために利用することができる。 SilentBadDiffusion(SBD)は、テキスト・ツー・イメージタスクにおけるSD攻撃において優れた性能を示す手法である。しかし、この領域で実現可能なデータリソースはまだ限られており、著作権の所有や不適切な内容といった問題によって制限や禁止されているものもある。そして、現在のデータセットのすべての画像が攻撃方法に適合しているわけではない。本稿では,SBD などの攻撃研究に利用できる新しいデータセットを作成し,毒サンプル当たりの有毒な視覚テキスト要素数を増やして攻撃力を高めるとともに,毒サンプルに対する離散コサイン変換 (DCT) を輸入してステルス性を維持することで,SBD に基づく多要素攻撃手法を提案する。著作権侵害率 (CIR) / First Attack Epoch (FAE) の2つの新しいデータセットは16.78% / 39.50と51.20% / 23.60で、それぞれPokemonとMijourneyのベンチマークに近いか、あるいは劣っている。 MESIとDCTは低いサブサンプリング比(5%、6種類の有毒試料)で、CIR / FAEは0.23% / 84.00と12.73% / 65.50で、どちらも元のSBDより優れているが、攻撃に失敗した。

論文の概要: ME: Trigger Element Combination Backdoor Attack on Copyright Infringement

関連論文リスト