Fugu-MT 論文翻訳(概要): Adv-TGD: Adversarial Text-Guided Diffusion for Face Recognition Impersonation Attacks

論文の概要: Adv-TGD: Adversarial Text-Guided Diffusion for Face Recognition Impersonation Attacks

arxiv url: http://arxiv.org/abs/2606.11615v2
Date: Tue, 16 Jun 2026 23:04:52 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-18 13:57:35.015672
Title: Adv-TGD: Adversarial Text-Guided Diffusion for Face Recognition Impersonation Attacks
Title（参考訳）: Adv-TGD: 顔認証におけるテキスト誘導拡散
Authors: Omid Ahmadieh, Nima Karimian,
Abstract要約: 本稿では,ジェネレーティブ・アタック・フレームワークであるAdv-TGDを提案する。ターゲットのアイデンティティを偽装し、顔認識システムを欺くことができる光現実的な顔を合成する。 Adv-TGDは、IR152、IRSE50、MobileFace、FaceNetで平均攻撃成功率(ASR)が85.90%に達する。
参考スコア（独自算出の注目度）: 2.1485350418225244
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The widespread adoption of face recognition (FR) technologies raises serious privacy concerns, as facial data can be exploited without consent. To address this challenge, we propose Adv-TGD, a generative adversarial attack framework that synthesizes photorealistic faces capable of impersonating target identities and deceiving face recognition systems. Built upon Stable Diffusion v2.1, Adv-TGD performs per-sample LoRA fine-tuning conditioned on concise textual prompts to generate natural yet adversarially manipulated identities. Unlike conventional identity attack approaches, our method optimizes lightweight cross-attention adapters for each source-target pair within a fixed-timestep denoising process. Latent blending is constrained by a face-local heatmap mask to ensure spatially precise identity manipulation while preserving non-sensitive regions. We introduce a composite objective that integrates masked epsilon-MSE reconstruction, thresholded identity divergence in FR embedding space, directional feature alignment, and source-similarity suppression to balance adversarial attack and visual realism. Optionally, LLaVA-generated attribute prompts enhance fine-grained semantic details without reintroducing identity cues. Under the black-box evaluation protocol, Adv-TGD attains an average attack success rate (ASR) of 85.90% across IR152, IRSE50, MobileFace, and FaceNet, surpassing the semantic SOTA baseline Adv-CPG by 6.25 points, the diffusion-based makeup method DiffAIM by 3 points, and the noise-based P3-Mask by 16 points. Despite its strong attack efficacy, Adv-TGD preserves high visual fidelity (PSNR = 28.18 dB, SSIM = 0.981). Furthermore, we demonstrate the flexibility of our framework by successfully extending it to in-the-wild datasets (LADN), general object classification (ImageNet), and transformer-based diffusion models (FLUX.1).
Abstract（参考訳）: 顔認識(FR)技術の普及により、顔認識データが同意なく活用されるため、深刻なプライバシー上の懸念が生じる。この課題に対処するために、ターゲットのアイデンティティを偽装し、顔認識システムを欺くことができるフォトリアリスティック顔を合成する生成的対向攻撃フレームワークAdv-TGDを提案する。安定拡散 v2.1 に基づいて、Adv-TGD は簡潔なテキストプロンプトに基づくサンプルごとのLORA微調整を実行し、自然に反抗的に操作されたIDを生成する。従来のアイデンティティアタック手法とは異なり,本手法では,固定時間段階のデノナイズプロセスにおいて,各ソースとターゲットのペアに対して,軽量なクロスアテンションアダプタを最適化する。潜水ブレンディングは、非感性領域を保持しながら空間的に正確なアイデンティティ操作を確保するために、面局所ヒートマップマスクによって拘束される。本研究では,マスク付きエプシロン-MSE再構成,FR埋め込み空間におけるしきい値の偏差,指向性特徴アライメント,ソース類似性抑制を統合して,対向攻撃と視覚リアリズムのバランスをとる複合目標を提案する。オプションとして、LLaVA生成属性は、アイデンティティーキューを再導入することなく、きめ細かなセマンティック詳細を促進させる。ブラックボックス評価プロトコルの下で、Adv-TGDはIR152、IRSE50、MobileFace、FaceNetで平均攻撃成功率85.90%に達し、意味的なSOTAベースラインAdv-CPGを6.25ポイント、拡散ベースのメイクメソッドDiffAIMを3ポイント、ノイズベースのP3-Maskを16ポイント超えた。強力な攻撃効果にもかかわらず、Adv-TGDは高い視力を維持する(PSNR = 28.18 dB、SSIM = 0.981)。さらに,このフレームワークの柔軟性を,組込みデータセット(LADN),汎用オブジェクト分類(ImageNet),トランスフォーマーベース拡散モデル(FLUX.1)に拡張することで実証する。

論文の概要: Adv-TGD: Adversarial Text-Guided Diffusion for Face Recognition Impersonation Attacks

関連論文リスト