Fugu-MT 論文翻訳(概要): Backdoor Attacks and Defenses in Computer Vision Domain: A Survey

論文の概要: Backdoor Attacks and Defenses in Computer Vision Domain: A Survey

arxiv url: http://arxiv.org/abs/2509.07504v1
Date: Tue, 09 Sep 2025 08:38:05 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-10 14:38:27.231356
Title: Backdoor Attacks and Defenses in Computer Vision Domain: A Survey
Title（参考訳）: コンピュータビジョン領域におけるバックドア攻撃と防御:サーベイ
Authors: Bilal Hussain Abbasi, Yanjun Zhang, Leo Zhang, Shang Gao,
Abstract要約: バックドア攻撃は、隠された制御可能な振る舞いを機械学習モデルに埋め込む。本調査は,コンピュータビジョン領域におけるバックドア攻撃と防御に関する文献の急成長を概観するものである。
参考スコア（独自算出の注目度）: 12.384887829851834
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Backdoor (trojan) attacks embed hidden, controllable behaviors into machine-learning models so that models behave normally on benign inputs but produce attacker-chosen outputs when a trigger is present. This survey reviews the rapidly growing literature on backdoor attacks and defenses in the computer-vision domain. We introduce a multi-dimensional taxonomy that organizes attacks and defenses by injection stage (dataset poisoning, model/parameter modification, inference-time injection), trigger type (patch, blended/frequency, semantic, transformation), labeling strategy (dirty-label vs. clean-label / feature-collision), representation stage (instance-specific, manifold/class-level, neuron/parameter hijacking, distributed encodings), and target task (classification, detection, segmentation, video, multimodal). For each axis we summarize representative methods, highlight evaluation practices, and discuss where defenses succeed or fail. For example, many classical sanitization and reverse-engineering tools are effective against reusable patch attacks but struggle with input-aware, sample-specific, or parameter-space backdoors and with transfer via compromised pre-trained encoders or hardware bit-flips. We synthesize trends, identify persistent gaps (supply-chain and hardware threats, certifiable defenses, cross-task benchmarks), and propose practical guidelines for threat-aware evaluation and layered defenses. This survey aims to orient researchers and practitioners to the current threat landscape and pressing research directions in secure computer vision.
Abstract（参考訳）: バックドア(トロイジャン)攻撃は、機械学習モデルに隠された制御可能な振る舞いを埋め込んで、モデルが通常、良心的な入力で振る舞うようにし、トリガーが存在する時にアタッカー・チョーゼン出力を生成する。本調査は,コンピュータビジョン領域におけるバックドア攻撃と防御に関する文献の急成長を概観するものである。我々は,攻撃と防御をインジェクションステージ(データセット中毒,モデル/パラメータ修正,推論時インジェクション),トリガータイプ(パッチ,ブレンド/周波数,セマンティック,変換),ラベル戦略(dirty-label vs. clean-label / feature-collision),表現ステージ(instance-specific, manifold/class-level,neuron/parameter hijacking, Distributed encodings),ターゲットタスク(classification, detection, segmentation, video, multimodal)によって組織する多次元分類法を提案する。各軸について、代表的手法を要約し、評価プラクティスを強調し、防御が成功するか失敗するかについて議論する。例えば、古典的な衛生化やリバースエンジニアリングツールは、再利用可能なパッチ攻撃に対して有効であるが、入力認識、サンプル特定、パラメータ空間のバックドアや、事前訓練されたエンコーダやハードウェアビットフリップによる転送に苦慮している。我々は、トレンドを合成し、永続的なギャップ(サプライチェーンとハードウェアの脅威、認証されたディフェンス、クロスタスクのベンチマーク)を特定し、脅威認識評価と層状ディフェンスの実践的ガイドラインを提案する。この調査は、研究者や実践者たちを現在の脅威の風景に誘導し、セキュアなコンピュータビジョンにおける研究の方向性を推し進めることを目的としている。

論文の概要: Backdoor Attacks and Defenses in Computer Vision Domain: A Survey

関連論文リスト