Fugu-MT 論文翻訳(概要): Backdooring Self-Supervised Contrastive Learning by Noisy Alignment

論文の概要: Backdooring Self-Supervised Contrastive Learning by Noisy Alignment

arxiv url: http://arxiv.org/abs/2508.14015v1
Date: Tue, 19 Aug 2025 17:25:43 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-20 15:36:32.029365
Title: Backdooring Self-Supervised Contrastive Learning by Noisy Alignment
Title（参考訳）: ノイズアライメントによる自己監督型コントラスト学習のバックドア化
Authors: Tuo Chen, Jie Gui, Minjing Dong, Ju Jia, Lanting Fang, Jian Liu,
Abstract要約: 自己教師付きコントラスト学習(CL)は、画像や画像とテキストのペアを含むラベルのないデータから転送可能な表現を効果的に学習する。敵は、有毒な画像を事前訓練されたデータセットに注入することができ、妥協したCLエンコーダが下流のタスクで誤った振る舞いを示す。有毒画像の雑音成分を明示的に抑制するDPCL法であるノイズアライメント(NA)を提案する。
参考スコア（独自算出の注目度）: 19.58754341352538
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Self-supervised contrastive learning (CL) effectively learns transferable representations from unlabeled data containing images or image-text pairs but suffers vulnerability to data poisoning backdoor attacks (DPCLs). An adversary can inject poisoned images into pretraining datasets, causing compromised CL encoders to exhibit targeted misbehavior in downstream tasks. Existing DPCLs, however, achieve limited efficacy due to their dependence on fragile implicit co-occurrence between backdoor and target object and inadequate suppression of discriminative features in backdoored images. We propose Noisy Alignment (NA), a DPCL method that explicitly suppresses noise components in poisoned images. Inspired by powerful training-controllable CL attacks, we identify and extract the critical objective of noisy alignment, adapting it effectively into data-poisoning scenarios. Our method implements noisy alignment by strategically manipulating contrastive learning's random cropping mechanism, formulating this process as an image layout optimization problem with theoretically derived optimal parameters. The resulting method is simple yet effective, achieving state-of-the-art performance compared to existing DPCLs, while maintaining clean-data accuracy. Furthermore, Noisy Alignment demonstrates robustness against common backdoor defenses. Codes can be found at https://github.com/jsrdcht/Noisy-Alignment.
Abstract（参考訳）: 自己教師付きコントラスト学習(CL)は、画像や画像とテキストのペアを含むラベルのないデータから、転送可能な表現を効果的に学習するが、バックドア攻撃(DPCL)に弱い。敵は、有毒な画像を事前訓練されたデータセットに注入することができ、妥協したCLエンコーダが下流のタスクで標的の誤動作を示す。しかし、既存のDPCLは、バックドアとターゲットオブジェクト間の脆弱な暗黙的共起への依存と、バックドア画像における差別的特徴の不十分な抑制により、限られた有効性が得られる。有毒画像の雑音成分を明示的に抑制するDPCL法であるノイズアライメント(NA)を提案する。強力な訓練制御可能なCL攻撃に触発されて、ノイズアライメントの重要目標を特定し、抽出し、データ汚染シナリオに効果的に適用する。提案手法は, 理論的に導出された最適パラメータを用いた画像レイアウト最適化問題として, コントラスト学習のランダムトリミング機構を戦略的に操作することにより, ノイズアライメントを実現する。得られた手法は単純だが有効であり、クリーンなデータの精度を維持しつつ、既存のDPCLと比較して最先端の性能を達成する。さらに、ノイズアライメントは一般的なバックドア防御に対する堅牢性を示す。コードはhttps://github.com/jsrdcht/Noisy-Alignmentで見ることができる。

論文の概要: Backdooring Self-Supervised Contrastive Learning by Noisy Alignment

関連論文リスト