Fugu-MT 論文翻訳(概要): Beyond Corner Patches: Semantics-Aware Backdoor Attack in Federated Learning

論文の概要: Beyond Corner Patches: Semantics-Aware Backdoor Attack in Federated Learning

arxiv url: http://arxiv.org/abs/2603.29328v3
Date: Tue, 07 Apr 2026 04:38:05 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-08 15:04:55.435781
Title: Beyond Corner Patches: Semantics-Aware Backdoor Attack in Federated Learning
Title（参考訳）: コーナーのパッチを超えて:フェデレートラーニングにおけるセマンティックスを意識したバックドア攻撃
Authors: Kavindu Herath, Joshua Zhao, Saurabh Bagchi,
Abstract要約: フェデレートラーニング(FL)に対するバックドア攻撃は、多くの場合、合成コーナーパッチやアウト・オブ・ディストリビューションパターンで評価される。フェデレートされた環境での学習を支援するセマンティックス対応バックドアであるSABLEを提案する。我々のセマンティクス駆動トリガは、良識テスト精度を維持しながら高い目標攻撃成功率を達成する。
参考スコア（独自算出の注目度）: 6.76324539337304
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Backdoor attacks on federated learning (FL) are most often evaluated with synthetic corner patches or out-of-distribution (OOD) patterns that are unlikely to arise in practice. In this paper, we revisit the backdoor threat to standard FL (a single global model) under a more realistic setting where triggers must be semantically meaningful, in-distribution, and visually plausible. We propose SABLE, a Semantics-Aware Backdoor for LEarning in federated settings, which constructs natural, content-consistent triggers (e.g., semantic attribute changes such as sunglasses) and optimizes an aggregation-aware malicious objective with feature separation and parameter regularization to keep attacker updates close to benign ones. We instantiate SABLE on CelebA hair-color classification and the German Traffic Sign Recognition Benchmark (GTSRB), poisoning only a small, interpretable subset of each malicious client's local data while otherwise following the standard FL protocol. Across heterogeneous client partitions and multiple aggregation rules (FedAvg, Trimmed Mean, MultiKrum, and FLAME), our semantics-driven triggers achieve high targeted attack success rates while preserving benign test accuracy. These results show that semantics-aligned backdoors remain a potent and practical threat in federated learning, and that robustness claims based solely on synthetic patch triggers can be overly optimistic.
Abstract（参考訳）: フェデレートラーニング(FL)に対するバックドア攻撃は、多くの場合、実際には起こりそうもない合成コーナーパッチやアウト・オブ・ディストリビューション(OOD)パターンで評価される。本稿では,標準的なFL(単一グローバルモデル)に対するバックドアの脅威を再考する。 SABLE, Semantics-Aware Backdoor for LEarning in Federated settings, which constructs natural, content-consistent triggers (例えば、サングラスのような意味的属性変化) and optimizations of a aggregate-aware malicious objective with feature separation and parameter regularization to keep attack update close to beinign. SABLE on CelebA hair-color classification and the German Traffic Sign Recognition Benchmark (GTSRB) は、各悪意あるクライアントのローカルデータの小さな解釈可能なサブセットに限って、標準FLプロトコルに従っている。不均一なクライアントパーティションと複数のアグリゲーションルール(FedAvg、Trimmed Mean、MultiKrum、FLAME)を通じて、我々のセマンティクス駆動トリガは、良質なテスト精度を維持しながら、高いターゲット攻撃成功率を達成する。これらの結果から, セマンティックスに整合したバックドアは, 連合学習において強力かつ実践的な脅威であり, 合成パッチトリガのみに基づくロバスト性主張は過度に楽観的であることが示唆された。

論文の概要: Beyond Corner Patches: Semantics-Aware Backdoor Attack in Federated Learning

関連論文リスト