Fugu-MT 論文翻訳(概要): Deep Partition Aggregation: Provable Defense against General Poisoning Attacks

論文の概要: Deep Partition Aggregation: Provable Defense against General Poisoning Attacks

arxiv url: http://arxiv.org/abs/2006.14768v2
Date: Thu, 18 Mar 2021 05:50:12 GMT
ステータス: 翻訳完了
システム内更新日: 2022-11-16 20:55:15.809359
Title: Deep Partition Aggregation: Provable Defense against General Poisoning Attacks
Title（参考訳）: ディープパーティショニングアグリゲーション:一般的な中毒攻撃に対する防御性
Authors: Alexander Levine, Soheil Feizi
Abstract要約: アドリアリン中毒は、分類器の試験時間挙動を損なうために訓練データを歪ませる。毒殺攻撃に対する2つの新たな防御策を提案する。 DPAは一般的な中毒脅威モデルに対する認証された防御である。 SS-DPAはラベルフリップ攻撃に対する認証された防御である。
参考スコア（独自算出の注目度）: 136.79415677706612
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Adversarial poisoning attacks distort training data in order to corrupt the test-time behavior of a classifier. A provable defense provides a certificate for each test sample, which is a lower bound on the magnitude of any adversarial distortion of the training set that can corrupt the test sample's classification. We propose two novel provable defenses against poisoning attacks: (i) Deep Partition Aggregation (DPA), a certified defense against a general poisoning threat model, defined as the insertion or deletion of a bounded number of samples to the training set -- by implication, this threat model also includes arbitrary distortions to a bounded number of images and/or labels; and (ii) Semi-Supervised DPA (SS-DPA), a certified defense against label-flipping poisoning attacks. DPA is an ensemble method where base models are trained on partitions of the training set determined by a hash function. DPA is related to both subset aggregation, a well-studied ensemble method in classical machine learning, as well as to randomized smoothing, a popular provable defense against evasion attacks. Our defense against label-flipping attacks, SS-DPA, uses a semi-supervised learning algorithm as its base classifier model: each base classifier is trained using the entire unlabeled training set in addition to the labels for a partition. SS-DPA significantly outperforms the existing certified defense for label-flipping attacks on both MNIST and CIFAR-10: provably tolerating, for at least half of test images, over 600 label flips (vs. < 200 label flips) on MNIST and over 300 label flips (vs. 175 label flips) on CIFAR-10. Against general poisoning attacks, where no prior certified defenses exists, DPA can certify >= 50% of test images against over 500 poison image insertions on MNIST, and nine insertions on CIFAR-10. These results establish new state-of-the-art provable defenses against poisoning attacks.
Abstract（参考訳）: 逆毒は、分類器の試験時間挙動を損なうために歪んだ訓練データを攻撃する。証明可能な防御は、各テストサンプルの証明書を提供するが、これは、テストサンプルの分類を損なう可能性のあるトレーニングセットの敵対的歪みの大きさに対する下限である。我々は2つの新たな防犯策を提案する。 (i) 一般中毒脅威モデルに対する認定防御である深層分割集約(dpa)は、トレーニングセットへの有界なサンプル数の挿入または削除として定義されており、この脅威モデルは、有界な画像及び/又はラベルに対する任意の歪みを含む。 (ii)半監督dpa(ss-dpa)は、ラベルを貼る毒殺攻撃に対する認定防御である。 dpaは、ハッシュ関数によって決定されるトレーニングセットのパーティションに基づいてベースモデルをトレーニングするアンサンブル手法である。 dpaは、古典的な機械学習でよく研究されたアンサンブルであるサブセットアグリゲーションと、回避攻撃に対する一般的な証明可能な防御であるランダム化スムージングの両方に関連している。 SS-DPAは半教師付き学習アルゴリズムをベース分類器モデルとして使用しており、各基本分類器は分割のためのラベルに加えてラベル付きトレーニングセット全体を用いて訓練される。 SS-DPAは、MNISTとCIFAR-10の双方に対するラベルフリップ攻撃に対する既存の認証された防御よりも、MNIST上の600以上のラベルフリップ(vs. < 200のラベルフリップ)と、CIFAR-10上の300以上のラベルフリップ(vs. 175のラベルフリップ)において、確実に許容できる。事前に認証された防御が存在しない一般的な中毒攻撃に対して、dpaはテスト画像の50%以上をmnistに500以上の毒画像が挿入され、cifar-10に9回挿入されたことを証明できる。これらの結果は、毒殺攻撃に対する新しい最先端証明可能な防御を確立する。

論文の概要: Deep Partition Aggregation: Provable Defense against General Poisoning Attacks

関連論文リスト