Fugu-MT 論文翻訳(概要): Exact Certification of Neural Networks and Partition Aggregation Ensembles against Label Poisoning

論文の概要: Exact Certification of Neural Networks and Partition Aggregation Ensembles against Label Poisoning

arxiv url: http://arxiv.org/abs/2604.11416v1
Date: Mon, 13 Apr 2026 13:01:14 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-14 20:13:16.545938
Title: Exact Certification of Neural Networks and Partition Aggregation Ensembles against Label Poisoning
Title（参考訳）: ニューラルネットワークの厳密な認証とラベル中毒対策のための分割集約
Authors: Ajinkya Mohgaonkar, Lukas Gosch, Mahalakshmi Sabanayagam, Debarghya Ghoshdastidar, Stephan Günnemann,
Abstract要約: ラベルフライング攻撃は教師付き学習モデルにとって大きな脅威である。既存の認証フレームワークはスムーズ化やパーティション集約といったアンサンブル技術に依存している。パーティション・アグリゲーション・アンサンブルの最初の認証フレームワークであるEnsembleCertを紹介します。
参考スコア（独自算出の注目度）: 48.86044009211734
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Label-flipping attacks, which corrupt training labels to induce misclassifications at inference, remain a major threat to supervised learning models. This drives the need for robustness certificates that provide formal guarantees about a model's robustness under adversarially corrupted labels. Existing certification frameworks rely on ensemble techniques such as smoothing or partition-aggregation, but treat the corresponding base classifiers as black boxes, yielding overly conservative guarantees. We introduce EnsembleCert, the first certification framework for partition-aggregation ensembles that utilizes white-box knowledge of the base classifiers. Concretely, EnsembleCert yields tighter guarantees than black-box approaches by aggregating per-partition white-box certificates to compute ensemble-level guarantees in polynomial time. To extract white-box knowledge from the base classifiers efficiently, we develop ScaLabelCert, a method that leverages the equivalence between sufficiently wide neural networks and kernel methods using the neural tangent kernel. ScaLabelCert yields the first exact, polynomial-time calculable certificate for neural networks against label-flipping attacks. EnsembleCert is either on par, or significantly outperforms the existing partition-based black box certificates. Exemplary, on CIFAR-10, our method can certify upto +26.5% more label flips in median over the test set compared to the existing black-box approach while requiring 100 times fewer partitions, thus, challenging the prevailing notion that heavy partitioning is a necessity for strong certified robustness.
Abstract（参考訳）: ラベルフリッピング攻撃は、推論時に誤分類を誘発するためにトレーニングラベルを破損させるが、教師付き学習モデルにとって大きな脅威である。これにより、逆向きに破損したラベルの下で、モデルの堅牢性に関する公式な保証を提供する堅牢性証明書の必要性が促進される。既存の認証フレームワークはスムーズ化や分割集約といったアンサンブル技術に依存しているが、対応する基本分類器をブラックボックスとして扱い、過度に保守的な保証をもたらす。本稿では,基本分類器のホワイトボックス知識を利用した分割集約アンサンブルの最初の認証フレームワークであるEnsembleCertを紹介する。具体的には、EnsembleCertは、多項式時間でアンサンブルレベルの保証を計算するために、パーティションごとのホワイトボックス証明書を集約することで、ブラックボックスアプローチよりも厳密な保証を得る。ベース分類器からホワイトボックス知識を効率的に抽出するために,十分に広いニューラルネットワークとニューラルネットワークを用いたカーネル手法の等価性を活用する手法であるScaLabelCertを開発した。 ScaLabelCertは、ラベルフライング攻撃に対するニューラルネットワークのための、最初の正確な多項式時間計算可能な証明書を提供する。 EnsembleCertは同等か、既存のパーティションベースのブラックボックス証明書よりもはるかに優れている。例えば、CIFAR-10では、既存のブラックボックスアプローチと比較してテストセットの中央値に対して、最大で26.5%以上のラベルフリップを証明できるが、100倍のパーティショニングを必要とするため、高いパーティショニングが強力な信頼性のロバスト性に欠かせないという一般的な考え方に挑戦する。

論文の概要: Exact Certification of Neural Networks and Partition Aggregation Ensembles against Label Poisoning

関連論文リスト