Fugu-MT 論文翻訳(概要): Certifiers Make Neural Networks Vulnerable to Availability Attacks

論文の概要: Certifiers Make Neural Networks Vulnerable to Availability Attacks

arxiv url: http://arxiv.org/abs/2108.11299v5
Date: Tue, 3 Oct 2023 13:08:50 GMT
ステータス: 翻訳完了
システム内更新日: 2023-10-05 11:19:10.654686
Title: Certifiers Make Neural Networks Vulnerable to Availability Attacks
Title（参考訳）: ニューラルネットワークのアベイラビリティアタックに対する脆弱性を認定する
Authors: Tobias Lorenz, Marta Kwiatkowska, Mario Fritz
Abstract要約: 私たちは初めて、逆転戦略が敵によって意図的に引き起こされる可能性があることを示します。いくつかの入力や摂動のために自然に発生する障害に加えて、敵は故意にフォールバックを誘発するために訓練時間攻撃を使用することができる。我々は2つの新しいアベイラビリティーアタックを設計し、これらの脅威の実用的妥当性を示す。
参考スコア（独自算出の注目度）: 70.69104148250614
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: To achieve reliable, robust, and safe AI systems, it is vital to implement fallback strategies when AI predictions cannot be trusted. Certifiers for neural networks are a reliable way to check the robustness of these predictions. They guarantee for some predictions that a certain class of manipulations or attacks could not have changed the outcome. For the remaining predictions without guarantees, the method abstains from making a prediction, and a fallback strategy needs to be invoked, which typically incurs additional costs, can require a human operator, or even fail to provide any prediction. While this is a key concept towards safe and secure AI, we show for the first time that this approach comes with its own security risks, as such fallback strategies can be deliberately triggered by an adversary. In addition to naturally occurring abstains for some inputs and perturbations, the adversary can use training-time attacks to deliberately trigger the fallback with high probability. This transfers the main system load onto the fallback, reducing the overall system's integrity and/or availability. We design two novel availability attacks, which show the practical relevance of these threats. For example, adding 1% poisoned data during training is sufficient to trigger the fallback and hence make the model unavailable for up to 100% of all inputs by inserting the trigger. Our extensive experiments across multiple datasets, model architectures, and certifiers demonstrate the broad applicability of these attacks. An initial investigation into potential defenses shows that current approaches are insufficient to mitigate the issue, highlighting the need for new, specific solutions.
Abstract（参考訳）: 信頼性があり堅牢で安全なAIシステムを実現するためには、AI予測が信頼できない場合にフォールバック戦略を実装することが不可欠である。ニューラルネットワークの証明器は、これらの予測の堅牢性をチェックする信頼できる方法である。彼らは特定の種類の操作や攻撃が結果を変えられないという予測を保証した。保証のない残りの予測では、この方法は予測を妨げ、通常追加のコストを発生させるフォールバック戦略を呼び出す必要がある。これは安全でセキュアなAIに対する重要な概念ですが、このアプローチが独自のセキュリティリスクを伴っていることを初めて示します。いくつかの入力や摂動に対して自然に発生する障害に加えて、敵は訓練時攻撃を使用して、高い確率でフォールバックを意図的に引き起こすことができる。これにより、メインシステムの負荷をフォールバックに転送し、システム全体の整合性と可用性を低下させる。我々は2つの新しいアベイラビリティーアタックを設計し、これらの脅威の実用的妥当性を示す。例えば、トレーニング中に1%の有毒データを追加することは、フォールバックを引き起こすのに十分であり、モデルがトリガーを挿入することで、入力の最大100%を利用できなくなる。複数のデータセット、モデルアーキテクチャ、証明器をまたいだ広範な実験は、これらの攻撃の幅広い適用性を示しています。潜在的な防衛に関する最初の調査は、現在のアプローチが問題を緩和するには不十分であることを示し、新しい特定のソリューションの必要性を強調している。

論文の概要: Certifiers Make Neural Networks Vulnerable to Availability Attacks

関連論文リスト