Fugu-MT 論文翻訳(概要): Measuring Equality in Machine Learning Security Defenses

論文の概要: Measuring Equality in Machine Learning Security Defenses

arxiv url: http://arxiv.org/abs/2302.08973v5
Date: Thu, 1 Jun 2023 03:17:29 GMT
ステータス: 翻訳完了
システム内更新日: 2023-06-03 00:26:09.611928
Title: Measuring Equality in Machine Learning Security Defenses
Title（参考訳）: マシンラーニングのセキュリティ防衛における品質測定
Authors: Luke E. Richards, Edward Raff, Cynthia Matuszek
Abstract要約: 機械学習のセキュリティコミュニティは、過去10年にわたって、回避攻撃のための無数の防御を開発してきた。このコミュニティの根底にある疑問は: この防衛は、誰のために防御されるのか? 機械学習のセキュリティ手法の公正性に関する実証的な結果を通じて、この問題に答えることのできる単純なパリティ指標と分析のためのフレームワークについて概説する。
参考スコア（独自算出の注目度）: 39.96812311550351
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The machine learning security community has developed myriad defenses for evasion attacks over the past decade. An understudied question in that community is: for whom do these defenses defend? In this work, we consider some common approaches to defending learned systems and whether those approaches may offer unexpected performance inequities when used by different sub-populations. We outline simple parity metrics and a framework for analysis that can begin to answer this question through empirical results of the fairness implications of machine learning security methods. Many methods have been proposed that can cause direct harm, which we describe as biased vulnerability and biased rejection. Our framework and metric can be applied to robustly trained models, preprocessing-based methods, and rejection methods to capture behavior over security budgets. We identify a realistic dataset with a reasonable computational cost suitable for measuring the equality of defenses. Through a case study in speech command recognition, we show how such defenses do not offer equal protection for social subgroups and how to perform such analyses for robustness training, and we present a comparison of fairness between two rejection-based defenses: randomized smoothing and neural rejection. We offer further analysis of factors that correlate to equitable defenses to stimulate the future investigation of how to assist in building such defenses. To the best of our knowledge, this is the first work that examines the fairness disparity in the accuracy-robustness trade-off in speech data and addresses fairness evaluation for rejection-based defenses.
Abstract（参考訳）: 機械学習セキュリティコミュニティは、過去10年間にわたって、回避攻撃に対する無数の防御を開発してきた。このコミュニティの未熟な疑問は: この防御策は誰を擁護するのか? 本研究では,学習システムを守るための一般的なアプローチと,それらのアプローチが異なるサブ集団で使用する場合,予期せぬ性能の低下をもたらすかどうかを検討する。機械学習のセキュリティ手法の公正性に関する実証的な結果を通じて、この問題に答えることのできる単純なパリティ指標と分析のためのフレームワークについて概説する。直接的害を引き起こす可能性のある多くの手法が提案されているが、これは偏りの脆弱性と偏りの拒絶である。当社のフレームワークとメトリックは,堅牢にトレーニングされたモデル,前処理ベースのメソッド,セキュリティ予算を越える行動をキャプチャする拒否メソッドにも適用可能です。我々は、防衛の平等性を測定するのに適した合理的な計算コストを持つ現実的なデータセットを特定する。音声認識におけるケーススタディを通じて、このような防衛が社会的サブグループに対して平等な保護を提供していないこと、そして、ロバストネストレーニングのためにそのような分析を行う方法を示し、ランダム化スムーシングとニューラルリジェクションの2つの拒否に基づく防御の公平性の比較を示す。我々は, 公平防衛と相関する要因のさらなる分析を行い, 今後, 防衛体制の整備を支援する方法についての調査を促進させる。我々の知る限りでは、音声データの正確さと損益性とのトレードオフの公平性を検証し、拒絶に基づく防御に対する公平性評価に対処する最初の研究である。

論文の概要: Measuring Equality in Machine Learning Security Defenses

関連論文リスト