Fugu-MT 論文翻訳(概要): Addressing Key Challenges of Adversarial Attacks and Defenses in the Tabular Domain: A Methodological Framework for Coherence and Consistency

論文の概要: Addressing Key Challenges of Adversarial Attacks and Defenses in the Tabular Domain: A Methodological Framework for Coherence and Consistency

arxiv url: http://arxiv.org/abs/2412.07326v2
Date: Tue, 03 Jun 2025 21:06:19 GMT
ステータス: 翻訳完了
システム内更新日: 2025-06-05 16:24:48.787813
Title: Addressing Key Challenges of Adversarial Attacks and Defenses in the Tabular Domain: A Methodological Framework for Coherence and Consistency
Title（参考訳）: タブラルドメインにおける敵攻撃と防衛の鍵的課題--一貫性と一貫性のための方法論的枠組み
Authors: Yael Itzhakev, Amit Giloni, Yuval Elovici, Asaf Shabtai,
Abstract要約: CSAD(Class-Specific Anomaly Detection)は,新しい異常検出手法である。 CSADは, 広い良性分布ではなく, 予測されたクラス分布に対して, 対数サンプルを評価する。本評価では, 異常検出率とSHAPに基づく評価を併用し, 対向検体品質のより包括的測定を行う。
参考スコア（独自算出の注目度）: 26.645723217188323
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Machine learning models trained on tabular data are vulnerable to adversarial attacks, even in realistic scenarios where attackers only have access to the model's outputs. Since tabular data contains complex interdependencies among features, it presents a unique challenge for adversarial samples which must maintain coherence and respect these interdependencies to remain indistinguishable from benign data. Moreover, existing attack evaluation metrics-such as the success rate, perturbation magnitude, and query count-fail to account for this challenge. To address those gaps, we propose a technique for perturbing dependent features while preserving sample coherence. In addition, we introduce Class-Specific Anomaly Detection (CSAD), an effective novel anomaly detection approach, along with concrete metrics for assessing the quality of tabular adversarial attacks. CSAD evaluates adversarial samples relative to their predicted class distribution, rather than a broad benign distribution. It ensures that subtle adversarial perturbations, which may appear coherent in other classes, are correctly identified as anomalies. We integrate SHAP explainability techniques to detect inconsistencies in model decision-making, extending CSAD for SHAP-based anomaly detection. Our evaluation incorporates both anomaly detection rates with SHAP-based assessments to provide a more comprehensive measure of adversarial sample quality. We evaluate various attack strategies, examining black-box query-based and transferability-based gradient attacks across four target models. Experiments on benchmark tabular datasets reveal key differences in the attacker's risk and effort and attack quality, offering insights into the strengths, limitations, and trade-offs faced by attackers and defenders. Our findings lay the groundwork for future research on adversarial attacks and defense development in the tabular domain.
Abstract（参考訳）: 表形式のデータに基づいてトレーニングされた機械学習モデルは、攻撃者がモデルの出力にのみアクセス可能な現実的なシナリオであっても、敵攻撃に対して脆弱である。表形式のデータには特徴間の複雑な相互依存性が含まれているため、一貫性を維持し、それらの相互依存性を尊重し、良質なデータと区別できないままにしておく必要がある、敵対的なサンプルに対してユニークな課題が提示される。さらに、既存の攻撃評価指標(成功率、摂動の大きさ、クエリ数障害など)も、この課題を考慮に入れている。これらのギャップに対処するために,サンプルコヒーレンスを保ちながら依存する特徴を摂動する手法を提案する。さらに,新たな異常検出手法であるCSAD(Class-Specific Anomaly Detection)を導入するとともに,表層攻撃の質を評価するための具体的な指標も紹介する。 CSADは, 広い良性分布ではなく, 予測されたクラス分布に対して, 対数サンプルを評価する。これは、他のクラスでコヒーレントに見える微妙な敵対的摂動が、異常として正しく識別されることを保証する。我々は、モデル決定の不整合を検出するためのSHAP説明可能性技術を統合し、SHAPに基づく異常検出のためのCSADを拡張した。本評価では, 異常検出率とSHAPに基づく評価を併用し, 対向検体品質のより包括的測定を行う。ブラックボックス・クエリー・ベースとトランスファービリティー・ベース・グラデーション・アタックを4つのターゲットモデルで検討し,様々な攻撃戦略を評価する。ベンチマーク表データセットの実験は、攻撃者のリスクと労力と攻撃品質に重要な違いを示し、攻撃者やディフェンダーが直面する強み、制限、トレードオフに関する洞察を提供する。本研究は,テーブル状領域における敵攻撃と防衛開発に関する今後の研究の基盤となるものである。

論文の概要: Addressing Key Challenges of Adversarial Attacks and Defenses in the Tabular Domain: A Methodological Framework for Coherence and Consistency

関連論文リスト