Fugu-MT 論文翻訳(概要): Toward Trustworthy AI: Multi-Target Adversarial Attacks and Robust Defenses for Continuous Data Summarization

論文の概要: Toward Trustworthy AI: Multi-Target Adversarial Attacks and Robust Defenses for Continuous Data Summarization

arxiv url: http://arxiv.org/abs/2606.11804v1
Date: Wed, 10 Jun 2026 08:38:27 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-11 16:42:38.365955
Title: Toward Trustworthy AI: Multi-Target Adversarial Attacks and Robust Defenses for Continuous Data Summarization
Title（参考訳）: 信頼できるAIに向けて - 継続的データ要約のためのマルチターゲット敵攻撃とロバストディフェンス
Authors: Yuefang Lian, Longkun Guo, Zhongrui Zhao, Zhigang Lu, Yanan Cai, Shuchao Pang, Dachuan Xu, Jason Xue,
Abstract要約: 要約プロセスに対する敵対的な摂動は、上流で信頼できるAIを妥協する可能性がある。本稿では,DR-サブモジュラー最適化による類似度レベルの摂動下での連続データ要約に対する逆攻撃について検討する。非負のサブモジュラー集合関数の多重線型拡張として、多重解像度画像要約対象のクラスが定式化可能であることを示す。次に、類似構造の許容摂動を最適化し、複数のターゲットの要約モデルを分解する、min-max問題としてマルチターゲット攻撃生成を定式化する。
参考スコア（独自算出の注目度）: 8.957420305238177
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Trustworthy AI requires reliable data-processing pipelines, not only robust downstream predictive models. As an upstream component, data summarization determines which information is retained and passed to subsequent learning or decision modules. Therefore, adversarial perturbations to the summarization process can compromise trustworthy AI in an upstream manner: they may alter the selected summary, reduce its representativeness, and further degrade the utility of subsequent learning tasks. In this paper, we study adversarial attacks on continuous data summarization under similarity-level perturbations through DR-submodular optimization. We show that a class of multi-resolution image summarization objectives can be formulated as multilinear extensions of non-negative submodular set functions and satisfy DR-submodularity with $m$-weak monotonicity. We then formulate multi-target attack generation as a min-max problem, where one admissible perturbation of the similarity structure is optimized to degrade multiple target summarization models. To mitigate such perturbations, we formulate robust defense against mixed attack types as a regularized max-min problem. For both problems, we develop approximation algorithms with theoretical guarantees. Experiments on real-data and controlled clustered benchmarks show that the proposed attack is effective in representative low-to-moderate budget regimes and can induce downstream task-performance loss. The proposed defense improves the robustness--mitigation trade-off in structured settings, while also revealing the parameter sensitivity of robust protection on real data.
Abstract（参考訳）: 信頼できるAIには、堅牢な下流予測モデルだけでなく、信頼できるデータ処理パイプラインが必要です。上流コンポーネントとして、データの要約はどの情報が保持され、その後の学習や決定モジュールに渡されるかを決定する。したがって、要約プロセスに対する敵対的摂動は、選択された要約を変更し、その代表性を減らし、さらにその後の学習タスクの有用性を低下させるという、信頼できるAIを上流的に妥協することができる。本稿では,DR-サブモジュラー最適化による類似度レベルの摂動下での連続データ要約に対する逆攻撃について検討する。非負のサブモジュラー集合関数の多重線型拡張として定式化することができ、DR-サブモジュラリティを$m$-weak monotonicityで満足することを示す。次に、類似構造の許容摂動を最適化し、複数のターゲットの要約モデルを分解する、min-max問題としてマルチターゲット攻撃生成を定式化する。このような摂動を緩和するために、我々は正則化された最大ミン問題として混合攻撃タイプに対する堅牢な防御を定式化する。両問題に対して,理論的保証付き近似アルゴリズムを開発した。実データおよびクラスタ化ベンチマークによる実験により,提案手法は低調な予算体系に有効であり,下流のタスクパフォーマンス損失を誘発できることが示された。提案したディフェンスは、構造化された設定におけるロバスト性-緩和トレードオフを改善するとともに、実データに対するロバスト保護のパラメータ感度を明らかにする。

論文の概要: Toward Trustworthy AI: Multi-Target Adversarial Attacks and Robust Defenses for Continuous Data Summarization

関連論文リスト