Fugu-MT 論文翻訳(概要): Focus on Hiders: Exploring Hidden Threats for Enhancing Adversarial Training

論文の概要: Focus on Hiders: Exploring Hidden Threats for Enhancing Adversarial Training

arxiv url: http://arxiv.org/abs/2312.07067v1
Date: Tue, 12 Dec 2023 08:41:18 GMT
ステータス: 翻訳完了
システム内更新日: 2023-12-13 16:46:56.229261
Title: Focus on Hiders: Exploring Hidden Threats for Enhancing Adversarial Training
Title（参考訳）: hidersに焦点をあてて - 敵のトレーニングを強化するための隠れた脅威を探求する
Authors: Qian Li, Yuxiao Hu, Yinpeng Dong, Dongxiao Zhang, Yuntian Chen
Abstract要約: 我々は、HFAT(Hider-Focused Adversarial Training)と呼ばれる一般化した逆トレーニングアルゴリズムを提案する。 HFATは、標準的な対向訓練と予防隠蔽装置の最適化方向を組み合わせたものである。提案手法の有効性を実験により検証した。
参考スコア（独自算出の注目度）: 20.1991376813843
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Adversarial training is often formulated as a min-max problem, however, concentrating only on the worst adversarial examples causes alternating repetitive confusion of the model, i.e., previously defended or correctly classified samples are not defensible or accurately classifiable in subsequent adversarial training. We characterize such non-ignorable samples as "hiders", which reveal the hidden high-risk regions within the secure area obtained through adversarial training and prevent the model from finding the real worst cases. We demand the model to prevent hiders when defending against adversarial examples for improving accuracy and robustness simultaneously. By rethinking and redefining the min-max optimization problem for adversarial training, we propose a generalized adversarial training algorithm called Hider-Focused Adversarial Training (HFAT). HFAT introduces the iterative evolution optimization strategy to simplify the optimization problem and employs an auxiliary model to reveal hiders, effectively combining the optimization directions of standard adversarial training and prevention hiders. Furthermore, we introduce an adaptive weighting mechanism that facilitates the model in adaptively adjusting its focus between adversarial examples and hiders during different training periods. We demonstrate the effectiveness of our method based on extensive experiments, and ensure that HFAT can provide higher robustness and accuracy.
Abstract（参考訳）: 敵意トレーニングは、しばしばmin-max問題として定式化されるが、最悪の敵意の例のみに集中すると、モデルの相変わらず反復的な混乱を引き起こす。我々は,このような無知なサンプルを「ヒアラー」として特徴付け,敵の訓練によって得られた安全領域内の隠れた高リスク領域を明らかにし,本モデルが真に最悪のケースを発見するのを防ぐ。我々は,同時に精度と堅牢性を向上するために,敵の例に対抗して隠蔽機を防御するモデルを求める。敵意学習のためのmin-max最適化問題を再考し,再定義することにより,hider-focus adversarial training(hfat)と呼ばれる一般化した敵意訓練アルゴリズムを提案する。 hfatは、最適化問題を単純化するために反復進化最適化戦略を導入し、標準敵訓練の最適化方向と防止ハイダを効果的に組み合わせ、ハイダを明らかにする補助モデルを採用している。さらに,異なるトレーニング期間において,実例とハイダ間のフォーカスを適応的に調整する適応重み付け機構を導入する。提案手法の有効性を実験的に検証し,HFATが高い堅牢性と精度を提供できることを確かめる。

関連論文リスト

Dynamic Label Adversarial Training for Deep Learning Robustness Against Adversarial Attacks [11.389689242531327]
対人訓練は、モデルの堅牢性を高める最も効果的な方法の1つである。従来のアプローチでは、主に敵の訓練に静的接地真理を用いるが、しばしば強固なオーバーフィッティングを引き起こす。本稿では,動的ラベル対逆トレーニング(DYNAT)アルゴリズムを提案する。
論文参考訳（メタデータ） (2024-08-23T14:25:12Z)
Improved Adversarial Training Through Adaptive Instance-wise Loss Smoothing [5.1024659285813785]
敵の訓練は、このような敵の攻撃に対する最も成功した防御であった。本稿では,新たな対人訓練手法を提案する。本手法は,$ell_infty$-norm制約攻撃に対する最先端のロバスト性を実現する。
論文参考訳（メタデータ） (2023-03-24T15:41:40Z)
Vanilla Feature Distillation for Improving the Accuracy-Robustness Trade-Off in Adversarial Training [37.5115141623558]
本稿では,Vanilla Feature Distillation Adversarial Training (VFD-Adv)を提案する。我々の手法の重要な利点は、既存の作品に普遍的に適応し、強化できることである。
論文参考訳（メタデータ） (2022-06-05T11:57:10Z)
Enhancing Adversarial Training with Feature Separability [52.39305978984573]
本稿では,特徴分離性を備えた対人訓練(ATFS)により,クラス内特徴の類似性を向上し,クラス間特徴分散を増大させることができる,新たな対人訓練グラフ(ATG)を提案する。包括的な実験を通じて、提案したATFSフレームワークがクリーンかつロバストなパフォーマンスを著しく改善することを示した。
論文参考訳（メタデータ） (2022-05-02T04:04:23Z)
On the Convergence and Robustness of Adversarial Training [134.25999006326916]
Project Gradient Decent (PGD) によるアドリアリトレーニングが最も効果的である。生成した逆数例の収束性を向上させるためのテクトダイナミックトレーニング戦略を提案する。その結果,提案手法の有効性が示唆された。
論文参考訳（メタデータ） (2021-12-15T17:54:08Z)
Model-Agnostic Meta-Attack: Towards Reliable Evaluation of Adversarial Robustness [53.094682754683255]
モデル非依存型メタアタック(MAMA)アプローチにより,より強力な攻撃アルゴリズムを自動検出する。本手法は、繰り返しニューラルネットワークによってパラメータ化された逆攻撃を学習する。本研究では,未知の防御を攻撃した場合の学習能力を向上させるために,モデルに依存しない訓練アルゴリズムを開発した。
論文参考訳（メタデータ） (2021-10-13T13:54:24Z)
Improving White-box Robustness of Pre-processing Defenses via Joint Adversarial Training [106.34722726264522]
対向騒音の干渉を軽減するため,様々な対向防御技術が提案されている。プレプロセス法は、ロバストネス劣化効果に悩まされることがある。この負の効果の潜在的な原因は、敵の訓練例が静的であり、前処理モデルとは独立していることである。本稿では,JATP(Joint Adversarial Training Based Pre-processing)防衛法を提案する。
論文参考訳（メタデータ） (2021-06-10T01:45:32Z)
Self-Progressing Robust Training [146.8337017922058]
敵対的なトレーニングのような現在の堅牢なトレーニング方法は、敵対的な例を生成するために「攻撃」を明示的に使用します。我々はSPROUTと呼ばれる自己プログレッシブ・ロバスト・トレーニングのための新しいフレームワークを提案する。その結果,スケーラブルで効果的で攻撃に依存しないロバストなトレーニング手法に新たな光を当てた。
論文参考訳（メタデータ） (2020-12-22T00:45:24Z)
Adversarial Distributional Training for Robust Deep Learning [53.300984501078126]
逆行訓練(AT)は、逆行例によるトレーニングデータを増やすことにより、モデルロバスト性を改善する最も効果的な手法の一つである。既存のAT手法の多くは、敵の例を作らせるために特定の攻撃を採用しており、他の目に見えない攻撃に対する信頼性の低い堅牢性につながっている。本稿では,ロバストモデル学習のための新しいフレームワークであるADTを紹介する。
論文参考訳（メタデータ） (2020-02-14T12:36:59Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。