Fugu-MT 論文翻訳(概要): ANCHOR: Integrating Adversarial Training with Hard-mined Supervised Contrastive Learning for Robust Representation Learning

論文の概要: ANCHOR: Integrating Adversarial Training with Hard-mined Supervised Contrastive Learning for Robust Representation Learning

arxiv url: http://arxiv.org/abs/2510.27599v1
Date: Fri, 31 Oct 2025 16:24:28 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-03 17:52:16.164027
Title: ANCHOR: Integrating Adversarial Training with Hard-mined Supervised Contrastive Learning for Robust Representation Learning
Title（参考訳）: ANCHOR:ロバスト表現学習のためのハードマイニング・スーパービジョン・コントラスト学習による対人訓練の統合
Authors: Samarup Bhattacharya, Anubhab Bhattacharya, Abir Chakraborty,
Abstract要約: モデルは勾配に従って学習し、データの中で最も識別しやすいパターンを特定するまで、パラメータを段階的に調整する。モデル学習を支援する勾配は、モデルが決定を完全に変更する原因となる、小さく、受け入れがたい微調整を生成するためにも使用することができる。これらの攻撃は、この脆弱性を悪用し、画像に小さな、知覚不能な変更を加えることで、人間の目と同じものを残しながら、モデルが間違った予測をする原因となる。
参考スコア（独自算出の注目度）: 1.9116784879310027
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Neural networks have changed the way machines interpret the world. At their core, they learn by following gradients, adjusting their parameters step by step until they identify the most discriminant patterns in the data. This process gives them their strength, yet it also opens the door to a hidden flaw. The very gradients that help a model learn can also be used to produce small, imperceptible tweaks that cause the model to completely alter its decision. Such tweaks are called adversarial attacks. These attacks exploit this vulnerability by adding tiny, imperceptible changes to images that, while leaving them identical to the human eye, cause the model to make wrong predictions. In this work, we propose Adversarially-trained Contrastive Hard-mining for Optimized Robustness (ANCHOR), a framework that leverages the power of supervised contrastive learning with explicit hard positive mining to enable the model to learn representations for images such that the embeddings for the images, their augmentations, and their perturbed versions cluster together in the embedding space along with those for other images of the same class while being separated from images of other classes. This alignment helps the model focus on stable, meaningful patterns rather than fragile gradient cues. On CIFAR-10, our approach achieves impressive results for both clean and robust accuracy under PGD-20 (epsilon = 0.031), outperforming standard adversarial training methods. Our results indicate that combining adversarial guidance with hard-mined contrastive supervision helps models learn more structured and robust representations, narrowing the gap between accuracy and robustness.
Abstract（参考訳）: ニューラルネットワークは、機械が世界を解釈する方法を変えた。中心となるのは勾配に従って学習し、データの中で最も識別しやすいパターンを特定するまで、パラメータを段階的に調整する。このプロセスは彼らに強みを与えるが、秘密の欠陥への扉を開く。モデル学習を支援する勾配は、モデルが決定を完全に変更する原因となる、小さく、受け入れがたい微調整を生成するためにも使用することができる。このような微調整を敵攻撃と呼ぶ。これらの攻撃は、この脆弱性を悪用し、画像に小さな、知覚不能な変更を加えることで、人間の目と同じものを残しながら、モデルが間違った予測をする原因となる。そこで本研究では, 教師付きコントラスト学習のパワーと明示的な強靭なマイニングを両立させるフレームワークであるAdversarially-trained Contrastive Hard-mining for Optimized Robustness (ANCHOR)を提案する。このアライメントは、モデルを脆弱な勾配キューではなく、安定的で意味のあるパターンにフォーカスするのに役立ちます。 CIFAR-10では, PGD-20 (epsilon = 0.031) の精度と精度の両立が図られ, 標準的な対向訓練法よりも優れていた。この結果から, 逆方向誘導と強み付きコントラスト監視を組み合わせることで, モデルがより構造化され, 頑健な表現を学習し, 精度と頑健さのギャップを狭めることが示唆された。

論文の概要: ANCHOR: Integrating Adversarial Training with Hard-mined Supervised Contrastive Learning for Robust Representation Learning

関連論文リスト