Fugu-MT 論文翻訳(概要): Model-agnostic Adversarial Attack and Defense for Vision-Language-Action Models

論文の概要: Model-agnostic Adversarial Attack and Defense for Vision-Language-Action Models

arxiv url: http://arxiv.org/abs/2510.13237v1
Date: Wed, 15 Oct 2025 07:42:44 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-16 20:13:28.548646
Title: Model-agnostic Adversarial Attack and Defense for Vision-Language-Action Models
Title（参考訳）: ビジョン・ランゲージ・アクションモデルに対するモデル非依存的敵攻撃と防御
Authors: Haochuan Xu, Yun Sing Koh, Shuhuai Huang, Zirun Zhou, Di Wang, Jun Sakuma, Jingfeng Zhang,
Abstract要約: VLA(Vision-Language-Action)モデルは、ロボット学習において革命的な進歩を遂げている。この進歩にもかかわらず、その敵意の強固さは未解明のままである。本稿では,VLAモデルに対する敵パッチ攻撃と対応する防御戦略の両方を提案する。
参考スコア（独自算出の注目度）: 25.45513133247862
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Vision-Language-Action (VLA) models have achieved revolutionary progress in robot learning, enabling robots to execute complex physical robot tasks from natural language instructions. Despite this progress, their adversarial robustness remains underexplored. In this work, we propose both adversarial patch attack and corresponding defense strategies for VLA models. We first introduce the Embedding Disruption Patch Attack (EDPA), a model-agnostic adversarial attack that generates patches directly placeable within the camera's view. In comparison to prior methods, EDPA can be readily applied to different VLA models without requiring prior knowledge of the model architecture, or the controlled robotic manipulator. EDPA constructs these patches by (i) disrupting the semantic alignment between visual and textual latent representations, and (ii) maximizing the discrepancy of latent representations between adversarial and corresponding clean visual inputs. Through the optimization of these objectives, EDPA distorts the VLA's interpretation of visual information, causing the model to repeatedly generate incorrect actions and ultimately result in failure to complete the given robotic task. To counter this, we propose an adversarial fine-tuning scheme for the visual encoder, in which the encoder is optimized to produce similar latent representations for both clean and adversarially perturbed visual inputs. Extensive evaluations on the widely recognized LIBERO robotic simulation benchmark demonstrate that EDPA substantially increases the task failure rate of cutting-edge VLA models, while our proposed defense effectively mitigates this degradation. The codebase is accessible via the homepage at https://edpa-attack.github.io/.
Abstract（参考訳）: VLA(Vision-Language-Action)モデルは、ロボット学習の革命的な進歩を達成し、ロボットが自然言語による複雑な物理ロボットタスクを実行できるようになった。この進歩にもかかわらず、その敵意の強固さは未解明のままである。本研究では,VLAモデルに対する逆パッチ攻撃と対応する防御戦略の両方を提案する。我々はまず,カメラの視界に直接配置可能なパッチを生成するモデルに依存しない敵攻撃である Embedding Disruption Patch Attack (EDPA) を紹介する。従来の手法と比較して、EDPAはモデルアーキテクチャや制御ロボットマニピュレータの事前知識を必要とせずに、異なるVLAモデルに容易に適用することができる。 EDPAはこれらのパッチを構築する (i)視覚的・テキスト的潜在表現のセマンティックアライメントを乱し、 (2) 敵と対応するクリーンな視覚入力の潜在表現の差を最大化する。これらの目的の最適化を通じて、EDPAはVLAの視覚情報解釈を歪め、モデルが繰り返し誤ったアクションを発生させ、最終的に与えられたロボットタスクを完了させることができない。これに対応するために,視覚エンコーダの逆方向の微調整方式を提案する。広く認識されているLIBEROロボットシミュレーションベンチマークにおいて、EDPAは最先端VLAモデルのタスク故障率を大幅に向上する一方、この劣化を効果的に軽減する。コードベースは、https://edpa- attack.github.io/のホームページからアクセスできる。

論文の概要: Model-agnostic Adversarial Attack and Defense for Vision-Language-Action Models

関連論文リスト