Fugu-MT 論文翻訳(概要): VLA-Hijack: A Transferable Patch Attack against Vision-Language-Action Models via Visual Proprioception Hijacking

論文の概要: VLA-Hijack: A Transferable Patch Attack against Vision-Language-Action Models via Visual Proprioception Hijacking

arxiv url: http://arxiv.org/abs/2605.28083v1
Date: Wed, 27 May 2026 07:38:16 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-28 17:38:55.850503
Title: VLA-Hijack: A Transferable Patch Attack against Vision-Language-Action Models via Visual Proprioception Hijacking
Title（参考訳）: VLA-Hijack:ビジュアル・プロプライオセプション・ヒジャックによるビジョン・ランゲージ・アクションモデルに対するトランスファー可能なパッチアタック
Authors: Jiyuan Fu, Kaixun Jiang, Jingkai Jia, Zhaoyu Chen, Xueyao Chen, Lingyi Hong, Shuyong Gao, Chenzhi Tan, Dingkang Yang, Wenqiang Zhang,
Abstract要約: VLA-HijackはVision-Language-Actionモデルのための統合された対向フレームワークである。共有された視覚的自己ローカライゼーションプロセスを活用することで、根本的な脆弱性を悪用する。ホワイトボックス設定において優れた最適化効率を実現し、クロスアーキテクチャとクロスドメインブラックボックス転送性のための新しいSOTAを設定する。
参考スコア（独自算出の注目度）: 49.51251398232003
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: While Vision-Language-Action (VLA) models have emerged as powerful generalist policies, their severe vulnerability to adversarial patches significantly hinders their deployment in safety-critical domains. Moreover, existing patch attacks primarily focus on white-box settings, heavily overfitting to the specific action output space of the target model, which results in poor cross-architecture transferability. To overcome this limitation, we propose VLA-Hijack, a unified adversarial framework that breaks the transferability bottleneck by exploiting a fundamental vulnerability identified in this work: before planning any motion, a VLA model must first use visual information to locate its own robotic arm within the environment. Targeting this shared visual self-localization process, our approach concurrently optimizes Attention-Guided Proprioceptive Suppression to inhibit the real robotic arm's features, and Multimodal Proprioceptive Injection to establish the patch as a surrogate "phantom embodiment". By alternating between semantic concept anchoring and visual prototype projection, VLA-Hijack effectively severs the semantic relationship between the agent's true embodiment and its control policy. Extensive experiments across diverse architectures (OpenVLA, UniVLA, and CronusVLA) demonstrate that VLA-Hijack achieves superior optimization efficiency in white-box settings and sets a new SOTA for cross-architecture and cross-domain black-box transferability.
Abstract（参考訳）: VLA(Vision-Language-Action)モデルは強力なジェネラリストポリシとして登場したが、敵のパッチに対する深刻な脆弱性は、安全クリティカルなドメインへの展開を著しく妨げている。さらに、既存のパッチ攻撃は主にホワイトボックスの設定に重点を置いており、ターゲットモデルの特定のアクション出力空間に非常に過度に適合しており、結果としてアーキテクチャ間の転送性が低下する。この制限を克服するために、我々はVLA-Hijack(VLA-Hijack)を提案する。これは、この作業で特定される基本的な脆弱性を利用することによって、転送可能性のボトルネックを突破する統合された対向フレームワークである。この共有視覚的自己ローカライゼーションプロセスを対象として,本手法では,実際のロボットアームの特徴を抑えるために注意誘導原初受容抑制を同時に最適化し,マルチモーダル原初受容注入により,パッチを「幻の具体化」として確立する。意味的概念のアンカーと視覚的プロトタイププロジェクションの交互に、VLA-Hijackはエージェントの真の姿とコントロールポリシーの間の意味的関係を効果的に分離する。多様なアーキテクチャ(OpenVLA、UniVLA、CronusVLA)にわたる広範な実験により、VLA-Hijackはホワイトボックス設定において優れた最適化効率を実現し、クロスアーキテクチャとクロスドメインのブラックボックス転送性のための新しいSOTAを設定できることを示した。

論文の概要: VLA-Hijack: A Transferable Patch Attack against Vision-Language-Action Models via Visual Proprioception Hijacking

関連論文リスト