Fugu-MT 論文翻訳(概要): FreezeVLA: Action-Freezing Attacks against Vision-Language-Action Models

論文の概要: FreezeVLA: Action-Freezing Attacks against Vision-Language-Action Models

arxiv url: http://arxiv.org/abs/2509.19870v1
Date: Wed, 24 Sep 2025 08:15:28 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-25 20:53:19.732465
Title: FreezeVLA: Action-Freezing Attacks against Vision-Language-Action Models
Title（参考訳）: FreezeVLA:ビジョンランゲージ・アクションモデルに対するアクションフリーズ攻撃
Authors: Xin Wang, Jie Li, Zejia Weng, Yixu Wang, Yifeng Gao, Tianyu Pang, Chao Du, Yan Teng, Yingchun Wang, Zuxuan Wu, Xingjun Ma, Yu-Gang Jiang,
Abstract要約: Vision-Language-Action(VLA)モデルはロボティクスの急速な進歩を加速している。敵画像はVLAモデルを「凍結」し、その後の命令を無視する。 FreezeVLAは、min-maxバイレベル最適化を通じて、アクション凍結攻撃を生成し、評価する。
参考スコア（独自算出の注目度）: 124.02734355214325
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Vision-Language-Action (VLA) models are driving rapid progress in robotics by enabling agents to interpret multimodal inputs and execute complex, long-horizon tasks. However, their safety and robustness against adversarial attacks remain largely underexplored. In this work, we identify and formalize a critical adversarial vulnerability in which adversarial images can "freeze" VLA models and cause them to ignore subsequent instructions. This threat effectively disconnects the robot's digital mind from its physical actions, potentially inducing inaction during critical interventions. To systematically study this vulnerability, we propose FreezeVLA, a novel attack framework that generates and evaluates action-freezing attacks via min-max bi-level optimization. Experiments on three state-of-the-art VLA models and four robotic benchmarks show that FreezeVLA attains an average attack success rate of 76.2%, significantly outperforming existing methods. Moreover, adversarial images generated by FreezeVLA exhibit strong transferability, with a single image reliably inducing paralysis across diverse language prompts. Our findings expose a critical safety risk in VLA models and highlight the urgent need for robust defense mechanisms.
Abstract（参考訳）: VLA(Vision-Language-Action)モデルは、エージェントがマルチモーダル入力を解釈し、複雑な長距離タスクを実行することを可能にすることで、ロボット工学の急速な進歩を加速している。しかし、敵の攻撃に対する安全性と堅牢性はいまだに未発見のままである。本研究では,VLAモデルを「凍結」し,その後の命令を無視する致命的な敵対的脆弱性を特定し,形式化する。この脅威は、ロボットのデジタルマインドを物理的行動から効果的に切り離し、臨界介入中の不作用を引き起こす可能性がある。この脆弱性を体系的に研究するために,min-maxバイレベル最適化による行動凍結攻撃を発生・評価する新しい攻撃フレームワークであるFreezeVLAを提案する。最先端の3つのVLAモデルと4つのロボットベンチマークの実験により、FreezeVLAは76.2%の平均的な攻撃成功率に達し、既存の手法よりも大幅に優れていた。さらに、FreezeVLAが生成した対向画像は強い伝達性を示し、単一の画像は多言語プロンプトの麻痺を確実に誘導する。本研究は,VLAモデルにおいて重要な安全性リスクを明らかにし,堅牢な防御機構の必要性を強調した。

論文の概要: FreezeVLA: Action-Freezing Attacks against Vision-Language-Action Models

関連論文リスト