Fugu-MT 論文翻訳(概要): Membership Inference Attacks on Vision-Language-Action Models

論文の概要: Membership Inference Attacks on Vision-Language-Action Models

arxiv url: http://arxiv.org/abs/2605.07088v1
Date: Fri, 08 May 2026 01:16:00 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-11 19:43:38.709386
Title: Membership Inference Attacks on Vision-Language-Action Models
Title（参考訳）: ビジョン・ランゲージ・アクションモデルによるメンバーシップ推論攻撃
Authors: Yuefeng Peng, Mingzhe Li, Kejing Xia, Renhao Zhang, Amir Houmansadr,
Abstract要約: 本稿では,視覚言語行動モデル(VLA)に対するメンバーシップ推論攻撃に関する最初の体系的研究について述べる。我々の攻撃は、トークンの確率のような古典的なMIA信号と、観測可能な動作誤差や時間的動きパターンのようなVLA固有の信号の両方を利用する。我々の研究結果によると、ロボットと具体化されたAIのプライバシーリスクがこれまで過小評価され、VLAモデルの専用のプライバシー評価と防衛の必要性が浮き彫りになっている。
参考スコア（独自算出の注目度）: 18.964278149350747
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Membership inference attacks (MIAs) have been extensively studied in large language models (LLMs) and vision-language models (VLMs), yet their implications for vision-language-action (VLA) models remain largely unexplored. VLA models differ from standard LLMs and VLMs in several important ways: they are often fine-tuned for many epochs on relatively small embodied datasets, operate over constrained and structured action spaces, and expose action outputs that can be observed as executable behaviors and temporally correlated trajectories. These characteristics suggest a distinct and potentially more informative attack surface for membership inference. In this work, we present the first systematic study of MIAs against VLA systems. We formalize two membership inference settings for VLA models: sample-level inference over individual transition samples and trajectory-level inference over complete embodied demonstrations. We further develop a suite of attack methods under multiple access regimes, including strict black-box access. Our attacks exploit both classic MIA signals, such as token likelihood, and VLA-specific signals, such as observable action errors and temporal motion patterns. Across multiple VLA benchmarks and representative VLA models, these attacks achieve strong inference performance, showing that VLA models are highly vulnerable to membership inference. Notably, black-box attacks based only on generated actions achieve strong performance, highlighting a practical privacy risk for deployed embodied AI systems. Our findings reveal a previously underexplored privacy risk in robotic and embodied AI, and underscore the need for dedicated privacy evaluation and defenses for VLA models.
Abstract（参考訳）: メンバーシップ推論攻撃 (MIA) は大規模言語モデル (LLM) や視覚言語モデル (VLM) で広く研究されてきたが、視覚言語モデル (VLA) に対するその影響は未解明のままである。 VLAモデルは、通常LLMやVLMとはいくつかの重要な方法で異なる: 比較的小さな埋め込みデータセット上で多くのエポックに対して微調整され、制約された、構造化されたアクション空間上で動作し、実行可能な振る舞いや時間的に相関した軌道として観測できるアクション出力を公開する。これらの特徴は、メンバーシップ推論において、識別され、より有益な攻撃面であることを示している。本稿では、VLAシステムに対するMIAの最初の系統的研究について述べる。 VLAモデルに対する2つのメンバシップ推論設定を定式化した: 個々の遷移サンプルに対するサンプルレベル推論と、完全に具体化されたデモに対する軌道レベル推論である。我々はさらに、厳格なブラックボックスアクセスを含む複数のアクセス体制の下で攻撃方法のスイートを開発する。我々の攻撃は、トークンの確率のような古典的なMIA信号と、観測可能な動作誤差や時間的動きパターンのようなVLA固有の信号の両方を利用する。複数のVLAベンチマークと代表的なVLAモデルを通して、これらの攻撃は強い推論性能を達成し、VLAモデルがメンバーシップ推論に対して非常に脆弱であることを示す。特に、生成されたアクションのみに基づくブラックボックス攻撃は、強力なパフォーマンスを実現し、デプロイされた組み込みAIシステムの実用的なプライバシリスクを強調している。我々の研究結果によると、ロボットと具体化されたAIのプライバシーリスクがこれまで過小評価され、VLAモデルの専用のプライバシー評価と防衛の必要性が浮き彫りになっている。

論文の概要: Membership Inference Attacks on Vision-Language-Action Models

関連論文リスト