Fugu-MT 論文翻訳(概要): From Prompt to Physical Action: Structured Backdoor Attacks on LLM-Mediated Robotic Control Systems

論文の概要: From Prompt to Physical Action: Structured Backdoor Attacks on LLM-Mediated Robotic Control Systems

arxiv url: http://arxiv.org/abs/2604.03890v1
Date: Sat, 04 Apr 2026 23:06:16 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-07 15:49:18.817937
Title: From Prompt to Physical Action: Structured Backdoor Attacks on LLM-Mediated Robotic Control Systems
Title（参考訳）: プロンプトから物理行動へ:LLMを利用したロボット制御システムにおける構造的バックドア攻撃
Authors: Mingyang Xie, Jin Wei-Kocsis,
Abstract要約: 大規模言語モデル(LLM)は、ユーザプロンプトを実行可能なコマンドに変換するために、ロボット制御パイプラインに統合される。このデジタル・物理的インターフェースは、重要で未発見の脆弱性、すなわち微調整中に埋め込まれた構造化バックドア攻撃を導入している。 LLMによるROS2ロボット制御システムにおけるLoRAベースのサプライチェーンバックドアを実験的に検討し、ロボットの物理的実行への影響を評価する。
参考スコア（独自算出の注目度）: 3.6185608572339962
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The integration of large language models (LLMs) into robotic control pipelines enables natural language interfaces that translate user prompts into executable commands. However, this digital-to-physical interface introduces a critical and underexplored vulnerability: structured backdoor attacks embedded during fine-tuning. In this work, we experimentally investigate LoRA-based supply-chain backdoors in LLM-mediated ROS2 robotic control systems and evaluate their impact on physical robot execution. We construct two poisoned fine-tuning strategies targeting different stages of the command generation pipeline and reveal a key systems-level insight: back-doors embedded at the natural-language reasoning stage do not reliably propagate to executable control outputs, whereas backdoors aligned directly with structured JSON command formats successfully survive translation and trigger physical actions. In both simulation and real-world experiments, backdoored models achieve an average Attack Success Rate of 83% while maintaining over 93% Clean Performance Accuracy (CPA) and sub-second latency, demonstrating both reliability and stealth. We further implement an agentic verification defense using a secondary LLM for semantic consistency checking. Although this reduces the Attack Success Rate (ASR) to 20%, it increases end-to-end latency to 8-9 seconds, exposing a significant security-responsiveness trade-off in real-time robotic systems. These results highlight structural vulnerabilities in LLM-mediated robotic control architectures and underscore the need for robotics-aware defenses for embodied AI systems.
Abstract（参考訳）: 大規模言語モデル(LLM)をロボット制御パイプラインに統合することで、ユーザプロンプトを実行可能なコマンドに変換する自然言語インターフェースが可能になる。しかし、このデジタル・物理的インターフェースは、重要かつ未発見の脆弱性、すなわち微調整中に埋め込まれた構造化バックドア攻撃をもたらす。本研究では,LLMによるROS2ロボット制御システムにおいて,ロラをベースとしたサプライチェーンバックドアを実験的に検討し,ロボットの物理的実行への影響を評価する。自然言語推論段階に埋め込まれたバックドアは、実行可能制御出力に確実に伝播せず、一方、構造化されたJSONコマンドフォーマットと直接一致したバックドアは、翻訳を生き残り、物理的なアクションをトリガーする。シミュレーションと実世界の両方の実験において、バックドアモデルの平均攻撃成功率は83%であり、93%以上のクリーンパフォーマンス精度(CPA)とサブ秒レイテンシを維持し、信頼性とステルスの両方を実証している。さらに,セマンティック一貫性チェックのための二次LLMを用いたエージェント検証ディフェンスを実装した。これによりアタック成功率(ASR)は20%に低下するが、エンドツーエンドのレイテンシは8～9秒に向上する。これらの結果は、LLMを介するロボット制御アーキテクチャの構造上の脆弱性を強調し、組み込みAIシステムに対するロボティクス対応の防御の必要性を浮き彫りにしている。

論文の概要: From Prompt to Physical Action: Structured Backdoor Attacks on LLM-Mediated Robotic Control Systems

関連論文リスト