Fugu-MT 論文翻訳(概要): Pedestrian-Aware LLM-Driven Behavioral Planning for Autonomous Vehicles

論文の概要: Pedestrian-Aware LLM-Driven Behavioral Planning for Autonomous Vehicles

arxiv url: http://arxiv.org/abs/2605.16858v1
Date: Sat, 16 May 2026 07:41:42 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-19 17:57:47.203543
Title: Pedestrian-Aware LLM-Driven Behavioral Planning for Autonomous Vehicles
Title（参考訳）: 歩行者対応LLM駆動型自動運転車の行動計画
Authors: Aidana Baimbetova, Haruki Yonekura, Hamada Rizk, Hirozumi Yamaguchi,
Abstract要約: 歩行者対応行動計画のための大規模言語モデル(LLM)に基づく意思決定フレームワーク。システムは、構造化されたシーンの観察を自然言語による推論のプロンプトに変換し、LLMは歩行者の意図を推測し、リスクを予測し、慎重な戦術的な運転決定を生成する。本研究では, 予期せぬ歩行, ターンバック・クロス, ためらい, 双方向横断など, 複数の歩行者行動シナリオにおけるSUMOの枠組みを評価した。
参考スコア（独自算出の注目度）: 5.343135467323389
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Autonomous Vehicles (AVs) must make reliable decisions in dense urban environments where pedestrian behavior is variable, sometimes abnormal, and often unseen during training. Reinforcement learning (RL)-based AV control systems perform well in structured traffic but struggle to generalize to unpredictable pedestrian interactions and out-of-distribution scenarios. Their reliance on handcrafted rewards and opaque decisions further limits their suitability for safety-critical, pedestrian-rich environments. To address these limitations, we introduce a Large Language Model (LLM)-based decision-making framework for pedestrian-aware behavioral planning. The system converts structured scene observations into natural-language reasoning prompts, enabling the LLM to infer pedestrian intent, anticipate risk, and generate cautious tactical driving decisions. These decisions are executed by a motion planner that ensures smooth, kinematically feasible control. We evaluate the framework in SUMO across multiple pedestrian-interaction scenarios, including unexpected jaywalking, turn-back crossing, hesitation, and bidirectional crossing. In zero-shot evaluation, the LLM-based agent achieves a 68% collision-free success rate, substantially outperforming deep RL baselines (17.7%). With few-shot episodic memory in a single-pedestrian scenario, performance increases to 96.0%, exceeding a custom DQN controller (82.0%). Cross-behavior evaluation further shows that memory derived from turn-back interactions transfers to unseen hesitation and bidirectional crossing scenarios, achieving 82.0% and 90.0% success, respectively. The system consistently initiates earlier responses, maintains wider safety buffers, and produces interpretable, human-aligned decisions.
Abstract（参考訳）: 自律走行車(AV)は、歩行者の行動が変動し、時には異常であり、訓練中にしばしば見つからない密集した都市環境において、信頼できる判断をしなければならない。強化学習(Reinforcement Learning, RL)に基づくAV制御システムは, 構造的トラフィックにおいて良好に機能するが, 予測不能な歩行者インタラクションやアウト・オブ・ディストリビューションシナリオへの一般化に苦慮する。手作りの報酬への依存と不透明な決定により、安全で歩行者に富んだ環境への適合性がさらに制限される。これらの制約に対処するため,歩行者対応行動計画のためのLarge Language Model (LLM)に基づく意思決定フレームワークを導入する。このシステムは、構造化されたシーンの観察を自然言語による推論のプロンプトに変換し、LLMは歩行者の意図を推測し、リスクを予測し、慎重な戦術的な運転決定を生成する。これらの決定は、スムーズでキネマティックに実行可能な制御を保証するモーションプランナーによって実行される。本研究では, 予期せぬ歩行, ターンバック・クロス, ためらい, 双方向横断など, 複数の歩行者行動シナリオにおけるSUMOの枠組みを評価した。ゼロショット評価では、LSMベースのエージェントは68%の衝突のない成功率を達成し、RLベースラインをかなり上回っている(17.7%)。単ペデストリアンのシナリオでは数ショットのエピソードメモリを使用すると、パフォーマンスは96.0%に向上し、カスタムDQNコントローラ(82.0%)を超える。クロスビヘイビア評価により、ターンバック相互作用から得られるメモリは、それぞれ82.0%と90.0%の成功を収める、見知らぬためらうシナリオと双方向の交差シナリオに転送されることが示された。このシステムは、常に早期の応答を開始し、より広範な安全バッファを保持し、解釈可能で人間に整合した決定を生成する。

論文の概要: Pedestrian-Aware LLM-Driven Behavioral Planning for Autonomous Vehicles

関連論文リスト