Fugu-MT 論文翻訳(概要): Large Language Model based Interactive Decision-Making for Autonomous Driving

論文の概要: Large Language Model based Interactive Decision-Making for Autonomous Driving

arxiv url: http://arxiv.org/abs/2604.23513v1
Date: Sun, 26 Apr 2026 03:19:12 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-28 17:12:07.403892
Title: Large Language Model based Interactive Decision-Making for Autonomous Driving
Title（参考訳）: 大規模言語モデルに基づく自律運転のための対話型意思決定
Authors: Xinwei Dong, Jiyang Li, Jiabin Xie, Yang Yi, Tianshang Jia, Shiyu Fang, Ye Tian, Peng Hang,
Abstract要約: 高複雑性の混合交通シナリオでは、既存の自律運転システムは、過度に保守的な振る舞いをデフォルトとする。本稿では,シーン理解と意図認識の相互作用を増強する大規模言語モデルに基づく対話型意思決定フレームワークを提案する。クラスタ駆動シミュレータの実験では、提案手法は安全性、快適性、効率の指標で従来のベースラインを上回っている。
参考スコア（独自算出の注目度）: 9.806333521695466
License: http://creativecommons.org/licenses/by/4.0/
Abstract: In high-conflict mixed-traffic scenarios involving human-driven and autonomous vehicles, most existing autonomous driving systems default to overly conservative behaviors, lack proactive interaction, and consequently suffer from limited public acceptance. To mitigate intent misunderstandings and decision failures, we present a Large Language Model based interactive decision-making framework that augments scene understanding and intent-aware interaction to jointly improve safety and efficiency. The approach uses Object-Process Methodology to semantically model complex multi-vehicle scenes, abstracting low-level perceptual data into objects, processes, and relations, thereby streamlining reasoning over latent causal structure. Building on this representation, the Large Language Model parses both explicit and implicit intents of surrounding agents and, under jointly enforced safety and efficiency constraints, selects candidate maneuvers. We further generate perturbed trajectory candidates via Monte Carlo sampling and evaluate them to obtain an optimized executable trajectory. To foster transparency and coordination with nearby road users, the final decision is translated by the Large Language Model into concise natural-language messages and broadcast through an external Human-Machine Interface, completing a closed loop from scene understanding to action to language. Experiments in a cluster driving simulator demonstrate that the proposed method outperforms traditional baselines across safety, comfort, and efficiency metrics, while a Turing-test-style evaluation indicates a high degree of human-likeness in decision making. Besides, these results suggest that coupling semantic scene abstraction with Large Language Model mediated intent reasoning and language-based eHMI communication offers a practical pathway toward interactive, trustworthy autonomous driving in dense mixed traffic.
Abstract（参考訳）: 人間が運転する自動車と自動運転車の混成交通シナリオでは、既存の自動運転システムは、過度に保守的な振る舞いをしており、積極的相互作用が欠如しており、結果として公共の受け入れが制限されている。意図の誤解と意思決定の失敗を軽減するため,大規模言語モデルに基づく対話型意思決定フレームワークを提案する。このアプローチでは、Object-Process Methodologyを使用して、複雑なマルチサイクルシーンをセマンティックにモデル化し、低レベルの知覚データをオブジェクト、プロセス、関係に抽象化し、潜在因果構造に対する推論を合理化する。この表現に基づいて、Large Language Modelは周辺エージェントの明示的意図と暗黙的意図の両方を解析し、共同で安全と効率の制約を課し、候補の操作を選択する。さらに、モンテカルロサンプリングを用いて摂動軌道候補を生成し、それらを評価して、最適化可能な軌道を得る。近隣の道路利用者との透明性と協調を促進するため、最終決定はLarge Language Modelによって簡潔な自然言語メッセージに変換され、外部のヒューマン・マシン・インタフェースを介して放送され、シーン理解からアクション・トゥ・ランゲージへの閉ループが完了する。クラスタ駆動シミュレータの実験では、提案手法は安全性、快適性、効率の指標で従来のベースラインよりも優れており、チューリングテストスタイルの評価は意思決定において高い人間の類似性を示している。これらの結果は,大規模言語モデルによる意図推論と言語に基づくeHMIコミュニケーションを介するセマンティックシーンの抽象化が,密混合交通における対話的かつ信頼性の高い自律運転への実践的経路となることを示唆している。

論文の概要: Large Language Model based Interactive Decision-Making for Autonomous Driving

関連論文リスト