Fugu-MT 論文翻訳(概要): Enhancing LLM-based Autonomous Driving with Modular Traffic Light and Sign Recognition

論文の概要: Enhancing LLM-based Autonomous Driving with Modular Traffic Light and Sign Recognition

arxiv url: http://arxiv.org/abs/2511.14391v1
Date: Tue, 18 Nov 2025 11:52:52 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-19 16:23:53.084933
Title: Enhancing LLM-based Autonomous Driving with Modular Traffic Light and Sign Recognition
Title（参考訳）: Modular Traffic Lightと符号認識によるLCMによる自律走行の強化
Authors: Fabian Schmidt, Noushiq Mohammed Kayilan Abdul Nazar, Markus Enzweiler, Abhinav Valada,
Abstract要約: 大規模言語モデル(LLM)は、自律運転における意思決定と計画にますます利用されている。 LLMをベースとした自律走行エージェントを明示的な信号と信号認識で拡張するモジュール冗長層であるTLS-Assistを導入する。 LMDriveを14%,BEVDriverを7%,運転性能を14%向上させた。
参考スコア（独自算出の注目度）: 15.4994260281059
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large Language Models (LLMs) are increasingly used for decision-making and planning in autonomous driving, showing promising reasoning capabilities and potential to generalize across diverse traffic situations. However, current LLM-based driving agents lack explicit mechanisms to enforce traffic rules and often struggle to reliably detect small, safety-critical objects such as traffic lights and signs. To address this limitation, we introduce TLS-Assist, a modular redundancy layer that augments LLM-based autonomous driving agents with explicit traffic light and sign recognition. TLS-Assist converts detections into structured natural language messages that are injected into the LLM input, enforcing explicit attention to safety-critical cues. The framework is plug-and-play, model-agnostic, and supports both single-view and multi-view camera setups. We evaluate TLS-Assist in a closed-loop setup on the LangAuto benchmark in CARLA. The results demonstrate relative driving performance improvements of up to 14% over LMDrive and 7% over BEVDriver, while consistently reducing traffic light and sign infractions. We publicly release the code and models on https://github.com/iis-esslingen/TLS-Assist.
Abstract（参考訳）: 大規模言語モデル(LLM)は、自律運転における意思決定と計画にますます使われており、有望な推論能力と様々な交通状況にまたがる一般化の可能性を示している。しかし、現在のLSMベースの運転エージェントは、交通規則を強制する明確なメカニズムが欠如しており、信号機や標識のような小型で安全な物体を確実に検出するのに苦労することが多い。この制限に対処するため,TLS-Assistは,LSMをベースとした自律走行エージェントを明示的な信号と信号認識で拡張するモジュール冗長層である。 TLS-Assistは、検出をLLM入力に注入される構造化された自然言語メッセージに変換する。このフレームワークはプラグアンドプレイで、モデルに依存しず、シングルビューとマルチビューの両方のカメラセットアップをサポートする。 CARLAにおけるLangAutoベンチマークのクローズドループ設定におけるTLS-Assistの評価を行った。その結果、LMDriveよりも最大14%、BEVDriverより7%、運転性能は最大14%向上した。コードとモデルはhttps://github.com/iis-esslingen/TLS-Assist.comで公開しています。

論文の概要: Enhancing LLM-based Autonomous Driving with Modular Traffic Light and Sign Recognition

関連論文リスト