Fugu-MT 論文翻訳(概要): LATS: Large Language Model Assisted Teacher-Student Framework for Multi-Agent Reinforcement Learning in Traffic Signal Control

論文の概要: LATS: Large Language Model Assisted Teacher-Student Framework for Multi-Agent Reinforcement Learning in Traffic Signal Control

arxiv url: http://arxiv.org/abs/2603.24361v1
Date: Wed, 25 Mar 2026 14:42:32 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-26 21:06:11.337314
Title: LATS: Large Language Model Assisted Teacher-Student Framework for Multi-Agent Reinforcement Learning in Traffic Signal Control
Title（参考訳）: LATS:交通信号制御におけるマルチエージェント強化学習を支援する大規模言語モデル
Authors: Yifeng Zhang, Peizhuo Li, Tingguang Zhou, Mingfeng Fan, Guillaume Sartoretti,
Abstract要約: 大規模言語モデル(LLM)とマルチエージェント強化学習(MARL)を統合したLATSという新しい学習パラダイムを提案する。 LLMは意味表現、推論、分析に優れるが、幻覚や推論速度の遅さは意思決定タスクへの直接的な適用を妨げることが多い。訓練された埋め込みLDMは、各交差点のトポロジ構造とトラフィックダイナミクスをキャプチャするリッチなセマンティックな特徴を生成する教師として機能する。
参考スコア（独自算出の注目度）: 12.562136225497937
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Adaptive Traffic Signal Control (ATSC) aims to optimize traffic flow and minimize delays by adjusting traffic lights in real time. Recent advances in Multi-agent Reinforcement Learning (MARL) have shown promise for ATSC, yet existing approaches still suffer from limited representational capacity, often leading to suboptimal performance and poor generalization in complex and dynamic traffic environments. On the other hand, Large Language Models (LLMs) excel at semantic representation, reasoning, and analysis, yet their propensity for hallucination and slow inference speeds often hinder their direct application to decision-making tasks. To address these challenges, we propose a novel learning paradigm named LATS that integrates LLMs and MARL, leveraging the former's strong prior knowledge and inductive abilities to enhance the latter's decision-making process. Specifically, we introduce a plug-and-play teacher-student learning module, where a trained embedding LLM serves as a teacher to generate rich semantic features that capture each intersection's topology structures and traffic dynamics. A much simpler (student) neural network then learns to emulate these features through knowledge distillation in the latent space, enabling the final model to operate independently from the LLM for downstream use in the RL decision-making process. This integration significantly enhances the overall model's representational capacity across diverse traffic scenarios, thus leading to more efficient and generalizable control strategies. Extensive experiments across diverse traffic datasets empirically demonstrate that our method enhances the representation learning capability of RL models, thereby leading to improved overall performance and generalization over both traditional RL and LLM-only approaches. [...]
Abstract（参考訳）: Adaptive Traffic Signal Control (ATSC) は、信号機をリアルタイムで調整することで、交通の流れを最適化し、遅延を最小限にすることを目的としている。近年のMARL(Multi-agent Reinforcement Learning)の進歩はATSCを約束しているが、既存のアプローチは表現能力の限界に悩まされているため、複雑な交通環境や動的交通環境における準最適性能や一般化の低さを招いている。一方、Large Language Models (LLM) は意味表現、推論、分析に優れるが、幻覚や推論速度の遅さは、意思決定タスクへの直接的な適用を妨げることが多い。これらの課題に対処するために,LLMとMARLを統合したLATSという新しい学習パラダイムを提案する。具体的には,各交差点のトポロジ構造と交通動態をキャプチャするリッチなセマンティックな特徴を生成するために,教師として訓練された組込みLDMが機能するプラグイン・アンド・プレイ型学習モジュールを提案する。より単純な(学生の)ニューラルネットワークは、潜伏空間の知識蒸留を通じてこれらの特徴をエミュレートすることを学び、最終モデルはLLMから独立して、RL決定プロセスで下流での使用を可能にする。この統合により、さまざまなトラフィックシナリオにわたるモデル全体の表現能力が大幅に向上し、より効率的で一般化可能な制御戦略がもたらされる。多様なトラフィックデータセットにわたる大規模な実験により、我々の手法はRLモデルの表現学習能力を向上し、従来のRLとLLMのみのアプローチよりも全体的な性能と一般化を向上させることが実証された。 [...]

論文の概要: LATS: Large Language Model Assisted Teacher-Student Framework for Multi-Agent Reinforcement Learning in Traffic Signal Control

関連論文リスト