Fugu-MT 論文翻訳(概要): SARAD: LLM-Based Safety-Aware Hybrid Reinforcement Learning with Collision Prediction for Autonomous Driving

論文の概要: SARAD: LLM-Based Safety-Aware Hybrid Reinforcement Learning with Collision Prediction for Autonomous Driving

arxiv url: http://arxiv.org/abs/2605.28583v1
Date: Wed, 27 May 2026 15:06:34 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-28 17:38:56.140424
Title: SARAD: LLM-Based Safety-Aware Hybrid Reinforcement Learning with Collision Prediction for Autonomous Driving
Title（参考訳）: SARAD: 自動運転のための衝突予測を用いたLLMによる安全対応ハイブリッド強化学習
Authors: Kangyu Wu, Peng Cui, Guoxi Chen, Ya Zhang,
Abstract要約: 従来のDeep Reinforcement Learning (DRL)は、安全でないランダムな探索と緩やかな収束に悩まされている。大規模言語モデル(LLM)は、リアルタイム推論操作に固有の遅延を示す。本稿では,自律走行のためのLLMとDRLを相乗化する安全対応ハイブリッドフレームワークSARADを提案する。
参考スコア（独自算出の注目度）: 22.134329848099142
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Ensuring both safety and efficiency in decision-making for autonomous driving systems remains a fundamental challenge. Traditional Deep Reinforcement Learning (DRL) suffers from unsafe random exploration and slow convergence, while Large Language Models (LLMs) demonstrate inherent latency in real-time inference operations. To address these limitations, this paper proposes SARAD, a novel safety-aware hybrid framework that synergizes LLMs and DRL for autonomous driving. SARAD substitutes the random exploration of DRL with Retrieval-Augmented Generation (RAG)-enhanced, LLM-guided decisions sourced from a dynamic expert knowledge repository. An attention discriminator is proposed to integrate the prior knowledge of LLMs into DRL policy optimization. A collision predictor module, fine-tuned with historical collision data, is further designed to improve vehicle safety. Extensive experiments show that SARAD achieves significant performance improvements in the Highway-Env simulator, validating the effectiveness of the proposed model in autonomous driving.
Abstract（参考訳）: 自動運転システムの意思決定における安全性と効率性の確保は、依然として根本的な課題である。従来のDeep Reinforcement Learning (DRL) は安全でないランダム探索と遅い収束に悩まされ、Large Language Models (LLM) はリアルタイム推論操作に固有の遅延を示す。これらの制約に対処するために,自動走行のためのLLMとDRLを相乗化する新しい安全対応ハイブリッドフレームワークであるSARADを提案する。 SARAD は DRL のランダムな探索に代えて,動的知識リポジトリをベースとしたRAG(Retrieval-Augmented Generation) 強化,LLM 誘導による決定を行う。 LLMの事前知識をDRLポリシー最適化に組み込むために,注意識別器を提案する。衝突予測モジュールは、過去の衝突データで微調整され、車両の安全性を向上させるためにさらに設計されている。大規模な実験により,SARADはハイウェイ-エンヴシミュレータにおいて大幅な性能向上を実現し,提案モデルの有効性を検証した。

論文の概要: SARAD: LLM-Based Safety-Aware Hybrid Reinforcement Learning with Collision Prediction for Autonomous Driving

関連論文リスト