Fugu-MT 論文翻訳(概要): Learning from Human Driving: A Human-in-the-Loop Online Behavior Cloning Framework for Autonomous Driving

論文の概要: Learning from Human Driving: A Human-in-the-Loop Online Behavior Cloning Framework for Autonomous Driving

arxiv url: http://arxiv.org/abs/2606.08170v1
Date: Sat, 06 Jun 2026 13:35:07 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-09 14:42:05.892526
Title: Learning from Human Driving: A Human-in-the-Loop Online Behavior Cloning Framework for Autonomous Driving
Title（参考訳）: 人間運転から学ぶ:自律運転のためのオンライン行動クローンフレームワーク
Authors: Yuhong Shi, Jianyi Liu, Lihang Sun, Li Li, Xudong Dong,
Abstract要約: 本稿では,自動運転のためのHuman-in-the-Loop Online Behavior Cloning frame work (HiL-OBC)を提案する。 HiL-OBCは、LFMのクロスモーダル知覚能力と、人間の専門家の高レベルの運転知能を統合している。提案手法によって最適化された運転ポリシーにより,性能が大幅に向上することを示す。
参考スコア（独自算出の注目度）: 12.991350746930104
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: With the evolution of large foundation models (LFMs), data-driven autonomous driving has made significant strides. However, existing paradigms still face severe challenges in complex interaction and long-tail scenarios due to distribution shift and causal confusion. These limitations often result in a lack of human-level decision-making flexibility and safety in extreme conditions. To overcome this limitation, this paper proposes a Human-in-the-Loop Online Behavior Cloning frame work (HiL-OBC) for autonomous driving, which aims to deeply integrate the cross-modal perceptual capabilities of LFMs with the high-level driving intelligence of human experts. Specifically, HiL-OBC deployment is executed through three critical phases: policy initialization with human intervention, latent behavioral modeling with Bayesian policy adaptation, and online deploy ment and updates. Furthermore, we design a Multi-modal Online Behavior Cloning (MOBC) model, which optimizes the base driving policy online through a lightweight network architecture, a takeover trigger mechanism, and a multi-variant loss function, thereby enhancing the system's decision-making robustness in complex environments. We evaluated the HiL-OBC on the LangAuto-Human CARLA benchmark. Experimental results demonstrate that the driving policies optimized via the human-in-the-loop mechanism achieve substantial performance gains: the DS of StructNav, LFG, and LMDrive increased by 47.25%, 31.59%, and 32.12%, respectively, with a simultaneous of various experimental settings and key components highlights the advantages of human-in-the-loop learning in improving decision-making robustness and overall driving performance.
Abstract（参考訳）: 大きな基礎モデル(LFM)の進化に伴い、データ駆動型自動運転は大きな進歩を遂げた。しかし、既存のパラダイムは、分散シフトと因果的混乱のため、複雑な相互作用と長い尾のシナリオにおいて深刻な課題に直面している。これらの制限はしばしば、極端な条件下での人間レベルの意思決定の柔軟性と安全性の欠如をもたらす。この制限を克服するため,自動運転のためのHuman-in-the-Loop Online Behavior Cloning frame work (HiL-OBC)を提案する。具体的には、HiL-OBCデプロイメントは、ヒューマン介入によるポリシー初期化、ベイズポリシー適応による潜伏行動モデリング、オンラインデプロイと更新の3つの重要なフェーズを通じて実行される。さらに,マルチモーダルオンライン行動クローン (MOBC) モデルを設計し,軽量ネットワークアーキテクチャ,テイクオーバトリガ機構,多変量損失関数を通じてオンラインのベースドライビングポリシーを最適化し,複雑な環境におけるシステム決定の堅牢性を高める。我々はLangAuto-Human CARLAベンチマークでHiL-OBCを評価した。 StructNav,LFG,LMDriveのDSは, それぞれ47.25%, 31.59%, 32.12%増加し, 各種実験設定とキーコンポーネントを同時に組み合わせることで, 意思決定の堅牢性と全体の運転性能を向上させる上でのヒューマン・イン・ザ・ループ学習の利点を浮き彫りにした。

論文の概要: Learning from Human Driving: A Human-in-the-Loop Online Behavior Cloning Framework for Autonomous Driving

関連論文リスト