Fugu-MT 論文翻訳(概要): Traffic Sign Recognition in Autonomous Driving: Dataset, Benchmark, and Field Experiment

論文の概要: Traffic Sign Recognition in Autonomous Driving: Dataset, Benchmark, and Field Experiment

arxiv url: http://arxiv.org/abs/2603.23034v1
Date: Tue, 24 Mar 2026 10:11:27 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-25 19:53:37.427773
Title: Traffic Sign Recognition in Autonomous Driving: Dataset, Benchmark, and Field Experiment
Title（参考訳）: 自動運転における交通信号認識:データセット,ベンチマーク,フィールド実験
Authors: Guoyang Zhao, Weiqing Qi, Kai Zhang, Chenguang Zhang, Zeying Gong, Zhihai Bi, Kai Chen, Benshan Ma, Ming Liu, Jun Ma,
Abstract要約: 交通信号認識(TSR)は、自動運転における中核的な認識能力である。本稿では,大規模かつグローバルなトラフィックサインデータセットTS-1Mを提案する。 3つの代表的な学習パラダイムにまたがって統一されたベンチマークを実施します。
参考スコア（独自算出の注目度）: 15.782194207723975
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Traffic Sign Recognition (TSR) is a core perception capability for autonomous driving, where robustness to cross-region variation, long-tailed categories, and semantic ambiguity is essential for reliable real-world deployment. Despite steady progress in recognition accuracy, existing traffic sign datasets and benchmarks offer limited diagnostic insight into how different modeling paradigms behave under these practical challenges. We present TS-1M, a large-scale and globally diverse traffic sign dataset comprising over one million real-world images across 454 standardized categories, together with a diagnostic benchmark designed to analyze model capability boundaries. Beyond standard train-test evaluation, we provide a suite of challenge-oriented settings, including cross-region recognition, rare-class identification, low-clarity robustness, and semantic text understanding, enabling systematic and fine-grained assessment of modern TSR models. Using TS-1M, we conduct a unified benchmark across three representative learning paradigms: classical supervised models, self-supervised pretrained models, and multimodal vision-language models (VLMs). Our analysis reveals consistent paradigm-dependent behaviors, showing that semantic alignment is a key factor for cross-region generalization and rare-category recognition, while purely visual models remain sensitive to appearance shift and data imbalance. Finally, we validate the practical relevance of TS-1M through real-scene autonomous driving experiments, where traffic sign recognition is integrated with semantic reasoning and spatial localization to support map-level decision constraints. Overall, TS-1M establishes a reference-level diagnostic benchmark for TSR and provides principled insights into robust and semantic-aware traffic sign perception. Project page: https://guoyangzhao.github.io/projects/ts1m.
Abstract（参考訳）: 交通信号認識(Traffic Sign Recognition, TSR)は、自律運転における中核的な認識能力である。認識精度は着実に向上しているが、既存のトラフィックサインデータセットとベンチマークは、これらの実践的な課題の下で異なるモデリングパラダイムがどのように振る舞うかについて、限られた診断洞察を提供する。 TS-1Mは,454の標準化カテゴリにまたがる100万以上の実世界の画像からなる大規模かつ多種多様なトラフィックサインデータセットであり,モデル能力境界を解析するための診断ベンチマークである。標準列車試験の評価以外にも、クロスリージョン認識、レアクラス識別、低明度頑健性、意味的テキスト理解など、最新のTSRモデルの体系的かつきめ細かな評価を可能にする、課題指向の一連の設定を提供する。 TS-1Mを用いて、古典的教師付きモデル、自己教師付き事前学習モデル、マルチモーダル視覚言語モデル(VLM)の3つの代表的な学習パラダイムの統一ベンチマークを行う。分析の結果, セマンティックアライメントは地域横断の一般化と希少なカテゴリー認識の鍵となる要因であり, 外観変化やデータの不均衡に対して純粋に視覚モデルが敏感であることが判明した。最後に,交通標識認識と意味論的推論と空間的位置決めを統合し,マップレベルの決定制約をサポートする実環境自律運転実験を通じて,TS-1Mの実用的妥当性を検証する。全体としてTS-1Mは、TSRの基準レベル診断ベンチマークを確立し、ロバストでセマンティックなトラフィックサインの認識に関する原則的な洞察を提供する。プロジェクトページ: https://guoyangzhao.github.io/projects/ts1m。

論文の概要: Traffic Sign Recognition in Autonomous Driving: Dataset, Benchmark, and Field Experiment

関連論文リスト