Fugu-MT 論文翻訳(概要): BMD-45: A Large-Scale CCTV Vehicle Detection Dataset for Urban Traffic in Developing Cities

論文の概要: BMD-45: A Large-Scale CCTV Vehicle Detection Dataset for Urban Traffic in Developing Cities

arxiv url: http://arxiv.org/abs/2604.24419v1
Date: Mon, 27 Apr 2026 12:49:02 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-28 17:12:08.006266
Title: BMD-45: A Large-Scale CCTV Vehicle Detection Dataset for Urban Traffic in Developing Cities
Title（参考訳）: BMD-45:都市開発における都市交通のための大規模CCTV車両検出データセット
Authors: Akash Sharma, Chinmay Mhatre, Sankalp Gawali, Ruthvik Bokkasam, Brij Sharma, Vishwajeet Pattanaik, Punit Rathore, Raghu Krishnapuram, Vijay Gopal Kovvali, Anirban Chakraborty, Yogesh Simmhan,
Abstract要約: CCTVの固定カメラからの車両検出は、インテリジェントトランスポーテーションシステムにとって重要である。既存のベンチマークは主に、エゴ中心の運転視点や制御された航空ビューから捉えた比較的均質で高度に組織化された交通パターンを特徴としている。 BMD-45は、480Kのバウンディングボックスからなる大規模データセットで、3.6K以上のSafe City CCTVカメラから撮影された45K画像に注釈を付けている。 UA-DETRACで微調整されたモデルでは33.6%のmAP@0.50:0.95しか達成できないが、ドメイン上でのトレーニングでは83.8%であった。
参考スコア（独自算出の注目度）: 5.530813453311619
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Robust vehicle detection from fixed CCTV cameras is critical for Intelligent Transportation Systems. Yet existing benchmarks predominantly feature relatively homogeneous, highly organized traffic patterns captured from ego-centric driving perspectives or controlled aerial views. This regional and sensor view bias creates a significant gap. Models trained on datasets such as UA-DETRAC and COCO struggle to generalize to the dense, heterogeneous, disorganized traffic conditions observed in rapidly developing urban centers in emerging economies. To address this limitation, we introduce BMD-45, a large-scale dataset comprising 480K bounding boxes annotated over 45K images captured from over 3.6K operational Safe City CCTV cameras. BMD-45 contains 14 fine-grained vehicle categories, including region-specific modes such as auto-rickshaws and tempo travellers, which are not present in existing benchmarks. The dataset captures real-world deployment challenges, including extreme viewpoint variation, occlusion, and vehicle density . We establish comprehensive baselines using state-of-the-art detectors and reveal a striking domain gap: models fine-tuned on UA-DETRAC achieve only 33.6% mAP@0.50:0.95, compared to 83.8% when trained in-domain on BMD-45, representing a 2.5x improvement that persists even when accounting for novel vehicle classes. This performance gap underscores the critical need for geographically diverse traffic benchmarks and establishes BMD-45 as a baseline for developing robust perception systems in underrepresented urban environments worldwide. The dataset is available at: https://huggingface.co/datasets/iisc-aim/BMD-45.
Abstract（参考訳）: CCTVの固定カメラからのロバストな車両検出は、インテリジェントトランスポーテーションシステムにとって重要である。しかし、既存のベンチマークは主に、エゴ中心の運転視点や制御された空中視点から捉えた比較的均質で、高度に組織化された交通パターンを特徴としている。この地域的およびセンサー的ビューバイアスは、大きなギャップを生み出します。 UA-DETRACやCOCOといったデータセットで訓練されたモデルは、新興国で急速に発展している都市の中心で観測される密集的で異質で非組織的な交通条件に一般化するために苦労している。この制限に対処するため,3.6K以上のSafe City CCTVカメラから取得した45K以上の画像に注釈を付けた480Kのバウンディングボックスからなる大規模データセットであるBMD-45を導入する。 BMD-45には14の細かな車種が含まれており、例えばオートリカーやテンポトラベラーのような地域固有のモードは既存のベンチマークには存在しない。このデータセットは、極端な視点の変化、閉塞、車両密度など、現実世界のデプロイメントの課題をキャプチャする。 UA-DETRACで微調整されたモデルでは33.6% mAP@0.50:0.95しか達成できないが、BMD-45でトレーニングされたドメインでは83.8%に留まり、新しい車種を考慮しても2.5倍の改善が持続する。この性能差は、地理的に多様な交通指標の必要性を浮き彫りにし、世界規模で表現されていない都市環境において、堅牢な認識システムを開発するためのベースラインとしてBMD-45を確立している。データセットは以下の通りである。 https://huggingface.co/datasets/iisc-aim/BMD-45。

論文の概要: BMD-45: A Large-Scale CCTV Vehicle Detection Dataset for Urban Traffic in Developing Cities

関連論文リスト