Fugu-MT 論文翻訳(概要): SynMVCrowd: A Large Synthetic Benchmark for Multi-view Crowd Counting and Localization

論文の概要: SynMVCrowd: A Large Synthetic Benchmark for Multi-view Crowd Counting and Localization

arxiv url: http://arxiv.org/abs/2603.23956v1
Date: Wed, 25 Mar 2026 05:34:24 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-26 21:06:11.146096
Title: SynMVCrowd: A Large Synthetic Benchmark for Multi-view Crowd Counting and Localization
Title（参考訳）: SynMVCrowd: マルチビューのクラウドカウントとローカライゼーションのための大規模なシンセティックベンチマーク
Authors: Qi Zhang, Daijie Chen, Yunfei Gong, Hui Huang,
Abstract要約: 群衆数, カメラビュー, フレームを限定した比較的小さなシーンにおいて, 既存の複数視点の群集カウントと位置決め手法を評価した。マルチビュー・クラウド・カウントとローカライズ・タスクのより実践的な評価と比較を行うため,大規模な総合ベンチマークであるSynMVCrowdを提案する。
参考スコア（独自算出の注目度）: 13.590728974745787
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Existing multi-view crowd counting and localization methods are evaluated under relatively small scenes with limited crowd numbers, camera views, and frames. This makes the evaluation and comparison of existing methods impractical, as small datasets are easily overfit by these methods. To avoid these issues, 3DROM proposes a data augmentation method. Instead, in this paper, we propose a large synthetic benchmark, SynMVCrowd, for more practical evaluation and comparison of multi-view crowd counting and localization tasks. The SynMVCrowd benchmark consists of 50 synthetic scenes with a large number of multi-view frames and camera views and a much larger crowd number (up to 1000), which is more suitable for large-scene multi-view crowd vision tasks. Besides, we propose strong multi-view crowd localization and counting baselines that outperform all comparison methods on the new SynMVCrowd benchmark. Moreover, we prove that better domain transferring multi-view and single-image counting performance could be achieved with the aid of the benchmark on novel new real scenes. As a result, the proposed benchmark could advance the research for multi-view and single-image crowd counting and localization to more practical applications. The codes and datasets are here: https://github.com/zqyq/SynMVCrowd.
Abstract（参考訳）: 群衆数, カメラビュー, フレームを限定した比較的小さなシーンにおいて, 既存の複数視点の群集カウントと位置決め手法を評価した。これにより、これらの手法によって小さなデータセットが容易に過度に適合するため、既存の手法の評価と比較は現実的ではない。これらの問題を避けるため、3DROMはデータ拡張法を提案する。そこで,本稿では,マルチビュー・クラウド・カウントとローカライズ・タスクのより実践的な評価と比較を行うため,大規模な総合ベンチマークであるSynMVCrowdを提案する。 SynMVCrowdベンチマークは、多数のマルチビューフレームとカメラビューを備えた50の合成シーンと、大規模なマルチビューの群衆ビジョンタスクに適したより大きな群衆数(最大1000まで)で構成されている。さらに,新しいSynMVCrowdベンチマークにおいて,全ての比較手法を上回る,強力なマルチビュー・クラウド・ローカライゼーションとベースライン数を提案する。さらに,新しいシーンのベンチマークによって,マルチビューとシングルイメージカウントのパフォーマンスが向上できることを実証した。その結果、提案したベンチマークにより、より実用的なアプリケーションへのマルチビューと単一イメージの群衆カウントとローカライゼーションの研究が進展する可能性がある。コードとデータセットは以下の通りである。

論文の概要: SynMVCrowd: A Large Synthetic Benchmark for Multi-view Crowd Counting and Localization

関連論文リスト