Fugu-MT 論文翻訳(概要): Dense Point-to-Mask Optimization with Reinforced Point Selection for Crowd Instance Segmentation

論文の概要: Dense Point-to-Mask Optimization with Reinforced Point Selection for Crowd Instance Segmentation

arxiv url: http://arxiv.org/abs/2604.01742v1
Date: Thu, 02 Apr 2026 08:02:49 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-03 14:21:10.603152
Title: Dense Point-to-Mask Optimization with Reinforced Point Selection for Crowd Instance Segmentation
Title（参考訳）: クラウド・インスタンス・セグメンテーションのための強化点選択によるDense Point-to-Mask最適化
Authors: Hongru Chen, Jiyang Huang, Jia Wan, Antoni B. Chan,
Abstract要約: 我々はポイントアノテーションから高密度なインスタンスセグメンテーションを生成するためにDPMO(Point-to-Mask Optimization)を提案する。そこで,高密度群集のインスタンスセグメンテーションを予測するために,Reinforced Point Selection (RPS) フレームワークを提案する。上海技術、UCF-QNRF、JHU-CROWD++、NWPU-Crowdデータセット上で、最先端のクラウドインスタンスセグメンテーション性能を実現する。
参考スコア（独自算出の注目度）: 45.48207887391471
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Crowd instance segmentation is a crucial task with a wide range of applications, including surveillance and transportation. Currently, point labels are common in crowd datasets, while region labels (e.g., boxes) are rare and inaccurate. The masks obtained through segmentation help to improve the accuracy of region labels and resolve the correspondence between individual location coordinates and crowd density maps. However, directly applying currently popular large foundation models such as SAM does not yield ideal results in dense crowds. To this end, we first propose Dense Point-to-Mask Optimization (DPMO), which integrates SAM with the Nearest Neighbor Exclusive Circle (NNEC) constraint to generate dense instance segmentation from point annotations. With DPMO and manual correction, we obtain mask annotations from the existing point annotations for traditional crowd datasets. Then, to predict instance segmentation in dense crowds, we propose a Reinforced Point Selection (RPS) framework trained with Group Relative Policy Optimization (GRPO), which selects the best predicted point from a sampling of the initial point prediction. Through extensive experiments, we achieve state-of-the-art crowd instance segmentation performance on ShanghaiTech, UCF-QNRF, JHU-CROWD++, and NWPU-Crowd datasets. Furthermore, we design new loss functions supervised by masks that boost counting performance across different models, demonstrating the significant role of mask annotations in enhancing counting accuracy.
Abstract（参考訳）: クラウドインスタンスのセグメンテーションは、監視や輸送など、幅広いアプリケーションにおいて重要なタスクである。現在、ポイントラベルは群衆データセットで一般的であり、リージョンラベル(例:ボックス)はまれで不正確である。セグメンテーションによって得られたマスクは、地域ラベルの精度を改善し、個々の位置座標と群衆密度マップとの対応を解消するのに役立つ。しかし、SAMのような現在人気の高い大規模基盤モデルを直接適用しても、密集した群衆には理想的な結果が得られない。そこで我々はまず,SAMとNNEC(Nearest Neighbor Exclusive Circle)制約を統合したDense Point-to-Mask Optimization(DPMO)を提案し,ポイントアノテーションから高密度なインスタンスセグメンテーションを生成する。 DPMOと手動補正により,従来のクラウドデータセットに対する既存のポイントアノテーションからマスクアノテーションを得る。そこで我々は,集団相対的政策最適化 (GRPO) で訓練されたReinforced Point Selection (RPS) フレームワークを提案する。大規模な実験により,上海技術,UCF-QNRF,JHU-CROWD++,NWPU-Crowdデータセット上での最先端のクラウドインスタンスセグメンテーション性能を実現する。さらに,マスクによる新たな損失関数を設計し,異なるモデル間でのカウント性能を高めるとともに,カウント精度を高める上でマスクアノテーションが重要な役割を担っていることを示す。

論文の概要: Dense Point-to-Mask Optimization with Reinforced Point Selection for Crowd Instance Segmentation

関連論文リスト