Fugu-MT 論文翻訳(概要): Point2RBox-v3: Self-Bootstrapping from Point Annotations via Integrated Pseudo-Label Refinement and Utilization

論文の概要: Point2RBox-v3: Self-Bootstrapping from Point Annotations via Integrated Pseudo-Label Refinement and Utilization

arxiv url: http://arxiv.org/abs/2509.26281v2
Date: Wed, 08 Oct 2025 03:36:37 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-09 14:21:18.177677
Title: Point2RBox-v3: Self-Bootstrapping from Point Annotations via Integrated Pseudo-Label Refinement and Utilization
Title（参考訳）: ポイント2RBox-v3:統合擬似ラベルリファインメントと利用によるポイントアノテーションからの自己ブートストラップ
Authors: Teng Zhang, Ziqian Fan, Mingxin Liu, Xin Zhang, Xudong Lu, Wentong Li, Yue Zhou, Yi Yu, Xiang Li, Junchi Yan, Xue Yang,
Abstract要約: Point2RBox-v3は、ラベル割り当てに動的擬似ラベルを使用する最初のモデルである。我々のソリューションは、特にオブジェクトサイズやスパースオブジェクトの発生に大きなバリエーションがあるシナリオにおいて、競争性能を提供します。
参考スコア（独自算出の注目度）: 58.42853147118086
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Driven by the growing need for Oriented Object Detection (OOD), learning from point annotations under a weakly-supervised framework has emerged as a promising alternative to costly and laborious manual labeling. In this paper, we discuss two deficiencies in existing point-supervised methods: inefficient utilization and poor quality of pseudo labels. Therefore, we present Point2RBox-v3. At the core are two principles: 1) Progressive Label Assignment (PLA). It dynamically estimates instance sizes in a coarse yet intelligent manner at different stages of the training process, enabling the use of label assignment methods. 2) Prior-Guided Dynamic Mask Loss (PGDM-Loss). It is an enhancement of the Voronoi Watershed Loss from Point2RBox-v2, which overcomes the shortcomings of Watershed in its poor performance in sparse scenes and SAM's poor performance in dense scenes. To our knowledge, Point2RBox-v3 is the first model to employ dynamic pseudo labels for label assignment, and it creatively complements the advantages of SAM model with the watershed algorithm, which achieves excellent performance in both sparse and dense scenes. Our solution gives competitive performance, especially in scenarios with large variations in object size or sparse object occurrences: 66.09%/56.86%/41.28%/46.40%/19.60%/45.96% on DOTA-v1.0/DOTA-v1.5/DOTA-v2.0/DIOR/STAR/RSAR.
Abstract（参考訳）: オブジェクト指向オブジェクト検出(OOD)の必要性が高まる中、弱い教師付きフレームワークの下でポイントアノテーションから学ぶことが、コストと労力のかかる手作業によるラベリングに代わる有望な選択肢として現れました。本稿では,既存の点管理手法における2つの欠陥について論じる:非効率な利用と擬似ラベルの品質の低下。したがって、Point2RBox-v3を示す。中心となる原則は2つある。 1)プログレッシブ・ラベル・アサインメント(PLA) トレーニングプロセスの異なる段階で、粗いがインテリジェントな方法でインスタンスサイズを動的に推定し、ラベル割り当てメソッドの使用を可能にする。 2)プリエントガイド式ダイナミックマスクロス(PGDM-Loss)。ポイント2RBox-v2のヴォロノイ・ウォーターッシュ・ロスの強化であり、華やかなシーンではウォーターッシュの欠点を克服し、密集したシーンではSAMの貧弱なパフォーマンスを克服している。我々の知る限り、Point2RBox-v3はラベル代入に動的擬似ラベルを用いた最初のモデルであり、SAMモデルの利点を創意的に補完し、疎密なシーンと密なシーンの両方で優れたパフォーマンスを実現する。 66.09%/56.86%/41.28%/46.40%/19.60%/45.96% on DOTA-v1.0/DOTA-v1.5/DOTA-v1.5/DIOR/STAR/RSAR。

論文の概要: Point2RBox-v3: Self-Bootstrapping from Point Annotations via Integrated Pseudo-Label Refinement and Utilization

関連論文リスト