Fugu-MT 論文翻訳(概要): SAPNet++: Evolving Point-Prompted Instance Segmentation with Semantic and Spatial Awareness

論文の概要: SAPNet++: Evolving Point-Prompted Instance Segmentation with Semantic and Spatial Awareness

arxiv url: http://arxiv.org/abs/2602.21762v1
Date: Wed, 25 Feb 2026 10:27:14 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-26 18:19:16.795244
Title: SAPNet++: Evolving Point-Prompted Instance Segmentation with Semantic and Spatial Awareness
Title（参考訳）: SAPNet++: 意味的および空間的認識によるポイントプロンプトインスタンスセグメンテーションの進化
Authors: Zhaoyang Wei, Xumeng Han, Xuehui Yu, Xue Yang, Guorong Li, Zhenjun Han, Jianbin Jiao,
Abstract要約: 単点アノテーションは、コスト削減のラベル付けのための視覚的タスクにおいて、ますます顕著になっている。正確にマスクを推定することを目的としたPPISタスクなど、高精度を要するタスクに挑戦する。従来、マスク生成のパラダイムとPPISを実現するための提案された選択は、通常、継承されてきた。これらのモジュールはSAPNet++で終了し、点プロンプトの粒度のあいまいさと境界の不確かさを緩和する。
参考スコア（独自算出の注目度）: 42.34831760820804
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Single-point annotation is increasingly prominent in visual tasks for labeling cost reduction. However, it challenges tasks requiring high precision, such as the point-prompted instance segmentation (PPIS) task, which aims to estimate precise masks using single-point prompts to train a segmentation network. Due to the constraints of point annotations, granularity ambiguity and boundary uncertainty arise the difficulty distinguishing between different levels of detail (eg. whole object vs. parts) and the challenge of precisely delineating object boundaries. Previous works have usually inherited the paradigm of mask generation along with proposal selection to achieve PPIS. However, proposal selection relies solely on category information, failing to resolve the ambiguity of different granularity. Furthermore, mask generators offer only finite discrete solutions that often deviate from actual masks, particularly at boundaries. To address these issues, we propose the Semantic-Aware Point-Prompted Instance Segmentation Network (SAPNet). It integrates Point Distance Guidance and Box Mining Strategy to tackle group and local issues caused by the point's granularity ambiguity. Additionally, we incorporate completeness scores within proposals to add spatial granularity awareness, enhancing multiple instance learning (MIL) in proposal selection termed S-MIL. The Multi-level Affinity Refinement conveys pixel and semantic clues, narrowing boundary uncertainty during mask refinement. These modules culminate in SAPNet++, mitigating point prompt's granularity ambiguity and boundary uncertainty and significantly improving segmentation performance. Extensive experiments on four challenging datasets validate the effectiveness of our methods, highlighting the potential to advance PPIS.
Abstract（参考訳）: 単点アノテーションは、コスト削減のラベル付けのための視覚的タスクにおいて、ますます顕著になっている。しかし, 単一点プロンプトを用いて精度の高いマスクを推定し, セグメンテーションネットワークを訓練することを目的としたPPISタスクなど, 高精度なタスクに挑戦する。点アノテーションの制約により、粒度のあいまいさと境界の不確実性は、異なる詳細レベル(例えば、全体オブジェクト対部分)と、オブジェクト境界を正確に線引きすることの難しさを生じさせる。従来、マスク生成のパラダイムとPPISを実現するための提案された選択は、通常、継承されてきた。しかし、提案の選択はカテゴリ情報にのみ依存しており、異なる粒度のあいまいさを解決できない。さらに、マスク生成器は、実際のマスク、特に境界においてしばしば逸脱する有限個の離散解のみを提供する。これらの問題に対処するため,セマンティック・アウェア・ポイント・プロンプテッド・インスタンス・セグメンテーション・ネットワーク(SAPNet)を提案する。ポイント距離誘導とボックスマイニング戦略を統合して、ポイントの粒度のあいまいさに起因するグループとローカルの問題に取り組む。さらに,提案手法に完全性スコアを組み込んで空間的粒度認識を追加し,S-MILと呼ばれる提案選択における複数インスタンス学習(MIL)を強化する。マルチレベルアフィニティ・リファインメントは画素やセマンティックな手がかりを伝達し、マスクリファインメントにおける境界の不確かさを狭める。これらのモジュールは、SAPNet++で終了し、ポイントプロンプトの粒度のあいまいさと境界の不確かさを緩和し、セグメンテーション性能を大幅に改善した。提案手法の有効性を検証し,PPISを推し進める可能性を強調した。

論文の概要: SAPNet++: Evolving Point-Prompted Instance Segmentation with Semantic and Spatial Awareness

関連論文リスト