Fugu-MT 論文翻訳(概要): SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement

論文の概要: SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement

arxiv url: http://arxiv.org/abs/2502.06756v1
Date: Mon, 10 Feb 2025 18:33:15 GMT
ステータス: 翻訳完了
システム内更新日: 2025-02-11 18:57:51.646864
Title: SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement
Title（参考訳）: SAMRefiner:Universal Mask Refinementのためのセグメンテーションモデル
Authors: Yuqi Lin, Hengjia Li, Wenqi Shao, Zheng Yang, Jun Zhao, Xiaofei He, Ping Luo, Kaipeng Zhang,
Abstract要約: マスク改善タスクにSAMを適用することで,汎用的で効率的なアプローチを提案する。具体的には,SAMの多様な入力プロンプトをマイニングするためのマルチプロンプト掘削手法を提案する。ターゲットデータセット上のジェネリックSAMRefinerのパフォーマンスをさらに向上するため、IoU適応ステップを追加してSAMRefiner++にメソッドを拡張します。
参考スコア（独自算出の注目度）: 40.37217744643069
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In this paper, we explore a principal way to enhance the quality of widely pre-existing coarse masks, enabling them to serve as reliable training data for segmentation models to reduce the annotation cost. In contrast to prior refinement techniques that are tailored to specific models or tasks in a close-world manner, we propose SAMRefiner, a universal and efficient approach by adapting SAM to the mask refinement task. The core technique of our model is the noise-tolerant prompting scheme. Specifically, we introduce a multi-prompt excavation strategy to mine diverse input prompts for SAM (i.e., distance-guided points, context-aware elastic bounding boxes, and Gaussian-style masks) from initial coarse masks. These prompts can collaborate with each other to mitigate the effect of defects in coarse masks. In particular, considering the difficulty of SAM to handle the multi-object case in semantic segmentation, we introduce a split-then-merge (STM) pipeline. Additionally, we extend our method to SAMRefiner++ by introducing an additional IoU adaption step to further boost the performance of the generic SAMRefiner on the target dataset. This step is self-boosted and requires no additional annotation. The proposed framework is versatile and can flexibly cooperate with existing segmentation methods. We evaluate our mask framework on a wide range of benchmarks under different settings, demonstrating better accuracy and efficiency. SAMRefiner holds significant potential to expedite the evolution of refinement tools. Our code is available at https://github.com/linyq2117/SAMRefiner.
Abstract（参考訳）: 本稿では,既存の粗いマスクの品質を高めるための主要な手法について検討し,セグメンテーションモデルのための信頼性のあるトレーニングデータとして機能し,アノテーションのコストを低減できることを示す。本研究では,マスク精錬作業にSAMを適応させることによる汎用的で効率的なアプローチであるSAMRefinerを提案する。我々のモデルの中核となる技術は、耐雑音性プロンプト方式である。具体的には,初期粗いマスクからSAM(距離誘導点,コンテキスト対応弾性バウンディングボックス,ガウス式マスク)の多様な入力プロンプトを抽出するためのマルチプロンプト掘削手法を提案する。これらのプロンプトは互いに協力して、粗いマスクの欠陥の影響を軽減することができる。特に,セマンティックセグメンテーションにおける多目的ケースの扱いの難しさを考慮し,STMパイプラインを導入する。さらに、ターゲットデータセット上のジェネリックSAMRefinerの性能をさらに向上させるために、IoU適応ステップを追加してSAMRefiner++にメソッドを拡張します。このステップは自己ブーイングされ、追加のアノテーションを必要としない。提案するフレームワークは汎用的で,既存のセグメンテーション手法と柔軟に連携することができる。マスクフレームワークを様々な設定で幅広いベンチマークで評価し、精度と効率性を実証した。 SAMRefinerは、改良ツールの進化を早める大きな可能性を秘めている。私たちのコードはhttps://github.com/linyq2117/SAMRefiner.comから入手可能です。

関連論文リスト

E-SAM: Training-Free Segment Every Entity Model [22.29478489117426]
特有なES能力を示す新しいトレーニングフリーフレームワークであるE-SAMを紹介する。 E-SAMは、以前のESメソッドと比較して最先端のパフォーマンスを実現し、ベンチマークメトリクスで+30.1で大幅に改善されている。
論文参考訳（メタデータ） (2025-03-15T11:41:33Z)
SAQ-SAM: Semantically-Aligned Quantization for Segment Anything Model [9.381558154295012]
本稿では,クリッピング基準として重なり合う注意力を利用した知覚一貫性クリッピングを提案する。また,マスクデコーダのクロスアテンション応答を活用することで,視覚的プロンプトインタラクションを取り入れたPrompt-Aware Reconstructionを提案する。本手法は, セグメンテーションタスクにおいて, ベースラインよりも11.7%高いmAPを実現する。
論文参考訳（メタデータ） (2025-03-09T08:38:32Z)
Promptable Anomaly Segmentation with SAM Through Self-Perception Tuning [63.55145330447408]
異常セグメンテーションのための textbfSelf-textbfPerceptinon textbfTuning (textbfSPT) 法を提案する。 SPT法は, 自己描画型チューニング戦略を取り入れ, 異常マスクの初期粗いドラフトを生成し, 精製処理を行う。
論文参考訳（メタデータ） (2024-11-26T08:33:25Z)
Bridge the Points: Graph-based Few-shot Segment Anything Semantically [79.1519244940518]
プレトレーニング技術の最近の進歩により、視覚基礎モデルの能力が向上した。最近の研究はSAMをFew-shot Semantic segmentation (FSS)に拡張している。本稿では,グラフ解析に基づく簡易かつ効果的な手法を提案する。
論文参考訳（メタデータ） (2024-10-09T15:02:28Z)
Adapting Segment Anything Model for Unseen Object Instance Segmentation [70.60171342436092]
Unseen Object Instance(UOIS)は、非構造環境で動作する自律ロボットにとって不可欠である。 UOISタスクのためのデータ効率のよいソリューションであるUOIS-SAMを提案する。 UOIS-SAMは、(i)HeatmapベースのPrompt Generator(HPG)と(ii)SAMのマスクデコーダに適応する階層識別ネットワーク(HDNet)の2つの重要なコンポーネントを統合する。
論文参考訳（メタデータ） (2024-09-23T19:05:50Z)
AlignSAM: Aligning Segment Anything Model to Open Context via Reinforcement Learning [61.666973416903005]
Segment Anything Model (SAM)は、オープンワールドシナリオにおいて、プロンプトのガイダンスによって、その印象的な一般化機能を実証した。オープンコンテキストにSAMをアライメントするための自動プロンプトのための新しいフレームワークAlignSAMを提案する。
論文参考訳（メタデータ） (2024-06-01T16:21:39Z)
PosSAM: Panoptic Open-vocabulary Segment Anything [58.72494640363136]
PosSAMはオープン・ボキャブラリ・パノプティ・セグメンテーション・モデルであり、Segment Anything Model(SAM)の強みを、エンドツーエンドのフレームワークで視覚ネイティブのCLIPモデルと統合する。本稿では,マスクの質を適応的に向上し,各画像の推論中にオープン語彙分類の性能を高めるマスク対応選択組立アルゴリズムを提案する。
論文参考訳（メタデータ） (2024-03-14T17:55:03Z)
WSI-SAM: Multi-resolution Segment Anything Model (SAM) for histopathology whole-slide images [8.179859593451285]
病理画像の正確なオブジェクト分割機能を備えたWSI-SAM, Segment Anything Model (SAM) を提案する。トレーニングオーバーヘッドを最小限にしながら、トレーニング済みの知識を完全に活用するために、SAMは凍結し、最小限のパラメータしか導入しません。本モデルでは, 膵管癌 in situ (DCIS) セグメンテーションタスクと乳癌転移セグメンテーションタスクにおいて, SAMを4.1, 2.5パーセント上回った。
論文参考訳（メタデータ） (2024-03-14T10:30:43Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。