Fugu-MT 論文翻訳(概要): Remote SAMsing: From Segment Anything to Segment Everything

論文の概要: Remote SAMsing: From Segment Anything to Segment Everything

arxiv url: http://arxiv.org/abs/2605.00256v1
Date: Thu, 30 Apr 2026 21:44:03 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-04 17:43:28.769837
Title: Remote SAMsing: From Segment Anything to Segment Everything
Title（参考訳）: リモートSAMsing: セグメンテーションからセグメンテーションまで
Authors: Osmar Luiz Ferreira de Carvalho, Osmar Abílio de Carvalho Júnior, Anesmar Olino de Albuquerque, Daniel Guerreiro e Silva,
Abstract要約: SAM2は自然画像上で高品質なゼロショットセグメンテーションを生成する。大きなリモートセンシングシーンに適用すると、2つの問題が浮き彫りになる。 SAM2を変更したり、トレーニングデータを必要とすることなく、両方の問題を解決するオープンソースのパイプラインであるRemote SAMsingを提案する。
参考スコア（独自算出の注目度）: 0.1749935196721634
License: http://creativecommons.org/licenses/by/4.0/
Abstract: SAM2 produces high-quality zero-shot segmentation on natural images, but applying it to large remote sensing scenes exposes two problems: (1) its mask generator faces an inherent quality-coverage trade-off: strict thresholds yield precise masks but leave most of the image unsegmented, while relaxed thresholds increase coverage at the cost of mask quality; and (2) large images must be tiled, fragmenting objects across tile boundaries. We propose Remote SAMsing, an open-source pipeline that solves both problems without modifying SAM2 or requiring training data. For coverage, a multi-pass algorithm runs SAM2 repeatedly on each tile, painting accepted masks black between passes to simplify the scene for the next iteration, and relaxing quality thresholds only when coverage gains stagnate, ensuring that the most precise masks are always captured first. For spatial consistency, contextual padding and a parameter-free best-match merge reconstruct objects fragmented across tile boundaries. Evaluated on seven scenes (5~cm to 4.78~m GSD), the pipeline raises coverage from 30--68\% (single-pass SAM2) to 91--98\%. Ablation experiments quantify the contribution of each component to coverage and detection quality. Per-class evaluation shows that SAM2 transfers well to discrete RS objects (buildings 95\%, cars 82--93\% Det@0.5) with segment boundaries 3--8$\times$ more precise than SLIC and Felzenszwalb baselines. Tile size functions as an implicit scale parameter: reducing it from $1{,}000$ to 250 raises Det@0.5 from 56\% to 85\%, outperforming SAM2's built-in multi-scale mechanism. The pipeline generalizes to MNF false-color imagery without retraining (99.5\% ASA) and scales to production-sized images: a 1.94 billion pixel Potsdam mosaic achieved 97\% coverage without quality degradation.
Abstract（参考訳）: SAM2は、自然画像に対して高品質なゼロショットセグメンテーションを生成するが、大きなリモートセンシングシーンに適用すると、(1)マスクジェネレータが固有の品質・カバートレードオフに直面している: 厳密なしきい値が正確なマスクを出力するが、ほとんどの画像は非セグメンテーションされ、一方、緩和されたしきい値はマスク品質のコストでカバレッジを増大させ、(2)大きな画像はタイル境界を越えて断片化されなければならない。 SAM2を変更したり、トレーニングデータを必要とすることなく、両方の問題を解決するオープンソースのパイプラインであるRemote SAMsingを提案する。カバレッジのために、マルチパスアルゴリズムは各タイル上でSAM2を繰り返し実行し、パス間で受け入れられたマスクを黒く塗って次のイテレーションのシーンを単純化し、カバレッジが停滞した場合にのみ品質閾値を緩和し、最も正確なマスクが最初にキャプチャされることを保証する。空間的整合性のために、コンテキストパディングとパラメータフリーのベストマッチマージはタイルの境界を越えて断片化されたオブジェクトを再構成する。 7つのシーン(5〜cm〜4.78〜m GSD)で評価され、パイプラインは30～68～%(シングルパスSAM2)から91～98～%まで範囲を拡大する。アブレーション実験は、各コンポーネントのカバレッジと検出品質への貢献を定量化する。クラスごとの評価では、SAM2 は SLIC や Felzenszwalb のベースラインよりも正確に3～8$\times$ のセグメント境界を持つ離散RSオブジェクト(ビルディング 95\%、車体 82～93\% Det@0.5)によく移行している。 1{,}000$から250に減らすと、Det@0.5が56\%から85\%に上昇し、SAM2の組み込みマルチスケールメカニズムを上回っます。パイプラインは再トレーニングせずにMNFの偽色画像(99.5\% ASA)に一般化し、プロダクションサイズの画像にスケールする:94億画素のポツダムモザイクは品質劣化なしに97\%のカバレッジを達成した。

論文の概要: Remote SAMsing: From Segment Anything to Segment Everything

関連論文リスト