Fugu-MT 論文翻訳(概要): Tiny-YOLOSAM: Fast Hybrid Image Segmentation

論文の概要: Tiny-YOLOSAM: Fast Hybrid Image Segmentation

arxiv url: http://arxiv.org/abs/2512.22193v1
Date: Sat, 20 Dec 2025 12:28:39 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-30 22:37:29.873913
Title: Tiny-YOLOSAM: Fast Hybrid Image Segmentation
Title（参考訳）: Tiny-YOLOSAM: 高速ハイブリッドイメージセグメンテーション
Authors: Kenneth Xu, Songhan Wu,
Abstract要約: TinySAMは軽量で蒸留されたSAMで、強力なゼロショットマスクの品質を保っている。 Tiny-YOLOSAMは高速なハイブリッドパイプラインで、最近のYOLO検出器を使って、TinySAMの箱プロンプトを生成する。 COCO val 2017では、ハイブリッドシステムはクラスに依存しない範囲を大幅に改善し(ARは16.4%から77.1%、mIoUは19.2%から67.8%)、エンドツーエンドのランタイムは49.20s/imageから10.39s/image (4.7x)まで減らした。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The Segment Anything Model (SAM) enables promptable, high-quality segmentation but is often too computationally expensive for latency-critical settings. TinySAM is a lightweight, distilled SAM variant that preserves strong zero-shot mask quality, yet its "segment-everything" mode still requires hundreds of prompts and remains slow in practice. We first replicate TinySAM on COCO val2017 using official checkpoints, matching the reported AP within 0.03%, establishing a reliable experimental baseline. Building on this, we propose Tiny-YOLOSAM, a fast hybrid pipeline that uses a recent YOLO detector (YOLOv12) to generate box prompts for TinySAM on salient foreground objects, and supplements uncovered regions with sparse point prompts sampled only where YOLO-guided masks provide no coverage. On COCO val2017, the hybrid system substantially improves class-agnostic coverage (AR from 16.4% to 77.1%, mIoU from 19.2% to 67.8%) while reducing end-to-end runtime from 49.20s/image to 10.39s/image (4.7x) on an Apple M1 Pro CPU. These results suggest detector-guided prompting combined with targeted sparse sampling as an effective alternative to dense "segment-everything" prompting for practical full-scene segmentation.
Abstract（参考訳）: Segment Anything Model (SAM)は、高速で高品質なセグメンテーションを可能にするが、レイテンシクリティカルな設定では計算コストが高すぎることが多い。 TinySAMは軽量で蒸留されたSAMで、強力なゼロショットマスクの品質を保っているが、"セグメンテーション・オールシング"モードは依然として数百のプロンプトを必要とし、実際は遅いままである。我々は最初に公式チェックポイントを使用してCOCO val2017上でTinySAMを複製し、報告されたAPを0.03%以内でマッチングし、信頼性の高い実験ベースラインを確立した。そこで我々は,最新のYOLO検出器(YOLOv12)を用いた高速ハイブリッドパイプラインであるTiny-YOLOSAMを提案する。 COCO val2017では、ハイブリッドシステムはクラスに依存しない範囲を大幅に改善し(ARは16.4%から77.1%、mIoUは19.2%から67.8%)、エンドツーエンドのランタイムは49.20s/imageから10.39s/image (4.7x)まで減らした。これらの結果から, ターゲットスパースサンプリングを併用した検出器誘導プロンプトが, フルシーンセグメンテーションに有効な「セグメンテーション」の代替となることが示唆された。

論文の概要: Tiny-YOLOSAM: Fast Hybrid Image Segmentation

関連論文リスト