Fugu-MT 論文翻訳(概要): Aqua Boundary-Saliency Attention Module for Lightweight Underwater Salient Instance Segmentation Detection Transformer

論文の概要: Aqua Boundary-Saliency Attention Module for Lightweight Underwater Salient Instance Segmentation Detection Transformer

arxiv url: http://arxiv.org/abs/2606.08002v1
Date: Sat, 06 Jun 2026 06:43:01 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-09 14:42:05.611962
Title: Aqua Boundary-Saliency Attention Module for Lightweight Underwater Salient Instance Segmentation Detection Transformer
Title（参考訳）: 軽量水中塩分セグメンテーション検出変圧器のアクアバウンダリサリアンスアテンションモジュール
Authors: M. Fazri Nizar, Julian Supardi, Muhammad Naufal Rachmatullah,
Abstract要約: 水中のインスタンスセグメンテーションは、海洋資源探索、生態モニタリング、水中ロボット知覚のためのピクセルレベルのマスク予測とインスタンスレベルの識別を統合している。本研究では,アクアバウンダリ・サリエンシ・アテンションモジュール(AquaBSAM)を中心に構築された小型検出・変圧器フレームワークLUSIS-DETRを紹介する。 AquaBSAMは、水中境界、コントラスト、減衰、クロマ、暗チャネル、そしてDINOv2でコーティングされたマルチスケールの特徴を、有界残留変調によって埋め込む。
参考スコア（独自算出の注目度）: 0.41292255339309664
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Underwater instance segmentation integrates pixel-level mask prediction and instance-level discrimination for marine resource exploration, ecological monitoring, and underwater robotic perception. Recent prompt-based and auxiliary-modality methods improve mask quality, but their reliance on large foundation models, prompt generation, or extra modality estimation complicates efficient deployment. This work introduces Lightweight Underwater Salient Instance Segmentation Detection Transformer (LUSIS-DETR), a compact detection-transformer framework built around the Aqua Boundary-Saliency Attention Module (AquaBSAM). AquaBSAM embeds underwater boundary, contrast, attenuation, chroma, dark-channel, and center-prior cues into DINOv2-initialized multi-scale features through bounded residual modulation, while auxiliary mask supervision and small-object copy-paste are training-only. Extensive evaluation on four recent underwater instance segmentation datasets, UIIS, UIIS10K, USIS10K, and USIS16K, shows competitively leading performance against previous state-of-the-art works across category-aware and salient-instance protocols. TensorRT half-precision (FP16) benchmarking on an NVIDIA T4 graphics processing unit (GPU) achieves 4.31-6.34 milliseconds (ms) latency, supporting real-time inference under an accessible reproduction setting.
Abstract（参考訳）: 水中のインスタンスセグメンテーションは、海洋資源探索、生態モニタリング、水中ロボット知覚のためのピクセルレベルのマスク予測とインスタンスレベルの識別を統合している。最近のプロンプトベースおよび補助モダリティ手法はマスクの品質を向上させるが、大きな基礎モデル、即時生成、または余剰モダリティ推定に依存するため、効率的な展開が複雑になる。本研究は,アクアバウンダリ・サリエンシ・アテンション・モジュール(AquaBSAM)を中心に構築されたコンパクトな検出・トランスフォーマフレームワークであるLUSIS-DETR(Lightweight Underwater Salient Instance Segmentation Detection Transformer)を紹介する。 AquaBSAMは、水中境界、コントラスト、減衰、クロマ、暗チャネル、および中心優先のキューを、有界残留変調によるDINOv2初期化マルチスケール特徴に埋め込む。最近の4つの水中インスタンスセグメンテーションデータセット(UIIS, UIIS10K, USIS10K, USIS16K)に対する広範囲な評価は、カテゴリ認識およびサージェントインスタンスプロトコル間の従来の最先端の作業に対して競争力のあるパフォーマンスを示している。 TensorRT half-precision (FP16)ベンチマークではNVIDIA T4グラフィックス処理ユニット(GPU)が4.31-6.34ミリ秒(ms)のレイテンシを達成した。

論文の概要: Aqua Boundary-Saliency Attention Module for Lightweight Underwater Salient Instance Segmentation Detection Transformer

関連論文リスト