Fugu-MT 論文翻訳(概要): MARIS: Marine Open-Vocabulary Instance Segmentation with Geometric Enhancement and Semantic Alignment

論文の概要: MARIS: Marine Open-Vocabulary Instance Segmentation with Geometric Enhancement and Semantic Alignment

arxiv url: http://arxiv.org/abs/2510.15398v2
Date: Thu, 23 Oct 2025 07:18:58 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-25 03:08:11.546673
Title: MARIS: Marine Open-Vocabulary Instance Segmentation with Geometric Enhancement and Semantic Alignment
Title（参考訳）: MARIS: 幾何学的拡張とセマンティックアライメントを併用した海洋オープンボキャブラリインスタンスセグメンテーション
Authors: Bingyu Li, Feiyu Wang, Da Zhang, Zhiyuan Zhao, Junyu Gao, Xuelong Li,
Abstract要約: 我々は,水中オープンボキャブラリ(OV)セグメンテーションのための大規模なベンチマークであるtextbfMARIS (underlineMarine Open-Vocabulary underlineInstance underlineSegmentation)を紹介した。当社のフレームワークは、既存のOVベースラインであるIn-DomainとCross-Domainの両方を一貫して上回ります。
参考スコア（独自算出の注目度）: 56.88334234553316
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Most existing underwater instance segmentation approaches are constrained by close-vocabulary prediction, limiting their ability to recognize novel marine categories. To support evaluation, we introduce \textbf{MARIS} (\underline{Mar}ine Open-Vocabulary \underline{I}nstance \underline{S}egmentation), the first large-scale fine-grained benchmark for underwater Open-Vocabulary (OV) segmentation, featuring a limited set of seen categories and diverse unseen categories. Although OV segmentation has shown promise on natural images, our analysis reveals that transfer to underwater scenes suffers from severe visual degradation (e.g., color attenuation) and semantic misalignment caused by lack underwater class definitions. To address these issues, we propose a unified framework with two complementary components. The Geometric Prior Enhancement Module (\textbf{GPEM}) leverages stable part-level and structural cues to maintain object consistency under degraded visual conditions. The Semantic Alignment Injection Mechanism (\textbf{SAIM}) enriches language embeddings with domain-specific priors, mitigating semantic ambiguity and improving recognition of unseen categories. Experiments show that our framework consistently outperforms existing OV baselines both In-Domain and Cross-Domain setting on MARIS, establishing a strong foundation for future underwater perception research.
Abstract（参考訳）: 既存の水中のインスタンスセグメンテーションアプローチは、新しい海洋カテゴリーを認識する能力を制限するために、語彙に近い予測によって制約されている。評価を支援するために,水中オープンボキャブラリ (OV) セグメンテーションのための大規模なベンチマークである \textbf{MARIS} (\underline{Mar}ine Open-Vocabulary \underline{I}nstance \underline{S}egmentation) を導入する。 OVセグメンテーションは自然画像に有望であるが, 水中のシーンへの移動は, 水中のクラス定義の欠如による視覚的劣化(例えば, 色減衰)と意味的不整合(semantic misalignment)に悩まされていることが明らかとなった。これらの問題に対処するため、我々は2つの相補的なコンポーネントを持つ統一されたフレームワークを提案する。 Geometric Prior Enhancement Module (\textbf{GPEM})は、安定な部分レベルと構造的キューを利用して、劣化した視覚条件下でオブジェクトの一貫性を維持する。 Semantic Alignment Injection Mechanism (\textbf{SAIM})は、言語埋め込みをドメイン固有の先行と豊かにし、意味的曖昧さを緩和し、目に見えないカテゴリの認識を改善する。実験の結果、我々のフレームワークは既存のOVベースラインとMARISのクロスドメイン設定の両方を一貫して上回り、将来の水中知覚研究の強力な基盤を確立していることがわかった。

論文の概要: MARIS: Marine Open-Vocabulary Instance Segmentation with Geometric Enhancement and Semantic Alignment

関連論文リスト