Fugu-MT 論文翻訳(概要): SGMA: Semantic-Guided Modality-Aware Segmentation for Remote Sensing with Incomplete Multimodal Data

論文の概要: SGMA: Semantic-Guided Modality-Aware Segmentation for Remote Sensing with Incomplete Multimodal Data

arxiv url: http://arxiv.org/abs/2603.02505v1
Date: Tue, 03 Mar 2026 01:28:21 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-04 21:38:10.594026
Title: SGMA: Semantic-Guided Modality-Aware Segmentation for Remote Sensing with Incomplete Multimodal Data
Title（参考訳）: SGMA:不完全なマルチモーダルデータを用いたリモートセンシングのための意味誘導型モダリティ認識セグメンテーション
Authors: Lekang Wen, Liang Liao, Jing Xiao, Mi Wang,
Abstract要約: マルチモーダルセマンティックセグメンテーションは、リモートセンシング地球観測のための多様なセンサーからの補完情報を統合する。 IMSSは3つの主要な課題に直面している:マルチモーダル不均衡、支配的なモダリティが脆弱なモダリティを抑えること、スケール、形状、方向のクラス内変化、矛盾するキーと矛盾するセマンティック応答を生み出すクロスモーダル不均一。本稿では,セマンティック・ガイド・モダリティ・アウェア(SGMA)フレームワークを提案する。セマンティック・モダリティ・アウェア(SGMA)フレームワークは,クラス内変動の低減とセマンティックガイダンスによる相互不整合の緩和を図りつつ,バランスの取れたマルチモーダル学習を実現する。
参考スコア（独自算出の注目度）: 31.146366498415784
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Multimodal semantic segmentation integrates complementary information from diverse sensors for remote sensing Earth observation. However, practical systems often encounter missing modalities due to sensor failures or incomplete coverage, termed Incomplete Multimodal Semantic Segmentation (IMSS). IMSS faces three key challenges: (1) multimodal imbalance, where dominant modalities suppress fragile ones; (2) intra-class variation in scale, shape, and orientation across modalities; and (3) cross-modal heterogeneity with conflicting cues producing inconsistent semantic responses. Existing methods rely on contrastive learning or joint optimization, which risk over-alignment, discarding modality-specific cues or imbalanced training, favoring robust modalities, while largely overlooking intra-class variation and cross-modal heterogeneity. To address these limitations, we propose the Semantic-Guided Modality-Aware (SGMA) framework, which ensures balanced multimodal learning while reducing intra-class variation and reconciling cross-modal inconsistencies through semantic guidance. SGMA introduces two complementary plug-and-play modules: (1) Semantic-Guided Fusion (SGF) module extracts multi-scale, class-wise semantic prototypes that capture consistent categorical representations across modalities, estimates per-modality robustness based on prototype-feature alignment, and performs adaptive fusion weighted by robustness scores to mitigate intra-class variation and cross-modal heterogeneity; (2) Modality-Aware Sampling (MAS) module leverages robustness estimations from SGF to dynamically reweight training samples, prioritizing challenging samples from fragile modalities to address modality imbalance. Extensive experiments across multiple datasets and backbones demonstrate that SGMA consistently outperforms state-of-the-art methods, with particularly significant improvements in fragile modalities.
Abstract（参考訳）: マルチモーダルセマンティックセグメンテーションは、リモートセンシング地球観測のための多様なセンサーからの補完情報を統合する。しかし、実際のシステムでは、センサーの故障や不完全なカバレッジのため、不完全なマルチモーダルセマンティックセマンティックセマンティックセグメンテーション(IMSS、Incomplete Multimodal Semantic Segmentation)と呼ばれるモダリティが欠落することが多い。 IMSSは,(1)支配的モダリティが脆弱なモダリティを抑制するマルチモーダル不均衡,(2)スケール,形状,方向のクラス内変化,(3)矛盾するセマンティック応答を生じる矛盾するキューとの異種性,という3つの課題に直面している。既存の手法は対照的な学習や共同最適化に依存しており、過度な調整、モダリティ固有の手がかりの破棄、不均衡な訓練、頑健なモダリティを優先する一方で、クラス内変異やクロスモーダルな異質性を見落としている。これらの制約に対処するため,セマンティック・ガイド・モダリティ・アウェア(SGMA)フレームワークを提案する。 SGMAは2つの相補的なプラグ・アンド・プレイモジュールを導入している: 1) セマンティック・ガイド・フュージョン(SGF)モジュールは、モダリティ全体にわたる一貫した分類的表現をキャプチャするマルチスケールのクラスワイドなセマンティック・プロトタイプを抽出し、プロトタイプ・ファインアライメントに基づいてモダリティごとのロバスト性を推定し、ロバストネススコアによって重み付けされた適応融合を実行し、クラス内の変動とクロスモーダル・ヘテロジニティを緩和する。複数のデータセットやバックボーンにわたる大規模な実験は、SGMAが常に最先端の手法よりも優れており、特に脆弱なモダリティが著しく向上していることを示している。

論文の概要: SGMA: Semantic-Guided Modality-Aware Segmentation for Remote Sensing with Incomplete Multimodal Data

関連論文リスト