Fugu-MT 論文翻訳(概要): CrossEarth-SAR: A SAR-Centric and Billion-Scale Geospatial Foundation Model for Domain Generalizable Semantic Segmentation

論文の概要: CrossEarth-SAR: A SAR-Centric and Billion-Scale Geospatial Foundation Model for Domain Generalizable Semantic Segmentation

arxiv url: http://arxiv.org/abs/2603.12008v1
Date: Thu, 12 Mar 2026 14:52:35 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-13 14:46:26.155127
Title: CrossEarth-SAR: A SAR-Centric and Billion-Scale Geospatial Foundation Model for Domain Generalizable Semantic Segmentation
Title（参考訳）: CrossEarth-SAR: 領域一般化可能なセマンティックセマンティックセグメンテーションのためのSAR中心及び数十億ドル規模の地球空間基盤モデル
Authors: Ziqi Ye, Ziyang Gong, Ning Liao, Xiaoxing Hu, Di Wang, Hongruixuan Chen, Chen Huang, Yiguo He, Yuru Jia, Xiaoxing Wang, Haipeng Wang, Xue Yang, Junchi Yan,
Abstract要約: 我々は、新しい物理誘導型スパース・オブ・エキスパート(MoE)アーキテクチャに基づいて構築された、最初の10億ドル規模のSARビジョン基盤モデルであるCrossEarth-SARを紹介する。大規模な事前トレーニングを容易にするために,公共およびプライベートなSAR画像を統合する弱く完全に教師付きデータセットであるCrossEarth-SAR-200Kを開発した。また、8つの異なる領域ギャップをまたいだ22のサブベンチマークからなるベンチマークスイートを導入し、SAR画像上でのドメイン一般化セマンティックセマンティックセグメンテーションのための最初の統一標準を確立した。
参考スコア（独自算出の注目度）: 51.94680303125737
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Synthetic Aperture Radar (SAR) enables global, all-weather earth observation. However, owing to diverse imaging mechanisms, domain shifts across sensors and regions severely hinder its semantic generalization. To address this, we present CrossEarth-SAR, the first billion-scale SAR vision foundation model built upon a novel physics-guided sparse mixture-of-experts (MoE) architecture incorporating physical descriptors, explicitly designed for cross-domain semantic segmentation. To facilitate large-scale pre-training, we develop CrossEarth-SAR-200K, a weakly and fully supervised dataset that unifies public and private SAR imagery. We also introduce a benchmark suite comprising 22 sub-benchmarks across 8 distinct domain gaps, establishing the first unified standard for domain generalization semantic segmentation on SAR imagery. Extensive experiments demonstrate that CrossEarth-SAR achieves state-of-the-art results on 20 benchmarks, surpassing previous methods by over 10\% mIoU on some benchmarks under multi-gap transfer. All code, benchmark and datasets will be publicly available.
Abstract（参考訳）: 合成開口レーダ(SAR)は地球の全地球観測を可能にする。しかし、多様なイメージング機構のため、センサーや領域間のドメインシフトはセマンティックな一般化を著しく妨げている。そこで本研究では,物理記述子を組み込んだ新しい物理誘導型スパース・ミックス・オブ・エキスパート(MoE)アーキテクチャ上に構築された,最初の10億ドル規模のSARビジョン基盤モデルであるCrossEarth-SARを提案する。大規模な事前トレーニングを容易にするために,公共およびプライベートなSAR画像を統合する弱く完全に教師付きデータセットであるCrossEarth-SAR-200Kを開発した。また、8つの異なる領域ギャップをまたいだ22のサブベンチマークからなるベンチマークスイートを導入し、SAR画像上でのドメイン一般化セマンティックセマンティックセグメンテーションのための最初の統一標準を確立した。大規模な実験により、クロスアース-SARは20のベンチマークで最先端の結果を達成し、マルチギャップ転送中のベンチマークでは、以前の手法を10\% mIoU以上上回る結果となった。すべてのコード、ベンチマーク、データセットが公開される。

論文の概要: CrossEarth-SAR: A SAR-Centric and Billion-Scale Geospatial Foundation Model for Domain Generalizable Semantic Segmentation

関連論文リスト