Fugu-MT 論文翻訳(概要): SFR-Net: Learning Scale-Frustum Representations for Ultra-Wide Area Remote Sensing Image Segmentation

論文の概要: SFR-Net: Learning Scale-Frustum Representations for Ultra-Wide Area Remote Sensing Image Segmentation

arxiv url: http://arxiv.org/abs/2605.25737v1
Date: Mon, 25 May 2026 11:45:50 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-26 19:50:19.84652
Title: SFR-Net: Learning Scale-Frustum Representations for Ultra-Wide Area Remote Sensing Image Segmentation
Title（参考訳）: SFR-Net:超広帯域リモートセンシング画像セグメンテーションのためのスケールフラストレーションの学習
Authors: Chuyu Zhong, Keyan Chen, Qinzhe Yang, Bowen Chen, Zhengxia Zou, Zhenwei Shi,
Abstract要約: 本稿では,UWA(Ultra-wide Area)リモートセンシング画像を対象とした新しいセグメンテーションタスクを提案する。本稿では,UWAセグメンテーションの課題に対処するため,SFR-Net(Scale-Frustum Representation Network)を提案する。 SFR-Netは最先端のパフォーマンスを実現し、mIoUを1.72%改善し、4.29%改善した。
参考スコア（独自算出の注目度）: 35.92745600073289
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Pixel count and geographical coverage are two key characteristics of remote sensing images. Existing remote sensing image segmentation methods typically focus on images with either a small pixel count or a large pixel count but limited geographical coverage. In this paper, we introduce a novel segmentation task targeting ultra-wide area (UWA) remote sensing images, characterized by both a large pixel count and extremely wide geographical coverage. The core challenges of UWA segmentation lie in simultaneously handling ground objects with significantly varying scales and maintaining long-range contextual semantic continuity. To address these challenges, we propose the Scale-Frustum Representation Network (SFR-Net). Inspired by the viewing frustums of remote sensing images captured from different altitudes, we construct scale-frustum representations, enabling unified modeling of ground objects and contextual features at different scales. Furthermore, we design a cascaded cross-scale fusion mechanism to effectively integrate these representations, enhancing local semantic understanding while ensuring long-range contextual continuity. Experimental results on GID and FBPS demonstrate that SFR-Net achieves state-of-the-art performance, improving mIoU by 1.72% and 4.29%, respectively, over the strongest competing methods. In addition, the proposed scale-frustum representations can be integrated into generic segmentation networks to improve both segmentation accuracy and convergence speed. The implementation code will be publicly available at https://github.com/ChuyuZhong/SFR-Net.
Abstract（参考訳）: リモートセンシング画像の2つの重要な特徴は、画素数と地理的カバレッジである。既存のリモートセンシング画像分割法は、通常、小さなピクセル数または大きなピクセル数を持つ画像に焦点を当てるが、地理的範囲は限られている。本稿では,UWA(Ultra-wide Area)リモートセンシング画像を対象とした新たなセグメンテーションタスクを提案する。 UWAセグメンテーションの中核となる課題は、異なるスケールのグラウンドオブジェクトを同時に扱うことと、長距離のコンテキストセマンティックな連続性を維持することである。これらの課題に対処するため,SFR-Net(Scale-Frustum Representation Network)を提案する。異なる高度から捉えたリモートセンシング画像のフラストムの観察に触発されて、スケール・フラストム表現を構築し、異なるスケールで地上オブジェクトとコンテキスト特徴の統一的なモデリングを可能にする。さらに,これらの表現を効果的に統合し,長期的文脈連続性を確保しつつ,局所的意味理解の強化を図るため,ケースケード型クロススケール融合機構を設計する。 GIDとFBPSの実験結果は、SFR-Netが最先端の性能を達成し、mIoUをそれぞれ1.72%改善し、4.29%改善したことを示している。さらに,提案するスケールフラストム表現を汎用セグメンテーションネットワークに統合することで,セグメンテーション精度と収束速度の両方を改善することができる。実装コードはhttps://github.com/ChuyuZhong/SFR-Netで公開される。

論文の概要: SFR-Net: Learning Scale-Frustum Representations for Ultra-Wide Area Remote Sensing Image Segmentation

関連論文リスト