Fugu-MT 論文翻訳(概要): Bin~Wan,G2HFNet: GeoGran-Aware Hierarchical Feature Fusion Network for Salient Object Detection in Optical Remote Sensing Images

論文の概要: Bin~Wan,G2HFNet: GeoGran-Aware Hierarchical Feature Fusion Network for Salient Object Detection in Optical Remote Sensing Images

arxiv url: http://arxiv.org/abs/2603.12680v1
Date: Fri, 13 Mar 2026 05:52:00 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-16 17:38:11.927091
Title: Bin~Wan,G2HFNet: GeoGran-Aware Hierarchical Feature Fusion Network for Salient Object Detection in Optical Remote Sensing Images
Title（参考訳）: Bin~Wan,G2HFNet:光リモートセンシング画像における高次物体検出のためのGeoGran対応階層的特徴融合ネットワーク
Authors: Bin Wan, Runmin Cong, Xiaofei Zhou, Hao Fang, Chengtao Lv, Sam Kwong,
Abstract要約: 既存の手法では、一様注意機構を用いて複数の特徴を単一スケールで抽出するのが一般的である。光リモートセンシング画像における幾何的および粒度をフル活用するGeoGran-Aware grained Feature Fusion Network (G2HFNet)を提案する。
参考スコア（独自算出の注目度）: 76.1829298714382
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Remote sensing images captured from aerial perspectives often exhibit significant scale variations and complex backgrounds, posing challenges for salient object detection (SOD). Existing methods typically extract multi-level features at a single scale using uniform attention mechanisms, leading to suboptimal representations and incomplete detection results. To address these issues, we propose a GeoGran-Aware Hierarchical Feature Fusion Network (G2HFNet) that fully exploits geometric and granular cues in optical remote sensing images. Specifically, G2HFNet adopts Swin Transformer as the backbone to extract multi-level features and integrates three key modules: the multi-scale detail enhancement (MDE) module to handle object scale variations and enrich fine details, the dual-branch geo-gran complementary (DGC) module to jointly capture fine-grained details and positional information in mid-level features, and the deep semantic perception (DSP) module to refine high-level positional cues via self-attention. Additionally, a local-global guidance fusion (LGF) module is introduced to replace traditional convolutions for effective multi-level feature integration. Extensive experiments demonstrate that G2HFNet achieves high-quality saliency maps and significantly improves detection performance in challenging remote sensing scenarios.
Abstract（参考訳）: 空中から捉えたリモートセンシング画像は、しばしば大きなスケールのバリエーションと複雑な背景を示し、SOD(Salient Object Detection)の課題を提起する。既存の手法では、一様注意機構を用いて単一スケールで複数レベルの特徴を抽出し、最適下表現と不完全な検出結果をもたらすのが一般的である。これらの課題に対処するために,光学リモートセンシング画像の幾何学的および粒度をフル活用するGeoGran-Aware Hierarchical Feature Fusion Network (G2HFNet)を提案する。具体的には、G2HFNetはバックボーンとしてSwin Transformerを採用し、オブジェクトスケールの変動を処理し、細部を豊かにするMDE(Multi-scale detail enhancement)モジュール、中程度の機能で微細な詳細と位置情報を共同でキャプチャするDGC(Dual-branch geo-gran complementary)モジュール、自己注意によって高レベルの位置のキューを洗練するためのDSP(Deep semantic perception)モジュールの3つの主要なモジュールを統合している。さらに、ローカル・グローバル誘導融合(LGF)モジュールが導入され、従来の畳み込みを代替し、効果的なマルチレベル機能統合が実現された。大規模な実験により,G2HFNetは高品質なサリエンシマップを実現し,遠隔センシングシナリオにおける検出性能を大幅に向上することが示された。

論文の概要: Bin~Wan,G2HFNet: GeoGran-Aware Hierarchical Feature Fusion Network for Salient Object Detection in Optical Remote Sensing Images

関連論文リスト