Fugu-MT 論文翻訳(概要): SinGeo: Unlock Single Model's Potential for Robust Cross-View Geo-Localization

論文の概要: SinGeo: Unlock Single Model's Potential for Robust Cross-View Geo-Localization

arxiv url: http://arxiv.org/abs/2603.09377v1
Date: Tue, 10 Mar 2026 08:51:52 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-11 15:25:24.177934
Title: SinGeo: Unlock Single Model's Potential for Robust Cross-View Geo-Localization
Title（参考訳）: SinGeo: 単一モデルのロバストなクロスビュージオローカライゼーションの可能性
Authors: Yang Chen, Xieyuanli Chen, Junxiang Li, Jie Tang, Tao Wu,
Abstract要約: SinGeoはシンプルだが強力なフレームワークであり、単一のモデルで堅牢なクロスビューなジオローカライゼーションを実現することができる。 SinGeoは、地上と衛星の両方のブランチにおけるビュー内識別性を向上する二重識別学習アーキテクチャを採用している。
参考スコア（独自算出の注目度）: 25.563713122044337
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Robust cross-view geo-localization (CVGL) remains challenging despite the surge in recent progress. Existing methods still rely on field-of-view (FoV)-specific training paradigms, where models are optimized under a fixed FoV but collapse when tested on unseen FoVs and unknown orientations. This limitation necessitates deploying multiple models to cover diverse variations. Although studies have explored dynamic FoV training by simply randomizing FoVs, they failed to achieve robustness across diverse conditions -- implicitly assuming all FoVs are equally difficult. To address this gap, we present SinGeo, a simple yet powerful framework that enables a single model to realize robust cross-view geo-localization without additional modules or explicit transformations. SinGeo employs a dual discriminative learning architecture that enhances intra-view discriminability within both ground and satellite branches, and is the first to introduce a curriculum learning strategy to achieve robust CVGL. Extensive evaluations on four benchmark datasets reveal that SinGeo sets state-of-the-art (SOTA) results under diverse conditions, and notably outperforms methods specifically trained for extreme FoVs. Beyond superior performance, SinGeo also exhibits cross-architecture transferability. Furthermore, we propose a consistency evaluation method to quantitatively assess model stability under varying views, providing an explainable perspective for understanding and advancing robustness in future CVGL research. Codes will be available upon acceptance.
Abstract（参考訳）: 近年の進歩にもかかわらず、ロバスト・クロスビュー・ジオローカライゼーション (CVGL) はいまだに困難である。既存の手法はまだフィールド・オブ・ビュー(FoV)固有の訓練パラダイムに依存しており、モデルは固定されたFoVの下で最適化されるが、未知のFoVや未知のオリエンテーションでテストすると崩壊する。この制限は、様々なバリエーションをカバーするために複数のモデルをデプロイする必要がある。研究は単純にFoVをランダム化することで動的FoVトレーニングを探求しているが、すべてのFoVが等しく難しいと暗黙的に仮定して、様々な条件で堅牢性を達成できなかった。このギャップに対処するため、SinGeoは単純な強力なフレームワークであり、単一のモデルでモジュールの追加や明示的な変換なしに、堅牢なクロスビューなジオローカライゼーションを実現することができる。 SinGeoは、地上と衛星の両方でビュー内識別性を向上する二重識別学習アーキテクチャを採用し、堅牢なCVGLを実現するためのカリキュラム学習戦略を最初に導入した。 4つのベンチマークデータセットの大規模な評価により、SinGeoは様々な条件下で最新技術(SOTA)結果をセットし、特に極端なFoVのために特別に訓練された方法よりも優れていることが判明した。優れたパフォーマンスに加えて、SinGeoはアーキテクチャ間の転送可能性も示す。さらに,様々な視点でモデル安定性を定量的に評価する一貫性評価手法を提案し,今後のCVGL研究におけるロバストネスの理解と向上のための説明可能な視点を提供する。コードは受理後利用可能。

論文の概要: SinGeo: Unlock Single Model's Potential for Robust Cross-View Geo-Localization

関連論文リスト