Fugu-MT 論文翻訳(概要): GeoLink: A 3D-Aware Framework Towards Better Generalization in Cross-View Geo-Localization

論文の概要: GeoLink: A 3D-Aware Framework Towards Better Generalization in Cross-View Geo-Localization

arxiv url: http://arxiv.org/abs/2604.13183v1
Date: Tue, 14 Apr 2026 18:06:41 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-16 20:38:32.243832
Title: GeoLink: A 3D-Aware Framework Towards Better Generalization in Cross-View Geo-Localization
Title（参考訳）: GeoLink: クロスビューなジオローカライゼーションの一般化を目指す3Dフレームワーク
Authors: Hongyang Zhang, Yinhao Liu, Haitao Zhang, Zhongyi Wen, Shuxian Liang, Xiansheng Hua,
Abstract要約: 汎用的なクロスビュージオローカライゼーションは、GPSの監督なしに、見えない地域や条件のビューで同じ位置を一致させることを目的としている。既存の手法は主に2D対応に依存しているが、ビューをまたいだ冗長な共有情報によって容易に邪魔される。一般化可能なクロスビューなジオローカライゼーションのための3次元認識型セマンティック一貫性フレームワークGeoLinkを提案する。
参考スコア（独自算出の注目度）: 32.57866918679771
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Generalizable cross-view geo-localization aims to match the same location across views in unseen regions and conditions without GPS supervision. Its core difficulty lies in severe semantic inconsistency caused by viewpoint variation and poor generalization under domain shift. Existing methods mainly rely on 2D correspondence, but they are easily distracted by redundant shared information across views, leading to less transferable representations. To address this, we propose GeoLink, a 3D-aware semantic-consistent framework for Generalizable cross-view geo-localization. Specifically, we offline reconstruct scene point clouds from multi-view drone images using VGGT, providing stable structural priors. Based on these 3D anchors, we improve 2D representation learning in two complementary ways. A Geometric-aware Semantic Refinement module mitigates potentially redundant and view-biased dependencies in 2D features under 3D guidance. In addition, a Unified View Relation Distillation module transfers 3D structural relations to 2D features, improving cross-view alignment while preserving a 2D-only inference pipeline. Extensive experiments on multiple benchmarks show that GeoLink consistently outperforms state-of-the-art methods and achieves superior generalization across unseen domains and diverse weather environments.
Abstract（参考訳）: 汎用的なクロスビュージオローカライゼーションは、GPSの監督なしに、見えない地域や条件のビューで同じ位置を一致させることを目的としている。その中核的な難しさは、視点の変化とドメインシフトの下での一般化不足によって引き起こされる深刻な意味的不整合にある。既存の手法は主に2D対応に依存しているが、ビュー間の冗長な共有情報によって容易に邪魔され、転送可能な表現が少なくなる。そこで本稿では,ジェネラライズ可能なクロスビュージオローカライゼーションのための3D対応セマンティック一貫性フレームワークGeoLinkを提案する。具体的には、VGGTを用いた多視点ドローン画像からシーンポイント雲をオフラインで再構成し、安定した構造的前提を提供する。これらの3Dアンカーに基づいて、2つの相補的な方法で2次元表現学習を改善する。 Geometric-aware Semantic Refinementモジュールは、3Dガイダンスの下で2D機能において、潜在的に冗長でビューバイアスのある依存関係を緩和する。さらに、Unified View Relation Distillationモジュールは2D機能に3D構造関係を転送し、2Dのみの推論パイプラインを保持しながら、クロスビューアライメントを改善する。複数のベンチマークによる大規模な実験により、GeoLinkは最先端の手法を一貫して上回り、目に見えない領域と多様な気象環境をまたいだ優れた一般化を実現している。

論文の概要: GeoLink: A 3D-Aware Framework Towards Better Generalization in Cross-View Geo-Localization

関連論文リスト