Fugu-MT 論文翻訳(概要): $A^2$GC: $A$symmetric $A$ggregation with Geometric Constraints for Locally Aggregated Descriptors

論文の概要: $A^2$GC: $A$symmetric $A$ggregation with Geometric Constraints for Locally Aggregated Descriptors

arxiv url: http://arxiv.org/abs/2511.14109v1
Date: Tue, 18 Nov 2025 03:47:48 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-19 16:23:52.918245
Title: $A^2$GC: $A$symmetric $A$ggregation with Geometric Constraints for Locally Aggregated Descriptors
Title（参考訳）: A^2$GC:$A$symmetric $A$ggregation with Geometric Constraints for Locally Aggregated Descriptors
Authors: Zhenyu Li, Tianyi Shang,
Abstract要約: 本稿では,局所的に集約された記述子に対する幾何学的制約を持つ非対称アグリゲーションVPR法($A2$GC-VPR)を提案する。視覚的位置認識における分布の相違に適応する非対称なマッチングを可能にする。
参考スコア（独自算出の注目度）: 6.392844932864485
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Visual Place Recognition (VPR) aims to match query images against a database using visual cues. State-of-the-art methods aggregate features from deep backbones to form global descriptors. Optimal transport-based aggregation methods reformulate feature-to-cluster assignment as a transport problem, but the standard Sinkhorn algorithm symmetrically treats source and target marginals, limiting effectiveness when image features and cluster centers exhibit substantially different distributions. We propose an asymmetric aggregation VPR method with geometric constraints for locally aggregated descriptors, called $A^2$GC-VPR. Our method employs row-column normalization averaging with separate marginal calibration, enabling asymmetric matching that adapts to distributional discrepancies in visual place recognition. Geometric constraints are incorporated through learnable coordinate embeddings, computing compatibility scores fused with feature similarities, thereby promoting spatially proximal features to the same cluster and enhancing spatial awareness. Experimental results on MSLS, NordLand, and Pittsburgh datasets demonstrate superior performance, validating the effectiveness of our approach in improving matching accuracy and robustness.
Abstract（参考訳）: ビジュアルプレース認識(VPR)は、クエリ画像をビジュアルキューを使用してデータベースとマッチングすることを目的としている。 State-of-the-artメソッドは、深いバックボーンから機能を集約してグローバルな記述子を形成する。しかし,Sinkhornアルゴリズムでは,画像特徴とクラスタ中心がほぼ異なる分布を示す場合の有効性を制限し,ソースとターゲットの辺縁を対称的に扱う。 A^2$GC-VPRと呼ばれる局所的に集約された記述子に対する幾何学的制約を持つ非対称アグリゲーションVPR法を提案する。視覚的位置認識における分布の相違に適応する非対称なマッチングを可能にする。幾何学的制約は、学習可能な座標埋め込みによって組み込まれ、特徴的類似性に融合した計算互換性スコアは、同じクラスタに空間的近位性を促進し、空間的認識を高める。 MSLS, NordLand, ピッツバーグのデータセットに対する実験結果から, マッチング精度とロバスト性を改善するためのアプローチの有効性が検証された。

論文の概要: $A^2$GC: $A$symmetric $A$ggregation with Geometric Constraints for Locally Aggregated Descriptors

関連論文リスト