Fugu-MT 論文翻訳(概要): GESS: Multi-cue Guided Local Feature Learning via Geometric and Semantic Synergy

論文の概要: GESS: Multi-cue Guided Local Feature Learning via Geometric and Semantic Synergy

arxiv url: http://arxiv.org/abs/2604.05359v1
Date: Tue, 07 Apr 2026 02:57:26 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-08 17:42:09.590997
Title: GESS: Multi-cue Guided Local Feature Learning via Geometric and Semantic Synergy
Title（参考訳）: GESS:Geometric and Semantic Synergyによるマルチキューローカル特徴学習
Authors: Yang Yi, Xieyuanli Chen, Jinpu Zhang, Hui Shen, Dewen Hu,
Abstract要約: 局所的な特徴の検出と記述はコンピュータビジョンの基本課題である。既存の手法は1つの外観の手がかりをモデリングに頼っており、不安定なキーポイントとディスクリプタ識別性に欠ける。本稿では,意味的および幾何学的手がかりを活用して,検出の堅牢性と記述者の識別性を高めるマルチキューガイド型局所特徴学習フレームワークを提案する。
参考スコア（独自算出の注目度）: 31.32050433924969
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Robust local feature detection and description are foundational tasks in computer vision. Existing methods primarily rely on single appearance cues for modeling, leading to unstable keypoints and insufficient descriptor discriminability. In this paper, we propose a multi-cue guided local feature learning framework that leverages semantic and geometric cues to synergistically enhance detection robustness and descriptor discriminability. Specifically, we construct a joint semantic-normal prediction head and a depth stability prediction head atop a lightweight backbone. The former leverages a shared 3D vector field to deeply couple semantic and normal cues, thereby resolving optimization interference from heterogeneous inconsistencies. The latter quantifies the reliability of local regions from a geometric consistency perspective, providing deterministic guidance for robust keypoint selection. Based on these predictions, we introduce the Semantic-Depth Aware Keypoint (SDAK) mechanism for feature detection. By coupling semantic reliability with depth stability, SDAK reweights keypoint responses to suppress spurious features in unreliable regions. For descriptor construction, we design a Unified Triple-Cue Fusion (UTCF) module, which employs a semantic-scheduled gating mechanism to adaptively inject multi-attribute features, improving descriptor discriminability. Extensive experiments on four benchmarks validate the effectiveness of the proposed framework. The source code and pre-trained model will be available at: https://github.com/yiyscut/GESS.git.
Abstract（参考訳）: 局所的な特徴の検出と記述はコンピュータビジョンの基本課題である。既存の手法は主にモデリングのための単一の外観の手がかりに依存しており、不安定なキーポイントとディスクリプタの識別性に欠ける。本稿では,意味的および幾何学的手がかりを利用して,検出の堅牢性と記述者の識別性を相乗的に向上するマルチキューガイド型局所特徴学習フレームワークを提案する。具体的には,軽量バックボーン上に,共同意味正規予測ヘッドと深度安定性予測ヘッドを構築する。前者は共有3次元ベクトル場を利用して意味論と通常の手がかりを深く結合し、不均一な不整合からの最適化干渉を解消する。後者は、幾何学的整合性の観点から局所領域の信頼性を定量化し、ロバストなキーポイント選択のための決定論的ガイダンスを提供する。これらの予測に基づいて,特徴検出のためのセマンティック・ディープス・アウェア・キーポイント(SDAK)機構を導入する。セマンティック信頼性と深度安定性を結合することにより、SDAKはキーポイント応答を重み付け、信頼できない領域の急激な特徴を抑制する。記述子構築のための一元三重項融合(UTCF)モジュールを設計し,多属性特徴を適応的に注入し,記述子識別性を向上させる。提案手法の有効性を4つのベンチマークで検証した。ソースコードと事前トレーニングされたモデルは、https://github.com/yiyscut/GESS.git.comで利用可能になる。

論文の概要: GESS: Multi-cue Guided Local Feature Learning via Geometric and Semantic Synergy

関連論文リスト