Fugu-MT 論文翻訳(概要): View-Aware Semantic Alignment for Aerial-Ground Person Re-Identification

論文の概要: View-Aware Semantic Alignment for Aerial-Ground Person Re-Identification

arxiv url: http://arxiv.org/abs/2605.18192v1
Date: Mon, 18 May 2026 10:32:26 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-19 17:57:49.399802
Title: View-Aware Semantic Alignment for Aerial-Ground Person Re-Identification
Title（参考訳）: 空中人物再同定のためのビューアウェア・セマンティックアライメント
Authors: Quan Zhang, Zeqiang Cai, Peiming Zhao, Jingze Wu, Cailun Wu, Hongbo Chen, Jianhuang Lai,
Abstract要約: ViSAは、ビュー間のセマンティック一貫性を実現するビュー対応フレームワークである。 ViSAは、挑戦的なCARGOクロスビュープロトコルにおいて、注目すべき10.06%のmAP改善とともに、一貫して優れたパフォーマンスを実現している。
参考スコア（独自算出の注目度）: 43.69772242567068
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Aerial-Ground Person Re-Identification (AGPReID) remains highly challenging due to drastic viewpoint variations between drones and fixed cameras. Existing methods typically follow a view-invariant paradigm, aligning shared features across views to achieve robustness. However, view-invariant inherently enforces part-level alignment, which ignores view-specific cues and discriminative identity information. To this end, this work proposes ViSA (View-aware Semantic Alignment), a view-aware framework that achieves cross-view semantic consistency containing an Expert-driven Token Generation Module (ETGM) and a Dual-branch Local Fusion Module (DLFM). Technically, the former constructs a set of view-aware experts to generate adaptive semantic queries that perceive viewpoint-specific patterns, while the latter leverages graph reasoning to extract and align local regions responsive to different experts. Extensive experiments on three AGPReID benchmarks including AG-ReID.v2, CARGO and LAGPeR demonstrate that ViSA consistently achieves superior performance, with a notable 10.06\% mAP improvement on the challenging CARGO cross-view protocol. The code is available at \href{https://github.com/Cat-Zero/ViSA}{https://github.com/Cat-Zero/ViSA}.
Abstract（参考訳）: AGPReID(Aerial-Ground Person Re-Identification, AGPReID)は、ドローンと固定カメラの劇的な視点の違いにより、依然として非常に困難である。既存のメソッドは通常、ビュー不変のパラダイムに従い、ビュー間で共有された機能を整列して堅牢性を達成する。しかし、ビュー不変性は本来、ビュー固有の手がかりや識別アイデンティティ情報を無視する部分レベルのアライメントを強制する。この目的のために、エキスパート駆動のトークン生成モジュール(ETGM)とデュアルブランチローカルフュージョンモジュール(DLFM)を含む、ビュー間のセマンティック一貫性を実現するビューアウェアフレームワークであるViSA(View-aware Semantic Alignment)を提案する。技術的には、前者はビューアウェアの専門家のセットを構築し、視点固有のパターンを知覚する適応的なセマンティッククエリを生成し、後者はグラフ推論を活用して、異なる専門家に応答するローカルリージョンを抽出し調整する。 AG-ReID.v2, CARGO, LAGPeR を含む3つの AGPReID ベンチマークの大規模な実験は、ViSA が常に優れた性能を発揮することを示した。コードは \href{https://github.com/Cat-Zero/ViSA}{https://github.com/Cat-Zero/ViSA} で公開されている。

論文の概要: View-Aware Semantic Alignment for Aerial-Ground Person Re-Identification

関連論文リスト