Fugu-MT 論文翻訳(概要): What Makes You Unique? Attribute Prompt Composition for Object Re-Identification

論文の概要: What Makes You Unique? Attribute Prompt Composition for Object Re-Identification

arxiv url: http://arxiv.org/abs/2509.18715v1
Date: Tue, 23 Sep 2025 07:03:08 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-24 20:41:27.740174
Title: What Makes You Unique? Attribute Prompt Composition for Object Re-Identification
Title（参考訳）: ユニークなものは何か? 物体再同定のためのプロンプト構成
Authors: Yingquan Wang, Pingping Zhang, Chong Sun, Dong Wang, Huchuan Lu,
Abstract要約: Object Re-IDentificationは、重複しないカメラビューで個人を認識することを目的としている。単一ドメインモデルはドメイン固有の機能に過度に適合する傾向がありますが、クロスドメインモデルは多種多様な正規化戦略に依存します。本稿では,テキストのセマンティクスを利用して識別と一般化を協調的に強化する属性プロンプト合成フレームワークを提案する。
参考スコア（独自算出の注目度）: 70.67907354506278
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Object Re-IDentification (ReID) aims to recognize individuals across non-overlapping camera views. While recent advances have achieved remarkable progress, most existing models are constrained to either single-domain or cross-domain scenarios, limiting their real-world applicability. Single-domain models tend to overfit to domain-specific features, whereas cross-domain models often rely on diverse normalization strategies that may inadvertently suppress identity-specific discriminative cues. To address these limitations, we propose an Attribute Prompt Composition (APC) framework, which exploits textual semantics to jointly enhance discrimination and generalization. Specifically, we design an Attribute Prompt Generator (APG) consisting of a Semantic Attribute Dictionary (SAD) and a Prompt Composition Module (PCM). SAD is an over-complete attribute dictionary to provide rich semantic descriptions, while PCM adaptively composes relevant attributes from SAD to generate discriminative attribute-aware features. In addition, motivated by the strong generalization ability of Vision-Language Models (VLM), we propose a Fast-Slow Training Strategy (FSTS) to balance ReID-specific discrimination and generalizable representation learning. Specifically, FSTS adopts a Fast Update Stream (FUS) to rapidly acquire ReID-specific discriminative knowledge and a Slow Update Stream (SUS) to retain the generalizable knowledge inherited from the pre-trained VLM. Through a mutual interaction, the framework effectively focuses on ReID-relevant features while mitigating overfitting. Extensive experiments on both conventional and Domain Generalized (DG) ReID datasets demonstrate that our framework surpasses state-of-the-art methods, exhibiting superior performances in terms of both discrimination and generalization. The source code is available at https://github.com/AWangYQ/APC.
Abstract（参考訳）: オブジェクト再識別(ReID)は、重複しないカメラビューで個人を認識することを目的としている。最近の進歩は目覚ましい進歩を遂げているが、既存のモデルのほとんどは単一ドメインまたはクロスドメインのシナリオに制約されており、実際の適用範囲が制限されている。単一ドメインモデルはドメイン固有の特徴に過度に適合する傾向があり、一方、クロスドメインモデルは、ID固有の差別的手がかりを必然的に抑制する様々な正規化戦略に依存していることが多い。これらの制約に対処するために,テキスト意味論を利用して識別と一般化を協調的に強化する属性・プロンプト・コンポジション(APC)フレームワークを提案する。具体的には,Attribute Prompt Generator (APG) を,Semantic Attribute Dictionary (SAD) と Prompt Composition Module (PCM) で設計する。 SADは豊富な意味記述を提供するための過剰完全属性辞書であり、PCMはSADから関連属性を適応的に合成し、識別的属性認識機能を生成する。さらに、視覚言語モデル(VLM)の強力な一般化能力により、ReID固有の識別と一般化可能な表現学習のバランスをとるために、FSTS(Fast-Slow Training Strategy)を提案する。具体的には、FSTSはFast Update Stream(FUS)を採用して、ReID固有の識別知識を迅速に取得し、Slow Update Stream(SUS)を使用して、事前訓練されたVLMから受け継がれた一般化可能な知識を保持する。相互の相互作用を通じて、フレームワークは、オーバーフィッティングを緩和しながら、ReID関連機能に効果的にフォーカスする。従来のドメイン一般化(DG)とドメイン一般化(DG)の両方のReIDデータセットに対する大規模な実験により、我々のフレームワークは最先端の手法を超越し、差別と一般化の両面で優れた性能を示した。ソースコードはhttps://github.com/AWangYQ/APCで入手できる。

論文の概要: What Makes You Unique? Attribute Prompt Composition for Object Re-Identification

関連論文リスト