Fugu-MT 論文翻訳(概要): Harnessing Weak Pair Uncertainty for Text-based Person Search

論文の概要: Harnessing Weak Pair Uncertainty for Text-based Person Search

arxiv url: http://arxiv.org/abs/2604.08877v1
Date: Fri, 10 Apr 2026 02:36:57 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-13 17:57:53.640914
Title: Harnessing Weak Pair Uncertainty for Text-based Person Search
Title（参考訳）: テキストに基づく人物検索におけるハーネスング弱視の不確かさ
Authors: Jintao Sun, Zhedong Zheng, Gangyi Ding,
Abstract要約: 本研究では,自然言語による興味ある人物を検索するテキストベースの人物検索について検討する。弱陽性をフル活用するために,画像とテキストのペアの不確かさを明示的に推定する不確実性認識手法を提案する。
参考スコア（独自算出の注目度）: 18.155384388834175
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In this paper, we study the text-based person search, which is to retrieve the person of interest via natural language description. Prevailing methods usually focus on the strict one-to-one correspondence pair matching between the visual and textual modality, such as contrastive learning. However, such a paradigm unintentionally disregards the weak positive image-text pairs, which are of the same person but the text descriptions are annotated from different views (cameras). To take full use of weak positives, we introduce an uncertainty-aware method to explicitly estimate image-text pair uncertainty, and incorporate the uncertainty into the optimization procedure in a smooth manner. Specifically, our method contains two modules: uncertainty estimation and uncertainty regularization. (1) Uncertainty estimation is to obtain the relative confidence on the given positive pairs; (2) Based on the predicted uncertainty, we propose the uncertainty regularization to adaptively adjust loss weight. Additionally, we introduce a group-wise image-text matching loss to further facilitate the representation space among the weak pairs. Compared with existing methods, the proposed method explicitly prevents the model from pushing away potentially weak positive candidates. Extensive experiments on three widely-used datasets, .e.g, CUHK-PEDES, RSTPReid and ICFG-PEDES, verify the mAP improvement of our method against existing competitive methods +3.06%, +3.55% and +6.94%, respectively.
Abstract（参考訳）: 本稿では,自然言語記述による興味ある人物の検索を目的としたテキストベースの人物検索について検討する。一般的な手法は、対照的な学習のような視覚とテキストのモダリティ間の厳密な1対1の対応ペアにフォーカスする。しかし、このようなパラダイムは、同一人物である弱正のイメージテキストペアを意図せず無視するが、テキスト記述は異なる視点(カメラ)から注釈付けされる。弱陽性をフル活用するために、画像とテキストのペアの不確実性を明示的に推定し、不確実性を最適化手順にスムーズな方法で組み込む不確実性認識手法を導入する。具体的には,不確実性推定と不確実性正則化の2つのモジュールを含む。 1) 不確実性推定は,与えられた正の対に対する相対的な信頼を得るためのものであり,(2)予測された不確実性に基づいて,損失重量を適応的に調整する不確実性正則化を提案する。さらに、弱いペア間の表現空間をさらに促進するために、グループワイドな画像テキストマッチング損失を導入する。既存手法と比較して,提案手法はモデルが潜在的に弱い正の候補を退避させることを明示的に防止する。広く使われている3つのデータセットに対する大規模な実験。例えば、CUHK-PEDES、RSTPReid、ICFG-PEDESは、既存の競合手法+3.06%、+3.55%、+6.94%に対して、我々の手法のmAP改善を検証する。

論文の概要: Harnessing Weak Pair Uncertainty for Text-based Person Search

関連論文リスト