LEAPS: End-to-End One-Step Person Search With Learnable Proposals
- URL: http://arxiv.org/abs/2303.11859v1
- Date: Tue, 21 Mar 2023 13:59:32 GMT
- Title: LEAPS: End-to-End One-Step Person Search With Learnable Proposals
- Authors: Zhiqiang Dong, Jiale Cao, Rao Muhammad Anwer, Jin Xie, Fahad Khan,
Yanwei Pang
- Abstract summary: We propose an end-to-end one-step person search approach with learnable proposals, named LEAPS.
Given a set of sparse and learnable proposals, LEAPS employs a dynamic person search head to directly perform person detection and corresponding re-id feature generation without non-maximum suppression post-processing.
- Score: 50.39493100627476
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We propose an end-to-end one-step person search approach with learnable
proposals, named LEAPS. Given a set of sparse and learnable proposals, LEAPS
employs a dynamic person search head to directly perform person detection and
corresponding re-id feature generation without non-maximum suppression
post-processing. The dynamic person search head comprises a detection head and
a novel flexible re-id head. Our flexible re-id head first employs a dynamic
region-of-interest (RoI) operation to extract discriminative RoI features of
the proposals. Then, it generates re-id features using a plain and a
hierarchical interaction re-id module. To better guide discriminative re-id
feature learning, we introduce a diverse re-id sample matching strategy,
instead of bipartite matching in detection head. Comprehensive experiments
reveal the benefit of the proposed LEAPS, achieving a favorable performance on
two public person search benchmarks: CUHK-SYSU and PRW. When using the same
ResNet50 backbone, our LEAPS obtains a mAP score of 55.0%, outperforming the
best reported results in literature by 1.7%, while achieving around a two-fold
speedup on the challenging PRW dataset. Our source code and models will be
released.
Related papers
- Learning to Rank for Multiple Retrieval-Augmented Models through Iterative Utility Maximization [21.115495457454365]
This paper investigates the design of a unified search engine to serve multiple retrieval-augmented generation (RAG) agents.
We introduce an iterative approach where the search engine generates retrieval results for these RAG agents and gathers feedback on the quality of the retrieved documents during an offline phase.
We adapt this approach to an online setting, allowing the search engine to refine its behavior based on real-time individual agents feedback.
arXiv Detail & Related papers (2024-10-13T17:53:50Z) - Contrastive Transformer Learning with Proximity Data Generation for
Text-Based Person Search [60.626459715780605]
Given a descriptive text query, text-based person search aims to retrieve the best-matched target person from an image gallery.
Such a cross-modal retrieval task is quite challenging due to significant modality gap, fine-grained differences and insufficiency of annotated data.
In this paper, we propose a simple yet effective dual Transformer model for text-based person search.
arXiv Detail & Related papers (2023-11-15T16:26:49Z) - PSDiff: Diffusion Model for Person Search with Iterative and
Collaborative Refinement [59.6260680005195]
We present a novel Person Search framework based on the Diffusion model, PSDiff.
PSDiff formulates the person search as a dual denoising process from noisy boxes and ReID embeddings to ground truths.
Following the new paradigm, we further design a new Collaborative Denoising Layer (CDL) to optimize detection and ReID sub-tasks in an iterative and collaborative way.
arXiv Detail & Related papers (2023-09-20T08:16:39Z) - PSTR: End-to-End One-Step Person Search With Transformers [140.32813648935752]
We propose a one-step transformer-based person search framework, PSTR.
PSS module contains a detection encoder-decoder for person detection along with a discriminative re-id decoder for person re-id.
On the challenging PRW benchmark, PSTR achieves a mean average precision (mAP) score of 56.5%.
arXiv Detail & Related papers (2022-04-07T10:22:33Z) - Making Person Search Enjoy the Merits of Person Re-identification [12.311100923753449]
We propose a faster and stronger one-step person search framework, the Teacher-guided Disentangling Networks (TDN)
The proposed TDN can significantly boost the person search performance by transferring the advanced person Re-ID knowledge to the person search model.
We also propose a Knowledge Transfer Bridge module to bridge the scale gap caused by different input formats between the Re-ID model and one-step person search model.
arXiv Detail & Related papers (2021-08-24T06:00:13Z) - Diverse Knowledge Distillation for End-to-End Person Search [81.4926655119318]
Person search aims to localize and identify a specific person from a gallery of images.
Recent methods can be categorized into two groups, i.e., two-step and end-to-end approaches.
We propose a simple yet strong end-to-end network with diverse knowledge distillation to break the bottleneck.
arXiv Detail & Related papers (2020-12-21T09:04:27Z) - Robust Partial Matching for Person Search in the Wild [70.6661871706788]
This paper proposes an Align-to-Part Network (APNet) for person detection and re-Identification (reID)
APNet refines detected bounding boxes to cover the estimated holistic body regions.
It achieves competitive performance on existing person search benchmarks like CUHK-SYSU and PRW.
arXiv Detail & Related papers (2020-04-20T14:21:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.