PSTR: End-to-End One-Step Person Search With Transformers
- URL: http://arxiv.org/abs/2204.03340v1
- Date: Thu, 7 Apr 2022 10:22:33 GMT
- Title: PSTR: End-to-End One-Step Person Search With Transformers
- Authors: Jiale Cao and Yanwei Pang and Rao Muhammad Anwer and Hisham Cholakkal
and Jin Xie and Mubarak Shah and Fahad Shahbaz Khan
- Abstract summary: We propose a one-step transformer-based person search framework, PSTR.
PSS module contains a detection encoder-decoder for person detection along with a discriminative re-id decoder for person re-id.
On the challenging PRW benchmark, PSTR achieves a mean average precision (mAP) score of 56.5%.
- Score: 140.32813648935752
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We propose a novel one-step transformer-based person search framework, PSTR,
that jointly performs person detection and re-identification (re-id) in a
single architecture. PSTR comprises a person search-specialized (PSS) module
that contains a detection encoder-decoder for person detection along with a
discriminative re-id decoder for person re-id. The discriminative re-id decoder
utilizes a multi-level supervision scheme with a shared decoder for
discriminative re-id feature learning and also comprises a part attention block
to encode relationship between different parts of a person. We further
introduce a simple multi-scale scheme to support re-id across person instances
at different scales. PSTR jointly achieves the diverse objectives of
object-level recognition (detection) and instance-level matching (re-id). To
the best of our knowledge, we are the first to propose an end-to-end one-step
transformer-based person search framework. Experiments are performed on two
popular benchmarks: CUHK-SYSU and PRW. Our extensive ablations reveal the
merits of the proposed contributions. Further, the proposed PSTR sets a new
state-of-the-art on both benchmarks. On the challenging PRW benchmark, PSTR
achieves a mean average precision (mAP) score of 56.5%. The source code is
available at \url{https://github.com/JialeCao001/PSTR}.
Related papers
- PSDiff: Diffusion Model for Person Search with Iterative and
Collaborative Refinement [59.6260680005195]
We present a novel Person Search framework based on the Diffusion model, PSDiff.
PSDiff formulates the person search as a dual denoising process from noisy boxes and ReID embeddings to ground truths.
Following the new paradigm, we further design a new Collaborative Denoising Layer (CDL) to optimize detection and ReID sub-tasks in an iterative and collaborative way.
arXiv Detail & Related papers (2023-09-20T08:16:39Z) - LEAPS: End-to-End One-Step Person Search With Learnable Proposals [50.39493100627476]
We propose an end-to-end one-step person search approach with learnable proposals, named LEAPS.
Given a set of sparse and learnable proposals, LEAPS employs a dynamic person search head to directly perform person detection and corresponding re-id feature generation without non-maximum suppression post-processing.
arXiv Detail & Related papers (2023-03-21T13:59:32Z) - Plug and Play Active Learning for Object Detection [12.50247484568549]
We introduce Plug and Play Active Learning (PPAL) for object detection.
PPAL is a two-stage method comprising uncertainty-based and diversity-based sampling phases.
We benchmark PPAL on the MS-COCO and Pascal VOC datasets using different detector architectures.
arXiv Detail & Related papers (2022-11-21T16:13:23Z) - Sequential Transformer for End-to-End Person Search [4.920657401819193]
Person search aims to simultaneously localize and recognize a target person from realistic and uncropped gallery images.
In this paper, we propose a novel Sequential Transformer (SeqTR) for end-to-end person search to deal with this challenge.
Our SeqTR contains a detection transformer and a novel re-ID transformer that sequentially addresses detection and re-ID tasks.
arXiv Detail & Related papers (2022-11-06T09:32:30Z) - Diverse Part Discovery: Occluded Person Re-identification with
Part-Aware Transformer [95.02123369512384]
Occluded person re-identification (Re-ID) is a challenging task as persons are frequently occluded by various obstacles or other persons.
We propose a novel end-to-end Part-Aware Transformer (PAT) for occluded person Re-ID through diverse part discovery.
arXiv Detail & Related papers (2021-06-08T04:29:07Z) - Anchor-Free Person Search [127.88668724345195]
Person search aims to simultaneously localize and identify a query person from realistic, uncropped images.
Most existing works employ two-stage detectors like Faster-RCNN, yielding encouraging accuracy but with high computational overhead.
We present the Feature-Aligned Person Search Network (AlignPS), the first anchor-free framework to efficiently tackle this challenging task.
arXiv Detail & Related papers (2021-03-22T07:04:29Z) - Diverse Knowledge Distillation for End-to-End Person Search [81.4926655119318]
Person search aims to localize and identify a specific person from a gallery of images.
Recent methods can be categorized into two groups, i.e., two-step and end-to-end approaches.
We propose a simple yet strong end-to-end network with diverse knowledge distillation to break the bottleneck.
arXiv Detail & Related papers (2020-12-21T09:04:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.