OIMNet++: Prototypical Normalization and Localization-aware Learning for
Person Search
- URL: http://arxiv.org/abs/2207.10320v1
- Date: Thu, 21 Jul 2022 06:34:03 GMT
- Title: OIMNet++: Prototypical Normalization and Localization-aware Learning for
Person Search
- Authors: Sanghoon Lee, Youngmin Oh, Donghyeon Baek, Junghyup Lee, Bumsub Ham
- Abstract summary: We address the task of person search, that is, localizing and re-identifying query persons from a set of raw scene images.
Recent approaches are typically built upon OIMNet, a pioneer work on person search, that learns joint person representations for performing both detection and person re-identification tasks.
We introduce a novel normalization layer, dubbed ProtoNorm, that calibrates features from pedestrian proposals, while considering a long-tail distribution of person IDs.
- Score: 34.460973847554364
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We address the task of person search, that is, localizing and re-identifying
query persons from a set of raw scene images. Recent approaches are typically
built upon OIMNet, a pioneer work on person search, that learns joint person
representations for performing both detection and person re-identification
(reID) tasks. To obtain the representations, they extract features from
pedestrian proposals, and then project them on a unit hypersphere with L2
normalization. These methods also incorporate all positive proposals, that
sufficiently overlap with the ground truth, equally to learn person
representations for reID. We have found that 1) the L2 normalization without
considering feature distributions degenerates the discriminative power of
person representations, and 2) positive proposals often also depict background
clutter and person overlaps, which could encode noisy features to person
representations. In this paper, we introduce OIMNet++ that addresses the
aforementioned limitations. To this end, we introduce a novel normalization
layer, dubbed ProtoNorm, that calibrates features from pedestrian proposals,
while considering a long-tail distribution of person IDs, enabling L2
normalized person representations to be discriminative. We also propose a
localization-aware feature learning scheme that encourages better-aligned
proposals to contribute more in learning discriminative representations.
Experimental results and analysis on standard person search benchmarks
demonstrate the effectiveness of OIMNet++.
Related papers
- PSDiff: Diffusion Model for Person Search with Iterative and
Collaborative Refinement [59.6260680005195]
We present a novel Person Search framework based on the Diffusion model, PSDiff.
PSDiff formulates the person search as a dual denoising process from noisy boxes and ReID embeddings to ground truths.
Following the new paradigm, we further design a new Collaborative Denoising Layer (CDL) to optimize detection and ReID sub-tasks in an iterative and collaborative way.
arXiv Detail & Related papers (2023-09-20T08:16:39Z) - Body Part-Based Representation Learning for Occluded Person
Re-Identification [102.27216744301356]
Occluded person re-identification (ReID) is a person retrieval task which aims at matching occluded person images with holistic ones.
Part-based methods have been shown beneficial as they offer fine-grained information and are well suited to represent partially visible human bodies.
We propose BPBreID, a body part-based ReID model for solving the above issues.
arXiv Detail & Related papers (2022-11-07T16:48:41Z) - Global-Local Context Network for Person Search [125.51080862575326]
Person search aims to jointly localize and identify a query person from natural, uncropped images.
We exploit rich context information globally and locally surrounding the target person, which we refer to scene and group context, respectively.
We propose a unified global-local context network (GLCNet) with the intuitive aim of feature enhancement.
arXiv Detail & Related papers (2021-12-05T07:38:53Z) - Exploring Visual Context for Weakly Supervised Person Search [155.46727990750227]
Person search has recently emerged as a challenging task that jointly addresses pedestrian detection and person re-identification.
Existing approaches follow a fully supervised setting where both bounding box and identity annotations are available.
This paper inventively considers weakly supervised person search with only bounding box annotations.
arXiv Detail & Related papers (2021-06-19T14:47:13Z) - Diverse Knowledge Distillation for End-to-End Person Search [81.4926655119318]
Person search aims to localize and identify a specific person from a gallery of images.
Recent methods can be categorized into two groups, i.e., two-step and end-to-end approaches.
We propose a simple yet strong end-to-end network with diverse knowledge distillation to break the bottleneck.
arXiv Detail & Related papers (2020-12-21T09:04:27Z) - A Convolutional Baseline for Person Re-Identification Using Vision and
Language Descriptions [24.794592610444514]
In real-world surveillance scenarios, frequently no visual information will be available about the queried person.
A two stream deep convolutional neural network framework supervised by cross entropy loss is presented.
The learnt visual representations are more robust and perform 22% better during retrieval as compared to a single modality system.
arXiv Detail & Related papers (2020-02-20T10:12:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.