Dynamic Template Initialization for Part-Aware Person Re-ID
- URL: http://arxiv.org/abs/2208.11440v1
- Date: Wed, 24 Aug 2022 11:20:48 GMT
- Title: Dynamic Template Initialization for Part-Aware Person Re-ID
- Authors: Kalana Abeywardena, Shechem Sumanthiran, Sanoojan Baliah, Nadarasar
Bahavan, Nalith Udugampola, Ajith Pasqual, Chamira Edussooriya and Ranga
Rodrigo
- Abstract summary: spatial attention-based Dynamic Part template Initialization module.
Part-level features of the backbone are used to extract the templates of diverse human body parts.
Tests on holistic, occluded, and partial Re-ID task benchmarks.
- Score: 0.640781528166787
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Many of the existing Person Re-identification (Re-ID) approaches depend on
feature maps which are either partitioned to localize parts of a person or
reduced to create a global representation. While part localization has shown
significant success, it uses either na{\i}ve position-based partitions or
static feature templates. These, however, hypothesize the pre-existence of the
parts in a given image or their positions, ignoring the input image-specific
information which limits their usability in challenging scenarios such as Re-ID
with partial occlusions and partial probe images. In this paper, we introduce a
spatial attention-based Dynamic Part Template Initialization module that
dynamically generates part-templates using mid-level semantic features at the
earlier layers of the backbone. Following a self-attention layer, human
part-level features of the backbone are used to extract the templates of
diverse human body parts using a simplified cross-attention scheme which will
then be used to identify and collate representations of various human parts
from semantically rich features, increasing the discriminative ability of the
entire model. We further explore adaptive weighting of part descriptors to
quantify the absence or occlusion of local attributes and suppress the
contribution of the corresponding part descriptors to the matching criteria.
Extensive experiments on holistic, occluded, and partial Re-ID task benchmarks
demonstrate that our proposed architecture is able to achieve competitive
performance. Codes will be included in the supplementary material and will be
made publicly available.
Related papers
- Multi-Stage Spatio-Temporal Aggregation Transformer for Video Person
Re-identification [78.08536797239893]
We propose a novel Multi-Stage Spatial-Temporal Aggregation Transformer (MSTAT) with two novel designed proxy embedding modules.
MSTAT consists of three stages to encode the attribute-associated, the identity-associated, and the attribute-identity-associated information from the video clips.
We show that MSTAT can achieve state-of-the-art accuracies on various standard benchmarks.
arXiv Detail & Related papers (2023-01-02T05:17:31Z) - DiP: Learning Discriminative Implicit Parts for Person Re-Identification [18.89539401924382]
We propose to learn Discriminative implicit Parts (DiPs) which are decoupled from explicit body parts.
We also propose a novel implicit position to give a geometric interpretation for each DiP.
The proposed method achieves state-of-the-art performance on multiple person ReID benchmarks.
arXiv Detail & Related papers (2022-12-24T17:59:01Z) - Framework-agnostic Semantically-aware Global Reasoning for Segmentation [29.69187816377079]
We propose a component that learns to project image features into latent representations and reason between them.
Our design encourages the latent regions to represent semantic concepts by ensuring that the activated regions are spatially disjoint.
Our latent tokens are semantically interpretable and diverse and provide a rich set of features that can be transferred to downstream tasks.
arXiv Detail & Related papers (2022-12-06T21:42:05Z) - Body Part-Based Representation Learning for Occluded Person
Re-Identification [102.27216744301356]
Occluded person re-identification (ReID) is a person retrieval task which aims at matching occluded person images with holistic ones.
Part-based methods have been shown beneficial as they offer fine-grained information and are well suited to represent partially visible human bodies.
We propose BPBreID, a body part-based ReID model for solving the above issues.
arXiv Detail & Related papers (2022-11-07T16:48:41Z) - Part-aware Prototypical Graph Network for One-shot Skeleton-based Action
Recognition [57.86960990337986]
One-shot skeleton-based action recognition poses unique challenges in learning transferable representation from base classes to novel classes.
We propose a part-aware prototypical representation for one-shot skeleton-based action recognition.
We demonstrate the effectiveness of our method on two public skeleton-based action recognition datasets.
arXiv Detail & Related papers (2022-08-19T04:54:56Z) - Dynamic Prototype Mask for Occluded Person Re-Identification [88.7782299372656]
Existing methods mainly address this issue by employing body clues provided by an extra network to distinguish the visible part.
We propose a novel Dynamic Prototype Mask (DPM) based on two self-evident prior knowledge.
Under this condition, the occluded representation could be well aligned in a selected subspace spontaneously.
arXiv Detail & Related papers (2022-07-19T03:31:13Z) - Fine-grained Few-shot Recognition by Deep Object Parsing [43.61794876834115]
We parse a test instance by inferring the K parts, where each part occupies a distinct location in the feature space.
We recognize test instances by comparing its active templates and the relative geometry of its part locations.
arXiv Detail & Related papers (2022-07-14T17:59:05Z) - Fine-Grained Object Classification via Self-Supervised Pose Alignment [42.55938966190932]
We learn a novel graph based object representation to reveal a global configuration of local parts for self-supervised pose alignment across classes.
We evaluate our method on three popular fine-grained object classification benchmarks, consistently achieving the state-of-the-art performance.
arXiv Detail & Related papers (2022-03-30T01:46:19Z) - Calibrated Feature Decomposition for Generalizable Person
Re-Identification [82.64133819313186]
Calibrated Feature Decomposition (CFD) module focuses on improving the generalization capacity for person re-identification.
A calibrated-and-standardized Batch normalization (CSBN) is designed to learn calibrated person representation.
arXiv Detail & Related papers (2021-11-27T17:12:43Z) - Part-aware Prototype Network for Few-shot Semantic Segmentation [50.581647306020095]
We propose a novel few-shot semantic segmentation framework based on the prototype representation.
Our key idea is to decompose the holistic class representation into a set of part-aware prototypes.
We develop a novel graph neural network model to generate and enhance the proposed part-aware prototypes.
arXiv Detail & Related papers (2020-07-13T11:03:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.