UPAR: Unified Pedestrian Attribute Recognition and Person Retrieval
- URL: http://arxiv.org/abs/2209.02522v1
- Date: Tue, 6 Sep 2022 14:20:56 GMT
- Title: UPAR: Unified Pedestrian Attribute Recognition and Person Retrieval
- Authors: Andreas Specker, Mickael Cormier, J\"urgen Beyerer
- Abstract summary: We present UPAR, the Unified Person Attribute Recognition dataset.
It is based on four well-known person attribute recognition datasets: PA100k, PETA, RAPv2, and Market1501.
We unify those datasets by providing 3,3M additional annotations to harmonize 40 important binary attributes over 12 attribute categories.
- Score: 4.6193503399184275
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recognizing soft-biometric pedestrian attributes is essential in video
surveillance and fashion retrieval. Recent works show promising results on
single datasets. Nevertheless, the generalization ability of these methods
under different attribute distributions, viewpoints, varying illumination, and
low resolutions remains rarely understood due to strong biases and varying
attributes in current datasets. To close this gap and support a systematic
investigation, we present UPAR, the Unified Person Attribute Recognition
Dataset. It is based on four well-known person attribute recognition datasets:
PA100K, PETA, RAPv2, and Market1501. We unify those datasets by providing 3,3M
additional annotations to harmonize 40 important binary attributes over 12
attribute categories across the datasets. We thus enable research on
generalizable pedestrian attribute recognition as well as attribute-based
person retrieval for the first time. Due to the vast variance of the image
distribution, pedestrian pose, scale, and occlusion, existing approaches are
greatly challenged both in terms of accuracy and efficiency. Furthermore, we
develop a strong baseline for PAR and attribute-based person retrieval based on
a thorough analysis of regularization methods. Our models achieve
state-of-the-art performance in cross-domain and specialization settings on
PA100k, PETA, RAPv2, Market1501-Attributes, and UPAR. We believe UPAR and our
strong baseline will contribute to the artificial intelligence community and
promote research on large-scale, generalizable attribute recognition systems.
Related papers
- Spatio-Temporal Side Tuning Pre-trained Foundation Models for Video-based Pedestrian Attribute Recognition [58.79807861739438]
Existing pedestrian recognition (PAR) algorithms are mainly developed based on a static image.
We propose to understand human attributes using video frames that can fully use temporal information.
arXiv Detail & Related papers (2024-04-27T14:43:32Z) - Prospector Heads: Generalized Feature Attribution for Large Models & Data [82.02696069543454]
We introduce prospector heads, an efficient and interpretable alternative to explanation-based attribution methods.
We demonstrate how prospector heads enable improved interpretation and discovery of class-specific patterns in input data.
arXiv Detail & Related papers (2024-02-18T23:01:28Z) - SequencePAR: Understanding Pedestrian Attributes via A Sequence
Generation Paradigm [18.53048511206039]
We propose a novel sequence generation paradigm for pedestrian attribute recognition, termed SequencePAR.
It extracts the pedestrian features using a pre-trained CLIP model and embeds the attribute set into query tokens under the guidance of text prompts.
The masked multi-head attention layer is introduced into the decoder module to prevent the model from remembering the next attribute while making attribute predictions during training.
arXiv Detail & Related papers (2023-12-04T05:42:56Z) - A Solution to Co-occurrence Bias: Attributes Disentanglement via Mutual
Information Minimization for Pedestrian Attribute Recognition [10.821982414387525]
We show that current methods can actually suffer in generalizing such fitted attributes interdependencies onto scenes or identities off the dataset distribution.
To render models robust in realistic scenes, we propose the attributes-disentangled feature learning to ensure the recognition of an attribute not inferring on the existence of others.
arXiv Detail & Related papers (2023-07-28T01:34:55Z) - TransFA: Transformer-based Representation for Face Attribute Evaluation [87.09529826340304]
We propose a novel textbftransformer-based representation for textbfattribute evaluation method (textbfTransFA)
The proposed TransFA achieves superior performances compared with state-of-the-art methods.
arXiv Detail & Related papers (2022-07-12T10:58:06Z) - Reinforced Pedestrian Attribute Recognition with Group Optimization
Reward [15.630702608104421]
Two key challenges in Pedestrian Attribute Recognition (PAR) are alignment relations between images and attributes, and imbalanced data distribution.
This paper addresses it as a decision-making task via a reinforcement learning framework.
We employ an agent to recognize each group of attributes, which is trained with Deep Q-learning algorithm.
arXiv Detail & Related papers (2022-05-21T03:38:03Z) - Spatial and Semantic Consistency Regularizations for Pedestrian
Attribute Recognition [50.932864767867365]
We propose a framework that consists of two complementary regularizations to achieve spatial and semantic consistency for each attribute.
Based on the precise attribute locations, we propose a semantic consistency regularization to extract intrinsic and discriminative semantic features.
Results show that the proposed method performs favorably against state-of-the-art methods without increasing parameters.
arXiv Detail & Related papers (2021-09-13T03:36:44Z) - Rethinking of Pedestrian Attribute Recognition: A Reliable Evaluation
under Zero-Shot Pedestrian Identity Setting [48.347987541336146]
We argue that it is time to step back and analyze the status quo of pedestrian attribute recognition.
We formally define and distinguish pedestrian attribute recognition from other similar tasks.
Experiments are conducted on four existing datasets and two proposed datasets to measure progress on pedestrian attribute recognition.
arXiv Detail & Related papers (2021-07-08T03:12:24Z) - Pedestrian Attribute Recognition in Video Surveillance Scenarios Based
on View-attribute Attention Localization [8.807717261983539]
We propose a novel view-attribute localization method based on attention (VALA)
A specific view-attribute is composed by the extracted attribute feature and four view scores which are predicted by view predictor as the confidences for attribute from different views.
Experiments on three wide datasets (RAP, RAPv2, PETA, and PA-100K) demonstrate the effectiveness of our approach compared with state-of-the-art methods.
arXiv Detail & Related papers (2021-06-11T16:09:31Z) - Deep Template Matching for Pedestrian Attribute Recognition with the
Auxiliary Supervision of Attribute-wise Keypoints [33.35677385823819]
Pedestrian Attribute Recognition (PAR) has aroused extensive attention due to its important role in video surveillance scenarios.
Recent works design complicated modules, e.g., attention mechanism and proposal of body parts to localize the attribute corresponding region.
These works prove that localization of attribute specific regions precisely will help in improving performance.
However, these part-information-based methods are still not accurate as well as increasing model complexity.
arXiv Detail & Related papers (2020-11-13T07:52:26Z) - Data Augmentation Imbalance For Imbalanced Attribute Classification [60.71438625139922]
We propose a new re-sampling algorithm called: data augmentation imbalance (DAI) to explicitly enhance the ability to discriminate the fewer attributes.
Our DAI algorithm achieves state-of-the-art results, based on pedestrian attribute datasets.
arXiv Detail & Related papers (2020-04-19T20:43:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.