DROP: Decouple Re-Identification and Human Parsing with Task-specific
Features for Occluded Person Re-identification
- URL: http://arxiv.org/abs/2401.18032v1
- Date: Wed, 31 Jan 2024 17:54:43 GMT
- Title: DROP: Decouple Re-Identification and Human Parsing with Task-specific
Features for Occluded Person Re-identification
- Authors: Shuguang Dou, Xiangyang Jiang, Yuanpeng Tu, Junyao Gao, Zefan Qu,
Qingsong Zhao, Cairong Zhao
- Abstract summary: The paper introduces the Decouple Re-identificatiOn and human Parsing (DROP) method for occluded person re-identification (ReID)
Unlike mainstream approaches using global features for simultaneous multi-task learning of ReID and human parsing, DROP argues that the inferior performance of the former is due to distinct requirements for ReID and human parsing features.
Experimental results highlight the efficacy of DROP, especially achieving a Rank-1 accuracy of 76.8% on Occluded-Duke, surpassing two mainstream methods.
- Score: 15.910080319118498
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The paper introduces the Decouple Re-identificatiOn and human Parsing (DROP)
method for occluded person re-identification (ReID). Unlike mainstream
approaches using global features for simultaneous multi-task learning of ReID
and human parsing, or relying on semantic information for attention guidance,
DROP argues that the inferior performance of the former is due to distinct
granularity requirements for ReID and human parsing features. ReID focuses on
instance part-level differences between pedestrian parts, while human parsing
centers on semantic spatial context, reflecting the internal structure of the
human body. To address this, DROP decouples features for ReID and human
parsing, proposing detail-preserving upsampling to combine varying resolution
feature maps. Parsing-specific features for human parsing are decoupled, and
human position information is exclusively added to the human parsing branch. In
the ReID branch, a part-aware compactness loss is introduced to enhance
instance-level part differences. Experimental results highlight the efficacy of
DROP, especially achieving a Rank-1 accuracy of 76.8% on Occluded-Duke,
surpassing two mainstream methods. The codebase is accessible at
https://github.com/shuguang-52/DROP.
Related papers
- Part-Attention Based Model Make Occluded Person Re-Identification Stronger [1.7648680700685022]
We introduce PAB-ReID, which is a novel ReID model incorporating part-attention mechanisms to tackle the issues effectively.
Firstly, we introduce the human parsing label to guide the generation of more accurate human part attention maps.
In addition, we propose a fine-grained feature focuser for generating fine-grained human local feature representations while suppressing background interference.
arXiv Detail & Related papers (2024-04-04T13:43:11Z) - AiOS: All-in-One-Stage Expressive Human Pose and Shape Estimation [55.179287851188036]
We introduce a novel all-in-one-stage framework, AiOS, for expressive human pose and shape recovery without an additional human detection step.
We first employ a human token to probe a human location in the image and encode global features for each instance.
Then, we introduce a joint-related token to probe the human joint in the image and encoder a fine-grained local feature.
arXiv Detail & Related papers (2024-03-26T17:59:23Z) - Integrating Human Parsing and Pose Network for Human Action Recognition [12.308394270240463]
We introduce human parsing feature map as a novel modality for action recognition.
We propose Integrating Human Parsing and Pose Network (IPP-Net) for action recognition.
IPP-Net is the first to leverage both skeletons and human parsing feature maps dualbranch approach.
arXiv Detail & Related papers (2023-07-16T07:58:29Z) - Body Part-Based Representation Learning for Occluded Person
Re-Identification [102.27216744301356]
Occluded person re-identification (ReID) is a person retrieval task which aims at matching occluded person images with holistic ones.
Part-based methods have been shown beneficial as they offer fine-grained information and are well suited to represent partially visible human bodies.
We propose BPBreID, a body part-based ReID model for solving the above issues.
arXiv Detail & Related papers (2022-11-07T16:48:41Z) - AIParsing: Anchor-free Instance-level Human Parsing [98.80740676794254]
We have designed an instance-level human parsing network which is anchor-free and solvable on a pixel level.
It consists of two simple sub-networks: an anchor-free detection head for bounding box predictions and an edge-guided parsing head for human segmentation.
Our method achieves the best global-level and instance-level performance over state-of-the-art one-stage top-down alternatives.
arXiv Detail & Related papers (2022-07-14T12:19:32Z) - Dual-Stream Reciprocal Disentanglement Learning for Domain Adaption
Person Re-Identification [44.80508095481811]
We propose a novel method named Dual-stream Reciprocal Disentanglement Learning (DRDL), which is quite efficient in learning domain-invariant features.
In DRDL, two encoders are first constructed for id-related and id-unrelated feature extractions, which are respectively measured by their associated classifiers.
Our proposed method is free from image generation, which not only reduces the computational complexity remarkably, but also removes redundant information from id-related features.
arXiv Detail & Related papers (2021-06-26T03:05:23Z) - Differentiable Multi-Granularity Human Representation Learning for
Instance-Aware Human Semantic Parsing [131.97475877877608]
A new bottom-up regime is proposed to learn category-level human semantic segmentation and multi-person pose estimation in a joint and end-to-end manner.
It is a compact, efficient and powerful framework that exploits structural information over different human granularities.
Experiments on three instance-aware human datasets show that our model outperforms other bottom-up alternatives with much more efficient inference.
arXiv Detail & Related papers (2021-03-08T06:55:00Z) - DRG: Dual Relation Graph for Human-Object Interaction Detection [65.50707710054141]
We tackle the challenging problem of human-object interaction (HOI) detection.
Existing methods either recognize the interaction of each human-object pair in isolation or perform joint inference based on complex appearance-based features.
In this paper, we leverage an abstract spatial-semantic representation to describe each human-object pair and aggregate the contextual information of the scene via a dual relation graph.
arXiv Detail & Related papers (2020-08-26T17:59:40Z) - Identity-Guided Human Semantic Parsing for Person Re-Identification [42.705908907250986]
We propose the identity-guided human semantic parsing approach (ISP) to locate both the human body parts and personal belongings at pixel-level for aligned person re-ID.
arXiv Detail & Related papers (2020-07-27T12:12:27Z) - Hierarchical Human Parsing with Typed Part-Relation Reasoning [179.64978033077222]
How to model human structures is the central theme in this task.
We seek to simultaneously exploit the representational capacity of deep graph networks and the hierarchical human structures.
arXiv Detail & Related papers (2020-03-10T16:45:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.