PGGANet: Pose Guided Graph Attention Network for Person
Re-identification
- URL: http://arxiv.org/abs/2111.14411v1
- Date: Mon, 29 Nov 2021 09:47:39 GMT
- Title: PGGANet: Pose Guided Graph Attention Network for Person
Re-identification
- Authors: Zhijun He, Hongbo Zhao, Wenquan Feng
- Abstract summary: Person re-identification (ReID) aims at retrieving a person from images captured by different cameras.
It has been proved that using local features together with global feature of person image could help to give robust feature representations for person retrieval.
We propose a pose guided graph attention network, a multi-branch architecture consisting of one branch for global feature, one branch for mid-granular body features and one branch for fine-granular key point features.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Person re-identification (ReID) aims at retrieving a person from images
captured by different cameras. For deep-learning-based ReID methods, it has
been proved that using local features together with global feature of person
image could help to give robust feature representations for person retrieval.
Human pose information could provide the locations of human skeleton to
effectively guide the network to pay more attention on these key areas and
could also help to reduce the noise distractions from background or occlusions.
However, methods proposed by previous pose-related works might not be able to
fully exploit the benefits of pose information and did not take into
consideration the different contributions of different local features. In this
paper, we propose a pose guided graph attention network, a multi-branch
architecture consisting of one branch for global feature, one branch for
mid-granular body features and one branch for fine-granular key point features.
We use a pre-trained pose estimator to generate the key-point heatmap for local
feature learning and carefully design a graph attention convolution layer to
re-evaluate the contribution weights of extracted local features by modeling
the similarities relations. Experiments results demonstrate the effectiveness
of our approach on discriminative feature learning and we show that our model
achieves state-of-the-art performances on several mainstream evaluation
datasets. We also conduct a plenty of ablation studies and design different
kinds of comparison experiments for our network to prove its effectiveness and
robustness, including holistic datasets, partial datasets, occluded datasets
and cross-domain tests.
Related papers
- Pose-Aided Video-based Person Re-Identification via Recurrent Graph
Convolutional Network [41.861537712563816]
We propose to learn the discriminative pose feature beyond the appearance feature for video retrieval.
To learn the pose feature, we first detect the pedestrian pose in each frame through an off-the-shelf pose detector.
We then exploit a recurrent graph convolutional network (RGCN) to learn the node embeddings of the temporal pose graph.
arXiv Detail & Related papers (2022-09-23T13:20:33Z) - Local-Aware Global Attention Network for Person Re-Identification Based on Body and Hand Images [0.0]
We propose a compound approach for end-to-end discriminative deep feature learning for person Re-Id based on both body and hand images.
The proposed method consistently outperforms existing state-of-the-art methods.
arXiv Detail & Related papers (2022-09-11T09:43:42Z) - Contrastive Learning of Features between Images and LiDAR [18.211513930388417]
This work treats learning cross-modal features as a dense contrastive learning problem.
To learn good features and not lose generality, we developed a variant of widely used PointNet++ architecture for images.
We show that our models indeed learn information from both images as well as LiDAR by visualizing the features.
arXiv Detail & Related papers (2022-06-24T04:35:23Z) - LEAD: Self-Supervised Landmark Estimation by Aligning Distributions of
Feature Similarity [49.84167231111667]
Existing works in self-supervised landmark detection are based on learning dense (pixel-level) feature representations from an image.
We introduce an approach to enhance the learning of dense equivariant representations in a self-supervised fashion.
We show that having such a prior in the feature extractor helps in landmark detection, even under drastically limited number of annotations.
arXiv Detail & Related papers (2022-04-06T17:48:18Z) - Kinship Verification Based on Cross-Generation Feature Interaction
Learning [53.62256887837659]
Kinship verification from facial images has been recognized as an emerging yet challenging technique in computer vision applications.
We propose a novel cross-generation feature interaction learning (CFIL) framework for robust kinship verification.
arXiv Detail & Related papers (2021-09-07T01:50:50Z) - Graph-based Person Signature for Person Re-Identifications [17.181807593574764]
We propose a new method to effectively aggregate detailed person descriptions (attributes labels) and visual features (body parts and global features) into a graph.
The graph is integrated into a multi-branch multi-task framework for person re-identification.
Our approach achieves competitive results among the state of the art and outperforms other attribute-based or mask-guided methods.
arXiv Detail & Related papers (2021-04-14T10:54:36Z) - Joint Learning of Neural Transfer and Architecture Adaptation for Image
Recognition [77.95361323613147]
Current state-of-the-art visual recognition systems rely on pretraining a neural network on a large-scale dataset and finetuning the network weights on a smaller dataset.
In this work, we prove that dynamically adapting network architectures tailored for each domain task along with weight finetuning benefits in both efficiency and effectiveness.
Our method can be easily generalized to an unsupervised paradigm by replacing supernet training with self-supervised learning in the source domain tasks and performing linear evaluation in the downstream tasks.
arXiv Detail & Related papers (2021-03-31T08:15:17Z) - Self-supervised Segmentation via Background Inpainting [96.10971980098196]
We introduce a self-supervised detection and segmentation approach that can work with single images captured by a potentially moving camera.
We exploit a self-supervised loss function that we exploit to train a proposal-based segmentation network.
We apply our method to human detection and segmentation in images that visually depart from those of standard benchmarks and outperform existing self-supervised methods.
arXiv Detail & Related papers (2020-11-11T08:34:40Z) - Ventral-Dorsal Neural Networks: Object Detection via Selective Attention [51.79577908317031]
We propose a new framework called Ventral-Dorsal Networks (VDNets)
Inspired by the structure of the human visual system, we propose the integration of a "Ventral Network" and a "Dorsal Network"
Our experimental results reveal that the proposed method outperforms state-of-the-art object detection approaches.
arXiv Detail & Related papers (2020-05-15T23:57:36Z) - High-Order Information Matters: Learning Relation and Topology for
Occluded Person Re-Identification [84.43394420267794]
We propose a novel framework by learning high-order relation and topology information for discriminative features and robust alignment.
Our framework significantly outperforms state-of-the-art by6.5%mAP scores on Occluded-Duke dataset.
arXiv Detail & Related papers (2020-03-18T12:18:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.