Global-Local Dynamic Feature Alignment Network for Person
Re-Identification
- URL: http://arxiv.org/abs/2109.05759v1
- Date: Mon, 13 Sep 2021 07:53:36 GMT
- Title: Global-Local Dynamic Feature Alignment Network for Person
Re-Identification
- Authors: Zhangqiang Ming and Yong Yang and Xiaoyong Wei and Jianrong Yan and
Xiangkun Wang and Fengjie Wang and Min Zhu
- Abstract summary: We propose a simple and efficient Local Sliding Alignment (LSA) strategy to dynamically align the local features of two images by setting a sliding window on the local stripes of the pedestrian.
LSA can effectively suppress spatial misalignment and does not need to introduce extra supervision information.
We introduce LSA into the local branch of GLDFA-Net to guide the computation of distance metrics, which can further improve the accuracy of the testing phase.
- Score: 5.202841879001503
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The misalignment of human images caused by pedestrian detection bounding box
errors or partial occlusions is one of the main challenges in person
Re-Identification (Re-ID) tasks. Previous local-based methods mainly focus on
learning local features in predefined semantic regions of pedestrians, usually
use local hard alignment methods or introduce auxiliary information such as key
human pose points to match local features. These methods are often not
applicable when large scene differences are encountered. Targeting to solve
these problems, we propose a simple and efficient Local Sliding Alignment (LSA)
strategy to dynamically align the local features of two images by setting a
sliding window on the local stripes of the pedestrian. LSA can effectively
suppress spatial misalignment and does not need to introduce extra supervision
information. Then, we design a Global-Local Dynamic Feature Alignment Network
(GLDFA-Net) framework, which contains both global and local branches. We
introduce LSA into the local branch of GLDFA-Net to guide the computation of
distance metrics, which can further improve the accuracy of the testing phase.
Evaluation experiments on several mainstream evaluation datasets including
Market-1501, DukeMTMC-reID, and CUHK03 show that our method has competitive
accuracy over the several state-of-the-art person Re-ID methods. Additionally,
it achieves 86.1% mAP and 94.8% Rank-1 accuracy on Market1501.
Related papers
- GlocalCLIP: Object-agnostic Global-Local Prompt Learning for Zero-shot Anomaly Detection [5.530212768657544]
We introduce glocal contrastive learning to improve the learning of global and local prompts, effectively detecting abnormal patterns across various domains.
The generalization performance of GlocalCLIP in ZSAD was demonstrated on 15 real-world datasets from both the industrial and medical domains.
arXiv Detail & Related papers (2024-11-09T05:22:13Z) - From Global to Local: Multi-scale Out-of-distribution Detection [129.37607313927458]
Out-of-distribution (OOD) detection aims to detect "unknown" data whose labels have not been seen during the in-distribution (ID) training process.
Recent progress in representation learning gives rise to distance-based OOD detection.
We propose Multi-scale OOD DEtection (MODE), a first framework leveraging both global visual information and local region details.
arXiv Detail & Related papers (2023-08-20T11:56:25Z) - DETR Doesn't Need Multi-Scale or Locality Design [69.56292005230185]
This paper presents an improved DETR detector that maintains a "plain" nature.
It uses a single-scale feature map and global cross-attention calculations without specific locality constraints.
We show that two simple technologies are surprisingly effective within a plain design to compensate for the lack of multi-scale feature maps and locality constraints.
arXiv Detail & Related papers (2023-08-03T17:59:04Z) - Spatial-Aware Token for Weakly Supervised Object Localization [137.0570026552845]
We propose a task-specific spatial-aware token to condition localization in a weakly supervised manner.
Experiments show that the proposed SAT achieves state-of-the-art performance on both CUB-200 and ImageNet, with 98.45% and 73.13% GT-known Loc.
arXiv Detail & Related papers (2023-03-18T15:38:17Z) - Adaptive Local-Component-aware Graph Convolutional Network for One-shot
Skeleton-based Action Recognition [54.23513799338309]
We present an Adaptive Local-Component-aware Graph Convolutional Network for skeleton-based action recognition.
Our method provides a stronger representation than the global embedding and helps our model reach state-of-the-art.
arXiv Detail & Related papers (2022-09-21T02:33:07Z) - Object-Guided Day-Night Visual Localization in Urban Scenes [2.4493299476776778]
The proposed method first detects semantic objects and establishes correspondences of those objects between images.
Experiments on standard urban localization datasets show that OGuL significantly improves localization results with as simple local features as SIFT.
arXiv Detail & Related papers (2022-02-09T13:21:30Z) - PGL: Prior-Guided Local Self-supervised Learning for 3D Medical Image
Segmentation [87.50205728818601]
We propose a PriorGuided Local (PGL) self-supervised model that learns the region-wise local consistency in the latent feature space.
Our PGL model learns the distinctive representations of local regions, and hence is able to retain structural information.
arXiv Detail & Related papers (2020-11-25T11:03:11Z) - Inter-Image Communication for Weakly Supervised Localization [77.2171924626778]
Weakly supervised localization aims at finding target object regions using only image-level supervision.
We propose to leverage pixel-level similarities across different objects for learning more accurate object locations.
Our method achieves the Top-1 localization error rate of 45.17% on the ILSVRC validation set.
arXiv Detail & Related papers (2020-08-12T04:14:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.