Gait Recognition via Effective Global-Local Feature Representation and
Local Temporal Aggregation
- URL: http://arxiv.org/abs/2011.01461v2
- Date: Sat, 14 Aug 2021 07:53:05 GMT
- Title: Gait Recognition via Effective Global-Local Feature Representation and
Local Temporal Aggregation
- Authors: Beibei Lin, Shunli Zhang and Xin Yu
- Abstract summary: Gait recognition is one of the most important biometric technologies and has been applied in many fields.
Recent gait recognition frameworks represent each gait frame by descriptors extracted from either global appearances or local regions of humans.
We propose a novel feature extraction and fusion framework to achieve discriminative feature representations for gait recognition.
- Score: 28.721376937882958
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Gait recognition is one of the most important biometric technologies and has
been applied in many fields. Recent gait recognition frameworks represent each
gait frame by descriptors extracted from either global appearances or local
regions of humans. However, the representations based on global information
often neglect the details of the gait frame, while local region based
descriptors cannot capture the relations among neighboring regions, thus
reducing their discriminativeness. In this paper, we propose a novel feature
extraction and fusion framework to achieve discriminative feature
representations for gait recognition. Towards this goal, we take advantage of
both global visual information and local region details and develop a Global
and Local Feature Extractor (GLFE). Specifically, our GLFE module is composed
of our newly designed multiple global and local convolutional layers (GLConv)
to ensemble global and local features in a principle manner. Furthermore, we
present a novel operation, namely Local Temporal Aggregation (LTA), to further
preserve the spatial information by reducing the temporal resolution to obtain
higher spatial resolution. With the help of our GLFE and LTA, our method
significantly improves the discriminativeness of our visual features, thus
improving the gait recognition performance. Extensive experiments demonstrate
that our proposed method outperforms state-of-the-art gait recognition methods
on two popular datasets.
Related papers
- Swarm Intelligence in Geo-Localization: A Multi-Agent Large Vision-Language Model Collaborative Framework [51.26566634946208]
We introduce smileGeo, a novel visual geo-localization framework.
By inter-agent communication, smileGeo integrates the inherent knowledge of these agents with additional retrieved information.
Results show that our approach significantly outperforms current state-of-the-art methods.
arXiv Detail & Related papers (2024-08-21T03:31:30Z) - Other Tokens Matter: Exploring Global and Local Features of Vision Transformers for Object Re-Identification [63.147482497821166]
We first explore the influence of global and local features of ViT and then propose a novel Global-Local Transformer (GLTrans) for high-performance object Re-ID.
Our proposed method achieves superior performance on four object Re-ID benchmarks.
arXiv Detail & Related papers (2024-04-23T12:42:07Z) - LoDisc: Learning Global-Local Discriminative Features for
Self-Supervised Fine-Grained Visual Recognition [18.442966979622717]
We present to incorporate the subtle local fine-grained feature learning into global self-supervised contrastive learning.
A novel pretext task called Local Discrimination (LoDisc) is proposed to explicitly supervise self-supervised model's focus towards local pivotal regions.
We show that Local Discrimination pretext task can effectively enhance fine-grained clues in important local regions, and the global-local framework further refines the fine-grained feature representations of images.
arXiv Detail & Related papers (2024-03-06T21:36:38Z) - Global Meets Local: Effective Multi-Label Image Classification via
Category-Aware Weak Supervision [37.761378069277676]
This paper builds a unified framework to perform effective noisy-proposal suppression.
We develop a cross-granularity attention module to explore the complementary information between global and local features.
Our framework achieves superior performance over state-of-the-art methods.
arXiv Detail & Related papers (2022-11-23T05:39:17Z) - RING++: Roto-translation Invariant Gram for Global Localization on a
Sparse Scan Map [20.276334172402763]
We propose RING++ which has roto-translation invariant representation for place recognition, and global convergence for both rotation and translation estimation.
With the theoretical guarantee, RING++ is able to address the large viewpoint difference using a lightweight map with sparse scans.
This is the first learning-free framework to address all subtasks of global localization in the sparse scan map.
arXiv Detail & Related papers (2022-10-12T07:49:24Z) - Adaptive Local-Component-aware Graph Convolutional Network for One-shot
Skeleton-based Action Recognition [54.23513799338309]
We present an Adaptive Local-Component-aware Graph Convolutional Network for skeleton-based action recognition.
Our method provides a stronger representation than the global embedding and helps our model reach state-of-the-art.
arXiv Detail & Related papers (2022-09-21T02:33:07Z) - GaitGL: Learning Discriminative Global-Local Feature Representations for
Gait Recognition [29.128348655227416]
GaitGL is a global-local based gait recognition network.
GLCL is a dual-branch structure that consists of a GFR extractor and a mask-based LFR extractor.
Experiments demonstrate GaitGL achieves better performance than state-of-the-art gait recognition methods.
arXiv Detail & Related papers (2022-08-02T11:50:21Z) - Region-Based Semantic Factorization in GANs [67.90498535507106]
We present a highly efficient algorithm to factorize the latent semantics learned by Generative Adversarial Networks (GANs) concerning an arbitrary image region.
Through an appropriately defined generalized Rayleigh quotient, we solve such a problem without any annotations or training.
Experimental results on various state-of-the-art GAN models demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2022-02-19T17:46:02Z) - Video Salient Object Detection via Adaptive Local-Global Refinement [7.723369608197167]
Video salient object detection (VSOD) is an important task in many vision applications.
We propose an adaptive local-global refinement framework for VSOD.
We show that our weighting methodology can further exploit the feature correlations, thus driving the network to learn more discriminative feature representation.
arXiv Detail & Related papers (2021-04-29T14:14:11Z) - Watching You: Global-guided Reciprocal Learning for Video-based Person
Re-identification [82.6971648465279]
We propose a novel Global-guided Reciprocal Learning framework for video-based person Re-ID.
Our approach can achieve better performance than other state-of-the-art approaches.
arXiv Detail & Related papers (2021-03-07T12:27:42Z) - Adversarial Graph Representation Adaptation for Cross-Domain Facial
Expression Recognition [86.25926461936412]
We propose a novel Adrialversa Graph Representation Adaptation (AGRA) framework that unifies graph representation propagation with adversarial learning for cross-domain holistic-local feature co-adaptation.
We conduct extensive and fair experiments on several popular benchmarks and show that the proposed AGRA framework achieves superior performance over previous state-of-the-art methods.
arXiv Detail & Related papers (2020-08-03T13:27:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.