Learning Diverse Features with Part-Level Resolution for Person
Re-Identification
- URL: http://arxiv.org/abs/2001.07442v1
- Date: Tue, 21 Jan 2020 11:01:56 GMT
- Title: Learning Diverse Features with Part-Level Resolution for Person
Re-Identification
- Authors: Ben Xie, Xiaofu Wu, Suofei Zhang, Shiliang Zhao, Ming Li
- Abstract summary: This paper proposes to construct a strong lightweight network architecture, termed PLR-OSNet.
It is based on the idea of Part-Level feature Resolution over the Omni-Scale Network (OSNet) for achieving feature diversity.
It achieves state-of-the-art performance on popular person Re-ID datasets, including Market1501, DukeMTMC-reID and CUHK03.
- Score: 10.940478376944133
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Learning diverse features is key to the success of person re-identification.
Various part-based methods have been extensively proposed for learning local
representations, which, however, are still inferior to the best-performing
methods for person re-identification. This paper proposes to construct a strong
lightweight network architecture, termed PLR-OSNet, based on the idea of
Part-Level feature Resolution over the Omni-Scale Network (OSNet) for achieving
feature diversity. The proposed PLR-OSNet has two branches, one branch for
global feature representation and the other branch for local feature
representation. The local branch employs a uniform partition strategy for
part-level feature resolution but produces only a single identity-prediction
loss, which is in sharp contrast to the existing part-based methods. Empirical
evidence demonstrates that the proposed PLR-OSNet achieves state-of-the-art
performance on popular person Re-ID datasets, including Market1501,
DukeMTMC-reID and CUHK03, despite its small model size.
Related papers
- A Refreshed Similarity-based Upsampler for Direct High-Ratio Feature Upsampling [54.05517338122698]
We propose an explicitly controllable query-key feature alignment from both semantic-aware and detail-aware perspectives.
We also develop a fine-grained neighbor selection strategy on HR features, which is simple yet effective for alleviating mosaic artifacts.
Our proposed ReSFU framework consistently achieves satisfactory performance on different segmentation applications.
arXiv Detail & Related papers (2024-07-02T14:12:21Z) - Learning Unified Representations for Multi-Resolution Face Recognition [0.7378853859331619]
Branch-to-Trunk network (BTNet) is a representation learning method for multi-resolution face recognition.
Our experiments demonstrate strong performance on face recognition benchmarks, both for multi-resolution identity matching and feature aggregation, with much less amount and parameter storage.
arXiv Detail & Related papers (2023-10-14T11:26:43Z) - A Task-aware Dual Similarity Network for Fine-grained Few-shot Learning [19.90385022248391]
Task-aware Dual Similarity Network( TDSNet) proposed to explore global invariant features and discriminative local details.
TDSNet achieves competitive performance by comparing with other state-of-the-art algorithms.
arXiv Detail & Related papers (2022-10-22T04:24:55Z) - Cross-modal Local Shortest Path and Global Enhancement for
Visible-Thermal Person Re-Identification [2.294635424666456]
We propose the Cross-modal Local Shortest Path and Global Enhancement (CM-LSP-GE) modules,a two-stream network based on joint learning of local and global features.
The experimental results on two typical datasets show that our model is obviously superior to the most state-of-the-art methods.
arXiv Detail & Related papers (2022-06-09T10:27:22Z) - LoGG3D-Net: Locally Guided Global Descriptor Learning for 3D Place
Recognition [31.105598103211825]
We show that an additional training signal (local consistency loss) can guide the network to learning local features which are consistent across revisits.
We formulate our approach in an end-to-end trainable architecture called LoGG3D-Net.
arXiv Detail & Related papers (2021-09-17T03:32:43Z) - Spatio-Temporal Representation Factorization for Video-based Person
Re-Identification [55.01276167336187]
We propose Spatio-Temporal Representation Factorization module (STRF) for re-ID.
STRF is a flexible new computational unit that can be used in conjunction with most existing 3D convolutional neural network architectures for re-ID.
We empirically show that STRF improves performance of various existing baseline architectures while demonstrating new state-of-the-art results.
arXiv Detail & Related papers (2021-07-25T19:29:37Z) - Conformer: Local Features Coupling Global Representations for Visual
Recognition [72.9550481476101]
We propose a hybrid network structure, termed Conformer, to take advantage of convolutional operations and self-attention mechanisms for enhanced representation learning.
Experiments show that Conformer, under the comparable parameter complexity, outperforms the visual transformer (DeiT-B) by 2.3% on ImageNet.
arXiv Detail & Related papers (2021-05-09T10:00:03Z) - Region Comparison Network for Interpretable Few-shot Image
Classification [97.97902360117368]
Few-shot image classification has been proposed to effectively use only a limited number of labeled examples to train models for new classes.
We propose a metric learning based method named Region Comparison Network (RCN), which is able to reveal how few-shot learning works.
We also present a new way to generalize the interpretability from the level of tasks to categories.
arXiv Detail & Related papers (2020-09-08T07:29:05Z) - ReMarNet: Conjoint Relation and Margin Learning for Small-Sample Image
Classification [49.87503122462432]
We introduce a novel neural network termed Relation-and-Margin learning Network (ReMarNet)
Our method assembles two networks of different backbones so as to learn the features that can perform excellently in both of the aforementioned two classification mechanisms.
Experiments on four image datasets demonstrate that our approach is effective in learning discriminative features from a small set of labeled samples.
arXiv Detail & Related papers (2020-06-27T13:50:20Z) - ResNeSt: Split-Attention Networks [86.25490825631763]
We present a modularized architecture, which applies the channel-wise attention on different network branches to leverage their success in capturing cross-feature interactions and learning diverse representations.
Our model, named ResNeSt, outperforms EfficientNet in accuracy and latency trade-off on image classification.
arXiv Detail & Related papers (2020-04-19T20:40:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.