Receptive Multi-granularity Representation for Person Re-Identification
- URL: http://arxiv.org/abs/2008.13450v1
- Date: Mon, 31 Aug 2020 09:26:08 GMT
- Title: Receptive Multi-granularity Representation for Person Re-Identification
- Authors: Guanshuo Wang, Yufeng Yuan, Jiwei Li, Shiming Ge, Xi Zhou
- Abstract summary: This paper proposes a receptive multi-granularity learning approach to facilitate stripe-based feature learning.
By two-branch network architecture, different scales of discriminative identity representation can be learned.
Our approach achieves a state-of-the-art accuracy of 96.2%@Rank-1 or 90.0%@mAP on the challenging Market-1501 benchmark.
- Score: 46.99913453669368
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: A key for person re-identification is achieving consistent local details for
discriminative representation across variable environments. Current
stripe-based feature learning approaches have delivered impressive accuracy,
but do not make a proper trade-off between diversity, locality, and robustness,
which easily suffers from part semantic inconsistency for the conflict between
rigid partition and misalignment. This paper proposes a receptive
multi-granularity learning approach to facilitate stripe-based feature
learning. This approach performs local partition on the intermediate
representations to operate receptive region ranges, rather than current
approaches on input images or output features, thus can enhance the
representation of locality while remaining proper local association. Toward
this end, the local partitions are adaptively pooled by using
significance-balanced activations for uniform stripes. Random shifting
augmentation is further introduced for a higher variance of person appearing
regions within bounding boxes to ease misalignment. By two-branch network
architecture, different scales of discriminative identity representation can be
learned. In this way, our model can provide a more comprehensive and efficient
feature representation without larger model storage costs. Extensive
experiments on intra-dataset and cross-dataset evaluations demonstrate the
effectiveness of the proposed approach. Especially, our approach achieves a
state-of-the-art accuracy of 96.2%@Rank-1 or 90.0%@mAP on the challenging
Market-1501 benchmark.
Related papers
- Robust Outlier Rejection for 3D Registration with Variational Bayes [70.98659381852787]
We develop a novel variational non-local network-based outlier rejection framework for robust alignment.
We propose a voting-based inlier searching strategy to cluster the high-quality hypothetical inliers for transformation estimation.
arXiv Detail & Related papers (2023-04-04T03:48:56Z) - Adaptive Local-Component-aware Graph Convolutional Network for One-shot
Skeleton-based Action Recognition [54.23513799338309]
We present an Adaptive Local-Component-aware Graph Convolutional Network for skeleton-based action recognition.
Our method provides a stronger representation than the global embedding and helps our model reach state-of-the-art.
arXiv Detail & Related papers (2022-09-21T02:33:07Z) - Distribution Regularized Self-Supervised Learning for Domain Adaptation
of Semantic Segmentation [3.284878354988896]
This paper proposes a pixel-level distribution regularization scheme (DRSL) for self-supervised domain adaptation of semantic segmentation.
In a typical setting, the classification loss forces the semantic segmentation model to greedily learn the representations that capture inter-class variations.
We capture pixel-level intra-class variations through class-aware multi-modal distribution learning.
arXiv Detail & Related papers (2022-06-20T09:52:49Z) - Feature Diversity Learning with Sample Dropout for Unsupervised Domain
Adaptive Person Re-identification [0.0]
This paper proposes a new approach to learn the feature representation with better generalization ability through limiting noisy pseudo labels.
We put forward a brand-new method referred as to Feature Diversity Learning (FDL) under the classic mutual-teaching architecture.
Experimental results show that our proposed FDL-SD achieves the state-of-the-art performance on multiple benchmark datasets.
arXiv Detail & Related papers (2022-01-25T10:10:48Z) - Towards Fewer Annotations: Active Learning via Region Impurity and
Prediction Uncertainty for Domain Adaptive Semantic Segmentation [19.55572909866489]
We propose a region-based active learning approach for semantic segmentation under a domain shift.
Our algorithm, Active Learning via Region Impurity and Prediction Uncertainty (AL-RIPU), introduces a novel acquisition strategy characterizing the spatial adjacency of image regions.
Our method only requires very few annotations to almost reach the supervised performance and substantially outperforms state-of-the-art methods.
arXiv Detail & Related papers (2021-11-25T06:40:58Z) - Coarse to Fine: Domain Adaptive Crowd Counting via Adversarial Scoring
Network [58.05473757538834]
This paper proposes a novel adversarial scoring network (ASNet) to bridge the gap across domains from coarse to fine granularity.
Three sets of migration experiments show that the proposed methods achieve state-of-the-art counting performance.
arXiv Detail & Related papers (2021-07-27T14:47:24Z) - Weakly supervised segmentation with cross-modality equivariant
constraints [7.757293476741071]
Weakly supervised learning has emerged as an appealing alternative to alleviate the need for large labeled datasets in semantic segmentation.
We present a novel learning strategy that leverages self-supervision in a multi-modal image scenario to significantly enhance original CAMs.
Our approach outperforms relevant recent literature under the same learning conditions.
arXiv Detail & Related papers (2021-04-06T13:14:20Z) - Margin Preserving Self-paced Contrastive Learning Towards Domain
Adaptation for Medical Image Segmentation [51.93711960601973]
We propose a novel margin preserving self-paced contrastive Learning model for cross-modal medical image segmentation.
With the guidance of progressively refined semantic prototypes, a novel margin preserving contrastive loss is proposed to boost the discriminability of embedded representation space.
Experiments on cross-modal cardiac segmentation tasks demonstrate that MPSCL significantly improves semantic segmentation performance.
arXiv Detail & Related papers (2021-03-15T15:23:10Z) - Learning Invariant Representations and Risks for Semi-supervised Domain
Adaptation [109.73983088432364]
We propose the first method that aims to simultaneously learn invariant representations and risks under the setting of semi-supervised domain adaptation (Semi-DA)
We introduce the LIRR algorithm for jointly textbfLearning textbfInvariant textbfRepresentations and textbfRisks.
arXiv Detail & Related papers (2020-10-09T15:42:35Z) - Adaptive Deep Metric Embeddings for Person Re-Identification under
Occlusions [17.911512103472727]
We propose a novel person ReID method, which learns the spatial dependencies between the local regions and extracts the discriminative feature representation of the pedestrian image based on Long Short-Term Memory (LSTM)
The proposed loss enables the deep neural network to adaptively learn discriminative metric embeddings, which significantly improve the capability of recognizing unseen person identities.
arXiv Detail & Related papers (2020-02-07T03:18:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.