Spatial Cascaded Clustering and Weighted Memory for Unsupervised Person
Re-identification
- URL: http://arxiv.org/abs/2403.00261v1
- Date: Fri, 1 Mar 2024 03:52:29 GMT
- Title: Spatial Cascaded Clustering and Weighted Memory for Unsupervised Person
Re-identification
- Authors: Jiahao Hong, Jialong Zuo, Chuchu Han, Ruochen Zheng, Ming Tian,
Changxin Gao, Nong Sang
- Abstract summary: Unsupervised person re-identification (re-ID) methods achieve high performance by leveraging fine-grained local context.
Part-based methods obtain local contexts through horizontal division, which suffer from misalignment due to various human poses.
We introduce the Spatial Cascaded Clustering and Weighted Memory (SCWM) method to address these challenges.
SCWM aims to parse and align more accurate local contexts for different human body parts while allowing the memory module to balance hard example mining and noise suppression.
- Score: 32.95715593278961
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recent unsupervised person re-identification (re-ID) methods achieve high
performance by leveraging fine-grained local context. These methods are
referred to as part-based methods. However, most part-based methods obtain
local contexts through horizontal division, which suffer from misalignment due
to various human poses. Additionally, the misalignment of semantic information
in part features restricts the use of metric learning, thus affecting the
effectiveness of part-based methods. The two issues mentioned above result in
the under-utilization of part features in part-based methods. We introduce the
Spatial Cascaded Clustering and Weighted Memory (SCWM) method to address these
challenges. SCWM aims to parse and align more accurate local contexts for
different human body parts while allowing the memory module to balance hard
example mining and noise suppression. Specifically, we first analyze the
foreground omissions and spatial confusions issues in the previous method.
Then, we propose foreground and space corrections to enhance the completeness
and reasonableness of the human parsing results. Next, we introduce a weighted
memory and utilize two weighting strategies. These strategies address hard
sample mining for global features and enhance noise resistance for part
features, which enables better utilization of both global and part features.
Extensive experiments on Market-1501 and MSMT17 validate the proposed method's
effectiveness over many state-of-the-art methods.
Related papers
- Mixture-of-Noises Enhanced Forgery-Aware Predictor for Multi-Face Manipulation Detection and Localization [52.87635234206178]
This paper proposes a new framework, namely MoNFAP, specifically tailored for multi-face manipulation detection and localization.
The framework incorporates two novel modules: the Forgery-aware Unified Predictor (FUP) Module and the Mixture-of-Noises Module (MNM)
arXiv Detail & Related papers (2024-08-05T08:35:59Z) - Anti-Collapse Loss for Deep Metric Learning Based on Coding Rate Metric [99.19559537966538]
DML aims to learn a discriminative high-dimensional embedding space for downstream tasks like classification, clustering, and retrieval.
To maintain the structure of embedding space and avoid feature collapse, we propose a novel loss function called Anti-Collapse Loss.
Comprehensive experiments on benchmark datasets demonstrate that our proposed method outperforms existing state-of-the-art methods.
arXiv Detail & Related papers (2024-07-03T13:44:20Z) - Robust Zero-Shot Crowd Counting and Localization With Adaptive Resolution SAM [55.93697196726016]
We propose a simple yet effective crowd counting method by utilizing the Segment-Everything-Everywhere Model (SEEM)
We show that SEEM's performance in dense crowd scenes is limited, primarily due to the omission of many persons in high-density areas.
Our proposed method achieves the best unsupervised performance in crowd counting, while also being comparable to some supervised methods.
arXiv Detail & Related papers (2024-02-27T13:55:17Z) - Mitigating the Effect of Incidental Correlations on Part-based Learning [50.682498099720114]
Part-based representations could be more interpretable and generalize better with limited data.
We present two innovative regularization methods for part-based representations.
We exhibit state-of-the-art (SoTA) performance on few-shot learning tasks on benchmark datasets.
arXiv Detail & Related papers (2023-09-30T13:44:48Z) - Region Generation and Assessment Network for Occluded Person
Re-Identification [43.49129366128688]
Person Re-identification (ReID) plays a more and more crucial role in recent years with a wide range of applications.
Most methods tackle such challenges by utilizing external tools to locate body parts or exploiting matching strategies.
We propose a Region Generation and Assessment Network (RGANet) to effectively and efficiently detect the human body regions.
arXiv Detail & Related papers (2023-09-07T08:41:47Z) - A Global to Local Double Embedding Method for Multi-person Pose
Estimation [10.05687757555923]
We present a novel method to simplify the pipeline by implementing person detection and joints detection simultaneously.
We propose a Double Embedding (DE) method to complete the multi-person pose estimation task in a global-to-local way.
We achieve the competitive results on benchmarks MSCOCO, MPII and CrowdPose, demonstrating the effectiveness and generalization ability of our method.
arXiv Detail & Related papers (2021-02-15T03:13:38Z) - Attentional-Biased Stochastic Gradient Descent [74.49926199036481]
We present a provable method (named ABSGD) for addressing the data imbalance or label noise problem in deep learning.
Our method is a simple modification to momentum SGD where we assign an individual importance weight to each sample in the mini-batch.
ABSGD is flexible enough to combine with other robust losses without any additional cost.
arXiv Detail & Related papers (2020-12-13T03:41:52Z) - Parts-Based Articulated Object Localization in Clutter Using Belief
Propagation [6.813222130986094]
We present a generative-discriminative parts-based recognition and localization method for articulated objects in clutter.
We demonstrate the efficacy of our methods in a tabletop environment for recognizing and localizing hand tools in uncluttered and cluttered configurations.
arXiv Detail & Related papers (2020-08-06T21:34:52Z) - Learning to Manipulate Individual Objects in an Image [71.55005356240761]
We describe a method to train a generative model with latent factors that are independent and localized.
This means that perturbing the latent variables affects only local regions of the synthesized image, corresponding to objects.
Unlike other unsupervised generative models, ours enables object-centric manipulation, without requiring object-level annotations.
arXiv Detail & Related papers (2020-04-11T21:50:20Z) - A flexible outlier detector based on a topology given by graph
communities [0.0]
anomaly detection is essential for optimal performance of machine learning methods and statistical predictive models.
Topology is computed using the communities of a weighted graph codifying mutual nearest neighbors in the feature space.
Our approach overall outperforms, both, local and global strategies in multi and single view settings.
arXiv Detail & Related papers (2020-02-18T18:40:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.