Gait Recognition with Mask-based Regularization
- URL: http://arxiv.org/abs/2203.04038v1
- Date: Tue, 8 Mar 2022 12:13:29 GMT
- Title: Gait Recognition with Mask-based Regularization
- Authors: Chuanfu Shen, Beibei Lin, Shunli Zhang, George Q. Huang, Shiqi Yu, Xin
Yu
- Abstract summary: We propose a novel mask-based regularization method named ReverseMask.
By injecting on the feature map, the proposed regularization method helps convolutional architecture learn the discriminative representations.
The plug-and-play Inception-like ReverseMask block is simple and effective to generalize networks.
- Score: 31.901166591272464
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Most gait recognition methods exploit spatial-temporal representations from
static appearances and dynamic walking patterns. However, we observe that many
part-based methods neglect representations at boundaries. In addition, the
phenomenon of overfitting on training data is relatively common in gait
recognition, which is perhaps due to insufficient data and low-informative gait
silhouettes. Motivated by these observations, we propose a novel mask-based
regularization method named ReverseMask. By injecting perturbation on the
feature map, the proposed regularization method helps convolutional
architecture learn the discriminative representations and enhances
generalization. Also, we design an Inception-like ReverseMask Block, which has
three branches composed of a global branch, a feature dropping branch, and a
feature scaling branch. Precisely, the dropping branch can extract fine-grained
representations when partial activations are zero-outed. Meanwhile, the scaling
branch randomly scales the feature map, keeping structural information of
activations and preventing overfitting. The plug-and-play Inception-like
ReverseMask block is simple and effective to generalize networks, and it also
improves the performance of many state-of-the-art methods. Extensive
experiments demonstrate that the ReverseMask regularization help baseline
achieves higher accuracy and better generalization. Moreover, the baseline with
Inception-like Block significantly outperforms state-of-the-art methods on the
two most popular datasets, CASIA-B and OUMVLP. The source code will be
released.
Related papers
- UGMAE: A Unified Framework for Graph Masked Autoencoders [67.75493040186859]
We propose UGMAE, a unified framework for graph masked autoencoders.
We first develop an adaptive feature mask generator to account for the unique significance of nodes.
We then design a ranking-based structure reconstruction objective joint with feature reconstruction to capture holistic graph information.
arXiv Detail & Related papers (2024-02-12T19:39:26Z) - Towards Compact 3D Representations via Point Feature Enhancement Masked
Autoencoders [52.66195794216989]
We propose Point Feature Enhancement Masked Autoencoders (Point-FEMAE) to learn compact 3D representations.
Point-FEMAE consists of a global branch and a local branch to capture latent semantic features.
Our method significantly improves the pre-training efficiency compared to cross-modal alternatives.
arXiv Detail & Related papers (2023-12-17T14:17:05Z) - High-fidelity Pseudo-labels for Boosting Weakly-Supervised Segmentation [17.804090651425955]
Image-level weakly-supervised segmentation (WSSS) reduces the usually vast data annotation cost by surrogate segmentation masks during training.
Our work is based on two techniques for improving CAMs; importance sampling, which is a substitute for GAP, and the feature similarity loss.
We reformulate both techniques based on binomial posteriors of multiple independent binary problems.
This has two benefits; their performance is improved and they become more general, resulting in an add-on method that can boost virtually any WSSS method.
arXiv Detail & Related papers (2023-04-05T17:43:57Z) - Dynamic Prototype Mask for Occluded Person Re-Identification [88.7782299372656]
Existing methods mainly address this issue by employing body clues provided by an extra network to distinguish the visible part.
We propose a novel Dynamic Prototype Mask (DPM) based on two self-evident prior knowledge.
Under this condition, the occluded representation could be well aligned in a selected subspace spontaneously.
arXiv Detail & Related papers (2022-07-19T03:31:13Z) - Self-Supervised Predictive Convolutional Attentive Block for Anomaly
Detection [97.93062818228015]
We propose to integrate the reconstruction-based functionality into a novel self-supervised predictive architectural building block.
Our block is equipped with a loss that minimizes the reconstruction error with respect to the masked area in the receptive field.
We demonstrate the generality of our block by integrating it into several state-of-the-art frameworks for anomaly detection on image and video.
arXiv Detail & Related papers (2021-11-17T13:30:31Z) - Inferring community characteristics in labelled networks [6.85316573653194]
We introduce a new generative model, the feature-first block model (FFBM)
We present a method to efficiently sample from the posterior distribution of the FFBM parameters.
The main advantages of the proposed approach are that the whole feature-space is used automatically, and features can be rank-ordered implicitly according to impact.
arXiv Detail & Related papers (2021-05-28T12:07:10Z) - Image Inpainting by End-to-End Cascaded Refinement with Mask Awareness [66.55719330810547]
Inpainting arbitrary missing regions is challenging because learning valid features for various masked regions is nontrivial.
We propose a novel mask-aware inpainting solution that learns multi-scale features for missing regions in the encoding phase.
Our framework is validated both quantitatively and qualitatively via extensive experiments on three public datasets.
arXiv Detail & Related papers (2021-04-28T13:17:47Z) - The surprising impact of mask-head architecture on novel class
segmentation [27.076315496682444]
We show that the architecture of the mask-head plays a surprisingly important role in generalization to classes for which we do not observe masks during training.
We also show that the choice of mask-head architecture alone can lead to SOTA results on the partially supervised COCO benchmark without the need of specialty modules or losses proposed by prior literature.
arXiv Detail & Related papers (2021-04-01T16:46:37Z) - Robust Person Re-Identification through Contextual Mutual Boosting [77.1976737965566]
We propose the Contextual Mutual Boosting Network (CMBN) to localize pedestrians.
It localizes pedestrians and recalibrates features by effectively exploiting contextual information and statistical inference.
Experiments on the benchmarks demonstrate the superiority of the architecture compared the state-of-the-art.
arXiv Detail & Related papers (2020-09-16T06:33:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.