Prototypical Contrast and Reverse Prediction: Unsupervised Skeleton
Based Action Recognition
- URL: http://arxiv.org/abs/2011.07236v1
- Date: Sat, 14 Nov 2020 08:04:23 GMT
- Title: Prototypical Contrast and Reverse Prediction: Unsupervised Skeleton
Based Action Recognition
- Authors: Shihao Xu, Haocong Rao, Xiping Hu, Bin Hu
- Abstract summary: We propose a novel framework named Prototypical Contrast and Reverse Prediction (PCRP)
PCRP creates reverse sequential prediction to learn low-level information and high-level pattern.
It also devises action prototypes to implicitly encode semantic similarity shared among sequences.
- Score: 12.463955174384457
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we focus on unsupervised representation learning for
skeleton-based action recognition. Existing approaches usually learn action
representations by sequential prediction but they suffer from the inability to
fully learn semantic information. To address this limitation, we propose a
novel framework named Prototypical Contrast and Reverse Prediction (PCRP),
which not only creates reverse sequential prediction to learn low-level
information (e.g., body posture at every frame) and high-level pattern (e.g.,
motion order), but also devises action prototypes to implicitly encode semantic
similarity shared among sequences. In general, we regard action prototypes as
latent variables and formulate PCRP as an expectation-maximization task.
Specifically, PCRP iteratively runs (1) E-step as determining the distribution
of prototypes by clustering action encoding from the encoder, and (2) M-step as
optimizing the encoder by minimizing the proposed ProtoMAE loss, which helps
simultaneously pull the action encoding closer to its assigned prototype and
perform reverse prediction task. Extensive experiments on N-UCLA, NTU 60, and
NTU 120 dataset present that PCRP outperforms state-of-the-art unsupervised
methods and even achieves superior performance over some of supervised methods.
Codes are available at https://github.com/Mikexu007/PCRP.
Related papers
- Sparse Prototype Network for Explainable Pedestrian Behavior Prediction [60.80524827122901]
We present Sparse Prototype Network (SPN), an explainable method designed to simultaneously predict a pedestrian's future action, trajectory, and pose.
Regularized by mono-semanticity and clustering constraints, the prototypes learn consistent and human-understandable features.
arXiv Detail & Related papers (2024-10-16T03:33:40Z) - Multistep Inverse Is Not All You Need [87.62730694973696]
In real-world control settings, the observation space is often unnecessarily high-dimensional and subject to time-correlated noise.
It is therefore desirable to learn an encoder to map the observation space to a simpler space of control-relevant variables.
We propose a new algorithm, ACDF, which combines multistep-inverse prediction with a latent forward model.
arXiv Detail & Related papers (2024-03-18T16:36:01Z) - Refine, Discriminate and Align: Stealing Encoders via Sample-Wise Prototypes and Multi-Relational Extraction [57.16121098944589]
RDA is a pioneering approach designed to address two primary deficiencies prevalent in previous endeavors aiming at stealing pre-trained encoders.
It is accomplished via a sample-wise prototype, which consolidates the target encoder's representations for a given sample's various perspectives.
For more potent efficacy, we develop a multi-relational extraction loss that trains the surrogate encoder to Discriminate mismatched embedding-prototype pairs.
arXiv Detail & Related papers (2023-12-01T15:03:29Z) - Sub-action Prototype Learning for Point-level Weakly-supervised Temporal
Action Localization [11.777205793663647]
Point-level weakly-supervised temporal action localization (PWTAL) aims to localize actions with only a single timestamp annotation for each action instance.
Existing methods tend to mine dense pseudo labels to alleviate the label sparsity, but overlook the potential sub-action temporal structures, resulting in inferior performance.
We propose a novel sub-action prototype learning framework (SPL-Loc) which comprises Sub-action Prototype Clustering (SPC) and Ordered Prototype Alignment (OPA)
arXiv Detail & Related papers (2023-09-16T17:57:40Z) - Provably Efficient UCB-type Algorithms For Learning Predictive State
Representations [55.00359893021461]
The sequential decision-making problem is statistically learnable if it admits a low-rank structure modeled by predictive state representations (PSRs)
This paper proposes the first known UCB-type approach for PSRs, featuring a novel bonus term that upper bounds the total variation distance between the estimated and true models.
In contrast to existing approaches for PSRs, our UCB-type algorithms enjoy computational tractability, last-iterate guaranteed near-optimal policy, and guaranteed model accuracy.
arXiv Detail & Related papers (2023-07-01T18:35:21Z) - A Prototypical Semantic Decoupling Method via Joint Contrastive Learning
for Few-Shot Name Entity Recognition [24.916377682689955]
Few-shot named entity recognition (NER) aims at identifying named entities based on only few labeled instances.
We propose a Prototypical Semantic Decoupling method via joint Contrastive learning (PSDC) for few-shot NER.
Experimental results on two few-shot NER benchmarks demonstrate that PSDC consistently outperforms the previous SOTA methods in terms of overall performance.
arXiv Detail & Related papers (2023-02-27T09:20:00Z) - Few-Shot Segmentation via Rich Prototype Generation and Recurrent
Prediction Enhancement [12.614578133091168]
We propose a rich prototype generation module (RPGM) and a recurrent prediction enhancement module (RPEM) to reinforce the prototype learning paradigm.
RPGM combines superpixel and K-means clustering to generate rich prototype features with complementary scale relationships.
RPEM utilizes the recurrent mechanism to design a round-way propagation decoder.
arXiv Detail & Related papers (2022-10-03T08:46:52Z) - Concept-level Debugging of Part-Prototype Networks [27.700043183428807]
Part-prototype Networks (ProtoPNets) are concept-based classifiers designed to achieve the same performance as black-box models without compromising transparency.
We propose a concept-level debugger for ProtoPNets in which a human supervisor supplies feedback in the form of what part-prototypes must be forgotten or kept.
arXiv Detail & Related papers (2022-05-31T13:18:51Z) - Consistency Regularization for Deep Face Anti-Spoofing [69.70647782777051]
Face anti-spoofing (FAS) plays a crucial role in securing face recognition systems.
Motivated by this exciting observation, we conjecture that encouraging feature consistency of different views may be a promising way to boost FAS models.
We enhance both Embedding-level and Prediction-level Consistency Regularization (EPCR) in FAS.
arXiv Detail & Related papers (2021-11-24T08:03:48Z) - Prototypical Contrastive Learning of Unsupervised Representations [171.3046900127166]
Prototypical Contrastive Learning (PCL) is an unsupervised representation learning method.
PCL implicitly encodes semantic structures of the data into the learned embedding space.
PCL outperforms state-of-the-art instance-wise contrastive learning methods on multiple benchmarks.
arXiv Detail & Related papers (2020-05-11T09:53:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.