Iterate & Cluster: Iterative Semi-Supervised Action Recognition
- URL: http://arxiv.org/abs/2006.06911v1
- Date: Fri, 12 Jun 2020 02:19:39 GMT
- Title: Iterate & Cluster: Iterative Semi-Supervised Action Recognition
- Authors: Jingyuan Li, Eli Shlizerman
- Abstract summary: Given time sequences of features tracked during movements our system clusters the sequences into actions.
We show that our system can boost recognition performance with only a small percentage of annotations.
- Score: 8.134961550216618
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We propose a novel system for active semi-supervised feature-based action
recognition. Given time sequences of features tracked during movements our
system clusters the sequences into actions. Our system is based on
encoder-decoder unsupervised methods shown to perform clustering by
self-organization of their latent representation through the auto-regression
task. These methods were tested on human action recognition benchmarks and
outperformed non-feature based unsupervised methods and achieved comparable
accuracy to skeleton-based supervised methods. However, such methods rely on
K-Nearest Neighbours (KNN) associating sequences to actions, and general
features with no annotated data would correspond to approximate clusters which
could be further enhanced. Our system proposes an iterative semi-supervised
method to address this challenge and to actively learn the association of
clusters and actions. The method utilizes latent space embedding and clustering
of the unsupervised encoder-decoder to guide the selection of sequences to be
annotated in each iteration. Each iteration, the selection aims to enhance
action recognition accuracy while choosing a small number of sequences for
annotation. We test the approach on human skeleton-based action recognition
benchmarks assuming that only annotations chosen by our method are available
and on mouse movements videos recorded in lab experiments. We show that our
system can boost recognition performance with only a small percentage of
annotations. The system can be used as an interactive annotation tool to guide
labeling efforts for 'in the wild' videos of various objects and actions to
reach robust recognition.
Related papers
- Collaborative Feature-Logits Contrastive Learning for Open-Set Semi-Supervised Object Detection [75.02249869573994]
In open-set scenarios, the unlabeled dataset contains both in-distribution (ID) classes and out-of-distribution (OOD) classes.
Applying semi-supervised detectors in such settings can lead to misclassifying OOD class as ID classes.
We propose a simple yet effective method, termed Collaborative Feature-Logits Detector (CFL-Detector)
arXiv Detail & Related papers (2024-11-20T02:57:35Z) - FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action Recognition [57.17966905865054]
Real-life applications of action recognition often require a fine-grained understanding of subtle movements.
Existing semi-supervised action recognition has mainly focused on coarse-grained action recognition.
We propose an Alignability-Verification-based Metric learning technique to effectively discriminate between fine-grained action pairs.
arXiv Detail & Related papers (2024-09-02T20:08:06Z) - An Information Compensation Framework for Zero-Shot Skeleton-based Action Recognition [49.45660055499103]
Zero-shot human skeleton-based action recognition aims to construct a model that can recognize actions outside the categories seen during training.
Previous research has focused on aligning sequences' visual and semantic spatial distributions.
We introduce a new loss function sampling method to obtain a tight and robust representation.
arXiv Detail & Related papers (2024-06-02T06:53:01Z) - Proposal-based Temporal Action Localization with Point-level Supervision [29.98225940694062]
Point-level supervised temporal action localization (PTAL) aims at recognizing and localizing actions in untrimmed videos.
We propose a novel method that localizes actions by generating and evaluating action proposals of flexible duration.
Experiments show that our proposed method achieves competitive or superior performance to the state-of-the-art methods.
arXiv Detail & Related papers (2023-10-09T08:27:05Z) - Neighbour Consistency Guided Pseudo-Label Refinement for Unsupervised
Person Re-Identification [80.98291772215154]
Unsupervised person re-identification (ReID) aims at learning discriminative identity features for person retrieval without any annotations.
Recent advances accomplish this task by leveraging clustering-based pseudo labels.
We propose a Neighbour Consistency guided Pseudo Label Refinement framework.
arXiv Detail & Related papers (2022-11-30T09:39:57Z) - Rethinking Clustering-Based Pseudo-Labeling for Unsupervised
Meta-Learning [146.11600461034746]
Method for unsupervised meta-learning, CACTUs, is a clustering-based approach with pseudo-labeling.
This approach is model-agnostic and can be combined with supervised algorithms to learn from unlabeled data.
We prove that the core reason for this is lack of a clustering-friendly property in the embedding space.
arXiv Detail & Related papers (2022-09-27T19:04:36Z) - Sparse Semi-Supervised Action Recognition with Active Learning [10.558951653323286]
Current state-of-the-art methods for skeleton-based action recognition are supervised and rely on labels.
We propose a novel approach for skeleton-based action recognition, called SESAR, that connects these approaches.
Our results outperform standalone skeleton-based supervised, unsupervised with cluster identification, and active-learning methods for action recognition when applied to sparse labeled samples.
arXiv Detail & Related papers (2020-12-03T07:48:31Z) - Self-supervised Video Object Segmentation [76.83567326586162]
The objective of this paper is self-supervised representation learning, with the goal of solving semi-supervised video object segmentation (a.k.a. dense tracking)
We make the following contributions: (i) we propose to improve the existing self-supervised approach, with a simple, yet more effective memory mechanism for long-term correspondence matching; (ii) by augmenting the self-supervised approach with an online adaptation module, our method successfully alleviates tracker drifts caused by spatial-temporal discontinuity; (iv) we demonstrate state-of-the-art results among the self-supervised approaches on DAVIS-2017 and YouTube
arXiv Detail & Related papers (2020-06-22T17:55:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.