Timestamp-Supervised Action Segmentation from the Perspective of
Clustering
- URL: http://arxiv.org/abs/2212.11694v2
- Date: Sun, 23 Apr 2023 02:45:46 GMT
- Title: Timestamp-Supervised Action Segmentation from the Perspective of
Clustering
- Authors: Dazhao Du, Enhan Li, Lingyu Si, Fanjiang Xu, Fuchun Sun
- Abstract summary: Most existing methods generate pseudo-labels for all frames in each video to train the segmentation model.
We propose a novel framework from the perspective of clustering, which includes the following two parts.
iterative clustering iteratively propagates the pseudo-labels to the ambiguous intervals by clustering, and thus updates the pseudo-label sequences to train the model.
- Score: 12.661218632080207
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Video action segmentation under timestamp supervision has recently received
much attention due to lower annotation costs. Most existing methods generate
pseudo-labels for all frames in each video to train the segmentation model.
However, these methods suffer from incorrect pseudo-labels, especially for the
semantically unclear frames in the transition region between two consecutive
actions, which we call ambiguous intervals. To address this issue, we propose a
novel framework from the perspective of clustering, which includes the
following two parts. First, pseudo-label ensembling generates incomplete but
high-quality pseudo-label sequences, where the frames in ambiguous intervals
have no pseudo-labels. Second, iterative clustering iteratively propagates the
pseudo-labels to the ambiguous intervals by clustering, and thus updates the
pseudo-label sequences to train the model. We further introduce a clustering
loss, which encourages the features of frames within the same action segment
more compact. Extensive experiments show the effectiveness of our method.
Related papers
- Scribble Hides Class: Promoting Scribble-Based Weakly-Supervised
Semantic Segmentation with Its Class Label [16.745019028033518]
We propose a class-driven scribble promotion network, which utilizes both scribble annotations and pseudo-labels informed by image-level classes and global semantics for supervision.
Experiments on the ScribbleSup dataset with different qualities of scribble annotations outperform all the previous methods, demonstrating the superiority and robustness of our method.
arXiv Detail & Related papers (2024-02-27T14:51:56Z) - Densify Your Labels: Unsupervised Clustering with Bipartite Matching for
Weakly Supervised Point Cloud Segmentation [42.144991202299934]
We propose a weakly supervised semantic segmentation method for point clouds that predicts "per-point" labels from just "whole-scene" annotations.
Our core idea is to propagate the scene-level labels to each point in the point cloud by creating pseudo labels in a conservative way.
We evaluate our method on ScanNet and S3DIS datasets, outperforming state of the art, and demonstrate that we can achieve results comparable to fully supervised methods.
arXiv Detail & Related papers (2023-12-11T19:18:17Z) - Exploiting Completeness and Uncertainty of Pseudo Labels for Weakly
Supervised Video Anomaly Detection [149.23913018423022]
Weakly supervised video anomaly detection aims to identify abnormal events in videos using only video-level labels.
Two-stage self-training methods have achieved significant improvements by self-generating pseudo labels.
We propose an enhancement framework by exploiting completeness and uncertainty properties for effective self-training.
arXiv Detail & Related papers (2022-12-08T05:53:53Z) - A Survey on Label-efficient Deep Segmentation: Bridging the Gap between
Weak Supervision and Dense Prediction [115.9169213834476]
This paper offers a comprehensive review on label-efficient segmentation methods.
We first develop a taxonomy to organize these methods according to the supervision provided by different types of weak labels.
Next, we summarize the existing label-efficient segmentation methods from a unified perspective.
arXiv Detail & Related papers (2022-07-04T06:21:01Z) - Timestamp-Supervised Action Segmentation with Graph Convolutional
Networks [7.696728525672148]
A graph convolutional network is learned to generate dense framewise labels from sparse timestamp labels.
The generated dense framewise labels can then be used to train the segmentation model.
Detailed experiments on four public datasets, including 50 Salads, GTEA, Breakfast, and Desktop Assembly, show that our method is superior to the multi-layer perceptron baseline.
arXiv Detail & Related papers (2022-06-30T05:56:24Z) - Refining Pseudo Labels with Clustering Consensus over Generations for
Unsupervised Object Re-identification [84.72303377833732]
Unsupervised object re-identification targets at learning discriminative representations for object retrieval without any annotations.
We propose to estimate pseudo label similarities between consecutive training generations with clustering consensus and refine pseudo labels with temporally propagated and ensembled pseudo labels.
The proposed pseudo label refinery strategy is simple yet effective and can be seamlessly integrated into existing clustering-based unsupervised re-identification methods.
arXiv Detail & Related papers (2021-06-11T02:42:42Z) - A Closer Look at Self-training for Zero-Label Semantic Segmentation [53.4488444382874]
Being able to segment unseen classes not observed during training is an important technical challenge in deep learning.
Prior zero-label semantic segmentation works approach this task by learning visual-semantic embeddings or generative models.
We propose a consistency regularizer to filter out noisy pseudo-labels by taking the intersections of the pseudo-labels generated from different augmentations of the same image.
arXiv Detail & Related papers (2021-04-21T14:34:33Z) - SegGroup: Seg-Level Supervision for 3D Instance and Semantic
Segmentation [88.22349093672975]
We design a weakly supervised point cloud segmentation algorithm that only requires clicking on one point per instance to indicate its location for annotation.
With over-segmentation for pre-processing, we extend these location annotations into segments as seg-level labels.
We show that our seg-level supervised method (SegGroup) achieves comparable results with the fully annotated point-level supervised methods.
arXiv Detail & Related papers (2020-12-18T13:23:34Z) - Weakly Supervised Temporal Action Localization with Segment-Level Labels [140.68096218667162]
Temporal action localization presents a trade-off between test performance and annotation-time cost.
We introduce a new segment-level supervision setting: segments are labeled when annotators observe actions happening here.
We devise a partial segment loss regarded as a loss sampling to learn integral action parts from labeled segments.
arXiv Detail & Related papers (2020-07-03T10:32:19Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.