Semi-Supervised 2D Human Pose Estimation Driven by Position
Inconsistency Pseudo Label Correction Module
- URL: http://arxiv.org/abs/2303.04346v1
- Date: Wed, 8 Mar 2023 02:57:05 GMT
- Title: Semi-Supervised 2D Human Pose Estimation Driven by Position
Inconsistency Pseudo Label Correction Module
- Authors: Linzhi Huang, Yulong Li, Hongbo Tian, Yue Yang, Xiangang Li, Weihong
Deng, Jieping Ye
- Abstract summary: The previous method ignored two problems: (i) When conducting interactive training between large model and lightweight model, the pseudo label of lightweight model will be used to guide large models.
We propose a semi-supervised 2D human pose estimation framework driven by a position inconsistency pseudo label correction module (SSPCM)
To further improve the performance of the student model, we use the semi-supervised Cut-Occlude based on pseudo keypoint perception to generate more hard and effective samples.
- Score: 74.80776648785897
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this paper, we delve into semi-supervised 2D human pose estimation. The
previous method ignored two problems: (i) When conducting interactive training
between large model and lightweight model, the pseudo label of lightweight
model will be used to guide large models. (ii) The negative impact of noise
pseudo labels on training. Moreover, the labels used for 2D human pose
estimation are relatively complex: keypoint category and keypoint position. To
solve the problems mentioned above, we propose a semi-supervised 2D human pose
estimation framework driven by a position inconsistency pseudo label correction
module (SSPCM). We introduce an additional auxiliary teacher and use the pseudo
labels generated by the two teacher model in different periods to calculate the
inconsistency score and remove outliers. Then, the two teacher models are
updated through interactive training, and the student model is updated using
the pseudo labels generated by two teachers. To further improve the performance
of the student model, we use the semi-supervised Cut-Occlude based on pseudo
keypoint perception to generate more hard and effective samples. In addition,
we also proposed a new indoor overhead fisheye human keypoint dataset
WEPDTOF-Pose. Extensive experiments demonstrate that our method outperforms the
previous best semi-supervised 2D human pose estimation method. We will release
the code and dataset at https://github.com/hlz0606/SSPCM.
Related papers
- TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection [59.498894868956306]
Pseudo-labeling approaches to semi-supervised learning adopt a teacher-student framework.
We leverage pre-trained motion-forecasting models to generate object trajectories on pseudo-labeled data.
Our approach improves pseudo-label quality in two distinct manners.
arXiv Detail & Related papers (2024-09-17T05:35:00Z) - Semi-supervised 2D Human Pose Estimation via Adaptive Keypoint Masking [2.297586471170049]
This paper proposes an adaptive keypoint masking method, which can fully mine the information in the samples and obtain better estimation performance.
The effectiveness of the proposed method is verified on COCO and MPII, outperforming the state-of-the-art semi-supervised pose estimation by 5.2% and 0.3%, respectively.
arXiv Detail & Related papers (2024-04-23T08:41:50Z) - Denoising and Selecting Pseudo-Heatmaps for Semi-Supervised Human Pose
Estimation [38.97427474379367]
We introduce a denoising scheme to generate reliable pseudo-heatmaps as targets for learning from unlabeled data.
We select the learning targets from these pseudo-heatmaps guided by the estimated cross-student uncertainty.
Our results show that our model outperforms previous state-of-the-art semi-supervised pose estimators.
arXiv Detail & Related papers (2023-09-29T19:17:30Z) - Optimising 2D Pose Representation: Improve Accuracy, Stability and
Generalisability Within Unsupervised 2D-3D Human Pose Estimation [7.294965109944706]
We show that the most optimal representation of a 2D pose is that of two independent segments, the torso and legs, with no shared features between each lifting network.
Our results show that the most optimal representation of a 2D pose is that of two independent segments, the torso and legs, with no shared features between each lifting network.
arXiv Detail & Related papers (2022-09-01T17:32:52Z) - Adapting the Mean Teacher for keypoint-based lung registration under
geometric domain shifts [75.51482952586773]
deep neural networks generally require plenty of labeled training data and are vulnerable to domain shifts between training and test data.
We present a novel approach to geometric domain adaptation for image registration, adapting a model from a labeled source to an unlabeled target domain.
Our method consistently improves on the baseline model by 50%/47% while even matching the accuracy of models trained on target data.
arXiv Detail & Related papers (2022-07-01T12:16:42Z) - PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and
Hallucination under Self-supervision [102.48681650013698]
Existing self-supervised 3D human pose estimation schemes have largely relied on weak supervisions to guide the learning.
We propose a novel self-supervised approach that allows us to explicitly generate 2D-3D pose pairs for augmenting supervision.
This is made possible via introducing a reinforcement-learning-based imitator, which is learned jointly with a pose estimator alongside a pose hallucinator.
arXiv Detail & Related papers (2022-03-29T14:45:53Z) - X-model: Improving Data Efficiency in Deep Learning with A Minimax Model [78.55482897452417]
We aim at improving data efficiency for both classification and regression setups in deep learning.
To take the power of both worlds, we propose a novel X-model.
X-model plays a minimax game between the feature extractor and task-specific heads.
arXiv Detail & Related papers (2021-10-09T13:56:48Z) - Learning Heatmap-Style Jigsaw Puzzles Provides Good Pretraining for 2D
Human Pose Estimation [19.389708889730834]
We introduce a self-supervised method for pretraining 2D pose estimation networks.
Specifically, we propose Heatmap-Style Jigsaw Puzzles (HSJP) problem as our pretext-task.
We only use images of person instances in MS-COCO, rather than introducing extra and much larger ImageNet dataset.
With two popular and strong 2D human pose estimators, HRNet and SimpleBaseline, we evaluate mAP score on both MS-COCO validation and test-dev datasets.
arXiv Detail & Related papers (2020-12-13T17:04:29Z) - DMT: Dynamic Mutual Training for Semi-Supervised Learning [69.17919491907296]
Self-training methods usually rely on single model prediction confidence to filter low-confidence pseudo labels.
We propose mutual training between two different models by a dynamically re-weighted loss function, called Dynamic Mutual Training.
Our experiments show that DMT achieves state-of-the-art performance in both image classification and semantic segmentation.
arXiv Detail & Related papers (2020-04-18T03:12:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.