PoseTrans: A Simple Yet Effective Pose Transformation Augmentation for
Human Pose Estimation
- URL: http://arxiv.org/abs/2208.07755v1
- Date: Tue, 16 Aug 2022 14:03:01 GMT
- Title: PoseTrans: A Simple Yet Effective Pose Transformation Augmentation for
Human Pose Estimation
- Authors: Wentao Jiang, Sheng Jin, Wentao Liu, Chen Qian, Ping Luo, Si Liu
- Abstract summary: We propose Pose Transformation (PoseTrans) to create new training samples that have diverse poses.
We also propose Pose Clustering Module (PCM) to measure the pose rarity and select the "rarest" poses to help balance the long-tailed distribution.
Our method is efficient and simple to implement, which can be easily integrated into the training pipeline of existing pose estimation models.
- Score: 40.50255017107963
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Human pose estimation aims to accurately estimate a wide variety of human
poses. However, existing datasets often follow a long-tailed distribution that
unusual poses only occupy a small portion, which further leads to the lack of
diversity of rare poses. These issues result in the inferior generalization
ability of current pose estimators. In this paper, we present a simple yet
effective data augmentation method, termed Pose Transformation (PoseTrans), to
alleviate the aforementioned problems. Specifically, we propose Pose
Transformation Module (PTM) to create new training samples that have diverse
poses and adopt a pose discriminator to ensure the plausibility of the
augmented poses. Besides, we propose Pose Clustering Module (PCM) to measure
the pose rarity and select the "rarest" poses to help balance the long-tailed
distribution. Extensive experiments on three benchmark datasets demonstrate the
effectiveness of our method, especially on rare poses. Also, our method is
efficient and simple to implement, which can be easily integrated into the
training pipeline of existing pose estimation models.
Related papers
- Multi-modal Pose Diffuser: A Multimodal Generative Conditional Pose Prior [8.314155285516073]
MOPED is the first method to leverage a novel multi-modal conditional diffusion model as a prior for SMPL pose parameters.
Our method offers powerful unconditional pose generation with the ability to condition on multi-modal inputs such as images and text.
arXiv Detail & Related papers (2024-10-18T15:29:19Z) - Pose-NDF: Modeling Human Pose Manifolds with Neural Distance Fields [47.62275563070933]
We present a continuous model for plausible human poses based on neural distance fields (NDFs)
Pose-NDF learns a manifold of plausible poses as the zero level set of a neural implicit function.
It can be used to generate more diverse poses by random sampling and projection than VAE-based methods.
arXiv Detail & Related papers (2022-07-27T21:46:47Z) - PoseGU: 3D Human Pose Estimation with Novel Human Pose Generator and
Unbiased Learning [36.609189237732394]
3D pose estimation has recently gained substantial interests in computer vision domain.
Existing 3D pose estimation methods have a strong reliance on large size well-annotated 3D pose datasets.
We propose PoseGU, a novel human pose generator that generates diverse poses with access only to a small size of seed samples.
arXiv Detail & Related papers (2022-07-07T23:43:53Z) - PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose
Estimation [83.50127973254538]
Existing 3D human pose estimators suffer poor generalization performance to new datasets.
We present PoseAug, a new auto-augmentation framework that learns to augment the available training poses towards a greater diversity.
arXiv Detail & Related papers (2021-05-06T06:57:42Z) - FixMyPose: Pose Correctional Captioning and Retrieval [67.20888060019028]
We introduce a new captioning dataset named FixMyPose to address automated pose correction systems.
We collect descriptions of correcting a "current" pose to look like a "target" pose.
To avoid ML biases, we maintain a balance across characters with diverse demographics.
arXiv Detail & Related papers (2021-04-04T21:45:44Z) - Chasing the Tail in Monocular 3D Human Reconstruction with Prototype
Memory [98.36233875637168]
We propose a prototype memory-augmented network, PM-Net, that effectively improves performances of predicting rare poses.
In this work, we 1) identify and analyze this learning obstacle and 2) propose a prototype memory-augmented network, PM-Net, that effectively improves performances of predicting rare poses.
arXiv Detail & Related papers (2020-12-29T12:57:22Z) - PoNA: Pose-guided Non-local Attention for Human Pose Transfer [105.14398322129024]
We propose a new human pose transfer method using a generative adversarial network (GAN) with simplified cascaded blocks.
Our model generates sharper and more realistic images with rich details, while having fewer parameters and faster speed.
arXiv Detail & Related papers (2020-12-13T12:38:29Z) - Self-Supervised 3D Human Pose Estimation via Part Guided Novel Image
Synthesis [72.34794624243281]
We propose a self-supervised learning framework to disentangle variations from unlabeled video frames.
Our differentiable formalization, bridging the representation gap between the 3D pose and spatial part maps, allows us to operate on videos with diverse camera movements.
arXiv Detail & Related papers (2020-04-09T07:55:01Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.