Unsupervised 3D Human Pose Representation with Viewpoint and Pose
Disentanglement
- URL: http://arxiv.org/abs/2007.07053v2
- Date: Tue, 2 Nov 2021 03:23:43 GMT
- Title: Unsupervised 3D Human Pose Representation with Viewpoint and Pose
Disentanglement
- Authors: Qiang Nie, Ziwei Liu, Yunhui Liu
- Abstract summary: Learning a good 3D human pose representation is important for human pose related tasks.
We propose a novel Siamese denoising autoencoder to learn a 3D pose representation.
Our approach achieves state-of-the-art performance on two inherently different tasks.
- Score: 63.853412753242615
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Learning a good 3D human pose representation is important for human pose
related tasks, e.g. human 3D pose estimation and action recognition. Within all
these problems, preserving the intrinsic pose information and adapting to view
variations are two critical issues. In this work, we propose a novel Siamese
denoising autoencoder to learn a 3D pose representation by disentangling the
pose-dependent and view-dependent feature from the human skeleton data, in a
fully unsupervised manner. These two disentangled features are utilized
together as the representation of the 3D pose. To consider both the kinematic
and geometric dependencies, a sequential bidirectional recursive network
(SeBiReNet) is further proposed to model the human skeleton data. Extensive
experiments demonstrate that the learned representation 1) preserves the
intrinsic information of human pose, 2) shows good transferability across
datasets and tasks. Notably, our approach achieves state-of-the-art performance
on two inherently different tasks: pose denoising and unsupervised action
recognition. Code and models are available at:
\url{https://github.com/NIEQiang001/unsupervised-human-pose.git}
Related papers
- Understanding Pose and Appearance Disentanglement in 3D Human Pose
Estimation [72.50214227616728]
Several methods have proposed to learn image representations in a self-supervised fashion so as to disentangle the appearance information from the pose one.
We study disentanglement from the perspective of the self-supervised network, via diverse image synthesis experiments.
We design an adversarial strategy focusing on generating natural appearance changes of the subject, and against which we could expect a disentangled network to be robust.
arXiv Detail & Related papers (2023-09-20T22:22:21Z) - 3D Human Pose Regression using Graph Convolutional Network [68.8204255655161]
We propose a graph convolutional network named PoseGraphNet for 3D human pose regression from 2D poses.
Our model's performance is close to the state-of-the-art, but with much fewer parameters.
arXiv Detail & Related papers (2021-05-21T14:41:31Z) - Pose2Mesh: Graph Convolutional Network for 3D Human Pose and Mesh
Recovery from a 2D Human Pose [70.23652933572647]
We propose a novel graph convolutional neural network (GraphCNN)-based system that estimates the 3D coordinates of human mesh vertices directly from the 2D human pose.
We show that our Pose2Mesh outperforms the previous 3D human pose and mesh estimation methods on various benchmark datasets.
arXiv Detail & Related papers (2020-08-20T16:01:56Z) - Self-Supervised 3D Human Pose Estimation via Part Guided Novel Image
Synthesis [72.34794624243281]
We propose a self-supervised learning framework to disentangle variations from unlabeled video frames.
Our differentiable formalization, bridging the representation gap between the 3D pose and spatial part maps, allows us to operate on videos with diverse camera movements.
arXiv Detail & Related papers (2020-04-09T07:55:01Z) - Chained Representation Cycling: Learning to Estimate 3D Human Pose and
Shape by Cycling Between Representations [73.11883464562895]
We propose a new architecture that facilitates unsupervised, or lightly supervised, learning.
We demonstrate the method by learning 3D human pose and shape from un-paired and un-annotated images.
While we present results for modeling humans, our formulation is general and can be applied to other vision problems.
arXiv Detail & Related papers (2020-01-06T14:54:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.