RePose: Learning Deep Kinematic Priors for Fast Human Pose Estimation
- URL: http://arxiv.org/abs/2002.03933v1
- Date: Mon, 10 Feb 2020 16:44:45 GMT
- Title: RePose: Learning Deep Kinematic Priors for Fast Human Pose Estimation
- Authors: Hossam Isack, Christian Haene, Cem Keskin, Sofien Bouaziz, Yuri
Boykov, Shahram Izadi and Sameh Khamis
- Abstract summary: We propose a novel efficient and lightweight model for human pose estimation from a single image.
Our model is designed to achieve competitive results at a fraction of the number of parameters and computational cost of various state-of-the-art methods.
- Score: 17.0630180888369
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We propose a novel efficient and lightweight model for human pose estimation
from a single image. Our model is designed to achieve competitive results at a
fraction of the number of parameters and computational cost of various
state-of-the-art methods. To this end, we explicitly incorporate part-based
structural and geometric priors in a hierarchical prediction framework. At the
coarsest resolution, and in a manner similar to classical part-based
approaches, we leverage the kinematic structure of the human body to propagate
convolutional feature updates between the keypoints or body parts. Unlike
classical approaches, we adopt end-to-end training to learn this geometric
prior through feature updates from data. We then propagate the feature
representation at the coarsest resolution up the hierarchy to refine the
predicted pose in a coarse-to-fine fashion. The final network effectively
models the geometric prior and intuition within a lightweight deep neural
network, yielding state-of-the-art results for a model of this size on two
standard datasets, Leeds Sports Pose and MPII Human Pose.
Related papers
- PGAHum: Prior-Guided Geometry and Appearance Learning for High-Fidelity Animatable Human Reconstruction [9.231326291897817]
We introduce PGAHum, a prior-guided geometry and appearance learning framework for high-fidelity animatable human reconstruction.
We thoroughly exploit 3D human priors in three key modules of PGAHum to achieve high-quality geometry reconstruction with intricate details and photorealistic view synthesis on unseen poses.
arXiv Detail & Related papers (2024-04-22T04:22:30Z) - FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects [55.77542145604758]
FoundationPose is a unified foundation model for 6D object pose estimation and tracking.
Our approach can be instantly applied at test-time to a novel object without fine-tuning.
arXiv Detail & Related papers (2023-12-13T18:28:09Z) - GS-Pose: Category-Level Object Pose Estimation via Geometric and
Semantic Correspondence [5.500735640045456]
Category-level pose estimation is a challenging task with many potential applications in computer vision and robotics.
We propose to utilize both geometric and semantic features obtained from a pre-trained foundation model.
This requires significantly less data to train than prior methods since the semantic features are robust to object texture and appearance.
arXiv Detail & Related papers (2023-11-23T02:35:38Z) - Scaling Pre-trained Language Models to Deeper via Parameter-efficient
Architecture [68.13678918660872]
We design a more capable parameter-sharing architecture based on matrix product operator (MPO)
MPO decomposition can reorganize and factorize the information of a parameter matrix into two parts.
Our architecture shares the central tensor across all layers for reducing the model size.
arXiv Detail & Related papers (2023-03-27T02:34:09Z) - STAR: Sparse Transformer-based Action Recognition [61.490243467748314]
This work proposes a novel skeleton-based human action recognition model with sparse attention on the spatial dimension and segmented linear attention on the temporal dimension of data.
Experiments show that our model can achieve comparable performance while utilizing much less trainable parameters and achieve high speed in training and inference.
arXiv Detail & Related papers (2021-07-15T02:53:11Z) - Bridge the Gap Between Model-based and Model-free Human Reconstruction [10.818838437018682]
We present an end-to-end neural network that simultaneously predicts the pixel-aligned implicit surface and the explicit mesh model built by graph convolutional neural network.
Experiments on DeepHuman dataset showed that our approach is effective.
arXiv Detail & Related papers (2021-06-11T11:13:42Z) - Learning to Segment Human Body Parts with Synthetically Trained Deep
Convolutional Networks [58.0240970093372]
This paper presents a new framework for human body part segmentation based on Deep Convolutional Neural Networks trained using only synthetic data.
The proposed approach achieves cutting-edge results without the need of training the models with real annotated data of human body parts.
arXiv Detail & Related papers (2021-02-02T12:26:50Z) - Shape Prior Deformation for Categorical 6D Object Pose and Size
Estimation [62.618227434286]
We present a novel learning approach to recover the 6D poses and sizes of unseen object instances from an RGB-D image.
We propose a deep network to reconstruct the 3D object model by explicitly modeling the deformation from a pre-learned categorical shape prior.
arXiv Detail & Related papers (2020-07-16T16:45:05Z) - Monocular Human Pose and Shape Reconstruction using Part Differentiable
Rendering [53.16864661460889]
Recent works succeed in regression-based methods which estimate parametric models directly through a deep neural network supervised by 3D ground truth.
In this paper, we introduce body segmentation as critical supervision.
To improve the reconstruction with part segmentation, we propose a part-level differentiable part that enables part-based models to be supervised by part segmentation.
arXiv Detail & Related papers (2020-03-24T14:25:46Z) - Hierarchical Kinematic Human Mesh Recovery [30.348060841242628]
We consider the problem of estimating a parametric model of 3D human mesh from a single image.
We propose a new technique for regression of human parametric model that is explicitly informed by the known hierarchical structure.
arXiv Detail & Related papers (2020-03-09T16:15:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.