Iterative Graph Filtering Network for 3D Human Pose Estimation
- URL: http://arxiv.org/abs/2307.16074v2
- Date: Mon, 7 Aug 2023 22:11:33 GMT
- Title: Iterative Graph Filtering Network for 3D Human Pose Estimation
- Authors: Zaedul Islam and A. Ben Hamza
- Abstract summary: Graph convolutional networks (GCNs) have proven to be an effective approach for 3D human pose estimation.
In this paper, we introduce an iterative graph filtering framework for 3D human pose estimation.
Our approach builds upon the idea of iteratively solving graph filtering with Laplacian regularization.
- Score: 5.177947445379688
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Graph convolutional networks (GCNs) have proven to be an effective approach
for 3D human pose estimation. By naturally modeling the skeleton structure of
the human body as a graph, GCNs are able to capture the spatial relationships
between joints and learn an efficient representation of the underlying pose.
However, most GCN-based methods use a shared weight matrix, making it
challenging to accurately capture the different and complex relationships
between joints. In this paper, we introduce an iterative graph filtering
framework for 3D human pose estimation, which aims to predict the 3D joint
positions given a set of 2D joint locations in images. Our approach builds upon
the idea of iteratively solving graph filtering with Laplacian regularization
via the Gauss-Seidel iterative method. Motivated by this iterative solution, we
design a Gauss-Seidel network (GS-Net) architecture, which makes use of weight
and adjacency modulation, skip connection, and a pure convolutional block with
layer normalization. Adjacency modulation facilitates the learning of edges
that go beyond the inherent connections of body joints, resulting in an
adjusted graph structure that reflects the human skeleton, while skip
connections help maintain crucial information from the input layer's initial
features as the network depth increases. We evaluate our proposed model on two
standard benchmark datasets, and compare it with a comprehensive set of strong
baseline methods for 3D human pose estimation. Our experimental results
demonstrate that our approach outperforms the baseline methods on both
datasets, achieving state-of-the-art performance. Furthermore, we conduct
ablation studies to analyze the contributions of different components of our
model architecture and show that the skip connection and adjacency modulation
help improve the model performance.
Related papers
- Spatio-temporal MLP-graph network for 3D human pose estimation [8.267311047244881]
Graph convolutional networks and their variants have shown significant promise in 3D human pose estimation.
We introduce a new weighted Jacobi feature rule obtained through graph filtering with implicit propagation fairing.
We also employ adjacency modulation with the aim of learning meaningful correlations beyond defined between body joints.
arXiv Detail & Related papers (2023-08-29T14:00:55Z) - Regular Splitting Graph Network for 3D Human Pose Estimation [5.177947445379688]
We introduce a higher-order regular splitting graph network (RS-Net) for 2D-to-3D human pose estimation.
Our model achieves superior performance over recent state-of-the-art methods for 3D human pose estimation.
arXiv Detail & Related papers (2023-05-09T22:13:04Z) - Skeleton-Parted Graph Scattering Networks for 3D Human Motion Prediction [120.08257447708503]
Graph convolutional network based methods that model the body-joints' relations, have recently shown great promise in 3D skeleton-based human motion prediction.
We propose a novel skeleton-parted graph scattering network (SPGSN)
SPGSN outperforms state-of-the-art methods by remarkable margins of 13.8%, 9.3% and 2.7% in terms of 3D mean per joint position error (MPJPE) on Human3.6M, CMU Mocap and 3DPW datasets, respectively.
arXiv Detail & Related papers (2022-07-31T05:51:39Z) - Higher-Order Implicit Fairing Networks for 3D Human Pose Estimation [1.1501261942096426]
We introduce a higher-order graph convolutional framework with initial residual connections for 2D-to-3D pose estimation.
Our model is able to capture the long-range dependencies between body joints.
Experiments and ablations studies conducted on two standard benchmarks demonstrate the effectiveness of our model.
arXiv Detail & Related papers (2021-11-01T13:48:55Z) - Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images [79.70127290464514]
We decompose the task into two stages, i.e. person localization and pose estimation.
And we propose three task-specific graph neural networks for effective message passing.
Our approach achieves state-of-the-art performance on CMU Panoptic and Shelf datasets.
arXiv Detail & Related papers (2021-09-13T11:44:07Z) - Conditional Directed Graph Convolution for 3D Human Pose Estimation [23.376538132362498]
Graph convolutional networks have significantly improved 3D human pose estimation by representing the human skeleton as an undirected graph.
This paper proposes to represent the human skeleton as a directed graph with the joints as nodes and bones as edges that are directed from parent joints to child joints.
arXiv Detail & Related papers (2021-07-16T09:50:40Z) - A hybrid classification-regression approach for 3D hand pose estimation
using graph convolutional networks [1.0152838128195467]
We propose a two-stage GCN-based framework that learns per-pose relationship constraints.
The first phase quantizes the 2D/3D space to classify the joints into 2D/3D blocks based on their locality.
The second stage uses a GCN-based module that uses an adaptative nearest neighbor algorithm to determine joint relationships.
arXiv Detail & Related papers (2021-05-23T10:09:10Z) - 3D Human Pose Regression using Graph Convolutional Network [68.8204255655161]
We propose a graph convolutional network named PoseGraphNet for 3D human pose regression from 2D poses.
Our model's performance is close to the state-of-the-art, but with much fewer parameters.
arXiv Detail & Related papers (2021-05-21T14:41:31Z) - On the spatial attention in Spatio-Temporal Graph Convolutional Networks
for skeleton-based human action recognition [97.14064057840089]
Graphal networks (GCNs) promising performance in skeleton-based human action recognition by modeling a sequence of skeletons as a graph.
Most of the recently proposed G-temporal-based methods improve the performance by learning the graph structure at each layer of the network.
arXiv Detail & Related papers (2020-11-07T19:03:04Z) - Mix Dimension in Poincar\'{e} Geometry for 3D Skeleton-based Action
Recognition [57.98278794950759]
Graph Convolutional Networks (GCNs) have already demonstrated their powerful ability to model the irregular data.
We present a novel spatial-temporal GCN architecture which is defined via the Poincar'e geometry.
We evaluate our method on two current largest scale 3D datasets.
arXiv Detail & Related papers (2020-07-30T18:23:18Z) - Learning 3D Human Shape and Pose from Dense Body Parts [117.46290013548533]
We propose a Decompose-and-aggregate Network (DaNet) to learn 3D human shape and pose from dense correspondences of body parts.
Messages from local streams are aggregated to enhance the robust prediction of the rotation-based poses.
Our method is validated on both indoor and real-world datasets including Human3.6M, UP3D, COCO, and 3DPW.
arXiv Detail & Related papers (2019-12-31T15:09:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.