Regular Splitting Graph Network for 3D Human Pose Estimation
- URL: http://arxiv.org/abs/2305.05785v1
- Date: Tue, 9 May 2023 22:13:04 GMT
- Title: Regular Splitting Graph Network for 3D Human Pose Estimation
- Authors: Tanvir Hassan and A. Ben Hamza
- Abstract summary: We introduce a higher-order regular splitting graph network (RS-Net) for 2D-to-3D human pose estimation.
Our model achieves superior performance over recent state-of-the-art methods for 3D human pose estimation.
- Score: 5.177947445379688
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In human pose estimation methods based on graph convolutional architectures,
the human skeleton is usually modeled as an undirected graph whose nodes are
body joints and edges are connections between neighboring joints. However, most
of these methods tend to focus on learning relationships between body joints of
the skeleton using first-order neighbors, ignoring higher-order neighbors and
hence limiting their ability to exploit relationships between distant joints.
In this paper, we introduce a higher-order regular splitting graph network
(RS-Net) for 2D-to-3D human pose estimation using matrix splitting in
conjunction with weight and adjacency modulation. The core idea is to capture
long-range dependencies between body joints using multi-hop neighborhoods and
also to learn different modulation vectors for different body joints as well as
a modulation matrix added to the adjacency matrix associated to the skeleton.
This learnable modulation matrix helps adjust the graph structure by adding
extra graph edges in an effort to learn additional connections between body
joints. Instead of using a shared weight matrix for all neighboring body
joints, the proposed RS-Net model applies weight unsharing before aggregating
the feature vectors associated to the joints in order to capture the different
relations between them. Experiments and ablations studies performed on two
benchmark datasets demonstrate the effectiveness of our model, achieving
superior performance over recent state-of-the-art methods for 3D human pose
estimation.
Related papers
- 3D Hand Reconstruction via Aggregating Intra and Inter Graphs Guided by
Prior Knowledge for Hand-Object Interaction Scenario [8.364378460776832]
We propose a 3D hand reconstruction network combining the benefits of model-based and model-free approaches to balance accuracy and physical plausibility for hand-object interaction scenario.
Firstly, we present a novel MANO pose parameters regression module from 2D joints directly, which avoids the process of highly nonlinear mapping from abstract image feature.
arXiv Detail & Related papers (2024-03-04T05:11:26Z) - Spatio-temporal MLP-graph network for 3D human pose estimation [8.267311047244881]
Graph convolutional networks and their variants have shown significant promise in 3D human pose estimation.
We introduce a new weighted Jacobi feature rule obtained through graph filtering with implicit propagation fairing.
We also employ adjacency modulation with the aim of learning meaningful correlations beyond defined between body joints.
arXiv Detail & Related papers (2023-08-29T14:00:55Z) - Iterative Graph Filtering Network for 3D Human Pose Estimation [5.177947445379688]
Graph convolutional networks (GCNs) have proven to be an effective approach for 3D human pose estimation.
In this paper, we introduce an iterative graph filtering framework for 3D human pose estimation.
Our approach builds upon the idea of iteratively solving graph filtering with Laplacian regularization.
arXiv Detail & Related papers (2023-07-29T20:46:44Z) - Skeleton-Parted Graph Scattering Networks for 3D Human Motion Prediction [120.08257447708503]
Graph convolutional network based methods that model the body-joints' relations, have recently shown great promise in 3D skeleton-based human motion prediction.
We propose a novel skeleton-parted graph scattering network (SPGSN)
SPGSN outperforms state-of-the-art methods by remarkable margins of 13.8%, 9.3% and 2.7% in terms of 3D mean per joint position error (MPJPE) on Human3.6M, CMU Mocap and 3DPW datasets, respectively.
arXiv Detail & Related papers (2022-07-31T05:51:39Z) - Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images [79.70127290464514]
We decompose the task into two stages, i.e. person localization and pose estimation.
And we propose three task-specific graph neural networks for effective message passing.
Our approach achieves state-of-the-art performance on CMU Panoptic and Shelf datasets.
arXiv Detail & Related papers (2021-09-13T11:44:07Z) - Conditional Directed Graph Convolution for 3D Human Pose Estimation [23.376538132362498]
Graph convolutional networks have significantly improved 3D human pose estimation by representing the human skeleton as an undirected graph.
This paper proposes to represent the human skeleton as a directed graph with the joints as nodes and bones as edges that are directed from parent joints to child joints.
arXiv Detail & Related papers (2021-07-16T09:50:40Z) - A hybrid classification-regression approach for 3D hand pose estimation
using graph convolutional networks [1.0152838128195467]
We propose a two-stage GCN-based framework that learns per-pose relationship constraints.
The first phase quantizes the 2D/3D space to classify the joints into 2D/3D blocks based on their locality.
The second stage uses a GCN-based module that uses an adaptative nearest neighbor algorithm to determine joint relationships.
arXiv Detail & Related papers (2021-05-23T10:09:10Z) - 3D Human Pose Regression using Graph Convolutional Network [68.8204255655161]
We propose a graph convolutional network named PoseGraphNet for 3D human pose regression from 2D poses.
Our model's performance is close to the state-of-the-art, but with much fewer parameters.
arXiv Detail & Related papers (2021-05-21T14:41:31Z) - Revisiting Skeleton-based Action Recognition [107.08112310075114]
PoseC3D is a new approach to skeleton-based action recognition, which relies on a 3D heatmap instead stack a graph sequence as the base representation of human skeletons.
On four challenging datasets, PoseC3D consistently obtains superior performance, when used alone on skeletons and in combination with the RGB modality.
arXiv Detail & Related papers (2021-04-28T06:32:17Z) - MotioNet: 3D Human Motion Reconstruction from Monocular Video with
Skeleton Consistency [72.82534577726334]
We introduce MotioNet, a deep neural network that directly reconstructs the motion of a 3D human skeleton from monocular video.
Our method is the first data-driven approach that directly outputs a kinematic skeleton, which is a complete, commonly used, motion representation.
arXiv Detail & Related papers (2020-06-22T08:50:09Z) - Disentangling and Unifying Graph Convolutions for Skeleton-Based Action
Recognition [79.33539539956186]
We propose a simple method to disentangle multi-scale graph convolutions and a unified spatial-temporal graph convolutional operator named G3D.
By coupling these proposals, we develop a powerful feature extractor named MS-G3D based on which our model outperforms previous state-of-the-art methods on three large-scale datasets.
arXiv Detail & Related papers (2020-03-31T11:28:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.