Mix Dimension in Poincar\'{e} Geometry for 3D Skeleton-based Action
Recognition
- URL: http://arxiv.org/abs/2007.15678v2
- Date: Mon, 3 Aug 2020 14:19:47 GMT
- Title: Mix Dimension in Poincar\'{e} Geometry for 3D Skeleton-based Action
Recognition
- Authors: Wei Peng and Jingang Shi and Zhaoqiang Xia and Guoying Zhao
- Abstract summary: Graph Convolutional Networks (GCNs) have already demonstrated their powerful ability to model the irregular data.
We present a novel spatial-temporal GCN architecture which is defined via the Poincar'e geometry.
We evaluate our method on two current largest scale 3D datasets.
- Score: 57.98278794950759
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Graph Convolutional Networks (GCNs) have already demonstrated their powerful
ability to model the irregular data, e.g., skeletal data in human action
recognition, providing an exciting new way to fuse rich structural information
for nodes residing in different parts of a graph. In human action recognition,
current works introduce a dynamic graph generation mechanism to better capture
the underlying semantic skeleton connections and thus improves the performance.
In this paper, we provide an orthogonal way to explore the underlying
connections. Instead of introducing an expensive dynamic graph generation
paradigm, we build a more efficient GCN on a Riemann manifold, which we think
is a more suitable space to model the graph data, to make the extracted
representations fit the embedding matrix. Specifically, we present a novel
spatial-temporal GCN (ST-GCN) architecture which is defined via the Poincar\'e
geometry such that it is able to better model the latent anatomy of the
structure data. To further explore the optimal projection dimension in the
Riemann space, we mix different dimensions on the manifold and provide an
efficient way to explore the dimension for each ST-GCN layer. With the final
resulted architecture, we evaluate our method on two current largest scale 3D
datasets, i.e., NTU RGB+D and NTU RGB+D 120. The comparison results show that
the model could achieve a superior performance under any given evaluation
metrics with only 40\% model size when compared with the previous best GCN
method, which proves the effectiveness of our model.
Related papers
- Equivariant Neural Operator Learning with Graphon Convolution [12.059797539633506]
We propose a general architecture that combines the learning coefficient scheme with a residual operator layer for learning mappings between continuous functions in the 3D Euclidean space.
arXiv Detail & Related papers (2023-11-17T23:28:22Z) - Iterative Graph Filtering Network for 3D Human Pose Estimation [5.177947445379688]
Graph convolutional networks (GCNs) have proven to be an effective approach for 3D human pose estimation.
In this paper, we introduce an iterative graph filtering framework for 3D human pose estimation.
Our approach builds upon the idea of iteratively solving graph filtering with Laplacian regularization.
arXiv Detail & Related papers (2023-07-29T20:46:44Z) - Latent Graph Inference using Product Manifolds [0.0]
We generalize the discrete Differentiable Graph Module (dDGM) for latent graph learning.
Our novel approach is tested on a wide range of datasets, and outperforms the original dDGM model.
arXiv Detail & Related papers (2022-11-26T22:13:06Z) - Learned Vertex Descent: A New Direction for 3D Human Model Fitting [64.04726230507258]
We propose a novel optimization-based paradigm for 3D human model fitting on images and scans.
Our approach is able to capture the underlying body of clothed people with very different body shapes, achieving a significant improvement compared to state-of-the-art.
LVD is also applicable to 3D model fitting of humans and hands, for which we show a significant improvement to the SOTA with a much simpler and faster method.
arXiv Detail & Related papers (2022-05-12T17:55:51Z) - ACE-HGNN: Adaptive Curvature Exploration Hyperbolic Graph Neural Network [72.16255675586089]
We propose an Adaptive Curvature Exploration Hyperbolic Graph NeuralNetwork named ACE-HGNN to adaptively learn the optimal curvature according to the input graph and downstream tasks.
Experiments on multiple real-world graph datasets demonstrate a significant and consistent performance improvement in model quality with competitive performance and good generalization ability.
arXiv Detail & Related papers (2021-10-15T07:18:57Z) - Dense Graph Convolutional Neural Networks on 3D Meshes for 3D Object
Segmentation and Classification [0.0]
We present new designs of graph convolutional neural networks (GCNs) on 3D meshes for 3D object classification and segmentation.
We use the faces of the mesh as basic processing units and represent a 3D mesh as a graph where each node corresponds to a face.
arXiv Detail & Related papers (2021-06-30T02:17:16Z) - On the spatial attention in Spatio-Temporal Graph Convolutional Networks
for skeleton-based human action recognition [97.14064057840089]
Graphal networks (GCNs) promising performance in skeleton-based human action recognition by modeling a sequence of skeletons as a graph.
Most of the recently proposed G-temporal-based methods improve the performance by learning the graph structure at each layer of the network.
arXiv Detail & Related papers (2020-11-07T19:03:04Z) - Meta Graph Attention on Heterogeneous Graph with Node-Edge Co-evolution [44.90253939019069]
We present Coevolved Meta Graph Neural Network (CoMGNN), which applies meta graph attention to heterogeneous graphs with co-evolution of edges and node states.
We also propose CoMGNN (ST-CoMGNN) for modelingtemporal patterns on nodes and edges.
arXiv Detail & Related papers (2020-10-09T13:19:39Z) - Learning Local Neighboring Structure for Robust 3D Shape Representation [143.15904669246697]
Representation learning for 3D meshes is important in many computer vision and graphics applications.
We propose a local structure-aware anisotropic convolutional operation (LSA-Conv)
Our model produces significant improvement in 3D shape reconstruction compared to state-of-the-art methods.
arXiv Detail & Related papers (2020-04-21T13:40:03Z) - Disentangling and Unifying Graph Convolutions for Skeleton-Based Action
Recognition [79.33539539956186]
We propose a simple method to disentangle multi-scale graph convolutions and a unified spatial-temporal graph convolutional operator named G3D.
By coupling these proposals, we develop a powerful feature extractor named MS-G3D based on which our model outperforms previous state-of-the-art methods on three large-scale datasets.
arXiv Detail & Related papers (2020-03-31T11:28:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.