Space-Time Domain Tensor Neural Networks: An Application on Human Pose
Classification
- URL: http://arxiv.org/abs/2004.08153v2
- Date: Sun, 18 Oct 2020 17:52:46 GMT
- Title: Space-Time Domain Tensor Neural Networks: An Application on Human Pose
Classification
- Authors: Konstantinos Makantasis, Athanasios Voulodimos, Anastasios Doulamis,
Nikolaos Bakalos, Nikolaos Doulamis
- Abstract summary: We propose a spatially and temporally aware tensor-based neural network for classification of human pose.
Our model is end-to-end trainable and characterized by a small number of trainable parameters.
Experimental evaluation of the proposed model indicates that it can achieve state-of-the-art performance.
- Score: 12.965269872510587
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recent advances in sensing technologies require the design and development of
pattern recognition models capable of processing spatiotemporal data
efficiently. In this study, we propose a spatially and temporally aware
tensor-based neural network for human pose classification using
three-dimensional skeleton data. Our model employs three novel components.
First, an input layer capable of constructing highly discriminative
spatiotemporal features. Second, a tensor fusion operation that produces
compact yet rich representations of the data, and third, a tensor-based neural
network that processes data representations in their original tensor form. Our
model is end-to-end trainable and characterized by a small number of trainable
parameters making it suitable for problems where the annotated data is limited.
Experimental evaluation of the proposed model indicates that it can achieve
state-of-the-art performance.
Related papers
- Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration [2.814748676983944]
We propose a graph neural network model embedded with a local Spherical Euclidean 3D equivariance property through SE(3) message passing based propagation.
Our model is composed mainly of a descriptor module, equivariant graph layers, match similarity, and the final regression layers.
Experiments conducted on the 3DMatch and KITTI datasets exhibit the compelling and robust performance of our model compared to state-of-the-art approaches.
arXiv Detail & Related papers (2024-10-08T06:48:01Z) - Towards Scalable and Versatile Weight Space Learning [51.78426981947659]
This paper introduces the SANE approach to weight-space learning.
Our method extends the idea of hyper-representations towards sequential processing of subsets of neural network weights.
arXiv Detail & Related papers (2024-06-14T13:12:07Z) - Equivariant Graph Neural Operator for Modeling 3D Dynamics [148.98826858078556]
We propose Equivariant Graph Neural Operator (EGNO) to directly models dynamics as trajectories instead of just next-step prediction.
EGNO explicitly learns the temporal evolution of 3D dynamics where we formulate the dynamics as a function over time and learn neural operators to approximate it.
Comprehensive experiments in multiple domains, including particle simulations, human motion capture, and molecular dynamics, demonstrate the significantly superior performance of EGNO against existing methods.
arXiv Detail & Related papers (2024-01-19T21:50:32Z) - Deception Detection from Linguistic and Physiological Data Streams Using Bimodal Convolutional Neural Networks [19.639533220155965]
This paper explores the application of convolutional neural networks for the purpose of multimodal deception detection.
We use a dataset built by interviewing 104 subjects about two topics, with one truthful and one falsified response from each subject about each topic.
arXiv Detail & Related papers (2023-11-18T02:44:33Z) - Efficient data-driven gap filling of satellite image time series using
deep neural networks with partial convolutions [0.0]
This paper shows how three-dimensional partial convolutions can be used as layers in neural networks to fill gaps in satellite image time series.
To evaluate the approach we apply a U-Net-like model on incomplete time series of quasi-global carbon monoxide observations from the Sentinel-5P satellite.
arXiv Detail & Related papers (2022-08-18T11:32:04Z) - Data-driven emergence of convolutional structure in neural networks [83.4920717252233]
We show how fully-connected neural networks solving a discrimination task can learn a convolutional structure directly from their inputs.
By carefully designing data models, we show that the emergence of this pattern is triggered by the non-Gaussian, higher-order local structure of the inputs.
arXiv Detail & Related papers (2022-02-01T17:11:13Z) - Self-Supervised Graph Representation Learning for Neuronal Morphologies [75.38832711445421]
We present GraphDINO, a data-driven approach to learn low-dimensional representations of 3D neuronal morphologies from unlabeled datasets.
We show, in two different species and across multiple brain areas, that this method yields morphological cell type clusterings on par with manual feature-based classification by experts.
Our method could potentially enable data-driven discovery of novel morphological features and cell types in large-scale datasets.
arXiv Detail & Related papers (2021-12-23T12:17:47Z) - Scene Synthesis via Uncertainty-Driven Attribute Synchronization [52.31834816911887]
This paper introduces a novel neural scene synthesis approach that can capture diverse feature patterns of 3D scenes.
Our method combines the strength of both neural network-based and conventional scene synthesis approaches.
arXiv Detail & Related papers (2021-08-30T19:45:07Z) - STAR: Sparse Transformer-based Action Recognition [61.490243467748314]
This work proposes a novel skeleton-based human action recognition model with sparse attention on the spatial dimension and segmented linear attention on the temporal dimension of data.
Experiments show that our model can achieve comparable performance while utilizing much less trainable parameters and achieve high speed in training and inference.
arXiv Detail & Related papers (2021-07-15T02:53:11Z) - Synthetic Data and Hierarchical Object Detection in Overhead Imagery [0.0]
We develop novel synthetic data generation and augmentation techniques for enhancing low/zero-sample learning in satellite imagery.
To test the effectiveness of synthetic imagery, we employ it in the training of detection models and our two stage model, and evaluate the resulting models on real satellite images.
arXiv Detail & Related papers (2021-01-29T22:52:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.