HP2IFS: Head Pose estimation exploiting Partitioned Iterated Function
Systems
- URL: http://arxiv.org/abs/2003.11536v1
- Date: Wed, 25 Mar 2020 17:56:45 GMT
- Title: HP2IFS: Head Pose estimation exploiting Partitioned Iterated Function
Systems
- Authors: Carmen Bisogni, Michele Nappi, Chiara Pero and Stefano Ricciardi
- Abstract summary: Estimating the actual head orientation from 2D images is a well known problem.
We use fractal coding theory and Partitioned Iterated Systems to extract the fractal code from the input head image.
The proposed PIFS based head pose estimation method provides accurate yaw/pitch/roll angular values.
- Score: 18.402636415604373
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Estimating the actual head orientation from 2D images, with regard to its
three degrees of freedom, is a well known problem that is highly significant
for a large number of applications involving head pose knowledge. Consequently,
this topic has been tackled by a plethora of methods and algorithms the most
part of which exploits neural networks. Machine learning methods, indeed,
achieve accurate head rotation values yet require an adequate training stage
and, to that aim, a relevant number of positive and negative examples. In this
paper we take a different approach to this topic by using fractal coding theory
and particularly Partitioned Iterated Function Systems to extract the fractal
code from the input head image and to compare this representation to the
fractal code of a reference model through Hamming distance. According to
experiments conducted on both the BIWI and the AFLW2000 databases, the proposed
PIFS based head pose estimation method provides accurate yaw/pitch/roll angular
values, with a performance approaching that of state of the art of
machine-learning based algorithms and exceeding most of non-training based
approaches.
Related papers
- Full-range Head Pose Geometric Data Augmentations [2.8358100463599722]
Many head pose estimation (HPE) methods promise the ability to create full-range datasets.
These methods are only accurate within a range of head angles; exceeding this specific range led to significant inaccuracies.
Here, we present methods that accurately infer the correct coordinate system and Euler angles in the correct axis-sequence.
arXiv Detail & Related papers (2024-08-02T20:41:18Z) - A Weighted K-Center Algorithm for Data Subset Selection [70.49696246526199]
Subset selection is a fundamental problem that can play a key role in identifying smaller portions of the training data.
We develop a novel factor 3-approximation algorithm to compute subsets based on the weighted sum of both k-center and uncertainty sampling objective functions.
arXiv Detail & Related papers (2023-12-17T04:41:07Z) - Two Approaches to Supervised Image Segmentation [55.616364225463066]
The present work develops comparison experiments between deep learning and multiset neurons approaches.
The deep learning approach confirmed its potential for performing image segmentation.
The alternative multiset methodology allowed for enhanced accuracy while requiring little computational resources.
arXiv Detail & Related papers (2023-07-19T16:42:52Z) - The Cascaded Forward Algorithm for Neural Network Training [61.06444586991505]
We propose a new learning framework for neural networks, namely Cascaded Forward (CaFo) algorithm, which does not rely on BP optimization as that in FF.
Unlike FF, our framework directly outputs label distributions at each cascaded block, which does not require generation of additional negative samples.
In our framework each block can be trained independently, so it can be easily deployed into parallel acceleration systems.
arXiv Detail & Related papers (2023-03-17T02:01:11Z) - Interactive Segmentation as Gaussian Process Classification [58.44673380545409]
Click-based interactive segmentation (IS) aims to extract the target objects under user interaction.
Most of the current deep learning (DL)-based methods mainly follow the general pipelines of semantic segmentation.
We propose to formulate the IS task as a Gaussian process (GP)-based pixel-wise binary classification model on each image.
arXiv Detail & Related papers (2023-02-28T14:01:01Z) - Improving Point Cloud Based Place Recognition with Ranking-based Loss
and Large Batch Training [1.116812194101501]
The paper presents a simple and effective learning-based method for computing a discriminative 3D point cloud descriptor.
We employ recent advances in image retrieval and propose a modified version of a loss function based on a differentiable average precision approximation.
arXiv Detail & Related papers (2022-03-02T09:29:28Z) - HHP-Net: A light Heteroscedastic neural network for Head Pose estimation
with uncertainty [2.064612766965483]
We introduce a novel method to estimate the head pose of people in single images starting from a small set of head keypoints.
Our model is simple to implement and more efficient with respect to the state of the art.
arXiv Detail & Related papers (2021-11-02T08:55:45Z) - Brain Inspired Object Recognition System [0.0]
Histogram of Oriented Gradients, Local Binary Patterns, and Principal components extracted from target images are used.
A computational theory is first developed by using concepts from the information processing mechanism of the brain.
Experiments are carried out using fifteen publicly available datasets to validate the performance of our proposed model.
arXiv Detail & Related papers (2021-05-15T14:42:17Z) - Self-supervised Geometric Perception [96.89966337518854]
Self-supervised geometric perception is a framework to learn a feature descriptor for correspondence matching without any ground-truth geometric model labels.
We show that SGP achieves state-of-the-art performance that is on-par or superior to the supervised oracles trained using ground-truth labels.
arXiv Detail & Related papers (2021-03-04T15:34:43Z) - MetaSDF: Meta-learning Signed Distance Functions [85.81290552559817]
Generalizing across shapes with neural implicit representations amounts to learning priors over the respective function space.
We formalize learning of a shape space as a meta-learning problem and leverage gradient-based meta-learning algorithms to solve this task.
arXiv Detail & Related papers (2020-06-17T05:14:53Z) - Deep Convolutional Neural Network-based Bernoulli Heatmap for Head Pose
Estimation [11.676928225717337]
This paper proposes a novel Bernoulli heatmap for head pose estimation from a single RGB image.
Our method can achieve the positioning of the head area while estimating the angles of the head.
A deep convolutional neural network (CNN) structure with multiscale representations is adopted to maintain high-resolution information.
arXiv Detail & Related papers (2020-05-24T15:36:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.