On the representation and methodology for wide and short range head pose
estimation
- URL: http://arxiv.org/abs/2401.05807v1
- Date: Thu, 11 Jan 2024 10:20:37 GMT
- Title: On the representation and methodology for wide and short range head pose
estimation
- Authors: Alejandro Cobo and Roberto Valle and Jos\'e M. Buenaposada and Luis
Baumela
- Abstract summary: Head pose estimation (HPE) is a problem of interest in computer vision.
Recent applications require the analysis of faces in the full 360deg rotation range.
Traditional approaches to solve the semi-frontal and profile cases are not directly amenable for the full rotation case.
- Score: 48.87319013701136
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Head pose estimation (HPE) is a problem of interest in computer vision to
improve the performance of face processing tasks in semi-frontal or profile
settings. Recent applications require the analysis of faces in the full
360{\deg} rotation range. Traditional approaches to solve the semi-frontal and
profile cases are not directly amenable for the full rotation case. In this
paper we analyze the methodology for short- and wide-range HPE and discuss
which representations and metrics are adequate for each case. We show that the
popular Euler angles representation is a good choice for short-range HPE, but
not at extreme rotations. However, the Euler angles' gimbal lock problem
prevents them from being used as a valid metric in any setting. We also revisit
the current cross-data set evaluation methodology and note that the lack of
alignment between the reference systems of the training and test data sets
negatively biases the results of all articles in the literature. We introduce a
procedure to quantify this misalignment and a new methodology for cross-data
set HPE that establishes new, more accurate, SOTA for the 300W-LP|Biwi
benchmark. We also propose a generalization of the geodesic angular distance
metric that enables the construction of a loss that controls the contribution
of each training sample to the optimization of the model. Finally, we introduce
a wide range HPE benchmark based on the CMU Panoptic data set.
Related papers
- For A More Comprehensive Evaluation of 6DoF Object Pose Tracking [22.696375341994035]
We contribute a unified benchmark to address the above problems.
For more accurate annotation of YCBV, we propose a multi-view multi-object global pose refinement method.
In experiments, we validate the precision and reliability of the proposed global pose refinement method with a realistic semi-synthesized dataset.
arXiv Detail & Related papers (2023-09-14T15:35:08Z) - Vanishing Point Estimation in Uncalibrated Images with Prior Gravity
Direction [82.72686460985297]
We tackle the problem of estimating a Manhattan frame.
We derive two new 2-line solvers, one of which does not suffer from singularities affecting existing solvers.
We also design a new non-minimal method, running on an arbitrary number of lines, to boost the performance in local optimization.
arXiv Detail & Related papers (2023-08-21T13:03:25Z) - EGformer: Equirectangular Geometry-biased Transformer for 360 Depth
Estimation [20.42460078279734]
Estimating the depths of equirectangular (i.e., 360) images (EIs) is challenging given the distorted 180 x 360 field-of-view.
We propose an equirectangular geometry-biased transformer termed EGformer.
arXiv Detail & Related papers (2023-04-16T15:14:17Z) - CPPF++: Uncertainty-Aware Sim2Real Object Pose Estimation by Vote Aggregation [67.12857074801731]
We introduce a novel method, CPPF++, designed for sim-to-real pose estimation.
To address the challenge posed by vote collision, we propose a novel approach that involves modeling the voting uncertainty.
We incorporate several innovative modules, including noisy pair filtering, online alignment optimization, and a feature ensemble.
arXiv Detail & Related papers (2022-11-24T03:27:00Z) - Zero-Shot Category-Level Object Pose Estimation [24.822189326540105]
We tackle the problem of estimating the pose of novel object categories in a zero-shot manner.
This extends much of the existing literature by removing the need for pose-labelled datasets or category-specific CAD models.
Our method provides a six-fold improvement in average rotation accuracy at 30 degrees.
arXiv Detail & Related papers (2022-04-07T17:58:39Z) - Visual SLAM with Graph-Cut Optimized Multi-Plane Reconstruction [11.215334675788952]
This paper presents a semantic planar SLAM system that improves pose estimation and mapping using cues from an instance planar segmentation network.
While the mainstream approaches are using RGB-D sensors, employing a monocular camera with such a system still faces challenges such as robust data association and precise geometric model fitting.
arXiv Detail & Related papers (2021-08-09T18:16:08Z) - Making Affine Correspondences Work in Camera Geometry Computation [62.7633180470428]
Local features provide region-to-region rather than point-to-point correspondences.
We propose guidelines for effective use of region-to-region matches in the course of a full model estimation pipeline.
Experiments show that affine solvers can achieve accuracy comparable to point-based solvers at faster run-times.
arXiv Detail & Related papers (2020-07-20T12:07:48Z) - PropagationNet: Propagate Points to Curve to Learn Structure Information [79.65125870257009]
We present a novel structure-infused face alignment algorithm based on heatmap regression.
We also propose a Focal Wing Loss for mining and emphasizing the difficult samples under in-the-wild condition.
Our method achieves 4.05% mean error on WFLW, 2.93% mean error on 300W full-set, and 3.71% mean error on COFW.
arXiv Detail & Related papers (2020-06-25T11:08:59Z) - On the Arbitrary-Oriented Object Detection: Classification based
Approaches Revisited [94.5455251250471]
We first show that the boundary problem suffered in existing dominant regression-based rotation detectors, is caused by angular periodicity or corner ordering.
We transform the angular prediction task from a regression problem to a classification one.
For the resulting circularly distributed angle classification problem, we first devise a Circular Smooth Label technique to handle the periodicity of angle and increase the error tolerance to adjacent angles.
arXiv Detail & Related papers (2020-03-12T03:23:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.