Perspective-1-Ellipsoid: Formulation, Analysis and Solutions of the
Camera Pose Estimation Problem from One Ellipse-Ellipsoid Correspondence
- URL: http://arxiv.org/abs/2208.12513v3
- Date: Wed, 14 Jun 2023 12:09:07 GMT
- Title: Perspective-1-Ellipsoid: Formulation, Analysis and Solutions of the
Camera Pose Estimation Problem from One Ellipse-Ellipsoid Correspondence
- Authors: Vincent Gaudilli\`ere, Gilles Simon, Marie-Odile Berger
- Abstract summary: We introduce an ellipsoid-specific theoretical framework and demonstrate its beneficial properties in the context of pose estimation.
We show that the proposed formalism enables to reduce the pose estimation problem to a position or orientation-only estimation problem.
- Score: 1.7188280334580193
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: In computer vision, camera pose estimation from correspondences between 3D
geometric entities and their projections into the image has been a widely
investigated problem. Although most state-of-the-art methods exploit low-level
primitives such as points or lines, the emergence of very effective CNN-based
object detectors in the recent years has paved the way to the use of
higher-level features carrying semantically meaningful information. Pioneering
works in that direction have shown that modelling 3D objects by ellipsoids and
2D detections by ellipses offers a convenient manner to link 2D and 3D data.
However, the mathematical formalism most often used in the related litterature
does not enable to easily distinguish ellipsoids and ellipses from other
quadrics and conics, leading to a loss of specificity potentially detrimental
in some developments. Moreover, the linearization process of the projection
equation creates an over-representation of the camera parameters, also possibly
causing an efficiency loss. In this paper, we therefore introduce an
ellipsoid-specific theoretical framework and demonstrate its beneficial
properties in the context of pose estimation. More precisely, we first show
that the proposed formalism enables to reduce the pose estimation problem to a
position or orientation-only estimation problem in which the remaining unknowns
can be derived in closed-form. Then, we demonstrate that it can be further
reduced to a 1 Degree-of-Freedom (1DoF) problem and provide the analytical
derivations of the pose as a function of that unique scalar unknown. We
illustrate our theoretical considerations by visual examples and include a
discussion on the practical aspects. Finally, we release this paper along with
the corresponding source code in order to contribute towards more efficient
resolutions of ellipsoid-related pose estimation problems.
Related papers
- DVMNet: Computing Relative Pose for Unseen Objects Beyond Hypotheses [59.51874686414509]
Current approaches approximate the continuous pose representation with a large number of discrete pose hypotheses.
We present a Deep Voxel Matching Network (DVMNet) that eliminates the need for pose hypotheses and computes the relative object pose in a single pass.
Our method delivers more accurate relative pose estimates for novel objects at a lower computational cost compared to state-of-the-art methods.
arXiv Detail & Related papers (2024-03-20T15:41:32Z) - GS-Pose: Category-Level Object Pose Estimation via Geometric and
Semantic Correspondence [5.500735640045456]
Category-level pose estimation is a challenging task with many potential applications in computer vision and robotics.
We propose to utilize both geometric and semantic features obtained from a pre-trained foundation model.
This requires significantly less data to train than prior methods since the semantic features are robust to object texture and appearance.
arXiv Detail & Related papers (2023-11-23T02:35:38Z) - GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D
Object Detection [95.8940731298518]
We propose a novel Geometry Uncertainty Propagation Network (GUPNet++)
It models the uncertainty propagation relationship of the geometry projection during training, improving the stability and efficiency of the end-to-end model learning.
Experiments show that the proposed approach not only obtains (state-of-the-art) SOTA performance in image-based monocular 3D detection but also demonstrates superiority in efficacy with a simplified framework.
arXiv Detail & Related papers (2023-10-24T08:45:15Z) - Level Set-Based Camera Pose Estimation From Multiple 2D/3D
Ellipse-Ellipsoid Correspondences [2.016317500787292]
We show that the definition of a cost function characterizing the projection of a 3D object onto a 2D object detection is not straightforward.
We develop an ellipse-ellipse cost based on level sets sampling, demonstrate its nice properties for handling partially visible objects and compare its performance with other common metrics.
arXiv Detail & Related papers (2022-07-16T14:09:54Z) - 3D Magic Mirror: Clothing Reconstruction from a Single Image via a
Causal Perspective [96.65476492200648]
This research aims to study a self-supervised 3D clothing reconstruction method.
It recovers the geometry shape, and texture of human clothing from a single 2D image.
arXiv Detail & Related papers (2022-04-27T17:46:55Z) - RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust
Correspondence Field Estimation and Pose Optimization [46.144194562841435]
We propose a framework based on a recurrent neural network (RNN) for object pose refinement.
The problem is formulated as a non-linear least squares problem based on the estimated correspondence field.
The correspondence field estimation and pose refinement are conducted alternatively in each iteration to recover accurate object poses.
arXiv Detail & Related papers (2022-03-24T06:24:55Z) - Learning Stereopsis from Geometric Synthesis for 6D Object Pose
Estimation [11.999630902627864]
Current monocular-based 6D object pose estimation methods generally achieve less competitive results than RGBD-based methods.
This paper proposes a 3D geometric volume based pose estimation method with a short baseline two-view setting.
Experiments show that our method outperforms state-of-the-art monocular-based methods, and is robust in different objects and scenes.
arXiv Detail & Related papers (2021-09-25T02:55:05Z) - Topology-Preserving 3D Image Segmentation Based On Hyperelastic
Regularization [1.52292571922932]
We propose a novel 3D topology-preserving registration-based segmentation model with the hyperelastic regularization.
Numerical experiments have been carried out on the synthetic and real images, which demonstrate the effectiveness of our proposed model.
arXiv Detail & Related papers (2021-03-31T02:20:46Z) - Calibrated and Partially Calibrated Semi-Generalized Homographies [65.29477277713205]
We propose the first minimal solutions for estimating the semi-generalized homography given a perspective and a generalized camera.
The proposed solvers are stable and efficient as demonstrated by a number of synthetic and real-world experiments.
arXiv Detail & Related papers (2021-03-11T08:56:24Z) - Deep Bingham Networks: Dealing with Uncertainty and Ambiguity in Pose
Estimation [74.76155168705975]
Deep Bingham Networks (DBN) can handle pose-related uncertainties and ambiguities arising in almost all real life applications concerning 3D data.
DBN extends the state of the art direct pose regression networks by (i) a multi-hypotheses prediction head which can yield different distribution modes.
We propose new training strategies so as to avoid mode or posterior collapse during training and to improve numerical stability.
arXiv Detail & Related papers (2020-12-20T19:20:26Z) - Wide-angle Image Rectification: A Survey [86.36118799330802]
wide-angle images contain distortions that violate the assumptions underlying pinhole camera models.
Image rectification, which aims to correct these distortions, can solve these problems.
We present a detailed description and discussion of the camera models used in different approaches.
Next, we review both traditional geometry-based image rectification methods and deep learning-based methods.
arXiv Detail & Related papers (2020-10-30T17:28:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.