Generalized Pose-and-Scale Estimation using 4-Point Congruence
Constraints
- URL: http://arxiv.org/abs/2011.13817v1
- Date: Fri, 27 Nov 2020 16:30:19 GMT
- Title: Generalized Pose-and-Scale Estimation using 4-Point Congruence
Constraints
- Authors: Victor Fragoso, Sudipta Sinha
- Abstract summary: gP4Pc is a new method for computing the absolute pose of a generalized camera with unknown internal scale from four corresponding 3D point-and-ray pairs.
Our experiments on real and synthetic datasets, demonstrate that gP4Pc is among the fastest methods in terms of total running time when used within a SACRAN framework.
- Score: 5.063728016437489
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We present gP4Pc, a new method for computing the absolute pose of a
generalized camera with unknown internal scale from four corresponding 3D
point-and-ray pairs. Unlike most pose-and-scale methods, gP4Pc is based on
constraints arising from the congruence of shapes defined by two sets of four
points related by an unknown similarity transformation. By choosing a novel
parametrization for the problem, we derive a system of four quadratic equations
in four scalar variables. The variables represent the distances of 3D points
along the rays from the camera centers. After solving this system via Groebner
basis-based automatic polynomial solvers, we compute the similarity
transformation using an efficient 3D point-point alignment method. We also
propose a specialized variant of our solver for the case of coplanar points,
which is computationally very efficient and about 3x faster than the fastest
existing solver. Our experiments on real and synthetic datasets, demonstrate
that gP4Pc is among the fastest methods in terms of total running time when
used within a RANSAC framework, while achieving competitive numerical
stability, accuracy, and robustness to noise.
Related papers
- Relative pose of three calibrated and partially calibrated cameras from four points using virtual correspondences [56.44647186049448]
We study challenging problems of estimating the relative pose of three cameras.
Our solutions are based on the simple idea of generating one or two additional virtual point correspondences in two views.
arXiv Detail & Related papers (2023-03-28T15:50:48Z) - IKOL: Inverse kinematics optimization layer for 3D human pose and shape
estimation via Gauss-Newton differentiation [44.00115413716392]
This paper presents an inverse kinematic layer (IKOL) for 3D human pose shape estimation.
IKOL has a much over over than most existing regression-based methods.
It provides a more accurate range of 3D human pose estimation.
arXiv Detail & Related papers (2023-02-02T12:43:29Z) - Cross-Sim-NGF: FFT-Based Global Rigid Multimodal Alignment of Image
Volumes using Normalized Gradient Fields [3.584984184069584]
Multimodal image alignment involves finding spatial correspondences between volumes varying in appearance and structure.
We propose a global optimization method for rigid multimodal 3D image alignment, based on a novel efficient algorithm for computing similarity of normalized fields (NGF) in the frequency domain.
arXiv Detail & Related papers (2021-10-19T15:28:37Z) - Calibrated and Partially Calibrated Semi-Generalized Homographies [65.29477277713205]
We propose the first minimal solutions for estimating the semi-generalized homography given a perspective and a generalized camera.
The proposed solvers are stable and efficient as demonstrated by a number of synthetic and real-world experiments.
arXiv Detail & Related papers (2021-03-11T08:56:24Z) - Displacement-Invariant Cost Computation for Efficient Stereo Matching [122.94051630000934]
Deep learning methods have dominated stereo matching leaderboards by yielding unprecedented disparity accuracy.
But their inference time is typically slow, on the order of seconds for a pair of 540p images.
We propose a emphdisplacement-invariant cost module to compute the matching costs without needing a 4D feature volume.
arXiv Detail & Related papers (2020-12-01T23:58:16Z) - Geometric Correspondence Fields: Learned Differentiable Rendering for 3D
Pose Refinement in the Wild [96.09941542587865]
We present a novel 3D pose refinement approach based on differentiable rendering for objects of arbitrary categories in the wild.
In this way, we precisely align 3D models to objects in RGB images which results in significantly improved 3D pose estimates.
We evaluate our approach on the challenging Pix3D dataset and achieve up to 55% relative improvement compared to state-of-the-art refinement methods in multiple metrics.
arXiv Detail & Related papers (2020-07-17T12:34:38Z) - Lightweight Multi-View 3D Pose Estimation through Camera-Disentangled
Representation [57.11299763566534]
We present a solution to recover 3D pose from multi-view images captured with spatially calibrated cameras.
We exploit 3D geometry to fuse input images into a unified latent representation of pose, which is disentangled from camera view-points.
Our architecture then conditions the learned representation on camera projection operators to produce accurate per-view 2d detections.
arXiv Detail & Related papers (2020-04-05T12:52:29Z) - Quaternion Equivariant Capsule Networks for 3D Point Clouds [58.566467950463306]
We present a 3D capsule module for processing point clouds that is equivariant to 3D rotations and translations.
We connect dynamic routing between capsules to the well-known Weiszfeld algorithm.
Based on our operator, we build a capsule network that disentangles geometry from pose.
arXiv Detail & Related papers (2019-12-27T13:51:17Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.