DiffPoseNet: Direct Differentiable Camera Pose Estimation
- URL: http://arxiv.org/abs/2203.11174v1
- Date: Mon, 21 Mar 2022 17:54:30 GMT
- Title: DiffPoseNet: Direct Differentiable Camera Pose Estimation
- Authors: Chethan M. Parameshwara, Gokul Hari, Cornelia Ferm\"uller, Nitin J.
Sanket, Yiannis Aloimonos
- Abstract summary: We introduce a network NFlowNet, for normal flow estimation which is used to enforce robust and direct constraints.
We perform extensive qualitative and quantitative evaluation of the proposed DiffPoseNet's sensitivity to noise and its generalization across datasets.
- Score: 11.941057800943653
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Current deep neural network approaches for camera pose estimation rely on
scene structure for 3D motion estimation, but this decreases the robustness and
thereby makes cross-dataset generalization difficult. In contrast, classical
approaches to structure from motion estimate 3D motion utilizing optical flow
and then compute depth. Their accuracy, however, depends strongly on the
quality of the optical flow. To avoid this issue, direct methods have been
proposed, which separate 3D motion from depth estimation but compute 3D motion
using only image gradients in the form of normal flow. In this paper, we
introduce a network NFlowNet, for normal flow estimation which is used to
enforce robust and direct constraints. In particular, normal flow is used to
estimate relative camera pose based on the cheirality (depth positivity)
constraint. We achieve this by formulating the optimization problem as a
differentiable cheirality layer, which allows for end-to-end learning of camera
pose. We perform extensive qualitative and quantitative evaluation of the
proposed DiffPoseNet's sensitivity to noise and its generalization across
datasets. We compare our approach to existing state-of-the-art methods on
KITTI, TartanAir, and TUM-RGBD datasets.
Related papers
- 3D Equivariant Pose Regression via Direct Wigner-D Harmonics Prediction [50.07071392673984]
Existing methods learn 3D rotations parametrized in the spatial domain using angles or quaternions.
We propose a frequency-domain approach that directly predicts Wigner-D coefficients for 3D rotation regression.
Our method achieves state-of-the-art results on benchmarks such as ModelNet10-SO(3) and PASCAL3D+.
arXiv Detail & Related papers (2024-11-01T12:50:38Z) - MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion [118.74385965694694]
We present Motion DUSt3R (MonST3R), a novel geometry-first approach that directly estimates per-timestep geometry from dynamic scenes.
By simply estimating a pointmap for each timestep, we can effectively adapt DUST3R's representation, previously only used for static scenes, to dynamic scenes.
We show that by posing the problem as a fine-tuning task, identifying several suitable datasets, and strategically training the model on this limited data, we can surprisingly enable the model to handle dynamics.
arXiv Detail & Related papers (2024-10-04T18:00:07Z) - FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent [19.977807508281835]
FlowMap is an end-to-end differentiable method that solves for precise camera poses, camera intrinsics, and per-frame dense depth of a video sequence.
Our method performs per-video gradient-descent minimization of a simple least-squares objective.
We empirically show that camera parameters and dense depth recovered by our method enable photo-realistic novel view synthesis on 360-degree trajectories.
arXiv Detail & Related papers (2024-04-23T17:46:50Z) - iComMa: Inverting 3D Gaussian Splatting for Camera Pose Estimation via Comparing and Matching [14.737266480464156]
We present a method named iComMa to address the 6D camera pose estimation problem in computer vision.
We propose an efficient method for accurate camera pose estimation by inverting 3D Gaussian Splatting (3DGS)
arXiv Detail & Related papers (2023-12-14T15:31:33Z) - Shape-Constraint Recurrent Flow for 6D Object Pose Estimation [15.238626453460666]
We propose a shape-constraint recurrent matching framework for 6D object pose estimation.
We first compute a pose-induced flow based on the displacement of 2D reprojection between the initial pose and the currently estimated pose.
We then use this pose-induced flow to construct the correlation map for the following matching iterations.
arXiv Detail & Related papers (2023-06-23T02:36:34Z) - NIKI: Neural Inverse Kinematics with Invertible Neural Networks for 3D
Human Pose and Shape Estimation [53.25973084799954]
We present NIKI (Neural Inverse Kinematics with Invertible Neural Network), which models bi-directional errors.
NIKI can learn from both the forward and inverse processes with invertible networks.
arXiv Detail & Related papers (2023-05-15T12:13:24Z) - ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving
Cameras in the Wild [57.37891682117178]
We present a robust dense indirect structure-from-motion method for videos that is based on dense correspondence from pairwise optical flow.
A novel neural network architecture is proposed for processing irregular point trajectory data.
Experiments on MPI Sintel dataset show that our system produces significantly more accurate camera trajectories.
arXiv Detail & Related papers (2022-07-19T09:19:45Z) - RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust
Correspondence Field Estimation and Pose Optimization [46.144194562841435]
We propose a framework based on a recurrent neural network (RNN) for object pose refinement.
The problem is formulated as a non-linear least squares problem based on the estimated correspondence field.
The correspondence field estimation and pose refinement are conducted alternatively in each iteration to recover accurate object poses.
arXiv Detail & Related papers (2022-03-24T06:24:55Z) - Optical Flow Estimation from a Single Motion-blurred Image [66.2061278123057]
Motion blur in an image may have practical interests in fundamental computer vision problems.
We propose a novel framework to estimate optical flow from a single motion-blurred image in an end-to-end manner.
arXiv Detail & Related papers (2021-03-04T12:45:18Z) - FlowStep3D: Model Unrolling for Self-Supervised Scene Flow Estimation [87.74617110803189]
Estimating the 3D motion of points in a scene, known as scene flow, is a core problem in computer vision.
We present a recurrent architecture that learns a single step of an unrolled iterative alignment procedure for refining scene flow predictions.
arXiv Detail & Related papers (2020-11-19T23:23:48Z) - Active Depth Estimation: Stability Analysis and its Applications [18.582561853987034]
This paper focuses on the theoretical properties of the Structure-from-Motion (SfM) scheme.
The term incremental stands for estimating the 3D structure of the scene over a chronological sequence of image frames.
By analyzing the convergence of the estimator using the Lyapunov theory, we relax the constraints on the projection of the 3D point in the image plane.
arXiv Detail & Related papers (2020-03-16T12:12:24Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.