PnP-Net: A hybrid Perspective-n-Point Network
- URL: http://arxiv.org/abs/2003.04626v1
- Date: Tue, 10 Mar 2020 10:43:14 GMT
- Title: PnP-Net: A hybrid Perspective-n-Point Network
- Authors: Roy Sheffer, Ami Wiesel
- Abstract summary: We consider the robust Perspective-n-Point problem using a hybrid approach that combines deep learning with model based algorithms.
We demonstrate both synthetic parameters and real world data with low computational requirements.
- Score: 2.66512000865131
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We consider the robust Perspective-n-Point (PnP) problem using a hybrid
approach that combines deep learning with model based algorithms. PnP is the
problem of estimating the pose of a calibrated camera given a set of 3D points
in the world and their corresponding 2D projections in the image. In its more
challenging robust version, some of the correspondences may be mismatched and
must be efficiently discarded. Classical solutions address PnP via iterative
robust non-linear least squares method that exploit the problem's geometry but
are either inaccurate or computationally intensive. In contrast, we propose to
combine a deep learning initial phase followed by a model-based fine tuning
phase. This hybrid approach, denoted by PnP-Net, succeeds in estimating the
unknown pose parameters under correspondence errors and noise, with low and
fixed computational complexity requirements. We demonstrate its advantages on
both synthetic data and real world data.
Related papers
- ParaPoint: Learning Global Free-Boundary Surface Parameterization of 3D Point Clouds [52.03819676074455]
ParaPoint is an unsupervised neural learning pipeline for achieving global free-boundary surface parameterization.
This work makes the first attempt to investigate neural point cloud parameterization that pursues both global mappings and free boundaries.
arXiv Detail & Related papers (2024-03-15T14:35:05Z) - Learnable human mesh triangulation for 3D human pose and shape
estimation [6.699132260402631]
The accuracy of joint rotation and shape estimation has received relatively little attention in the skinned multi-person linear model (SMPL)-based human mesh reconstruction from multi-view images.
We propose a two-stage method to resolve the ambiguity of joint rotation and shape reconstruction and the difficulty of network learning.
The proposed method significantly outperforms the previous works in terms of joint rotation and shape estimation, and achieves competitive performance in terms of joint location estimation.
arXiv Detail & Related papers (2022-08-24T01:11:57Z) - Visual SLAM with Graph-Cut Optimized Multi-Plane Reconstruction [11.215334675788952]
This paper presents a semantic planar SLAM system that improves pose estimation and mapping using cues from an instance planar segmentation network.
While the mainstream approaches are using RGB-D sensors, employing a monocular camera with such a system still faces challenges such as robust data association and precise geometric model fitting.
arXiv Detail & Related papers (2021-08-09T18:16:08Z) - Uncertainty-Aware Camera Pose Estimation from Points and Lines [101.03675842534415]
Perspective-n-Point-and-Line (Pn$PL) aims at fast, accurate and robust camera localizations with respect to a 3D model from 2D-3D feature coordinates.
arXiv Detail & Related papers (2021-07-08T15:19:36Z) - Deep Magnification-Flexible Upsampling over 3D Point Clouds [103.09504572409449]
We propose a novel end-to-end learning-based framework to generate dense point clouds.
We first formulate the problem explicitly, which boils down to determining the weights and high-order approximation errors.
Then, we design a lightweight neural network to adaptively learn unified and sorted weights as well as the high-order refinements.
arXiv Detail & Related papers (2020-11-25T14:00:18Z) - Human Body Model Fitting by Learned Gradient Descent [48.79414884222403]
We propose a novel algorithm for the fitting of 3D human shape to images.
We show that this algorithm is fast (avg. 120ms convergence), robust to dataset, and achieves state-of-the-art results on public evaluation datasets.
arXiv Detail & Related papers (2020-08-19T14:26:47Z) - Solving the Blind Perspective-n-Point Problem End-To-End With Robust
Differentiable Geometric Optimization [44.85008070868851]
Blind Perspective-n-Point is the problem estimating the position of a camera relative to a scene.
We propose the first fully end-to-end trainable network for solving the blind geometric problem efficiently globally.
arXiv Detail & Related papers (2020-07-29T06:35:45Z) - Learning 2D-3D Correspondences To Solve The Blind Perspective-n-Point
Problem [98.92148855291363]
This paper proposes a deep CNN model which simultaneously solves for both 6-DoF absolute camera pose 2D--3D correspondences.
Tests on both real and simulated data have shown that our method substantially outperforms existing approaches.
arXiv Detail & Related papers (2020-03-15T04:17:30Z) - Belief Propagation Reloaded: Learning BP-Layers for Labeling Problems [83.98774574197613]
We take one of the simplest inference methods, a truncated max-product Belief propagation, and add what is necessary to make it a proper component of a deep learning model.
This BP-Layer can be used as the final or an intermediate block in convolutional neural networks (CNNs)
The model is applicable to a range of dense prediction problems, is well-trainable and provides parameter-efficient and robust solutions in stereo, optical flow and semantic segmentation.
arXiv Detail & Related papers (2020-03-13T13:11:35Z) - W-PoseNet: Dense Correspondence Regularized Pixel Pair Pose Regression [34.8793946023412]
This paper introduces a novel pose estimation algorithm W-PoseNet.
It densely regresses from input data to 6D pose and also 3D coordinates in model space.
Experiment results on the popular YCB-Video and LineMOD benchmarks show that the proposed W-PoseNet consistently achieves superior performance.
arXiv Detail & Related papers (2019-12-26T15:51:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.