Direct-PoseNet: Absolute Pose Regression with Photometric Consistency
- URL: http://arxiv.org/abs/2104.04073v1
- Date: Thu, 8 Apr 2021 21:10:18 GMT
- Title: Direct-PoseNet: Absolute Pose Regression with Photometric Consistency
- Authors: Shuai Chen, Zirui Wang, Victor Prisacariu
- Abstract summary: We present a relocalization pipeline, which combines an absolute pose regression network with a novel view synthesis based direct matching module.
Our network Direct-PoseNet achieves state-of-the-art performance among all other single-image APR methods on the 7-Scenes benchmark and the LLFF dataset.
- Score: 16.60612964943876
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We present a relocalization pipeline, which combines an absolute pose
regression (APR) network with a novel view synthesis based direct matching
module, offering superior accuracy while maintaining low inference time. Our
contribution is twofold: i) we design a direct matching module that supplies a
photometric supervision signal to refine the pose regression network via
differentiable rendering; ii) we modify the rotation representation from the
classical quaternion to SO(3) in pose regression, removing the need for
balancing rotation and translation loss terms. As a result, our network
Direct-PoseNet achieves state-of-the-art performance among all other
single-image APR methods on the 7-Scenes benchmark and the LLFF dataset.
Related papers
- (Fusionformer):Exploiting the Joint Motion Synergy with Fusion Network
Based On Transformer for 3D Human Pose Estimation [1.52292571922932]
Many previous methods lack the understanding of local joint information.cite8888987considers the temporal relationship of a single joint in this work.
Our proposed textbfFusionformer method introduces a global-temporal self-trajectory module and a cross-temporal self-trajectory module.
The results show an improvement of 2.4% MPJPE and 4.3% P-MPJPE on the Human3.6M dataset.
arXiv Detail & Related papers (2022-10-08T12:22:10Z) - Effective Invertible Arbitrary Image Rescaling [77.46732646918936]
Invertible Neural Networks (INN) are able to increase upscaling accuracy significantly by optimizing the downscaling and upscaling cycle jointly.
A simple and effective invertible arbitrary rescaling network (IARN) is proposed to achieve arbitrary image rescaling by training only one model in this work.
It is shown to achieve a state-of-the-art (SOTA) performance in bidirectional arbitrary rescaling without compromising perceptual quality in LR outputs.
arXiv Detail & Related papers (2022-09-26T22:22:30Z) - DFNet: Enhance Absolute Pose Regression with Direct Feature Matching [16.96571417692014]
We introduce a camera relocalization pipeline that combines absolute pose regression (APR) and direct feature matching.
We show that our method achieves a state-of-the-art accuracy by outperforming existing single-image APR methods by as much as 56%, comparable to 3D structure-based methods.
arXiv Detail & Related papers (2022-04-01T16:39:16Z) - Poseur: Direct Human Pose Regression with Transformers [119.79232258661995]
We propose a direct, regression-based approach to 2D human pose estimation from single images.
Our framework is end-to-end differentiable, and naturally learns to exploit the dependencies between keypoints.
Ours is the first regression-based approach to perform favorably compared to the best heatmap-based pose estimation methods.
arXiv Detail & Related papers (2022-01-19T04:31:57Z) - PR-RRN: Pairwise-Regularized Residual-Recursive Networks for Non-rigid
Structure-from-Motion [58.75694870260649]
PR-RRN is a novel neural-network based method for Non-rigid Structure-from-Motion.
We propose two new pairwise regularizations to further regularize the reconstruction.
Our approach achieves state-of-the-art performance on CMU MOCAP and PASCAL3D+ dataset.
arXiv Detail & Related papers (2021-08-17T08:39:02Z) - 3D Human Pose and Shape Regression with Pyramidal Mesh Alignment
Feedback Loop [128.07841893637337]
Regression-based methods have recently shown promising results in reconstructing human meshes from monocular images.
Minor deviation in parameters may lead to noticeable misalignment between the estimated meshes and image evidences.
We propose a Pyramidal Mesh Alignment Feedback (PyMAF) loop to leverage a feature pyramid and rectify the predicted parameters.
arXiv Detail & Related papers (2021-03-30T17:07:49Z) - Tight-Integration of Feature-Based Relocalization in Monocular Direct
Visual Odometry [49.89611704653707]
We propose a framework for integrating map-based relocalization into online visual odometry.
We integrate image features into Direct Sparse Odometry (DSO) and rely on feature matching to associate online visual odometry with a previously built map.
arXiv Detail & Related papers (2021-02-01T21:41:05Z) - LM-Reloc: Levenberg-Marquardt Based Direct Visual Relocalization [54.77498358487812]
LM-Reloc is a novel approach for visual relocalization based on direct image alignment.
We propose a loss formulation inspired by the classical Levenberg-Marquardt algorithm to train LM-Net.
arXiv Detail & Related papers (2020-10-13T12:15:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.