A Probabilistic Framework for Visual Localization in Ambiguous Scenes
- URL: http://arxiv.org/abs/2301.02086v1
- Date: Thu, 5 Jan 2023 14:46:54 GMT
- Title: A Probabilistic Framework for Visual Localization in Ambiguous Scenes
- Authors: Fereidoon Zangeneh, Leonard Bruns, Amit Dekel, Alessandro Pieropan and
Patric Jensfelt
- Abstract summary: We propose a probabilistic framework that for a given image predicts the arbitrarily shaped posterior distribution of its camera pose.
We do this via a novel formulation of camera pose regression using variational inference, which allows sampling from the predicted distribution.
Our method outperforms existing methods on localization in ambiguous scenes.
- Score: 64.13544430239267
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Visual localization allows autonomous robots to relocalize when losing track
of their pose by matching their current observation with past ones. However,
ambiguous scenes pose a challenge for such systems, as repetitive structures
can be viewed from many distinct, equally likely camera poses, which means it
is not sufficient to produce a single best pose hypothesis. In this work, we
propose a probabilistic framework that for a given image predicts the
arbitrarily shaped posterior distribution of its camera pose. We do this via a
novel formulation of camera pose regression using variational inference, which
allows sampling from the predicted distribution. Our method outperforms
existing methods on localization in ambiguous scenes. Code and data will be
released at https://github.com/efreidun/vapor.
Related papers
- Cameras as Rays: Pose Estimation via Ray Diffusion [54.098613859015856]
Estimating camera poses is a fundamental task for 3D reconstruction and remains challenging given sparsely sampled views.
We propose a distributed representation of camera pose that treats a camera as a bundle of rays.
Our proposed methods, both regression- and diffusion-based, demonstrate state-of-the-art performance on camera pose estimation on CO3D.
arXiv Detail & Related papers (2024-02-22T18:59:56Z) - Probabilistic Triangulation for Uncalibrated Multi-View 3D Human Pose
Estimation [22.127170452402332]
This paper presents a novel Probabilistic Triangulation module that can be embedded in a calibrated 3D human pose estimation method.
Our method achieves a trade-off between estimation accuracy and generalizability.
arXiv Detail & Related papers (2023-09-09T11:03:37Z) - GAUDI: A Neural Architect for Immersive 3D Scene Generation [67.97817314857917]
GAUDI is a generative model capable of capturing the distribution of complex and realistic 3D scenes that can be rendered immersively from a moving camera.
We show that GAUDI obtains state-of-the-art performance in the unconditional generative setting across multiple datasets.
arXiv Detail & Related papers (2022-07-27T19:10:32Z) - Visual Camera Re-Localization Using Graph Neural Networks and Relative
Pose Supervision [31.947525258453584]
Visual re-localization means using a single image as input to estimate the camera's location and orientation relative to a pre-recorded environment.
Our proposed method makes few special assumptions, and is fairly lightweight in training and testing.
We validate the effectiveness of our approach on both standard indoor (7-Scenes) and outdoor (Cambridge Landmarks) camera re-localization benchmarks.
arXiv Detail & Related papers (2021-04-06T14:29:03Z) - Pose Guided Person Image Generation with Hidden p-Norm Regression [113.41144529452663]
We propose a novel approach to solve the pose guided person image generation task.
Our method estimates a pose-invariant feature matrix for each identity, and uses it to predict the target appearance conditioned on the target pose.
Our method yields competitive performance in all the aforementioned variant scenarios.
arXiv Detail & Related papers (2021-02-19T17:03:54Z) - Deep Bingham Networks: Dealing with Uncertainty and Ambiguity in Pose
Estimation [74.76155168705975]
Deep Bingham Networks (DBN) can handle pose-related uncertainties and ambiguities arising in almost all real life applications concerning 3D data.
DBN extends the state of the art direct pose regression networks by (i) a multi-hypotheses prediction head which can yield different distribution modes.
We propose new training strategies so as to avoid mode or posterior collapse during training and to improve numerical stability.
arXiv Detail & Related papers (2020-12-20T19:20:26Z) - PREGAN: Pose Randomization and Estimation for Weakly Paired Image Style
Translation [11.623477199795037]
We propose a weakly-paired setting for the style translation, where the content in the two images is aligned with errors in poses.
PREGAN is validated on both simulated and real-world collected data to show the effectiveness.
arXiv Detail & Related papers (2020-10-31T16:11:11Z) - 6D Camera Relocalization in Ambiguous Scenes via Continuous Multimodal
Inference [67.70859730448473]
We present a multimodal camera relocalization framework that captures ambiguities and uncertainties.
We predict multiple camera pose hypotheses as well as the respective uncertainty for each prediction.
We introduce a new dataset specifically designed to foster camera localization research in ambiguous environments.
arXiv Detail & Related papers (2020-04-09T20:55:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.