A Probabilistic Framework for Visual Localization in Ambiguous Scenes
- URL: http://arxiv.org/abs/2301.02086v1
- Date: Thu, 5 Jan 2023 14:46:54 GMT
- Title: A Probabilistic Framework for Visual Localization in Ambiguous Scenes
- Authors: Fereidoon Zangeneh, Leonard Bruns, Amit Dekel, Alessandro Pieropan and
Patric Jensfelt
- Abstract summary: We propose a probabilistic framework that for a given image predicts the arbitrarily shaped posterior distribution of its camera pose.
We do this via a novel formulation of camera pose regression using variational inference, which allows sampling from the predicted distribution.
Our method outperforms existing methods on localization in ambiguous scenes.
- Score: 64.13544430239267
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Visual localization allows autonomous robots to relocalize when losing track
of their pose by matching their current observation with past ones. However,
ambiguous scenes pose a challenge for such systems, as repetitive structures
can be viewed from many distinct, equally likely camera poses, which means it
is not sufficient to produce a single best pose hypothesis. In this work, we
propose a probabilistic framework that for a given image predicts the
arbitrarily shaped posterior distribution of its camera pose. We do this via a
novel formulation of camera pose regression using variational inference, which
allows sampling from the predicted distribution. Our method outperforms
existing methods on localization in ambiguous scenes. Code and data will be
released at https://github.com/efreidun/vapor.
Related papers
- No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images [100.80376573969045]
NoPoSplat is a feed-forward model capable of reconstructing 3D scenes parameterized by 3D Gaussians from multi-view images.
Our model achieves real-time 3D Gaussian reconstruction during inference.
This work makes significant advances in pose-free generalizable 3D reconstruction and demonstrates its applicability to real-world scenarios.
arXiv Detail & Related papers (2024-10-31T17:58:22Z) - Conditional Variational Autoencoders for Probabilistic Pose Regression [45.563533339332615]
We propose a probabilistic method to predict the posterior distribution of camera poses given an observed image.
Our proposed training strategy results in a generative model of camera poses given an image, which can be used to draw samples from the pose posterior distribution.
arXiv Detail & Related papers (2024-10-07T12:43:50Z) - ADen: Adaptive Density Representations for Sparse-view Camera Pose Estimation [17.097170273209333]
Recovering camera poses from a set of images is a foundational task in 3D computer vision.
Recent data-driven approaches aim to directly output camera poses, either through regressing the 6DoF camera poses or formulating rotation as a probability distribution.
We propose ADen to unify the two frameworks by employing a generator and a discriminator.
arXiv Detail & Related papers (2024-08-16T22:45:46Z) - Cameras as Rays: Pose Estimation via Ray Diffusion [54.098613859015856]
Estimating camera poses is a fundamental task for 3D reconstruction and remains challenging given sparsely sampled views.
We propose a distributed representation of camera pose that treats a camera as a bundle of rays.
Our proposed methods, both regression- and diffusion-based, demonstrate state-of-the-art performance on camera pose estimation on CO3D.
arXiv Detail & Related papers (2024-02-22T18:59:56Z) - Probabilistic Triangulation for Uncalibrated Multi-View 3D Human Pose
Estimation [22.127170452402332]
This paper presents a novel Probabilistic Triangulation module that can be embedded in a calibrated 3D human pose estimation method.
Our method achieves a trade-off between estimation accuracy and generalizability.
arXiv Detail & Related papers (2023-09-09T11:03:37Z) - GAUDI: A Neural Architect for Immersive 3D Scene Generation [67.97817314857917]
GAUDI is a generative model capable of capturing the distribution of complex and realistic 3D scenes that can be rendered immersively from a moving camera.
We show that GAUDI obtains state-of-the-art performance in the unconditional generative setting across multiple datasets.
arXiv Detail & Related papers (2022-07-27T19:10:32Z) - Visual Camera Re-Localization Using Graph Neural Networks and Relative
Pose Supervision [31.947525258453584]
Visual re-localization means using a single image as input to estimate the camera's location and orientation relative to a pre-recorded environment.
Our proposed method makes few special assumptions, and is fairly lightweight in training and testing.
We validate the effectiveness of our approach on both standard indoor (7-Scenes) and outdoor (Cambridge Landmarks) camera re-localization benchmarks.
arXiv Detail & Related papers (2021-04-06T14:29:03Z) - 6D Camera Relocalization in Ambiguous Scenes via Continuous Multimodal
Inference [67.70859730448473]
We present a multimodal camera relocalization framework that captures ambiguities and uncertainties.
We predict multiple camera pose hypotheses as well as the respective uncertainty for each prediction.
We introduce a new dataset specifically designed to foster camera localization research in ambiguous environments.
arXiv Detail & Related papers (2020-04-09T20:55:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.