Handbook on Leveraging Lines for Two-View Relative Pose Estimation
- URL: http://arxiv.org/abs/2309.16040v1
- Date: Wed, 27 Sep 2023 21:43:04 GMT
- Title: Handbook on Leveraging Lines for Two-View Relative Pose Estimation
- Authors: Petr Hruby, Shaohui Liu, R\'emi Pautrat, Marc Pollefeys, Daniel Barath
- Abstract summary: We propose an approach for estimating the relative pose between image pairs by jointly exploiting points, lines, and their coincidences in a hybrid manner.
Our hybrid framework combines the advantages of all configurations, enabling robust and accurate estimation in challenging environments.
- Score: 82.72686460985297
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We propose an approach for estimating the relative pose between calibrated
image pairs by jointly exploiting points, lines, and their coincidences in a
hybrid manner. We investigate all possible configurations where these data
modalities can be used together and review the minimal solvers available in the
literature. Our hybrid framework combines the advantages of all configurations,
enabling robust and accurate estimation in challenging environments. In
addition, we design a method for jointly estimating multiple vanishing point
correspondences in two images, and a bundle adjustment that considers all
relevant data modalities. Experiments on various indoor and outdoor datasets
show that our approach outperforms point-based methods, improving
AUC@10$^\circ$ by 1-7 points while running at comparable speeds. The source
code of the solvers and hybrid framework will be made public.
Related papers
- Robust Two-View Geometry Estimation with Implicit Differentiation [2.048226951354646]
We present a novel two-view geometry estimation framework.
It is based on a differentiable robust loss function fitting.
We evaluate our approach on the camera pose estimation task in both outdoor and indoor scenarios.
arXiv Detail & Related papers (2024-10-23T15:51:33Z) - Str-L Pose: Integrating Point and Structured Line for Relative Pose Estimation in Dual-Graph [45.115555973941255]
Relative pose estimation is crucial for various computer vision applications, including Robotic and Autonomous Driving.
We propose a Geometric Correspondence Graph neural network that integrates point features with extra structured line segments.
This integration of matched points and line segments further exploits the geometry constraints and enhances model performance across different environments.
arXiv Detail & Related papers (2024-08-28T12:33:26Z) - Multiway Point Cloud Mosaicking with Diffusion and Global Optimization [74.3802812773891]
We introduce a novel framework for multiway point cloud mosaicking (named Wednesday)
At the core of our approach is ODIN, a learned pairwise registration algorithm that identifies overlaps and refines attention scores.
Tested on four diverse, large-scale datasets, our method state-of-the-art pairwise and rotation registration results by a large margin on all benchmarks.
arXiv Detail & Related papers (2024-03-30T17:29:13Z) - AlignMiF: Geometry-Aligned Multimodal Implicit Field for LiDAR-Camera
Joint Synthesis [98.3959800235485]
Recently, there exist some methods exploring multiple modalities within a single field, aiming to share implicit features from different modalities to enhance reconstruction performance.
In this work, we conduct comprehensive analyses on the multimodal implicit field of LiDAR-camera joint synthesis, revealing the underlying issue lies in the misalignment of different sensors.
We introduce AlignMiF, a geometrically aligned multimodal implicit field with two proposed modules: Geometry-Aware Alignment (GAA) and Shared Geometry Initialization (SGI)
arXiv Detail & Related papers (2024-02-27T13:08:47Z) - 360 Layout Estimation via Orthogonal Planes Disentanglement and Multi-view Geometric Consistency Perception [56.84921040837699]
Existing panoramic layout estimation solutions tend to recover room boundaries from a vertically compressed sequence, yielding imprecise results.
We propose an orthogonal plane disentanglement network (termed DOPNet) to distinguish ambiguous semantics.
We also present an unsupervised adaptation technique tailored for horizon-depth and ratio representations.
Our solution outperforms other SoTA models on both monocular layout estimation and multi-view layout estimation tasks.
arXiv Detail & Related papers (2023-12-26T12:16:03Z) - Unsupervised Manifold Alignment with Joint Multidimensional Scaling [4.683612295430957]
We introduce Joint Multidimensional Scaling, which maps datasets from two different domains to a common low-dimensional Euclidean space.
Our approach integrates Multidimensional Scaling (MDS) and Wasserstein Procrustes analysis into a joint optimization problem.
We demonstrate the effectiveness of our approach in several applications, including joint visualization of two datasets, unsupervised heterogeneous domain adaptation, graph matching, and protein structure alignment.
arXiv Detail & Related papers (2022-07-06T21:02:42Z) - Hybrid Relation Guided Set Matching for Few-shot Action Recognition [51.3308583226322]
We propose a novel Hybrid Relation guided Set Matching (HyRSM) approach that incorporates two key components.
The purpose of the hybrid relation module is to learn task-specific embeddings by fully exploiting associated relations within and cross videos in an episode.
We evaluate HyRSM on six challenging benchmarks, and the experimental results show its superiority over the state-of-the-art methods by a convincing margin.
arXiv Detail & Related papers (2022-04-28T11:43:41Z) - An Adaptive Framework for Learning Unsupervised Depth Completion [59.17364202590475]
We present a method to infer a dense depth map from a color image and associated sparse depth measurements.
We show that regularization and co-visibility are related via the fitness of the model to data and can be unified into a single framework.
arXiv Detail & Related papers (2021-06-06T02:27:55Z) - Improving Calibration in Deep Metric Learning With Cross-Example Softmax [11.014197662964335]
We propose Cross-Example Softmax which combines the properties of top-$k$ and threshold relevancy.
In each iteration, the proposed loss encourages all queries to be closer to their matching images than all queries are to all non-matching images.
This leads to a globally more calibrated similarity metric and makes distance more interpretable as an absolute measure of relevance.
arXiv Detail & Related papers (2020-11-17T18:47:28Z) - Indoor Layout Estimation by 2D LiDAR and Camera Fusion [3.2387553628943535]
This paper presents an algorithm for indoor layout estimation and reconstruction through the fusion of a sequence of captured images and LiDAR data sets.
In the proposed system, a movable platform collects both intensity images and 2D LiDAR information.
arXiv Detail & Related papers (2020-01-15T16:43:35Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.