MSC-VO: Exploiting Manhattan and Structural Constraints for Visual
  Odometry
        - URL: http://arxiv.org/abs/2111.03408v1
- Date: Fri, 5 Nov 2021 11:29:52 GMT
- Title: MSC-VO: Exploiting Manhattan and Structural Constraints for Visual
  Odometry
- Authors: Joan P. Company-Corcoles, Emilio Garcia-Fidalgo, Alberto Ortiz
- Abstract summary: We introduce MSC-VO, an RGB-D -based visual odometry approach that combines both point and line features and leverages, if exist, those structural regularities and the Manhattan axes of the scene.
 MSC-VO is assessed using several public datasets, outperforming other state-of-the-art solutions, and comparing favourably even with some SLAM methods.
- Score: 3.1583465114791105
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   Visual odometry algorithms tend to degrade when facing low-textured scenes
-from e.g. human-made environments-, where it is often difficult to find a
sufficient number of point features. Alternative geometrical visual cues, such
as lines, which can often be found within these scenarios, can become
particularly useful. Moreover, these scenarios typically present structural
regularities, such as parallelism or orthogonality, and hold the Manhattan
World assumption. Under these premises, in this work, we introduce MSC-VO, an
RGB-D -based visual odometry approach that combines both point and line
features and leverages, if exist, those structural regularities and the
Manhattan axes of the scene. Within our approach, these structural constraints
are initially used to estimate accurately the 3D position of the extracted
lines. These constraints are also combined next with the estimated Manhattan
axes and the reprojection errors of points and lines to refine the camera pose
by means of local map optimization. Such a combination enables our approach to
operate even in the absence of the aforementioned constraints, allowing the
method to work for a wider variety of scenarios. Furthermore, we propose a
novel multi-view Manhattan axes estimation procedure that mainly relies on line
features. MSC-VO is assessed using several public datasets, outperforming other
state-of-the-art solutions, and comparing favourably even with some SLAM
methods.
 
      
        Related papers
        - Cross-Modal Geometric Hierarchy Fusion: An Implicit-Submap Driven   Framework for Resilient 3D Place Recognition [4.196626042312499]
 We propose a novel framework that redefines 3D place recognition through density-agnostic geometric reasoning.<n>Specifically, we introduce an implicit 3D representation based on elastic points, which is immune to the interference of original scene point cloud density.<n>With the aid of these two types of information, we obtain descriptors that fuse geometric information from both bird's-eye view and 3D segment perspectives.
 arXiv  Detail & Related papers  (2025-06-17T07:04:07Z)
- Robust Incremental Structure-from-Motion with Hybrid Features [73.55745864762703]
 We introduce an incremental Structure-from-Motion (SfM) system that leverages lines and their structured geometric relations.
Our system is consistently more robust and accurate compared to the widely used point-based state of the art in SfM.
 arXiv  Detail & Related papers  (2024-09-29T22:20:32Z)
- GLACE: Global Local Accelerated Coordinate Encoding [66.87005863868181]
 Scene coordinate regression methods are effective in small-scale scenes but face significant challenges in large-scale scenes.
We propose GLACE, which integrates pre-trained global and local encodings and enables SCR to scale to large scenes with only a single small-sized network.
Our method achieves state-of-the-art results on large-scale scenes with a low-map-size model.
 arXiv  Detail & Related papers  (2024-06-06T17:59:50Z)
- 360 Layout Estimation via Orthogonal Planes Disentanglement and   Multi-view Geometric Consistency Perception [56.84921040837699]
 Existing panoramic layout estimation solutions tend to recover room boundaries from a vertically compressed sequence, yielding imprecise results.
We propose an orthogonal plane disentanglement network (termed DOPNet) to distinguish ambiguous semantics.
We also present an unsupervised adaptation technique tailored for horizon-depth and ratio representations.
Our solution outperforms other SoTA models on both monocular layout estimation and multi-view layout estimation tasks.
 arXiv  Detail & Related papers  (2023-12-26T12:16:03Z)
- Neural 3D Scene Reconstruction with the Manhattan-world Assumption [58.90559966227361]
 This paper addresses the challenge of reconstructing 3D indoor scenes from multi-view images.
Planar constraints can be conveniently integrated into the recent implicit neural representation-based reconstruction methods.
The proposed method outperforms previous methods by a large margin on 3D reconstruction quality.
 arXiv  Detail & Related papers  (2022-05-05T17:59:55Z)
- Visual SLAM with Graph-Cut Optimized Multi-Plane Reconstruction [11.215334675788952]
 This paper presents a semantic planar SLAM system that improves pose estimation and mapping using cues from an instance planar segmentation network.
While the mainstream approaches are using RGB-D sensors, employing a monocular camera with such a system still faces challenges such as robust data association and precise geometric model fitting.
 arXiv  Detail & Related papers  (2021-08-09T18:16:08Z)
- Multi-View Optimization of Local Feature Geometry [70.18863787469805]
 We address the problem of refining the geometry of local image features from multiple views without known scene or camera geometry.
Our proposed method naturally complements the traditional feature extraction and matching paradigm.
We show that our method consistently improves the triangulation and camera localization performance for both hand-crafted and learned local features.
 arXiv  Detail & Related papers  (2020-03-18T17:22:11Z)
- From Planes to Corners: Multi-Purpose Primitive Detection in Unorganized
  3D Point Clouds [59.98665358527686]
 We propose a new method for segmentation-free joint estimation of orthogonal planes.
Such unified scene exploration allows for multitudes of applications such as semantic plane detection or local and global scan alignment.
Our experiments demonstrate the validity of our approach in numerous scenarios from wall detection to 6D tracking.
 arXiv  Detail & Related papers  (2020-01-21T06:51:47Z)
- Plane Pair Matching for Efficient 3D View Registration [7.920114031312631]
 We present a novel method to estimate the motion matrix between overlapping pairs of 3D views in the context of indoor scenes.
We use the Manhattan world assumption to introduce lightweight geometric constraints under the form of planes quadri into the problem.
We validate our approach on a toy example and present quantitative experiments on a public RGB-D dataset, comparing against recent state-of-the-art methods.
 arXiv  Detail & Related papers  (2020-01-20T11:15:26Z)
- DeepFactors: Real-Time Probabilistic Dense Monocular SLAM [29.033778410908877]
 We present a SLAM system that unifies methods in a probabilistic framework while still maintaining real-time performance.
This is achieved through the use of a learned compact depth map representation and reformulating three different types of errors.
We evaluate our system on trajectory estimation and depth reconstruction on real-world sequences and present various examples of estimated dense geometry.
 arXiv  Detail & Related papers  (2020-01-14T21:08:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.