VOOM: Robust Visual Object Odometry and Mapping using Hierarchical
Landmarks
- URL: http://arxiv.org/abs/2402.13609v2
- Date: Mon, 26 Feb 2024 10:02:57 GMT
- Title: VOOM: Robust Visual Object Odometry and Mapping using Hierarchical
Landmarks
- Authors: Yutong Wang, Chaoyang Jiang, Xieyuanli Chen
- Abstract summary: We propose a Visual Object Odometry and Mapping framework VOOM.
We use high-level objects and low-level points as the hierarchical landmarks in a coarse-to-fine manner.
VOOM outperforms both object-oriented SLAM and feature points SLAM systems in terms of localization.
- Score: 19.789761641342043
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In recent years, object-oriented simultaneous localization and mapping (SLAM)
has attracted increasing attention due to its ability to provide high-level
semantic information while maintaining computational efficiency. Some
researchers have attempted to enhance localization accuracy by integrating the
modeled object residuals into bundle adjustment. However, few have demonstrated
better results than feature-based visual SLAM systems, as the generic coarse
object models, such as cuboids or ellipsoids, are less accurate than feature
points. In this paper, we propose a Visual Object Odometry and Mapping
framework VOOM using high-level objects and low-level points as the
hierarchical landmarks in a coarse-to-fine manner instead of directly using
object residuals in bundle adjustment. Firstly, we introduce an improved
observation model and a novel data association method for dual quadrics,
employed to represent physical objects. It facilitates the creation of a 3D map
that closely reflects reality. Next, we use object information to enhance the
data association of feature points and consequently update the map. In the
visual object odometry backend, the updated map is employed to further optimize
the camera pose and the objects. Meanwhile, local bundle adjustment is
performed utilizing the objects and points-based covisibility graphs in our
visual object mapping process. Experiments show that VOOM outperforms both
object-oriented SLAM and feature points SLAM systems such as ORB-SLAM2 in terms
of localization. The implementation of our method is available at
https://github.com/yutongwangBIT/VOOM.git.
Related papers
- GOReloc: Graph-based Object-Level Relocalization for Visual SLAM [17.608119427712236]
This article introduces a novel method for object-level relocalization of robotic systems.
It determines the pose of a camera sensor by robustly associating the object detections in the current frame with 3D objects in a lightweight object-level map.
arXiv Detail & Related papers (2024-08-15T03:54:33Z) - Semantic Object-level Modeling for Robust Visual Camera Relocalization [14.998133272060695]
We propose a novel method of automatic object-level voxel modeling for accurate ellipsoidal representations of objects.
All of these modules are entirely intergrated into visual SLAM system.
arXiv Detail & Related papers (2024-02-10T13:39:44Z) - Background Activation Suppression for Weakly Supervised Object
Localization and Semantic Segmentation [84.62067728093358]
Weakly supervised object localization and semantic segmentation aim to localize objects using only image-level labels.
New paradigm has emerged by generating a foreground prediction map to achieve pixel-level localization.
This paper presents two astonishing experimental observations on the object localization learning process.
arXiv Detail & Related papers (2023-09-22T15:44:10Z) - 3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding [58.924180772480504]
3D visual grounding aims to localize the target object in a 3D point cloud by a free-form language description.
We propose a relation-aware one-stage framework, named 3D Relative Position-aware Network (3-Net)
arXiv Detail & Related papers (2023-07-25T09:33:25Z) - An Object SLAM Framework for Association, Mapping, and High-Level Tasks [12.62957558651032]
We present a comprehensive object SLAM framework that focuses on object-based perception and object-oriented robot tasks.
A range of public datasets and real-world results have been used to evaluate the proposed object SLAM framework for its efficient performance.
arXiv Detail & Related papers (2023-05-12T08:10:14Z) - 3D Video Object Detection with Learnable Object-Centric Global
Optimization [65.68977894460222]
Correspondence-based optimization is the cornerstone for 3D scene reconstruction but is less studied in 3D video object detection.
We propose BA-Det, an end-to-end optimizable object detector with object-centric temporal correspondence learning and featuremetric object bundle adjustment.
arXiv Detail & Related papers (2023-03-27T17:39:39Z) - NeuSE: Neural SE(3)-Equivariant Embedding for Consistent Spatial
Understanding with Objects [53.111397800478294]
We present NeuSE, a novel Neural SE(3)-Equivariant Embedding for objects.
NeuSE serves as a compact point cloud surrogate for complete object models.
Our proposed SLAM paradigm, using NeuSE for object shape and pose characterization, can operate independently or in conjunction with typical SLAM systems.
arXiv Detail & Related papers (2023-03-13T17:30:43Z) - Robust Change Detection Based on Neural Descriptor Fields [53.111397800478294]
We develop an object-level online change detection approach that is robust to partially overlapping observations and noisy localization results.
By associating objects via shape code similarity and comparing local object-neighbor spatial layout, our proposed approach demonstrates robustness to low observation overlap and localization noises.
arXiv Detail & Related papers (2022-08-01T17:45:36Z) - Object-Augmented RGB-D SLAM for Wide-Disparity Relocalisation [3.888848425698769]
We propose a novel object-augmented RGB-D SLAM system that is capable of constructing a consistent object map and performing relocalisation based on centroids of objects in the map.
arXiv Detail & Related papers (2021-08-05T11:02:25Z) - Objects are Different: Flexible Monocular 3D Object Detection [87.82253067302561]
We propose a flexible framework for monocular 3D object detection which explicitly decouples the truncated objects and adaptively combines multiple approaches for object depth estimation.
Experiments demonstrate that our method outperforms the state-of-the-art method by relatively 27% for the moderate level and 30% for the hard level in the test set of KITTI benchmark.
arXiv Detail & Related papers (2021-04-06T07:01:28Z) - OrcVIO: Object residual constrained Visual-Inertial Odometry [18.3130718336919]
This work presents OrcVIO, for visual-inertial odometry tightly coupled with tracking and optimization over structured object models.
The ability of OrcVIO for accurate trajectory estimation and large-scale object-level mapping is evaluated using real data.
arXiv Detail & Related papers (2020-07-29T21:01:37Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.