Object-oriented SLAM using Quadrics and Symmetry Properties for Indoor
Environments
- URL: http://arxiv.org/abs/2004.05303v1
- Date: Sat, 11 Apr 2020 04:15:25 GMT
- Title: Object-oriented SLAM using Quadrics and Symmetry Properties for Indoor
Environments
- Authors: Ziwei Liao, Wei Wang, Xianyu Qi, Xiaoyu Zhang, Lin Xue, Jianzhen Jiao
and Ran Wei
- Abstract summary: This paper proposes a sparse object-level SLAM algorithm based on an RGB-D camera.
A quadric representation is used as a landmark to compactly model objects, including their position, orientation, and occupied space.
Experiments have shown that compared with the state-of-art algorithm, especially on the forward trajectory of mobile robots, the proposed algorithm significantly improves the accuracy and convergence speed of quadric reconstruction.
- Score: 11.069661312755034
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Aiming at the application environment of indoor mobile robots, this paper
proposes a sparse object-level SLAM algorithm based on an RGB-D camera. A
quadric representation is used as a landmark to compactly model objects,
including their position, orientation, and occupied space. The state-of-art
quadric-based SLAM algorithm faces the observability problem caused by the
limited perspective under the plane trajectory of the mobile robot. To solve
the problem, the proposed algorithm fuses both object detection and point cloud
data to estimate the quadric parameters. It finishes the quadric initialization
based on a single frame of RGB-D data, which significantly reduces the
requirements for perspective changes. As objects are often observed locally,
the proposed algorithm uses the symmetrical properties of indoor artificial
objects to estimate the occluded parts to obtain more accurate quadric
parameters. Experiments have shown that compared with the state-of-art
algorithm, especially on the forward trajectory of mobile robots, the proposed
algorithm significantly improves the accuracy and convergence speed of quadric
reconstruction. Finally, we made available an opensource implementation to
replicate the experiments.
Related papers
- RAD: A Dataset and Benchmark for Real-Life Anomaly Detection with Robotic Observations [18.23500204496233]
The Realistic Anomaly Detection dataset (RAD) is the first multi-view RGB-based anomaly detection dataset specifically collected using a real robot arm.
RAD comprises 4765 images across 13 categories and 4 defect types, collected from more than 50 viewpoints.
We propose a data augmentation method to improve the accuracy of pose estimation and facilitate the reconstruction of 3D point clouds.
arXiv Detail & Related papers (2024-10-01T14:05:35Z) - Sparse Color-Code Net: Real-Time RGB-Based 6D Object Pose Estimation on Edge Devices [2.3281513013731145]
Our proposed Color-Code Net ( SCCN) embodies a clear and concise pipeline design to address this requirement.
SCCN performs pixel-level predictions on the target object in the RGB image, utilizing the sparsity of essential object geometry features to speed up the Perspective-n-Point process.
It notably achieves an estimation rate of 19 frames per second (FPS) and 6 FPS on the benchmark LINEMOD dataset and the OcclusionMOD dataset.
arXiv Detail & Related papers (2024-06-05T06:21:48Z) - RGB-based Category-level Object Pose Estimation via Decoupled Metric
Scale Recovery [72.13154206106259]
We propose a novel pipeline that decouples the 6D pose and size estimation to mitigate the influence of imperfect scales on rigid transformations.
Specifically, we leverage a pre-trained monocular estimator to extract local geometric information.
A separate branch is designed to directly recover the metric scale of the object based on category-level statistics.
arXiv Detail & Related papers (2023-09-19T02:20:26Z) - Detecting Rotated Objects as Gaussian Distributions and Its 3-D
Generalization [81.29406957201458]
Existing detection methods commonly use a parameterized bounding box (BBox) to model and detect (horizontal) objects.
We argue that such a mechanism has fundamental limitations in building an effective regression loss for rotation detection.
We propose to model the rotated objects as Gaussian distributions.
We extend our approach from 2-D to 3-D with a tailored algorithm design to handle the heading estimation.
arXiv Detail & Related papers (2022-09-22T07:50:48Z) - IFOR: Iterative Flow Minimization for Robotic Object Rearrangement [92.97142696891727]
IFOR, Iterative Flow Minimization for Robotic Object Rearrangement, is an end-to-end method for the problem of object rearrangement for unknown objects.
We show that our method applies to cluttered scenes, and in the real world, while training only on synthetic data.
arXiv Detail & Related papers (2022-02-01T20:03:56Z) - Accurate and Robust Object-oriented SLAM with 3D Quadric Landmark
Construction in Outdoor Environment [4.881705044039887]
We propose a stereo visual SLAM with a robust quadric landmark representation method.
The proposed system is more robust to observation noise and significantly outperforms current state-of-the-art methods in outdoor environments.
arXiv Detail & Related papers (2021-10-18T02:03:51Z) - Leveraging Spatial and Photometric Context for Calibrated Non-Lambertian
Photometric Stereo [61.6260594326246]
We introduce an efficient fully-convolutional architecture that can leverage both spatial and photometric context simultaneously.
Using separable 4D convolutions and 2D heat-maps reduces the size and makes more efficient.
arXiv Detail & Related papers (2021-03-22T18:06:58Z) - Spatial Attention Improves Iterative 6D Object Pose Estimation [52.365075652976735]
We propose a new method for 6D pose estimation refinement from RGB images.
Our main insight is that after the initial pose estimate, it is important to pay attention to distinct spatial features of the object.
We experimentally show that this approach learns to attend to salient spatial features and learns to ignore occluded parts of the object, leading to better pose estimation across datasets.
arXiv Detail & Related papers (2021-01-05T17:18:52Z) - Nothing But Geometric Constraints: A Model-Free Method for Articulated
Object Pose Estimation [89.82169646672872]
We propose an unsupervised vision-based system to estimate the joint configurations of the robot arm from a sequence of RGB or RGB-D images without knowing the model a priori.
We combine a classical geometric formulation with deep learning and extend the use of epipolar multi-rigid-body constraints to solve this task.
arXiv Detail & Related papers (2020-11-30T20:46:48Z) - Moving object detection for visual odometry in a dynamic environment
based on occlusion accumulation [31.143322364794894]
We propose a moving object detection algorithm that uses RGB-D images.
The proposed algorithm does not require estimating a background model.
We use dense visual odometry (DVO) as a VO method with a bi-square regression weight.
arXiv Detail & Related papers (2020-09-18T11:01:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.