Accurate and Robust Object-oriented SLAM with 3D Quadric Landmark
Construction in Outdoor Environment
- URL: http://arxiv.org/abs/2110.08977v1
- Date: Mon, 18 Oct 2021 02:03:51 GMT
- Title: Accurate and Robust Object-oriented SLAM with 3D Quadric Landmark
Construction in Outdoor Environment
- Authors: Rui Tian, Yunzhou Zhang, Yonghui Feng, Linghao Yang, Zhenzhong Cao,
Sonya Coleman, Dermot Kerr
- Abstract summary: We propose a stereo visual SLAM with a robust quadric landmark representation method.
The proposed system is more robust to observation noise and significantly outperforms current state-of-the-art methods in outdoor environments.
- Score: 4.881705044039887
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Object-oriented SLAM is a popular technology in autonomous driving and
robotics. In this paper, we propose a stereo visual SLAM with a robust quadric
landmark representation method. The system consists of four components,
including deep learning detection, object-oriented data association, dual
quadric landmark initialization and object-based pose optimization.
State-of-the-art quadric-based SLAM algorithms always face observation related
problems and are sensitive to observation noise, which limits their application
in outdoor scenes. To solve this problem, we propose a quadric initialization
method based on the decoupling of the quadric parameters method, which improves
the robustness to observation noise. The sufficient object data association
algorithm and object-oriented optimization with multiple cues enables a highly
accurate object pose estimation that is robust to local observations.
Experimental results show that the proposed system is more robust to
observation noise and significantly outperforms current state-of-the-art
methods in outdoor environments. In addition, the proposed system demonstrates
real-time performance.
Related papers
- ESVO2: Direct Visual-Inertial Odometry with Stereo Event Cameras [33.81592783496106]
Event-based visual odometry aims at solving tracking and mapping sub-problems in parallel.
We build an event-based stereo visual-inertial odometry system on top of our previous direct pipeline Event-based Stereo Visual Odometry.
arXiv Detail & Related papers (2024-10-12T05:35:27Z) - RAD: A Dataset and Benchmark for Real-Life Anomaly Detection with Robotic Observations [18.23500204496233]
The Realistic Anomaly Detection dataset (RAD) is the first multi-view RGB-based anomaly detection dataset specifically collected using a real robot arm.
RAD comprises 4765 images across 13 categories and 4 defect types, collected from more than 50 viewpoints.
We propose a data augmentation method to improve the accuracy of pose estimation and facilitate the reconstruction of 3D point clouds.
arXiv Detail & Related papers (2024-10-01T14:05:35Z) - Graphical Object-Centric Actor-Critic [55.2480439325792]
We propose a novel object-centric reinforcement learning algorithm combining actor-critic and model-based approaches.
We use a transformer encoder to extract object representations and graph neural networks to approximate the dynamics of an environment.
Our algorithm performs better in a visually complex 3D robotic environment and a 2D environment with compositional structure than the state-of-the-art model-free actor-critic algorithm.
arXiv Detail & Related papers (2023-10-26T06:05:12Z) - UniQuadric: A SLAM Backend for Unknown Rigid Object 3D Tracking and
Light-Weight Modeling [7.626461564400769]
We propose a novel SLAM backend that unifies ego-motion tracking, rigid object motion tracking, and modeling.
Our system showcases the potential application of object perception in complex dynamic scenes.
arXiv Detail & Related papers (2023-09-29T07:50:09Z) - DETR4D: Direct Multi-View 3D Object Detection with Sparse Attention [50.11672196146829]
3D object detection with surround-view images is an essential task for autonomous driving.
We propose DETR4D, a Transformer-based framework that explores sparse attention and direct feature query for 3D object detection in multi-view images.
arXiv Detail & Related papers (2022-12-15T14:18:47Z) - CPPF++: Uncertainty-Aware Sim2Real Object Pose Estimation by Vote Aggregation [67.12857074801731]
We introduce a novel method, CPPF++, designed for sim-to-real pose estimation.
To address the challenge posed by vote collision, we propose a novel approach that involves modeling the voting uncertainty.
We incorporate several innovative modules, including noisy pair filtering, online alignment optimization, and a feature ensemble.
arXiv Detail & Related papers (2022-11-24T03:27:00Z) - Neural Motion Fields: Encoding Grasp Trajectories as Implicit Value
Functions [65.84090965167535]
We present Neural Motion Fields, a novel object representation which encodes both object point clouds and the relative task trajectories as an implicit value function parameterized by a neural network.
This object-centric representation models a continuous distribution over the SE(3) space and allows us to perform grasping reactively by leveraging sampling-based MPC to optimize this value function.
arXiv Detail & Related papers (2022-06-29T18:47:05Z) - Object-based Illumination Estimation with Rendering-aware Neural
Networks [56.01734918693844]
We present a scheme for fast environment light estimation from the RGBD appearance of individual objects and their local image areas.
With the estimated lighting, virtual objects can be rendered in AR scenarios with shading that is consistent to the real scene.
arXiv Detail & Related papers (2020-08-06T08:23:19Z) - Robust Ego and Object 6-DoF Motion Estimation and Tracking [5.162070820801102]
This paper proposes a robust solution to achieve accurate estimation and consistent track-ability for dynamic multi-body visual odometry.
A compact and effective framework is proposed leveraging recent advances in semantic instance-level segmentation and accurate optical flow estimation.
A novel formulation, jointly optimizing SE(3) motion and optical flow is introduced that improves the quality of the tracked points and the motion estimation accuracy.
arXiv Detail & Related papers (2020-07-28T05:12:56Z) - Benchmarking Unsupervised Object Representations for Video Sequences [111.81492107649889]
We compare the perceptual abilities of four object-centric approaches: ViMON, OP3, TBA and SCALOR.
Our results suggest that the architectures with unconstrained latent representations learn more powerful representations in terms of object detection, segmentation and tracking.
Our benchmark may provide fruitful guidance towards learning more robust object-centric video representations.
arXiv Detail & Related papers (2020-06-12T09:37:24Z) - Object-oriented SLAM using Quadrics and Symmetry Properties for Indoor
Environments [11.069661312755034]
This paper proposes a sparse object-level SLAM algorithm based on an RGB-D camera.
A quadric representation is used as a landmark to compactly model objects, including their position, orientation, and occupied space.
Experiments have shown that compared with the state-of-art algorithm, especially on the forward trajectory of mobile robots, the proposed algorithm significantly improves the accuracy and convergence speed of quadric reconstruction.
arXiv Detail & Related papers (2020-04-11T04:15:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.