Convex Hull-based Algebraic Constraint for Visual Quadric SLAM
- URL: http://arxiv.org/abs/2503.01254v1
- Date: Mon, 03 Mar 2025 07:30:07 GMT
- Title: Convex Hull-based Algebraic Constraint for Visual Quadric SLAM
- Authors: Xiaolong Yu, Junqiao Zhao, Shuangfu Song, Zhongyang Zhu, Zihan Yuan, Chen Ye, Tiantian Feng,
- Abstract summary: Using quadrics as the object representation has the benefits of both generality closed-form projection between image and world spaces.<n>Although numerous have been proposed for dualc reconstruction, we found that many of them are imprecise and provide minimal improvements to localization.<n>We introduce a concise yet more precise convex hull-based constraint for object landmarks.<n>Experiments on public datasets demonstrate that our approach is applicable to both monocular and RGB-D SLAM.
- Score: 9.855936120653995
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Using Quadrics as the object representation has the benefits of both generality and closed-form projection derivation between image and world spaces. Although numerous constraints have been proposed for dual quadric reconstruction, we found that many of them are imprecise and provide minimal improvements to localization.After scrutinizing the existing constraints, we introduce a concise yet more precise convex hull-based algebraic constraint for object landmarks, which is applied to object reconstruction, frontend pose estimation, and backend bundle adjustment.This constraint is designed to fully leverage precise semantic segmentation, effectively mitigating mismatches between complex-shaped object contours and dual quadrics.Experiments on public datasets demonstrate that our approach is applicable to both monocular and RGB-D SLAM and achieves improved object mapping and localization than existing quadric SLAM methods. The implementation of our method is available at https://github.com/tiev-tongji/convexhull-based-algebraic-constraint.
Related papers
- BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation [58.14071520415005]
This paper presents a general RGB-based approach for object pose estimation, specifically designed to address challenges in sparse-view settings.
To overcome these limitations, we introduce corner points of the object bounding box as an intermediate representation of the object pose.
The 3D object corners can be reliably recovered from sparse input views, while the 2D corner points in the target view are estimated through a novel reference-based point datasets.
arXiv Detail & Related papers (2025-04-10T17:58:35Z) - Pointmap Association and Piecewise-Plane Constraint for Consistent and Compact 3D Gaussian Segmentation Field [24.145668344380766]
CCGS is a method designed to achieve both view consistent 2D segmentation and a compact 3D Gaussian segmentation field.<n> Experimental results on ScanNet and Replica datasets demonstrate that CCGS outperforms existing methods in both 2D panoptic segmentation and 3D Gaussian segmentation.
arXiv Detail & Related papers (2025-02-22T17:23:46Z) - UNOPose: Unseen Object Pose Estimation with an Unposed RGB-D Reference Image [86.7128543480229]
We present a novel approach and benchmark, termed UNOPose, for unseen one-reference-based object pose estimation.
Building upon a coarse-to-fine paradigm, UNOPose constructs an SE(3)-invariant reference frame to standardize object representation.
We recalibrate the weight of each correspondence based on its predicted likelihood of being within the overlapping region.
arXiv Detail & Related papers (2024-11-25T05:36:00Z) - LCP-Fusion: A Neural Implicit SLAM with Enhanced Local Constraints and Computable Prior [22.948360879064758]
LCP-Fusion is a neural implicit SLAM system with enhanced local constraints and computable prior.
We show that our method achieve better localization accuracy and reconstruction consistency than existing RGB-D implicit SLAM.
arXiv Detail & Related papers (2024-11-06T02:05:44Z) - Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields [18.474371929572918]
Generalizable NeRF aims to synthesize novel views for unseen scenes.
We introduce an Adaptive Cost Aggregation (ACA) approach to amplify the contribution of consistent pixel pairs.
We observe the two existing decoding strategies excel in different areas, which are complementary.
arXiv Detail & Related papers (2024-04-26T16:46:28Z) - Constrained 6-DoF Grasp Generation on Complex Shapes for Improved Dual-Arm Manipulation [11.048436789482189]
We propose CGDF: Constrained Grasp Diffusion Fields, a diffusion-based grasp generative model that generalizes to objects with arbitrary geometries.
We show that our method can generalize to generate stable grasps on complex objects, especially useful for dual-arm manipulation settings.
arXiv Detail & Related papers (2024-04-06T14:28:01Z) - Distributional Reduction: Unifying Dimensionality Reduction and Clustering with Gromov-Wasserstein [56.62376364594194]
Unsupervised learning aims to capture the underlying structure of potentially large and high-dimensional datasets.
In this work, we revisit these approaches under the lens of optimal transport and exhibit relationships with the Gromov-Wasserstein problem.
This unveils a new general framework, called distributional reduction, that recovers DR and clustering as special cases and allows addressing them jointly within a single optimization problem.
arXiv Detail & Related papers (2024-02-03T19:00:19Z) - Dynamic Hyperbolic Attention Network for Fine Hand-object Reconstruction [76.5549647815413]
We propose the first precise hand-object reconstruction method in hyperbolic space, namely Dynamic Hyperbolic Attention Network (DHANet)
Our method learns mesh features with rich geometry-image multi-modal information and models better hand-object interaction.
arXiv Detail & Related papers (2023-09-06T13:00:10Z) - ReCLIP: A Strong Zero-Shot Baseline for Referring Expression
Comprehension [114.85628613911713]
Large-scale pre-trained models are useful for image classification across domains.
We present ReCLIP, a simple but strong zero-shot baseline that repurposes CLIP, a state-of-the-art large-scale model, for ReC.
arXiv Detail & Related papers (2022-04-12T17:55:38Z) - SO-SLAM: Semantic Object SLAM with Scale Proportional and Symmetrical
Texture Constraints [9.694083816665525]
This paper proposes a novel monocular Semantic Object SLAM (SO-SLAM) system that addresses the introduction of object spatial constraints.
We have verified the performance of the algorithm on the public datasets and an author-recorded mobile robot dataset.
arXiv Detail & Related papers (2021-09-10T13:55:37Z) - Cogradient Descent for Bilinear Optimization [124.45816011848096]
We introduce a Cogradient Descent algorithm (CoGD) to address the bilinear problem.
We solve one variable by considering its coupling relationship with the other, leading to a synchronous gradient descent.
Our algorithm is applied to solve problems with one variable under the sparsity constraint.
arXiv Detail & Related papers (2020-06-16T13:41:54Z) - Dense Non-Rigid Structure from Motion: A Manifold Viewpoint [162.88686222340962]
Non-Rigid Structure-from-Motion (NRSfM) problem aims to recover 3D geometry of a deforming object from its 2D feature correspondences across multiple frames.
We show that our approach significantly improves accuracy, scalability, and robustness against noise.
arXiv Detail & Related papers (2020-06-15T09:15:54Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.