Related papers: BoxGraph: Semantic Place Recognition and Pose Estimation from 3D LiDAR

BoxGraph: Semantic Place Recognition and Pose Estimation from 3D LiDAR

URL: http://arxiv.org/abs/2206.15154v1
Date: Thu, 30 Jun 2022 09:39:08 GMT
Title: BoxGraph: Semantic Place Recognition and Pose Estimation from 3D LiDAR
Authors: Georgi Pramatarov, Daniele De Martini, Matthew Gadd, Paul Newman
Abstract summary: We model 3D point clouds as fully-connected graphs of semantically identified components. Optimal association across graphs allows for full 6-Degree-of-Freedom (DoF) pose estimation and place recognition. This representation is very concise, condensing the size of maps by a factor of 25 against the state-of-the-art.
Score: 22.553026961366005
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: This paper is about extremely robust and lightweight localisation using LiDAR point clouds based on instance segmentation and graph matching. We model 3D point clouds as fully-connected graphs of semantically identified components where each vertex corresponds to an object instance and encodes its shape. Optimal vertex association across graphs allows for full 6-Degree-of-Freedom (DoF) pose estimation and place recognition by measuring similarity. This representation is very concise, condensing the size of maps by a factor of 25 against the state-of-the-art, requiring only 3kB to represent a 1.4MB laser scan. We verify the efficacy of our system on the SemanticKITTI dataset, where we achieve a new state-of-the-art in place recognition, with an average of 88.4% recall at 100% precision where the next closest competitor follows with 64.9%. We also show accurate metric pose estimation performance - estimating 6-DoF pose with median errors of 10 cm and 0.33 deg.

Related papers

Map2Thought: Explicit 3D Spatial Reasoning via Metric Cognitive Maps [35.51348819617679]
Map2Thought is a framework that enables explicit and interpretable spatial reasoning for 3D VLMs.<n>Metric Cognitive Map (Metric-CogMap) and Cognitive Chain-of-Thought (Cog-CoT) are key components of the framework.<n>We show that Map2Thought enables explainable 3D understanding, achieving 59.9% accuracy using only half the supervision.
arXiv Detail & Related papers (2026-01-16T17:02:46Z)
iMatcher: Improve matching in point cloud registration via local-to-global geometric consistency learning [2.3985192761907643]
iMatcher is a framework for feature matching in point cloud registration.<n>It uses both local and global consistency to predict a point-wise matching probability.<n>It achieves state-of-the-art inlier ratios, scoring 95% - 97% on KITTI, 94% - 97% on KITTI-360, and up to 81.1% on 3DMatch.
arXiv Detail & Related papers (2025-09-10T20:25:57Z)
Zero-shot Inexact CAD Model Alignment from a Single Image [53.37898107159792]
A practical approach to infer 3D scene structure from a single image is to retrieve a closely matching 3D model from a database and align it with the object in the image.<n>Existing methods rely on supervised training with images and pose annotations, which limits them to a narrow set of object categories.<n>We propose a weakly supervised 9-DoF alignment method for inexact 3D models that requires no pose annotations and generalizes to unseen categories.
arXiv Detail & Related papers (2025-07-04T04:46:59Z)
Dense 3D Displacement Estimation for Landslide Monitoring via Fusion of TLS Point Clouds and Embedded RGB Images [7.144866519844918]
Landslide monitoring is essential for understanding geohazards and mitigating associated risks.<n>Existing point cloud-based methods typically rely on either geometric or radiometric information.<n>We propose a hierarchical partition-based coarse-to-fine approach that fuses 3D point clouds and co-registered RGB images.
arXiv Detail & Related papers (2025-06-19T12:28:09Z)
On-the-fly Point Feature Representation for Point Clouds Analysis [7.074010861305738]
We propose On-the-fly Point Feature Representation (OPFR), which captures abundant geometric information explicitly through Curve Feature Generator module. We also introduce the Local Reference Constructor module, which approximates the local coordinate systems based on triangle sets. OPFR only requires extra 1.56ms for inference (65x faster than vanilla PFH) and 0.012M more parameters, and it can serve as a versatile plug-and-play module for various backbones.
arXiv Detail & Related papers (2024-07-31T04:57:06Z)
That's My Point: Compact Object-centric LiDAR Pose Estimation for Large-scale Outdoor Localisation [18.26335698291226]
This paper is about 3D pose estimation on LiDAR scans with extremely minimal storage requirements. We achieve this by clustering all points of segmented scans into semantic objects and representing them only with their respective centroid and semantic class. We achieve accurate metric estimates comparable with state-of-the-art methods with almost half the representation size.
arXiv Detail & Related papers (2024-03-07T18:55:30Z)
Whole-body Detection, Recognition and Identification at Altitude and Range [57.445372305202405]
We propose an end-to-end system evaluated on diverse datasets. Our approach involves pre-training the detector on common image datasets and fine-tuning it on BRIAR's complex videos and images. We conduct thorough evaluations under various conditions, such as different ranges and angles in indoor, outdoor, and aerial scenarios.
arXiv Detail & Related papers (2023-11-09T20:20:23Z)
Volumetric Semantically Consistent 3D Panoptic Mapping [77.13446499924977]
We introduce an online 2D-to-3D semantic instance mapping algorithm aimed at generating semantic 3D maps suitable for autonomous agents in unstructured environments. It introduces novel ways of integrating semantic prediction confidence during mapping, producing semantic and instance-consistent 3D regions. The proposed method achieves accuracy superior to the state of the art on public large-scale datasets, improving on a number of widely used metrics.
arXiv Detail & Related papers (2023-09-26T08:03:10Z)
LFM-3D: Learnable Feature Matching Across Wide Baselines Using 3D Signals [9.201550006194994]
Learnable matchers often underperform when there exists only small regions of co-visibility between image pairs. We propose LFM-3D, a Learnable Feature Matching framework that uses models based on graph neural networks. We show that the resulting improved correspondences lead to much higher relative posing accuracy for in-the-wild image pairs.
arXiv Detail & Related papers (2023-03-22T17:46:27Z)
A Large Scale Homography Benchmark [52.55694707744518]
We present a large-scale dataset of Planes in 3D, Pi3D, of roughly 1000 planes observed in 10 000 images from the 1DSfM dataset. We also present HEB, a large-scale homography estimation benchmark leveraging Pi3D.
arXiv Detail & Related papers (2023-02-20T14:18:09Z)
Contour Context: Abstract Structural Distribution for 3D LiDAR Loop Detection and Metric Pose Estimation [31.968749056155467]
This paper proposes a simple, effective, and efficient topological loop closure detection pipeline with accurate 3-DoF metric pose estimation. We interpret the Cartesian birds' eye view (BEV) image projected from 3D LiDAR points as layered distribution of structures. A retrieval key is designed to accelerate the search of a database indexed by layered KD-trees.
arXiv Detail & Related papers (2023-02-13T07:18:24Z)
Lepard: Learning partial point cloud matching in rigid and deformable scenes [73.45277809052928]
Lepard is a Learning based approach for partial point cloud matching for rigid and deformable scenes. For rigid point cloud matching, Lepard sets a new state-of-the-art on the 3DMatch / 3DLoMatch benchmarks with 93.6% / 69.0% registration recall. In deformable cases, Lepard achieves +27.1% / +34.8% higher non-rigid feature matching recall than the prior art on our newly constructed 4DMatch / 4DLoMatch benchmark.
arXiv Detail & Related papers (2021-11-24T16:09:29Z)
Category-Level Metric Scale Object Shape and Pose Estimation [73.92460712829188]
We propose a framework that jointly estimates a metric scale shape and pose from a single RGB image. We validated our method on both synthetic and real-world datasets to evaluate category-level object pose and shape.
arXiv Detail & Related papers (2021-09-01T12:16:46Z)
Semantic Graph Based Place Recognition for 3D Point Clouds [22.608115489674653]
This paper presents a novel semantic graph based approach for place recognition. First, we propose a novel semantic graph representation for the point cloud scenes. We then design a fast and effective graph similarity network to compute the similarity.
arXiv Detail & Related papers (2020-08-26T09:27:26Z)
PerMO: Perceiving More at Once from a Single Image for Autonomous Driving [76.35684439949094]
We present a novel approach to detect, segment, and reconstruct complete textured 3D models of vehicles from a single image. Our approach combines the strengths of deep learning and the elegance of traditional techniques. We have integrated these algorithms with an autonomous driving system.
arXiv Detail & Related papers (2020-07-16T05:02:45Z)

This list is automatically generated from the titles and abstracts of the papers in this site.