GOTPR: General Outdoor Text-based Place Recognition Using Scene Graph Retrieval with OpenStreetMap
- URL: http://arxiv.org/abs/2501.08575v2
- Date: Thu, 22 May 2025 11:48:18 GMT
- Title: GOTPR: General Outdoor Text-based Place Recognition Using Scene Graph Retrieval with OpenStreetMap
- Authors: Donghwi Jung, Keonwoo Kim, Seong-Woo Kim,
- Abstract summary: We propose GOTPR, a robust place recognition method designed for outdoor environments where GPS signals are unavailable.<n>Unlike existing approaches that use point cloud maps, which are large and difficult to store, GOTPR leverages scene graphs generated from text descriptions and maps for place recognition.<n>In city-scale tests, it completed processing within a few seconds, making it highly practical for real-world robotics applications.
- Score: 4.51019574688293
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: We propose GOTPR, a robust place recognition method designed for outdoor environments where GPS signals are unavailable. Unlike existing approaches that use point cloud maps, which are large and difficult to store, GOTPR leverages scene graphs generated from text descriptions and maps for place recognition. This method improves scalability by replacing point clouds with compact data structures, allowing robots to efficiently store and utilize extensive map data. In addition, GOTPR eliminates the need for custom map creation by using publicly available OpenStreetMap data, which provides global spatial information. We evaluated its performance using the KITTI360Pose dataset with corresponding OpenStreetMap data, comparing it to existing point cloud-based place recognition methods. The results show that GOTPR achieves comparable accuracy while significantly reducing storage requirements. In city-scale tests, it completed processing within a few seconds, making it highly practical for real-world robotics applications. More information can be found at https://donghwijung.github.io/GOTPR_page/.
Related papers
- FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud Maps [0.7421845364041001]
We propose FlexCloud for an automatic georeferencing of point cloud maps created from SLAM.
Our approach is designed to work modularly with different SLAM methods, utilizing only the generated local point cloud map.
Our approach enables the creation of consistent, globally referenced point cloud maps from data collected by a mobile mapping system.
arXiv Detail & Related papers (2025-02-01T10:56:05Z) - R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual Localization [66.87005863868181]
We introduce a covisibility graph-based global encoding learning and data augmentation strategy.
We revisit the network architecture and local feature extraction module.
Our method achieves state-of-the-art on challenging large-scale datasets without relying on network ensembles or 3D supervision.
arXiv Detail & Related papers (2025-01-02T18:59:08Z) - MapQaTor: An Extensible Framework for Efficient Annotation of Map-Based QA Datasets [3.3856216159724983]
We introduce MapQaTor, an open-source framework that streamlines the creation of traceable map-based QA datasets.<n>MapQaTor enables seamless integration with any maps API, allowing users to gather and visualize data from diverse sources.
arXiv Detail & Related papers (2024-12-30T15:33:19Z) - PointRegGPT: Boosting 3D Point Cloud Registration using Generative Point-Cloud Pairs for Training [90.06520673092702]
We present PointRegGPT, boosting 3D point cloud registration using generative point-cloud pairs for training.
To our knowledge, this is the first generative approach that explores realistic data generation for indoor point cloud registration.
arXiv Detail & Related papers (2024-07-19T06:29:57Z) - PRISM-TopoMap: Online Topological Mapping with Place Recognition and Scan Matching [42.74395278382559]
This paper introduces PRISM-TopoMap -- a topological mapping method that maintains a graph of locally aligned locations.
The proposed method involves learnable multimodal place recognition paired with the scan matching pipeline for localization and loop closure.
We conduct a broad experimental evaluation of the suggested approach in a range of photo-realistic environments and on a real robot.
arXiv Detail & Related papers (2024-04-02T06:25:16Z) - DUFOMap: Efficient Dynamic Awareness Mapping [3.3580006471376205]
The dynamic nature of the real world is one of the main challenges in robotics.
Current solutions are often applied in post-processing, where parameter tuning allows the user to adjust the setting for a specific dataset.
We propose DUFOMap, a novel dynamic awareness mapping framework designed for efficient online processing.
arXiv Detail & Related papers (2024-03-03T09:07:16Z) - Loopy-SLAM: Dense Neural SLAM with Loop Closures [53.11936461015725]
We introduce Loopy-SLAM that globally optimize poses and the dense 3D model.
We use frame-to-model tracking using a data-driven point-based submap generation method and trigger loop closures online by performing global place recognition.
Evaluation on the synthetic Replica and real-world TUM-RGBD and ScanNet datasets demonstrate competitive or superior performance in tracking, mapping, and rendering accuracy when compared to existing dense neural RGBD SLAM methods.
arXiv Detail & Related papers (2024-02-14T18:18:32Z) - Neural Implicit Dense Semantic SLAM [83.04331351572277]
We propose a novel RGBD vSLAM algorithm that learns a memory-efficient, dense 3D geometry, and semantic segmentation of an indoor scene in an online manner.
Our pipeline combines classical 3D vision-based tracking and loop closing with neural fields-based mapping.
Our proposed algorithm can greatly enhance scene perception and assist with a range of robot control problems.
arXiv Detail & Related papers (2023-04-27T23:03:52Z) - HPointLoc: Point-based Indoor Place Recognition using Synthetic RGB-D
Images [58.720142291102135]
We present a novel dataset named as HPointLoc, specially designed for exploring capabilities of visual place recognition in indoor environment.
The dataset is based on the popular Habitat simulator, in which it is possible to generate indoor scenes using both own sensor data and open datasets.
arXiv Detail & Related papers (2022-12-30T12:20:56Z) - A Map-matching Algorithm with Extraction of Multi-group Information for
Low-frequency Data [9.476212160807549]
This paper designs a new map-matching method to make full use of "Big data"
We sort all data into four groups according to their spatial and temporal distance from the present matching probe.
We use a modified top-K shortest-path method to search the candidate paths within an ellipse region and then use the fused score to infer the path.
arXiv Detail & Related papers (2022-09-18T08:09:17Z) - Robust Self-Tuning Data Association for Geo-Referencing Using Lane Markings [44.4879068879732]
This paper presents a complete pipeline for resolving ambiguities during the data association.
Its core is a robust self-tuning data association that adapts the search area depending on the entropy of the measurements.
We evaluate our method on real data from urban and rural scenarios around the city of Karlsruhe in Germany.
arXiv Detail & Related papers (2022-07-28T12:29:39Z) - Long-term Visual Map Sparsification with Heterogeneous GNN [47.12309045366042]
In this paper, we aim to overcome the environmental changes and reduce the map size at the same time by selecting points that are valuable to future localization.
Inspired by the recent progress in Graph Neural Network(GNN), we propose the first work that models SfM maps as heterogeneous graphs and predicts 3D point importance scores with a GNN.
Two novel supervisions are proposed: 1) a data-fitting term for selecting valuable points to future localization based on training queries; 2) a K-Cover term for selecting sparse points with full map coverage.
arXiv Detail & Related papers (2022-03-29T01:46:12Z) - GANmapper: geographical content filling [0.0]
We present a new method to create spatial data using a generative adversarial network (GAN)
Our contribution uses coarse and widely available geospatial data to create maps of less available features at the finer scale in the built environment.
We employ land use data and road networks as input to generate building footprints, and conduct experiments in 9 cities around the world.
arXiv Detail & Related papers (2021-08-07T05:50:54Z) - Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset,
Benchmarks and Challenges [52.624157840253204]
We present an urban-scale photogrammetric point cloud dataset with nearly three billion richly annotated points.
Our dataset consists of large areas from three UK cities, covering about 7.6 km2 of the city landscape.
We evaluate the performance of state-of-the-art algorithms on our dataset and provide a comprehensive analysis of the results.
arXiv Detail & Related papers (2020-09-07T14:47:07Z) - Zero-Shot Multi-View Indoor Localization via Graph Location Networks [66.05980368549928]
indoor localization is a fundamental problem in location-based applications.
We propose a novel neural network based architecture Graph Location Networks (GLN) to perform infrastructure-free, multi-view image based indoor localization.
GLN makes location predictions based on robust location representations extracted from images through message-passing networks.
We introduce a novel zero-shot indoor localization setting and tackle it by extending the proposed GLN to a dedicated zero-shot version.
arXiv Detail & Related papers (2020-08-06T07:36:55Z) - Rethinking Localization Map: Towards Accurate Object Perception with
Self-Enhancement Maps [78.2581910688094]
This work introduces a novel self-enhancement method to harvest accurate object localization maps and object boundaries with only category labels as supervision.
In particular, the proposed Self-Enhancement Maps achieve the state-of-the-art localization accuracy of 54.88% on ILSVRC.
arXiv Detail & Related papers (2020-06-09T12:35:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.