Long-term Visual Map Sparsification with Heterogeneous GNN
- URL: http://arxiv.org/abs/2203.15182v1
- Date: Tue, 29 Mar 2022 01:46:12 GMT
- Title: Long-term Visual Map Sparsification with Heterogeneous GNN
- Authors: Ming-Fang Chang, Yipu Zhao, Rajvi Shah, Jakob J. Engel, Michael Kaess,
and Simon Lucey
- Abstract summary: In this paper, we aim to overcome the environmental changes and reduce the map size at the same time by selecting points that are valuable to future localization.
Inspired by the recent progress in Graph Neural Network(GNN), we propose the first work that models SfM maps as heterogeneous graphs and predicts 3D point importance scores with a GNN.
Two novel supervisions are proposed: 1) a data-fitting term for selecting valuable points to future localization based on training queries; 2) a K-Cover term for selecting sparse points with full map coverage.
- Score: 47.12309045366042
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We address the problem of map sparsification for long-term visual
localization. For map sparsification, a commonly employed assumption is that
the pre-build map and the later captured localization query are consistent.
However, this assumption can be easily violated in the dynamic world.
Additionally, the map size grows as new data accumulate through time, causing
large data overhead in the long term. In this paper, we aim to overcome the
environmental changes and reduce the map size at the same time by selecting
points that are valuable to future localization. Inspired by the recent
progress in Graph Neural Network(GNN), we propose the first work that models
SfM maps as heterogeneous graphs and predicts 3D point importance scores with a
GNN, which enables us to directly exploit the rich information in the SfM map
graph. Two novel supervisions are proposed: 1) a data-fitting term for
selecting valuable points to future localization based on training queries; 2)
a K-Cover term for selecting sparse points with full map coverage. The
experiments show that our method selected map points on stable and widely
visible structures and outperformed baselines in localization performance.
Related papers
- Diffusion-based Data Augmentation for Object Counting Problems [62.63346162144445]
We develop a pipeline that utilizes a diffusion model to generate extensive training data.
We are the first to generate images conditioned on a location dot map with a diffusion model.
Our proposed counting loss for the diffusion model effectively minimizes the discrepancies between the location dot map and the crowd images generated.
arXiv Detail & Related papers (2024-01-25T07:28:22Z) - Grid Cell-Inspired Fragmentation and Recall for Efficient Map Building [29.630483662400444]
We propose and apply the concept of Fragmentation-and-Recall (FARMap) in the mapping of large spaces.
Agents solve the mapping problem by building local maps via a surprisal-based clustering of space.
We demonstrate that FARMap replicates the fragmentation points observed in animal studies.
arXiv Detail & Related papers (2023-07-11T20:40:19Z) - Neural Map Prior for Autonomous Driving [17.198729798817094]
High-definition (HD) semantic maps are crucial in enabling autonomous vehicles to navigate urban environments.
Traditional method of creating offline HD maps involves labor-intensive manual annotation processes.
Recent studies have proposed an alternative approach that generates local maps using online sensor observations.
In this study, we propose Neural Map Prior (NMP), a neural representation of global maps.
arXiv Detail & Related papers (2023-04-17T17:58:40Z) - Efficient Map Sparsification Based on 2D and 3D Discretized Grids [47.22997560184043]
As a map grows larger, more memory is required and localization becomes inefficient.
Previous map sparsification methods add a quadratic term in mixed-integer programming to enforce a uniform distribution of selected landmarks.
In this paper, we formulate map sparsification in an efficient linear form and select uniformly distributed landmarks based on 2D discretized grids.
arXiv Detail & Related papers (2023-03-20T05:49:14Z) - Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement [54.29252286561449]
We propose a two-stage graph-based and model-agnostic framework, called Graph-PCNN.
In the first stage, heatmap regression network is applied to obtain a rough localization result, and a set of proposal keypoints, called guided points, are sampled.
In the second stage, for each guided point, different visual feature is extracted by the localization.
The relationship between guided points is explored by the graph pose refinement module to get more accurate localization results.
arXiv Detail & Related papers (2020-07-21T04:59:15Z) - Rethinking Localization Map: Towards Accurate Object Perception with
Self-Enhancement Maps [78.2581910688094]
This work introduces a novel self-enhancement method to harvest accurate object localization maps and object boundaries with only category labels as supervision.
In particular, the proposed Self-Enhancement Maps achieve the state-of-the-art localization accuracy of 54.88% on ILSVRC.
arXiv Detail & Related papers (2020-06-09T12:35:55Z) - Structural Temporal Graph Neural Networks for Anomaly Detection in
Dynamic Graphs [54.13919050090926]
We propose an end-to-end structural temporal Graph Neural Network model for detecting anomalous edges in dynamic graphs.
In particular, we first extract the $h$-hop enclosing subgraph centered on the target edge and propose the node labeling function to identify the role of each node in the subgraph.
Based on the extracted features, we utilize Gated recurrent units (GRUs) to capture the temporal information for anomaly detection.
arXiv Detail & Related papers (2020-05-15T09:17:08Z) - GPS-Net: Graph Property Sensing Network for Scene Graph Generation [91.60326359082408]
Scene graph generation (SGG) aims to detect objects in an image along with their pairwise relationships.
GPS-Net fully explores three properties for SGG: edge direction information, the difference in priority between nodes, and the long-tailed distribution of relationships.
GPS-Net achieves state-of-the-art performance on three popular databases: VG, OI, and VRD by significant gains under various settings and metrics.
arXiv Detail & Related papers (2020-03-29T07:22:31Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.