Local positional graphs and attentive local features for a data and   runtime-efficient hierarchical place recognition pipeline
        - URL: http://arxiv.org/abs/2403.10283v1
- Date: Fri, 15 Mar 2024 13:26:39 GMT
- Title: Local positional graphs and attentive local features for a data and   runtime-efficient hierarchical place recognition pipeline
- Authors: Fangming Yuan, Stefan Schubert, Peter Protzel, Peer Neubert, 
- Abstract summary: This paper proposes a runtime and data-efficient hierarchical VPR pipeline that extends existing approaches and presents novel ideas.
First, we propose Local Positional Graphs (LPG), a training-free and runtime-efficient approach to encode spatial context information of local image features.
Second, we present Attentive Local SPED (ATLAS), an extension of our previous local features approach with an attention module.
Third, we present a hierarchical pipeline that exploits hyperdimensional computing to use the same local features as holistic HDC-descriptors for fast candidate selection and for candidate reranking.
- Score: 11.099588962062937
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract:   Large-scale applications of Visual Place Recognition (VPR) require computationally efficient approaches. Further, a well-balanced combination of data-based and training-free approaches can decrease the required amount of training data and effort and can reduce the influence of distribution shifts between the training and application phases. This paper proposes a runtime and data-efficient hierarchical VPR pipeline that extends existing approaches and presents novel ideas. There are three main contributions: First, we propose Local Positional Graphs (LPG), a training-free and runtime-efficient approach to encode spatial context information of local image features. LPG can be combined with existing local feature detectors and descriptors and considerably improves the image-matching quality compared to existing techniques in our experiments. Second, we present Attentive Local SPED (ATLAS), an extension of our previous local features approach with an attention module that improves the feature quality while maintaining high data efficiency. The influence of the proposed modifications is evaluated in an extensive ablation study. Third, we present a hierarchical pipeline that exploits hyperdimensional computing to use the same local features as holistic HDC-descriptors for fast candidate selection and for candidate reranking. We combine all contributions in a runtime and data-efficient VPR pipeline that shows benefits over the state-of-the-art method Patch-NetVLAD on a large collection of standard place recognition datasets with 15$\%$ better performance in VPR accuracy, 54$\times$ faster feature comparison speed, and 55$\times$ less descriptor storage occupancy, making our method promising for real-world high-performance large-scale VPR in changing environments. Code will be made available with publication of this paper. 
 
      
        Related papers
        - SelaVPR++: Towards Seamless Adaptation of Foundation Models for   Efficient Place Recognition [69.58329995485158]
 Recent studies show that the visual place recognition (VPR) method using pre-trained visual foundation models can achieve promising performance.
We propose a novel method to realize seamless adaptation of foundation models to VPR.
In pursuit of higher efficiency and better performance, we propose an extension of the SelaVPR, called SelaVPR++.
 arXiv  Detail & Related papers  (2025-02-23T15:01:09Z)
- VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place   Recognition [23.173085268845384]
 This paper introduces VLAD-BuFF, a self-similarity based feature discounting mechanism to learn Burst-aware features within end-to-end VPR training.
We benchmark our method on 9 public datasets, where VLAD-BuFF sets a new state of the art.
Our method is able to maintain its high recall even for 12x reduced local feature dimensions, thus enabling fast feature aggregation without compromising on recall.
 arXiv  Detail & Related papers  (2024-09-28T09:44:08Z)
- Structured Pruning for Efficient Visual Place Recognition [24.433604332415204]
 Visual Place Recognition (VPR) is fundamental for the global re-localization of robots and devices.
Our work introduces a novel structured pruning method to streamline common VPR architectures.
This dual focus significantly enhances the efficiency of the system, reducing both map and model memory requirements and decreasing feature extraction and retrieval latencies.
 arXiv  Detail & Related papers  (2024-09-12T08:32:25Z)
- FUSELOC: Fusing Global and Local Descriptors to Disambiguate 2D-3D   Matching in Visual Localization [57.59857784298536]
 Direct 2D-3D matching algorithms require significantly less memory but suffer from lower accuracy due to the larger and more ambiguous search space.
We address this ambiguity by fusing local and global descriptors using a weighted average operator within a 2D-3D search framework.
We consistently improve the accuracy over local-only systems and achieve performance close to hierarchical methods while halving memory requirements.
 arXiv  Detail & Related papers  (2024-08-21T23:42:16Z)
- AIR-HLoc: Adaptive Retrieved Images Selection for Efficient Visual   Localisation [8.789742514363777]
 State-of-the-art hierarchical localisation pipelines (HLoc) employ image retrieval (IR) to establish 2D-3D correspondences.
This paper investigates the relationship between global and local descriptors.
We propose an adaptive strategy that adjusts $k$ based on the similarity between the query's global descriptor and those in the database.
 arXiv  Detail & Related papers  (2024-03-27T06:17:21Z)
- Deep Homography Estimation for Visual Place Recognition [49.235432979736395]
 We propose a transformer-based deep homography estimation (DHE) network.
It takes the dense feature map extracted by a backbone network as input and fits homography for fast and learnable geometric verification.
Experiments on benchmark datasets show that our method can outperform several state-of-the-art methods.
 arXiv  Detail & Related papers  (2024-02-25T13:22:17Z)
- Towards Seamless Adaptation of Pre-trained Models for Visual Place   Recognition [72.35438297011176]
 We propose a novel method to realize seamless adaptation of pre-trained models for visual place recognition (VPR)
Specifically, to obtain both global and local features that focus on salient landmarks for discriminating places, we design a hybrid adaptation method.
 Experimental results show that our method outperforms the state-of-the-art methods with less training data and training time.
 arXiv  Detail & Related papers  (2024-02-22T12:55:01Z)
- Optimal Transport Aggregation for Visual Place Recognition [9.192660643226372]
 We introduce SALAD, which reformulates NetVLAD's soft-assignment of local features to clusters as an optimal transport problem.
In SALAD, we consider both feature-to-cluster and cluster-to-feature relations and we also introduce a 'dustbin' cluster, designed to selectively discard features deemed non-informative.
Our single-stage method surpasses single-stage baselines in public VPR datasets, but also surpasses two-stage methods that add a re-ranking with significantly higher cost.
 arXiv  Detail & Related papers  (2023-11-27T15:46:19Z)
- AANet: Aggregation and Alignment Network with Semi-hard Positive Sample
  Mining for Hierarchical Place Recognition [48.043749855085025]
 Visual place recognition (VPR) is one of the research hotspots in robotics, which uses visual information to locate robots.
We present a unified network capable of extracting global features for retrieving candidates via an aggregation module.
We also propose a Semi-hard Positive Sample Mining (ShPSM) strategy to select appropriate hard positive images for training more robust VPR networks.
 arXiv  Detail & Related papers  (2023-10-08T14:46:11Z)
- Local Augmentation for Graph Neural Networks [78.48812244668017]
 We introduce the local augmentation, which enhances node features by its local subgraph structures.
Based on the local augmentation, we further design a novel framework: LA-GNN, which can apply to any GNN models in a plug-and-play manner.
 arXiv  Detail & Related papers  (2021-09-08T18:10:08Z)
- Collaborative Training between Region Proposal Localization and
  Classification for Domain Adaptive Object Detection [121.28769542994664]
 Domain adaptation for object detection tries to adapt the detector from labeled datasets to unlabeled ones for better performance.
In this paper, we are the first to reveal that the region proposal network (RPN) and region proposal classifier(RPC) demonstrate significantly different transferability when facing large domain gap.
 arXiv  Detail & Related papers  (2020-09-17T07:39:52Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.