Improving Visual Place Recognition with Sequence-Matching Receptiveness Prediction
- URL: http://arxiv.org/abs/2503.06840v1
- Date: Mon, 10 Mar 2025 02:01:24 GMT
- Title: Improving Visual Place Recognition with Sequence-Matching Receptiveness Prediction
- Authors: Somayeh Hussaini, Tobias Fischer, Michael Milford,
- Abstract summary: We present a new supervised learning approach that learns to predict the per-frame sequence matching receptiveness (SMR) of VPR techniques.<n>Our approach significantly improves VPR performance across a large range of state-of-the-art and classical VPR techniques.
- Score: 19.577433371468533
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In visual place recognition (VPR), filtering and sequence-based matching approaches can improve performance by integrating temporal information across image sequences, especially in challenging conditions. While these methods are commonly applied, their effects on system behavior can be unpredictable and can actually make performance worse in certain situations. In this work, we present a new supervised learning approach that learns to predict the per-frame sequence matching receptiveness (SMR) of VPR techniques, enabling the system to selectively decide when to trust the output of a sequence matching system. The approach is agnostic to the underlying VPR technique. Our approach predicts SMR-and hence significantly improves VPR performance-across a large range of state-of-the-art and classical VPR techniques (namely CosPlace, MixVPR, EigenPlaces, SALAD, AP-GeM, NetVLAD and SAD), and across three benchmark VPR datasets (Nordland, Oxford RobotCar, and SFU-Mountain). We also provide insights into a complementary approach that uses the predictor to replace discarded matches, as well as ablation studies, including an analysis of the interactions between our SMR predictor and the selected sequence length. We will release our code upon acceptance.
Related papers
- To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition [4.008780119020479]
We show that modern retrieval systems often reach a point where re-ranking can degrade results, as current VPR datasets are largely saturated.
We propose using image matching as a verification step to assess retrieval confidence, demonstrating that inlier counts can reliably predict when re-ranking is beneficial.
arXiv Detail & Related papers (2025-04-08T15:10:10Z) - SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place Recognition [69.58329995485158]
Recent studies show that the visual place recognition (VPR) method using pre-trained visual foundation models can achieve promising performance.<n>We propose a novel method to realize seamless adaptation of foundation models to VPR.<n>In pursuit of higher efficiency and better performance, we propose an extension of the SelaVPR, called SelaVPR++.
arXiv Detail & Related papers (2025-02-23T15:01:09Z) - Improving Adversarial Robustness of Masked Autoencoders via Test-time
Frequency-domain Prompting [133.55037976429088]
We investigate the adversarial robustness of vision transformers equipped with BERT pretraining (e.g., BEiT, MAE)
A surprising observation is that MAE has significantly worse adversarial robustness than other BERT pretraining methods.
We propose a simple yet effective way to boost the adversarial robustness of MAE.
arXiv Detail & Related papers (2023-08-20T16:27:17Z) - Unsupervised Quality Prediction for Improved Single-Frame and Weighted
Sequential Visual Place Recognition [20.737660223671003]
We present a new, training-free approach to predicting the likely quality of localization estimates.
We use these predictions to bias a sequence-matching process to produce additional performance gains.
Our system is lightweight, runs in real-time and is agnostic to the underlying VPR technique.
arXiv Detail & Related papers (2023-07-04T03:53:05Z) - A-MuSIC: An Adaptive Ensemble System For Visual Place Recognition In
Changing Environments [22.58641358408613]
Visual place recognition (VPR) is an essential component of robot navigation and localization systems.
No single VPR technique excels in every environmental condition.
adaptive VPR system dubbed Adaptive Multi-Self Identification and Correction (A-MuSIC)
A-MuSIC matches or beats state-of-the-art VPR performance across all tested benchmark datasets.
arXiv Detail & Related papers (2023-03-24T19:25:22Z) - Large-Scale Sequential Learning for Recommender and Engineering Systems [91.3755431537592]
In this thesis, we focus on the design of an automatic algorithms that provide personalized ranking by adapting to the current conditions.
For the former, we propose novel algorithm called SAROS that take into account both kinds of feedback for learning over the sequence of interactions.
The proposed idea of taking into account the neighbour lines shows statistically significant results in comparison with the initial approach for faults detection in power grid.
arXiv Detail & Related papers (2022-05-13T21:09:41Z) - Consistency Regularization for Deep Face Anti-Spoofing [69.70647782777051]
Face anti-spoofing (FAS) plays a crucial role in securing face recognition systems.
Motivated by this exciting observation, we conjecture that encouraging feature consistency of different views may be a promising way to boost FAS models.
We enhance both Embedding-level and Prediction-level Consistency Regularization (EPCR) in FAS.
arXiv Detail & Related papers (2021-11-24T08:03:48Z) - Sequence-Based Filtering for Visual Route-Based Navigation: Analysing
the Benefits, Trade-offs and Design Choices [17.48671856442762]
An emerging trend in Visual Place Recognition (VPR) is the use of sequence-based filtering methods on top of single-frame-based place matching techniques.
This paper conducts an in-depth investigation of the relationship between the performance of single-frame-based place matching techniques and the use of sequence-based filtering on top of those methods.
arXiv Detail & Related papers (2021-03-02T19:24:58Z) - Inter-class Discrepancy Alignment for Face Recognition [55.578063356210144]
We propose a unified framework calledInter-class DiscrepancyAlignment(IDA)
IDA-DAO is used to align the similarity scores considering the discrepancy between the images and its neighbors.
IDA-SSE can provide convincing inter-class neighbors by introducing virtual candidate images generated with GAN.
arXiv Detail & Related papers (2021-03-02T08:20:08Z) - Improving Visual Place Recognition Performance by Maximising
Complementarity [22.37892767050086]
This paper investigates the complementarity of state-of-the-art VPR methods systematically for the first time.
It identifies those combinations which can result in better performance.
Results are presented for eight state-of-the-art VPR methods on ten widely-used VPR datasets.
arXiv Detail & Related papers (2021-02-16T19:18:33Z) - ConvSequential-SLAM: A Sequence-based, Training-less Visual Place
Recognition Technique for Changing Environments [19.437998213418446]
Visual Place Recognition (VPR) is the ability to correctly recall a previously visited place under changing viewpoints and appearances.
We present a new handcrafted VPR technique that achieves state-of-the-art place matching performance under challenging conditions.
arXiv Detail & Related papers (2020-09-28T16:31:29Z) - Evaluating probabilistic classifiers: Reliability diagrams and score
decompositions revisited [68.8204255655161]
We introduce the CORP approach, which generates provably statistically Consistent, Optimally binned, and Reproducible reliability diagrams in an automated way.
Corpor is based on non-parametric isotonic regression and implemented via the Pool-adjacent-violators (PAV) algorithm.
arXiv Detail & Related papers (2020-08-07T08:22:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.