Automatic Vocabulary and Graph Verification for Accurate Loop Closure
Detection
- URL: http://arxiv.org/abs/2107.14611v1
- Date: Fri, 30 Jul 2021 13:19:33 GMT
- Title: Automatic Vocabulary and Graph Verification for Accurate Loop Closure
Detection
- Authors: Haosong Yue and Jinyu Miao and Weihai Chen and Wei Wang and Fanghong
Guo and Zhengguo Li
- Abstract summary: Bag-of-Words (BoW) builds a visual vocabulary to associate features and then detect loops.
We propose a natural convergence criterion based on the comparison between the radii of nodes and the drifts of feature descriptors.
We present a novel topological graph verification method for validating candidate loops.
- Score: 21.862978912891677
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Localizing pre-visited places during long-term simultaneous localization and
mapping, i.e. loop closure detection (LCD), is a crucial technique to correct
accumulated inconsistencies. As one of the most effective and efficient
solutions, Bag-of-Words (BoW) builds a visual vocabulary to associate features
and then detect loops. Most existing approaches that build vocabularies
off-line determine scales of the vocabulary by trial-and-error, which often
results in unreasonable feature association. Moreover, the accuracy of the
algorithm usually declines due to perceptual aliasing, as the BoW-based method
ignores the positions of visual features. To overcome these disadvantages, we
propose a natural convergence criterion based on the comparison between the
radii of nodes and the drifts of feature descriptors, which is then utilized to
build the optimal vocabulary automatically. Furthermore, we present a novel
topological graph verification method for validating candidate loops so that
geometrical positions of the words can be involved with a negligible increase
in complexity, which can significantly improve the accuracy of LCD. Experiments
on various public datasets and comparisons against several state-of-the-art
algorithms verify the performance of our proposed approach.
Related papers
- LM-assisted keyword biasing with Aho-Corasick algorithm for Transducer-based ASR [3.841280537264271]
We propose a light on-the-fly method to improve automatic speech recognition performance.
We combine a bias list of named entities with a word-level n-gram language model with the shallow fusion approach based on the Aho-Corasick string matching algorithm.
We achieve up to 21.6% relative improvement in the general word error rate with no practical difference in the inverse real-time factor.
arXiv Detail & Related papers (2024-09-20T13:53:37Z) - Beyond Levenshtein: Leveraging Multiple Algorithms for Robust Word Error Rate Computations And Granular Error Classifications [5.266869303483375]
The Word Error Rate (WER) is the common measure of accuracy for Automatic Speech Recognition (ASR)
We present a non-destructive, token-based approach using an extended Levenshtein distance algorithm to compute a robust WER.
We also provide an exemplary analysis of derived use cases, such as a punctuation error rate, and a web application for interactive use and visualisation of our implementation.
arXiv Detail & Related papers (2024-08-28T08:14:51Z) - SparseCL: Sparse Contrastive Learning for Contradiction Retrieval [87.02936971689817]
Contradiction retrieval refers to identifying and extracting documents that explicitly disagree with or refute the content of a query.
Existing methods such as similarity search and crossencoder models exhibit significant limitations.
We introduce SparseCL that leverages specially trained sentence embeddings designed to preserve subtle, contradictory nuances between sentences.
arXiv Detail & Related papers (2024-06-15T21:57:03Z) - Bisimulation Learning [55.859538562698496]
We compute finite bisimulations of state transition systems with large, possibly infinite state space.
Our technique yields faster verification results than alternative state-of-the-art tools in practice.
arXiv Detail & Related papers (2024-05-24T17:11:27Z) - SemanticTopoLoop: Semantic Loop Closure With 3D Topological Graph Based
on Quadric-Level Object Map [0.8158530638728501]
Loop closure is one of the crucial components in SLAM.
Traditional appearance-based methods, such as bag-of-words models, are often limited by local 2D features and the volume of training data.
arXiv Detail & Related papers (2023-11-06T02:30:30Z) - Generalized Time Warping Invariant Dictionary Learning for Time Series
Classification and Clustering [8.14208923345076]
The dynamic time warping (DTW) is commonly used for dealing with temporal delays, scaling, transformation, and many other kinds of temporal misalignments issues.
We propose a generalized time warping invariant dictionary learning algorithm in this paper.
The superiority of the proposed method in terms of dictionary learning, classification, and clustering is validated through ten sets of public datasets.
arXiv Detail & Related papers (2023-06-30T14:18:13Z) - Towards View-invariant and Accurate Loop Detection Based on Scene Graph [33.68204049028845]
Loop detection plays a key role in visual SLAM by correcting the accumulated pose drift.
Current semantic-aided loop detection methods face challenges in dealing with ambiguous semantic instances and drastic viewpoint differences.
This paper introduces a novel loop detection method based on an incrementally created scene graph.
arXiv Detail & Related papers (2023-05-24T08:34:43Z) - Finding Dataset Shortcuts with Grammar Induction [85.47127659108637]
We propose to use probabilistic grammars to characterize and discover shortcuts in NLP datasets.
Specifically, we use a context-free grammar to model patterns in sentence classification datasets and use a synchronous context-free grammar to model datasets involving sentence pairs.
The resulting grammars reveal interesting shortcut features in a number of datasets, including both simple and high-level features.
arXiv Detail & Related papers (2022-10-20T19:54:11Z) - Multiway Non-rigid Point Cloud Registration via Learned Functional Map
Synchronization [105.14877281665011]
We present SyNoRiM, a novel way to register multiple non-rigid shapes by synchronizing the maps relating learned functions defined on the point clouds.
We demonstrate via extensive experiments that our method achieves a state-of-the-art performance in registration accuracy.
arXiv Detail & Related papers (2021-11-25T02:37:59Z) - Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence
Lip-Reading [96.48553941812366]
Lip-reading aims to infer the speech content from the lip movement sequence.
Traditional learning process of seq2seq models suffers from two problems.
We propose a novel pseudo-convolutional policy gradient (PCPG) based method to address these two problems.
arXiv Detail & Related papers (2020-03-09T09:12:26Z) - Consistency of a Recurrent Language Model With Respect to Incomplete
Decoding [67.54760086239514]
We study the issue of receiving infinite-length sequences from a recurrent language model.
We propose two remedies which address inconsistency: consistent variants of top-k and nucleus sampling, and a self-terminating recurrent language model.
arXiv Detail & Related papers (2020-02-06T19:56:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.