Aggregating Multiple Bio-Inspired Image Region Classifiers For Effective
And Lightweight Visual Place Recognition
- URL: http://arxiv.org/abs/2312.12995v1
- Date: Wed, 20 Dec 2023 12:57:01 GMT
- Title: Aggregating Multiple Bio-Inspired Image Region Classifiers For Effective
And Lightweight Visual Place Recognition
- Authors: Bruno Arcanjo, Bruno Ferrarini, Maria Fasli, Michael Milford, Klaus D.
McDonald-Maier and Shoaib Ehsan
- Abstract summary: We propose a novel multi-DrosoNet localization system, dubbed RegionDrosoNet, with significantly improved VPR performance.
Our approach relies on specializing distinct groups of DrosoNets on differently sliced partitions of the original image.
We introduce a novel voting module to combine the outputs of all DrosoNets into the final place prediction.
- Score: 22.09628302234166
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Visual place recognition (VPR) enables autonomous systems to localize
themselves within an environment using image information. While VPR techniques
built upon a Convolutional Neural Network (CNN) backbone dominate
state-of-the-art VPR performance, their high computational requirements make
them unsuitable for platforms equipped with low-end hardware. Recently, a
lightweight VPR system based on multiple bio-inspired classifiers, dubbed
DrosoNets, has been proposed, achieving great computational efficiency at the
cost of reduced absolute place retrieval performance. In this work, we propose
a novel multi-DrosoNet localization system, dubbed RegionDrosoNet, with
significantly improved VPR performance, while preserving a low-computational
profile. Our approach relies on specializing distinct groups of DrosoNets on
differently sliced partitions of the original image, increasing extrinsic model
differentiation. Furthermore, we introduce a novel voting module to combine the
outputs of all DrosoNets into the final place prediction which considers
multiple top refence candidates from each DrosoNet. RegionDrosoNet outperforms
other lightweight VPR techniques when dealing with both appearance changes and
viewpoint variations. Moreover, it competes with computationally expensive
methods on some benchmark datasets at a small fraction of their online
inference time.
Related papers
- Task-Oriented Real-time Visual Inference for IoVT Systems: A Co-design Framework of Neural Networks and Edge Deployment [61.20689382879937]
Task-oriented edge computing addresses this by shifting data analysis to the edge.
Existing methods struggle to balance high model performance with low resource consumption.
We propose a novel co-design framework to optimize neural network architecture.
arXiv Detail & Related papers (2024-10-29T19:02:54Z) - ELGC-Net: Efficient Local-Global Context Aggregation for Remote Sensing Change Detection [65.59969454655996]
We propose an efficient change detection framework, ELGC-Net, which leverages rich contextual information to precisely estimate change regions.
Our proposed ELGC-Net sets a new state-of-the-art performance in remote sensing change detection benchmarks.
We also introduce ELGC-Net-LW, a lighter variant with significantly reduced computational complexity, suitable for resource-constrained settings.
arXiv Detail & Related papers (2024-03-26T17:46:25Z) - ClusVPR: Efficient Visual Place Recognition with Clustering-based
Weighted Transformer [13.0858576267115]
We present ClusVPR, a novel approach that tackles the specific issues of redundant information in duplicate regions and representations of small objects.
ClusVPR introduces a unique paradigm called Clustering-based weighted Transformer Network (CWTNet)
We also introduce the optimized-VLAD layer that significantly reduces the number of parameters and enhances model efficiency.
arXiv Detail & Related papers (2023-10-06T09:01:15Z) - Patch-DrosoNet: Classifying Image Partitions With Fly-Inspired Models
For Lightweight Visual Place Recognition [22.58641358408613]
We present a novel training approach for DrosoNet, wherein separate models are trained on distinct regions of a reference image.
We also introduce a convolutional-like prediction method, in which each DrosoNet unit generates a set of place predictions for each portion of the query image.
Our approach significantly improves upon the VPR performance of previous work while maintaining an extremely compact and lightweight algorithm.
arXiv Detail & Related papers (2023-05-09T08:25:49Z) - Ghost-dil-NetVLAD: A Lightweight Neural Network for Visual Place Recognition [3.6249801498927923]
We propose a lightweight weakly supervised end-to-end neural network consisting of a front-ended perception model called GhostCNN and a learnable VLAD layer as a back-end.
To enhance our proposed lightweight model further, we add dilated convolutions to the Ghost module to get features containing more spatial semantic information, improving accuracy.
arXiv Detail & Related papers (2021-12-22T06:05:02Z) - An Efficient and Scalable Collection of Fly-inspired Voting Units for
Visual Place Recognition in Changing Environments [20.485491385050615]
Low-overhead VPR techniques would enable platforms equipped with low-end, cheap hardware.
Our goal is to provide an algorithm of extreme compactness and efficiency while achieving state-of-the-art robustness to appearance changes and small point-of-view variations.
arXiv Detail & Related papers (2021-09-22T19:01:20Z) - STA-VPR: Spatio-temporal Alignment for Visual Place Recognition [17.212503755962757]
We propose an adaptive dynamic time warping algorithm to align local features from the spatial domain while measuring the distance between two images.
A local matching DTW algorithm is applied to perform image sequence matching based on temporal alignment.
The results show that the proposed method significantly improves the CNN-based methods.
arXiv Detail & Related papers (2021-03-25T03:27:42Z) - Adversarial Feature Augmentation and Normalization for Visual
Recognition [109.6834687220478]
Recent advances in computer vision take advantage of adversarial data augmentation to ameliorate the generalization ability of classification models.
Here, we present an effective and efficient alternative that advocates adversarial augmentation on intermediate feature embeddings.
We validate the proposed approach across diverse visual recognition tasks with representative backbone networks.
arXiv Detail & Related papers (2021-03-22T20:36:34Z) - Learning Frequency-aware Dynamic Network for Efficient Super-Resolution [56.98668484450857]
This paper explores a novel frequency-aware dynamic network for dividing the input into multiple parts according to its coefficients in the discrete cosine transform (DCT) domain.
In practice, the high-frequency part will be processed using expensive operations and the lower-frequency part is assigned with cheap operations to relieve the computation burden.
Experiments conducted on benchmark SISR models and datasets show that the frequency-aware dynamic network can be employed for various SISR neural architectures.
arXiv Detail & Related papers (2021-03-15T12:54:26Z) - A Deep-Unfolded Reference-Based RPCA Network For Video
Foreground-Background Separation [86.35434065681925]
This paper proposes a new deep-unfolding-based network design for the problem of Robust Principal Component Analysis (RPCA)
Unlike existing designs, our approach focuses on modeling the temporal correlation between the sparse representations of consecutive video frames.
Experimentation using the moving MNIST dataset shows that the proposed network outperforms a recently proposed state-of-the-art RPCA network in the task of video foreground-background separation.
arXiv Detail & Related papers (2020-10-02T11:40:09Z) - Image Fine-grained Inpainting [89.17316318927621]
We present a one-stage model that utilizes dense combinations of dilated convolutions to obtain larger and more effective receptive fields.
To better train this efficient generator, except for frequently-used VGG feature matching loss, we design a novel self-guided regression loss.
We also employ a discriminator with local and global branches to ensure local-global contents consistency.
arXiv Detail & Related papers (2020-02-07T03:45:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.