Visual Place Recognition with Low-Resolution Images
- URL: http://arxiv.org/abs/2305.05776v1
- Date: Tue, 9 May 2023 21:34:38 GMT
- Title: Visual Place Recognition with Low-Resolution Images
- Authors: Mihnea-Alexandru Tomita, Bruno Ferrarini, Michael Milford, Klaus
McDonald-Maier, Shoaib Ehsan
- Abstract summary: We analyse the effects of image resolution on the accuracy and robustness of handcrafted Visual Place Recognition pipelines.
This paper aims to help academic researchers and companies in the hardware and software industry co-design VPR solutions and expand the use of VPR algorithms in commercial products.
- Score: 17.847661026367767
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Images incorporate a wealth of information from a robot's surroundings. With
the widespread availability of compact cameras, visual information has become
increasingly popular for addressing the localisation problem, which is then
termed as Visual Place Recognition (VPR). While many applications use
high-resolution cameras and high-end systems to achieve optimal place-matching
performance, low-end commercial systems face limitations due to resource
constraints and relatively low-resolution and low-quality cameras. In this
paper, we analyse the effects of image resolution on the accuracy and
robustness of well-established handcrafted VPR pipelines. Handcrafted designs
have low computational demands and can adapt to flexible image resolutions,
making them a suitable approach to scale to any image source and to operate
under resource limitations. This paper aims to help academic researchers and
companies in the hardware and software industry co-design VPR solutions and
expand the use of VPR algorithms in commercial products.
Related papers
- Low-Resolution Self-Attention for Semantic Segmentation [96.81482872022237]
We introduce the Low-Resolution Self-Attention (LRSA) mechanism to capture global context at a significantly reduced computational cost.
Our approach involves computing self-attention in a fixed low-resolution space regardless of the input image's resolution.
We demonstrate the effectiveness of our LRSA approach by building the LRFormer, a vision transformer with an encoder-decoder structure.
arXiv Detail & Related papers (2023-10-08T06:10:09Z) - Super-Resolving Face Image by Facial Parsing Information [52.1267613768555]
Face super-resolution is a technology that transforms a low-resolution face image into the corresponding high-resolution one.
We build a novel parsing map guided face super-resolution network which extracts the face prior from low-resolution face image.
High-resolution features contain more precise spatial information while low-resolution features provide strong contextual information.
arXiv Detail & Related papers (2023-04-06T08:19:03Z) - LSwinSR: UAV Imagery Super-Resolution based on Linear Swin Transformer [7.3817359680010615]
Super-resolution technology is especially beneficial for Unmanned Aerial Vehicles (UAV)
In this paper, for the super-resolution of UAV images, a novel network based on the state-of-the-art Swin Transformer is proposed with better efficiency and competitive accuracy.
arXiv Detail & Related papers (2023-03-17T20:14:10Z) - Improving Performance of Object Detection using the Mechanisms of Visual
Recognition in Humans [0.4297070083645048]
We first track the performance of the state-of-the-art deep object recognition network, Faster- RCNN, as a function of image resolution.
They also show that different spatial frequencies convey different information about the objects in recognition process.
We propose a multi-resolution object recognition framework rather than a single-resolution network.
arXiv Detail & Related papers (2023-01-23T19:09:36Z) - Efficient High-Resolution Deep Learning: A Survey [90.76576712433595]
Cameras in modern devices such as smartphones, satellites and medical equipment are capable of capturing very high resolution images and videos.
Such high-resolution data often need to be processed by deep learning models for cancer detection, automated road navigation, weather prediction, surveillance, optimizing agricultural processes and many other applications.
Using high-resolution images and videos as direct inputs for deep learning models creates many challenges due to their high number of parameters, computation cost, inference latency and GPU memory consumption.
Several works in the literature propose better alternatives in order to deal with the challenges of high-resolution data and improve accuracy and speed while complying with hardware limitations
arXiv Detail & Related papers (2022-07-26T17:13:53Z) - Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text
Spotting [49.33891486324731]
We propose a novel cost-efficient Dynamic Low-resolution Distillation (DLD) text spotting framework.
It aims to infer images in different small but recognizable resolutions and achieve a better balance between accuracy and efficiency.
The proposed method can be optimized end-to-end and adopted in any current text spotting framework to improve the practicability.
arXiv Detail & Related papers (2022-07-14T06:49:59Z) - 3D Human Pose, Shape and Texture from Low-Resolution Images and Videos [107.36352212367179]
We propose RSC-Net, which consists of a Resolution-aware network, a Self-supervision loss, and a Contrastive learning scheme.
The proposed method is able to learn 3D body pose and shape across different resolutions with one single model.
We extend the RSC-Net to handle low-resolution videos and apply it to reconstruct textured 3D pedestrians from low-resolution input.
arXiv Detail & Related papers (2021-03-11T06:52:12Z) - Human Pose Estimation on Privacy-Preserving Low-Resolution Depth Images [2.8802646903517957]
Human pose estimation (HPE) is a key building block for developing AI-based context-aware systems inside the operating room (OR)
Being able to solely use low-resolution privacy-preserving images would address these concerns.
We propose an end-to-end solution that integrates a multi-scale super-resolution network with a 2D human pose estimation network.
arXiv Detail & Related papers (2020-07-16T14:03:52Z) - Multi-image Super Resolution of Remotely Sensed Images using Residual
Feature Attention Deep Neural Networks [1.3764085113103222]
The presented research proposes a novel residual attention model (RAMS) that efficiently tackles the multi-image super-resolution task.
We introduce the mechanism of visual feature attention with 3D convolutions in order to obtain an aware data fusion and information extraction.
Our representation learning network makes extensive use of nestled residual connections to let flow redundant low-frequency signals.
arXiv Detail & Related papers (2020-07-06T22:54:02Z) - Learning Enriched Features for Real Image Restoration and Enhancement [166.17296369600774]
convolutional neural networks (CNNs) have achieved dramatic improvements over conventional approaches for image restoration task.
We present a novel architecture with the collective goals of maintaining spatially-precise high-resolution representations through the entire network.
Our approach learns an enriched set of features that combines contextual information from multiple scales, while simultaneously preserving the high-resolution spatial details.
arXiv Detail & Related papers (2020-03-15T11:04:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.