Content-Based Landmark Retrieval Combining Global and Local Features
using Siamese Neural Networks
- URL: http://arxiv.org/abs/2208.04201v1
- Date: Wed, 3 Aug 2022 18:11:36 GMT
- Title: Content-Based Landmark Retrieval Combining Global and Local Features
using Siamese Neural Networks
- Authors: Tianyi Hu, Monika Kwiatkowski, Simon Matern, Olaf Hellwich
- Abstract summary: We present a method for landmark retrieval that utilizes global and local features.
A Siamese network is used for global feature extraction and metric learning, which gives an initial ranking of the landmark search.
We utilize the extracted feature maps from the Siamese architecture as local descriptors, the search results are then further refined using a cosine similarity between local descriptors.
- Score: 3.785123406103385
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In this work, we present a method for landmark retrieval that utilizes global
and local features. A Siamese network is used for global feature extraction and
metric learning, which gives an initial ranking of the landmark search. We
utilize the extracted feature maps from the Siamese architecture as local
descriptors, the search results are then further refined using a cosine
similarity between local descriptors. We conduct a deeper analysis of the
Google Landmark Dataset, which is used for evaluation, and augment the dataset
to handle various intra-class variances. Furthermore, we conduct several
experiments to compare the effects of transfer learning and metric learning, as
well as experiments using other local descriptors. We show that a re-ranking
using local features can improve the search results. We believe that the
proposed local feature extraction using cosine similarity is a simple approach
that can be extended to many other retrieval tasks.
Related papers
- Deep Homography Estimation for Visual Place Recognition [49.235432979736395]
We propose a transformer-based deep homography estimation (DHE) network.
It takes the dense feature map extracted by a backbone network as input and fits homography for fast and learnable geometric verification.
Experiments on benchmark datasets show that our method can outperform several state-of-the-art methods.
arXiv Detail & Related papers (2024-02-25T13:22:17Z) - CLIP-Loc: Multi-modal Landmark Association for Global Localization in
Object-based Maps [0.16492989697868893]
This paper describes a multi-modal data association method for global localization using object-based maps and camera images.
We propose labeling landmarks with natural language descriptions and extracting correspondences based on conceptual similarity with image observations.
arXiv Detail & Related papers (2024-02-08T22:59:12Z) - Geo-Encoder: A Chunk-Argument Bi-Encoder Framework for Chinese
Geographic Re-Ranking [61.60169764507917]
Chinese geographic re-ranking task aims to find the most relevant addresses among retrieved candidates.
We propose an innovative framework, namely Geo-Encoder, to more effectively integrate Chinese geographical semantics into re-ranking pipelines.
arXiv Detail & Related papers (2023-09-04T13:44:50Z) - Local Contrastive Feature learning for Tabular Data [8.93957397187611]
We propose a new local contrastive feature learning framework (LoCL)
In order to create a niche for local learning, we use feature correlations to create a maximum-spanning tree, and break the tree into feature subsets.
Convolutional learning of the features is used to learn latent feature space, regulated by contrastive and reconstruction losses.
arXiv Detail & Related papers (2022-11-19T00:53:41Z) - Capturing Structural Locality in Non-parametric Language Models [85.94669097485992]
We propose a simple yet effective approach for adding locality information into non-parametric language models.
Experiments on two different domains, Java source code and Wikipedia text, demonstrate that locality features improve model efficacy.
arXiv Detail & Related papers (2021-10-06T15:53:38Z) - Connecting Images through Time and Sources: Introducing Low-data,
Heterogeneous Instance Retrieval [3.6526118822907594]
We show that it is not trivial to pick features responding well to a panel of variations and semantic content.
Introducing a new enhanced version of the Alegoria benchmark, we compare descriptors using the detailed annotations.
arXiv Detail & Related papers (2021-03-19T10:54:51Z) - On the use of local structural properties for improving the efficiency
of hierarchical community detection methods [77.34726150561087]
We study how local structural network properties can be used as proxies to improve the efficiency of hierarchical community detection.
We also check the performance impact of network prunings as an ancillary tactic to make hierarchical community detection more efficient.
arXiv Detail & Related papers (2020-09-15T00:16:12Z) - Region Comparison Network for Interpretable Few-shot Image
Classification [97.97902360117368]
Few-shot image classification has been proposed to effectively use only a limited number of labeled examples to train models for new classes.
We propose a metric learning based method named Region Comparison Network (RCN), which is able to reveal how few-shot learning works.
We also present a new way to generalize the interpretability from the level of tasks to categories.
arXiv Detail & Related papers (2020-09-08T07:29:05Z) - DC-NAS: Divide-and-Conquer Neural Architecture Search [108.57785531758076]
We present a divide-and-conquer (DC) approach to effectively and efficiently search deep neural architectures.
We achieve a $75.1%$ top-1 accuracy on the ImageNet dataset, which is higher than that of state-of-the-art methods using the same search space.
arXiv Detail & Related papers (2020-05-29T09:02:16Z) - Learning Local Features with Context Aggregation for Visual Localization [24.167882373322957]
Keypoint detection and description is fundamental yet important in many vision applications.
Most existing methods use detect-then-describe or detect-and-describe strategy to learn local features without considering their context information.
In this paper, we focus on the fusion of low-level textual information and high-level semantic context information to improve the discrimitiveness of local features.
arXiv Detail & Related papers (2020-05-26T17:19:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.