A Semantic Indexing Structure for Image Retrieval
- URL: http://arxiv.org/abs/2109.06583v1
- Date: Tue, 14 Sep 2021 11:12:30 GMT
- Title: A Semantic Indexing Structure for Image Retrieval
- Authors: Ying Wang, Tingzhen Liu, Zepeng Bu, Yuhui Huang, Lizhong Gao, Qiao
Wang
- Abstract summary: We propose a new classification-based indexing structure, called Semantic Indexing Structure (SIS)
SIS uses semantic categories rather than clustering centers to create database partitions.
SIS achieves outstanding performance compared with state-of-the-art models.
- Score: 9.889773269004241
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In large-scale image retrieval, many indexing methods have been proposed to
narrow down the searching scope of retrieval. The features extracted from
images usually are of high dimensions or unfixed sizes due to the existence of
key points. Most of existing index structures suffer from the dimension curse,
the unfixed feature size and/or the loss of semantic similarity. In this paper
a new classification-based indexing structure, called Semantic Indexing
Structure (SIS), is proposed, in which we utilize the semantic categories
rather than clustering centers to create database partitions, such that the
proposed index SIS can be combined with feature extractors without the
restriction of dimensions. Besides, it is observed that the size of each
semantic partition is positively correlated with the semantic distribution of
database. Along this way, we found that when the partition number is normalized
to five, the proposed algorithm performed very well in all the tests. Compared
with state-of-the-art models, SIS achieves outstanding performance.
Related papers
- Accelerating spherical K-means clustering for large-scale sparse document data [0.7366405857677226]
This paper presents an accelerated spherical K-means clustering algorithm for large-scale and high-dimensional sparse document data sets.
We experimentally demonstrate that our algorithm efficiently achieves superior speed performance in large-scale documents compared with algorithms using the state-of-the-art techniques.
arXiv Detail & Related papers (2024-11-18T05:50:58Z) - Vocabulary-free Image Classification and Semantic Segmentation [71.78089106671581]
We introduce the Vocabulary-free Image Classification (VIC) task, which aims to assign a class from an un-constrained language-induced semantic space to an input image without needing a known vocabulary.
VIC is challenging due to the vastness of the semantic space, which contains millions of concepts, including fine-grained categories.
We propose Category Search from External Databases (CaSED), a training-free method that leverages a pre-trained vision-language model and an external database.
arXiv Detail & Related papers (2024-04-16T19:27:21Z) - Generative Retrieval as Multi-Vector Dense Retrieval [71.75503049199897]
Generative retrieval generates identifiers of relevant documents in an end-to-end manner.
Prior work has demonstrated that generative retrieval with atomic identifiers is equivalent to single-vector dense retrieval.
We show that generative retrieval and multi-vector dense retrieval share the same framework for measuring the relevance to a query of a document.
arXiv Detail & Related papers (2024-03-31T13:29:43Z) - Dense X Retrieval: What Retrieval Granularity Should We Use? [56.90827473115201]
Often-overlooked design choice is the retrieval unit in which the corpus is indexed, e.g. document, passage, or sentence.
We introduce a novel retrieval unit, proposition, for dense retrieval.
Experiments reveal that indexing a corpus by fine-grained units such as propositions significantly outperforms passage-level units in retrieval tasks.
arXiv Detail & Related papers (2023-12-11T18:57:35Z) - Integrating Visual and Semantic Similarity Using Hierarchies for Image
Retrieval [0.46040036610482665]
We propose a method for CBIR that captures both visual and semantic similarity using a visual hierarchy.
The hierarchy is constructed by merging classes with overlapping features in the latent space of a deep neural network trained for classification.
Our method achieves superior performance compared to the existing methods on image retrieval.
arXiv Detail & Related papers (2023-08-16T15:23:14Z) - Inverted Semantic-Index for Image Retrieval [3.751222656656264]
inverted indices aim to build finer partitions that produce a concise and accurate candidate list.
In this paper, we replace the clustering method with image classification, during the construction of codebook.
We combine our semantic-index with the product quantization (PQ) so as to alleviate the accuracy loss caused by PQ compression.
arXiv Detail & Related papers (2022-06-25T11:21:56Z) - A Learned Index for Exact Similarity Search in Metric Spaces [25.330353637669386]
LIMS is proposed to use data clustering and pivot-based data transformation techniques to build learned indexes.
Machine learning models are developed to approximate the position of each data record on the disk.
Extensive experiments on real-world and synthetic datasets demonstrate the superiority of LIMS compared with traditional indexes.
arXiv Detail & Related papers (2022-04-21T11:24:55Z) - Generalizing Few-Shot NAS with Gradient Matching [165.5690495295074]
One-Shot methods train one supernet to approximate the performance of every architecture in the search space via weight-sharing.
Few-Shot NAS reduces the level of weight-sharing by splitting the One-Shot supernet into multiple separated sub-supernets.
It significantly outperforms its Few-Shot counterparts while surpassing previous comparable methods in terms of the accuracy of derived architectures.
arXiv Detail & Related papers (2022-03-29T03:06:16Z) - IHashNet: Iris Hashing Network based on efficient multi-index hashing [9.540646692526348]
We propose an iris indexing scheme using real-valued deep iris features binarized to iris bar codes (IBC) compatible with the indexing structure.
For indexing the iris dataset, we have proposed a loss that can transform the binary feature into an improved feature compatible with the Multi-Index Hashing scheme.
arXiv Detail & Related papers (2020-12-07T17:50:57Z) - The Case for Learned Spatial Indexes [62.88514422115702]
We use techniques proposed from a state-of-the art learned multi-dimensional index structure (namely, Flood) to answer spatial range queries.
We show that (i) machine learned search within a partition is faster by 11.79% to 39.51% than binary search when using filtering on one dimension.
We also refine using machine learned indexes is 1.23x to 1.83x times faster than closest competitor which filters on two dimensions.
arXiv Detail & Related papers (2020-08-24T12:09:55Z) - Compact Deep Aggregation for Set Retrieval [87.52470995031997]
We focus on retrieving images containing multiple faces from a large scale dataset of images.
Here the set consists of the face descriptors in each image, and given a query for multiple identities, the goal is then to retrieve, in order, images which contain all the identities.
We show that this compact descriptor has minimal loss of discriminability up to two faces per image, and degrades slowly after that.
arXiv Detail & Related papers (2020-03-26T08:43:15Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.