High-Throughput Vector Similarity Search in Knowledge Graphs
- URL: http://arxiv.org/abs/2304.01926v1
- Date: Tue, 4 Apr 2023 16:19:15 GMT
- Title: High-Throughput Vector Similarity Search in Knowledge Graphs
- Authors: Jason Mohoney, Anil Pacaci, Shihabur Rahman Chowdhury, Ali Mousavi,
Ihab F. Ilyas, Umar Farooq Minhas, Jeffrey Pound, Theodoros Rekatsinas
- Abstract summary: Recent data management systems propose augmenting query processing with online vector similarity search.
We focus on hybrid vector similarity search (hybrid queries for short) where part of the query corresponds to vector similarity search.
We present our system, HQI, for high- throughput batch processing of hybrid queries.
- Score: 17.41683819564348
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: There is an increasing adoption of machine learning for encoding data into
vectors to serve online recommendation and search use cases. As a result,
recent data management systems propose augmenting query processing with online
vector similarity search. In this work, we explore vector similarity search in
the context of Knowledge Graphs (KGs). Motivated by the tasks of finding
related KG queries and entities for past KG query workloads, we focus on hybrid
vector similarity search (hybrid queries for short) where part of the query
corresponds to vector similarity search and part of the query corresponds to
predicates over relational attributes associated with the underlying data
vectors. For example, given past KG queries for a song entity, we want to
construct new queries for new song entities whose vector representations are
close to the vector representation of the entity in the past KG query. But
entities in a KG also have non-vector attributes such as a song associated with
an artist, a genre, and a release date. Therefore, suggested entities must also
satisfy query predicates over non-vector attributes beyond a vector-based
similarity predicate. While these tasks are central to KGs, our contributions
are generally applicable to hybrid queries. In contrast to prior works that
optimize online queries, we focus on enabling efficient batch processing of
past hybrid query workloads. We present our system, HQI, for high-throughput
batch processing of hybrid queries. We introduce a workload-aware vector data
partitioning scheme to tailor the vector index layout to the given workload and
describe a multi-query optimization technique to reduce the overhead of vector
similarity computations. We evaluate our methods on industrial workloads and
demonstrate that HQI yields a 31x improvement in throughput for finding related
KG queries compared to existing hybrid query processing approaches.
Related papers
- Effective Instruction Parsing Plugin for Complex Logical Query Answering on Knowledge Graphs [51.33342412699939]
Knowledge Graph Query Embedding (KGQE) aims to embed First-Order Logic (FOL) queries in a low-dimensional KG space for complex reasoning over incomplete KGs.
Recent studies integrate various external information (such as entity types and relation context) to better capture the logical semantics of FOL queries.
We propose an effective Query Instruction Parsing (QIPP) that captures latent query patterns from code-like query instructions.
arXiv Detail & Related papers (2024-10-27T03:18:52Z) - Operational Advice for Dense and Sparse Retrievers: HNSW, Flat, or Inverted Indexes? [62.57689536630933]
We provide experimental results on the BEIR dataset using the open-source Lucene search library.
Our results provide guidance for today's search practitioner in understanding the design space of dense and sparse retrievers.
arXiv Detail & Related papers (2024-09-10T12:46:23Z) - User Intent Recognition and Semantic Cache Optimization-Based Query Processing Framework using CFLIS and MGR-LAU [0.0]
This work analyzed the informational, navigational, and transactional-based intents in queries for enhanced QP.
For efficient QP, the data is structured using Epanechnikov Kernel-Ordering Points To Identify the Clustering Structure (EK-OPTICS)
The extracted features, detected intents and structured data are inputted to the Multi-head Gated Recurrent Learnable Attention Unit (MGR-LAU)
arXiv Detail & Related papers (2024-06-06T20:28:05Z) - Generative Retrieval as Multi-Vector Dense Retrieval [71.75503049199897]
Generative retrieval generates identifiers of relevant documents in an end-to-end manner.
Prior work has demonstrated that generative retrieval with atomic identifiers is equivalent to single-vector dense retrieval.
We show that generative retrieval and multi-vector dense retrieval share the same framework for measuring the relevance to a query of a document.
arXiv Detail & Related papers (2024-03-31T13:29:43Z) - Efficient Data Access Paths for Mixed Vector-Relational Search [8.80592433569832]
Machine learning and the adoption of data processing methods using vector embeddings sparked a great interest in creating systems for vector data management.
While the predominant approach of vector data management is to use specialized index structures for fast search over the entirety of the vector embeddings, once combined with other (meta)data, the search queries can also become selective on relational attributes.
As using vector indexes differs from traditional relational data access, we revisit and analyze alternative access paths for efficient mixed vector-relational search.
arXiv Detail & Related papers (2024-03-23T11:34:17Z) - LeanVec: Searching vectors faster by making them fit [1.0863382547662974]
We present LeanVec, a framework that combines linear dimensionality reduction with vector quantization to accelerate similarity search on high-dimensional vectors.
We show that LeanVec produces state-of-the-art results, with up to 3.7x improvement in search throughput and up to 4.9x faster index build time.
arXiv Detail & Related papers (2023-12-26T21:14:59Z) - Building Interpretable and Reliable Open Information Retriever for New
Domains Overnight [67.03842581848299]
Information retrieval is a critical component for many down-stream tasks such as open-domain question answering (QA)
We propose an information retrieval pipeline that uses entity/event linking model and query decomposition model to focus more accurately on different information units of the query.
We show that, while being more interpretable and reliable, our proposed pipeline significantly improves passage coverages and denotation accuracies across five IR and QA benchmarks.
arXiv Detail & Related papers (2023-08-09T07:47:17Z) - UniKGQA: Unified Retrieval and Reasoning for Solving Multi-hop Question
Answering Over Knowledge Graph [89.98762327725112]
Multi-hop Question Answering over Knowledge Graph(KGQA) aims to find the answer entities that are multiple hops away from the topic entities mentioned in a natural language question.
We propose UniKGQA, a novel approach for multi-hop KGQA task, by unifying retrieval and reasoning in both model architecture and parameter learning.
arXiv Detail & Related papers (2022-12-02T04:08:09Z) - Navigable Proximity Graph-Driven Native Hybrid Queries with Structured
and Unstructured Constraints [10.842138336245384]
We propose a native hybrid query (NHQ) framework based on proximity graph (PG), which provides the specialized textitcomposite index and joint pruning modules for hybrid queries.
We present two novel navigable PGs with optimized edge selection and routing strategies, which obtain better overall performance than existing PGs.
arXiv Detail & Related papers (2022-03-25T12:02:37Z) - Learning Query Expansion over the Nearest Neighbor Graph [94.80212602202518]
Graph Query Expansion (GQE) is presented, which is learned in a supervised manner and performs aggregation over an extended neighborhood of the query.
The technique achieves state-of-the-art results over known benchmarks.
arXiv Detail & Related papers (2021-12-05T19:48:42Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.