Query Understanding for Natural Language Enterprise Search
- URL: http://arxiv.org/abs/2012.06238v1
- Date: Fri, 11 Dec 2020 10:57:25 GMT
- Title: Query Understanding for Natural Language Enterprise Search
- Authors: Francisco Borges, Georgios Balikas, Marc Brette, Guillaume Kempf,
Arvind Srikantan, Matthieu Landos, Darya Brazouskaya, Qianqian Shi
- Abstract summary: Natural Language Search (NLS) extends the capabilities of search engines that perform keyword search allowing users to issue queries in a more "natural" language.
We present an NLS system we implemented as part of the Search service of a major CRM platform.
- Score: 0.7363840001905632
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Natural Language Search (NLS) extends the capabilities of search engines that
perform keyword search allowing users to issue queries in a more "natural"
language. The engine tries to understand the meaning of the queries and to map
the query words to the symbols it supports like Persons, Organizations, Time
Expressions etc.. It, then, retrieves the information that satisfies the user's
need in different forms like an answer, a record or a list of records. We
present an NLS system we implemented as part of the Search service of a major
CRM platform. The system is currently in production serving thousands of
customers. Our user studies showed that creating dynamic reports with NLS saved
more than 50% of our user's time compared to achieving the same result with
navigational search. We describe the architecture of the system, the
particularities of the CRM domain as well as how they have influenced our
design decisions. Among several submodules of the system we detail the role of
a Deep Learning Named Entity Recognizer. The paper concludes with discussion
over the lessons learned while developing this product.
Related papers
- QueryBuilder: Human-in-the-Loop Query Development for Information Retrieval [12.543590253664492]
We present a novel, interactive system called $textitQueryBuilder$.
It allows a novice, English-speaking user to create queries with a small amount of effort.
It rapidly develops cross-lingual information retrieval queries corresponding to the user's information needs.
arXiv Detail & Related papers (2024-09-07T00:46:58Z) - UQE: A Query Engine for Unstructured Databases [71.49289088592842]
We investigate the potential of Large Language Models to enable unstructured data analytics.
We propose a new Universal Query Engine (UQE) that directly interrogates and draws insights from unstructured data collections.
arXiv Detail & Related papers (2024-06-23T06:58:55Z) - STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases [93.96463520716759]
We develop STARK, a large-scale Semi-structure retrieval benchmark on Textual and Knowledge Bases.
Our benchmark covers three domains: product search, academic paper search, and queries in precision medicine.
We design a novel pipeline to synthesize realistic user queries that integrate diverse relational information and complex textual properties.
arXiv Detail & Related papers (2024-04-19T22:54:54Z) - Enhanced Facet Generation with LLM Editing [5.4327243200369555]
In information retrieval, facet identification of a user query is an important task.
Previous studies can enhance facet prediction by leveraging retrieved documents and related queries obtained through a search engine.
However, there are challenges in extending it to other applications when a search engine operates as part of the model.
arXiv Detail & Related papers (2024-03-25T00:43:44Z) - Searching, fast and slow, through product catalogs [5.077235981745305]
We present a unified architecture for SKU search that provides both a real-time suggestion system and a lower latency search system.
We show how our system vastly outperforms, in all aspects, the results provided by the default search engine.
arXiv Detail & Related papers (2024-01-01T12:30:46Z) - Synergistic Interplay between Search and Large Language Models for
Information Retrieval [141.18083677333848]
InteR allows RMs to expand knowledge in queries using LLM-generated knowledge collections.
InteR achieves overall superior zero-shot retrieval performance compared to state-of-the-art methods.
arXiv Detail & Related papers (2023-05-12T11:58:15Z) - DAMO-NLP at SemEval-2023 Task 2: A Unified Retrieval-augmented System
for Multilingual Named Entity Recognition [94.90258603217008]
The MultiCoNER RNum2 shared task aims to tackle multilingual named entity recognition (NER) in fine-grained and noisy scenarios.
Previous top systems in the MultiCoNER RNum1 either incorporate the knowledge bases or gazetteers.
We propose a unified retrieval-augmented system (U-RaNER) for fine-grained multilingual NER.
arXiv Detail & Related papers (2023-05-05T16:59:26Z) - Task Oriented Conversational Modelling With Subjective Knowledge [0.0]
DSTC-11 proposes a three stage pipeline consisting of knowledge seeking turn detection, knowledge selection and response generation.
We propose entity retrieval methods which result in an accurate and faster knowledge search.
Preliminary results show a 4 % improvement in exact match score on knowledge selection task.
arXiv Detail & Related papers (2023-03-30T20:23:49Z) - Graph Enhanced BERT for Query Understanding [55.90334539898102]
query understanding plays a key role in exploring users' search intents and facilitating users to locate their most desired information.
In recent years, pre-trained language models (PLMs) have advanced various natural language processing tasks.
We propose a novel graph-enhanced pre-training framework, GE-BERT, which can leverage both query content and the query graph.
arXiv Detail & Related papers (2022-04-03T16:50:30Z) - Conversations with Search Engines: SERP-based Conversational Response
Generation [77.1381159789032]
We create a suitable dataset, the Search as a Conversation (SaaC) dataset, for the development of pipelines for conversations with search engines.
We also develop a state-of-the-art pipeline for conversations with search engines, the Conversations with Search Engines (CaSE) using this dataset.
CaSE enhances the state-of-the-art by introducing a supporting token identification module and aprior-aware pointer generator.
arXiv Detail & Related papers (2020-04-29T13:07:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.