Unified Generative & Dense Retrieval for Query Rewriting in Sponsored
Search
- URL: http://arxiv.org/abs/2209.05861v2
- Date: Sat, 3 Jun 2023 12:19:43 GMT
- Title: Unified Generative & Dense Retrieval for Query Rewriting in Sponsored
Search
- Authors: Akash Kumar Mohankumar, Bhargav Dodla, Gururaj K, Amit Singh
- Abstract summary: We compare two paradigms for online query rewriting: Generative (NLG) and Dense Retrieval (DR) methods.
We propose CLOVER-Unity, a novel approach that unifies generative and dense retrieval methods in one single model.
- Score: 6.181557214852772
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Sponsored search is a key revenue source for search engines, where
advertisers bid on keywords to target users or search queries of interest.
However, finding relevant keywords for a given query is challenging due to the
large and dynamic keyword space, ambiguous user/advertiser intents, and diverse
possible topics and languages. In this work, we present a comprehensive
comparison between two paradigms for online query rewriting: Generative (NLG)
and Dense Retrieval (DR) methods. We observe that both methods offer
complementary benefits that are additive. As a result, we show that around 40%
of the high-quality keywords retrieved by the two approaches are unique and not
retrieved by the other. To leverage the strengths of both methods, we propose
CLOVER-Unity, a novel approach that unifies generative and dense retrieval
methods in one single model. Through offline experiments, we show that the NLG
and DR components of CLOVER-Unity consistently outperform individually trained
NLG and DR models on public and internal benchmarks. Furthermore, we show that
CLOVER-Unity achieves 9.8% higher good keyword density than the ensemble of two
separate DR and NLG models while reducing computational costs by almost half.
We conduct extensive online A/B experiments on Microsoft Bing in 140+ countries
and achieve improved user engagement, with an average increase in total clicks
by 0.89% and increased revenue by 1.27%. We also share our practical lessons
and optimization tricks for deploying such unified models in production.
Related papers
- Bridging Search and Recommendation in Generative Retrieval: Does One Task Help the Other? [9.215695600542249]
Generative retrieval for search and recommendation is a promising paradigm for retrieving items.
These generative systems can play a crucial role in centralizing a variety of Information Retrieval (IR) tasks in a single model.
This paper investigates whether and when such a unified approach can outperform task-specific models in the IR tasks of search and recommendation.
arXiv Detail & Related papers (2024-10-22T08:49:43Z) - Learning to Rank for Multiple Retrieval-Augmented Models through Iterative Utility Maximization [21.115495457454365]
This paper investigates the design of a unified search engine to serve multiple retrieval-augmented generation (RAG) agents.
We introduce an iterative approach where the search engine generates retrieval results for these RAG agents and gathers feedback on the quality of the retrieved documents during an offline phase.
We adapt this approach to an online setting, allowing the search engine to refine its behavior based on real-time individual agents feedback.
arXiv Detail & Related papers (2024-10-13T17:53:50Z) - ACE: A Generative Cross-Modal Retrieval Framework with Coarse-To-Fine Semantic Modeling [53.97609687516371]
We propose a pioneering generAtive Cross-modal rEtrieval framework (ACE) for end-to-end cross-modal retrieval.
ACE achieves state-of-the-art performance in cross-modal retrieval and outperforms the strong baselines on Recall@1 by 15.27% on average.
arXiv Detail & Related papers (2024-06-25T12:47:04Z) - Unified Embedding Based Personalized Retrieval in Etsy Search [0.206242362470764]
We propose learning a unified embedding model incorporating graph, transformer and term-based embeddings end to end.
Our personalized retrieval model significantly improves the overall search experience, as measured by a 5.58% increase in search purchase rate and a 2.63% increase in site-wide conversion rate.
arXiv Detail & Related papers (2023-06-07T23:24:50Z) - LEAPS: End-to-End One-Step Person Search With Learnable Proposals [50.39493100627476]
We propose an end-to-end one-step person search approach with learnable proposals, named LEAPS.
Given a set of sparse and learnable proposals, LEAPS employs a dynamic person search head to directly perform person detection and corresponding re-id feature generation without non-maximum suppression post-processing.
arXiv Detail & Related papers (2023-03-21T13:59:32Z) - Enriching Relation Extraction with OpenIE [70.52564277675056]
Relation extraction (RE) is a sub-discipline of information extraction (IE)
In this work, we explore how recent approaches for open information extraction (OpenIE) may help to improve the task of RE.
Our experiments over two annotated corpora, KnowledgeNet and FewRel, demonstrate the improved accuracy of our enriched models.
arXiv Detail & Related papers (2022-12-19T11:26:23Z) - Query Expansion Using Contextual Clue Sampling with Language Models [69.51976926838232]
We propose a combination of an effective filtering strategy and fusion of the retrieved documents based on the generation probability of each context.
Our lexical matching based approach achieves a similar top-5/top-20 retrieval accuracy and higher top-100 accuracy compared with the well-established dense retrieval model DPR.
For end-to-end QA, the reader model also benefits from our method and achieves the highest Exact-Match score against several competitive baselines.
arXiv Detail & Related papers (2022-10-13T15:18:04Z) - Multi-Objective Personalized Product Retrieval in Taobao Search [27.994166796745496]
We propose a novel Multi-Objective Personalized Product Retrieval (MOPPR) model with four hierarchical optimization objectives: relevance, exposure, click and purchase.
MOPPR achieves 0.96% transaction and 1.29% GMV improvements in a 28-day online A/B test.
Since the Double-11 shopping festival of 2021, MOPPR has been fully deployed in mobile Taobao search, replacing the previous MGDSPR.
arXiv Detail & Related papers (2022-10-09T05:18:42Z) - Diversity driven Query Rewriting in Search Advertising [1.5289756643078838]
generative retrieval models have been shown to be effective at the task of generating such query rewrites.
We introduce CLOVER, a framework to generate both high-quality and diverse rewrites.
We empirically show the effectiveness of our proposed approach through offline experiments on search queries across geographies spanning three major languages.
arXiv Detail & Related papers (2021-06-07T17:30:45Z) - One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search
Space Shrinking [97.60915598958968]
We propose a one-shot neural ensemble architecture search (NEAS) solution that addresses the two challenges.
For the first challenge, we introduce a novel diversity-based metric to guide search space shrinking.
For the second challenge, we enable a new search dimension to learn layer sharing among different models for efficiency purposes.
arXiv Detail & Related papers (2021-04-01T16:29:49Z) - Diverse Knowledge Distillation for End-to-End Person Search [81.4926655119318]
Person search aims to localize and identify a specific person from a gallery of images.
Recent methods can be categorized into two groups, i.e., two-step and end-to-end approaches.
We propose a simple yet strong end-to-end network with diverse knowledge distillation to break the bottleneck.
arXiv Detail & Related papers (2020-12-21T09:04:27Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.