Transfer Learning for E-commerce Query Product Type Prediction
- URL: http://arxiv.org/abs/2410.07121v1
- Date: Fri, 20 Sep 2024 13:30:04 GMT
- Title: Transfer Learning for E-commerce Query Product Type Prediction
- Authors: Anna Tigunova, Thomas Ricatte, Ghadir Eraisha,
- Abstract summary: We focus on Q2PT prediction in the global multilocale e-commerce markets.
We benchmark the per-locale Q2PT model against the unified one, which shares the training data and model structure across all worldwide stores.
We conduct extensive quantiative and qualitative analysis of Q2PT models on the large-scale e-commerce dataset across 20 worldwide locales.
- Score: 3.092822696545516
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Getting a good understanding of the customer intent is essential in e-commerce search engines. In particular, associating the correct product type to a search query plays a vital role in surfacing correct products to the customers. Query product type classification (Q2PT) is a particularly challenging task because search queries are short and ambiguous, the number of existing product categories is extremely large, spanning thousands of values. Moreover, international marketplaces face additional challenges, such as language and dialect diversity and cultural differences, influencing the interpretation of the query. In this work we focus on Q2PT prediction in the global multilocale e-commerce markets. The common approach of training Q2PT models for each locale separately shows significant performance drops in low-resource stores. Moreover, this method does not allow for a smooth expansion to a new country, requiring to collect the data and train a new locale-specific Q2PT model from scratch. To tackle this, we propose to use transfer learning from the highresource to the low-resource locales, to achieve global parity of Q2PT performance. We benchmark the per-locale Q2PT model against the unified one, which shares the training data and model structure across all worldwide stores. Additionally, we compare locale-aware and locale-agnostic Q2PT models, showing the task dependency on the country-specific traits. We conduct extensive quantiative and qualitative analysis of Q2PT models on the large-scale e-commerce dataset across 20 worldwide locales, which shows that unified locale-aware Q2PT model has superior performance over the alternatives.
Related papers
- SEQ+MD: Learning Multi-Task as a SEQuence with Multi-Distribution Data [5.069855142454979]
We propose the SEQ+MD framework, which integrates sequential learning for multi-task learning (MTL) and feature-generated region-mask for multi-distribution input.
We show a strong increase on the high-value engagement including add-to-cart and purchase while keeping click performance neutral.
Our multi-regional learning module is "plug-and-play" and can be easily adapted to enhance other MTL applications.
arXiv Detail & Related papers (2024-08-23T20:14:27Z) - M2QA: Multi-domain Multilingual Question Answering [63.191474328757366]
Generalization and robustness to input variation are core desiderata of machine learning research.
We introduce M2QA, a multi-domain multilingual question answering benchmark.
M2QA includes 13,500 SQuAD 2.0-style question-answer instances in German, Turkish, and Chinese for the domains of product reviews, news, and creative writing.
arXiv Detail & Related papers (2024-07-01T08:48:49Z) - Datasets for Multilingual Answer Sentence Selection [59.28492975191415]
We introduce new high-quality datasets for AS2 in five European languages (French, German, Italian, Portuguese, and Spanish)
Results indicate that our datasets are pivotal in producing robust and powerful multilingual AS2 models.
arXiv Detail & Related papers (2024-06-14T16:50:29Z) - Solving Price Per Unit Problem Around the World: Formulating Fact
Extraction as Question Answering [4.094848360328624]
Price Per Unit (PPU) is an essential information for consumers shopping on e-commerce websites when comparing products.
We formulate this problem as a question-answering (QA) task rather than named entity recognition (NER) task for fact extraction.
Our QA approach outperforms rule-based methods by 34.4% in precision and also BERT-based fact extraction approach in all stores globally.
arXiv Detail & Related papers (2022-04-12T06:43:48Z) - Towards Best Practices for Training Multilingual Dense Retrieval Models [54.91016739123398]
We focus on the task of monolingual retrieval in a variety of typologically diverse languages using one such design.
Our study is organized as a "best practices" guide for training multilingual dense retrieval models.
arXiv Detail & Related papers (2022-04-05T17:12:53Z) - Delving Deeper into Cross-lingual Visual Question Answering [115.16614806717341]
We show that simple modifications to the standard training setup can substantially reduce the transfer gap to monolingual English performance.
We analyze cross-lingual VQA across different question types of varying complexity for different multilingual multimodal Transformers.
arXiv Detail & Related papers (2022-02-15T18:22:18Z) - Graph-based Multilingual Product Retrieval in E-commerce Search [29.156647795471176]
We introduce a universal end-to-end multilingual retrieval system to serve billion-scale product retrieval for e-commerce search.
We propose a multilingual graph attention based retrieval network by leveraging recent advances in transformer-based multilingual language models.
Our algorithm outperforms the state-of-the-art baselines by 35% recall and 25% mAP on average.
arXiv Detail & Related papers (2021-05-06T21:49:10Z) - Multilingual Answer Sentence Reranking via Automatically Translated Data [97.98885151955467]
We present a study on the design of multilingual Answer Sentence Selection (AS2) models, which are a core component of modern Question Answering (QA) systems.
The main idea is to transfer data, created from one resource rich language, e.g., English, to other languages, less rich in terms of resources.
arXiv Detail & Related papers (2021-02-20T03:52:08Z) - Modeling Household Online Shopping Demand in the U.S.: A Machine
Learning Approach and Comparative Investigation between 2009 and 2017 [0.0]
This paper leverages two recent releases of the U.S. National Household Travel Survey (NHTS) data for 2009 and 2017 to develop machine learning (ML) models for predicting household-level online shopping purchases.
Two latest advances in machine learning techniques, namely Shapley value-based feature importance and Accumulated Local Effects plots, are adopted to overcome inherent drawbacks of the popular techniques in current ML modeling.
The models developed and insights gained can be used for online shopping-related freight demand generation and may also be considered for evaluating the potential impact of relevant policies on online shopping demand.
arXiv Detail & Related papers (2021-01-11T03:45:53Z) - Cross-Lingual Low-Resource Set-to-Description Retrieval for Global
E-Commerce [83.72476966339103]
Cross-lingual information retrieval is a new task in cross-border e-commerce.
We propose a novel cross-lingual matching network (CLMN) with the enhancement of context-dependent cross-lingual mapping.
Experimental results indicate that our proposed CLMN yields impressive results on the challenging task.
arXiv Detail & Related papers (2020-05-17T08:10:51Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.