SAGE: Structured Attribute Value Generation for Billion-Scale Product
Catalogs
- URL: http://arxiv.org/abs/2309.05920v1
- Date: Tue, 12 Sep 2023 02:24:16 GMT
- Title: SAGE: Structured Attribute Value Generation for Billion-Scale Product
Catalogs
- Authors: Athanasios N. Nikolakopoulos, Swati Kaul, Siva Karthik Gade, Bella
Dubrov, Umit Batur, Suleiman Ali Khan
- Abstract summary: SAGE is a Generative LLM for inferring attribute values for products across world-wide e-Commerce catalogs.
We introduce a novel formulation of the attribute-value prediction problem as a Seq2Seq summarization task.
SAGE is the first method able to tackle all aspects of the attribute-value-prediction task as they arise in practical settings in e-Commerce catalogs.
- Score: 1.1184789007828977
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We introduce SAGE; a Generative LLM for inferring attribute values for
products across world-wide e-Commerce catalogs. We introduce a novel
formulation of the attribute-value prediction problem as a Seq2Seq
summarization task, across languages, product types and target attributes. Our
novel modeling approach lifts the restriction of predicting attribute values
within a pre-specified set of choices, as well as, the requirement that the
sought attribute values need to be explicitly mentioned in the text. SAGE can
infer attribute values even when such values are mentioned implicitly using
periphrastic language, or not-at-all-as is the case for common-sense defaults.
Additionally, SAGE is capable of predicting whether an attribute is
inapplicable for the product at hand, or non-obtainable from the available
information. SAGE is the first method able to tackle all aspects of the
attribute-value-prediction task as they arise in practical settings in
e-Commerce catalogs. A comprehensive set of experiments demonstrates the
effectiveness of the proposed approach, as well as, its superiority against
state-of-the-art competing alternatives. Moreover, our experiments highlight
SAGE's ability to tackle the task of predicting attribute values in zero-shot
setting; thereby, opening up opportunities for significantly reducing the
overall number of labeled examples required for training.
Related papers
- CASA: Class-Agnostic Shared Attributes in Vision-Language Models for Efficient Incremental Object Detection [30.46562066023117]
We propose a novel method utilizing attributes in vision-language foundation models for incremental object detection.
Our method constructs a Class-Agnostic Shared Attribute base (CASA) to capture common semantic information among incremental classes.
Our method adds only 0.7% to parameter storage through parameter-efficient fine-tuning to significantly enhance the scalability and adaptability of our proposed method.
arXiv Detail & Related papers (2024-10-08T08:36:12Z) - EIVEN: Efficient Implicit Attribute Value Extraction using Multimodal LLM [52.016009472409166]
EIVEN is a data- and parameter-efficient generative framework for implicit attribute value extraction.
We introduce a novel Learning-by-Comparison technique to reduce model confusion.
Our experiments reveal that EIVEN significantly outperforms existing methods in extracting implicit attribute values.
arXiv Detail & Related papers (2024-04-13T03:15:56Z) - Enhancing User Intent Capture in Session-Based Recommendation with
Attribute Patterns [77.19390850643944]
We propose the Frequent Attribute Pattern Augmented Transformer (FAPAT)
FAPAT characterizes user intents by building attribute transition graphs and matching attribute patterns.
We demonstrate that FAPAT consistently outperforms state-of-the-art methods by an average of 4.5% across various evaluation metrics.
arXiv Detail & Related papers (2023-12-23T03:28:18Z) - JPAVE: A Generation and Classification-based Model for Joint Product
Attribute Prediction and Value Extraction [59.94977231327573]
We propose a multi-task learning model with value generation/classification and attribute prediction called JPAVE.
Two variants of our model are designed for open-world and closed-world scenarios.
Experimental results on a public dataset demonstrate the superiority of our model compared with strong baselines.
arXiv Detail & Related papers (2023-11-07T18:36:16Z) - ExtractGPT: Exploring the Potential of Large Language Models for Product Attribute Value Extraction [52.14681890859275]
E-commerce platforms require structured product data in the form of attribute-value pairs.
BERT-based extraction methods require large amounts of task-specific training data.
This paper explores using large language models (LLMs) as a more training-data efficient and robust alternative.
arXiv Detail & Related papers (2023-10-19T07:39:00Z) - AE-smnsMLC: Multi-Label Classification with Semantic Matching and
Negative Label Sampling for Product Attribute Value Extraction [42.79022954630978]
Product attribute value extraction plays an important role for many real-world applications in e-Commerce such as product search and recommendation.
Previous methods treat it as a sequence labeling task that needs more annotation for position of values in the product text.
We propose a classification model with semantic matching and negative label sampling for attribute value extraction.
arXiv Detail & Related papers (2023-10-11T02:22:28Z) - A Unified Generative Approach to Product Attribute-Value Identification [6.752749933406399]
We explore a generative approach to the product attribute-value identification (PAVI) task.
We finetune a pre-trained generative model, T5, to decode a set of attribute-value pairs as a target sequence from the given product text.
Experimental results confirm that our generation-based approach outperforms the existing extraction and classification-based methods.
arXiv Detail & Related papers (2023-06-09T00:33:30Z) - OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak
Supervision [93.26737878221073]
We study the attribute mining problem in an open-world setting to extract novel attributes and their values.
We propose a principled framework that first generates attribute value candidates and then groups them into clusters of attributes.
Our model significantly outperforms strong baselines and can generalize to unseen attributes and product types.
arXiv Detail & Related papers (2022-04-29T04:16:04Z) - Automatic Validation of Textual Attribute Values in E-commerce Catalog
by Learning with Limited Labeled Data [61.789797281676606]
We propose a novel meta-learning latent variable approach, called MetaBridge.
It can learn transferable knowledge from a subset of categories with limited labeled data.
It can capture the uncertainty of never-seen categories with unlabeled data.
arXiv Detail & Related papers (2020-06-15T21:31:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.