JPAVE: A Generation and Classification-based Model for Joint Product
Attribute Prediction and Value Extraction
- URL: http://arxiv.org/abs/2311.04196v1
- Date: Tue, 7 Nov 2023 18:36:16 GMT
- Title: JPAVE: A Generation and Classification-based Model for Joint Product
Attribute Prediction and Value Extraction
- Authors: Zhongfen Deng, Hao Peng, Tao Zhang, Shuaiqi Liu, Wenting Zhao, Yibo
Wang, Philip S. Yu
- Abstract summary: We propose a multi-task learning model with value generation/classification and attribute prediction called JPAVE.
Two variants of our model are designed for open-world and closed-world scenarios.
Experimental results on a public dataset demonstrate the superiority of our model compared with strong baselines.
- Score: 59.94977231327573
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Product attribute value extraction is an important task in e-Commerce which
can help several downstream applications such as product search and
recommendation. Most previous models handle this task using sequence labeling
or question answering method which rely on the sequential position information
of values in the product text and are vulnerable to data discrepancy between
training and testing. This limits their generalization ability to real-world
scenario in which each product can have multiple descriptions across various
shopping platforms with different composition of text and style. They also have
limited zero-shot ability to new values. In this paper, we propose a multi-task
learning model with value generation/classification and attribute prediction
called JPAVE to predict values without the necessity of position information of
values in the text. Furthermore, the copy mechanism in value generator and the
value attention module in value classifier help our model address the data
discrepancy issue by only focusing on the relevant part of input text and
ignoring other information which causes the discrepancy issue such as sentence
structure in the text. Besides, two variants of our model are designed for
open-world and closed-world scenarios. In addition, copy mechanism introduced
in the first variant based on value generation can improve its zero-shot
ability for identifying unseen values. Experimental results on a public dataset
demonstrate the superiority of our model compared with strong baselines and its
generalization ability of predicting new values.
Related papers
- Exploiting Contextual Target Attributes for Target Sentiment
Classification [53.30511968323911]
Existing PTLM-based models for TSC can be categorized into two groups: 1) fine-tuning-based models that adopt PTLM as the context encoder; 2) prompting-based models that transfer the classification task to the text/word generation task.
We present a new perspective of leveraging PTLM for TSC: simultaneously leveraging the merits of both language modeling and explicit target-context interactions via contextual target attributes.
arXiv Detail & Related papers (2023-12-21T11:45:28Z) - AE-smnsMLC: Multi-Label Classification with Semantic Matching and
Negative Label Sampling for Product Attribute Value Extraction [42.79022954630978]
Product attribute value extraction plays an important role for many real-world applications in e-Commerce such as product search and recommendation.
Previous methods treat it as a sequence labeling task that needs more annotation for position of values in the product text.
We propose a classification model with semantic matching and negative label sampling for attribute value extraction.
arXiv Detail & Related papers (2023-10-11T02:22:28Z) - SAGE: Structured Attribute Value Generation for Billion-Scale Product
Catalogs [1.1184789007828977]
SAGE is a Generative LLM for inferring attribute values for products across world-wide e-Commerce catalogs.
We introduce a novel formulation of the attribute-value prediction problem as a Seq2Seq summarization task.
SAGE is the first method able to tackle all aspects of the attribute-value-prediction task as they arise in practical settings in e-Commerce catalogs.
arXiv Detail & Related papers (2023-09-12T02:24:16Z) - Large Language Models are Diverse Role-Players for Summarization
Evaluation [82.31575622685902]
A document summary's quality can be assessed by human annotators on various criteria, both objective ones like grammar and correctness, and subjective ones like informativeness, succinctness, and appeal.
Most of the automatic evaluation methods like BLUE/ROUGE may be not able to adequately capture the above dimensions.
We propose a new evaluation framework based on LLMs, which provides a comprehensive evaluation framework by comparing generated text and reference text from both objective and subjective aspects.
arXiv Detail & Related papers (2023-03-27T10:40:59Z) - Automated Scoring for Reading Comprehension via In-context BERT Tuning [9.135673900486827]
In this paper, we report our (grand prize-winning) solution to the National Assessment of Education Progress (NAEP) automated scoring challenge for reading comprehension.
Our approach, in-context BERT fine-tuning, produces a single shared scoring model for all items with a carefully-designed input structure.
We demonstrate the effectiveness of our approach via local evaluations using the training dataset provided by the challenge.
arXiv Detail & Related papers (2022-05-19T21:16:15Z) - OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak
Supervision [93.26737878221073]
We study the attribute mining problem in an open-world setting to extract novel attributes and their values.
We propose a principled framework that first generates attribute value candidates and then groups them into clusters of attributes.
Our model significantly outperforms strong baselines and can generalize to unseen attributes and product types.
arXiv Detail & Related papers (2022-04-29T04:16:04Z) - Automatic Validation of Textual Attribute Values in E-commerce Catalog
by Learning with Limited Labeled Data [61.789797281676606]
We propose a novel meta-learning latent variable approach, called MetaBridge.
It can learn transferable knowledge from a subset of categories with limited labeled data.
It can capture the uncertainty of never-seen categories with unlabeled data.
arXiv Detail & Related papers (2020-06-15T21:31:05Z) - Few-Shot Learning for Opinion Summarization [117.70510762845338]
Opinion summarization is the automatic creation of text reflecting subjective information expressed in multiple documents.
In this work, we show that even a handful of summaries is sufficient to bootstrap generation of the summary text.
Our approach substantially outperforms previous extractive and abstractive methods in automatic and human evaluation.
arXiv Detail & Related papers (2020-04-30T15:37:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.