AE-smnsMLC: Multi-Label Classification with Semantic Matching and
Negative Label Sampling for Product Attribute Value Extraction
- URL: http://arxiv.org/abs/2310.07137v1
- Date: Wed, 11 Oct 2023 02:22:28 GMT
- Title: AE-smnsMLC: Multi-Label Classification with Semantic Matching and
Negative Label Sampling for Product Attribute Value Extraction
- Authors: Zhongfen Deng, Wei-Te Chen, Lei Chen, Philip S. Yu
- Abstract summary: Product attribute value extraction plays an important role for many real-world applications in e-Commerce such as product search and recommendation.
Previous methods treat it as a sequence labeling task that needs more annotation for position of values in the product text.
We propose a classification model with semantic matching and negative label sampling for attribute value extraction.
- Score: 42.79022954630978
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Product attribute value extraction plays an important role for many
real-world applications in e-Commerce such as product search and
recommendation. Previous methods treat it as a sequence labeling task that
needs more annotation for position of values in the product text. This limits
their application to real-world scenario in which only attribute values are
weakly-annotated for each product without their position. Moreover, these
methods only use product text (i.e., product title and description) and do not
consider the semantic connection between the multiple attribute values of a
given product and its text, which can help attribute value extraction. In this
paper, we reformulate this task as a multi-label classification task that can
be applied for real-world scenario in which only annotation of attribute values
is available to train models (i.e., annotation of positional information of
attribute values is not available). We propose a classification model with
semantic matching and negative label sampling for attribute value extraction.
Semantic matching aims to capture semantic interactions between attribute
values of a given product and its text. Negative label sampling aims to enhance
the model's ability of distinguishing similar values belonging to the same
attribute. Experimental results on three subsets of a large real-world
e-Commerce dataset demonstrate the effectiveness and superiority of our
proposed model.
Related papers
- EAVE: Efficient Product Attribute Value Extraction via Lightweight Sparse-layer Interaction [94.22610101608332]
We propose an Efficient product Attribute Value Extraction (EAVE) approach via lightweight sparse-layer interaction.
We employ a heavy encoder to separately encode the product context and attribute. The resulting non-interacting heavy representations of the context can be cached and reused for all attributes.
Our method achieves significant efficiency gains with neutral or marginal loss in performance when the context is long and number of attributes is large.
arXiv Detail & Related papers (2024-06-10T23:06:38Z) - EIVEN: Efficient Implicit Attribute Value Extraction using Multimodal LLM [52.016009472409166]
EIVEN is a data- and parameter-efficient generative framework for implicit attribute value extraction.
We introduce a novel Learning-by-Comparison technique to reduce model confusion.
Our experiments reveal that EIVEN significantly outperforms existing methods in extracting implicit attribute values.
arXiv Detail & Related papers (2024-04-13T03:15:56Z) - Using LLMs for the Extraction and Normalization of Product Attribute Values [47.098255866050835]
This paper explores the potential of using large language models (LLMs) to extract and normalize attribute values from product titles and descriptions.
We introduce the Web Data Commons - Product Attribute Value Extraction (WDC-PAVE) benchmark dataset for our experiments.
arXiv Detail & Related papers (2024-03-04T15:39:59Z) - JPAVE: A Generation and Classification-based Model for Joint Product
Attribute Prediction and Value Extraction [59.94977231327573]
We propose a multi-task learning model with value generation/classification and attribute prediction called JPAVE.
Two variants of our model are designed for open-world and closed-world scenarios.
Experimental results on a public dataset demonstrate the superiority of our model compared with strong baselines.
arXiv Detail & Related papers (2023-11-07T18:36:16Z) - A Unified Generative Approach to Product Attribute-Value Identification [6.752749933406399]
We explore a generative approach to the product attribute-value identification (PAVI) task.
We finetune a pre-trained generative model, T5, to decode a set of attribute-value pairs as a target sequence from the given product text.
Experimental results confirm that our generation-based approach outperforms the existing extraction and classification-based methods.
arXiv Detail & Related papers (2023-06-09T00:33:30Z) - Multimodal Joint Attribute Prediction and Value Extraction for
E-commerce Product [40.46223408546036]
Product attribute values are essential in many e-commerce scenarios, such as customer service robots, product recommendations, and product retrieval.
While in the real world, the attribute values of a product are usually incomplete and vary over time, which greatly hinders the practical applications.
We propose a multimodal method to jointly predict product attributes and extract values from textual product descriptions with the help of the product images.
arXiv Detail & Related papers (2020-09-15T15:10:51Z) - Automatic Validation of Textual Attribute Values in E-commerce Catalog
by Learning with Limited Labeled Data [61.789797281676606]
We propose a novel meta-learning latent variable approach, called MetaBridge.
It can learn transferable knowledge from a subset of categories with limited labeled data.
It can capture the uncertainty of never-seen categories with unlabeled data.
arXiv Detail & Related papers (2020-06-15T21:31:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.