OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak
Supervision
- URL: http://arxiv.org/abs/2204.13874v1
- Date: Fri, 29 Apr 2022 04:16:04 GMT
- Title: OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak
Supervision
- Authors: Xinyang Zhang, Chenwei Zhang, Xian Li, Xin Luna Dong, Jingbo Shang,
Christos Faloutsos, Jiawei Han
- Abstract summary: We study the attribute mining problem in an open-world setting to extract novel attributes and their values.
We propose a principled framework that first generates attribute value candidates and then groups them into clusters of attributes.
Our model significantly outperforms strong baselines and can generalize to unseen attributes and product types.
- Score: 93.26737878221073
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Automatic extraction of product attributes from their textual descriptions is
essential for online shopper experience. One inherent challenge of this task is
the emerging nature of e-commerce products -- we see new types of products with
their unique set of new attributes constantly. Most prior works on this matter
mine new values for a set of known attributes but cannot handle new attributes
that arose from constantly changing data. In this work, we study the attribute
mining problem in an open-world setting to extract novel attributes and their
values. Instead of providing comprehensive training data, the user only needs
to provide a few examples for a few known attribute types as weak supervision.
We propose a principled framework that first generates attribute value
candidates and then groups them into clusters of attributes. The candidate
generation step probes a pre-trained language model to extract phrases from
product titles. Then, an attribute-aware fine-tuning method optimizes a
multitask objective and shapes the language model representation to be
attribute-discriminative. Finally, we discover new attributes and values
through the self-ensemble of our framework, which handles the open-world
challenge. We run extensive experiments on a large distantly annotated
development set and a gold standard human-annotated test set that we collected.
Our model significantly outperforms strong baselines and can generalize to
unseen attributes and product types.
Related papers
- EIVEN: Efficient Implicit Attribute Value Extraction using Multimodal LLM [52.016009472409166]
EIVEN is a data- and parameter-efficient generative framework for implicit attribute value extraction.
We introduce a novel Learning-by-Comparison technique to reduce model confusion.
Our experiments reveal that EIVEN significantly outperforms existing methods in extracting implicit attribute values.
arXiv Detail & Related papers (2024-04-13T03:15:56Z) - JPAVE: A Generation and Classification-based Model for Joint Product
Attribute Prediction and Value Extraction [59.94977231327573]
We propose a multi-task learning model with value generation/classification and attribute prediction called JPAVE.
Two variants of our model are designed for open-world and closed-world scenarios.
Experimental results on a public dataset demonstrate the superiority of our model compared with strong baselines.
arXiv Detail & Related papers (2023-11-07T18:36:16Z) - SAGE: Structured Attribute Value Generation for Billion-Scale Product
Catalogs [1.1184789007828977]
SAGE is a Generative LLM for inferring attribute values for products across world-wide e-Commerce catalogs.
We introduce a novel formulation of the attribute-value prediction problem as a Seq2Seq summarization task.
SAGE is the first method able to tackle all aspects of the attribute-value-prediction task as they arise in practical settings in e-Commerce catalogs.
arXiv Detail & Related papers (2023-09-12T02:24:16Z) - Large Scale Generative Multimodal Attribute Extraction for E-commerce
Attributes [23.105116746332506]
E-commerce websites (e.g. Amazon) have a plethora of structured and unstructured information (text and images) present on the product pages.
Sellers often either don't label or mislabel values of the attributes (e.g. color, size etc.) for their products.
We present a scalable solution for this problem using textbfMXT, consisting of three key components.
arXiv Detail & Related papers (2023-06-01T06:21:45Z) - Towards Open-World Product Attribute Mining: A Lightly-Supervised
Approach [60.52087154731358]
We present a new task setting for attribute mining on e-commerce products.
We aim to expand the attribute vocabulary of existing seed types, and also to discover any new attribute types automatically.
Our approach surpasses various baselines by 12 F1, expanding attributes of existing types significantly by up to 12 times, and discovering values from 39% new types.
arXiv Detail & Related papers (2023-05-26T11:51:31Z) - MAVE: A Product Dataset for Multi-source Attribute Value Extraction [10.429320377835241]
We introduce MAVE, a new dataset to better facilitate research on product attribute value extraction.
MAVE is composed of a curated set of 2.2 million products from Amazon pages, with 3 million attribute-value annotations across 1257 unique categories.
We propose a novel approach that effectively extracts the attribute value from the multi-source product information.
arXiv Detail & Related papers (2021-12-16T06:48:31Z) - Make an Omelette with Breaking Eggs: Zero-Shot Learning for Novel
Attribute Synthesis [65.74825840440504]
We propose Zero Shot Learning for Attributes (ZSLA), which is the first of its kind to the best of our knowledge.
Our proposed method is able to synthesize the detectors of novel attributes in a zero-shot learning manner.
With using only 32 seen attributes on the Caltech-UCSD Birds-200-2011 dataset, our proposed method is able to synthesize other 207 novel attributes.
arXiv Detail & Related papers (2021-11-28T15:45:54Z) - Automatic Validation of Textual Attribute Values in E-commerce Catalog
by Learning with Limited Labeled Data [61.789797281676606]
We propose a novel meta-learning latent variable approach, called MetaBridge.
It can learn transferable knowledge from a subset of categories with limited labeled data.
It can capture the uncertainty of never-seen categories with unlabeled data.
arXiv Detail & Related papers (2020-06-15T21:31:05Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.