Developing a Component Comment Extractor from Product Reviews on
E-Commerce Sites
- URL: http://arxiv.org/abs/2207.05979v1
- Date: Wed, 13 Jul 2022 06:25:55 GMT
- Title: Developing a Component Comment Extractor from Product Reviews on
E-Commerce Sites
- Authors: Shogo Anda, Masato Kikuchi, Tadachika Ozono
- Abstract summary: We develop a system that identifies and collects component and aspect information of products in sentences.
Our BERT-based classifiers assign labels referring to components and aspects to sentences in reviews.
Our data augmentation method can improve the-F1-measure on insufficient data from 0.66 to 0.76.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Consumers often read product reviews to inform their buying decision, as some
consumers want to know a specific component of a product. However, because
typical sentences on product reviews contain various details, users must
identify sentences about components they want to know amongst the many reviews.
Therefore, we aimed to develop a system that identifies and collects component
and aspect information of products in sentences. Our BERT-based classifiers
assign labels referring to components and aspects to sentences in reviews and
extract sentences with comments on specific components and aspects. We
determined proper labels based for the words identified through pattern
matching from product reviews to create the training data. Because we could not
use the words as labels, we carefully created labels covering the meanings of
the words. However, the training data was imbalanced on component and aspect
pairs. We introduced a data augmentation method using WordNet to reduce the
bias. Our evaluation demonstrates that the system can determine labels for road
bikes using pattern matching, covering more than 88\% of the indicators of
components and aspects on e-commerce sites. Moreover, our data augmentation
method can improve the-F1-measure on insufficient data from 0.66 to 0.76.
Related papers
- A Usage-centric Take on Intent Understanding in E-Commerce [20.648271216249977]
We focus on predicative user intents as "how a customer uses a product"
We identify two weaknesses of FolkScope, the SOTA E-Commerce Intent Graph.
They limit its ability to strongly align user intents with products having the most desirable property.
arXiv Detail & Related papers (2024-02-22T18:09:33Z) - Product Information Extraction using ChatGPT [69.12244027050454]
This paper explores the potential of ChatGPT for extracting attribute/value pairs from product descriptions.
Our results show that ChatGPT achieves a performance similar to a pre-trained language model but requires much smaller amounts of training data and computation for fine-tuning.
arXiv Detail & Related papers (2023-06-23T09:30:01Z) - Description-Enhanced Label Embedding Contrastive Learning for Text
Classification [65.01077813330559]
Self-Supervised Learning (SSL) in model learning process and design a novel self-supervised Relation of Relation (R2) classification task.
Relation of Relation Learning Network (R2-Net) for text classification, in which text classification and R2 classification are treated as optimization targets.
external knowledge from WordNet to obtain multi-aspect descriptions for label semantic learning.
arXiv Detail & Related papers (2023-06-15T02:19:34Z) - Unsupervised Ranking and Aggregation of Label Descriptions for Zero-Shot
Classifiers [8.434227773463022]
In a true zero-shot setup, designing good label descriptions is challenging because no development set is available.
We look at how probabilistic models of repeated rating analysis can be used for selecting the best label descriptions in an unsupervised fashion.
arXiv Detail & Related papers (2022-04-20T14:23:09Z) - Fine-Grained Element Identification in Complaint Text of Internet Fraud [47.249423877146604]
We propose to analyse the complaint text of internet fraud in a fine-grained manner.
Considering the complaint text includes multiple clauses with various functions, we propose to identify the role of each clause and classify them into different types of fraud element.
arXiv Detail & Related papers (2021-08-19T13:33:09Z) - Abstractive Opinion Tagging [65.47649273721679]
In e-commerce, opinion tags refer to a ranked list of tags provided by the e-commerce platform that reflect characteristics of reviews of an item.
Current mechanisms for generating opinion tags rely on either manual or labelling methods, which is time-consuming and ineffective.
We propose an abstractive opinion tagging framework, named AOT-Net, to generate a ranked list of opinion tags given a large number of reviews.
arXiv Detail & Related papers (2021-01-18T05:08:15Z) - Automatic Validation of Textual Attribute Values in E-commerce Catalog
by Learning with Limited Labeled Data [61.789797281676606]
We propose a novel meta-learning latent variable approach, called MetaBridge.
It can learn transferable knowledge from a subset of categories with limited labeled data.
It can capture the uncertainty of never-seen categories with unlabeled data.
arXiv Detail & Related papers (2020-06-15T21:31:05Z) - A Unified Dual-view Model for Review Summarization and Sentiment
Classification with Inconsistency Loss [51.448615489097236]
Acquiring accurate summarization and sentiment from user reviews is an essential component of modern e-commerce platforms.
We propose a novel dual-view model that jointly improves the performance of these two tasks.
Experiment results on four real-world datasets from different domains demonstrate the effectiveness of our model.
arXiv Detail & Related papers (2020-06-02T13:34:11Z) - Sentiment Analysis Using Averaged Weighted Word Vector Features [1.2691047660244332]
We develop two methods that combine different types of word vectors to learn and estimate polarity of reviews.
We apply the methods to several datasets from different domains that are used as standard benchmarks for sentiment analysis.
arXiv Detail & Related papers (2020-02-13T16:30:34Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.