Detecting Spam Reviews on Vietnamese E-commerce Websites
- URL: http://arxiv.org/abs/2207.14636v1
- Date: Wed, 27 Jul 2022 10:37:14 GMT
- Title: Detecting Spam Reviews on Vietnamese E-commerce Websites
- Authors: Co Van Dinh, Son T. Luu and Anh Gia-Tuan Nguyen
- Abstract summary: We propose the dataset called ViSpamReviews, which has a strict annotation procedure for detecting spam reviews on e-commerce platforms.
Our dataset consists of two tasks: the binary classification task for detecting whether a review is a spam or not and the multi-class classification task for identifying the type of spam.
The PhoBERT obtained the highest results on both tasks, 88.93% and 72.17%, respectively, by macro average F1 score.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: The reviews of customers play an essential role in online shopping. People
often refer to reviews or comments of previous customers to decide whether to
buy a new product. Catching up with this behavior, some people create untruths
and illegitimate reviews to hoax customers about the fake quality of products.
These reviews are called spam reviews, which confuse consumers on online
shopping platforms and negatively affect online shopping behaviors. We propose
the dataset called ViSpamReviews, which has a strict annotation procedure for
detecting spam reviews on e-commerce platforms. Our dataset consists of two
tasks: the binary classification task for detecting whether a review is a spam
or not and the multi-class classification task for identifying the type of
spam. The PhoBERT obtained the highest results on both tasks, 88.93% and
72.17%, respectively, by macro average F1 score.
Related papers
- What Matters in Explanations: Towards Explainable Fake Review Detection Focusing on Transformers [45.55363754551388]
Customers' reviews and feedback play crucial role on e-commerce platforms like Amazon, Zalando, and eBay.
There is a prevailing concern that sellers often post fake or spam reviews to deceive potential customers and manipulate their opinions about a product.
We propose an explainable framework for detecting fake reviews with high precision in identifying fraudulent content with explanations.
arXiv Detail & Related papers (2024-07-24T13:26:02Z) - Metadata Integration for Spam Reviews Detection on Vietnamese E-commerce Websites [0.0]
We introduce the ViSpamReviews v2 dataset, which includes metadata of reviews.
We propose a novel approach to simultaneously integrate both textual and categorical attributes into the classification model.
arXiv Detail & Related papers (2024-05-22T02:19:13Z) - On the Role of Reviewer Expertise in Temporal Review Helpfulness
Prediction [5.381004207943597]
Existing methods for identifying helpful reviews primarily focus on review text and ignore the two key factors of (1) who post the reviews and (2) when the reviews are posted.
We introduce a dataset and develop a model that integrates the reviewer's expertise, derived from the past review history, and the temporal dynamics of the reviews to automatically assess review helpfulness.
arXiv Detail & Related papers (2023-02-22T23:41:22Z) - Combat AI With AI: Counteract Machine-Generated Fake Restaurant Reviews
on Social Media [77.34726150561087]
We propose to leverage the high-quality elite Yelp reviews to generate fake reviews from the OpenAI GPT review creator.
We apply the model to predict non-elite reviews and identify the patterns across several dimensions.
We show that social media platforms are continuously challenged by machine-generated fake reviews.
arXiv Detail & Related papers (2023-02-10T19:40:10Z) - A Dataset on Malicious Paper Bidding in Peer Review [84.68308372858755]
Malicious reviewers strategically bid in order to unethically manipulate the paper assignment.
A critical impediment towards creating and evaluating methods to mitigate this issue is the lack of publicly-available data on malicious paper bidding.
We release a novel dataset, collected from a mock conference activity where participants were instructed to bid either honestly or maliciously.
arXiv Detail & Related papers (2022-06-24T20:23:33Z) - Attention-based Bidirectional LSTM for Deceptive Opinion Spam
Classification [0.0]
Fraudulent reviews are deliberately posted on various online review platforms to trick customers to buy, visit or distract against a product or a restaurant.
The work aims at detecting and classifying the reviews as deceptive or truthful.
It involves use of various deep learning techniques for classifying the reviews and an overview of proposed approach.
arXiv Detail & Related papers (2021-12-29T19:02:04Z) - Identifying Hijacked Reviews [21.600163117735857]
Review Hijacking is a new review manipulation tactic in which unethical sellers "hijack" an existing product page.
We propose a framework to generate synthetically labeled data for review hijacking by swapping products and reviews.
We then evaluate the potential of both a Twin LSTM network and BERT sequence pair to distinguish legitimate reviews from hijacked ones using this data.
arXiv Detail & Related papers (2021-07-07T20:43:36Z) - Abstractive Opinion Tagging [65.47649273721679]
In e-commerce, opinion tags refer to a ranked list of tags provided by the e-commerce platform that reflect characteristics of reviews of an item.
Current mechanisms for generating opinion tags rely on either manual or labelling methods, which is time-consuming and ineffective.
We propose an abstractive opinion tagging framework, named AOT-Net, to generate a ranked list of opinion tags given a large number of reviews.
arXiv Detail & Related papers (2021-01-18T05:08:15Z) - Robust Spammer Detection by Nash Reinforcement Learning [64.80986064630025]
We develop a minimax game where the spammers and spam detectors compete with each other on their practical goals.
We show that an optimization algorithm can reliably find an equilibrial detector that can robustly prevent spammers with any mixed spamming strategies from attaining their practical goal.
arXiv Detail & Related papers (2020-06-10T21:18:07Z) - DFraud3- Multi-Component Fraud Detection freeof Cold-start [50.779498955162644]
The Cold-start is a significant problem referring to the failure of a detection system to recognize the authenticity of a new user.
In this paper, we model a review system as a Heterogeneous InformationNetwork (HIN) which enables a unique representation to every component.
HIN with graph induction helps to address the camouflage issue (fraudsterswith genuine reviews) which has shown to be more severe when it is coupled with cold-start, i.e., new fraudsters with genuine first reviews.
arXiv Detail & Related papers (2020-06-10T08:20:13Z) - How do online consumers review negatively? [1.52292571922932]
Using 1, 450, 000 negative reviews from JD.com, the largest B2C platform in China, the behavioral patterns from temporal, perceptional and emotional perspectives are explored.
Consumers from lower levels express more intensive negative feelings, especially on product pricing and seller attitudes.
The value of negative reviews from higher-level consumers is unexpectedly highlighted because of less emotionalization and less biased narration.
arXiv Detail & Related papers (2020-04-28T12:54:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.