ScoreGAN: A Fraud Review Detector based on Multi Task Learning of
Regulated GAN with Data Augmentation
- URL: http://arxiv.org/abs/2006.06561v2
- Date: Wed, 17 Mar 2021 14:18:15 GMT
- Title: ScoreGAN: A Fraud Review Detector based on Multi Task Learning of
Regulated GAN with Data Augmentation
- Authors: Saeedreza Shehnepoor, Roberto Togneri, Wei Liu, Mohammed Bennamoun
- Abstract summary: We propose ScoreGAN for fraud review detection that makes use of both review text and review rating scores in the generation and detection process.
Results show that the proposed framework outperformed the existing state-of-the-art framework, namely FakeGAN, in terms of AP by 7%, and 5% on the Yelp and TripAdvisor datasets.
- Score: 50.779498955162644
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The promising performance of Deep Neural Networks (DNNs) in text
classification, has attracted researchers to use them for fraud review
detection. However, the lack of trusted labeled data has limited the
performance of the current solutions in detecting fraud reviews. The Generative
Adversarial Network (GAN) as a semi-supervised method has demonstrated to be
effective for data augmentation purposes. The state-of-the-art solutions
utilize GANs to overcome the data scarcity problem. However, they fail to
incorporate the behavioral clues in fraud generation. Additionally,
state-of-the-art approaches overlook the possible bot-generated reviews in the
dataset. Finally, they also suffer from a common limitation in scalability and
stability of the GAN, slowing down the training procedure. In this work, we
propose ScoreGAN for fraud review detection that makes use of both review text
and review rating scores in the generation and detection process. Scores are
incorporated through Information Gain Maximization (IGM) into the loss function
for three reasons. One is to generate score-correlated reviews based on the
scores given to the generator. Second, the generated reviews are employed to
train the discriminator, so the discriminator can correctly label the possible
bot-generated reviews through joint representations learned from the
concatenation of GLobal Vector for Word representation (GLoVe) extracted from
the text and the score. Finally, it can be used to improve the stability and
scalability of the GAN. Results show that the proposed framework outperformed
the existing state-of-the-art framework, namely FakeGAN, in terms of AP by 7\%,
and 5\% on the Yelp and TripAdvisor datasets, respectively.
Related papers
- Analytical and Empirical Study of Herding Effects in Recommendation Systems [72.6693986712978]
We study how to manage product ratings via rating aggregation rules and shortlisted representative reviews.
We show that proper recency aware rating aggregation rules can improve the speed of convergence in Amazon and TripAdvisor.
arXiv Detail & Related papers (2024-08-20T14:29:23Z) - Graph Anomaly Detection with Noisy Labels by Reinforcement Learning [13.135788402192215]
We propose a novel framework REGAD, i.e., REinforced Graph Anomaly Detector.
Specifically, we aim to maximize the performance improvement (AUC) of a base detector by cutting noisy edges approximated through the nodes with high-confidence labels.
arXiv Detail & Related papers (2024-07-08T13:41:21Z) - AMRFact: Enhancing Summarization Factuality Evaluation with AMR-Driven Negative Samples Generation [57.8363998797433]
We propose AMRFact, a framework that generates perturbed summaries using Abstract Meaning Representations (AMRs)
Our approach parses factually consistent summaries into AMR graphs and injects controlled factual inconsistencies to create negative examples, allowing for coherent factually inconsistent summaries to be generated with high error-type coverage.
arXiv Detail & Related papers (2023-11-16T02:56:29Z) - Transaction Fraud Detection via an Adaptive Graph Neural Network [64.9428588496749]
We propose an Adaptive Sampling and Aggregation-based Graph Neural Network (ASA-GNN) that learns discriminative representations to improve the performance of transaction fraud detection.
A neighbor sampling strategy is performed to filter noisy nodes and supplement information for fraudulent nodes.
Experiments on three real financial datasets demonstrate that the proposed method ASA-GNN outperforms state-of-the-art ones.
arXiv Detail & Related papers (2023-07-11T07:48:39Z) - Label Information Enhanced Fraud Detection against Low Homophily in
Graphs [24.170070133328277]
We propose GAGA, a novel Group AGgregation enhanced TrAnsformer, to tackle the above challenges.
GAGA outperforms other competitive graph-based fraud detectors by up to 24.39% on two trending public datasets and a real-world industrial dataset from Anonymous.
arXiv Detail & Related papers (2023-02-21T02:42:28Z) - Spatio-Temporal Graph Representation Learning for Fraudster Group
Detection [50.779498955162644]
Companies may hire fraudster groups to write fake reviews to either demote competitors or promote their own businesses.
To detect such groups, a common model is to represent fraudster groups' static networks.
We propose to first capitalize on the effectiveness of the HIN-RNN in both reviewers' representation learning.
arXiv Detail & Related papers (2022-01-07T08:01:38Z) - HIN-RNN: A Graph Representation Learning Neural Network for Fraudster
Group Detection With No Handcrafted Features [42.30892608083864]
We propose the first neural approach, HIN-RNN, a Heterogeneous Information Network (HIN) compatible RNN for fraudster group detection.
HIN-RNN provides a unifying architecture for learning text representation of each reviewer, with the initial vector as the sum of word embeddings of all review written by the same reviewer.
The proposed approach is confirmed to be effective with marked improvement over state-of-the-art approaches on both the Yelp (22% and 12% in terms of recall and F1-value, respectively) and Amazon (4% and 2% in terms of recall and F1-value, respectively
arXiv Detail & Related papers (2021-05-25T01:48:28Z) - DFraud3- Multi-Component Fraud Detection freeof Cold-start [50.779498955162644]
The Cold-start is a significant problem referring to the failure of a detection system to recognize the authenticity of a new user.
In this paper, we model a review system as a Heterogeneous InformationNetwork (HIN) which enables a unique representation to every component.
HIN with graph induction helps to address the camouflage issue (fraudsterswith genuine reviews) which has shown to be more severe when it is coupled with cold-start, i.e., new fraudsters with genuine first reviews.
arXiv Detail & Related papers (2020-06-10T08:20:13Z) - xAI-GAN: Enhancing Generative Adversarial Networks via Explainable AI
Systems [16.360144499713524]
Generative Adversarial Networks (GANs) are a revolutionary class of Deep Neural Networks (DNNs) that have been successfully used to generate realistic images, music, text, and other data.
We propose a new class of GAN that leverages recent advances in explainable AI (xAI) systems to provide a "richer" form of corrective feedback from discriminators to generators.
We observe xAI-GANs provide an improvement of up to 23.18% in the quality of generated images on both MNIST and FMNIST datasets over standard GANs.
arXiv Detail & Related papers (2020-02-24T18:38:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.