Synthetic Data Generation for Fraud Detection using GANs
- URL: http://arxiv.org/abs/2109.12546v1
- Date: Sun, 26 Sep 2021 09:51:44 GMT
- Title: Synthetic Data Generation for Fraud Detection using GANs
- Authors: Charitos Charitou, Simo Dragicevic, Artur d'Avila Garcez
- Abstract summary: Fraud detection related issues face the significant problem of the class imbalance.
In this paper we propose a novel system based on Generative Adrial Networks (GANs) for generating synthetic data.
Our framework Synthetic Data Generation GAN, manages to outperformed density based over-sampling methods.
- Score: 1.1816942730023885
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Detecting money laundering in gambling is becoming increasingly challenging
for the gambling industry as consumers migrate to online channels. Whilst
increasingly stringent regulations have been applied over the years to prevent
money laundering in gambling, despite this, online gambling is still a channel
for criminals to spend proceeds from crime. Complementing online gambling's
growth more concerns are raised to its effects compared with gambling in
traditional, physical formats, as it might introduce higher levels of problem
gambling or fraudulent behaviour due to its nature of immediate interaction
with online gambling experience. However, in most cases the main issue when
organisations try to tackle those areas is the absence of high quality data.
Since fraud detection related issues face the significant problem of the class
imbalance, in this paper we propose a novel system based on Generative
Adversarial Networks (GANs) for generating synthetic data in order to train a
supervised classifier. Our framework Synthetic Data Generation GAN (SDG-GAN),
manages to outperformed density based over-sampling methods and improve the
classification performance of benchmarks datasets and the real world gambling
fraud dataset.
Related papers
- Dynamic Fraud Detection: Integrating Reinforcement Learning into Graph Neural Networks [39.54354926067617]
Graph neural networks are a type of deep learning model that can utilize the interactive relationships within graph structures.
fraudulent activities only account for a very small part of transaction transfers.
fraudsters often disguise their behavior, which can have a negative impact on the final prediction results.
arXiv Detail & Related papers (2024-09-15T23:08:31Z) - Automatic detection of problem-gambling signs from online texts using
large language models [16.418288795462935]
Problem gambling is a major public health concern and is associated with profound psychological distress and economic problems.
There are numerous gambling communities on the internet where users exchange information about games, gambling tactics, as well as gambling-related problems.
Online gambling communities may provide insights into problem-gambling behaviour.
arXiv Detail & Related papers (2023-11-24T13:48:02Z) - Transaction Fraud Detection via an Adaptive Graph Neural Network [64.9428588496749]
We propose an Adaptive Sampling and Aggregation-based Graph Neural Network (ASA-GNN) that learns discriminative representations to improve the performance of transaction fraud detection.
A neighbor sampling strategy is performed to filter noisy nodes and supplement information for fraudulent nodes.
Experiments on three real financial datasets demonstrate that the proposed method ASA-GNN outperforms state-of-the-art ones.
arXiv Detail & Related papers (2023-07-11T07:48:39Z) - Who is Gambling? Finding Cryptocurrency Gamblers Using Multi-modal
Retrieval Methods [46.17004007514548]
We propose a tool termed ETHGamDet to discover gambling behaviors and identify the contracts and addresses involved in gambling.
The tool is able to automatically detect the smart contracts and addresses involved in gambling by scrutinizing the smart contract code and address transaction records.
We present a novel LightGBM model with memory components, which possesses the ability to learn from its own misclassifications.
arXiv Detail & Related papers (2022-11-27T10:07:13Z) - Adapting to Online Label Shift with Provable Guarantees [137.89382409682233]
We formulate and investigate the problem of online label shift.
The non-stationarity and lack of supervision make the problem challenging to be tackled.
Our algorithms enjoy optimal dynamic regret, indicating that performance is competitive with a clairvoyant nature.
arXiv Detail & Related papers (2022-07-05T15:43:14Z) - Modeling Users' Behavior Sequences with Hierarchical Explainable Network
for Cross-domain Fraud Detection [19.262529179023254]
We propose a Hierarchical Explainable Network (HEN) to model users' behavior sequences.
We also propose a transfer framework to tackle the cross-domain fraud detection problem.
Our proposed method is a general transfer framework that could not only be applied upon HEN but also various existing models.
arXiv Detail & Related papers (2022-01-04T06:37:16Z) - Application of Deep Reinforcement Learning to Payment Fraud [0.0]
A typical fraud detection system employs standard supervised learning methods where the focus is on maximizing the fraud recall rate.
We argue that such a formulation can lead to suboptimal solutions.
We formulate fraud detection as a sequential decision-making problem by including the utility within the model in the form of the reward function.
arXiv Detail & Related papers (2021-12-08T11:30:53Z) - Deep Fraud Detection on Non-attributed Graph [61.636677596161235]
Graph Neural Networks (GNNs) have shown solid performance on fraud detection.
labeled data is scarce in large-scale industrial problems, especially for fraud detection.
We propose a novel graph pre-training strategy to leverage more unlabeled data.
arXiv Detail & Related papers (2021-10-04T03:42:09Z) - Relational Graph Neural Networks for Fraud Detection in a Super-App
environment [53.561797148529664]
We propose a framework of relational graph convolutional networks methods for fraudulent behaviour prevention in the financial services of a Super-App.
We use an interpretability algorithm for graph neural networks to determine the most important relations to the classification task of the users.
Our results show that there is an added value when considering models that take advantage of the alternative data of the Super-App and the interactions found in their high connectivity.
arXiv Detail & Related papers (2021-07-29T00:02:06Z) - Fraud Detection using Data-Driven approach [0.0]
The first known online banking service came in 1980.
The ever increasing use of internet banking and a large number of online transactions increased fraudulent behavior also.
This research paper aims to construct an efficient fraud detection model which is adaptive to customer behavior changes.
arXiv Detail & Related papers (2020-09-08T20:58:51Z) - Enhancing Graph Neural Network-based Fraud Detectors against Camouflaged
Fraudsters [78.53851936180348]
We introduce two types of camouflages based on recent empirical studies, i.e., the feature camouflage and the relation camouflage.
Existing GNNs have not addressed these two camouflages, which results in their poor performance in fraud detection problems.
We propose a new model named CAmouflage-REsistant GNN (CARE-GNN) to enhance the GNN aggregation process with three unique modules against camouflages.
arXiv Detail & Related papers (2020-08-19T22:33:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.