Generative Pretraining at Scale: Transformer-Based Encoding of
Transactional Behavior for Fraud Detection
- URL: http://arxiv.org/abs/2312.14406v1
- Date: Fri, 22 Dec 2023 03:15:17 GMT
- Title: Generative Pretraining at Scale: Transformer-Based Encoding of
Transactional Behavior for Fraud Detection
- Authors: Ze Yu Zhao (1), Zheng Zhu (1), Guilin Li (1), Wenhan Wang (1), Bo Wang
(1) ((1) Tencent, WeChat Pay)
- Abstract summary: Our model confronts token explosion and reconstructs behavioral sequences, providing a nuanced understanding of transactional behavior.
We integrate a differential convolutional approach to enhance anomaly detection, bolstering the security and efficacy of one of the largest online payment merchants in China.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In this work, we introduce an innovative autoregressive model leveraging
Generative Pretrained Transformer (GPT) architectures, tailored for fraud
detection in payment systems. Our approach innovatively confronts token
explosion and reconstructs behavioral sequences, providing a nuanced
understanding of transactional behavior through temporal and contextual
analysis. Utilizing unsupervised pretraining, our model excels in feature
representation without the need for labeled data. Additionally, we integrate a
differential convolutional approach to enhance anomaly detection, bolstering
the security and efficacy of one of the largest online payment merchants in
China. The scalability and adaptability of our model promise broad
applicability in various transactional contexts.
Related papers
- Ethereum Fraud Detection via Joint Transaction Language Model and Graph Representation Learning [6.378807038086552]
Current fraud detection methods fail to consider the semantic information and similarity patterns within transactions.
We propose TLMG4Eth that combines a transaction language model with graph-based methods to capture semantic, similarity, and structural features of transaction data.
arXiv Detail & Related papers (2024-09-09T07:13:44Z) - Uniting contrastive and generative learning for event sequences models [51.547576949425604]
This study investigates the integration of two self-supervised learning techniques - instance-wise contrastive learning and a generative approach based on restoring masked events in latent space.
Experiments conducted on several public datasets, focusing on sequence classification and next-event type prediction, show that the integrated method achieves superior performance compared to individual approaches.
arXiv Detail & Related papers (2024-08-19T13:47:17Z) - Credit Card Fraud Detection Using Advanced Transformer Model [15.34892016767672]
This study focuses on innovative applications of the latest Transformer models for more robust and precise fraud detection.
We meticulously processed the data sources, balancing the dataset to address the issue of data sparsity significantly.
We conducted performance comparisons with several widely adopted models, including Support Vector Machine (SVM), Random Forest, Neural Network, and Logistic Regression.
arXiv Detail & Related papers (2024-06-06T04:12:57Z) - Utilizing GANs for Fraud Detection: Model Training with Synthetic
Transaction Data [0.0]
This paper explores the application of Generative Adversarial Networks (GANs) in fraud detection.
GANs have shown promise in modeling complex data distributions, making them effective tools for anomaly detection.
The study demonstrates the potential of GANs in enhancing transaction security through deep learning techniques.
arXiv Detail & Related papers (2024-02-15T09:48:20Z) - Towards a Foundation Purchasing Model: Pretrained Generative
Autoregression on Transaction Sequences [0.0]
We present a generative pretraining method that can be used to obtain contextualised embeddings of financial transactions.
We additionally perform large-scale pretraining of an embedding model using a corpus of data from 180 issuing banks containing 5.1 billion transactions.
arXiv Detail & Related papers (2024-01-03T09:32:48Z) - Refined Mechanism Design for Approximately Structured Priors via Active
Regression [50.71772232237571]
We consider the problem of a revenue-maximizing seller with a large number of items for sale to $n$ strategic bidders.
It is well-known that optimal and even approximately-optimal mechanisms for this setting are notoriously difficult to characterize or compute.
arXiv Detail & Related papers (2023-10-11T20:34:17Z) - Towards General Visual-Linguistic Face Forgery Detection [95.73987327101143]
Deepfakes are realistic face manipulations that can pose serious threats to security, privacy, and trust.
Existing methods mostly treat this task as binary classification, which uses digital labels or mask signals to train the detection model.
We propose a novel paradigm named Visual-Linguistic Face Forgery Detection(VLFFD), which uses fine-grained sentence-level prompts as the annotation.
arXiv Detail & Related papers (2023-07-31T10:22:33Z) - Transaction Fraud Detection via an Adaptive Graph Neural Network [64.9428588496749]
We propose an Adaptive Sampling and Aggregation-based Graph Neural Network (ASA-GNN) that learns discriminative representations to improve the performance of transaction fraud detection.
A neighbor sampling strategy is performed to filter noisy nodes and supplement information for fraudulent nodes.
Experiments on three real financial datasets demonstrate that the proposed method ASA-GNN outperforms state-of-the-art ones.
arXiv Detail & Related papers (2023-07-11T07:48:39Z) - Enhancing Multiple Reliability Measures via Nuisance-extended
Information Bottleneck [77.37409441129995]
In practical scenarios where training data is limited, many predictive signals in the data can be rather from some biases in data acquisition.
We consider an adversarial threat model under a mutual information constraint to cover a wider class of perturbations in training.
We propose an autoencoder-based training to implement the objective, as well as practical encoder designs to facilitate the proposed hybrid discriminative-generative training.
arXiv Detail & Related papers (2023-03-24T16:03:21Z) - Modeling Users' Behavior Sequences with Hierarchical Explainable Network
for Cross-domain Fraud Detection [19.262529179023254]
We propose a Hierarchical Explainable Network (HEN) to model users' behavior sequences.
We also propose a transfer framework to tackle the cross-domain fraud detection problem.
Our proposed method is a general transfer framework that could not only be applied upon HEN but also various existing models.
arXiv Detail & Related papers (2022-01-04T06:37:16Z) - A Probabilistic Formulation of Unsupervised Text Style Transfer [128.80213211598752]
We present a deep generative model for unsupervised text style transfer that unifies previously proposed non-generative techniques.
By hypothesizing a parallel latent sequence that generates each observed sequence, our model learns to transform sequences from one domain to another in a completely unsupervised fashion.
arXiv Detail & Related papers (2020-02-10T16:20:49Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.