Corporate Fraud Detection in Rich-yet-Noisy Financial Graph
- URL: http://arxiv.org/abs/2502.19305v1
- Date: Wed, 26 Feb 2025 17:05:54 GMT
- Title: Corporate Fraud Detection in Rich-yet-Noisy Financial Graph
- Authors: Shiqi Wang, Zhibo Zhang, Libing Fang, Cam-Tu Nguyen, Wenzhon Li,
- Abstract summary: Corporate fraud detection aims to automatically recognize companies that conduct wrongful activities such as fraudulent financial statements or illegal insider trading.<n>Previous learning-based methods fail to effectively integrate rich interactions in the company network.<n>We analyze 18-year financial records in China to form three graph datasets with fraud labels.
- Score: 10.493486554714885
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Corporate fraud detection aims to automatically recognize companies that conduct wrongful activities such as fraudulent financial statements or illegal insider trading. Previous learning-based methods fail to effectively integrate rich interactions in the company network. To close this gap, we collect 18-year financial records in China to form three graph datasets with fraud labels. We analyze the characteristics of the financial graphs, highlighting two pronounced issues: (1) information overload: the dominance of (noisy) non-company nodes over company nodes hinders the message-passing process in Graph Convolution Networks (GCN); and (2) hidden fraud: there exists a large percentage of possible undetected violations in the collected data. The hidden fraud problem will introduce noisy labels in the training dataset and compromise fraud detection results. To handle such challenges, we propose a novel graph-based method, namely, Knowledge-enhanced GCN with Robust Two-stage Learning (${\rm KeGCN}_{R}$), which leverages Knowledge Graph Embeddings to mitigate the information overload and effectively learns rich representations. The proposed model adopts a two-stage learning method to enhance robustness against hidden frauds. Extensive experimental results not only confirm the importance of interactions but also show the superiority of ${\rm KeGCN}_{R}$ over a number of strong baselines in terms of fraud detection effectiveness and robustness.
Related papers
- A Label-Free Heterophily-Guided Approach for Unsupervised Graph Fraud Detection [60.09453163562244]
We propose a Heterophily-guided Unsupervised Graph fraud dEtection approach (HUGE) for unsupervised GFD.<n>In the estimation module, we design a novel label-free heterophily metric called HALO, which captures the critical graph properties for GFD.<n>In the alignment-based fraud detection module, we develop a joint-GNN architecture with ranking loss and asymmetric alignment loss.
arXiv Detail & Related papers (2025-02-18T22:07:36Z) - Financial Fraud Detection using Jump-Attentive Graph Neural Networks [0.0]
A significant portion of the financial services sector employs various machine learning algorithms, such as XGBoost, Random Forest, and neural networks, to model transaction data.
We propose a novel algorithm that employs an efficient neighborhood sampling method, effective for camouflage detection and preserving crucial feature information from non-similar nodes.
arXiv Detail & Related papers (2024-11-07T05:12:51Z) - Heterogeneous Graph Auto-Encoder for CreditCard Fraud Detection [0.7864304771129751]
This paper proposes a novel approach for credit card fraud detection using Graph Neural Networks (GNNs) with attention mechanisms applied to heterogeneous graph representations of financial data.
The proposed model outperforms benchmark algorithms such as Graph Sage and FI-GRL, achieving a superior AUC-PR of 0.89 and an F1-score of 0.81.
arXiv Detail & Related papers (2024-10-10T17:05:27Z) - Combating Bilateral Edge Noise for Robust Link Prediction [56.43882298843564]
We propose an information-theory-guided principle, Robust Graph Information Bottleneck (RGIB), to extract reliable supervision signals and avoid representation collapse.
Two instantiations, RGIB-SSL and RGIB-REP, are explored to leverage the merits of different methodologies.
Experiments on six datasets and three GNNs with diverse noisy scenarios verify the effectiveness of our RGIB instantiations.
arXiv Detail & Related papers (2023-11-02T12:47:49Z) - Transaction Fraud Detection via an Adaptive Graph Neural Network [64.9428588496749]
We propose an Adaptive Sampling and Aggregation-based Graph Neural Network (ASA-GNN) that learns discriminative representations to improve the performance of transaction fraud detection.
A neighbor sampling strategy is performed to filter noisy nodes and supplement information for fraudulent nodes.
Experiments on three real financial datasets demonstrate that the proposed method ASA-GNN outperforms state-of-the-art ones.
arXiv Detail & Related papers (2023-07-11T07:48:39Z) - Company-as-Tribe: Company Financial Risk Assessment on Tribe-Style Graph
with Hierarchical Graph Neural Networks [62.94317686301643]
Company financial risk is ubiquitous and early risk assessment for listed companies can avoid considerable losses.
Traditional methods mainly focus on the financial statements of companies and lack the complex relationships among them.
We propose a novel Hierarchical Graph Neural Network (TH-GNN) for Tribe-style graphs via two levels, with the first level to encode the structure pattern of the tribes with contrastive learning, and the second level to diffuse information based on the inter-tribe relations.
arXiv Detail & Related papers (2023-01-31T09:17:13Z) - Deep Fraud Detection on Non-attributed Graph [61.636677596161235]
Graph Neural Networks (GNNs) have shown solid performance on fraud detection.
labeled data is scarce in large-scale industrial problems, especially for fraud detection.
We propose a novel graph pre-training strategy to leverage more unlabeled data.
arXiv Detail & Related papers (2021-10-04T03:42:09Z) - Relational Graph Neural Networks for Fraud Detection in a Super-App
environment [53.561797148529664]
We propose a framework of relational graph convolutional networks methods for fraudulent behaviour prevention in the financial services of a Super-App.
We use an interpretability algorithm for graph neural networks to determine the most important relations to the classification task of the users.
Our results show that there is an added value when considering models that take advantage of the alternative data of the Super-App and the interactions found in their high connectivity.
arXiv Detail & Related papers (2021-07-29T00:02:06Z) - Identifying Linked Fraudulent Activities Using GraphConvolution Network [0.0]
We present a novel approach to identify linked fraudulent activities using Graph Convolution Network (GCN)
GCNs learn similarities between fraudulent nodes to identify clusters of similar attempts and require much smaller dataset to learn.
Our results outperform label propagation community detection and supervised GBTs algorithms in terms of solution quality and time.
arXiv Detail & Related papers (2021-06-05T09:56:08Z) - A Semi-supervised Graph Attentive Network for Financial Fraud Detection [30.645390612737266]
We propose a semi-supervised attentive graph neural network, namedSemiSemiGNN, to utilize the multi-view labeled and unlabeled data for fraud detection.
By utilizing the social relations and the user attributes, our method can achieve a better accuracy compared with the state-of-the-art methods on two tasks.
arXiv Detail & Related papers (2020-02-28T10:35:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.