Label Information Enhanced Fraud Detection against Low Homophily in
Graphs
- URL: http://arxiv.org/abs/2302.10407v1
- Date: Tue, 21 Feb 2023 02:42:28 GMT
- Title: Label Information Enhanced Fraud Detection against Low Homophily in
Graphs
- Authors: Yuchen Wang, Jinghui Zhang, Zhengjie Huang, Weibin Li, Shikun Feng,
Ziheng Ma, Yu Sun, Dianhai Yu, Fang Dong, Jiahui Jin, Beilun Wang and Junzhou
Luo
- Abstract summary: We propose GAGA, a novel Group AGgregation enhanced TrAnsformer, to tackle the above challenges.
GAGA outperforms other competitive graph-based fraud detectors by up to 24.39% on two trending public datasets and a real-world industrial dataset from Anonymous.
- Score: 24.170070133328277
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Node classification is a substantial problem in graph-based fraud detection.
Many existing works adopt Graph Neural Networks (GNNs) to enhance fraud
detectors. While promising, currently most GNN-based fraud detectors fail to
generalize to the low homophily setting. Besides, label utilization has been
proved to be significant factor for node classification problem. But we find
they are less effective in fraud detection tasks due to the low homophily in
graphs. In this work, we propose GAGA, a novel Group AGgregation enhanced
TrAnsformer, to tackle the above challenges. Specifically, the group
aggregation provides a portable method to cope with the low homophily issue.
Such an aggregation explicitly integrates the label information to generate
distinguishable neighborhood information. Along with group aggregation, an
attempt towards end-to-end trainable group encoding is proposed which augments
the original feature space with the class labels. Meanwhile, we devise two
additional learnable encodings to recognize the structural and relational
context. Then, we combine the group aggregation and the learnable encodings
into a Transformer encoder to capture the semantic information. Experimental
results clearly show that GAGA outperforms other competitive graph-based fraud
detectors by up to 24.39% on two trending public datasets and a real-world
industrial dataset from Anonymous. Even more, the group aggregation is
demonstrated to outperform other label utilization methods (e.g., C&S,
BoT/UniMP) in the low homophily setting.
Related papers
- LOSS-GAT: Label Propagation and One-Class Semi-Supervised Graph
Attention Network for Fake News Detection [2.6396287656676725]
Loss-GAT is a semi-supervised and one-class approach for fake news detection.
We employ a two-step label propagation algorithm to categorize news into two groups: interest (fake) and non-interest (real)
arXiv Detail & Related papers (2024-02-13T12:02:37Z) - Revisiting Graph-Based Fraud Detection in Sight of Heterophily and Spectrum [26.62679288320554]
Graph-based fraud detection (GFD) can be regarded as a challenging semi-supervised node binary classification task.
This paper proposes a semi-supervised GNN-based fraud detector SEC-GFD.
The comprehensive experimental results on four real-world fraud detection datasets denote that SEC-GFD outperforms other competitive graph-based fraud detectors.
arXiv Detail & Related papers (2023-12-11T15:18:51Z) - CONVERT:Contrastive Graph Clustering with Reliable Augmentation [110.46658439733106]
We propose a novel CONtrastiVe Graph ClustEring network with Reliable AugmenTation (CONVERT)
In our method, the data augmentations are processed by the proposed reversible perturb-recover network.
To further guarantee the reliability of semantics, a novel semantic loss is presented to constrain the network.
arXiv Detail & Related papers (2023-08-17T13:07:09Z) - Graph Anomaly Detection at Group Level: A Topology Pattern Enhanced
Unsupervised Approach [25.383587951822964]
This paper introduces a novel unsupervised framework for a new task called Group-level Graph Anomaly Detection (Gr-GAD)
The proposed framework first employs a variant of Graph AutoEncoder (GAE) to locate anchor nodes that belong to potential anomaly groups by capturing long-range inconsistencies.
The experimental results on both real-world and synthetic datasets demonstrate that the proposed framework shows superior performance in identifying and localizing anomaly groups.
arXiv Detail & Related papers (2023-08-02T10:22:04Z) - Pseudo Contrastive Learning for Graph-based Semi-supervised Learning [67.37572762925836]
Pseudo Labeling is a technique used to improve the performance of Graph Neural Networks (GNNs)
We propose a general framework for GNNs, termed Pseudo Contrastive Learning (PCL)
arXiv Detail & Related papers (2023-02-19T10:34:08Z) - Informative Pseudo-Labeling for Graph Neural Networks with Few Labels [12.83841767562179]
Graph Neural Networks (GNNs) have achieved state-of-the-art results for semi-supervised node classification on graphs.
The challenge of how to effectively learn GNNs with very few labels is still under-explored.
We propose a novel informative pseudo-labeling framework, called InfoGNN, to facilitate learning of GNNs with extremely few labels.
arXiv Detail & Related papers (2022-01-20T01:49:30Z) - Deep Fraud Detection on Non-attributed Graph [61.636677596161235]
Graph Neural Networks (GNNs) have shown solid performance on fraud detection.
labeled data is scarce in large-scale industrial problems, especially for fraud detection.
We propose a novel graph pre-training strategy to leverage more unlabeled data.
arXiv Detail & Related papers (2021-10-04T03:42:09Z) - Unified Robust Training for Graph NeuralNetworks against Label Noise [12.014301020294154]
We propose a new framework, UnionNET, for learning with noisy labels on graphs under a semi-supervised setting.
Our approach provides a unified solution for robustly training GNNs and performing label correction simultaneously.
arXiv Detail & Related papers (2021-03-05T01:17:04Z) - Exploiting Heterogeneous Graph Neural Networks with Latent Worker/Task
Correlation Information for Label Aggregation in Crowdsourcing [72.34616482076572]
Crowdsourcing has attracted much attention for its convenience to collect labels from non-expert workers instead of experts.
We propose a novel framework based on graph neural networks for aggregating crowd labels.
arXiv Detail & Related papers (2020-10-25T10:12:37Z) - On the Equivalence of Decoupled Graph Convolution Network and Label
Propagation [60.34028546202372]
Some work shows that coupling is inferior to decoupling, which supports deep graph propagation better.
Despite effectiveness, the working mechanisms of the decoupled GCN are not well understood.
We propose a new label propagation method named propagation then training Adaptively (PTA), which overcomes the flaws of the decoupled GCN.
arXiv Detail & Related papers (2020-10-23T13:57:39Z) - ScoreGAN: A Fraud Review Detector based on Multi Task Learning of
Regulated GAN with Data Augmentation [50.779498955162644]
We propose ScoreGAN for fraud review detection that makes use of both review text and review rating scores in the generation and detection process.
Results show that the proposed framework outperformed the existing state-of-the-art framework, namely FakeGAN, in terms of AP by 7%, and 5% on the Yelp and TripAdvisor datasets.
arXiv Detail & Related papers (2020-06-11T16:15:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.