Pattern Mining for Anomaly Detection in Graphs: Application to Fraud in
Public Procurement
- URL: http://arxiv.org/abs/2306.10857v1
- Date: Mon, 19 Jun 2023 11:18:55 GMT
- Title: Pattern Mining for Anomaly Detection in Graphs: Application to Fraud in
Public Procurement
- Authors: Lucas Potin (LIA), Rosa Figueiredo (LIA), Vincent Labatut (LIA),
Christine Largeron (LHC)
- Abstract summary: In public procurement, several indicators called red flags are used to estimate fraud risk.
These attributes are very often missing in practice, which prohibits red flags.
In this work, we adopt a graph-based method allowing leveraging relations between contracts, to compensate for the missing attributes.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In the context of public procurement, several indicators called red flags are
used to estimate fraud risk. They are computed according to certain contract
attributes and are therefore dependent on the proper filling of the contract
and award notices. However, these attributes are very often missing in
practice, which prohibits red flags computation. Traditional fraud detection
approaches focus on tabular data only, considering each contract separately,
and are therefore very sensitive to this issue. In this work, we adopt a
graph-based method allowing leveraging relations between contracts, to
compensate for the missing attributes. We propose PANG (Pattern-Based Anomaly
Detection in Graphs), a general supervised framework relying on pattern
extraction to detect anomalous graphs in a collection of attributed graphs.
Notably, it is able to identify induced subgraphs, a type of pattern widely
overlooked in the literature. When benchmarked on standard datasets, its
predictive performance is on par with state-of-the-art methods, with the
additional advantage of being explainable. These experiments also reveal that
induced patterns are more discriminative on certain datasets. When applying
PANG to public procurement data, the prediction is superior to other methods,
and it identifies subgraph patterns that are characteristic of fraud-prone
situations, thereby making it possible to better understand fraudulent
behavior.
Related papers
- Enhancing Fairness in Unsupervised Graph Anomaly Detection through Disentanglement [33.565252991113766]
Graph anomaly detection (GAD) is increasingly crucial in various applications, ranging from financial fraud detection to fake news detection.
Current GAD methods largely overlook the fairness problem, which might result in discriminatory decisions skewed toward certain demographic groups.
We devise a novel DisEntangle-based FairnEss-aware aNomaly Detection framework on the attributed graph, named DEFEND.
Our empirical evaluations on real-world datasets reveal that DEFEND performs effectively in GAD and significantly enhances fairness compared to state-of-the-art baselines.
arXiv Detail & Related papers (2024-06-03T04:48:45Z) - Anomaly Detection by Context Contrasting [57.695202846009714]
Anomaly detection focuses on identifying samples that deviate from the norm.
Recent advances in self-supervised learning have shown great promise in this regard.
We propose Con$$, which learns through context augmentations.
arXiv Detail & Related papers (2024-05-29T07:59:06Z) - Towards Self-Interpretable Graph-Level Anomaly Detection [73.1152604947837]
Graph-level anomaly detection (GLAD) aims to identify graphs that exhibit notable dissimilarity compared to the majority in a collection.
We propose a Self-Interpretable Graph aNomaly dETection model ( SIGNET) that detects anomalous graphs as well as generates informative explanations simultaneously.
arXiv Detail & Related papers (2023-10-25T10:10:07Z) - BOURNE: Bootstrapped Self-supervised Learning Framework for Unified
Graph Anomaly Detection [50.26074811655596]
We propose a novel unified graph anomaly detection framework based on bootstrapped self-supervised learning (named BOURNE)
By swapping the context embeddings between nodes and edges, we enable the mutual detection of node and edge anomalies.
BOURNE can eliminate the need for negative sampling, thereby enhancing its efficiency in handling large graphs.
arXiv Detail & Related papers (2023-07-28T00:44:57Z) - The Devil is in the Conflict: Disentangled Information Graph Neural
Networks for Fraud Detection [17.254383007779616]
We argue that the performance degradation is mainly attributed to the inconsistency between topology and attribute.
We propose a simple and effective method that uses the attention mechanism to adaptively fuse two views.
Our model can significantly outperform stateof-the-art baselines on real-world fraud detection datasets.
arXiv Detail & Related papers (2022-10-22T08:21:49Z) - From Unsupervised to Few-shot Graph Anomaly Detection: A Multi-scale Contrastive Learning Approach [26.973056364587766]
Anomaly detection from graph data is an important data mining task in many applications such as social networks, finance, and e-commerce.
We propose a novel framework, graph ANomaly dEtection framework with Multi-scale cONtrastive lEarning (ANEMONE in short)
By using a graph neural network as a backbone to encode the information from multiple graph scales (views), we learn better representation for nodes in a graph.
arXiv Detail & Related papers (2022-02-11T09:45:11Z) - Applications of Signature Methods to Market Anomaly Detection [1.911678487931003]
We present applications of signature or randomized signature as feature extractors for anomaly detection algorithms.
We show a real life application by using transaction data from the cryptocurrency market.
In this case, we are able to identify pump and dump attempts organized on social networks with F1 scores up to 88%.
arXiv Detail & Related papers (2022-01-07T13:05:43Z) - Deep Fraud Detection on Non-attributed Graph [61.636677596161235]
Graph Neural Networks (GNNs) have shown solid performance on fraud detection.
labeled data is scarce in large-scale industrial problems, especially for fraud detection.
We propose a novel graph pre-training strategy to leverage more unlabeled data.
arXiv Detail & Related papers (2021-10-04T03:42:09Z) - Handling Missing Data with Graph Representation Learning [62.59831675688714]
We propose GRAPE, a graph-based framework for feature imputation as well as label prediction.
Under GRAPE, the feature imputation is formulated as an edge-level prediction task and the label prediction as a node-level prediction task.
Experimental results on nine benchmark datasets show that GRAPE yields 20% lower mean absolute error for imputation tasks and 10% lower for label prediction tasks.
arXiv Detail & Related papers (2020-10-30T17:59:13Z) - Graph topology inference benchmarks for machine learning [16.857405938139525]
We introduce several benchmarks specifically designed to reveal the relative merits and limitations of graph inference methods.
We also contrast some of the most prominent techniques in the literature.
arXiv Detail & Related papers (2020-07-16T09:40:32Z) - Structured Prediction with Partial Labelling through the Infimum Loss [85.4940853372503]
The goal of weak supervision is to enable models to learn using only forms of labelling which are cheaper to collect.
This is a type of incomplete annotation where, for each datapoint, supervision is cast as a set of labels containing the real one.
This paper provides a unified framework based on structured prediction and on the concept of infimum loss to deal with partial labelling.
arXiv Detail & Related papers (2020-03-02T13:59:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.