Machine learning methods to detect money laundering in the Bitcoin
blockchain in the presence of label scarcity
- URL: http://arxiv.org/abs/2005.14635v2
- Date: Tue, 5 Oct 2021 10:33:23 GMT
- Title: Machine learning methods to detect money laundering in the Bitcoin
blockchain in the presence of label scarcity
- Authors: Joana Lorenz, Maria In\^es Silva, David Apar\'icio, Jo\~ao Tiago
Ascens\~ao, Pedro Bizarro
- Abstract summary: We show that existing state-of-the-art solutions using unsupervised anomaly detection methods are inadequate to detect the illicit patterns in a real Bitcoin transaction dataset.
Our proposed active learning solution is capable of matching the performance of a fully supervised baseline by using just 5% of the labels.
- Score: 1.7499351967216341
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Every year, criminals launder billions of dollars acquired from serious
felonies (e.g., terrorism, drug smuggling, or human trafficking) harming
countless people and economies. Cryptocurrencies, in particular, have developed
as a haven for money laundering activity. Machine Learning can be used to
detect these illicit patterns. However, labels are so scarce that traditional
supervised algorithms are inapplicable. Here, we address money laundering
detection assuming minimal access to labels. First, we show that existing
state-of-the-art solutions using unsupervised anomaly detection methods are
inadequate to detect the illicit patterns in a real Bitcoin transaction
dataset. Then, we show that our proposed active learning solution is capable of
matching the performance of a fully supervised baseline by using just 5\% of
the labels. This solution mimics a typical real-life situation in which a
limited number of labels can be acquired through manual annotation by experts.
Related papers
- Transaction Fraud Detection via an Adaptive Graph Neural Network [64.9428588496749]
We propose an Adaptive Sampling and Aggregation-based Graph Neural Network (ASA-GNN) that learns discriminative representations to improve the performance of transaction fraud detection.
A neighbor sampling strategy is performed to filter noisy nodes and supplement information for fraudulent nodes.
Experiments on three real financial datasets demonstrate that the proposed method ASA-GNN outperforms state-of-the-art ones.
arXiv Detail & Related papers (2023-07-11T07:48:39Z) - Chainlet Orbits: Topological Address Embedding for the Bitcoin
Blockchain [15.099255988459602]
Rise of cryptocurrencies like Bitcoin, which enable transactions with a degree of pseudonymity, has led to a surge in various illicit activities.
We introduce an effective solution called Chainlet Orbits to embed Bitcoin addresses by leveraging their topological characteristics in transactions.
Our approach enables the use of interpretable and explainable machine learning models in as little as 15 minutes for most days on the Bitcoin transaction network.
arXiv Detail & Related papers (2023-05-18T21:16:59Z) - Blockchain Large Language Models [65.7726590159576]
This paper presents a dynamic, real-time approach to detecting anomalous blockchain transactions.
The proposed tool, BlockGPT, generates tracing representations of blockchain activity and trains from scratch a large language model to act as a real-time Intrusion Detection System.
arXiv Detail & Related papers (2023-04-25T11:56:18Z) - Did You Train on My Dataset? Towards Public Dataset Protection with
Clean-Label Backdoor Watermarking [54.40184736491652]
We propose a backdoor-based watermarking approach that serves as a general framework for safeguarding public-available data.
By inserting a small number of watermarking samples into the dataset, our approach enables the learning model to implicitly learn a secret function set by defenders.
This hidden function can then be used as a watermark to track down third-party models that use the dataset illegally.
arXiv Detail & Related papers (2023-03-20T21:54:30Z) - Catch Me If You Can: Semi-supervised Graph Learning for Spotting Money
Laundering [0.4159343412286401]
Money laundering is a process where criminals use financial services to move illegal money to untraceable destinations.
It is very crucial to identify such activities accurately and reliably in order to enforce an anti-money laundering (AML)
In this paper, we employ semi-supervised graph learning techniques on graphs of financial transactions in order to identify nodes involved in potential money laundering.
arXiv Detail & Related papers (2023-02-23T09:34:19Z) - Inspection-L: Practical GNN-Based Money Laundering Detection System for
Bitcoin [0.0]
This paper proposes Inspection-L, a graph neural network (GNN) framework based on self-supervised Deep Graph Infomax (DGI), with Random Forest (RF) to detect illicit transactions for Anti-Money laundering (AML)
To the best of our knowledge, our proposal is the first of applying self-supervised GNNs to the problem of AML in Bitcoin.
The proposed method has been evaluated on the Elliptic dataset and shows that our approach outperforms the state-of-the-art in terms of key classification metrics.
arXiv Detail & Related papers (2022-03-20T06:19:18Z) - Fighting Money Laundering with Statistics and Machine Learning [95.42181254494287]
There is little scientific literature on statistical and machine learning methods for anti-money laundering.
We propose a unifying terminology with two central elements: (i) client risk profiling and (ii) suspicious behavior flagging.
arXiv Detail & Related papers (2022-01-11T21:31:18Z) - Deep Fraud Detection on Non-attributed Graph [61.636677596161235]
Graph Neural Networks (GNNs) have shown solid performance on fraud detection.
labeled data is scarce in large-scale industrial problems, especially for fraud detection.
We propose a novel graph pre-training strategy to leverage more unlabeled data.
arXiv Detail & Related papers (2021-10-04T03:42:09Z) - GuiltyWalker: Distance to illicit nodes in the Bitcoin network [1.7550798084784973]
We propose new features based on the structure of the graph and past labels to boost the performance of machine learning methods to detect money laundering.
Our method, GuiltyWalker, performs random walks on the bitcoin transaction graph and computes features based on the distance to illicit transactions.
arXiv Detail & Related papers (2021-02-10T10:29:13Z) - Adversarial Attacks on Linear Contextual Bandits [87.08004581867537]
Malicious agents may have incentives to attack the bandit algorithm to induce it to perform a desired behavior.
We show that a malicious agent can force a linear contextual bandit algorithm to pull any desired arm $T - o(T)$ times over a horizon of $T$ steps.
We also investigate the case when a malicious agent is interested in affecting the behavior of the bandit algorithm in a single context.
arXiv Detail & Related papers (2020-02-10T15:04:09Z) - Characterizing and Detecting Money Laundering Activities on the Bitcoin
Network [8.212945859699406]
We explore the landscape of potential money laundering activities occurring across the Bitcoin network.
Using data collected over three years, we create transaction graphs and provide an analysis on various graph characteristics to differentiate money laundering transactions from regular transactions.
We propose and evaluate a set of classifiers based on four types of graph features to classify money laundering and regular transactions.
arXiv Detail & Related papers (2019-12-27T11:34:41Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.