AMLgentex: Mobilizing Data-Driven Research to Combat Money Laundering
- URL: http://arxiv.org/abs/2506.13989v1
- Date: Tue, 03 Jun 2025 15:28:09 GMT
- Title: AMLgentex: Mobilizing Data-Driven Research to Combat Money Laundering
- Authors: Johan Östman, Edvin Callisen, Anton Chen, Kristiina Ausmees, Emanuel Gårdh, Jovan Zamac, Jolanta Goldsteine, Hugo Wefer, Simon Whelan, Markus Reimegård,
- Abstract summary: Money laundering enables organized crime by allowing illicit funds to enter the legitimate economy.<n>We present AMLGentex, an open-source suite for generating realistic, benchmarking transaction data and detection methods.<n>It enables systematic evaluation of anti-money laundering systems in a controlled environment that captures key real-world challenges.
- Score: 1.8200934978381271
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Money laundering enables organized crime by allowing illicit funds to enter the legitimate economy. Although trillions of dollars are laundered each year, only a small fraction is ever uncovered. This stems from a range of factors, including deliberate evasion by launderers, the rarity of confirmed cases, and the limited visibility each financial institution has into the global transaction network. While several synthetic datasets are available, they fail to model the structural and behavioral complexity of real-world money laundering. In particular, they often overlook partial observability, sparse and uncertain labels, strategic behavior, temporal dynamics, class imbalance, and network-level dependencies. To address these limitations, we present AMLGentex, an open-source suite for generating realistic, configurable transaction data and benchmarking detection methods. It enables systematic evaluation of anti-money laundering (AML) systems in a controlled environment that captures key real-world challenges. We demonstrate how the framework can be used to rigorously evaluate methods under conditions that reflect the complexity of practical AML scenarios.
Related papers
- Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs [60.881609323604685]
Large Language Models (LLMs) accessed via black-box APIs introduce a trust challenge.<n>Users pay for services based on advertised model capabilities.<n> providers may covertly substitute the specified model with a cheaper, lower-quality alternative to reduce operational costs.<n>This lack of transparency undermines fairness, erodes trust, and complicates reliable benchmarking.
arXiv Detail & Related papers (2025-04-07T03:57:41Z) - Deep Learning Approaches for Anti-Money Laundering on Mobile Transactions: Review, Framework, and Directions [51.43521977132062]
Money laundering is a financial crime that obscures the origin of illicit funds.<n>The proliferation of mobile payment platforms and smart IoT devices has significantly complicated anti-money laundering investigations.<n>This paper conducts a comprehensive review of deep learning solutions and the challenges associated with their use in AML.
arXiv Detail & Related papers (2025-03-13T05:19:44Z) - FinTSB: A Comprehensive and Practical Benchmark for Financial Time Series Forecasting [58.70072722290475]
Financial time series (FinTS) record the behavior of human-brain-augmented decision-making.<n>FinTSB is a comprehensive and practical benchmark for financial time series forecasting.
arXiv Detail & Related papers (2025-02-26T05:19:16Z) - Beyond Static Datasets: A Behavior-Driven Entity-Specific Simulation to Overcome Data Scarcity and Train Effective Crypto Anti-Money Laundering Models [0.23020018305241333]
Money laundering is a key crime to be mitigated to also suspend the movement of funds from other illicit activities.<n>It is getting extremely difficult to identify money laundering in crypto transactions owing to many layering strategies available today.<n>In this paper, we propose behavior embedded entity-specific money laundering-like transaction simulation.
arXiv Detail & Related papers (2025-01-01T06:58:05Z) - Advancing Anomaly Detection: Non-Semantic Financial Data Encoding with LLMs [49.57641083688934]
We introduce a novel approach to anomaly detection in financial data using Large Language Models (LLMs) embeddings.
Our experiments demonstrate that LLMs contribute valuable information to anomaly detection as our models outperform the baselines.
arXiv Detail & Related papers (2024-06-05T20:19:09Z) - Starlit: Privacy-Preserving Federated Learning to Enhance Financial
Fraud Detection [2.436659710491562]
Federated Learning (FL) is a data-minimization approach enabling collaborative model training across diverse clients with local data.
State-of-the-art FL solutions to identify fraudulent financial transactions exhibit a subset of the following limitations.
We introduce Starlit, a novel scalable privacy-preserving FL mechanism that overcomes these limitations.
arXiv Detail & Related papers (2024-01-19T15:37:11Z) - Realistic Synthetic Financial Transactions for Anti-Money Laundering
Models [2.3802629107286046]
Money laundering is the movement of illicit funds to conceal their origins.
The UN estimates 2-5% of global GDP or $0.8 - $2.0 trillion dollars are laundered globally each year.
This paper contributes a synthetic financial transaction dataset generator and a set of synthetically generated AML datasets.
arXiv Detail & Related papers (2023-06-22T10:32:51Z) - A Robustness Analysis of Blind Source Separation [91.3755431537592]
Blind source separation (BSS) aims to recover an unobserved signal from its mixture $X=f(S)$ under the condition that the transformation $f$ is invertible but unknown.
We present a general framework for analysing such violations and quantifying their impact on the blind recovery of $S$ from $X$.
We show that a generic BSS-solution in response to general deviations from its defining structural assumptions can be profitably analysed in the form of explicit continuity guarantees.
arXiv Detail & Related papers (2023-03-17T16:30:51Z) - Catch Me If You Can: Semi-supervised Graph Learning for Spotting Money
Laundering [0.4159343412286401]
Money laundering is a process where criminals use financial services to move illegal money to untraceable destinations.
It is very crucial to identify such activities accurately and reliably in order to enforce an anti-money laundering (AML)
In this paper, we employ semi-supervised graph learning techniques on graphs of financial transactions in order to identify nodes involved in potential money laundering.
arXiv Detail & Related papers (2023-02-23T09:34:19Z) - Fighting Money Laundering with Statistics and Machine Learning [95.42181254494287]
There is little scientific literature on statistical and machine learning methods for anti-money laundering.
We propose a unifying terminology with two central elements: (i) client risk profiling and (ii) suspicious behavior flagging.
arXiv Detail & Related papers (2022-01-11T21:31:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.