Fighting Money Laundering with Statistics and Machine Learning
- URL: http://arxiv.org/abs/2201.04207v5
- Date: Tue, 21 Mar 2023 07:13:27 GMT
- Title: Fighting Money Laundering with Statistics and Machine Learning
- Authors: Rasmus Jensen and Alexandros Iosifidis
- Abstract summary: There is little scientific literature on statistical and machine learning methods for anti-money laundering.
We propose a unifying terminology with two central elements: (i) client risk profiling and (ii) suspicious behavior flagging.
- Score: 95.42181254494287
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Money laundering is a profound global problem. Nonetheless, there is little
scientific literature on statistical and machine learning methods for
anti-money laundering. In this paper, we focus on anti-money laundering in
banks and provide an introduction and review of the literature. We propose a
unifying terminology with two central elements: (i) client risk profiling and
(ii) suspicious behavior flagging. We find that client risk profiling is
characterized by diagnostics, i.e., efforts to find and explain risk factors.
On the other hand, suspicious behavior flagging is characterized by
non-disclosed features and hand-crafted risk indices. Finally, we discuss
directions for future research. One major challenge is the need for more public
data sets. This may potentially be addressed by synthetic data generation.
Other possible research directions include semi-supervised and deep learning,
interpretability, and fairness of the results.
Related papers
- Network Analytics for Anti-Money Laundering -- A Systematic Literature Review and Experimental Evaluation [1.7119723306387908]
This paper presents an extensive and systematic review of the literature on network analytics (NA) for anti-money laundering (AML)
We identify and analyse 97 papers in the Web of Science and Scopus databases, resulting in a taxonomy of approaches following the fraud analytics framework of Bockel-Rickermann et al.
The framework is applied on the publicly available Elliptic data set and implements manual feature engineering, random walk-based methods, and deep learning GNNs.
arXiv Detail & Related papers (2024-05-29T08:48:52Z) - Language Models Can Reduce Asymmetry in Information Markets [100.38786498942702]
We introduce an open-source simulated digital marketplace where intelligent agents, powered by language models, buy and sell information on behalf of external participants.
The central mechanism enabling this marketplace is the agents' dual capabilities: they have the capacity to assess the quality of privileged information but also come equipped with the ability to forget.
To perform well, agents must make rational decisions, strategically explore the marketplace through generated sub-queries, and synthesize answers from purchased information.
arXiv Detail & Related papers (2024-03-21T14:48:37Z) - AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework [48.3060010653088]
We release AlphaFin datasets, combining traditional research datasets, real-time financial data, and handwritten chain-of-thought (CoT) data.
We then use AlphaFin datasets to benchmark a state-of-the-art method, called Stock-Chain, for effectively tackling the financial analysis task.
arXiv Detail & Related papers (2024-03-19T09:45:33Z) - On the Societal Impact of Open Foundation Models [93.67389739906561]
We focus on open foundation models, defined here as those with broadly available model weights.
We identify five distinctive properties of open foundation models that lead to both their benefits and risks.
arXiv Detail & Related papers (2024-02-27T16:49:53Z) - The Double-Edged Sword of Big Data and Information Technology for the
Disadvantaged: A Cautionary Tale from Open Banking [0.3867363075280543]
Open Banking has ignited a revolution in financial services, opening new opportunities for customer acquisition, management, retention, and risk assessment.
We investigate the dimensions of financial vulnerability (FV), a global concern resulting from COVID-19 and rising inflation.
Using a unique dataset from a UK FinTech lender, we demonstrate the power of fine-grained transaction data while simultaneously cautioning its safe usage.
arXiv Detail & Related papers (2023-07-25T11:07:43Z) - Catch Me If You Can: Semi-supervised Graph Learning for Spotting Money
Laundering [0.4159343412286401]
Money laundering is a process where criminals use financial services to move illegal money to untraceable destinations.
It is very crucial to identify such activities accurately and reliably in order to enforce an anti-money laundering (AML)
In this paper, we employ semi-supervised graph learning techniques on graphs of financial transactions in order to identify nodes involved in potential money laundering.
arXiv Detail & Related papers (2023-02-23T09:34:19Z) - Weakly Supervised Anomaly Detection: A Survey [75.26180038443462]
Anomaly detection (AD) is a crucial task in machine learning with various applications.
We present the first comprehensive survey of weakly supervised anomaly detection (WSAD) methods.
For each setting, we provide formal definitions, key algorithms, and potential future directions.
arXiv Detail & Related papers (2023-02-09T10:27:21Z) - Anti-Money Laundering Alert Optimization Using Machine Learning with
Graphs [0.769672852567215]
Money laundering is a global problem that concerns legitimizing proceeds from serious felonies (1.7-4 trillion euros annually)
We propose a machine learning triage model, which complements the rule-based system and learns to predict the risk of an alert accurately.
We validate our model on a real-world banking dataset and show how the triage model can reduce the number of false positives by 80% while detecting over 90% of true positives.
arXiv Detail & Related papers (2021-12-14T16:12:30Z) - Adverse Media Mining for KYC and ESG Compliance [2.381399746981591]
Adverse media or negative news screening is crucial for the identification of such non-financial risks.
We present an automated system to conduct both real-time and batch search of adverse media for users' queries.
Our scalable, machine-learning driven approach to high-precision, adverse news filtering is based on four perspectives.
arXiv Detail & Related papers (2021-10-22T01:04:16Z) - Exploration and Incentives in Reinforcement Learning [107.42240386544633]
We consider complex exploration problems, where each agent faces the same (but unknown) MDP.
Agents control the choice of policies, whereas an algorithm can only issue recommendations.
We design an algorithm which explores all reachable states in the MDP.
arXiv Detail & Related papers (2021-02-28T00:15:53Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.