Related papers: Fighting crime with Transformers: Empirical analysis of address parsing methods in payment data

Fighting crime with Transformers: Empirical analysis of address parsing methods in payment data

URL: http://arxiv.org/abs/2404.05632v2
Date: Tue, 9 Apr 2024 09:30:46 GMT
Title: Fighting crime with Transformers: Empirical analysis of address parsing methods in payment data
Authors: Haitham Hammami, Louis Baligand, Bojan Petrovski,
Abstract summary: This paper explores the performance of Transformers and Generative Large Language Models (LLM) We show the need for training robust models capable of dealing with real-world noisy transactional data. Our results suggest that a well fine-tuned Transformer model using early-stopping significantly outperforms other approaches.
Score: 0.01499944454332829
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: In the financial industry, identifying the location of parties involved in payments is a major challenge in the context of various regulatory requirements. For this purpose address parsing entails extracting fields such as street, postal code, or country from free text message attributes. While payment processing platforms are updating their standards with more structured formats such as SWIFT with ISO 20022, address parsing remains essential for a considerable volume of messages. With the emergence of Transformers and Generative Large Language Models (LLM), we explore the performance of state-of-the-art solutions given the constraint of processing a vast amount of daily data. This paper also aims to show the need for training robust models capable of dealing with real-world noisy transactional data. Our results suggest that a well fine-tuned Transformer model using early-stopping significantly outperforms other approaches. Nevertheless, generative LLMs demonstrate strong zero-shot performance and warrant further investigations.

Related papers

Better with Less: Small Proprietary Models Surpass Large Language Models in Financial Transaction Understanding [1.4125114383423856]
This paper conducts experiments to evaluate three types of Transformer models: pretrained LLMs, fine-tuned LLMs, and small proprietary models developed from scratch.<n>Our findings highlight the importance of model selection based on domain-specific needs.
arXiv Detail & Related papers (2025-09-30T05:23:08Z)
Check Field Detection Agent (CFD-Agent) using Multimodal Large Language and Vision Language Models [7.836288735110501]
We introduce a novel, training-free framework for automated check field detection.<n>Our approach enables zero-shot detection of check components, significantly lowering the barrier to deployment in real-world financial settings.
arXiv Detail & Related papers (2025-09-22T20:43:59Z)
ByteGen: A Tokenizer-Free Generative Model for Orderbook Events in Byte Space [11.523583937607622]
We introduce ByteGen, a novel generative model that operates directly on the raw byte streams of LOB events.<n>Our work is the complete elimination of feature engineering and tokenization, enabling the model to learn market dynamics from its most fundamental representation.<n>ByteGen successfully reproduces key facts of financial markets, generating realistic price distributions, heavy-tailed returns, and bursty event timing.
arXiv Detail & Related papers (2025-08-04T09:48:42Z)
Your Spending Needs Attention: Modeling Financial Habits with Transformers [2.5960274245156922]
This paper investigates using transformer-based representation learning models for transaction data.<n>We propose a new method enabling the use of SSL with transaction data by adapting transformer-based models to handle both textual and structured attributes.
arXiv Detail & Related papers (2025-07-31T05:56:21Z)
Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs [60.881609323604685]
Large Language Models (LLMs) accessed via black-box APIs introduce a trust challenge. Users pay for services based on advertised model capabilities. providers may covertly substitute the specified model with a cheaper, lower-quality alternative to reduce operational costs. This lack of transparency undermines fairness, erodes trust, and complicates reliable benchmarking.
arXiv Detail & Related papers (2025-04-07T03:57:41Z)
Quantifying Qualitative Insights: Leveraging LLMs to Market Predict [0.0]
This study addresses challenges by leveraging daily reports from securities firms to create high-quality contextual information. The reports are segmented into text-based key factors and combined with numerical data, such as price information, to form context sets. A crafted prompt is designed to assign scores to the key factors, converting qualitative insights into quantitative results.
arXiv Detail & Related papers (2024-11-13T07:45:40Z)
Advancing Anomaly Detection: Non-Semantic Financial Data Encoding with LLMs [49.57641083688934]
We introduce a novel approach to anomaly detection in financial data using Large Language Models (LLMs) embeddings. Our experiments demonstrate that LLMs contribute valuable information to anomaly detection as our models outperform the baselines.
arXiv Detail & Related papers (2024-06-05T20:19:09Z)
Towards a Foundation Purchasing Model: Pretrained Generative Autoregression on Transaction Sequences [0.0]
We present a generative pretraining method that can be used to obtain contextualised embeddings of financial transactions. We additionally perform large-scale pretraining of an embedding model using a corpus of data from 180 issuing banks containing 5.1 billion transactions.
arXiv Detail & Related papers (2024-01-03T09:32:48Z)
Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning [54.682106515794864]
offline reinforcement learning (RL) aims to find a near-optimal policy using pre-collected datasets. This paper introduces $textbfLanguage Models for $textbfMo$tion Control ($textbfLaMo$), a general framework based on Decision Transformers to use pre-trained Language Models (LMs) for offline RL. Empirical results indicate $textbfLaMo$ achieves state-of-the-art performance in sparse-reward tasks.
arXiv Detail & Related papers (2023-10-31T16:24:17Z)
Adapting Large Language Models for Content Moderation: Pitfalls in Data Engineering and Supervised Fine-tuning [79.53130089003986]
Large Language Models (LLMs) have become a feasible solution for handling tasks in various domains. In this paper, we introduce how to fine-tune a LLM model that can be privately deployed for content moderation.
arXiv Detail & Related papers (2023-10-05T09:09:44Z)
Simultaneous Machine Translation with Large Language Models [51.470478122113356]
We investigate the possibility of applying Large Language Models to SimulMT tasks. We conducted experiments using the textttLlama2-7b-chat model on nine different languages from the MUST-C dataset. The results show that LLM outperforms dedicated MT models in terms of BLEU and LAAL metrics.
arXiv Detail & Related papers (2023-09-13T04:06:47Z)
Generative AI for End-to-End Limit Order Book Modelling: A Token-Level Autoregressive Generative Model of Message Flow Using a Deep State Space Network [7.54290390842336]
We propose an end-to-end autoregressive generative model that generates tokenized limit order book (LOB) messages. Using NASDAQ equity LOBs, we develop a custom tokenizer for message data, converting groups of successive digits to tokens. Results show promising performance in approximating the data distribution, as evidenced by low model perplexity.
arXiv Detail & Related papers (2023-08-23T09:37:22Z)
Optimizing Non-Autoregressive Transformers with Contrastive Learning [74.46714706658517]
Non-autoregressive Transformers (NATs) reduce the inference latency of Autoregressive Transformers (ATs) by predicting words all at once rather than in sequential order. In this paper, we propose to ease the difficulty of modality learning via sampling from the model distribution instead of the data distribution.
arXiv Detail & Related papers (2023-05-23T04:20:13Z)
Transformer-based Approaches for Legal Text Processing [3.4630926944621643]
We introduce our approaches using Transformer-based models for different problems of the COLIEE 2021 automatic legal text processing competition. We find that Transformer-based pretrained language models can perform well with automated legal text processing problems with appropriate approaches.
arXiv Detail & Related papers (2022-02-13T19:59:15Z)
Bayesian Transformer Language Models for Speech Recognition [59.235405107295655]
State-of-the-art neural language models (LMs) represented by Transformers are highly complex. This paper proposes a full Bayesian learning framework for Transformer LM estimation.
arXiv Detail & Related papers (2021-02-09T10:55:27Z)

This list is automatically generated from the titles and abstracts of the papers in this site.