Deep Semi-Supervised Anomaly Detection for Finding Fraud in the Futures
Market
- URL: http://arxiv.org/abs/2309.00088v1
- Date: Thu, 31 Aug 2023 19:07:50 GMT
- Title: Deep Semi-Supervised Anomaly Detection for Finding Fraud in the Futures
Market
- Authors: Timothy DeLise
- Abstract summary: This research article aims to evaluate the efficacy of a deep semi-supervised anomaly detection technique, called Deep SAD, for detecting fraud in high-frequency financial data.
We use exclusive proprietary limit order book data from the TMX exchange in Montr'eal, with a small set of true labeled instances of fraud, to evaluate Deep SAD.
We show that incorporating a small amount of labeled data into an unsupervised anomaly detection framework can greatly improve its accuracy.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Modern financial electronic exchanges are an exciting and fast-paced
marketplace where billions of dollars change hands every day. They are also
rife with manipulation and fraud. Detecting such activity is a major
undertaking, which has historically been a job reserved exclusively for humans.
Recently, more research and resources have been focused on automating these
processes via machine learning and artificial intelligence. Fraud detection is
overwhelmingly associated with the greater field of anomaly detection, which is
usually performed via unsupervised learning techniques because of the lack of
labeled data needed for supervised learning. However, a small quantity of
labeled data does often exist. This research article aims to evaluate the
efficacy of a deep semi-supervised anomaly detection technique, called Deep
SAD, for detecting fraud in high-frequency financial data. We use exclusive
proprietary limit order book data from the TMX exchange in Montr\'eal, with a
small set of true labeled instances of fraud, to evaluate Deep SAD against its
unsupervised predecessor. We show that incorporating a small amount of labeled
data into an unsupervised anomaly detection framework can greatly improve its
accuracy.
Related papers
- Advancing Anomaly Detection: Non-Semantic Financial Data Encoding with LLMs [49.57641083688934]
We introduce a novel approach to anomaly detection in financial data using Large Language Models (LLMs) embeddings.
Our experiments demonstrate that LLMs contribute valuable information to anomaly detection as our models outperform the baselines.
arXiv Detail & Related papers (2024-06-05T20:19:09Z) - Securing Transactions: A Hybrid Dependable Ensemble Machine Learning
Model using IHT-LR and Grid Search [2.4374097382908477]
We introduce a state-of-the-art hybrid ensemble (ENS) Machine learning (ML) model that intelligently combines multiple algorithms to enhance fraud identification.
Our experiments are conducted on a publicly available credit card dataset comprising 284,807 transactions.
The proposed model achieves impressive accuracy rates of 99.66%, 99.73%, 98.56%, and 99.79%, and a perfect 100% for the DT, RF, KNN, and ENS models, respectively.
arXiv Detail & Related papers (2024-02-22T09:01:42Z) - Weakly Supervised Anomaly Detection via Knowledge-Data Alignment [24.125871437370357]
Anomaly detection plays a pivotal role in numerous web-based applications, including malware detection, anti-money laundering, device failure detection, and network fault analysis.
Weakly Supervised Anomaly Detection (WSAD) has been introduced with a limited number of labeled anomaly samples to enhance model performance.
We introduce a novel framework Knowledge-Data Alignment (KDAlign) to integrate rule knowledge, typically summarized by human experts, to supplement the limited labeled data.
arXiv Detail & Related papers (2024-02-06T07:57:13Z) - Progressing from Anomaly Detection to Automated Log Labeling and
Pioneering Root Cause Analysis [53.24804865821692]
This study introduces a taxonomy for log anomalies and explores automated data labeling to mitigate labeling challenges.
The study envisions a future where root cause analysis follows anomaly detection, unraveling the underlying triggers of anomalies.
arXiv Detail & Related papers (2023-12-22T15:04:20Z) - Weakly Supervised Anomaly Detection: A Survey [75.26180038443462]
Anomaly detection (AD) is a crucial task in machine learning with various applications.
We present the first comprehensive survey of weakly supervised anomaly detection (WSAD) methods.
For each setting, we provide formal definitions, key algorithms, and potential future directions.
arXiv Detail & Related papers (2023-02-09T10:27:21Z) - PULL: Reactive Log Anomaly Detection Based On Iterative PU Learning [58.85063149619348]
We propose PULL, an iterative log analysis method for reactive anomaly detection based on estimated failure time windows.
Our evaluation shows that PULL consistently outperforms ten benchmark baselines across three different datasets.
arXiv Detail & Related papers (2023-01-25T16:34:43Z) - Challenges and Complexities in Machine Learning based Credit Card Fraud
Detection [0.0]
Volume of transactions, uniqueness of frauds and ingenuity of the fraudster are main challenges in detecting frauds.
The advent of machine learning, artificial intelligence and big data has opened up new tools in the fight against frauds.
However, the developments in fraud detection algorithms has been challenging and slow due to the massively unbalanced nature of fraud data.
arXiv Detail & Related papers (2022-08-20T07:53:51Z) - Anomaly Detection via Self-organizing Map [52.542991004752]
Anomaly detection plays a key role in industrial manufacturing for product quality control.
Traditional methods for anomaly detection are rule-based with limited generalization ability.
Recent methods based on supervised deep learning are more powerful but require large-scale annotated datasets for training.
arXiv Detail & Related papers (2021-07-21T06:56:57Z) - Anomaly Detection Based on Selection and Weighting in Latent Space [73.01328671569759]
We propose a novel selection-and-weighting-based anomaly detection framework called SWAD.
Experiments on both benchmark and real-world datasets have shown the effectiveness and superiority of SWAD.
arXiv Detail & Related papers (2021-03-08T10:56:38Z) - Leaking Sensitive Financial Accounting Data in Plain Sight using Deep
Autoencoder Neural Networks [1.9659095632676094]
We introduce a real-world threat model' designed to leak sensitive accounting data.
We show that a deep steganographic process, constituted by three neural networks, can be trained to hide such data in unobtrusive day-to-day' images.
arXiv Detail & Related papers (2020-12-13T17:29:53Z) - Unsupervised anomaly detection for discrete sequence healthcare data [1.2667973028134798]
We propose a machine learning model that automates fraud detection in an unsupervised way.
Two deep learning approaches include LSTM neural network for prediction next patient visit and a seq2seq model.
We use real data on sequences of patients' visits data from Allianz company for the validation.
arXiv Detail & Related papers (2020-07-20T13:42:21Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.