Securing Transactions: A Hybrid Dependable Ensemble Machine Learning
Model using IHT-LR and Grid Search
- URL: http://arxiv.org/abs/2402.14389v1
- Date: Thu, 22 Feb 2024 09:01:42 GMT
- Title: Securing Transactions: A Hybrid Dependable Ensemble Machine Learning
Model using IHT-LR and Grid Search
- Authors: Md. Alamin Talukder, Rakib Hossen, Md Ashraf Uddin, Mohammed Nasir
Uddin and Uzzal Kumar Acharjee
- Abstract summary: We introduce a state-of-the-art hybrid ensemble (ENS) Machine learning (ML) model that intelligently combines multiple algorithms to enhance fraud identification.
Our experiments are conducted on a publicly available credit card dataset comprising 284,807 transactions.
The proposed model achieves impressive accuracy rates of 99.66%, 99.73%, 98.56%, and 99.79%, and a perfect 100% for the DT, RF, KNN, and ENS models, respectively.
- Score: 2.4374097382908477
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Financial institutions and businesses face an ongoing challenge from
fraudulent transactions, prompting the need for effective detection methods.
Detecting credit card fraud is crucial for identifying and preventing
unauthorized transactions.Timely detection of fraud enables investigators to
take swift actions to mitigate further losses. However, the investigation
process is often time-consuming, limiting the number of alerts that can be
thoroughly examined each day. Therefore, the primary objective of a fraud
detection model is to provide accurate alerts while minimizing false alarms and
missed fraud cases. In this paper, we introduce a state-of-the-art hybrid
ensemble (ENS) dependable Machine learning (ML) model that intelligently
combines multiple algorithms with proper weighted optimization using Grid
search, including Decision Tree (DT), Random Forest (RF), K-Nearest Neighbor
(KNN), and Multilayer Perceptron (MLP), to enhance fraud identification. To
address the data imbalance issue, we employ the Instant Hardness Threshold
(IHT) technique in conjunction with Logistic Regression (LR), surpassing
conventional approaches. Our experiments are conducted on a publicly available
credit card dataset comprising 284,807 transactions. The proposed model
achieves impressive accuracy rates of 99.66%, 99.73%, 98.56%, and 99.79%, and a
perfect 100% for the DT, RF, KNN, MLP and ENS models, respectively. The hybrid
ensemble model outperforms existing works, establishing a new benchmark for
detecting fraudulent transactions in high-frequency scenarios. The results
highlight the effectiveness and reliability of our approach, demonstrating
superior performance metrics and showcasing its exceptional potential for
real-world fraud detection applications.
Related papers
- Advancing Anomaly Detection: Non-Semantic Financial Data Encoding with LLMs [49.57641083688934]
We introduce a novel approach to anomaly detection in financial data using Large Language Models (LLMs) embeddings.
Our experiments demonstrate that LLMs contribute valuable information to anomaly detection as our models outperform the baselines.
arXiv Detail & Related papers (2024-06-05T20:19:09Z) - Enhancing Credit Card Fraud Detection A Neural Network and SMOTE Integrated Approach [4.341096233663623]
This research proposes an innovative methodology combining Neural Networks (NN) and Synthet ic Minority Over-sampling Technique (SMOTE) to enhance the detection performance.
The study addresses the inherent imbalance in credit card transaction data, focusing on technical advancements for robust and precise fraud detection.
arXiv Detail & Related papers (2024-02-27T02:26:04Z) - Transaction Fraud Detection via an Adaptive Graph Neural Network [64.9428588496749]
We propose an Adaptive Sampling and Aggregation-based Graph Neural Network (ASA-GNN) that learns discriminative representations to improve the performance of transaction fraud detection.
A neighbor sampling strategy is performed to filter noisy nodes and supplement information for fraudulent nodes.
Experiments on three real financial datasets demonstrate that the proposed method ASA-GNN outperforms state-of-the-art ones.
arXiv Detail & Related papers (2023-07-11T07:48:39Z) - Credit Card Fraud Detection Using Enhanced Random Forest Classifier for
Imbalanced Data [0.8223798883838329]
This paper implements the random forest (RF) algorithm to solve the issue in the hand.
A dataset of credit card transactions was used in this study.
arXiv Detail & Related papers (2023-03-11T22:59:37Z) - Fraud Detection Using Optimized Machine Learning Tools Under Imbalance
Classes [0.304585143845864]
Fraud detection with smart versions of machine learning (ML) tools is essential to assure safety.
We investigate four state-of-the-art ML techniques, namely, logistic regression, decision trees, random forest, and extreme gradient boost.
For phishing website URLs and credit card fraud transaction datasets, the results indicate that extreme gradient boost trained on the original data shows trustworthy performance.
arXiv Detail & Related papers (2022-09-04T15:30:23Z) - Should All Proposals be Treated Equally in Object Detection? [110.27485090952385]
The complexity-precision trade-off of an object detector is a critical problem for resource constrained vision tasks.
It is hypothesized that improved detection efficiency requires a paradigm shift, towards the unequal processing of proposals.
This results in better utilization of available computational budget, enabling higher accuracy for the same FLOPS.
arXiv Detail & Related papers (2022-07-07T18:26:32Z) - Leveraging Unlabeled Data to Predict Out-of-Distribution Performance [63.740181251997306]
Real-world machine learning deployments are characterized by mismatches between the source (training) and target (test) distributions.
In this work, we investigate methods for predicting the target domain accuracy using only labeled source data and unlabeled target data.
We propose Average Thresholded Confidence (ATC), a practical method that learns a threshold on the model's confidence, predicting accuracy as the fraction of unlabeled examples.
arXiv Detail & Related papers (2022-01-11T23:01:12Z) - Bayesian Optimization with Machine Learning Algorithms Towards Anomaly
Detection [66.05992706105224]
In this paper, an effective anomaly detection framework is proposed utilizing Bayesian Optimization technique.
The performance of the considered algorithms is evaluated using the ISCX 2012 dataset.
Experimental results show the effectiveness of the proposed framework in term of accuracy rate, precision, low-false alarm rate, and recall.
arXiv Detail & Related papers (2020-08-05T19:29:35Z) - SADet: Learning An Efficient and Accurate Pedestrian Detector [68.66857832440897]
This paper proposes a series of systematic optimization strategies for the detection pipeline of one-stage detector.
It forms a single shot anchor-based detector (SADet) for efficient and accurate pedestrian detection.
Though structurally simple, it presents state-of-the-art result and real-time speed of $20$ FPS for VGA-resolution images.
arXiv Detail & Related papers (2020-07-26T12:32:38Z) - Towards Credit-Fraud Detection via Sparsely Varying Gaussian
Approximations [0.0]
We propose a credit card fraud detection concept incorporating the uncertainty in our prediction system to ensure better judgment in such a crucial task.
We perform the same with different sets of kernels and the different number of inducing data points to show the best accuracy was obtained.
arXiv Detail & Related papers (2020-07-14T16:56:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.