A Novel Classification Approach for Credit Scoring based on Gaussian
Mixture Models
- URL: http://arxiv.org/abs/2010.13388v1
- Date: Mon, 26 Oct 2020 07:34:27 GMT
- Title: A Novel Classification Approach for Credit Scoring based on Gaussian
Mixture Models
- Authors: Hamidreza Arian, Seyed Mohammad Sina Seyfi, Azin Sharifi
- Abstract summary: This paper introduces a new method for credit scoring based on Gaussian Mixture Models.
Our algorithm classifies consumers into groups which are labeled as positive or negative.
We apply our model with real world databases from Australia, Japan, and Germany.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Credit scoring is a rapidly expanding analytical technique used by banks and
other financial institutions. Academic studies on credit scoring provide a
range of classification techniques used to differentiate between good and bad
borrowers. The main contribution of this paper is to introduce a new method for
credit scoring based on Gaussian Mixture Models. Our algorithm classifies
consumers into groups which are labeled as positive or negative. Labels are
estimated according to the probability associated with each class. We apply our
model with real world databases from Australia, Japan, and Germany. Numerical
results show that not only our model's performance is comparable to others, but
also its flexibility avoids over-fitting even in the absence of standard cross
validation techniques. The framework developed by this paper can provide a
computationally efficient and powerful tool for assessment of consumer default
risk in related financial institutions.
Related papers
- Enhanced Credit Score Prediction Using Ensemble Deep Learning Model [12.85570952381681]
This paper combines high-performance models like XGBoost and LightGBM, already widely used in modern banking systems, with the powerful TabNet model.
We have developed a potent model capable of accurately determining credit score levels by integrating Random Forest, XGBoost, and TabNet, and through the stacking technique in ensemble modeling.
arXiv Detail & Related papers (2024-09-30T21:56:16Z) - Empowering Many, Biasing a Few: Generalist Credit Scoring through Large
Language Models [53.620827459684094]
Large Language Models (LLMs) have great potential for credit scoring tasks, with strong generalization ability across multiple tasks.
We propose the first open-source comprehensive framework for exploring LLMs for credit scoring.
We then propose the first Credit and Risk Assessment Large Language Model (CALM) by instruction tuning, tailored to the nuanced demands of various financial risk assessment tasks.
arXiv Detail & Related papers (2023-10-01T03:50:34Z) - Inclusive FinTech Lending via Contrastive Learning and Domain Adaptation [9.75150920742607]
FinTech lending has played a significant role in facilitating financial inclusion.
There are concerns about the potentially biased algorithmic decision-making during loan screening.
We propose a new Transformer-based sequential loan screening model with self-supervised contrastive learning and domain adaptation.
arXiv Detail & Related papers (2023-05-10T01:11:35Z) - GREAT Score: Global Robustness Evaluation of Adversarial Perturbation using Generative Models [60.48306899271866]
We present a new framework, called GREAT Score, for global robustness evaluation of adversarial perturbation using generative models.
We show high correlation and significantly reduced cost of GREAT Score when compared to the attack-based model ranking on RobustBench.
GREAT Score can be used for remote auditing of privacy-sensitive black-box models.
arXiv Detail & Related papers (2023-04-19T14:58:27Z) - Federated Learning Aggregation: New Robust Algorithms with Guarantees [63.96013144017572]
Federated learning has been recently proposed for distributed model training at the edge.
This paper presents a complete general mathematical convergence analysis to evaluate aggregation strategies in a federated learning framework.
We derive novel aggregation algorithms which are able to modify their model architecture by differentiating client contributions according to the value of their losses.
arXiv Detail & Related papers (2022-05-22T16:37:53Z) - On the combination of graph data for assessing thin-file borrowers'
creditworthiness [0.0]
We introduce a framework to improve credit scoring models by blending several Graph Representation Learning methods.
We validated this framework using a unique dataset that characterizes the relationships and credit history for the entire population of a Latin American country.
In Corporate lending, where the gain is much higher, it confirms that evaluating an unbanked company cannot solely consider its features.
arXiv Detail & Related papers (2021-11-26T18:45:23Z) - Bagging Supervised Autoencoder Classifier for Credit Scoring [3.5977219275318166]
The imbalanced nature of credit scoring datasets, as well as the heterogeneous nature of features in credit scoring datasets, pose difficulties in developing and implementing effective credit scoring models.
We propose the Bagging Supervised Autoencoder (BSAC) that mainly leverages the superior performance of the Supervised Autoencoder.
BSAC also addresses the data imbalance problem by employing a variant of the Bagging process based on the undersampling of the majority class.
arXiv Detail & Related papers (2021-08-12T17:49:08Z) - Robustness Gym: Unifying the NLP Evaluation Landscape [91.80175115162218]
Deep neural networks are often brittle when deployed in real-world systems.
Recent research has focused on testing the robustness of such models.
We propose a solution in the form of Robustness Gym, a simple and evaluation toolkit.
arXiv Detail & Related papers (2021-01-13T02:37:54Z) - Dynamic Ensemble Learning for Credit Scoring: A Comparative Study [3.6503610360564687]
This study attempts to benchmark different dynamic selection approaches for ensemble learning models to accurately estimate the credit scoring task on a large and high-dimensional real-life credit scoring data set.
The results of this study indicate that dynamic selection techniques are able to boost the performance of ensemble models, especially in imbalanced training environments.
arXiv Detail & Related papers (2020-10-18T07:06:02Z) - Provable tradeoffs in adversarially robust classification [96.48180210364893]
We develop and leverage new tools, including recent breakthroughs from probability theory on robust isoperimetry.
Our results reveal fundamental tradeoffs between standard and robust accuracy that grow when data is imbalanced.
arXiv Detail & Related papers (2020-06-09T09:58:19Z) - Super-App Behavioral Patterns in Credit Risk Models: Financial,
Statistical and Regulatory Implications [110.54266632357673]
We present the impact of alternative data that originates from an app-based marketplace, in contrast to traditional bureau data, upon credit scoring models.
Our results, validated across two countries, show that these new sources of data are particularly useful for predicting financial behavior in low-wealth and young individuals.
arXiv Detail & Related papers (2020-05-09T01:32:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.