Enabling Machine Learning Algorithms for Credit Scoring -- Explainable
Artificial Intelligence (XAI) methods for clear understanding complex
predictive models
- URL: http://arxiv.org/abs/2104.06735v1
- Date: Wed, 14 Apr 2021 09:44:04 GMT
- Title: Enabling Machine Learning Algorithms for Credit Scoring -- Explainable
Artificial Intelligence (XAI) methods for clear understanding complex
predictive models
- Authors: Przemys{\l}aw Biecek, Marcin Chlebus, Janusz Gajda, Alicja Gosiewska,
Anna Kozak, Dominik Ogonowski, Jakub Sztachelski, Piotr Wojewnik
- Abstract summary: This paper compares various predictive models (logistic regression, logistic regression with weight of evidence transformations and modern artificial intelligence algorithms) and show that advanced tree based models give best results in prediction of client default.
We also show how to boost advanced models using techniques which allow to interpret them and made them more accessible for credit risk practitioners.
- Score: 2.1723750239223034
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Rapid development of advanced modelling techniques gives an opportunity to
develop tools that are more and more accurate. However as usually, everything
comes with a price and in this case, the price to pay is to loose
interpretability of a model while gaining on its accuracy and precision. For
managers to control and effectively manage credit risk and for regulators to be
convinced with model quality the price to pay is too high. In this paper, we
show how to take credit scoring analytics in to the next level, namely we
present comparison of various predictive models (logistic regression, logistic
regression with weight of evidence transformations and modern artificial
intelligence algorithms) and show that advanced tree based models give best
results in prediction of client default. What is even more important and
valuable we also show how to boost advanced models using techniques which allow
to interpret them and made them more accessible for credit risk practitioners,
resolving the crucial obstacle in widespread deployment of more complex, 'black
box' models like random forests, gradient boosted or extreme gradient boosted
trees. All this will be shown on the large dataset obtained from the Polish
Credit Bureau to which all the banks and most of the lending companies in the
country do report the credit files. In this paper the data from lending
companies were used. The paper then compares state of the art best practices in
credit risk modelling with new advanced modern statistical tools boosted by the
latest developments in the field of interpretability and explainability of
artificial intelligence algorithms. We believe that this is a valuable
contribution when it comes to presentation of different modelling tools but
what is even more important it is showing which methods might be used to get
insight and understanding of AI methods in credit risk context.
Related papers
- Enhanced Credit Score Prediction Using Ensemble Deep Learning Model [12.85570952381681]
This paper combines high-performance models like XGBoost and LightGBM, already widely used in modern banking systems, with the powerful TabNet model.
We have developed a potent model capable of accurately determining credit score levels by integrating Random Forest, XGBoost, and TabNet, and through the stacking technique in ensemble modeling.
arXiv Detail & Related papers (2024-09-30T21:56:16Z) - Knowledge-Aware Parsimony Learning: A Perspective from Relational Graphs [47.6830995661091]
We develop next-generation models in a parsimonious manner, achieving greater potential with simpler models.
The key is to drive models using domain-specific knowledge, such as symbols, logic, and formulas, instead of relying on the scaling law.
This approach allows us to build a framework that uses this knowledge as "building blocks" to achieve parsimony in model design, training, and interpretation.
arXiv Detail & Related papers (2024-06-29T15:52:37Z) - QualEval: Qualitative Evaluation for Model Improvement [82.73561470966658]
We propose QualEval, which augments quantitative scalar metrics with automated qualitative evaluation as a vehicle for model improvement.
QualEval uses a powerful LLM reasoner and our novel flexible linear programming solver to generate human-readable insights.
We demonstrate that leveraging its insights, for example, improves the absolute performance of the Llama 2 model by up to 15% points relative.
arXiv Detail & Related papers (2023-11-06T00:21:44Z) - Beyond Explaining: Opportunities and Challenges of XAI-Based Model
Improvement [75.00655434905417]
Explainable Artificial Intelligence (XAI) is an emerging research field bringing transparency to highly complex machine learning (ML) models.
This paper offers a comprehensive overview over techniques that apply XAI practically for improving various properties of ML models.
We show empirically through experiments on toy and realistic settings how explanations can help improve properties such as model generalization ability or reasoning.
arXiv Detail & Related papers (2022-03-15T15:44:28Z) - Predicting Credit Risk for Unsecured Lending: A Machine Learning
Approach [0.0]
This research paper is to build a contemporary credit scoring model to forecast credit defaults for unsecured lending (credit cards)
Our research indicates that the Light Gradient Boosting Machine (LGBM) model is better equipped to deliver higher learning speeds, better efficiencies and manage larger data volumes.
We expect that deployment of this model will enable better and timely prediction of credit defaults for decision-makers in commercial lending institutions and banks.
arXiv Detail & Related papers (2021-10-05T17:54:56Z) - Federated Learning with Unreliable Clients: Performance Analysis and
Mechanism Design [76.29738151117583]
Federated Learning (FL) has become a promising tool for training effective machine learning models among distributed clients.
However, low quality models could be uploaded to the aggregator server by unreliable clients, leading to a degradation or even a collapse of training.
We model these unreliable behaviors of clients and propose a defensive mechanism to mitigate such a security risk.
arXiv Detail & Related papers (2021-05-10T08:02:27Z) - Beyond Trivial Counterfactual Explanations with Diverse Valuable
Explanations [64.85696493596821]
In computer vision applications, generative counterfactual methods indicate how to perturb a model's input to change its prediction.
We propose a counterfactual method that learns a perturbation in a disentangled latent space that is constrained using a diversity-enforcing loss.
Our model improves the success rate of producing high-quality valuable explanations when compared to previous state-of-the-art methods.
arXiv Detail & Related papers (2021-03-18T12:57:34Z) - Explaining Credit Risk Scoring through Feature Contribution Alignment
with Expert Risk Analysts [1.7778609937758323]
We focus on companies credit scoring and we benchmark different machine learning models.
The aim is to build a model to predict whether a company will experience financial problems in a given time horizon.
We bring light by providing an expert-aligned feature relevance score highlighting the disagreement between a credit risk expert and a model feature attribution explanation.
arXiv Detail & Related papers (2021-03-15T12:59:15Z) - Explanations of Machine Learning predictions: a mandatory step for its
application to Operational Processes [61.20223338508952]
Credit Risk Modelling plays a paramount role.
Recent machine and deep learning techniques have been applied to the task.
We suggest to use LIME technique to tackle the explainability problem in this field.
arXiv Detail & Related papers (2020-12-30T10:27:59Z) - Explainable AI for Interpretable Credit Scoring [0.8379286663107844]
Credit scoring helps financial experts make better decisions regarding whether or not to accept a loan application.
Regulations have added the need for model interpretability to ensure that algorithmic decisions are understandable coherent.
We present a credit scoring model that is both accurate and interpretable.
arXiv Detail & Related papers (2020-12-03T18:44:03Z) - Transparency, Auditability and eXplainability of Machine Learning Models
in Credit Scoring [4.370097023410272]
This paper works out different dimensions that have to be considered for making credit scoring models understandable.
We present an overview of techniques, demonstrate how they can be applied in credit scoring and how results compare to the interpretability of score cards.
arXiv Detail & Related papers (2020-09-28T15:00:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.