On the dynamics of credit history and social interaction features, and
their impact on creditworthiness assessment performance
- URL: http://arxiv.org/abs/2204.06122v1
- Date: Wed, 13 Apr 2022 00:42:27 GMT
- Title: On the dynamics of credit history and social interaction features, and
their impact on creditworthiness assessment performance
- Authors: Ricardo Mu\~noz-Cancino and Cristi\'an Bravo, Sebasti\'an A. R\'ios,
and Manuel Gra\~na
- Abstract summary: This study aims to understand the creditworthiness assessment performance dynamics and how it is influenced by the credit history, repayment behavior, and social network features.
Our research shows that borrowers' history increases performance at a decreasing rate during the first six months and then stabilizes.
The most notable effect on perfomance of social networks features occurs at loan application.
- Score: 3.6748639131154315
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: For more than a half-century, credit risk management has used credit scoring
models in each of its well-defined stages to manage credit risk. Application
scoring is used to decide whether to grant a credit or not, while behavioral
scoring is used mainly for portfolio management and to take preventive actions
in case of default signals. In both cases, network data has recently been shown
to be valuable to increase the predictive power of these models, especially
when the borrower's historical data is scarce or not available. This study aims
to understand the creditworthiness assessment performance dynamics and how it
is influenced by the credit history, repayment behavior, and social network
features. To accomplish this, we introduced a machine learning classification
framework to analyze 97.000 individuals and companies from the moment they
obtained their first loan to 12 months afterward. Our novel and massive dataset
allow us to characterize each borrower according to their credit behavior, and
social and economic relationships. Our research shows that borrowers' history
increases performance at a decreasing rate during the first six months and then
stabilizes. The most notable effect on perfomance of social networks features
occurs at loan application; in personal scoring, this effect prevails a few
months, while in business scoring adds value throughout the study period. These
findings are of great value to improve credit risk management and optimize the
use of traditional information and alternative data sources.
Related papers
- Credit Scores: Performance and Equity [0.0]
We benchmark a widely used credit score against a machine learning model of consumer default.
We find significant misclassification of borrowers, especially those with low scores.
Our model improves predictive accuracy for young, low-income, and minority groups.
arXiv Detail & Related papers (2024-08-30T23:36:02Z) - Empowering Many, Biasing a Few: Generalist Credit Scoring through Large
Language Models [53.620827459684094]
Large Language Models (LLMs) have great potential for credit scoring tasks, with strong generalization ability across multiple tasks.
We propose the first open-source comprehensive framework for exploring LLMs for credit scoring.
We then propose the first Credit and Risk Assessment Large Language Model (CALM) by instruction tuning, tailored to the nuanced demands of various financial risk assessment tasks.
arXiv Detail & Related papers (2023-10-01T03:50:34Z) - Would I have gotten that reward? Long-term credit assignment by
counterfactual contribution analysis [50.926791529605396]
We introduce Counterfactual Contribution Analysis (COCOA), a new family of model-based credit assignment algorithms.
Our algorithms achieve precise credit assignment by measuring the contribution of actions upon obtaining subsequent rewards.
arXiv Detail & Related papers (2023-06-29T09:27:27Z) - Machine Learning Models Evaluation and Feature Importance Analysis on
NPL Dataset [0.0]
We evaluate how different Machine learning models perform on the dataset provided by a private bank in Ethiopia.
XGBoost achieves the highest F1 score on the KMeans SMOTE over-sampled data.
arXiv Detail & Related papers (2022-08-28T17:09:44Z) - On the combination of graph data for assessing thin-file borrowers'
creditworthiness [0.0]
We introduce a framework to improve credit scoring models by blending several Graph Representation Learning methods.
We validated this framework using a unique dataset that characterizes the relationships and credit history for the entire population of a Latin American country.
In Corporate lending, where the gain is much higher, it confirms that evaluating an unbanked company cannot solely consider its features.
arXiv Detail & Related papers (2021-11-26T18:45:23Z) - Feature-Level Fusion of Super-App and Telecommunication Alternative Data
Sources for Credit Card Fraud Detection [106.33204064461802]
We review the effectiveness of a feature-level fusion of super-app customer information, mobile phone line data, and traditional credit risk variables for the early detection of identity theft credit card fraud.
We evaluate our approach over approximately 90,000 users from a credit lender's digital platform database.
arXiv Detail & Related papers (2021-11-05T19:10:35Z) - How Costly is Noise? Data and Disparities in Consumer Credit [0.0]
We show that credit scores are noisier indicators of default risk for historically under-served groups.
We find that equalizing the precision of credit scores can reduce disparities in approval rates and in credit misallocation for disadvantaged groups by approximately half.
arXiv Detail & Related papers (2021-05-17T00:42:26Z) - Explanations of Machine Learning predictions: a mandatory step for its
application to Operational Processes [61.20223338508952]
Credit Risk Modelling plays a paramount role.
Recent machine and deep learning techniques have been applied to the task.
We suggest to use LIME technique to tackle the explainability problem in this field.
arXiv Detail & Related papers (2020-12-30T10:27:59Z) - PCAL: A Privacy-preserving Intelligent Credit Risk Modeling Framework
Based on Adversarial Learning [111.19576084222345]
This paper proposes a framework of Privacy-preserving Credit risk modeling based on Adversarial Learning (PCAL)
PCAL aims to mask the private information inside the original dataset, while maintaining the important utility information for the target prediction task performance.
Results indicate that PCAL can learn an effective, privacy-free representation from user data, providing a solid foundation towards privacy-preserving machine learning for credit risk analysis.
arXiv Detail & Related papers (2020-10-06T07:04:59Z) - Super-App Behavioral Patterns in Credit Risk Models: Financial,
Statistical and Regulatory Implications [110.54266632357673]
We present the impact of alternative data that originates from an app-based marketplace, in contrast to traditional bureau data, upon credit scoring models.
Our results, validated across two countries, show that these new sources of data are particularly useful for predicting financial behavior in low-wealth and young individuals.
arXiv Detail & Related papers (2020-05-09T01:32:03Z) - The Value of Big Data for Credit Scoring: Enhancing Financial Inclusion
using Mobile Phone Data and Social Network Analytics [6.919243767837341]
This paper leverages alternative data sources to enhance both statistical and economic model performance.
A unique combination of datasets, including call-detail records, credit and debit account information of customers is used.
The results have an impact in terms of ethical use of call-detail records, regulatory implications, financial inclusion, as well as data sharing and privacy.
arXiv Detail & Related papers (2020-02-23T16:13:56Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.