Determining Secondary Attributes for Credit Evaluation in P2P Lending
- URL: http://arxiv.org/abs/2006.13921v1
- Date: Mon, 8 Jun 2020 16:12:00 GMT
- Title: Determining Secondary Attributes for Credit Evaluation in P2P Lending
- Authors: Revathi Bhuvaneswari, Antonio Segalini
- Abstract summary: We utilize machine learning classification and clustering algorithms to accurately predict a borrower's creditworthiness.
We achieved 65% F1 and 73% AUC on the LendingClub data while identifying key secondary attributes.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: There has been an increased need for secondary means of credit evaluation by
both traditional banking organizations as well as peer-to-peer lending
entities. This is especially important in the present technological era where
sticking with strict primary credit histories doesn't help distinguish between
a 'good' and a 'bad' borrower, and ends up hurting both the individual borrower
as well as the investor as a whole. We utilized machine learning classification
and clustering algorithms to accurately predict a borrower's creditworthiness
while identifying specific secondary attributes that contribute to this score.
While extensive research has been done in predicting when a loan would be fully
paid, the area of feature selection for lending is relatively new. We achieved
65% F1 and 73% AUC on the LendingClub data while identifying key secondary
attributes.
Related papers
- Credit Risk Meets Large Language Models: Building a Risk Indicator from Loan Descriptions in P2P Lending [1.1970409518725493]
Peer-to-peer (P2P) lending has emerged as a distinctive financing mechanism, linking borrowers with lenders through online platforms.
However, P2P lending faces the challenge of information asymmetry, as lenders often lack sufficient data to assess the creditworthiness of borrowers.
This paper proposes a novel approach to address this issue by leveraging the textual descriptions provided by borrowers during the loan application process.
arXiv Detail & Related papers (2024-01-29T10:11:05Z) - Empowering Many, Biasing a Few: Generalist Credit Scoring through Large
Language Models [53.620827459684094]
Large Language Models (LLMs) have great potential for credit scoring tasks, with strong generalization ability across multiple tasks.
We propose the first open-source comprehensive framework for exploring LLMs for credit scoring.
We then propose the first Credit and Risk Assessment Large Language Model (CALM) by instruction tuning, tailored to the nuanced demands of various financial risk assessment tasks.
arXiv Detail & Related papers (2023-10-01T03:50:34Z) - Machine Learning Models Evaluation and Feature Importance Analysis on
NPL Dataset [0.0]
We evaluate how different Machine learning models perform on the dataset provided by a private bank in Ethiopia.
XGBoost achieves the highest F1 score on the KMeans SMOTE over-sampled data.
arXiv Detail & Related papers (2022-08-28T17:09:44Z) - On the dynamics of credit history and social interaction features, and
their impact on creditworthiness assessment performance [3.6748639131154315]
This study aims to understand the creditworthiness assessment performance dynamics and how it is influenced by the credit history, repayment behavior, and social network features.
Our research shows that borrowers' history increases performance at a decreasing rate during the first six months and then stabilizes.
The most notable effect on perfomance of social networks features occurs at loan application.
arXiv Detail & Related papers (2022-04-13T00:42:27Z) - Selective Credit Assignment [57.41789233550586]
We describe a unified view on temporal-difference algorithms for selective credit assignment.
We present insights into applying weightings to value-based learning and planning algorithms.
arXiv Detail & Related papers (2022-02-20T00:07:57Z) - Feature-Level Fusion of Super-App and Telecommunication Alternative Data
Sources for Credit Card Fraud Detection [106.33204064461802]
We review the effectiveness of a feature-level fusion of super-app customer information, mobile phone line data, and traditional credit risk variables for the early detection of identity theft credit card fraud.
We evaluate our approach over approximately 90,000 users from a credit lender's digital platform database.
arXiv Detail & Related papers (2021-11-05T19:10:35Z) - Bagging Supervised Autoencoder Classifier for Credit Scoring [3.5977219275318166]
The imbalanced nature of credit scoring datasets, as well as the heterogeneous nature of features in credit scoring datasets, pose difficulties in developing and implementing effective credit scoring models.
We propose the Bagging Supervised Autoencoder (BSAC) that mainly leverages the superior performance of the Supervised Autoencoder.
BSAC also addresses the data imbalance problem by employing a variant of the Bagging process based on the undersampling of the majority class.
arXiv Detail & Related papers (2021-08-12T17:49:08Z) - Online Learning of Competitive Equilibria in Exchange Economies [94.24357018178867]
In economics, the sharing of scarce resources among multiple rational agents is a classical problem.
We propose an online learning mechanism to learn agent preferences.
We demonstrate the effectiveness of this mechanism through numerical simulations.
arXiv Detail & Related papers (2021-06-11T21:32:17Z) - Explanations of Machine Learning predictions: a mandatory step for its
application to Operational Processes [61.20223338508952]
Credit Risk Modelling plays a paramount role.
Recent machine and deep learning techniques have been applied to the task.
We suggest to use LIME technique to tackle the explainability problem in this field.
arXiv Detail & Related papers (2020-12-30T10:27:59Z) - Super-App Behavioral Patterns in Credit Risk Models: Financial,
Statistical and Regulatory Implications [110.54266632357673]
We present the impact of alternative data that originates from an app-based marketplace, in contrast to traditional bureau data, upon credit scoring models.
Our results, validated across two countries, show that these new sources of data are particularly useful for predicting financial behavior in low-wealth and young individuals.
arXiv Detail & Related papers (2020-05-09T01:32:03Z) - Predicting Bank Loan Default with Extreme Gradient Boosting [0.0]
We use an Extreme Gradient Boosting algorithm called XGBoost for loan default prediction.
The prediction is based on a loan data from a leading bank taking into consideration data sets from both the loan application and the demographic of the applicant.
arXiv Detail & Related papers (2020-01-18T18:52:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.