Inclusive FinTech Lending via Contrastive Learning and Domain Adaptation
- URL: http://arxiv.org/abs/2305.05827v1
- Date: Wed, 10 May 2023 01:11:35 GMT
- Title: Inclusive FinTech Lending via Contrastive Learning and Domain Adaptation
- Authors: Xiyang Hu, Yan Huang, Beibei Li, Tian Lu
- Abstract summary: FinTech lending has played a significant role in facilitating financial inclusion.
There are concerns about the potentially biased algorithmic decision-making during loan screening.
We propose a new Transformer-based sequential loan screening model with self-supervised contrastive learning and domain adaptation.
- Score: 9.75150920742607
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: FinTech lending (e.g., micro-lending) has played a significant role in
facilitating financial inclusion. It has reduced processing times and costs,
enhanced the user experience, and made it possible for people to obtain loans
who may not have qualified for credit from traditional lenders. However, there
are concerns about the potentially biased algorithmic decision-making during
loan screening. Machine learning algorithms used to evaluate credit quality can
be influenced by representation bias in the training data, as we only have
access to the default outcome labels of approved loan applications, for which
the borrowers' socioeconomic characteristics are better than those of rejected
ones. In this case, the model trained on the labeled data performs well on the
historically approved population, but does not generalize well to borrowers of
low socioeconomic background. In this paper, we investigate the problem of
representation bias in loan screening for a real-world FinTech lending
platform. We propose a new Transformer-based sequential loan screening model
with self-supervised contrastive learning and domain adaptation to tackle this
challenging issue. We use contrastive learning to train our feature extractor
on unapproved (unlabeled) loan applications and use domain adaptation to
generalize the performance of our label predictor. We demonstrate the
effectiveness of our model through extensive experimentation in the real-world
micro-lending setting. Our results show that our model significantly promotes
the inclusiveness of funding decisions, while also improving loan screening
accuracy and profit by 7.10% and 8.95%, respectively. We also show that
incorporating the test data into contrastive learning and domain adaptation and
labeling a small ratio of test data can further boost model performance.
Related papers
- Bank Loan Prediction Using Machine Learning Techniques [0.0]
We have worked on a dataset of 148,670 instances and 37 attributes using machine learning methods.
The best-performing algorithm was AdaBoosting, which achieved an incredible accuracy of 99.99%.
arXiv Detail & Related papers (2024-10-11T15:01:47Z) - Ensuring Equitable Financial Decisions: Leveraging Counterfactual Fairness and Deep Learning for Bias [0.0]
This research paper investigates advanced bias mitigation techniques, with a particular focus on counterfactual fairness in conjunction with data augmentation.
The study looks into how these integrated approaches can lessen gender bias in the financial industry, specifically in loan approval procedures.
arXiv Detail & Related papers (2024-08-27T14:28:06Z) - Advancing Anomaly Detection: Non-Semantic Financial Data Encoding with LLMs [49.57641083688934]
We introduce a novel approach to anomaly detection in financial data using Large Language Models (LLMs) embeddings.
Our experiments demonstrate that LLMs contribute valuable information to anomaly detection as our models outperform the baselines.
arXiv Detail & Related papers (2024-06-05T20:19:09Z) - Fast Model Debias with Machine Unlearning [54.32026474971696]
Deep neural networks might behave in a biased manner in many real-world scenarios.
Existing debiasing methods suffer from high costs in bias labeling or model re-training.
We propose a fast model debiasing framework (FMD) which offers an efficient approach to identify, evaluate and remove biases.
arXiv Detail & Related papers (2023-10-19T08:10:57Z) - Machine Learning Models Evaluation and Feature Importance Analysis on
NPL Dataset [0.0]
We evaluate how different Machine learning models perform on the dataset provided by a private bank in Ethiopia.
XGBoost achieves the highest F1 score on the KMeans SMOTE over-sampled data.
arXiv Detail & Related papers (2022-08-28T17:09:44Z) - Explanations of Machine Learning predictions: a mandatory step for its
application to Operational Processes [61.20223338508952]
Credit Risk Modelling plays a paramount role.
Recent machine and deep learning techniques have been applied to the task.
We suggest to use LIME technique to tackle the explainability problem in this field.
arXiv Detail & Related papers (2020-12-30T10:27:59Z) - Explainable AI for Interpretable Credit Scoring [0.8379286663107844]
Credit scoring helps financial experts make better decisions regarding whether or not to accept a loan application.
Regulations have added the need for model interpretability to ensure that algorithmic decisions are understandable coherent.
We present a credit scoring model that is both accurate and interpretable.
arXiv Detail & Related papers (2020-12-03T18:44:03Z) - A Novel Classification Approach for Credit Scoring based on Gaussian
Mixture Models [0.0]
This paper introduces a new method for credit scoring based on Gaussian Mixture Models.
Our algorithm classifies consumers into groups which are labeled as positive or negative.
We apply our model with real world databases from Australia, Japan, and Germany.
arXiv Detail & Related papers (2020-10-26T07:34:27Z) - Accurate and Robust Feature Importance Estimation under Distribution
Shifts [49.58991359544005]
PRoFILE is a novel feature importance estimation method.
We show significant improvements over state-of-the-art approaches, both in terms of fidelity and robustness.
arXiv Detail & Related papers (2020-09-30T05:29:01Z) - Fairness in Semi-supervised Learning: Unlabeled Data Help to Reduce
Discrimination [53.3082498402884]
A growing specter in the rise of machine learning is whether the decisions made by machine learning models are fair.
We present a framework of fair semi-supervised learning in the pre-processing phase, including pseudo labeling to predict labels for unlabeled data.
A theoretical decomposition analysis of bias, variance and noise highlights the different sources of discrimination and the impact they have on fairness in semi-supervised learning.
arXiv Detail & Related papers (2020-09-25T05:48:56Z) - Super-App Behavioral Patterns in Credit Risk Models: Financial,
Statistical and Regulatory Implications [110.54266632357673]
We present the impact of alternative data that originates from an app-based marketplace, in contrast to traditional bureau data, upon credit scoring models.
Our results, validated across two countries, show that these new sources of data are particularly useful for predicting financial behavior in low-wealth and young individuals.
arXiv Detail & Related papers (2020-05-09T01:32:03Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.