A machine learning workflow to address credit default prediction
- URL: http://arxiv.org/abs/2403.03785v1
- Date: Wed, 6 Mar 2024 15:30:41 GMT
- Title: A machine learning workflow to address credit default prediction
- Authors: Rambod Rahmani, Marco Parola, and Mario G.C.A. Cimino
- Abstract summary: Credit default prediction (CDP) plays a crucial role in assessing the creditworthiness of individuals and businesses.
We propose a workflow-based approach to improve CDP, which refers to the task of assessing the probability that a borrower will default on his or her credit obligations.
- Score: 0.44943951389724796
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Due to the recent increase in interest in Financial Technology (FinTech),
applications like credit default prediction (CDP) are gaining significant
industrial and academic attention. In this regard, CDP plays a crucial role in
assessing the creditworthiness of individuals and businesses, enabling lenders
to make informed decisions regarding loan approvals and risk management. In
this paper, we propose a workflow-based approach to improve CDP, which refers
to the task of assessing the probability that a borrower will default on his or
her credit obligations. The workflow consists of multiple steps, each designed
to leverage the strengths of different techniques featured in machine learning
pipelines and, thus best solve the CDP task. We employ a comprehensive and
systematic approach starting with data preprocessing using Weight of Evidence
encoding, a technique that ensures in a single-shot data scaling by removing
outliers, handling missing values, and making data uniform for models working
with different data types. Next, we train several families of learning models,
introducing ensemble techniques to build more robust models and hyperparameter
optimization via multi-objective genetic algorithms to consider both predictive
accuracy and financial aspects. Our research aims at contributing to the
FinTech industry in providing a tool to move toward more accurate and reliable
credit risk assessment, benefiting both lenders and borrowers.
Related papers
- FinRobot: AI Agent for Equity Research and Valuation with Large Language Models [6.2474959166074955]
This paper presents FinRobot, the first AI agent framework specifically designed for equity research.
FinRobot employs a multi-agent Chain of Thought (CoT) system, integrating both quantitative and qualitative analyses to emulate the comprehensive reasoning of a human analyst.
Unlike existing automated research tools, such as CapitalCube and Wright Reports, FinRobot delivers insights comparable to those produced by major brokerage firms and fundamental research vendors.
arXiv Detail & Related papers (2024-11-13T17:38:07Z) - Bank Loan Prediction Using Machine Learning Techniques [0.0]
We have worked on a dataset of 148,670 instances and 37 attributes using machine learning methods.
The best-performing algorithm was AdaBoosting, which achieved an incredible accuracy of 99.99%.
arXiv Detail & Related papers (2024-10-11T15:01:47Z) - Enhancing the Efficiency and Accuracy of Underlying Asset Reviews in Structured Finance: The Application of Multi-agent Framework [3.022596401099308]
We show that AI can automate the verification of information between loan applications and bank statements effectively.
This research highlights AI's potential to minimize manual errors and streamline due diligence, suggesting a broader application of AI in financial document analysis and risk management.
arXiv Detail & Related papers (2024-05-07T13:09:49Z) - Empowering Many, Biasing a Few: Generalist Credit Scoring through Large
Language Models [53.620827459684094]
Large Language Models (LLMs) have great potential for credit scoring tasks, with strong generalization ability across multiple tasks.
We propose the first open-source comprehensive framework for exploring LLMs for credit scoring.
We then propose the first Credit and Risk Assessment Large Language Model (CALM) by instruction tuning, tailored to the nuanced demands of various financial risk assessment tasks.
arXiv Detail & Related papers (2023-10-01T03:50:34Z) - Can ChatGPT Forecast Stock Price Movements? Return Predictability and Large Language Models [51.3422222472898]
We document the capability of large language models (LLMs) like ChatGPT to predict stock price movements using news headlines.
We develop a theoretical model incorporating information capacity constraints, underreaction, limits-to-arbitrage, and LLMs.
arXiv Detail & Related papers (2023-04-15T19:22:37Z) - Financial Distress Prediction For Small And Medium Enterprises Using
Machine Learning Techniques [5.301137510638804]
Financial Distress Prediction plays a crucial role in the economy by accurately forecasting the number and probability of failing structures.
However, predicting financial distress for Small and Medium Enterprises is challenging due to their inherent ambiguity.
We propose a corporate FCP model that better aligns with industry practice and incorporates the gathering of thin-head component analysis of financial data, corporate governance qualities, and market exchange data with a Relevant Vector Machine.
arXiv Detail & Related papers (2023-02-23T15:58:30Z) - Understanding Information Disclosure from Secure Computation Output: A Study of Average Salary Computation [58.74407460023331]
Quantifying information disclosure about private inputs from observing a function outcome is the subject of this work.
Motivated by the City of Boston gender pay gap studies, in this work we focus on the computation of the average of salaries.
arXiv Detail & Related papers (2022-09-21T15:59:48Z) - Bayesian Bilinear Neural Network for Predicting the Mid-price Dynamics
in Limit-Order Book Markets [84.90242084523565]
Traditional time-series econometric methods often appear incapable of capturing the true complexity of the multi-level interactions driving the price dynamics.
By adopting a state-of-the-art second-order optimization algorithm, we train a Bayesian bilinear neural network with temporal attention.
By addressing the use of predictive distributions to analyze errors and uncertainties associated with the estimated parameters and model forecasts, we thoroughly compare our Bayesian model with traditional ML alternatives.
arXiv Detail & Related papers (2022-03-07T18:59:54Z) - Multi Agent System for Machine Learning Under Uncertainty in Cyber
Physical Manufacturing System [78.60415450507706]
Recent advancements in predictive machine learning has led to its application in various use cases in manufacturing.
Most research focused on maximising predictive accuracy without addressing the uncertainty associated with it.
In this paper, we determine the sources of uncertainty in machine learning and establish the success criteria of a machine learning system to function well under uncertainty.
arXiv Detail & Related papers (2021-07-28T10:28:05Z) - Explanations of Machine Learning predictions: a mandatory step for its
application to Operational Processes [61.20223338508952]
Credit Risk Modelling plays a paramount role.
Recent machine and deep learning techniques have been applied to the task.
We suggest to use LIME technique to tackle the explainability problem in this field.
arXiv Detail & Related papers (2020-12-30T10:27:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.