Related papers: Firms Default Prediction with Machine Learning

Firms Default Prediction with Machine Learning

URL: http://arxiv.org/abs/2002.11705v1
Date: Mon, 17 Feb 2020 10:09:35 GMT
Title: Firms Default Prediction with Machine Learning
Authors: Tesi Aliaj and Aris Anagnostopoulos and Stefano Piersanti
Abstract summary: An earlier sign that a company has financial difficulties and may eventually bankrupt is going in emphdefault, loosely speaking. Our study uses, for the first time according to our knowledge, a very large database of granular credit data from the Italian Central Credit Register of Bank of Italy. We find that ensemble techniques and random forest provide the best results, corroborating the findings of Barboza et al.
Score: 3.8415806547786735
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Academics and practitioners have studied over the years models for predicting firms bankruptcy, using statistical and machine-learning approaches. An earlier sign that a company has financial difficulties and may eventually bankrupt is going in \emph{default}, which, loosely speaking means that the company has been having difficulties in repaying its loans towards the banking system. Firms default status is not technically a failure but is very relevant for bank lending policies and often anticipates the failure of the company. Our study uses, for the first time according to our knowledge, a very large database of granular credit data from the Italian Central Credit Register of Bank of Italy that contain information on all Italian companies' past behavior towards the entire Italian banking system to predict their default using machine-learning techniques. Furthermore, we combine these data with other information regarding companies' public balance sheet data. We find that ensemble techniques and random forest provide the best results, corroborating the findings of Barboza et al. (Expert Syst. Appl., 2017).

Related papers

Financial Fraud Identification and Interpretability Study for Listed Companies Based on Convolutional Neural Network [4.504327589607446]
This paper proposes a financial fraud detection framework for Chinese A-share listed companies based on convolutional neural networks (CNNs)<n> Experiments show that the CNN outperforms logistic regression and LightGBM in accuracy, robustness, and early-warning performance.<n>We find that solvency, ratio structure, governance structure, and internal control are general predictors of fraud, while environmental indicators matter mainly in high-pollution industries.
arXiv Detail & Related papers (2025-12-07T04:14:16Z)
Are Foundation Models Useful for Bankruptcy Prediction? [0.0]
We study bankruptcy forecasting using Llama-3.3-70B-Instruct and TabPFN.<n>We provide the first systematic comparison of foundation models against classical machine learning baselines for this task.
arXiv Detail & Related papers (2025-11-20T13:59:18Z)
Enhancing Bankruptcy Prediction of Banks through Advanced Machine Learning Techniques: An Innovative Approach and Analysis [0.0]
Calculating the probability of a bank going bankrupt is one way to ensure the banking system is safe and sound.<n> statistical methods rely on rigid and sometimes irrelevant assumptions, which can result in low forecast accuracy.<n>The proposed innovative machine learning approach help to implement policies that reduce the costs of bankruptcy.
arXiv Detail & Related papers (2025-10-08T10:16:10Z)
Your AI, Not Your View: The Bias of LLMs in Investment Analysis [62.388554963415906]
In finance, Large Language Models (LLMs) face frequent knowledge conflicts arising from discrepancies between their pre-trained parametric knowledge and real-time market data.<n>These conflicts are especially problematic in real-world investment services, where a model's inherent biases can misalign with institutional objectives.<n>We propose an experimental framework to investigate emergent behaviors in such conflict scenarios, offering a quantitative analysis of bias in investment analysis.
arXiv Detail & Related papers (2025-07-28T16:09:38Z)
Company-as-Tribe: Company Financial Risk Assessment on Tribe-Style Graph with Hierarchical Graph Neural Networks [62.94317686301643]
Company financial risk is ubiquitous and early risk assessment for listed companies can avoid considerable losses. Traditional methods mainly focus on the financial statements of companies and lack the complex relationships among them. We propose a novel Hierarchical Graph Neural Network (TH-GNN) for Tribe-style graphs via two levels, with the first level to encode the structure pattern of the tribes with contrastive learning, and the second level to diffuse information based on the inter-tribe relations.
arXiv Detail & Related papers (2023-01-31T09:17:13Z)
Should Bank Stress Tests Be Fair? [1.370633147306388]
We argue that simply pooling data across banks treats banks equally but is subject to two deficiencies. We argue for estimating and then discarding centered bank fixed effects as preferable to simply ignoring differences across banks.
arXiv Detail & Related papers (2022-07-27T06:46:51Z)
Fighting Money Laundering with Statistics and Machine Learning [95.42181254494287]
There is little scientific literature on statistical and machine learning methods for anti-money laundering. We propose a unifying terminology with two central elements: (i) client risk profiling and (ii) suspicious behavior flagging.
arXiv Detail & Related papers (2022-01-11T21:31:18Z)
Solving the Data Sparsity Problem in Predicting the Success of the Startups with Machine Learning Methods [2.939434965353219]
We investigate several machine learning algorithms with a large dataset from Crunchbase. The results suggest that LightGBM and XGBoost perform best and achieve 53.03% and 52.96% F1 scores. These findings have substantial implications on how machine learning methods can help startup companies and investors.
arXiv Detail & Related papers (2021-12-15T09:21:32Z)
Feature-Level Fusion of Super-App and Telecommunication Alternative Data Sources for Credit Card Fraud Detection [106.33204064461802]
We review the effectiveness of a feature-level fusion of super-app customer information, mobile phone line data, and traditional credit risk variables for the early detection of identity theft credit card fraud. We evaluate our approach over approximately 90,000 users from a credit lender's digital platform database.
arXiv Detail & Related papers (2021-11-05T19:10:35Z)
Adversarial Semi-supervised Learning for Corporate Credit Ratings [1.90365714903665]
In this work, we consider the problem of adversarial semi-supervised learning for corporate credit rating. In the first phase, we train a normal rating system via a normal machine-learning algorithm to give unlabeled data pseudo rating level. In the second phase, adversarial semi-supervised learning is applied uniting labeled data and pseudo-labeled data.
arXiv Detail & Related papers (2021-04-04T09:05:53Z)
BeFair: Addressing Fairness in the Banking Sector [54.08949958349055]
We present the initial results of an industrial open innovation project in the banking sector. We propose a general roadmap for fairness in ML and the implementation of a toolkit called BeFair that helps to identify and mitigate bias.
arXiv Detail & Related papers (2021-02-03T16:37:10Z)
PCAL: A Privacy-preserving Intelligent Credit Risk Modeling Framework Based on Adversarial Learning [111.19576084222345]
This paper proposes a framework of Privacy-preserving Credit risk modeling based on Adversarial Learning (PCAL) PCAL aims to mask the private information inside the original dataset, while maintaining the important utility information for the target prediction task performance. Results indicate that PCAL can learn an effective, privacy-free representation from user data, providing a solid foundation towards privacy-preserving machine learning for credit risk analysis.
arXiv Detail & Related papers (2020-10-06T07:04:59Z)
Societal biases reinforcement through machine learning: A credit scoring perspective [38.437384481171804]
This paper aims to analyse whether machine learning and AI ensure that social biases thrive. In this paper, we analyse how social biases are transmitted from the data into banks loan approvals by predicting either the gender or the ethnicity of the customers.
arXiv Detail & Related papers (2020-06-15T12:40:21Z)
Super-App Behavioral Patterns in Credit Risk Models: Financial, Statistical and Regulatory Implications [110.54266632357673]
We present the impact of alternative data that originates from an app-based marketplace, in contrast to traditional bureau data, upon credit scoring models. Our results, validated across two countries, show that these new sources of data are particularly useful for predicting financial behavior in low-wealth and young individuals.
arXiv Detail & Related papers (2020-05-09T01:32:03Z)

This list is automatically generated from the titles and abstracts of the papers in this site.