Related papers: Debiasing Alternative Data for Credit Underwriting Using Causal Inference

Debiasing Alternative Data for Credit Underwriting Using Causal Inference

URL: http://arxiv.org/abs/2410.22382v2
Date: Thu, 31 Oct 2024 17:12:27 GMT
Title: Debiasing Alternative Data for Credit Underwriting Using Causal Inference
Authors: Chris Lam,
Abstract summary: Alternative data provides valuable insights for lenders to evaluate a borrower's creditworthiness. But some forms of alternative data have historically been excluded from credit underwriting because it could act as an illegal proxy for a protected class. We propose a method for applying causal inference to a supervised machine learning model to debias alternative data so that it might be used for credit underwriting.
Score: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Alternative data provides valuable insights for lenders to evaluate a borrower's creditworthiness, which could help expand credit access to underserved groups and lower costs for borrowers. But some forms of alternative data have historically been excluded from credit underwriting because it could act as an illegal proxy for a protected class like race or gender, causing redlining. We propose a method for applying causal inference to a supervised machine learning model to debias alternative data so that it might be used for credit underwriting. We demonstrate how our algorithm can be used against a public credit dataset to improve model accuracy across different racial groups, while providing theoretically robust nondiscrimination guarantees.

Related papers

A Distributionally Robust Optimisation Approach to Fair Credit Scoring [2.8851756275902467]
Credit scoring has been catalogued by the European Commission and the Executive Office of the US President as a high-risk classification task. To address this concern, recent credit scoring research has considered a range of fairness-enhancing techniques.
arXiv Detail & Related papers (2024-02-02T11:43:59Z)
D-BIAS: A Causality-Based Human-in-the-Loop System for Tackling Algorithmic Bias [57.87117733071416]
We propose D-BIAS, a visual interactive tool that embodies human-in-the-loop AI approach for auditing and mitigating social biases. A user can detect the presence of bias against a group by identifying unfair causal relationships in the causal network. For each interaction, say weakening/deleting a biased causal edge, the system uses a novel method to simulate a new (debiased) dataset.
arXiv Detail & Related papers (2022-08-10T03:41:48Z)
The Fairness of Credit Scoring Models [0.0]
In credit markets, screening algorithms aim to discriminate between good-type and bad-type borrowers. This can be unintentional and originate from the training dataset or from the model itself. We show how to formally test the algorithmic fairness of scoring models and how to identify the variables responsible for any lack of fairness.
arXiv Detail & Related papers (2022-05-20T14:20:40Z)
Selective Credit Assignment [57.41789233550586]
We describe a unified view on temporal-difference algorithms for selective credit assignment. We present insights into applying weightings to value-based learning and planning algorithms.
arXiv Detail & Related papers (2022-02-20T00:07:57Z)
Feature-Level Fusion of Super-App and Telecommunication Alternative Data Sources for Credit Card Fraud Detection [106.33204064461802]
We review the effectiveness of a feature-level fusion of super-app customer information, mobile phone line data, and traditional credit risk variables for the early detection of identity theft credit card fraud. We evaluate our approach over approximately 90,000 users from a credit lender's digital platform database.
arXiv Detail & Related papers (2021-11-05T19:10:35Z)
How Costly is Noise? Data and Disparities in Consumer Credit [0.0]
We show that credit scores are noisier indicators of default risk for historically under-served groups. We find that equalizing the precision of credit scores can reduce disparities in approval rates and in credit misallocation for disadvantaged groups by approximately half.
arXiv Detail & Related papers (2021-05-17T00:42:26Z)
Enhancing User' s Income Estimation with Super-App Alternative Data [59.60094442546867]
It compares the performance of these alternative data sources with the performance of industry-accepted bureau income estimators. Ultimately, this paper shows the incentive for financial institutions to seek to incorporate alternative data into constructing their risk profiles.
arXiv Detail & Related papers (2021-04-12T21:34:44Z)
A Novel Classification Approach for Credit Scoring based on Gaussian Mixture Models [0.0]
This paper introduces a new method for credit scoring based on Gaussian Mixture Models. Our algorithm classifies consumers into groups which are labeled as positive or negative. We apply our model with real world databases from Australia, Japan, and Germany.
arXiv Detail & Related papers (2020-10-26T07:34:27Z)
PCAL: A Privacy-preserving Intelligent Credit Risk Modeling Framework Based on Adversarial Learning [111.19576084222345]
This paper proposes a framework of Privacy-preserving Credit risk modeling based on Adversarial Learning (PCAL) PCAL aims to mask the private information inside the original dataset, while maintaining the important utility information for the target prediction task performance. Results indicate that PCAL can learn an effective, privacy-free representation from user data, providing a solid foundation towards privacy-preserving machine learning for credit risk analysis.
arXiv Detail & Related papers (2020-10-06T07:04:59Z)
Super-App Behavioral Patterns in Credit Risk Models: Financial, Statistical and Regulatory Implications [110.54266632357673]
We present the impact of alternative data that originates from an app-based marketplace, in contrast to traditional bureau data, upon credit scoring models. Our results, validated across two countries, show that these new sources of data are particularly useful for predicting financial behavior in low-wealth and young individuals.
arXiv Detail & Related papers (2020-05-09T01:32:03Z)

This list is automatically generated from the titles and abstracts of the papers in this site.