Explainable AI for Comprehensive Risk Assessment for Financial Reports: A Lightweight Hierarchical Transformer Network Approach
- URL: http://arxiv.org/abs/2506.23767v2
- Date: Thu, 03 Jul 2025 03:15:52 GMT
- Title: Explainable AI for Comprehensive Risk Assessment for Financial Reports: A Lightweight Hierarchical Transformer Network Approach
- Authors: Xue Wen Tan, Stanley Kok,
- Abstract summary: Every publicly traded U.S. company files an annual 10-K report containing critical insights into financial health and risk.<n>We propose Tiny eXplainable Risk Assessor (TinyXRA), a transformer-based model that automatically assesses company risk from these reports.<n>TinyXRA incorporates skewness, kurtosis, and the Sortino ratio for more comprehensive risk assessment.
- Score: 1.2200609701777907
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Every publicly traded U.S. company files an annual 10-K report containing critical insights into financial health and risk. We propose Tiny eXplainable Risk Assessor (TinyXRA), a lightweight and explainable transformer-based model that automatically assesses company risk from these reports. Unlike prior work that relies solely on the standard deviation of excess returns (adjusted for the Fama-French model), which indiscriminately penalizes both upside and downside risk, TinyXRA incorporates skewness, kurtosis, and the Sortino ratio for more comprehensive risk assessment. We leverage TinyBERT as our encoder to efficiently process lengthy financial documents, coupled with a novel dynamic, attention-based word cloud mechanism that provides intuitive risk visualization while filtering irrelevant terms. This lightweight design ensures scalable deployment across diverse computing environments with real-time processing capabilities for thousands of financial documents which is essential for production systems with constrained computational resources. We employ triplet loss for risk quartile classification, improving over pairwise loss approaches in existing literature by capturing both the direction and magnitude of risk differences. Our TinyXRA achieves state-of-the-art predictive accuracy across seven test years on a dataset spanning 2013-2024, while providing transparent and interpretable risk assessments. We conduct comprehensive ablation studies to evaluate our contributions and assess model explanations both quantitatively by systematically removing highly attended words and sentences, and qualitatively by examining explanation coherence. The paper concludes with findings, practical implications, limitations, and future research directions. Our code is available at https://github.com/Chen-XueWen/TinyXRA.
Related papers
- Entity-Specific Cyber Risk Assessment using InsurTech Empowered Risk Factors [12.0472507787498]
We develop a novel InsurTech framework that enriches cyber incident data with entity-specific attributes.<n>We apply multiple interpretable ML techniques to identify and cross-validate potential risk factors.<n>The framework generates transparent, entity-specific cyber risk profiles, supporting customized underwriting and proactive cyber risk mitigation.
arXiv Detail & Related papers (2025-07-10T22:04:00Z) - Risk-Averse Certification of Bayesian Neural Networks [70.44969603471903]
We propose a Risk-Averse Certification framework for Bayesian neural networks called RAC-BNN.<n>Our method leverages sampling and optimisation to compute a sound approximation of the output set of a BNN.<n>We validate RAC-BNN on a range of regression and classification benchmarks and compare its performance with a state-of-the-art method.
arXiv Detail & Related papers (2024-11-29T14:22:51Z) - Elevating Software Trust: Unveiling and Quantifying the Risk Landscape [9.428116807615407]
We propose a risk assessment framework called SAFER (Software Analysis Framework for Evaluating Risk)<n>This framework is based on the necessity of a dynamic, data-driven, and adaptable process to quantify security risk in the software supply chain.<n>The results suggest that SAFER mitigates subjectivity and yields dynamic data-driven weights as well as security risk scores.
arXiv Detail & Related papers (2024-08-06T00:50:08Z) - Explainable Risk Classification in Financial Reports [1.2200609701777907]
Every publicly traded company in the US is required to file an annual 10-K financial report, which contains a wealth of information about the company.<n>We propose an explainable deep-learning model, called FinBERT-XRC, that takes a 10-K report as input, and automatically assesses the post-event return volatility risk of its associated company.
arXiv Detail & Related papers (2024-05-03T06:56:47Z) - Language Model Cascades: Token-level uncertainty and beyond [65.38515344964647]
Recent advances in language models (LMs) have led to significant improvements in quality on complex NLP tasks.
Cascading offers a simple strategy to achieve more favorable cost-quality tradeoffs.
We show that incorporating token-level uncertainty through learned post-hoc deferral rules can significantly outperform simple aggregation strategies.
arXiv Detail & Related papers (2024-04-15T21:02:48Z) - On the Societal Impact of Open Foundation Models [93.67389739906561]
We focus on open foundation models, defined here as those with broadly available model weights.
We identify five distinctive properties of open foundation models that lead to both their benefits and risks.
arXiv Detail & Related papers (2024-02-27T16:49:53Z) - Just-In-Time Learning for Operational Risk Assessment in Power Grids [12.939739997360016]
In a grid with a significant share of renewable generation, operators will need additional tools to evaluate the operational risk.
This paper proposes a Just-In-Time Risk Assessment Learning Framework (JITRALF) as an alternative.
JITRALF trains risk surrogates, one for each hour in the day, using Machine Learning (ML) to predict the quantities needed to estimate risk.
arXiv Detail & Related papers (2022-09-26T15:11:27Z) - FisrEbp: Enterprise Bankruptcy Prediction via Fusing its Intra-risk and
Spillover-Risk [4.369823783549928]
We propose a novel method that is equipped with an LSTM-based intra-risk encoder and GNNs-based spillover-risk encoder.
The intra-risk encoder is able to capture enterprise intra-risk using the statistic correlated indicators from the basic business information and litigation information.
The spillover-risk encoder consists of hypergraph neural networks and heterogeneous graph neural networks.
arXiv Detail & Related papers (2022-02-01T04:28:48Z) - Risk Minimization from Adaptively Collected Data: Guarantees for
Supervised and Policy Learning [57.88785630755165]
Empirical risk minimization (ERM) is the workhorse of machine learning, but its model-agnostic guarantees can fail when we use adaptively collected data.
We study a generic importance sampling weighted ERM algorithm for using adaptively collected data to minimize the average of a loss function over a hypothesis class.
For policy learning, we provide rate-optimal regret guarantees that close an open gap in the existing literature whenever exploration decays to zero.
arXiv Detail & Related papers (2021-06-03T09:50:13Z) - Risk-Constrained Thompson Sampling for CVaR Bandits [82.47796318548306]
We consider a popular risk measure in quantitative finance known as the Conditional Value at Risk (CVaR)
We explore the performance of a Thompson Sampling-based algorithm CVaR-TS under this risk measure.
arXiv Detail & Related papers (2020-11-16T15:53:22Z) - Encoded Value-at-Risk: A Predictive Machine for Financial Risk
Management [0.0]
We provide a novel approach for measuring market risk called Encoded Value-at-Risk (Encoded VaR)
Encoded VaR is based on a type of artificial neural network, called Variational Auto-encoders (VAEs)
arXiv Detail & Related papers (2020-11-13T03:25:35Z) - Learning Bounds for Risk-sensitive Learning [86.50262971918276]
In risk-sensitive learning, one aims to find a hypothesis that minimizes a risk-averse (or risk-seeking) measure of loss.
We study the generalization properties of risk-sensitive learning schemes whose optimand is described via optimized certainty equivalents.
arXiv Detail & Related papers (2020-06-15T05:25:02Z) - SAMBA: Safe Model-Based & Active Reinforcement Learning [59.01424351231993]
SAMBA is a framework for safe reinforcement learning that combines aspects from probabilistic modelling, information theory, and statistics.
We evaluate our algorithm on a variety of safe dynamical system benchmarks involving both low and high-dimensional state representations.
We provide intuition as to the effectiveness of the framework by a detailed analysis of our active metrics and safety constraints.
arXiv Detail & Related papers (2020-06-12T10:40:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.