Related papers: Generative AI Enhanced Financial Risk Management Information Retrieval

Generative AI Enhanced Financial Risk Management Information Retrieval

URL: http://arxiv.org/abs/2504.06293v2
Date: Thu, 10 Apr 2025 03:08:59 GMT
Title: Generative AI Enhanced Financial Risk Management Information Retrieval
Authors: Amin Haeri, Jonathan Vitrano, Mahdi Ghelichi,
Abstract summary: RiskData is a dataset curated for finetuning embedding models in risk management.<n>RiskEmbed is a finetuned embedding model designed to improve retrieval accuracy in financial question-answering systems.
Score: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Risk management in finance involves recognizing, evaluating, and addressing financial risks to maintain stability and ensure regulatory compliance. Extracting relevant insights from extensive regulatory documents is a complex challenge requiring advanced retrieval and language models. This paper introduces RiskData, a dataset specifically curated for finetuning embedding models in risk management, and RiskEmbed, a finetuned embedding model designed to improve retrieval accuracy in financial question-answering systems. The dataset is derived from 94 regulatory guidelines published by the Office of the Superintendent of Financial Institutions (OSFI) from 1991 to 2024. We finetune a state-of-the-art sentence BERT embedding model to enhance domain-specific retrieval performance typically for Retrieval-Augmented Generation (RAG) systems. Experimental results demonstrate that RiskEmbed significantly outperforms general-purpose and financial embedding models, achieving substantial improvements in ranking metrics. By open-sourcing both the dataset and the model, we provide a valuable resource for financial institutions and researchers aiming to develop more accurate and efficient risk management AI solutions.

Related papers

Machine Learning based Enterprise Financial Audit Framework and High Risk Identification [6.433444278723668]
This study proposes an AI-driven framework for enterprise financial audits and high-risk identification.<n>Using a dataset from the Big Four accounting firms (EY, PwC, Deloitte, KPMG) from 2020 to 2025, the research examines trends in risk assessment, compliance violations, and fraud detection.<n>The study recommends adopting Random Forest as a core model, enhancing features via engineering, and implementing real-time risk monitoring.
arXiv Detail & Related papers (2025-07-08T00:22:49Z)
FinDER: Financial Dataset for Question Answering and Evaluating Retrieval-Augmented Generation [63.55583665003167]
We present FinDER, an expert-generated dataset tailored for Retrieval-Augmented Generation (RAG) in finance. FinDER focuses on annotating search-relevant evidence by domain experts, offering 5,703 query-evidence-answer triplets. By challenging models to retrieve relevant information from large corpora, FinDER offers a more realistic benchmark for evaluating RAG systems.
arXiv Detail & Related papers (2025-04-22T11:30:13Z)
AI for Climate Finance: Agentic Retrieval and Multi-Step Reasoning for Early Warning System Investments [1.3192560874022086]
This study focuses on a real-world application: tracking EWS investments in the Climate Risk and Early Warning Systems (CREWS) Fund.<n>We analyze 25 MDB project documents and evaluate multiple AI-driven classification methods, including zero-shot and few-shot learning.<n>Our results show that the agent-based RAG approach significantly outperforms other methods, achieving 87% accuracy, 89% precision, and 83% recall.
arXiv Detail & Related papers (2025-04-07T14:11:11Z)
Cross-Asset Risk Management: Integrating LLMs for Real-Time Monitoring of Equity, Fixed Income, and Currency Markets [30.815524322885754]
Large language models (LLMs) have emerged as powerful tools in the field of finance.<n>We introduce a Cross-Asset Risk Management framework that utilizes LLMs to facilitate real-time monitoring of equity, fixed income, and currency markets.
arXiv Detail & Related papers (2025-04-05T22:28:35Z)
Model Risk Management for Generative AI In Financial Institutions [6.995717424201032]
The success of OpenAI's ChatGPT in 2023 has spurred financial enterprises into exploring Generative AI applications.<n>This paper outlines the key aspects for model risk management of generative AI model with a special emphasis on additional practices required in model validation.
arXiv Detail & Related papers (2025-03-19T19:52:29Z)
Towards Trustworthy Retrieval Augmented Generation for Large Language Models: A Survey [92.36487127683053]
Retrieval-Augmented Generation (RAG) is an advanced technique designed to address the challenges of Artificial Intelligence-Generated Content (AIGC) RAG provides reliable and up-to-date external knowledge, reduces hallucinations, and ensures relevant context across a wide range of tasks. Despite RAG's success and potential, recent studies have shown that the RAG paradigm also introduces new risks, including privacy concerns, adversarial attacks, and accountability issues.
arXiv Detail & Related papers (2025-02-08T06:50:47Z)
Leveraging Generative Adversarial Networks for Addressing Data Imbalance in Financial Market Supervision [5.864973298916232]
This study explores the application of generative adversarial networks in financial market supervision.<n>The data generated by GAN has significant advantages in dealing with imbalance problems and improving the prediction accuracy of the model.
arXiv Detail & Related papers (2024-12-04T08:06:47Z)
Predicting Liquidity Coverage Ratio with Gated Recurrent Units: A Deep Learning Model for Risk Management [5.864973298916232]
This paper proposes a liquidity coverage ratio (LCR) prediction model based on the gated recurrent unit (GRU) network to help financial institutions manage their liquidity risk more effectively. By utilizing the GRU network in deep learning technology, the model can automatically learn complex patterns from historical data and accurately predict LCR for a period of time in the future.
arXiv Detail & Related papers (2024-10-24T23:43:50Z)
Trustworthiness in Retrieval-Augmented Generation Systems: A Survey [59.26328612791924]
Retrieval-Augmented Generation (RAG) has quickly grown into a pivotal paradigm in the development of Large Language Models (LLMs) We propose a unified framework that assesses the trustworthiness of RAG systems across six key dimensions: factuality, robustness, fairness, transparency, accountability, and privacy.
arXiv Detail & Related papers (2024-09-16T09:06:44Z)
EARBench: Towards Evaluating Physical Risk Awareness for Task Planning of Foundation Model-based Embodied AI Agents [53.717918131568936]
Embodied artificial intelligence (EAI) integrates advanced AI models into physical entities for real-world interaction. Foundation models as the "brain" of EAI agents for high-level task planning have shown promising results. However, the deployment of these agents in physical environments presents significant safety challenges. This study introduces EARBench, a novel framework for automated physical risk assessment in EAI scenarios.
arXiv Detail & Related papers (2024-08-08T13:19:37Z)
C-RAG: Certified Generation Risks for Retrieval-Augmented Language Models [57.10361282229501]
We propose C-RAG, the first framework to certify generation risks for RAG models. Specifically, we provide conformal risk analysis for RAG models and certify an upper confidence bound of generation risks. We prove that RAG achieves a lower conformal generation risk than that of a single LLM when the quality of the retrieval model and transformer is non-trivial.
arXiv Detail & Related papers (2024-02-05T16:46:16Z)
Explanations of Machine Learning predictions: a mandatory step for its application to Operational Processes [61.20223338508952]
Credit Risk Modelling plays a paramount role. Recent machine and deep learning techniques have been applied to the task. We suggest to use LIME technique to tackle the explainability problem in this field.
arXiv Detail & Related papers (2020-12-30T10:27:59Z)

This list is automatically generated from the titles and abstracts of the papers in this site.