Fugu-MT 論文翻訳(概要): VeritasFi: An Adaptable, Multi-tiered RAG Framework for Multi-modal Financial Question Answering

論文の概要: VeritasFi: An Adaptable, Multi-tiered RAG Framework for Multi-modal Financial Question Answering

arxiv url: http://arxiv.org/abs/2510.10828v1
Date: Sun, 12 Oct 2025 22:45:24 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-14 18:06:30.123901
Title: VeritasFi: An Adaptable, Multi-tiered RAG Framework for Multi-modal Financial Question Answering
Title（参考訳）: VeritasFi: マルチモーダル財務質問応答のための適応型多層RAGフレームワーク
Authors: Zhenghan Tai, Hanwei Wu, Qingchen Hu, Jijun Chi, Hailin He, Lei Ding, Tung Sum Thomas Kwok, Bohuai Xiao, Yuchen Hua, Suyuchen Wang, Peng Lu, Muzhi Li, Yihong Wu, Liheng Ma, Jerry Huang, Jiayi Zhang, Gonghao Zhang, Chaolong Jiang, Jingrui Tian, Sicheng Lyu, Zeyu Li, Boyu Han, Fengran Mo, Xinyue Yu, Yufei Cui, Ling Zhou, Xinyu Wang,
Abstract要約: 金融セクターにおける質問回答 (QA) には, RAG (Retrieval-Augmented Generation) がますます重要になっている。 We present VeritasFi, a innovative hybrid RAG framework that with a multi-modal preprocessing pipeline。提案した設計を統合することで、VeritasFiは金融RAGシステムの適応性と堅牢性を大幅に向上する画期的なフレームワークを提供する。
参考スコア（独自算出の注目度）: 31.334698403426245
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Retrieval-Augmented Generation (RAG) is becoming increasingly essential for Question Answering (QA) in the financial sector, where accurate and contextually grounded insights from complex public disclosures are crucial. However, existing financial RAG systems face two significant challenges: (1) they struggle to process heterogeneous data formats, such as text, tables, and figures; and (2) they encounter difficulties in balancing general-domain applicability with company-specific adaptation. To overcome these challenges, we present VeritasFi, an innovative hybrid RAG framework that incorporates a multi-modal preprocessing pipeline alongside a cutting-edge two-stage training strategy for its re-ranking component. VeritasFi enhances financial QA through three key innovations: (1) A multi-modal preprocessing pipeline that seamlessly transforms heterogeneous data into a coherent, machine-readable format. (2) A tripartite hybrid retrieval engine that operates in parallel, combining deep multi-path retrieval over a semantically indexed document corpus, real-time data acquisition through tool utilization, and an expert-curated memory bank for high-frequency questions, ensuring comprehensive scope, accuracy, and efficiency. (3) A two-stage training strategy for the document re-ranker, which initially constructs a general, domain-specific model using anonymized data, followed by rapid fine-tuning on company-specific data for targeted applications. By integrating our proposed designs, VeritasFi presents a groundbreaking framework that greatly enhances the adaptability and robustness of financial RAG systems, providing a scalable solution for both general-domain and company-specific QA tasks. Code accompanying this work is available at https://github.com/simplew4y/VeritasFi.git.
Abstract（参考訳）: 複雑な公開情報からの正確かつ文脈的に根ざした洞察が不可欠である金融セクターにおける質問回答(QA)には、レトリーバル強化世代(RAG)が欠かせないものになりつつある。しかし,既存の金融RAGシステムでは,(1)テキスト,表,図などの異種データフォーマットの処理に苦慮し,(2)企業固有の適応と一般ドメイン適用性のバランスをとるのが困難である,という2つの大きな課題に直面している。これらの課題を克服するために、私たちはVeritasFiという革新的なハイブリッドなRAGフレームワークを紹介します。 VeritasFiは、1)異種データをシームレスにコヒーレントでマシン可読なフォーマットに変換するマルチモーダル前処理パイプライン。 2) セマンティックインデックス化された文書コーパス上での深いマルチパス検索, ツール利用によるリアルタイムデータ取得, および高頻度質問のための専門家計算メモリバンクを併用し, 包括的範囲, 精度, 効率の確保を図る。 (3) 匿名化データを用いた汎用ドメイン固有モデルの構築と,対象アプリケーションを対象とした企業固有データの迅速な微調整を行う文書再ランカの2段階トレーニング戦略について検討した。提案した設計を統合することで、VeritasFiは、金融RAGシステムの適応性と堅牢性を大幅に向上し、汎用ドメインと企業固有のQAタスクの両方にスケーラブルなソリューションを提供する、画期的なフレームワークを提供する。この作業に伴うコードはhttps://github.com/simplew4y/VeritasFi.git.comで公開されている。

論文の概要: VeritasFi: An Adaptable, Multi-tiered RAG Framework for Multi-modal Financial Question Answering

関連論文リスト