Fugu-MT 論文翻訳(概要): Compliance-to-Code: Enhancing Financial Compliance Checking via Code Generation

論文の概要: Compliance-to-Code: Enhancing Financial Compliance Checking via Code Generation

arxiv url: http://arxiv.org/abs/2505.19804v2
Date: Mon, 09 Jun 2025 07:23:25 GMT
ステータス: 翻訳完了
システム内更新日: 2025-06-10 21:10:47.003531
Title: Compliance-to-Code: Enhancing Financial Compliance Checking via Code Generation
Title（参考訳）: コンプライアンス・トゥ・コード:コード生成による財務コンプライアンスチェックの強化
Authors: Siyuan Li, Jian Chen, Rui Yao, Xuming Hu, Peilin Zhou, Weihua Qiu, Simin Zhang, Chucheng Dong, Zhiyao Li, Qipeng Xie, Zixuan Yuan,
Abstract要約: 金融規制コンプライアンスに特化した中国初の大規模データセットであるCompliance-to-Codeを提示する。 10のカテゴリにわたる361の規則から1,159の注釈付き節をカバーし、各節は規則関係とともに4つの論理的要素、条件、制約、文脈情報からなるモジュール構造になっている。自動監査を容易にするために,決定論的Pythonコードマッピング,詳細なコード推論,コード説明を提供する。
参考スコア（独自算出の注目度）: 36.166087396386445
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Nowadays, regulatory compliance has become a cornerstone of corporate governance, ensuring adherence to systematic legal frameworks. At its core, financial regulations often comprise highly intricate provisions, layered logical structures, and numerous exceptions, which inevitably result in labor-intensive or comprehension challenges. To mitigate this, recent Regulatory Technology (RegTech) and Large Language Models (LLMs) have gained significant attention in automating the conversion of regulatory text into executable compliance logic. However, their performance remains suboptimal particularly when applied to Chinese-language financial regulations, due to three key limitations: (1) incomplete domain-specific knowledge representation, (2) insufficient hierarchical reasoning capabilities, and (3) failure to maintain temporal and logical coherence. One promising solution is to develop a domain specific and code-oriented datasets for model training. Existing datasets such as LexGLUE, LegalBench, and CODE-ACCORD are often English-focused, domain-mismatched, or lack fine-grained granularity for compliance code generation. To fill these gaps, we present Compliance-to-Code, the first large-scale Chinese dataset dedicated to financial regulatory compliance. Covering 1,159 annotated clauses from 361 regulations across ten categories, each clause is modularly structured with four logical elements-subject, condition, constraint, and contextual information-along with regulation relations. We provide deterministic Python code mappings, detailed code reasoning, and code explanations to facilitate automated auditing. To demonstrate utility, we present FinCheck: a pipeline for regulation structuring, code generation, and report generation.
Abstract（参考訳）: 今日では、規制の遵守が企業統治の基盤となり、体系的な法的枠組みの遵守が保証されている。金融規制の中核は、しばしば高度に複雑な規定、階層化された論理構造、そして多くの例外で構成されており、必然的に労働集約的または理解的課題をもたらす。これを軽減するために、最近のRegTechとLarge Language Models(LLMs)は、規制テキストを実行可能なコンプライアンスロジックに変換する自動化において、大きな注目を集めている。しかし,1) ドメイン固有知識表現の不完全化,(2) 階層的推論能力の不足,(3) 時間的・論理的一貫性の維持に失敗したこと,の3つの主要な制約により,中国語の金融規制に適用された場合,その性能は依然として最適以下である。有望な解決策の1つは、モデルトレーニングのためのドメイン固有かつコード指向のデータセットを開発することである。 LexGLUE、LegalBench、CODE-ACCORDといった既存のデータセットは、しばしば英語にフォーカスし、ドメインミスマッチし、コンプライアンスコード生成のきめ細かい粒度を欠いている。これらのギャップを埋めるために、金融規制コンプライアンスに特化した中国初の大規模データセットであるCompliance-to-Codeを紹介します。 10のカテゴリにわたる361の規則から1,159の注釈付き節をカバーし、各節は規則関係とともに4つの論理的要素、条件、制約、文脈情報からなるモジュール構造になっている。自動監査を容易にするために,決定論的Pythonコードマッピング,詳細なコード推論,コード説明を提供する。実用性を示すために、FinCheck: 構造化、コード生成、レポート生成のためのパイプラインを提供する。

論文の概要: Compliance-to-Code: Enhancing Financial Compliance Checking via Code Generation

関連論文リスト