Fugu-MT 論文翻訳(概要): TULIP: Adapting Open-Source Large Language Models for Underrepresented Languages and Specialized Financial Tasks

論文の概要: TULIP: Adapting Open-Source Large Language Models for Underrepresented Languages and Specialized Financial Tasks

arxiv url: http://arxiv.org/abs/2508.16243v1
Date: Fri, 22 Aug 2025 09:23:10 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-25 16:42:36.336224
Title: TULIP: Adapting Open-Source Large Language Models for Underrepresented Languages and Specialized Financial Tasks
Title（参考訳）: TULIP: 未表現言語へのオープンソースの大規模言語モデルの適用と財務業務の特化
Authors: İrem Demirtaş, Burak Payzun, Seçil Arslan,
Abstract要約: Llama 3.1 8B と Qwen 2.5 7B をドメインおよび言語適応に適用する T モデルを提案する。 5段階の開発パイプラインには、データ収集、継続的な事前トレーニング、ベンチマーク設計、合成データ生成、教師付き微調整が含まれる。
参考スコア（独自算出の注目度）: 0.19116784879310023
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Thanks to the growing popularity of large language models over the years, there is great potential for their applications in finance. Despite the exceptional performance of larger proprietary models, which are presented as black-box solutions through APIs, smaller models that can be hosted on-premise present opportunities for adaptability and privacy. Especially in cases where the management of sensitive information and application of domain knowledge is important, like finance, enhancing the capabilities of smaller models becomes crucial, notably for underrepresented languages. In this work, we introduce TULIP models, which adapt Llama 3.1 8B and Qwen 2.5 7B for domain and language adaptation, focusing on financial Turkish use cases. The five-stage development pipeline involves data collection, continual pre-training (CPT), benchmark design, synthetic data generation and supervised fine-tuning (SFT). The results show that the capabilities of the models can be enhanced to effectively accomplish targeted tasks in this specific domain and language.
Abstract（参考訳）: 長年にわたって大きな言語モデルの人気が高まってきたため、金融分野での彼らの応用には大きな可能性がある。 APIを通じてブラックボックスソリューションとして提示される、大規模なプロプライエタリなモデルの例外的なパフォーマンスにもかかわらず、オンプレミスでホスト可能な小さなモデルは、適応性とプライバシの機会を提供する。特に、金融など、機密情報の管理やドメイン知識の適用が重要である場合、特に表現不足言語において、より小さなモデルの能力を高めることが重要となる。本研究では,Llama 3.1 8B と Qwen 2.5 7B をドメインおよび言語適応に適用した TULIP モデルを提案する。 5段階の開発パイプラインには、データ収集、継続事前トレーニング(CPT)、ベンチマーク設計、合成データ生成、教師付き微調整(SFT)が含まれる。その結果、この特定のドメインと言語におけるターゲットタスクを効果的に達成するために、モデルの能力を拡張できることが判明した。

論文の概要: TULIP: Adapting Open-Source Large Language Models for Underrepresented Languages and Specialized Financial Tasks

関連論文リスト