Fugu-MT 論文翻訳(概要): Arrows of Math Reasoning Data Synthesis for Large Language Models: Diversity, Complexity and Correctness

論文の概要: Arrows of Math Reasoning Data Synthesis for Large Language Models: Diversity, Complexity and Correctness

arxiv url: http://arxiv.org/abs/2508.18824v1
Date: Tue, 26 Aug 2025 09:01:50 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-27 17:42:38.767245
Title: Arrows of Math Reasoning Data Synthesis for Large Language Models: Diversity, Complexity and Correctness
Title（参考訳）: 大規模言語モデルのための数学推論データの合成:多様性、複雑さ、正確性
Authors: Sirui Chen, Changxin Tian, Binbin Hu, Kunlong Chen, Ziqi Liu, Zhiqiang Zhang, Jun Zhou,
Abstract要約: 本稿では,多様性,複雑性,正確性を保証した高品質な数学的コーパスを生成するプログラム支援合成フレームワークを提案する。このフレームワークは、数学的知識システムとドメイン固有のツールを統合して実行可能なプログラムを作成する。我々は1230万の問題を解決した。
参考スコア（独自算出の注目度）: 37.03639650111288
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Enhancing the mathematical reasoning of large language models (LLMs) demands high-quality training data, yet conventional methods face critical challenges in scalability, cost, and data reliability. To address these limitations, we propose a novel program-assisted synthesis framework that systematically generates a high-quality mathematical corpus with guaranteed diversity, complexity, and correctness. This framework integrates mathematical knowledge systems and domain-specific tools to create executable programs. These programs are then translated into natural language problem-solution pairs and vetted by a bilateral validation mechanism that verifies solution correctness against program outputs and ensures program-problem consistency. We have generated 12.3 million such problem-solving triples. Experiments demonstrate that models fine-tuned on our data significantly improve their inference capabilities, achieving state-of-the-art performance on several benchmark datasets and showcasing the effectiveness of our synthesis approach.
Abstract（参考訳）: 大規模言語モデル(LLM)の数学的推論を強化するには、高品質なトレーニングデータが必要であるが、従来の手法はスケーラビリティ、コスト、データの信頼性において重要な課題に直面している。これらの制約に対処するために,多様性,複雑性,正確性を保証した高品質な数学的コーパスを体系的に生成する,プログラム支援型合成フレームワークを提案する。このフレームワークは、数学的知識システムとドメイン固有のツールを統合して実行可能なプログラムを作成する。これらのプログラムは、自然言語の問題解決ペアに変換され、プログラム出力に対する解の正当性を検証し、プログラムプロブレムの整合性を保証する二元検証メカニズムによって検証される。我々は1230万の問題を解決した。実験により、データに基づいて微調整されたモデルは、推論能力を大幅に向上し、いくつかのベンチマークデータセット上で最先端のパフォーマンスを達成し、合成アプローチの有効性を示す。

論文の概要: Arrows of Math Reasoning Data Synthesis for Large Language Models: Diversity, Complexity and Correctness

関連論文リスト