Fugu-MT 論文翻訳(概要): MathAgent: Adversarial Evolution of Constraint Graphs for Mathematical Reasoning Data Synthesis

論文の概要: MathAgent: Adversarial Evolution of Constraint Graphs for Mathematical Reasoning Data Synthesis

arxiv url: http://arxiv.org/abs/2604.11188v1
Date: Mon, 13 Apr 2026 08:48:12 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-14 20:13:16.437185
Title: MathAgent: Adversarial Evolution of Constraint Graphs for Mathematical Reasoning Data Synthesis
Title（参考訳）: MathAgent: 数学的推論データ合成のための制約グラフの逆進化
Authors: Zixiong Yu, Jun Rao, Guhan Chen, Songtao Tian, Bohan Li, Jiansheng Wei, Min Zhang, Xiaojun Meng,
Abstract要約: 本稿では、教師なし最適化問題としてデータ合成を定式化する階層型フレームワークを提案する。立法者は、問題の制約をコードする構造化された世代図を逆向きに進化させ、執行者はこれらの仕様をさまざまな自然言語シナリオにインスタンス化する。 Qwen, Llama, Mistral, Gemmaの各シリーズの合計10モデルを用いて行った実験により, 本手法が顕著な結果が得られることを示した。
参考スコア（独自算出の注目度）: 26.328617109421327
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Synthesizing high-quality mathematical reasoning data without human priors remains a significant challenge. Current approaches typically rely on seed data mutation or simple prompt engineering, often suffering from mode collapse and limited logical complexity. This paper proposes a hierarchical synthesis framework that formulates data synthesis as an unsupervised optimization problem over a constraint graph followed by semantic instantiation, rather than treating it as a direct text generation task. We introduce a Legislator-Executor paradigm: The Legislator adversarially evolves structured generation blueprints encoding the constraints of the problem, while the Executor instantiates these specifications into diverse natural language scenarios. This decoupling of skeleton design from linguistic realization enables a prioritized focus on constructing complex and diverse logical structures, thereby guiding high-quality data synthesis. Experiments conducted on a total of 10 models across the Qwen, Llama, Mistral, and Gemma series demonstrate that our method achieves notable results: models fine-tuned on 1K synthesized samples outperform widely-used datasets of comparable scale (LIMO, s1K) across eight mathematical benchmarks, exhibiting superior out-of-distribution generalization.
Abstract（参考訳）: 人間の先入観のない高品質な数学的推論データを合成することは、依然として大きな課題である。現在のアプローチは通常、シードデータ変異や単純なプロンプトエンジニアリングに依存しており、しばしばモード崩壊と限定的な論理的複雑さに悩まされている。本稿では,データ合成を直接テキスト生成タスクとして扱うのではなく,制約グラフに続きセマンティックインスタンス化による教師なし最適化問題として定式化する階層型合成フレームワークを提案する。立法者は、問題の制約をコードする構造化された世代図を逆向きに進化させ、執行者はこれらの仕様をさまざまな自然言語シナリオにインスタンス化する。この言語的実現からスケルトン設計を分離することで、複雑で多様な論理構造の構築に重点を置き、高品質なデータ合成を導くことができる。 Qwen, Llama, Mistral, Gemma シリーズの合計10モデルを用いて行った実験により,本手法は顕著な結果が得られた。

論文の概要: MathAgent: Adversarial Evolution of Constraint Graphs for Mathematical Reasoning Data Synthesis

関連論文リスト