Fugu-MT 論文翻訳(概要): Steerable Instruction Following Coding Data Synthesis with Actor-Parametric Schema Co-Evolution

論文の概要: Steerable Instruction Following Coding Data Synthesis with Actor-Parametric Schema Co-Evolution

arxiv url: http://arxiv.org/abs/2604.16322v1
Date: Fri, 27 Feb 2026 21:31:41 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-04 02:32:13.884557
Title: Steerable Instruction Following Coding Data Synthesis with Actor-Parametric Schema Co-Evolution
Title（参考訳）: Actor-Parametric Schema共進化による符号化データ合成の安定的指導
Authors: Tinglin Huang, Bo Chen, Xiao Zhang, Kai Shen, Rex Ying,
Abstract要約: IFCodeEvolveは,大規模命令ペアプログラミングデータを合成するフレームワークである。 IFCodeEvolveはベースモデルの性能を大幅に向上させ,我々の32BモデルはプロプライエタリなSOTAモデルと同等であることを示す。我々はまた、ソリューションと堅牢なASTベースの検証を備えた総合的人間認証ベンチマークIFCodeBenchにも貢献する。
参考スコア（独自算出の注目度）: 30.919665619254
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Interpreting and following human instructions is a critical capability of large language models (LLMs) in automatic programming. However, synthesizing large-scale instruction-paired coding data remains largely unexplored and is particularly challenging when ensuring logical compatibility among multiple constraints. In this study, we propose IFCodeEvolve, an actor-schema co-evolution framework for instruction following coding data generation. By representing instructions as parametric function schema, we construct a library that covers the vast instruction space via dynamic constraint instantiation. Building upon this, Monte Carlo Tree Search (MCTS) sampler is applied to efficiently navigate this space, utilizing actor model feedback as a dynamic termination signal. Furthermore, to progressively explore challenging problems, we introduce a co-evolving paradigm that iteratively advances both the actor model and the schema library, via schema composition and mutation, based on sampler statistics. Empirical results demonstrate that IFCodeEvolve significantly boosts base model performance, with our 32B model achieving parity with proprietary SOTA models. Additionally, we contribute IFCodeBench, a comprehensive human-verified benchmark equipped with solutions and robust AST-based verification.
Abstract（参考訳）: 人間の指示を解釈し従うことは、自動プログラミングにおける大規模言語モデル(LLM)の重要な能力である。しかし、大規模命令対ペア符号化データの合成は未解決のままであり、複数の制約間の論理的整合性を確保することは特に困難である。本研究では,アクタ-スキーマ共進化フレームワークIFCodeEvolveを提案する。命令をパラメトリック関数スキーマとして表現することにより、動的制約インスタンス化によって広大な命令空間をカバーするライブラリを構築する。これに基づいて、モンテカルロ木探索(MCTS)サンプリング装置を用いて、アクターモデルフィードバックを動的終端信号として利用して、この空間を効率的にナビゲートする。さらに,課題を段階的に探求するために,サンプル統計に基づくスキーマ構成と突然変異を用いてアクターモデルとスキーマライブラリの両方を反復的に進化させる共進化パラダイムを導入する。その結果、IFCodeEvolveはベースモデルの性能を大幅に向上させ、我々の32BモデルはプロプライエタリなSOTAモデルと同等であることがわかった。さらに、ソリューションと堅牢なASTベースの検証を備えた総合的人間検証ベンチマークIFCodeBenchを寄贈する。

論文の概要: Steerable Instruction Following Coding Data Synthesis with Actor-Parametric Schema Co-Evolution

関連論文リスト