Fugu-MT 論文翻訳(概要): Beyond Quantity: Trajectory Diversity Scaling for Code Agents

論文の概要: Beyond Quantity: Trajectory Diversity Scaling for Code Agents

arxiv url: http://arxiv.org/abs/2602.03219v1
Date: Tue, 03 Feb 2026 07:43:03 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-04 18:37:15.315938
Title: Beyond Quantity: Trajectory Diversity Scaling for Code Agents
Title（参考訳）: 量を超える: コードエージェントのための軌道上の多様性のスケーリング
Authors: Guhong Chen, Chenghao Sun, Cheng Fu, Qiyao Wang, Zhihong Huang, Chaopeng Wei, Guangxu Chen, Feiteng Fang, Ahmadreza Argha, Bing Zhao, Xander Xu, Qi Han, Hamid Alinejad-Rokny, Qiang Qu, Binhua Li, Shiwen Ni, Min Yang, Hu Wei, Yongbin Li,
Abstract要約: Trajectory Diversity Scalingは、コードエージェントのためのデータ合成フレームワークである。 TDScalingは、(1)実際のサービスの論理的依存関係をキャプチャするBusiness Clusterメカニズム、(2)軌道コヒーレンスを強制するブループリント駆動のマルチエージェントパラダイム、(3)ロングテールシナリオを指向する適応的な進化メカニズムの4つの革新を統合しています。
参考スコア（独自算出の注目度）: 51.71414642763219
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: As code large language models (LLMs) evolve into tool-interactive agents via the Model Context Protocol (MCP), their generalization is increasingly limited by low-quality synthetic data and the diminishing returns of quantity scaling. Moreover, quantity-centric scaling exhibits an early bottleneck that underutilizes trajectory data. We propose TDScaling, a Trajectory Diversity Scaling-based data synthesis framework for code agents that scales performance through diversity rather than raw volume. Under a fixed training budget, increasing trajectory diversity yields larger gains than adding more trajectories, improving the performance-cost trade-off for agent training. TDScaling integrates four innovations: (1) a Business Cluster mechanism that captures real-service logical dependencies; (2) a blueprint-driven multi-agent paradigm that enforces trajectory coherence; (3) an adaptive evolution mechanism that steers synthesis toward long-tail scenarios using Domain Entropy, Reasoning Mode Entropy, and Cumulative Action Complexity to prevent mode collapse; and (4) a sandboxed code tool that mitigates catastrophic forgetting of intrinsic coding capabilities. Experiments on general tool-use benchmarks (BFCL, tau^2-Bench) and code agent tasks (RebenchT, CodeCI, BIRD) demonstrate a win-win outcome: TDScaling improves both tool-use generalization and inherent coding proficiency. We plan to release the full codebase and the synthesized dataset (including 30,000+ tool clusters) upon publication.
Abstract（参考訳）: コード大言語モデル (LLM) が Model Context Protocol (MCP) を介してツール間エージェントへと進化するにつれて、それらの一般化は低品質な合成データと量スケーリングのリターンの低下によってますます制限される。さらに、量中心のスケーリングは、トラジェクトリデータを不活用する初期のボトルネックを示す。本稿では,TDScalingを提案する。TDScalingは,生のボリュームではなく,多様性を通じてパフォーマンスをスケールするコードエージェントのための,トラジェクティブ・ダイバーシティ・スケーリングに基づくデータ合成フレームワークである。一定の訓練予算の下では、軌道の多様性の増大は、より多くの軌道を追加するよりも大きな利得をもたらし、エージェントトレーニングのパフォーマンスコストのトレードオフを改善する。 TDScalingは、(1)実際のサービスの論理的依存関係をキャプチャするBusiness Clusterメカニズム、(2)軌道コヒーレンスを強制するブループリント駆動のマルチエージェントパラダイム、(3)ドメインのエントロピー、推論モードのエントロピー、および累積アクションの複雑さによって、モード崩壊を防止し、(4)固有のコーディング機能の破滅的な忘れを緩和するサンドボックスコードツール、の4つのイノベーションを統合している。一般的なツール利用ベンチマーク(BFCL, tau^2-Bench)とコードエージェントタスク(RebenchT, CodeCI, BIRD)の実験では、勝利が示されている。完全なコードベースと合成データセット(30,000以上のツールクラスタを含む)を公開時にリリースする予定です。

論文の概要: Beyond Quantity: Trajectory Diversity Scaling for Code Agents

関連論文リスト