Fugu-MT 論文翻訳(概要): MetaAgent-X : Breaking the Ceiling of Automatic Multi-Agent Systems via End-to-End Reinforcement Learning

論文の概要: MetaAgent-X : Breaking the Ceiling of Automatic Multi-Agent Systems via End-to-End Reinforcement Learning

arxiv url: http://arxiv.org/abs/2605.14212v1
Date: Thu, 14 May 2026 00:11:27 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-15 21:45:34.547942
Title: MetaAgent-X : Breaking the Ceiling of Automatic Multi-Agent Systems via End-to-End Reinforcement Learning
Title（参考訳）: MetaAgent-X : エンドツーエンド強化学習による自動マルチエージェントシステムのシーリング
Authors: Yaolun Zhang, Yujie Zhao, Nan Wang, Yiran Wu, Jiayu Chang, Yizhao Chen, Qingyun Wu, Jishen Zhao, Huazheng Wang,
Abstract要約: 自動MAS設計と実行を協調的に最適化する,エンドツーエンドの強化学習フレームワークであるMetaAgent-Xを紹介する。 MetaAgent-Xは既存のMASベースラインを一貫して上回り、最大21.7%のアップを達成した。これらの結果は、自己設計および自己実行エージェントモデルを構築するための実践パラダイムとして、エンドツーエンドのトレーニング可能な自動MASを確立する。
参考スコア（独自算出の注目度）: 26.35314976395182
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Automatic multi-agent systems aim to instantiate agent workflows without relying on manually designed or fixed orchestration. However, existing automatic MAS approaches remain only partially adaptive: they either perform training-free test-time search or optimize the meta-level designer while keeping downstream execution agents frozen, which creating a frozen-executor ceiling and leaving the end-to-end training of self-designing and self-executing agentic models unexplored. To address this, we introduce MetaAgent-X, an end-to-end reinforcement learning framework that jointly optimizes automatic MAS design and execution. MetaAgent-X enables script-based MAS generation, execution rollout collection, and credit assignment for both designer and executor trajectories. To support stable and scalable optimization, we propose Executor Designer Hierarchical Rollout and Stagewise Co-evolution to improve training stability and expose the dynamics of designer-executor co-evolution. MetaAgent-X consistently outperforms existing automatic MAS baselines, achieving up to 21.7% gains. Comprehensive ablations show that both designer and executor improve throughout training, and that effective automatic MAS learning follows a stagewise co-evolution process. These results establish end-to-end trainable automatic MAS as a practical paradigm for building self-designing and self-executing agentic models.
Abstract（参考訳）: 自動マルチエージェントシステムは、手動で設計または固定されたオーケストレーションに頼ることなく、エージェントワークフローをインスタンス化する。しかし、既存の自動MASアプローチは、トレーニング不要なテストタイムサーチを実行するか、下流の実行エージェントを凍結させながらメタレベルのデザイナを最適化する。そこで本稿では,自動MAS設計と実行を協調的に最適化する,エンドツーエンドの強化学習フレームワークであるMetaAgent-Xを紹介する。 MetaAgent-Xは、スクリプトベースのMAS生成、実行ロールアウトコレクション、デザイナとエグゼクタの両方に対するクレジット割り当てを可能にする。安定かつスケーラブルな最適化を支援するために,設計者/実行者共同進化のダイナミクスを明らかにするために,実行者設計者の階層的ロールアウトと段階的共進化を提案する。 MetaAgent-Xは既存のMASベースラインを一貫して上回り、21.7%のゲインを達成している。総合的な説明は、デザイナーと実行担当者の両方がトレーニングを通して改善し、効果的な自動MAS学習は段階的に共進化プロセスに従うことを示している。これらの結果は、自己設計および自己実行エージェントモデルを構築するための実践パラダイムとして、エンドツーエンドのトレーニング可能な自動MASを確立する。

論文の概要: MetaAgent-X : Breaking the Ceiling of Automatic Multi-Agent Systems via End-to-End Reinforcement Learning

関連論文リスト