Fugu-MT 論文翻訳(概要): $Agent^2$: An Agent-Generates-Agent Framework for Reinforcement Learning Automation

論文の概要: $Agent^2$: An Agent-Generates-Agent Framework for Reinforcement Learning Automation

arxiv url: http://arxiv.org/abs/2509.13368v2
Date: Tue, 30 Sep 2025 05:06:12 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-01 17:09:04.108813
Title: $Agent^2$: An Agent-Generates-Agent Framework for Reinforcement Learning Automation
Title（参考訳）: $Agent^2$:強化学習自動化のためのエージェント生成エージェントフレームワーク
Authors: Yuan Wei, Xiaohan Shan, Ran Miao, Jianmin Li,
Abstract要約: 強化学習(RL)エージェント開発は伝統的にかなりの専門知識と反復的な努力を必要とする。本稿では,完全自動RLエージェント設計のための LLM 駆動型エージェント生成エージェントフレームワークである Agent$2$ を紹介する。 Agent$2$は、自然言語のタスク記述と環境コードを人間の介入なしに実行可能なRLソリューションに変換する。
参考スコア（独自算出の注目度）: 5.325886106098561
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Reinforcement learning (RL) agent development traditionally requires substantial expertise and iterative effort, often leading to high failure rates and limited accessibility. This paper introduces Agent$^2$, an LLM-driven agent-generates-agent framework for fully automated RL agent design. Agent$^2$ autonomously translates natural language task descriptions and environment code into executable RL solutions without human intervention. The framework adopts a dual-agent architecture: a Generator Agent that analyzes tasks and designs agents, and a Target Agent that is automatically generated and executed. To better support automation, RL development is decomposed into two stages, MDP modeling and algorithmic optimization, facilitating targeted and effective agent generation. Built on the Model Context Protocol, Agent$^2$ provides a unified framework for standardized agent creation across diverse environments and algorithms, incorporating adaptive training management and intelligent feedback analysis for continuous refinement. Extensive experiments on benchmarks including MuJoCo, MetaDrive, MPE, and SMAC show that Agent$^2$ outperforms manually designed baselines across all tasks, achieving up to 55\% performance improvement with consistent average gains. By enabling a closed-loop, end-to-end automation pipeline, this work advances a new paradigm in which agents can design and optimize other agents, underscoring the potential of agent-generates-agent systems for automated AI development.
Abstract（参考訳）: 強化学習(RL)エージェント開発は伝統的にかなりの専門知識と反復的な努力を必要とし、しばしば高い失敗率と限られたアクセシビリティをもたらす。本稿では,完全自動RLエージェント設計のための LLM 駆動エージェント生成エージェントフレームワークである Agent$2$ を紹介する。 Agent$2$は、自然言語のタスク記述と環境コードを人間の介入なしに実行可能なRLソリューションに変換する。このフレームワークでは、タスクを分析してエージェントを設計するGenerator Agentと、自動生成して実行されるTarget Agentという、デュアルエージェントアーキテクチャを採用している。自動化を向上するために、RL開発はMDPモデリングとアルゴリズム最適化という2つの段階に分けられ、ターゲットと効果的なエージェント生成を容易にする。 Model Context Protocolをベースに構築されたAgens$^2$は、さまざまな環境やアルゴリズムにわたる標準化されたエージェント作成のための統一されたフレームワークを提供し、適応的なトレーニング管理と、継続的改善のためのインテリジェントなフィードバック分析を取り入れている。 MuJoCo、MetaDrive、MPE、SMACなどのベンチマークに関する大規模な実験は、Agent$^2$がすべてのタスクで手動で設計されたベースラインより優れており、一貫した平均ゲインで最大55倍のパフォーマンス改善を実現していることを示している。クローズドループでエンドツーエンドの自動化パイプラインを有効にすることで、エージェントが他のエージェントを設計、最適化できる新しいパラダイムを推進し、エージェント生成エージェントシステムのAI自動開発の可能性を強調します。

論文の概要: $Agent^2$: An Agent-Generates-Agent Framework for Reinforcement Learning Automation

関連論文リスト