Fugu-MT 論文翻訳(概要): Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

論文の概要: Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

arxiv url: http://arxiv.org/abs/2604.18292v1
Date: Mon, 20 Apr 2026 14:01:10 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-21 21:52:52.919756
Title: Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence
Title（参考訳）: エージェントワールド:汎用エージェントインテリジェンスを進化させるための実世界環境のスケーリング
Authors: Guanting Dong, Junting Lu, Junjie Huang, Wanjun Zhong, Longxiang Liu, Shijue Huang, Zhenyu Li, Yang Zhao, Xiaoshuai Song, Xiaoxi Li, Jiajie Jin, Yutao Zhu, Hanbin Wang, Fangyu Lei, Qinyu Luo, Mingyang Chen, Zehui Chen, Jiazhan Feng, Ji-Rong Wen, Zhicheng Dou,
Abstract要約: textbfAgent-Worldは、スケーラブルな環境を通じて汎用エージェントインテリジェンスを前進させるための自己進化型トレーニングアリーナである。 Agent-World-8Bと14Bは、強力なプロプライエタリなモデルと環境スケーリングベースラインを一貫して上回っている。
参考スコア（独自算出の注目度）: 122.8626256816085
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language models are increasingly expected to serve as general-purpose agents that interact with external, stateful tool environments. The Model Context Protocol (MCP) and broader agent skills offer a unified interface for connecting agents with scalable real-world services, but training robust agents remains limited by the lack of realistic environments and principled mechanisms for life-long learning. In this paper, we present \textbf{Agent-World}, a self-evolving training arena for advancing general agent intelligence through scalable environments. Agent-World has two main components: (1) Agentic Environment-Task Discovery, which autonomously explores topic-aligned databases and executable tool ecosystems from thousands of real-world environment themes and synthesizes verifiable tasks with controllable difficulty; and (2) Continuous Self-Evolving Agent Training, which combines multi-environment reinforcement learning with a self-evolving agent arena that automatically identifies capability gaps through dynamic task synthesis and drives targeted learning, enabling the co-evolution of agent policies and environments. Across 23 challenging agent benchmarks, Agent-World-8B and 14B consistently outperforms strong proprietary models and environment scaling baselines. Further analyses reveal scaling trends in relation to environment diversity and self-evolution rounds, offering insights for building general agent intelligence.
Abstract（参考訳）: 大規模言語モデルは、外部のステートフルなツール環境と対話する汎用エージェントとして、ますます期待されている。 Model Context Protocol(MCP)とより広範なエージェントスキルは、エージェントをスケーラブルな現実世界のサービスに接続するための統一されたインターフェースを提供するが、現実的な環境の欠如と生涯学習の原則的なメカニズムによって、堅牢なエージェントのトレーニングは制限されている。本稿では、スケーラブルな環境を通した汎用エージェントインテリジェンスを促進するための自己進化型トレーニングアリーナである「textbf{Agent-World}」を紹介する。エージェント・ワールドには2つの主要な要素がある: 1) エージェント環境-タスク発見(Agentic Environment-Task Discovery) エージェント環境-タスク発見(Agentic Environment-Task Discovery) エージェント環境-タスク発見(Agentic Environment-Task Discovery) 数千の現実世界環境からトピック整合したデータベースと実行可能なツールエコシステムを自律的に探索し、検証可能なタスクを制御可能な難易度で合成し、(2) 多環境強化学習と自己進化エージェントアリーナを組み合わせ、動的タスク合成を通じて機能ギャップを自動的に識別し、ターゲット学習を推進し、エージェントポリシーと環境の共進化を可能にする。 23の挑戦的なエージェントベンチマーク、Agent-World-8Bと14Bは、強力なプロプライエタリなモデルと環境スケーリングベースラインを一貫して上回っている。さらなる分析では、環境多様性と自己進化ラウンドに関連するスケーリングトレンドを明らかにし、汎用エージェントインテリジェンスを構築するための洞察を提供する。

論文の概要: Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

関連論文リスト