Fugu-MT 論文翻訳(概要): Scalable Environments Drive Generalizable Agents

論文の概要: Scalable Environments Drive Generalizable Agents

arxiv url: http://arxiv.org/abs/2605.18181v1
Date: Mon, 18 May 2026 10:24:05 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-19 17:57:49.394588
Title: Scalable Environments Drive Generalizable Agents
Title（参考訳）: スケーラブル環境は汎用エージェントを駆動する
Authors: Jiayi Zhang, Fanqi Kong, Guibin Zhang, Maojia Song, Zhaoyang Yu, Jianhao Ruan, Jinyu Xiang, Bang Liu, Chenglin Wu, Yuyu Luo,
Abstract要約: 一般化可能なエージェントは、トレーニングディストリビューションを超えて、多様なタスクや見つからない環境に適応する必要がある。現在のスケーリングプラクティスは、より経験やより広範なタスクを、一定のインタラクションルールの下で収集することに集中しています。本稿では,主要な成果物によるトラジェクティブスケーリング,タスクスケーリング,環境スケーリングを分離した統合分類法を提案する。
参考スコア（独自算出の注目度）: 42.76199937252846
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Generalizable agents should adapt to diverse tasks and unseen environments beyond their training distribution. This position paper argues that such generalization requires environment scaling: expanding the distribution of executable rule-sets that agents interact with, rather than only increasing trajectories or tasks within fixed benchmarks. Current scaling practices largely focus on collecting more experience or broader task sets under fixed interaction rules, leaving agents brittle when underlying interfaces, dynamics, observations, or feedback signals change. The core challenge is therefore a world-level distribution shift: agents need systematic exposure to environments with meaningfully different executable rule-sets. To clarify this challenge, we propose a unified taxonomy that separates trajectory scaling, task scaling, and environment scaling by their primary deliverables and by what changes in the executable rule-set. Building on this taxonomy, we synthesize construction paradigms for scalable environments, contrasting programmatic generators that prioritize controllability and verifiability with generative world models that offer broader coverage and open-endedness. We further outline how environment scaling can be coupled with stateful learning mechanisms, emphasizing learned update rules for cross-environment adaptation. We conclude by discussing alternative perspectives and argue that scalable environments provide the essential substrate for measurable and controllable progress toward robust general agents.
Abstract（参考訳）: 一般化可能なエージェントは、トレーニングディストリビューションを超えて、多様なタスクや見つからない環境に適応する必要がある。固定ベンチマーク内での軌道やタスクの増加だけでなく、エージェントが相互作用する実行可能なルールセットの分布の拡大である。現在のスケーリングプラクティスは,インターフェースやダイナミクス,観察,あるいはフィードバック信号の変更によってエージェントが不安定になるような,より経験やより広範なタスクセットの収集に重点を置いています。エージェントは、有意義に異なる実行可能なルールセットを持つ環境に、体系的に露出する必要がある。この課題を明らかにするため、我々は、軌道スケーリング、タスクスケーリング、環境スケーリングを主要な成果物と実行可能なルールセットの変化によって分離する統合分類法を提案する。この分類に基づいて、我々はスケーラブルな環境の構築パラダイムを合成し、制御性と検証性を優先するプログラム的ジェネレータと、より広範なカバレッジとオープンディペンデンスを提供する生成的世界モデルとを対比する。さらに、環境スケーリングがステートフルな学習メカニズムとどのように結合するかを概説し、クロス環境適応のための学習された更新ルールを強調した。我々は、代替的な視点について議論し、スケーラブルな環境は、堅牢な汎用エージェントに対する測定可能かつ制御可能な進歩に不可欠な基盤を提供する、と論じる。

論文の概要: Scalable Environments Drive Generalizable Agents

関連論文リスト