Fugu-MT 論文翻訳(概要): Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

論文の概要: Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

arxiv url: http://arxiv.org/abs/2604.22748v1
Date: Fri, 24 Apr 2026 17:48:47 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-27 15:36:26.549007
Title: Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond
Title（参考訳）: エージェント・ワールド・モデリング - 基礎、能力、法則など
Authors: Meng Chu, Xuan Billy Zhang, Kevin Qinghong Lin, Lingdong Kong, Jize Zhang, Teng Tu, Weijian Ma, Ziqi Huang, Senqiao Yang, Wei Huang, Yeying Jin, Zhefan Rao, Jinhui Ye, Xinyu Lin, Xichen Zhang, Qisheng Hu, Shuai Yang, Leyang Shen, Wei Chow, Yifei Dong, Fengyi Wu, Quanyu Long, Bin Xia, Shaozuo Yu, Mingkang Zhu, Wenhu Zhang, Jiehui Huang, Haokun Gui, Haoxuan Che, Long Chen, Qifeng Chen, Wenxuan Zhang, Wenya Wang, Xiaojuan Qi, Yang Deng, Yanwei Li, Mike Zheng Shou, Zhi-Qi Cheng, See-Kiong Ng, Ziwei Liu, Philip Torr, Jiaya Jia,
Abstract要約: 2つの軸に沿って組織された「レベルx法」の分類を導入します。第一に、3つの能力レベルを定義している: 1段階の局所遷移演算子を学ぶL1 Predictor、それらをドメインの法則を尊重する多段階のアクション条件付きロールアウトに構成するL2 Simulator、新しいエビデンスに対して予測が失敗すると自己のモデルを自動で修正するL3 Evolver。我々は400以上の作品を合成し、モデルに基づく強化学習、ビデオ生成、WebおよびGUIエージェント、マルチエージェント社会シミュレーション、AIによる科学的発見にまたがる100以上の代表システムを要約する。
参考スコア（独自算出の注目度）: 209.35045331678043
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: As AI systems move from generating text to accomplishing goals through sustained interaction, the ability to model environment dynamics becomes a central bottleneck. Agents that manipulate objects, navigate software, coordinate with others, or design experiments require predictive environment models, yet the term world model carries different meanings across research communities. We introduce a "levels x laws" taxonomy organized along two axes. The first defines three capability levels: L1 Predictor, which learns one-step local transition operators; L2 Simulator, which composes them into multi-step, action-conditioned rollouts that respect domain laws; and L3 Evolver, which autonomously revises its own model when predictions fail against new evidence. The second identifies four governing-law regimes: physical, digital, social, and scientific. These regimes determine what constraints a world model must satisfy and where it is most likely to fail. Using this framework, we synthesize over 400 works and summarize more than 100 representative systems spanning model-based reinforcement learning, video generation, web and GUI agents, multi-agent social simulation, and AI-driven scientific discovery. We analyze methods, failure modes, and evaluation practices across level-regime pairs, propose decision-centric evaluation principles and a minimal reproducible evaluation package, and outline architectural guidance, open problems, and governance challenges. The resulting roadmap connects previously isolated communities and charts a path from passive next-step prediction toward world models that can simulate, and ultimately reshape, the environments in which agents operate.
Abstract（参考訳）: AIシステムは、持続的なインタラクションを通じて、テキスト生成から目標達成への移行によって、環境ダイナミクスをモデル化する能力が中心的なボトルネックとなる。オブジェクトを操作したり、ソフトウェアをナビゲートしたり、他者と調整したり、設計実験を行うエージェントは予測環境モデルを必要とするが、世界モデルという用語は研究コミュニティ全体で異なる意味を持つ。 2つの軸に沿って組織された「レベルx法」の分類を導入します。 1つは、ワンステップの局所遷移演算子を学習するL1 Predictorと、ドメイン法を尊重するマルチステップのアクション条件付きロールアウトを構成するL2 Simulatorと、新しいエビデンスに対して予測が失敗すると、独自のモデルを自律的に修正するL3 Evolverである。 2つ目は、物理、デジタル、社会、科学の4つの法則を定めている。これらの体制は、世界モデルが満たすべき制約と、最も失敗しそうな場所を決定する。このフレームワークを用いて400以上の作品を合成し、モデルに基づく強化学習、ビデオ生成、WebおよびGUIエージェント、マルチエージェント社会シミュレーション、AIによる科学的発見にまたがる100以上の代表システムを要約する。我々は、レベル登録ペア間の方法、障害モード、評価プラクティスを分析し、意思決定中心の評価原則と最小限の再現可能な評価パッケージを提案し、アーキテクチャガイダンス、オープン問題、ガバナンス課題の概要を説明します。結果として得られたロードマップは、これまで孤立していたコミュニティを結びつけ、エージェントが動作する環境をシミュレートし、最終的に再形成できる世界モデルへの受動的次のステップ予測への道をグラフ化する。

論文の概要: Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

関連論文リスト