Fugu-MT 論文翻訳(概要): FutureWorld: A Live Environment for Training Predictive Agents with Real-World Outcome Rewards

論文の概要: FutureWorld: A Live Environment for Training Predictive Agents with Real-World Outcome Rewards

arxiv url: http://arxiv.org/abs/2604.26733v1
Date: Wed, 29 Apr 2026 14:34:45 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-30 15:59:36.442649
Title: FutureWorld: A Live Environment for Training Predictive Agents with Real-World Outcome Rewards
Title（参考訳）: FutureWorld: リアルなアウトカムリワードのある予測エージェントをトレーニングするためのライブ環境
Authors: Zhixin Han, Yanzhi Zhang, Chuyang Wei, Maohang Gao, Xiawei Yue, Kefei Chen, Yu Zhuang, Haoxiang Guan, Jiyan He, Jian Li, Yitong Duan, Yu Shi, Mengting Hu, Shuxin Zheng,
Abstract要約: ライブ・フューチャー・予測(Live Future Prediction)とは、現実の事象が展開する前に予測を行うタスクである。本稿では,予測,結果実現,パラメータ更新の間のトレーニングループを閉鎖するエージェント強化学習環境であるFutureWorldを紹介する。
参考スコア（独自算出の注目度）: 20.541743597851177
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Live future prediction refers to the task of making predictions about real-world events before they unfold. This task is increasingly studied using large language model-based agent systems, and it is important for building agents that can continually learn from real-world. Just as interactive environments have often driven progress in agents, advancing live future prediction naturally motivates viewing it as a learning environment. Prior works have explored future prediction from several different parts, but have generally not framed it as a unified learning environment. This task is appealing for learning because it can provide a large number of prediction questions grounded in diverse real-world events, while preventing answer leakage. To leverage the advantages of live future prediction, we present FutureWorld, a live agentic reinforcement learning environment that closes the training loop between prediction, outcome realization, and parameters update. In our environment, we take three open-source base models and train them for consecutive days. The results show that training is effective. Furthermore, we build a daily benchmark based on the environment and evaluate several frontier agents on it to establish performance baselines for current agent systems.
Abstract（参考訳）: ライブ・フューチャー・予測(Live Future Prediction)とは、現実の事象が展開する前に予測を行うタスクである。このタスクは、大規模言語モデルに基づくエージェントシステムを用いて、ますます研究され、現実世界から継続的に学習できるエージェントを構築することが重要である。対話的な環境がエージェントの進歩を後押しするのと同じように、ライブ未来予測が自然に学習環境と見なす動機となる。以前の研究は、いくつかの異なる部分から将来の予測を探求してきたが、一般的には、それを統一的な学習環境とみなすことはなかった。このタスクは、さまざまな現実世界のイベントに根ざした多くの予測質問を提供すると同時に、回答の漏洩を防止できるため、学習にアピールする。将来予測の利点を活用するために,予測,結果実現,パラメータ更新の間のトレーニングループを閉鎖するエージェント強化学習環境であるFutureWorldを提案する。私たちの環境では、3つのオープンソースベースモデルを連続してトレーニングします。その結果,トレーニングが効果的であることが示唆された。さらに、環境に基づく日次ベンチマークを構築し、その上で複数のフロンティアエージェントを評価し、現在のエージェントシステムのパフォーマンスベースラインを確立する。

論文の概要: FutureWorld: A Live Environment for Training Predictive Agents with Real-World Outcome Rewards

関連論文リスト