Fugu-MT 論文翻訳(概要): Synthetic Computers at Scale for Long-Horizon Productivity Simulation

論文の概要: Synthetic Computers at Scale for Long-Horizon Productivity Simulation

arxiv url: http://arxiv.org/abs/2604.28181v1
Date: Thu, 30 Apr 2026 17:58:02 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-01 16:31:54.246632
Title: Synthetic Computers at Scale for Long-Horizon Productivity Simulation
Title（参考訳）: 長期生産性シミュレーションのための大規模合成コンピュータ
Authors: Tao Ge, Baolin Peng, Hao Cheng, Jianfeng Gao,
Abstract要約: 本稿では,ユーザ固有のコンピュータ環境を構築するためのスケーラブルな方法論であるSynthetic Computers at Scaleを紹介する。予備実験では,1,000台の合成コンピュータを作成し,その上で長距離シミュレーションを行う。これらのシミュレーションは、エージェント性能の大幅な改善によって有効性が検証された豊富な経験的学習信号を生成する。
参考スコア（独自算出の注目度）: 47.31865037664483
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Realistic long-horizon productivity work is strongly conditioned on user-specific computer environments, where much of the work context is stored and organized through directory structures and content-rich artifacts. To scale synthetic data creation for such productivity scenarios, we introduce Synthetic Computers at Scale, a scalable methodology for creating such environments with realistic folder hierarchies and content-rich artifacts (e.g., documents, spreadsheets, and presentations). Conditioned on each synthetic computer, we run long-horizon simulations: one agent creates productivity objectives that are specific to the computer's user and require multiple professional deliverables and about a month of human work; another agent then acts as that user and keeps working across the computer -- for example, navigating the filesystem for grounding, coordinating with simulated collaborators, and producing professional artifacts -- until these objectives are completed. In preliminary experiments, we create 1,000 synthetic computers and run long-horizon simulations on them; each run requires over 8 hours of agent runtime and spans more than 2,000 turns on average. These simulations produce rich experiential learning signals, whose effectiveness is validated by significant improvements in agent performance on both in-domain and out-of-domain productivity evaluations. Given that personas are abundant at billion scale, this methodology can in principle scale to millions or even billions of synthetic user worlds with sufficient compute, enabling broader coverage of diverse professions, roles, contexts, environments, and productivity needs. We argue that scalable synthetic computer creation, together with at-scale simulations, is highly promising as a foundational substrate for agent self-improvement and agentic reinforcement learning in long-horizon productivity scenarios.
Abstract（参考訳）: 現実的な長期生産性の作業は、多くの作業コンテキストがディレクトリ構造やコンテンツリッチなアーティファクトを通じて格納され、整理される、ユーザ固有のコンピュータ環境において強く条件付けられている。このような生産性シナリオに対して合成データ作成をスケールするために、現実的なフォルダ階層とコンテンツリッチなアーティファクト(ドキュメント、スプレッドシート、プレゼンテーションなど)でそのような環境を作成するためのスケーラブルな方法論であるSynthetic Computers at Scaleを紹介します。ひとつのエージェントは、コンピュータのユーザ特有の生産性目標を生成し、複数のプロフェッショナルな成果物と約1ヶ月の人的作業を必要とします。予備実験では、1,000台の合成コンピュータを作成し、その上で長距離シミュレーションを実行します。これらのシミュレーションは、ドメイン内およびドメイン外の生産性評価においてエージェント性能の大幅な向上により、豊富な経験的学習信号を生成する。数十億の規模でペルソナが豊富であることを考えると、この方法論は原則として、数百万から数十億の合成ユーザワールドに十分な計算能力を持ち、多様な専門職、役割、状況、環境、生産性のニーズを幅広くカバーできる。我々は、スケーラブルな合成コンピュータの作成と大規模シミュレーションは、長期の生産性シナリオにおけるエージェント自己改善とエージェント強化学習の基礎的基盤として非常に有望であると主張している。

論文の概要: Synthetic Computers at Scale for Long-Horizon Productivity Simulation

関連論文リスト