Fugu-MT 論文翻訳(概要): EgoCS-400K: An Egocentric Gameplay Dataset for World Models

論文の概要: EgoCS-400K: An Egocentric Gameplay Dataset for World Models

arxiv url: http://arxiv.org/abs/2606.18180v1
Date: Tue, 16 Jun 2026 17:13:58 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-17 17:15:32.567003
Title: EgoCS-400K: An Egocentric Gameplay Dataset for World Models
Title（参考訳）: EgoCS-400K:世界モデルのためのエゴセントリックなゲームプレイデータセット
Authors: Rongjin Guo, Dong Liang, Yuhao Liu, Fang Liu, Tianyu Huang, Gerhard P. Hancke, Rynson W. H. Lau,
Abstract要約: EgoCS-400Kは,世界モデルのための大規模リプレイ・グラウンド型Egocentric Counter-Strikeデータセットである。パブリックプロのCSとCS2のマッチデモから作られ、人間のゲームプレイの軌跡を保存している。 EgoCS-400Kには、1000以上の試合と4万回以上のラウンドから40万人以上の個人ビデオと1万時間のゲームプレイが含まれている。
参考スコア（独自算出の注目度）: 53.9389499966106
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The shift from video generation to interactive world modeling places new demands on data: beyond captioned videos, world models require temporally aligned video-action-language trajectories grounded in the actions, camera motion, states, and events that drive future scene changes. However, such data is difficult to obtain at scale. Web video datasets offer broad visual coverage but lack executable actions and reliable states; robotic datasets provide action and state supervision but are costly and limited in scene diversity; and existing simulators often lack large-scale human-driven interaction trajectories. In this paper, we introduce EgoCS-400K, a large-scale replay-grounded egocentric Counter-Strike dataset for world models, built from public professional CS and CS2 match demos that preserve human gameplay trajectories and enable parsing, replaying, rendering, and temporal alignment. We extract player states, view directions, movements, keyboard/button inputs, view-angle changes, weapon usage, game events, and round-level context, and render clean first-person videos from the same trajectories. EgoCS-400K contains over 400,000 first-person videos and 10,000 hours of gameplay from more than 1,000 matches and 40,000 rounds, covering 13 maps and 10 player viewpoints per round. It supports a range of interactive visual modeling tasks, including action-conditioned future prediction, state- and event-aware scene rollout, replay-grounded captioning, and agent egocentric action understanding. By connecting visual observations with human actions, camera motion, game states, and events at scale, EgoCS-400K serves as a practical bridge between passive web videos, controllable game simulation, and costly real-world embodied data.
Abstract（参考訳）: ビデオ生成からインタラクティブな世界モデリングへのシフトは、キャプション付きビデオを超えて、世界モデルは、アクション、カメラモーション、状態、将来のシーン変更を駆動するイベントに、時間的に整列されたビデオアクション言語トラジェクトリを必要とする。しかし、そのようなデータは大規模に入手することは困難である。 Webビデオデータセットは幅広い視覚的カバレッジを提供するが、実行可能なアクションや信頼性のある状態は欠く; ロボットデータセットはアクションと状態の監視を提供するが、コストがかかり、シーンの多様性が制限される; 既存のシミュレータは、大規模な人間主導のインタラクション軌跡を欠いていることが多い。本稿では,ヒトの遊戯軌跡を保存し,解析,再生,レンダリング,時間的アライメントを可能にするCSとCS2のマッチングデモから構築した,世界モデルのための大規模リプレイ・グラウンド・エゴセントリック・カウンタ・ストライクデータセットであるEgoCS-400Kを紹介する。プレイヤーの状態、方向、動き、キーボード/ボタン入力、ビューアングルの変更、武器使用、ゲームイベント、ラウンドレベルのコンテキストを抽出し、同じ軌跡からクリーンなファーストパーソンビデオを描画する。 EgoCS-400Kには、1000試合以上から40万人以上の個人ビデオと1万時間のゲームプレイがあり、4万ラウンドで13の地図と10のプレイヤー視点をカバーしている。アクション条件付き将来の予測、状態とイベント対応のシーンロールアウト、リプレイグラウンドのキャプション、エージェントのエゴセントリックなアクション理解など、インタラクティブなビジュアルモデリングタスクをサポートする。 EgoCS-400Kは、人間の行動、カメラの動き、ゲーム状態、大規模イベントと視覚的観察を結びつけることで、受動的ウェブビデオ、制御可能なゲームシミュレーション、そしてコストのかかる実世界のエンボディドデータの間の実践的なブリッジとして機能する。

論文の概要: EgoCS-400K: An Egocentric Gameplay Dataset for World Models

関連論文リスト