Fugu-MT 論文翻訳(概要): Cutscene Agent: An LLM Agent Framework for Automated 3D Cutscene Generation

論文の概要: Cutscene Agent: An LLM Agent Framework for Automated 3D Cutscene Generation

arxiv url: http://arxiv.org/abs/2604.25318v1
Date: Tue, 28 Apr 2026 07:28:14 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-29 16:49:17.758821
Title: Cutscene Agent: An LLM Agent Framework for Automated 3D Cutscene Generation
Title（参考訳）: Cutscene Agent: 自動3Dカットセン生成のためのLLMエージェントフレームワーク
Authors: Lanshan He, Haozhou Pang, Qi Gan, Xin Shen, Ziwei Zhang, Yibo Liu, Gang Fang, Bo Liu, Kai Sheng, Shengfeng Zeng, Chaofan Li, Zhen Hui, Keer Zhou, Lan Zhou, Shujun Dai,
Abstract要約: Cutscene Agentは、エンドツーエンドのCutscene自動生成のためのエージェントフレームワークである。フレームワークには3つのコントリビューションがある。モデルコンテキストプロトコル(MCP)上に構築されたCutscene Toolkit。 LLMエージェントとゲームエンジンの双方向統合。監督エージェントは、アニメーション、撮影撮影、音響デザインのスペシャリストを編成し、視覚的推論フィードバックループによって、知覚駆動の洗練のために強化する。
参考スコア（独自算出の注目度）: 13.671638376402377
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Cutscenes are carefully choreographed cinematic sequences embedded in video games and interactive media, serving as the primary vehicle for narrative delivery, character development, and emotional engagement. Producing cutscenes is inherently complex: it demands seamless coordination across screenwriting, cinematography, character animation, voice acting, and technical direction, often requiring days to weeks of collaborative effort from multidisciplinary teams to produce minutes of polished content. In this work, we present Cutscene Agent, an LLM agent framework for automated end-to-end cutscene generation. The framework makes three contributions: (1)~a Cutscene Toolkit built on the Model Context Protocol (MCP) that establishes \emph{bidirectional} integration between LLM agents and the game engine -- agents not only invoke engine operations but continuously observe real-time scene state, enabling closed-loop generation of editable engine-native cinematic assets; (2)~a multi-agent system where a director agent orchestrates specialist subagents for animation, cinematography, and sound design, augmented by a visual reasoning feedback loop for perception-driven refinement; and (3)~CutsceneBench, a hierarchical evaluation benchmark for cutscene generation. Unlike typical tool-use benchmarks that evaluate short, isolated function calls, cutscene generation requires long-horizon, multi-step orchestration of dozens of interdependent tool invocations with strict ordering constraints -- a capability dimension that existing benchmarks do not cover. We evaluate a range of LLMs on CutsceneBench and analyze their performance across this challenging task.
Abstract（参考訳）: カットシーンは、ビデオゲームやインタラクティブメディアに埋め込まれた、慎重に振付された映画シーケンスであり、物語の配信、キャラクター開発、感情的なエンゲージメントの主要な手段として機能する。スクリーンライティング、シネマグラフィー、キャラクターアニメーション、声優、そして技術的な方向性をシームレスに調整する必要がある。本研究では, エンド・ツー・エンドのカットシーン自動生成のためのLDMエージェントフレームワークであるCutscene Agentを提案する。 1 - A Cutscene Toolkit built on the Model Context Protocol (MCP) that establisheds \emph{bidirectional} integration with LLM agent and the game engine -- agent agent invoke engine operations but only continuous real-time scene state, allowing closed-loop generation of editingable engine-native cinematic assets; (2) - A multi-agent system which a director agent orchestrates specialist subagents for animation, cinematography, and sound design, augmented by a visual reasoning feedback loop for perception-driven refinement; (3) CutsceneBench, ahierarchical evaluation benchmark for cutscene generation。短い、孤立した関数呼び出しを評価する一般的なツール使用ベンチマークとは異なり、カットスーン生成には、厳密な順序制約を持つ数十の相互依存ツール呼び出しの長期的、複数ステップのオーケストレーションが必要です。我々は,CutsceneBench 上での LLM の範囲を評価し,この課題にまたがる性能を解析する。

論文の概要: Cutscene Agent: An LLM Agent Framework for Automated 3D Cutscene Generation

関連論文リスト