Fugu-MT 論文翻訳(概要): SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations

論文の概要: SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations

arxiv url: http://arxiv.org/abs/2512.05905v1
Date: Fri, 05 Dec 2025 17:38:55 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-13 22:40:57.115722
Title: SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations
Title（参考訳）: SCAIL: 3D-Consistent Pose Representationのインコンテキスト学習によるStudio-Grade文字アニメーションの実現
Authors: Wenhao Yan, Sheng Ye, Zhuoyi Yang, Jiayan Teng, ZhenHui Dong, Kairui Wen, Xiaotao Gu, Yong-Jin Liu, Jie Tang,
Abstract要約: 既存のアプローチでは、駆動ビデオから参照画像への動きを転送することができるが、しばしば野生のシナリオにおける構造的忠実さと時間的一貫性を維持できない。 textbfSCAIL (textbfStudio-grade textbfCharacter textbfAnimation via textbfIn-context textbfLL)は、2つの重要なイノベーションからこれらの課題に対処するために設計されたフレームワークである。
参考スコア（独自算出の注目度）: 31.524598864213996
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Achieving character animation that meets studio-grade production standards remains challenging despite recent progress. Existing approaches can transfer motion from a driving video to a reference image, but often fail to preserve structural fidelity and temporal consistency in wild scenarios involving complex motion and cross-identity animations. In this work, we present \textbf{SCAIL} (\textbf{S}tudio-grade \textbf{C}haracter \textbf{A}nimation via \textbf{I}n-context \textbf{L}earning), a framework designed to address these challenges from two key innovations. First, we propose a novel 3D pose representation, providing a more robust and flexible motion signal. Second, we introduce a full-context pose injection mechanism within a diffusion-transformer architecture, enabling effective spatio-temporal reasoning over full motion sequences. To align with studio-level requirements, we develop a curated data pipeline ensuring both diversity and quality, and establish a comprehensive benchmark for systematic evaluation. Experiments show that \textbf{SCAIL} achieves state-of-the-art performance and advances character animation toward studio-grade reliability and realism.
Abstract（参考訳）: 近年の進歩にもかかわらず、スタジオグレードのプロダクション標準を満たすキャラクターアニメーションの達成は困難である。既存のアプローチでは、駆動ビデオから参照画像への動きを転送することができるが、複雑な動きやクロスアイデンティティのアニメーションを含む野生のシナリオにおいて、構造的忠実さと時間的一貫性を維持できない場合が多い。本稿では,2つの重要なイノベーションからこれらの課題に対処するために設計されたフレームワークである \textbf{SCAIL} (\textbf{S}tudio-grade \textbf{C}haracter \textbf{A}nimation via \textbf{I}n-context \textbf{L}earning)を提案する。まず,より頑健で柔軟な動作信号を提供する新しい3次元ポーズ表現を提案する。第2に、拡散変換器アーキテクチャにおいて、フルコンテキストのポーズ注入機構を導入し、フルモーションシーケンスに対する効果的な時空間推論を可能にする。スタジオレベルの要件に合わせて,多様性と品質を両立させるキュレートされたデータパイプラインを開発し,システム評価のための総合的なベンチマークを確立する。実験により, <textbf{SCAIL} は最先端のパフォーマンスを実現し, スタジオグレードの信頼性とリアリズムに向けてキャラクタアニメーションを進化させることが示された。

論文の概要: SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations

関連論文リスト