Fugu-MT 論文翻訳(概要): AniMatrix: An Anime Video Generation Model that Thinks in Art, Not Physics

論文の概要: AniMatrix: An Anime Video Generation Model that Thinks in Art, Not Physics

arxiv url: http://arxiv.org/abs/2605.03652v2
Date: Thu, 07 May 2026 12:35:51 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-08 22:27:11.293681
Title: AniMatrix: An Anime Video Generation Model that Thinks in Art, Not Physics
Title（参考訳）: AniMatrix: 物理ではなく芸術を思い浮かべるアニメビデオ生成モデル
Authors: Tencent HY Team,
Abstract要約: AniMatrixは、物理的正確性ではなく芸術性を再定義するビデオ生成モデルである。 AniMatrixは、二重チャネルの条件付け機構と3段階の遷移によって、物理的な正確さよりも芸術性を目標としている。プロのアニメーターによる5つの制作次元を持つアニメ固有の人間評価において、AniMatrixは5つ中4つにランクインしている。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Video generation models internalize physical realism as their prior. Anime deliberately violates physics: smears, impact frames, chibi shifts; and its thousands of coexisting artistic conventions yield no single "physics of anime" a model can absorb. Physics-biased models therefore flatten the artistry that defines the medium or collapse under its stylistic variance. We present AniMatrix, a video generation model that targets artistic rather than physical correctness through a dual-channel conditioning mechanism and a three-step transition: redefine correctness, override the physics prior, and distinguish art from failure. First, a Production Knowledge System encodes anime as a structured taxonomy of controllable production variables (Style, Motion, Camera, VFX), and AniCaption infers these variables from pixels as directorial directives. A trainable tag encoder preserves the field-value structure of this taxonomy while a frozen T5 encoder handles free-form narrative; dual-path injection (cross-attention for fine-grained control, AdaLN modulation for global enforcement) ensures categorical directives are never diluted by open-ended text. Second, a style-motion-deformation curriculum transitions the model from near-physical motion to full anime expressiveness. Third, deformation-aware preference optimization with a domain-specific reward model separates intentional artistry from pathological collapse. On an anime-specific human evaluation with five production dimensions scored by professional animators, AniMatrix ranks first on four of five, with the largest gains over Seedance-Pro 1.0 on Prompt Understanding (+0.70, +22.4 percent) and Artistic Motion (+0.55, +16.9 percent). We are preparing accompanying resources for public release to support reproducibility and follow-up research.
Abstract（参考訳）: ビデオ生成モデルは、物理リアリズムをそれ以前のものとして内部化する。アニメは、スミア、衝撃フレーム、チビのシフトといった物理学に意図的に違反し、その数千の共存する芸術的慣習は、モデルが吸収できる唯一の「アニメの物理学」を生み出しない。したがって、物理バイアスのあるモデルは、その様式的な分散の下で媒体や崩壊を定義する芸術性をフラットにする。 AniMatrixは、二重チャネル条件付け機構と3段階の遷移により、物理的正確性よりも芸術性を目標とするビデオ生成モデルである。まず、生産知識システムは、制御可能な生産変数(スタイル、モーション、カメラ、VFX)の構造化分類としてアニメを符号化し、AniCaptionはこれらの変数を画素からディレクトリディレクティブとして推論する。訓練可能なタグエンコーダは、この分類のフィールド値構造を保ち、凍結されたT5エンコーダは、自由形式の物語を扱う。第二に、スタイル・モーション・デフォーメーションのカリキュラムは、モデルをほぼ物理運動から完全なアニメ表現に移行させる。第三に、ドメイン固有の報酬モデルによる変形認識の好みの最適化は、意図的な芸術性を病理的な崩壊から切り離す。アニマトリクスはプロのアニメーターによる5つの制作次元を持つアニメ固有の人間評価において、プロンプト理解(+0.70, +22.4%)とアーティスティック・モーション(+0.55, +16.9%)でSeedance-Pro 1.0の4位にランクインしている。再現性およびフォローアップ研究を支援するため、公募に付随する資源を準備している。

論文の概要: AniMatrix: An Anime Video Generation Model that Thinks in Art, Not Physics

関連論文リスト