Fugu-MT 論文翻訳(概要): AnimationBench: Are Video Models Good at Character-Centric Animation?

論文の概要: AnimationBench: Are Video Models Good at Character-Centric Animation?

arxiv url: http://arxiv.org/abs/2604.15299v1
Date: Thu, 16 Apr 2026 17:57:08 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-17 21:29:32.045094
Title: AnimationBench: Are Video Models Good at Character-Centric Animation?
Title（参考訳）: AnimationBench: ビデオモデルはキャラクター中心のアニメーションに優れているか?
Authors: Leyi Wu, Pengjun Fang, Kai Sun, Yazhou Xing, Yinwei Wu, Songsong Wang, Ziqi Huang, Dan Zhou, Yingqing He, Ying-Cong Chen, Qifeng Chen,
Abstract要約: AnimationBenchはアニメーション画像とビデオの生成を評価するための最初の体系的なベンチマークである。アニメーションの基本原則とIP保存を計測可能な評価次元に運用する。リアリズム指向のベンチマークで見過ごされるアニメーション固有の品質差を公開する。
参考スコア（独自算出の注目度）: 67.15372653996833
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Video generation has advanced rapidly, with recent methods producing increasingly convincing animated results. However, existing benchmarks-largely designed for realistic videos-struggle to evaluate animation-style generation with its stylized appearance, exaggerated motion, and character-centric consistency. Moreover, they also rely on fixed prompt sets and rigid pipelines, offering limited flexibility for open-domain content and custom evaluation needs. To address this gap, we introduce AnimationBench, the first systematic benchmark for evaluating animation image-to-video generation. AnimationBench operationalizes the Twelve Basic Principles of Animation and IP Preservation into measurable evaluation dimensions, together with Broader Quality Dimensions including semantic consistency, motion rationality, and camera motion consistency. The benchmark supports both a standardized close-set evaluation for reproducible comparison and a flexible open-set evaluation for diagnostic analysis, and leverages visual-language models for scalable assessment. Extensive experiments show that AnimationBench aligns well with human judgment and exposes animation-specific quality differences overlooked by realism-oriented benchmarks, leading to more informative and discriminative evaluation of state-of-the-art I2V models.
Abstract（参考訳）: ビデオ生成は急速に進歩し、最近の手法はますます説得力のあるアニメーション結果を生み出している。しかし、既存のベンチマークは、アニメーションスタイルの生成をスタイル化された外観、誇張された動き、キャラクター中心の一貫性で評価するために、リアルなビデオストルグルのために設計されている。さらに、固定プロンプトセットや堅固なパイプラインにも依存しており、オープンドメインコンテンツやカスタム評価ニーズに対する柔軟性が制限されている。このギャップに対処するために,アニメーション画像から映像への生成を評価するための最初の体系的ベンチマークであるAnimationBenchを紹介する。 AnimationBenchは、セマンティック一貫性、モーション合理性、カメラモーション整合性を含むブロード品質次元とともに、アニメーションとIP保存の12の基本原則を計測可能な評価次元に運用する。このベンチマークは再現性比較のための標準化されたクローズセット評価と診断分析のための柔軟なオープンセット評価の両方をサポートし、スケーラブルな評価のために視覚言語モデルを活用する。広範囲な実験により、アニメーションベンチは人間の判断とよく一致し、リアリズム指向のベンチマークで見過ごされるアニメーション特有の品質差を露呈し、最先端のI2Vモデルのより情報的で差別的な評価をもたらすことが示されている。

論文の概要: AnimationBench: Are Video Models Good at Character-Centric Animation?

関連論文リスト