Fugu-MT 論文翻訳(概要): AnimateAnyMesh++: A Flexible 4D Foundation Model for High-Fidelity Text-Driven Mesh Animation

論文の概要: AnimateAnyMesh++: A Flexible 4D Foundation Model for High-Fidelity Text-Driven Mesh Animation

arxiv url: http://arxiv.org/abs/2604.26917v1
Date: Wed, 29 Apr 2026 17:27:40 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-30 15:59:36.519101
Title: AnimateAnyMesh++: A Flexible 4D Foundation Model for High-Fidelity Text-Driven Mesh Animation
Title（参考訳）: AnimateAnyMesh++: 高忠実なテキスト駆動メッシュアニメーションのためのフレキシブルな4Dファンデーションモデル
Authors: Zijie Wu, Chaohui Yu, Fan Wang, Xiang Bai,
Abstract要約: AnimateAnyMesh++は任意の3Dメッシュのテキスト駆動アニメーションのためのフィードフォワードフレームワークである。 AnimateAnyMesh++はセマンティックに正確で時間的にコヒーレントなメッシュアニメーションを数秒で生成する。
参考スコア（独自算出の注目度）: 60.958967260016976
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recent advances in 4D content generation have attracted increasing attention, yet creating high-quality animated 3D models remains challenging due to the complexity of modeling spatio-temporal distributions and the scarcity of 4D training data. We present AnimateAnyMesh++, a feed-forward framework for text-driven animation of arbitrary 3D meshes with substantial upgrades in data, architecture, and generative capability. First, we expand the DyMesh-XL dataset by mining dynamic content from Objaverse-XL, increasing the number of unique identities from 60K to 300K and substantially broadening category and motion diversity. Second, we redesign DyMeshVAE-Flex with power-law topology-aware attention and vertex-normal enhanced features, which significantly improves trajectory reconstruction, local geometry preservation, and mitigates trajectory-sticking artifacts. Third, we introduce architectural changes to both DyMeshVAE-Flex and the rectified-flow (RF) generator to support variable-length sequence training and generation, enabling longer animations while preserving reconstruction fidelity. Extensive experiments demonstrate that AnimateAnyMesh++ generates semantically accurate and temporally coherent mesh animations within seconds, surpassing prior approaches in quality and efficiency. The enlarged DyMesh-XL, the upgraded DyMeshVAE-Flex, and variable-length RF together deliver consistent gains across benchmarks and in-the-wild meshes. We will release code, models, and the expanded DyMesh-XL upon acceptance of this manuscript to facilitate research in 4D content creation.
Abstract（参考訳）: 近年の4Dコンテンツ生成の進歩が注目されているが、時空間分布のモデリングの複雑さと4Dトレーニングデータの不足により、高品質な3Dモデルを作成することは依然として困難である。 AnimateAnyMesh++は、任意の3Dメッシュのテキスト駆動アニメーションのためのフィードフォワードフレームワークで、データ、アーキテクチャ、生成機能を大幅にアップグレードする。まず、Objaverse-XLから動的コンテンツをマイニングすることでDyMesh-XLデータセットを拡張し、60Kから300Kまでのユニークなアイデンティティの数を増やし、カテゴリと動きの多様性を大幅に広げる。第二に,我々はDyMeshVAE-Flexを再設計し,トポロジを意識したトポロジと頂点正規化機能を導入し,トラジェクトリ再構成,局所的な地形保存,トラジェクトリスティックアーティファクトの軽減を実現した。第3に、DyMeshVAE-Flexと修正フロー(RF)ジェネレータの両方にアーキテクチャ変更を導入し、可変長シーケンスのトレーニングと生成をサポートし、再現性を維持しながらより長いアニメーションを可能にする。大規模な実験では、AnimateAnyMesh++が数秒でセマンティックに正確で時間的にコヒーレントなメッシュアニメーションを生成し、品質と効率の従来のアプローチを上回ることが示されている。拡張されたDyMesh-XL、アップグレードされたDyMeshVAE-Flex、可変長RFは、ベンチマークとWildメッシュ間で一貫した利得を提供する。この原稿を受理し、4Dコンテンツ作成の研究を促進するため、コード、モデル、拡張されたDyMesh-XLをリリースする。

論文の概要: AnimateAnyMesh++: A Flexible 4D Foundation Model for High-Fidelity Text-Driven Mesh Animation

関連論文リスト