Fugu-MT 論文翻訳(概要): MotionDreamer: Universal Skeletal Motion Generation for 3D Rigged Shapes

論文の概要: MotionDreamer: Universal Skeletal Motion Generation for 3D Rigged Shapes

arxiv url: http://arxiv.org/abs/2606.01518v1
Date: Mon, 01 Jun 2026 00:42:31 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-02 21:34:29.76007
Title: MotionDreamer: Universal Skeletal Motion Generation for 3D Rigged Shapes
Title（参考訳）: MotionDreamer: 3D Rigged Shapesのためのユニバーサル骨格モーションジェネレーション
Authors: Ye Tao, Yuxin Yao, Kendong Liu, Dapeng Wu, Junhui Hou,
Abstract要約: MotionDreamerは、2Dビデオガイダンスからカテゴリに依存しない骨格アニメーションを生成するための拡散ベースのフレームワークである。本研究では,2次元視覚運動キューとヘテロジニアスな3次元骨格構造とのギャップを埋める構造的意味注入機構を提案する。提案手法は既存の手法を著しく上回り、堅牢で効率的な4Dアセット生成のための最先端のベンチマークを新たに設定する。
参考スコア（独自算出の注目度）: 52.45449797329548
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Motion generation for rigged shapes is vital for scalable 4D asset production. However, template-based methods are limited by specific topologies and fail to generalize across diverse morphologies. Conversely, per-case optimization is computationally expensive, susceptible to local optima, and highly sensitive to viewpoint-induced ambiguities. In this paper, we present MotionDreamer, a diffusion-based framework designed for category-agnostic skeletal animation generation from 2D video guidance. To overcome the scarcity of high-quality training data, we have curated a large-scale dynamic dataset comprising approximately 20,000 diverse 3D models, each featuring complete textures, skeletal rigging, and a wide array of comprehensive animation sequences. To bridge the kinematic gap between 2D visual motion cues and heterogeneous 3D skeletal structures, we propose a structural-semantic injection mechanism. Our model integrates texture and semantic attributes directly into skeletal joint representations. This allows it to map perceived visual dynamics to specific joint hierarchies and their functional roles. This enables MotionDreamer to synthesize high-fidelity animations that maintain anatomical consistency across a vast range of unseen categories, from existing biological species to fantastical beings. Extensive experiments demonstrate that our approach significantly outperforms existing methods, setting a new state-of-the-art benchmark for robust and efficient 4D asset generation. The code will be made publicly available upon acceptance.
Abstract（参考訳）: 剛体形状の運動生成はスケーラブルな4Dアセット生産に不可欠である。しかし、テンプレートベースの手法は特定のトポロジーによって制限され、様々な形態をまたいだ一般化に失敗する。逆に、ケースごとの最適化は計算コストが高く、局所的な最適性に影響を受けやすく、視点によるあいまいさに非常に敏感である。本稿では,2次元ビデオガイダンスからカテゴリに依存しない骨格アニメーションを生成するための拡散型フレームワークであるMotionDreamerを提案する。高品質なトレーニングデータの不足を克服するため,約2万の多様な3Dモデルからなる大規模動的データセットをキュレートした。 2次元視覚運動キューとヘテロジニアスな3次元骨格構造との間の運動的ギャップを埋めるために,構造意味注入機構を提案する。我々のモデルは、テクスチャとセマンティック属性を直接骨格関節表現に統合する。これにより、知覚された視覚力学を特定の関節階層とその機能的役割にマッピングすることができる。これにより、MotionDreamerは、既存の生物種から幻想的な生物まで、さまざまな未確認カテゴリーにわたる解剖学的一貫性を維持する高忠実なアニメーションを合成することができる。大規模な実験により、我々の手法は既存の手法よりも大幅に優れており、ロバストで効率的な4Dアセット生成のための新しい最先端のベンチマークが設定されている。コードは受理時に公開されます。

論文の概要: MotionDreamer: Universal Skeletal Motion Generation for 3D Rigged Shapes

関連論文リスト