Fugu-MT 論文翻訳(概要): Motion Marionette: Rethinking Rigid Motion Transfer via Prior Guidance

論文の概要: Motion Marionette: Rethinking Rigid Motion Transfer via Prior Guidance

arxiv url: http://arxiv.org/abs/2511.19909v1
Date: Tue, 25 Nov 2025 04:34:42 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-26 17:37:04.269525
Title: Motion Marionette: Rethinking Rigid Motion Transfer via Prior Guidance
Title（参考訳）: Motion Marionette: プライオリエンスによるリジッドモーショントランスファーの再考
Authors: Haoxuan Wang, Jiachen Tao, Junyi Wu, Gaowen Liu, Ramana Rao Kompella, Yan Yan,
Abstract要約: Motion Marionetteは、モノクルソースビデオからシングルビューターゲット画像への厳格なモーション転送のためのフレームワークである。動画から動き軌跡を抽出して空間時間(SpaT)を予め構築する。結果の速度場は、効率的なビデオ制作に柔軟に利用することができる。
参考スコア（独自算出の注目度）: 26.642143303176997
License: http://creativecommons.org/licenses/by/4.0/
Abstract: We present Motion Marionette, a zero-shot framework for rigid motion transfer from monocular source videos to single-view target images. Previous works typically employ geometric, generative, or simulation priors to guide the transfer process, but these external priors introduce auxiliary constraints that lead to trade-offs between generalizability and temporal consistency. To address these limitations, we propose guiding the motion transfer process through an internal prior that exclusively captures the spatial-temporal transformations and is shared between the source video and any transferred target video. Specifically, we first lift both the source video and the target image into a unified 3D representation space. Motion trajectories are then extracted from the source video to construct a spatial-temporal (SpaT) prior that is independent of object geometry and semantics, encoding relative spatial variations over time. This prior is further integrated with the target object to synthesize a controllable velocity field, which is subsequently refined using Position-Based Dynamics to mitigate artifacts and enhance visual coherence. The resulting velocity field can be flexibly employed for efficient video production. Empirical results demonstrate that Motion Marionette generalizes across diverse objects, produces temporally consistent videos that align well with the source motion, and supports controllable video generation.
Abstract（参考訳）: モノクロ映像からシングルビュー対象画像への剛性移動のためのゼロショットフレームワークであるMotion Marionetteを提案する。従来の研究では、幾何学的、生成的、あるいはシミュレーションに先立って転送プロセスを導くのが一般的だったが、これらの外部の先行は、一般化可能性と時間的一貫性の間のトレードオフをもたらす補助的な制約を導入する。これらの制約に対処するために、我々は、空間的時間的変換のみをキャプチャし、ソースビデオと転送対象のビデオ間で共有する、内部の事前を通したモーション転送プロセスの導出を提案する。具体的には、まず、ソースビデオとターゲット画像の両方を統一された3D表現空間に引き上げる。モーショントラジェクトリはソースビデオから抽出され、オブジェクトの幾何学や意味論に依存しない空間時間(SpaT)を事前に構築し、時間とともに相対的な空間変動を符号化する。この前者は、制御可能な速度場を合成するためにターゲットオブジェクトとさらに統合され、その後、位置ベースダイナミクスを使用して、アーティファクトを緩和し、視覚的コヒーレンスを高めるために洗練される。結果の速度場は、効率的なビデオ制作に柔軟に利用することができる。実験の結果,Motion Marionetteは様々な物体をまたいで一般化し,時間的に一貫した映像を生成し,映像生成を制御可能であることが示された。

論文の概要: Motion Marionette: Rethinking Rigid Motion Transfer via Prior Guidance

関連論文リスト