Fugu-MT 論文翻訳(概要): Gaussian See, Gaussian Do: Semantic 3D Motion Transfer from Multiview Video

論文の概要: Gaussian See, Gaussian Do: Semantic 3D Motion Transfer from Multiview Video

arxiv url: http://arxiv.org/abs/2511.14848v1
Date: Tue, 18 Nov 2025 19:02:50 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-20 15:51:28.495203
Title: Gaussian See, Gaussian Do: Semantic 3D Motion Transfer from Multiview Video
Title（参考訳）: Gaussian See, Gaussian Do:Semantic 3D Motion Transfer from Multiview Video (英語)
Authors: Yarin Bekor, Gal Michael Harari, Or Perel, Or Litany,
Abstract要約: マルチビュー映像からの3D動画のセマンティック・モーション・トランスファーのための新しい手法を提案する。条件インバージョンにより映像から動き埋め込みを抽出し,レンダリングフレームに適用し,結果の動画を用いて動的3次元ガウススプラッティング再構成を監督する。セマンティックな3次元動き伝達のための最初のベンチマークを確立し, 適応ベースラインよりも優れた動きの忠実度と構造的整合性を示す。
参考スコア（独自算出の注目度）: 15.994811723477973
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: We present Gaussian See, Gaussian Do, a novel approach for semantic 3D motion transfer from multiview video. Our method enables rig-free, cross-category motion transfer between objects with semantically meaningful correspondence. Building on implicit motion transfer techniques, we extract motion embeddings from source videos via condition inversion, apply them to rendered frames of static target shapes, and use the resulting videos to supervise dynamic 3D Gaussian Splatting reconstruction. Our approach introduces an anchor-based view-aware motion embedding mechanism, ensuring cross-view consistency and accelerating convergence, along with a robust 4D reconstruction pipeline that consolidates noisy supervision videos. We establish the first benchmark for semantic 3D motion transfer and demonstrate superior motion fidelity and structural consistency compared to adapted baselines. Code and data for this paper available at https://gsgd-motiontransfer.github.io/
Abstract（参考訳）: マルチビュー映像からの3D動画のセマンティック・モーション・トランスファーのための新しいアプローチであるGaussian See,Gaussian Doを提示する。本手法は,意味的に意味のある対応を持つオブジェクト間のリグフリーなクロスカテゴリ動作転送を可能にする。暗黙のモーショントランスファー技術に基づいて、条件インバージョンによりソースビデオから動き埋め込みを抽出し、静的なターゲット形状の描画フレームに適用し、その結果の動画を用いて動的3Dガウス分割再構成を監督する。提案手法では, アンカーをベースとしたモーション埋め込み機構を導入し, クロスビューの整合性を確保し, 収束の促進を図るとともに, ノイズの多い監視映像を集約する頑健な4D再構成パイプラインを提案する。セマンティックな3次元動き伝達のための最初のベンチマークを確立し, 適応ベースラインよりも優れた動きの忠実度と構造的整合性を示す。この論文のコードとデータはhttps://gsgd-motiontransfer.github.io/で公開されている。

論文の概要: Gaussian See, Gaussian Do: Semantic 3D Motion Transfer from Multiview Video

関連論文リスト