Fugu-MT 論文翻訳(概要): Vista4D: Video Reshooting with 4D Point Clouds

論文の概要: Vista4D: Video Reshooting with 4D Point Clouds

arxiv url: http://arxiv.org/abs/2604.21915v1
Date: Thu, 23 Apr 2026 17:57:28 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-24 14:40:06.816519
Title: Vista4D: Video Reshooting with 4D Point Clouds
Title（参考訳）: Vista4D: 4Dポイントクラウドを使ったビデオリシューティング
Authors: Kuan Heng Lin, Zhizheng Liu, Pablo Salamanca, Yash Kant, Ryan Burgert, Yuancheng Xu, Koichi Namekata, Yiwei Zhao, Bolei Zhou, Micah Goldblum, Paul Debevec, Ning Yu,
Abstract要約: Vista4Dは、入力されたビデオとターゲットカメラを4Dポイントクラウドにグラウンドするビデオリシューティングフレームワークである。現状のベースラインと比較して、4Dの一貫性、カメラ制御、視覚的品質が改善された。本手法は,動的シーン展開や4次元シーン再構成といった実世界の応用に一般化する。
参考スコア（独自算出の注目度）: 60.22650781486302
License: http://creativecommons.org/licenses/by/4.0/
Abstract: We present Vista4D, a robust and flexible video reshooting framework that grounds the input video and target cameras in a 4D point cloud. Specifically, given an input video, our method re-synthesizes the scene with the same dynamics from a different camera trajectory and viewpoint. Existing video reshooting methods often struggle with depth estimation artifacts of real-world dynamic videos, while also failing to preserve content appearance and failing to maintain precise camera control for challenging new trajectories. We build a 4D-grounded point cloud representation with static pixel segmentation and 4D reconstruction to explicitly preserve seen content and provide rich camera signals, and we train with reconstructed multiview dynamic data for robustness against point cloud artifacts during real-world inference. Our results demonstrate improved 4D consistency, camera control, and visual quality compared to state-of-the-art baselines under a variety of videos and camera paths. Moreover, our method generalizes to real-world applications such as dynamic scene expansion and 4D scene recomposition. See our project page for results, code, and models: https://eyeline-labs.github.io/Vista4D
Abstract（参考訳）: 入力ビデオとターゲットカメラを4Dポイントのクラウドに接地する、堅牢で柔軟なビデオリシューティングフレームワークであるVista4Dを紹介する。具体的には、入力映像が与えられた場合、異なるカメラの軌跡と視点から同じダイナミックスでシーンを再合成する。既存のビデオ再撮影手法は、実世界のダイナミックビデオの深さ推定アーチファクトに苦労するが、コンテンツの外観を保たず、新しいトラジェクトリに挑戦するための正確なカメラ制御の維持に失敗する。静的なピクセルセグメンテーションと4D再構成を併用した4Dグラウンドのポイントクラウド表現を構築し,目に見えるコンテンツを明示的に保存し,リッチなカメラ信号を提供する。その結果、様々なビデオやカメラパスにおける最先端のベースラインと比較して、4Dの一貫性、カメラ制御、視覚的品質が改善された。さらに,本手法は動的シーン展開や4次元シーン再構成といった実世界の応用に一般化する。結果、コード、モデルについては、プロジェクトページを参照してください。

論文の概要: Vista4D: Video Reshooting with 4D Point Clouds

関連論文リスト