Fugu-MT 論文翻訳(概要): Voyaging into Unbounded Dynamic Scenes from a Single View

論文の概要: Voyaging into Unbounded Dynamic Scenes from a Single View

arxiv url: http://arxiv.org/abs/2507.04183v1
Date: Sat, 05 Jul 2025 22:49:25 GMT
ステータス: 翻訳完了
システム内更新日: 2025-07-08 15:46:35.055973
Title: Voyaging into Unbounded Dynamic Scenes from a Single View
Title（参考訳）: 単一視点からの非有界なダイナミックシーンへのヴォイジング
Authors: Fengrui Tian, Tianjiao Ding, Jinqi Luo, Hancheng Min, René Vidal,
Abstract要約: そこで本稿では,動的シーン生成を動的コンテンツのシーン露光プロセスとして再構成するDynamicVoyagerを提案する。我々は、この部分的な映像を新しい視点でレンダリングし、点雲からの光コンテキストで映像を映し出し、3D一貫した動きを生成する。実験により、我々のモデルは、フライスルーカメラに沿って一貫した動きで、境界のないシーンを生成できることが示されている。
参考スコア（独自算出の注目度）: 31.85867311855001
License: http://creativecommons.org/licenses/by/4.0/
Abstract: This paper studies the problem of generating an unbounded dynamic scene from a single view, which has wide applications in augmented/virtual reality and robotics. Since the scene is changing over time, different generated views need to be consistent with the underlying 3D motions. While previous works learn such consistency by training from multiple views, the generated scene regions are bounded to be close to the training views with limited camera movements. To address this issue, we propose DynamicVoyager that reformulates the dynamic scene generation as a scene outpainting process for new dynamic content. As 2D outpainting models can hardly generate 3D consistent motions from only 2D pixels at a single view, we consider pixels as rays to enrich the pixel input with the ray context, so that the 3D motion consistency can be learned from the ray information. More specifically, we first map the single-view video input to a dynamic point cloud with the estimated video depths. Then we render the partial video at a novel view and outpaint the video with ray contexts from the point cloud to generate 3D consistent motions. We employ the outpainted video to update the point cloud, which is used for scene outpainting from future novel views. Experiments show that our model is able to generate unbounded scenes with consistent motions along fly-through cameras, and the generated contents can be controlled with scene prompts.
Abstract（参考訳）: 本稿では,拡張現実・仮想現実感・ロボット工学に広く応用されている,単一の視点から非有界なダイナミックシーンを生成する問題について考察する。シーンは時間とともに変化しているので、異なる生成されたビューは、基礎となる3Dモーションと一致する必要がある。過去の研究では、複数の視点からトレーニングすることで、このような一貫性を学習する一方で、生成されたシーン領域は、カメラの動きに制限のあるトレーニングビューに近いように制限されている。この問題に対処するため,我々は動的シーン生成を動的コンテンツのシーン出力プロセスとして再構成するDynamicVoyagerを提案する。 2Dアウトパインティングモデルでは、単一の視点で2Dピクセルのみから3D一貫した動きを生成できないため、画素を光線情報から3D一貫した動きを学習できるように、画素を光線入力をリッチ化するための光線とみなす。より具体的には、単視点ビデオ入力を推定されたビデオ深度で動的ポイントクラウドにマッピングする。そして、その部分的な映像を新しい視点でレンダリングし、点雲からの光コンテキストで映像を映し出し、3D一貫した動きを生成する。アウトペイントされたビデオを用いてポイントクラウドを更新する。実験により,本モデルでは,フライスルーカメラに沿って一貫した動きを持つ非有界シーンを生成でき,生成したコンテンツはシーンプロンプトで制御可能であることが示された。

論文の概要: Voyaging into Unbounded Dynamic Scenes from a Single View

関連論文リスト