Fugu-MT 論文翻訳(概要): Dynamic View Synthesis from Small Camera Motion Videos

論文の概要: Dynamic View Synthesis from Small Camera Motion Videos

arxiv url: http://arxiv.org/abs/2506.23153v1
Date: Sun, 29 Jun 2025 09:17:55 GMT
ステータス: 翻訳完了
システム内更新日: 2025-07-01 21:27:53.748859
Title: Dynamic View Synthesis from Small Camera Motion Videos
Title（参考訳）: 小型カメラモーションビデオからのダイナミックビュー合成
Authors: Huiqiang Sun, Xingyi Li, Juewen Peng, Liao Shen, Zhiguo Cao, Ke Xian, Guosheng Lin,
Abstract要約: 分布に基づく奥行き正規化に基づく動的3ドルDシーンのための新しいビュー合成を提案する。また、光線に沿った物体境界がほぼゼロとなる前に空間点の体積密度を強制する制約を導入し、我々のモデルがシーンの正確な幾何学を学べるようにした。我々は,小さなカメラモーション入力によるシーン表現におけるアプローチの有効性を実証するために,広範囲な実験を行い,その結果を最先端の手法と比較した。
参考スコア（独自算出の注目度）: 56.359460602781304
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Novel view synthesis for dynamic $3$D scenes poses a significant challenge. Many notable efforts use NeRF-based approaches to address this task and yield impressive results. However, these methods rely heavily on sufficient motion parallax in the input images or videos. When the camera motion range becomes limited or even stationary (i.e., small camera motion), existing methods encounter two primary challenges: incorrect representation of scene geometry and inaccurate estimation of camera parameters. These challenges make prior methods struggle to produce satisfactory results or even become invalid. To address the first challenge, we propose a novel Distribution-based Depth Regularization (DDR) that ensures the rendering weight distribution to align with the true distribution. Specifically, unlike previous methods that use depth loss to calculate the error of the expectation, we calculate the expectation of the error by using Gumbel-softmax to differentiably sample points from discrete rendering weight distribution. Additionally, we introduce constraints that enforce the volume density of spatial points before the object boundary along the ray to be near zero, ensuring that our model learns the correct geometry of the scene. To demystify the DDR, we further propose a visualization tool that enables observing the scene geometry representation at the rendering weight level. For the second challenge, we incorporate camera parameter learning during training to enhance the robustness of our model to camera parameters. We conduct extensive experiments to demonstrate the effectiveness of our approach in representing scenes with small camera motion input, and our results compare favorably to state-of-the-art methods.
Abstract（参考訳）: ダイナミックな3ドル(約3,300円)のシーンのための新しいビュー合成は、大きな課題となる。多くの注目すべき取り組みは、この課題に対処し、印象的な結果を得るためにNeRFベースのアプローチを使用している。しかし、これらの手法は入力画像やビデオの運動パララックスに大きく依存している。カメラの動き範囲が制限されたり、静止状態になったりした場合(すなわち、小さなカメラの動き)、既存の手法では、シーン形状の誤った表現とカメラパラメータの不正確な推定の2つの主要な課題に直面する。これらの課題は、事前の手法が満足のいく結果を生み出すのに苦労したり、無効になったりする。最初の課題に対処するため,本研究では,真の分布と整合するレンダリング重み分布を確実にするDDR(Distributed-based Depth Regularization)を提案する。具体的には、予測誤差を計算するために奥行き損失を用いた従来の手法とは異なり、Gumbel-softmax を用いて離散的な描画重み分布から点を微分して誤差の予測を計算する。さらに、光線に沿った物体境界がゼロに近い前に空間点の体積密度を強制する制約を導入し、我々のモデルがシーンの正確な幾何学を学べるようにした。さらに,DDRをデミスティフィケートするために,レンダリングウェイトレベルでシーンの幾何学的表現を観察できる可視化ツールを提案する。第2の課題として、トレーニング中にカメラパラメータ学習を導入し、カメラパラメータに対するモデルの堅牢性を高める。我々は,小さなカメラモーション入力によるシーン表現におけるアプローチの有効性を実証するために,広範囲な実験を行い,その結果を最先端の手法と比較した。

論文の概要: Dynamic View Synthesis from Small Camera Motion Videos

関連論文リスト