Fugu-MT 論文翻訳(概要): DeblurNVS: Geometric Latent Diffusion for Novel View Synthesis from Sparse Motion-Blurred Images

論文の概要: DeblurNVS: Geometric Latent Diffusion for Novel View Synthesis from Sparse Motion-Blurred Images

arxiv url: http://arxiv.org/abs/2606.01315v1
Date: Sun, 31 May 2026 16:04:55 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-02 21:34:29.578531
Title: DeblurNVS: Geometric Latent Diffusion for Novel View Synthesis from Sparse Motion-Blurred Images
Title（参考訳）: DeblurNVS:Sparse Motion-Blurred画像からの新しいビュー合成のための幾何学的潜在拡散
Authors: Changyue Shi, Wangbo Yu, Chaoran Feng, Li Yuan,
Abstract要約: 新しいビュー合成(NVS)はコンピュータビジョンとグラフィックスの基本的な問題である。ニューラルレイディアンスフィールド(NeRF),3次元ガウススプラッティング(3DGS)および生成ビュー合成の進歩により,その品質が大幅に向上した。スパース・モーション・ブルード画像から直接高忠実度ノベルビューを実現するための新しいフレームワークであるDeNVSを提案する。
参考スコア（独自算出の注目度）: 22.61939963916488
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Novel view synthesis (NVS) is a fundamental problem in computer vision and graphics. Recent advances in neural radiance fields (NeRF), 3D Gaussian Splatting (3DGS), and generative view synthesis have substantially improved its quality. Yet most methods still rely on clean observations, where image structures and cross-view geometric cues are well preserved. Motion blur breaks this assumption by corrupting local details and weakening multi-view correspondences. Such blur commonly arises from camera shake, scene motion, or finite exposure in practical capture. Blur-aware NVS methods address this degradation by modeling image formation, but their reliance on costly per-scene optimization limits efficient and generalizable sparse-view synthesis. To address this, we propose DeblurNVS, a novel framework for synthesizing high-fidelity novel views directly from sparse motion-blurred images, without requiring per-scene optimization. DeblurNVS restores the intermediate geometric representations needed for multi-view reasoning, enabling blurred inputs to recover reliable structure and correspondence cues. The restored representations are then combined with target camera information to synthesize the target-view representation and reconstruct a sharp RGB novel view. To enable the large-scale training, we construct a motion-blurred NVS dataset from DL3DV-10K using interpolation-based finite-exposure blur synthesis. Extensive experiments demonstrate that DeblurNVS outperforms existing baselines on synthetic motion-blur benchmarks and generalizes to real motion-blurred scenes, producing perceptually sharper and structurally more stable novel views while avoiding costly per-scene optimization. Project page: https://github.com/PKU-YuanGroup/DeblurNVS.
Abstract（参考訳）: 新しいビュー合成(NVS)はコンピュータビジョンとグラフィックスの基本的な問題である。ニューラルレイディアンスフィールド(NeRF),3次元ガウススプラッティング(3DGS)および生成ビュー合成の進歩により,その品質が大幅に向上した。しかし、ほとんどの手法は、画像構造や断面幾何学的手がかりがよく保存されているクリーンな観察に依存している。動きのぼかしはこの仮定を破り、局所的な詳細を破損させ、多視点対応を弱める。このようなぼやけは、カメラの揺れ、シーンの動き、あるいは実用的なキャプチャにおける有限露出から生じることが多い。 Blur-aware NVS法は画像形成をモデル化することでこの劣化に対処するが、コストのかかるシーンごとの最適化に依存するため、効率的で一般化可能なスパースビュー合成が可能である。そこで本稿では, シーンごとの最適化を必要とせず, スパース映像から直接高忠実度新規ビューを合成する新しいフレームワークであるDeblurNVSを提案する。 DeblurNVSは、マルチビュー推論に必要な中間幾何学的表現を復元し、ぼやけた入力で信頼性の高い構造と対応の手がかりを復元する。復元された表現は、ターゲットカメラ情報と組み合わせて、ターゲットビュー表現を合成し、シャープなRGBノベルビューを再構成する。大規模トレーニングを実現するために,補間に基づく有限露光ブラー合成を用いて,DL3DV-10Kから動きブルーのNVSデータセットを構築する。大規模な実験により、DeblurNVSは、合成モーションブルーのベンチマークで既存のベースラインを上回り、実際のモーションブルーのシーンに一般化し、コストの高いシーンごとの最適化を回避しつつ、知覚的にシャープで構造的により安定した新しいビューを生み出すことが示されている。プロジェクトページ:https://github.com/PKU-YuanGroup/DeblurNVS

論文の概要: DeblurNVS: Geometric Latent Diffusion for Novel View Synthesis from Sparse Motion-Blurred Images

関連論文リスト