Fugu-MT 論文翻訳(概要): Leveling3D: Leveling Up 3D Reconstruction with Feed-Forward 3D Gaussian Splatting and Geometry-Aware Generation

論文の概要: Leveling3D: Leveling Up 3D Reconstruction with Feed-Forward 3D Gaussian Splatting and Geometry-Aware Generation

arxiv url: http://arxiv.org/abs/2603.16211v1
Date: Tue, 17 Mar 2026 07:40:45 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-18 17:42:07.155668
Title: Leveling3D: Leveling Up 3D Reconstruction with Feed-Forward 3D Gaussian Splatting and Geometry-Aware Generation
Title（参考訳）: Leveling 3D: Leveling up 3D Reconstruction with Feed-forward 3D Gaussian Splatting and Geometry-Aware Generation
Authors: Yiming Huang, Baixiang Huang, Beilei Cui, Chi Kit Ng, Long Bai, Hongliang Ren,
Abstract要約: 本稿では, フィードフォワード3次元再構成と幾何一貫性生成を統合した新しいパイプラインであるLeveling3Dを紹介する。我々は,新規ビュー合成や深度推定などのタスクを含む,公開データセット上でのSOTA性能を実現する。
参考スコア（独自算出の注目度）: 15.735997729565987
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Feed-forward 3D reconstruction has revolutionized 3D vision, providing a powerful baseline for downstream tasks such as novel-view synthesis with 3D Gaussian Splatting. Previous works explore fixing the corrupted rendering results with a diffusion model. However, they lack geometric concern and fail at filling the missing area on the extrapolated view. In this work, we introduce Leveling3D, a novel pipeline that integrates feed-forward 3D reconstruction with geometrical-consistent generation to enable holistic simultaneous reconstruction and generation. We propose a geometry-aware leveling adapter, a lightweight technique that aligns internal knowledge in the diffusion model with the geometry prior from the feed-forward model. The leveling adapter enables generation on the artifact area of the extrapolated novel views caused by underconstrained regions of the 3D representation. Specifically, to learn a more diverse distributed generation, we introduce the palette filtering strategy for training, and a test-time masking refinement to prevent messy boundaries along the fixing regions. More importantly, the enhanced extrapolated novel views from Leveling3D could be used as the inputs for feed-forward 3DGS, leveling up the 3D reconstruction. We achieve SOTA performance on public datasets, including tasks such as novel-view synthesis and depth estimation.
Abstract（参考訳）: フィードフォワード3D再構成は3D視覚に革命をもたらし、3Dガウススプラッティングを用いたノベルビュー合成のような下流タスクの強力なベースラインを提供する。従来の研究は、拡散モデルを用いて、劣化したレンダリング結果の修正を検討していた。しかし、幾何学的な懸念がなく、外挿されたビューの欠落した領域を埋めることに失敗した。本研究では, フィードフォワード3次元再構成と幾何整合生成を統合し, 全体的同時再構成と生成を可能にする新しいパイプラインであるLeveling3Dを紹介する。本稿では,拡散モデルにおける内部知識と,フィードフォワードモデルに先行する幾何学的知識を整合させる軽量な手法である幾何対応レベリングアダプタを提案する。レベル付けアダプタは、3D表現の制約の少ない領域によって引き起こされる外挿された新規ビューのアーティファクト領域の生成を可能にする。具体的には、より多様な分散世代を学習するために、トレーニングのためのパレットフィルタリング戦略と、固定領域の乱れを防止するためのテストタイムマスキング改善を導入する。さらに重要なことに、Leveling3Dの強化された外挿された新しいビューはフィードフォワード3DGSの入力として使用することができ、3D再構成のレベルアップを実現した。我々は,新規ビュー合成や深度推定などのタスクを含む,公開データセット上でのSOTA性能を実現する。

論文の概要: Leveling3D: Leveling Up 3D Reconstruction with Feed-Forward 3D Gaussian Splatting and Geometry-Aware Generation

関連論文リスト