Fugu-MT 論文翻訳(概要): One2Scene: Geometric Consistent Explorable 3D Scene Generation from a Single Image

論文の概要: One2Scene: Geometric Consistent Explorable 3D Scene Generation from a Single Image

arxiv url: http://arxiv.org/abs/2602.19766v1
Date: Mon, 23 Feb 2026 12:15:54 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-24 17:42:02.803539
Title: One2Scene: Geometric Consistent Explorable 3D Scene Generation from a Single Image
Title（参考訳）: One2Scene: 単一画像からの幾何学的一貫性のある探索可能な3Dシーン生成
Authors: Pengfei Wang, Liyi Chen, Zhiyuan Ma, Yanjun Guo, Guowen Zhang, Lei Zhang,
Abstract要約: textbf2Sceneは、不適切な問題を3つのトラクタブルなサブタスクに分解する効果的なフレームワークである。 One2Sceneは、パノラマ深度推定、フィードフォワード360再構成、探索可能な3Dシーン生成において、最先端の手法を大幅に上回っている。
参考スコア（独自算出の注目度）: 13.717660892718099
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Generating explorable 3D scenes from a single image is a highly challenging problem in 3D vision. Existing methods struggle to support free exploration, often producing severe geometric distortions and noisy artifacts when the viewpoint moves far from the original perspective. We introduce \textbf{One2Scene}, an effective framework that decomposes this ill-posed problem into three tractable sub-tasks to enable immersive explorable scene generation. We first use a panorama generator to produce anchor views from a single input image as initialization. Then, we lift these 2D anchors into an explicit 3D geometric scaffold via a generalizable, feed-forward Gaussian Splatting network. Instead of treating the panorama as a single image for reconstruction, we project it into multiple sparse anchor views and reformulate the reconstruction task as multi-view stereo matching, which allows us to leverage robust geometric priors learned from large-scale multi-view datasets. A bidirectional feature fusion module is used to enforce cross-view consistency, yielding an efficient and geometrically reliable scaffold. Finally, the scaffold serves as a strong prior for a novel view generator to produce photorealistic and geometrically accurate views at arbitrary cameras. By explicitly conditioning on a 3D-consistent scaffold to perform reconstruction, One2Scene works stably under large camera motions, supporting immersive scene exploration. Extensive experiments show that One2Scene substantially outperforms state-of-the-art methods in panorama depth estimation, feed-forward 360° reconstruction, and explorable 3D scene generation. Code and models will be released.
Abstract（参考訳）: 単一の画像から探索可能な3Dシーンを生成することは、3Dビジョンにおいて非常に難しい問題である。既存の手法は自由な探検を支援するのに苦労し、視点が本来の視点から遠く離れると、しばしば厳しい幾何学的歪みやノイズの多い人工物を生み出す。我々は,この不適切な問題を3つの抽出可能なサブタスクに分解して,没入可能な探索可能なシーン生成を可能にする,効果的なフレームワークである「textbf{One2Scene}」を紹介した。まず,パノラマ生成器を用いて1つの入力画像からアンカービューを初期化する。そして、これらの2次元アンカーを、一般化可能なフィードフォワードガウススプラッティングネットワークを介して、明示的な3次元幾何学的足場に持ち上げる。パノラマを再構成のための単一のイメージとして扱う代わりに、複数のスパースアンカービューに投影し、再構成タスクをマルチビューステレオマッチングとして再構成することで、大規模マルチビューデータセットから得られた堅牢な幾何学的事前情報を活用することができる。双方向のフィーチャーフュージョンモジュールは、効率的で幾何学的に信頼性の高い足場をもたらす、クロスビューの一貫性を強制するために使用される。最後に、足場は、任意のカメラでフォトリアリスティックで幾何学的に正確なビューを生成する新しいビュージェネレータの強力な先駆けとなる。 One2Sceneは3D一貫性のある足場を明示的に調整することで、大きなカメラの動きの下で安定して動作し、没入型シーン探索をサポートする。大規模な実験により、One2Sceneはパノラマ深度推定、フィードフォワード360度再構成、探索可能な3Dシーン生成において、最先端の手法を大幅に上回っていることが示された。コードとモデルはリリースされる。

論文の概要: One2Scene: Geometric Consistent Explorable 3D Scene Generation from a Single Image

関連論文リスト