Fugu-MT 論文翻訳(概要): FreeScale: Scaling 3D Scenes via Certainty-Aware Free-View Generation

論文の概要: FreeScale: Scaling 3D Scenes via Certainty-Aware Free-View Generation

arxiv url: http://arxiv.org/abs/2604.10512v1
Date: Sun, 12 Apr 2026 08:00:53 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-14 20:13:16.0661
Title: FreeScale: Scaling 3D Scenes via Certainty-Aware Free-View Generation
Title（参考訳）: FreeScale: 確実なフリービュー生成による3Dシーンのスケーリング
Authors: Chenhan Jiang, Yu Chen, Qingwen Zhang, Jifei Song, Songcen Xu, Dit-Yan Yeung, Jiankang Deng,
Abstract要約: FreeScaleは、限られた現実世界の画像シーケンスを、高品質なトレーニングデータのスケーラブルなソースに変換するフレームワークである。フィードフォワードNVSモデルのトレーニングをスケールアップし,PSNRにおける2.7dBの顕著なゲインを達成することにより,FreeScaleの有効性を示す。私たちの仕事は、3Dビジョンの根本的なボトルネックを克服するために、実用的で強力なデータ生成エンジンを提供します。
参考スコア（独自算出の注目度）: 75.74617373156902
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: The development of generalizable Novel View Synthesis (NVS) models is critically limited by the scarcity of large-scale training data featuring diverse and precise camera trajectories. While real-world captures are photorealistic, they are typically sparse and discrete. Conversely, synthetic data scales but suffers from a domain gap and often lacks realistic semantics. We introduce FreeScale, a novel framework that leverages the power of scene reconstruction to transform limited real-world image sequences into a scalable source of high-quality training data. Our key insight is that an imperfect reconstructed scene serves as a rich geometric proxy, but naively sampling from it amplifies artifacts. To this end, we propose a certainty-aware free-view sampling strategy identifying novel viewpoints that are both semantically meaningful and minimally affected by reconstruction errors. We demonstrate FreeScale's effectiveness by scaling up the training of feedforward NVS models, achieving a notable gain of 2.7 dB in PSNR on challenging out-of-distribution benchmarks. Furthermore, we show that the generated data can actively enhance per-scene 3D Gaussian Splatting optimization, leading to consistent improvements across multiple datasets. Our work provides a practical and powerful data generation engine to overcome a fundamental bottleneck in 3D vision. Project page: https://mvp-ai-lab.github.io/FreeScale.
Abstract（参考訳）: 一般化可能なノベルビュー合成(NVS)モデルの開発は、多種多様な正確なカメラ軌跡を特徴とする大規模トレーニングデータの不足により、極めて制限されている。現実世界のキャプチャーはフォトリアリスティックだが、通常はまばらで離散的である。逆に、合成データはスケールするが、ドメインギャップに悩まされ、しばしば現実的な意味論に欠ける。我々は、シーン再構成のパワーを活用して、限られた現実世界の画像シーケンスを高品質なトレーニングデータのスケーラブルなソースに変換する新しいフレームワークFreeScaleを紹介する。我々の重要な洞察は、不完全な再構成されたシーンはリッチな幾何学的プロキシとして機能するが、それから鼻でサンプリングすることでアーティファクトを増幅するということだ。そこで本研究では,意味的に意味があり,再現エラーの影響を最小限に抑える新しい視点を識別する,確実なフリービューサンプリング戦略を提案する。フィードフォワードNVSモデルのトレーニングをスケールアップすることでFreeScaleの有効性を実証し,PSNRにおけるアウト・オブ・ディストリビューションベンチマークへの挑戦において,2.7dBの顕著な向上を達成した。さらに、生成したデータは、シーンごとの3Dガウススプラッティング最適化を積極的に強化し、複数のデータセット間で一貫した改善をもたらすことを示す。私たちの仕事は、3Dビジョンの根本的なボトルネックを克服するために、実用的で強力なデータ生成エンジンを提供します。プロジェクトページ: https://mvp-ai-lab.github.io/FreeScale.com

論文の概要: FreeScale: Scaling 3D Scenes via Certainty-Aware Free-View Generation

関連論文リスト