Fugu-MT 論文翻訳(概要): Two Experts Are Better Than One Generalist: Decoupling Geometry and Appearance for Feed-Forward 3D Gaussian Splatting

論文の概要: Two Experts Are Better Than One Generalist: Decoupling Geometry and Appearance for Feed-Forward 3D Gaussian Splatting

arxiv url: http://arxiv.org/abs/2603.21064v1
Date: Sun, 22 Mar 2026 05:14:38 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-24 19:11:39.220494
Title: Two Experts Are Better Than One Generalist: Decoupling Geometry and Appearance for Feed-Forward 3D Gaussian Splatting
Title（参考訳）: 2人のスペシャリストが1人のジェネラリストより優れている:フィードフォワード3Dガウスプラッティングの幾何学と外観の分離
Authors: Hwasik Jeong, Seungryong Lee, Gyeongjin Kang, Seungkwon Yang, Xiangyu Sun, Seungtae Nam, Eunbyung Park,
Abstract要約: 本稿では,ポーズフリーフィードフォワード3DGSフレームワークである2Xplatを紹介する。専用の幾何学の専門家が最初にカメラのポーズを予測し、3Dガウスを合成する強力な外見の専門家に明示的に渡される。その概念的単純さは先行研究で大半が過小評価されているにもかかわらず、提案手法は極めて効果的であることが証明されている。
参考スコア（独自算出の注目度）: 22.824154073395878
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Pose-free feed-forward 3D Gaussian Splatting (3DGS) has opened a new frontier for rapid 3D modeling, enabling high-quality Gaussian representations to be generated from uncalibrated multi-view images in a single forward pass. The dominant approach in this space adopts unified monolithic architectures, often built on geometry-centric 3D foundation models, to jointly estimate camera poses and synthesize 3DGS representations within a single network. While architecturally streamlined, such "all-in-one" designs may be suboptimal for high-fidelity 3DGS generation, as they entangle geometric reasoning and appearance modeling within a shared representation. In this work, we introduce 2Xplat, a pose-free feed-forward 3DGS framework based on a two-expert design that explicitly separates geometry estimation from Gaussian generation. A dedicated geometry expert first predicts camera poses, which are then explicitly passed to a powerful appearance expert that synthesizes 3D Gaussians. Despite its conceptual simplicity, being largely underexplored in prior works, the proposed approach proves highly effective. In fewer than 5K training iterations, the proposed two-experts pipeline substantially outperforms prior pose-free feed-forward 3DGS approaches and achieves performance on par with state-of-the-art posed methods. These results challenge the prevailing unified paradigm and suggest the potential advantages of modular design principles for complex 3D geometric estimation and appearance synthesis tasks.
Abstract（参考訳）: 高速な3次元モデリングのための新しいフロンティアを3DGS(Pose-free feed-forward 3D Gaussian Splatting)で公開した。この領域における支配的なアプローチは、しばしば幾何学中心の3D基礎モデルに基づいて構築される統一的なモノリシックアーキテクチャを採用し、カメラのポーズを共同で推定し、単一のネットワーク内で3DGS表現を合成する。アーキテクチャ的に合理化されているが、このようなオールインワンの設計は、幾何学的推論と外観モデリングを共有表現内で絡み合わせるため、高忠実な3DGS生成に最適である。本研究では,ポーズフリーフィードフォワード3DGSフレームワークである2Xplatを紹介する。専用の幾何学の専門家が最初にカメラのポーズを予測し、3Dガウスを合成する強力な外見の専門家に明示的に渡される。その概念的単純さは先行研究で大半が過小評価されているにもかかわらず、提案手法は極めて効果的であることが証明されている。 5Kのトレーニングイテレーション未満では、提案された2専門家パイプラインは、ポーズなしフィードフォワード3DGSアプローチよりも大幅に優れており、最先端の提案手法と同等のパフォーマンスを実現している。これらの結果は、一般的な統一パラダイムに挑戦し、複雑な3次元幾何推定および外観合成タスクのためのモジュラー設計原則の潜在的な利点を示唆している。

論文の概要: Two Experts Are Better Than One Generalist: Decoupling Geometry and Appearance for Feed-Forward 3D Gaussian Splatting

関連論文リスト