Fugu-MT 論文翻訳(概要): SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer

論文の概要: SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer

arxiv url: http://arxiv.org/abs/2509.04379v1
Date: Thu, 04 Sep 2025 16:40:44 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-05 20:21:10.226229
Title: SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer
Title（参考訳）: SSGaussian:Semantic-Aware and Structure-Preserving 3D Style Transfer
Authors: Jimin Xu, Bosheng Qin, Tao Jin, Zhou Zhao, Zhenhui Ye, Jun Yu, Fei Wu,
Abstract要約: 本研究では,事前学習した2次元拡散モデルから事前知識を統合する新しい3次元移動パイプラインを提案する。パイプラインは2つの重要なステージで構成されています。 2つ目は、スタイリングされたキービュー間のインスタンスレベルの一貫性を効果的に活用し、それを3D表現に転送する、インスタンスレベルのスタイル転送である。
参考スコア（独自算出の注目度）: 57.723850794113055
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recent advancements in neural representations, such as Neural Radiance Fields and 3D Gaussian Splatting, have increased interest in applying style transfer to 3D scenes. While existing methods can transfer style patterns onto 3D-consistent neural representations, they struggle to effectively extract and transfer high-level style semantics from the reference style image. Additionally, the stylized results often lack structural clarity and separation, making it difficult to distinguish between different instances or objects within the 3D scene. To address these limitations, we propose a novel 3D style transfer pipeline that effectively integrates prior knowledge from pretrained 2D diffusion models. Our pipeline consists of two key stages: First, we leverage diffusion priors to generate stylized renderings of key viewpoints. Then, we transfer the stylized key views onto the 3D representation. This process incorporates two innovative designs. The first is cross-view style alignment, which inserts cross-view attention into the last upsampling block of the UNet, allowing feature interactions across multiple key views. This ensures that the diffusion model generates stylized key views that maintain both style fidelity and instance-level consistency. The second is instance-level style transfer, which effectively leverages instance-level consistency across stylized key views and transfers it onto the 3D representation. This results in a more structured, visually coherent, and artistically enriched stylization. Extensive qualitative and quantitative experiments demonstrate that our 3D style transfer pipeline significantly outperforms state-of-the-art methods across a wide range of scenes, from forward-facing to challenging 360-degree environments. Visit our project page https://jm-xu.github.io/SSGaussian for immersive visualization.
Abstract（参考訳）: ニューラル・ラディアンス・フィールドや3Dガウス・スプラッティングといった最近の神経表現の進歩は、3Dシーンへのスタイル・トランスファーへの関心を高めている。既存の手法では、スタイルパターンを3D一貫性のあるニューラル表現に転送することができるが、参照スタイルイメージからハイレベルなスタイルセマンティクスを効果的に抽出し、転送することは困難である。さらに、スタイリングされた結果には構造的明瞭さや分離が欠如していることが多く、3Dシーン内の異なるインスタンスやオブジェクトの区別が難しい。これらの制約に対処するために,事前学習された2次元拡散モデルから事前知識を効果的に統合する新しい3Dスタイルのトランスファーパイプラインを提案する。パイプラインは2つの重要なステージで構成されています。次に,スタイル化されたキービューを3D表現に転送する。このプロセスには2つの革新的な設計が含まれている。ひとつはクロスビュースタイルのアライメントで、複数のキービューにまたがる機能インタラクションを可能にするUNetの最後のアップサンプリングブロックに、クロスビューの注意を挿入する。これにより、拡散モデルは、スタイルの忠実さとインスタンスレベルの一貫性の両方を維持する、スタイル化されたキービューを生成する。 2つ目は、スタイリングされたキービュー間のインスタンスレベルの一貫性を効果的に活用し、それを3D表現に転送する、インスタンスレベルのスタイル転送である。この結果、より構造化され、視覚的に一貫性があり、芸術的に豊かなスタイル化がもたらされる。大規模な定性的および定量的実験により、我々の3Dスタイルのトランスファーパイプラインは、前方から360度環境まで幅広い場面で最先端の手法を大幅に上回っていることが示された。没入型視覚化のためのプロジェクトページ https://jm-xu.github.io/SSGaussian をご覧ください。

論文の概要: SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer

関連論文リスト