Fugu-MT 論文翻訳(概要): AnyStyle: Single-Pass Multimodal Stylization for 3D Gaussian Splatting

論文の概要: AnyStyle: Single-Pass Multimodal Stylization for 3D Gaussian Splatting

arxiv url: http://arxiv.org/abs/2602.04043v1
Date: Tue, 03 Feb 2026 22:19:58 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-05 19:45:11.29276
Title: AnyStyle: Single-Pass Multimodal Stylization for 3D Gaussian Splatting
Title（参考訳）: AnyStyle:3次元ガウス平滑化のためのシングルパスマルチモーダルスティル化
Authors: Joanna Kaleta, Bartosz Świrta, Kacper Kania, Przemysław Spurek, Marek Kowalski,
Abstract要約: フィードフォワード3D再構成・スタイリングフレームワークであるAnyStyleを導入し,マルチモーダルコンディショニングによるポーズフリー・ゼロショットスタイリングを実現する。本手法はテキスト入力と視覚入力の両方をサポートし,自然言語記述や参照画像を用いてシーンの外観を制御できる。実験により、AnyStyleは、高品質な幾何再構成を保ちつつ、フィードフォワードスタイル化手法よりもスタイル制御性を向上させることが示された。
参考スコア（独自算出の注目度）: 3.8078651836376007
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The growing demand for rapid and scalable 3D asset creation has driven interest in feed-forward 3D reconstruction methods, with 3D Gaussian Splatting (3DGS) emerging as an effective scene representation. While recent approaches have demonstrated pose-free reconstruction from unposed image collections, integrating stylization or appearance control into such pipelines remains underexplored. Existing attempts largely rely on image-based conditioning, which limits both controllability and flexibility. In this work, we introduce AnyStyle, a feed-forward 3D reconstruction and stylization framework that enables pose-free, zero-shot stylization through multimodal conditioning. Our method supports both textual and visual style inputs, allowing users to control the scene appearance using natural language descriptions or reference images. We propose a modular stylization architecture that requires only minimal architectural modifications and can be integrated into existing feed-forward 3D reconstruction backbones. Experiments demonstrate that AnyStyle improves style controllability over prior feed-forward stylization methods while preserving high-quality geometric reconstruction. A user study further confirms that AnyStyle achieves superior stylization quality compared to an existing state-of-the-art approach. Repository: https://github.com/joaxkal/AnyStyle.
Abstract（参考訳）: 高速でスケーラブルな3Dアセット作成への需要が高まり、フィードフォワード3D再構築手法への関心が高まり、3Dガウススプラッティング(3DGS)が効果的なシーン表現として登場した。最近のアプローチでは、ポーズなしのイメージコレクションからの復元が実証されているが、スタイリゼーションや外観制御をそのようなパイプラインに組み込むことは、まだ未定である。既存の試みは、制御性と柔軟性の両方を制限するイメージベースの条件付けに大きく依存している。本研究では,フィードフォワード3D再構成・スタイリングフレームワークであるAnyStyleを紹介し,マルチモーダルコンディショニングによるポーズフリー・ゼロショットスタイリングを実現する。本手法はテキスト入力と視覚入力の両方をサポートし,自然言語記述や参照画像を用いてシーンの外観を制御できる。本稿では,最小限のアーキテクチャ変更しか必要とせず,既存のフィードフォワード3D再構築バックボーンに統合可能なモジュール型スタイリングアーキテクチャを提案する。実験により、AnyStyleは、高品質な幾何再構成を保ちつつ、フィードフォワードスタイル化手法よりもスタイル制御性を向上させることが示された。ユーザ調査では、AnyStyleが既存の最先端アプローチに比べて優れたスタイリゼーション品質を達成していることも確認されている。リポジトリ: https://github.com/joaxkal/AnyStyle

論文の概要: AnyStyle: Single-Pass Multimodal Stylization for 3D Gaussian Splatting

関連論文リスト