Fugu-MT 論文翻訳(概要): Wonder3D++: Cross-domain Diffusion for High-fidelity 3D Generation from a Single Image

論文の概要: Wonder3D++: Cross-domain Diffusion for High-fidelity 3D Generation from a Single Image

arxiv url: http://arxiv.org/abs/2511.01767v1
Date: Mon, 03 Nov 2025 17:24:18 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-05 16:37:27.348116
Title: Wonder3D++: Cross-domain Diffusion for High-fidelity 3D Generation from a Single Image
Title（参考訳）: Wonder3D++: 単一画像からの高忠実度3D生成のためのクロスドメイン拡散
Authors: Yuxiao Yang, Xiao-Xiao Long, Zhiyang Dou, Cheng Lin, Yuan Liu, Qingsong Yan, Yuexin Ma, Haoqian Wang, Zhiqiang Wu, Wei Yin,
Abstract要約: 単一ビュー画像から高忠実なテクスチャメッシュを効率的に生成する新しい手法である textbfWonder3D++ を導入する。マルチビュー正規写像と対応するカラー画像を生成するクロスドメイン拡散モデルを提案する。最後に,多視点2次元表現から高品質な表面を粗い方法でわずか3ドル程度で駆動するカスケード3次元メッシュ抽出アルゴリズムを提案する。
参考スコア（独自算出の注目度）: 68.55613894952177
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In this work, we introduce \textbf{Wonder3D++}, a novel method for efficiently generating high-fidelity textured meshes from single-view images. Recent methods based on Score Distillation Sampling (SDS) have shown the potential to recover 3D geometry from 2D diffusion priors, but they typically suffer from time-consuming per-shape optimization and inconsistent geometry. In contrast, certain works directly produce 3D information via fast network inferences, but their results are often of low quality and lack geometric details. To holistically improve the quality, consistency, and efficiency of single-view reconstruction tasks, we propose a cross-domain diffusion model that generates multi-view normal maps and the corresponding color images. To ensure the consistency of generation, we employ a multi-view cross-domain attention mechanism that facilitates information exchange across views and modalities. Lastly, we introduce a cascaded 3D mesh extraction algorithm that drives high-quality surfaces from the multi-view 2D representations in only about $3$ minute in a coarse-to-fine manner. Our extensive evaluations demonstrate that our method achieves high-quality reconstruction results, robust generalization, and good efficiency compared to prior works. Code available at https://github.com/xxlong0/Wonder3D/tree/Wonder3D_Plus.
Abstract（参考訳）: 本研究では,単一視点画像から高忠実度テクスチャメッシュを効率よく生成する新しい手法である \textbf{Wonder3D++} を紹介する。 SDS(Score Distillation Sampling)に基づく最近の手法は、2次元拡散前の3次元幾何を復元する可能性を示しているが、それらは通常、形状ごとの最適化と矛盾した幾何に時間を要する。対照的に、特定の作品は高速なネットワーク推論によって直接3D情報を生成するが、その結果は品質が低く、幾何学的詳細が欠けていることが多い。単一視点再構成タスクの品質,一貫性,効率性を均一に向上するために,多視点正規写像と対応するカラー画像を生成するクロスドメイン拡散モデルを提案する。生成の一貫性を確保するために、ビューとモダリティ間の情報交換を容易にする多視点クロスドメインアテンション機構を用いる。最後に,多視点2次元表現から高品質な表面を粗い方法でわずか3ドル程度で駆動するカスケード3次元メッシュ抽出アルゴリズムを提案する。提案手法は, 従来の方法と比較して, 高品質な再構築, 堅牢な一般化, 優れた効率性を実現していることを示す。コードはhttps://github.com/xxlong0/Wonder3D/tree/Wonder3D_Plusで公開されている。

論文の概要: Wonder3D++: Cross-domain Diffusion for High-fidelity 3D Generation from a Single Image

関連論文リスト