Fugu-MT 論文翻訳(概要): StableIntrinsic: Detail-preserving One-step Diffusion Model for Multi-view Material Estimation

論文の概要: StableIntrinsic: Detail-preserving One-step Diffusion Model for Multi-view Material Estimation

arxiv url: http://arxiv.org/abs/2508.19789v1
Date: Wed, 27 Aug 2025 11:15:55 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-28 19:07:41.607514
Title: StableIntrinsic: Detail-preserving One-step Diffusion Model for Multi-view Material Estimation
Title（参考訳）: 安定内在性:多視点材料推定のための詳細保存ワンステップ拡散モデル
Authors: Xiuchao Wu, Pengfei Zhu, Jiangjing Lyu, Xinguo Liu, Jie Guo, Yanwen Guo, Weiwei Xu, Chengfei Lyu,
Abstract要約: 本稿では,多視点材料推定のための一段階拡散モデルであるStableIntrinsicを紹介する。一段階拡散における過度に滑らかな問題に対処するために、StableIntrinsicは画素空間の損失を適用している。また,VAE符号化による詳細損失を排除するために,DIN(Detail Injection Network)を導入する。
参考スコア（独自算出の注目度）: 36.79338202811421
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recovering material information from images has been extensively studied in computer graphics and vision. Recent works in material estimation leverage diffusion model showing promising results. However, these diffusion-based methods adopt a multi-step denoising strategy, which is time-consuming for each estimation. Such stochastic inference also conflicts with the deterministic material estimation task, leading to a high variance estimated results. In this paper, we introduce StableIntrinsic, a one-step diffusion model for multi-view material estimation that can produce high-quality material parameters with low variance. To address the overly-smoothing problem in one-step diffusion, StableIntrinsic applies losses in pixel space, with each loss designed based on the properties of the material. Additionally, StableIntrinsic introduces a Detail Injection Network (DIN) to eliminate the detail loss caused by VAE encoding, while further enhancing the sharpness of material prediction results. The experimental results indicate that our method surpasses the current state-of-the-art techniques by achieving a $9.9\%$ improvement in the Peak Signal-to-Noise Ratio (PSNR) of albedo, and by reducing the Mean Square Error (MSE) for metallic and roughness by $44.4\%$ and $60.0\%$, respectively.
Abstract（参考訳）: 画像から物質情報を復元する手法はコンピュータグラフィックスや視覚学において広く研究されている。材料推定における最近の研究は, 有望な結果を示す拡散モデルを活用している。しかし、これらの拡散に基づく手法は、各推定に時間を要する多段階のデノベーション戦略を採用している。このような確率的推論は、決定論的物質推定タスクと矛盾し、高い分散推定結果をもたらす。本稿では,多視点材料推定のための一段階拡散モデルであるStableIntrinsicを導入する。一段階拡散における過度に滑らかな問題に対処するために、StableIntrinsicは、材料の性質に基づいて各損失を設計し、ピクセル空間の損失を適用した。さらに、StableIntrinsicは、DIN(Detail Injection Network)を導入し、VAE符号化による詳細損失を解消し、材料予測結果のシャープさをさらに強化する。実験結果から,アルベドのピーク信号対雑音比(PSNR)を9.9 %,金属および粗さの平均角誤差(MSE)を4.4 %,60.0 %と下げることにより,現在の最先端技術を超えていることが示された。

論文の概要: StableIntrinsic: Detail-preserving One-step Diffusion Model for Multi-view Material Estimation

関連論文リスト