Fugu-MT 論文翻訳(概要): Light of Normals: Unified Feature Representation for Universal Photometric Stereo

論文の概要: Light of Normals: Unified Feature Representation for Universal Photometric Stereo

arxiv url: http://arxiv.org/abs/2506.18882v4
Date: Sat, 04 Oct 2025 15:23:33 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-07 14:28:10.190901
Title: Light of Normals: Unified Feature Representation for Universal Photometric Stereo
Title（参考訳）: 普通人の光:ユニバーサル測光ステレオのための統一された特徴表現
Authors: Hong Li, Houyuan Chen, Chongjie Ye, Zhaoxi Chen, Bohan Li, Shaocong Xu, Xianda Guo, Xuhui Liu, Yikai Wang, Baochang Zhang, Satoshi Ikehata, Boxin Shi, Anyi Rao, Hao Zhao,
Abstract要約: 現在のエンコーダは、照明と通常の情報が切り離されていることを保証できない。 i) 点、方向、環境光を集約する光アライメント監督機能を備えた光レジスタトークン。また,PS-Verseも導入した。PS-Verseは,幾何学的複雑性と照明の多様性を指標とした大規模合成データセットである。
参考スコア（独自算出の注目度）: 69.95514862547174
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Universal photometric stereo (PS) is defined by two factors: it must (i) operate under arbitrary, unknown lighting conditions and (ii) avoid reliance on specific illumination models. Despite progress (e.g., SDM UniPS), two challenges remain. First, current encoders cannot guarantee that illumination and normal information are decoupled. To enforce decoupling, we introduce LINO UniPS with two key components: (i) Light Register Tokens with light alignment supervision to aggregate point, direction, and environment lights; (ii) Interleaved Attention Block featuring global cross-image attention that takes all lighting conditions together so the encoder can factor out lighting while retaining normal-related evidence. Second, high-frequency geometric details are easily lost. We address this with (i) a Wavelet-based Dual-branch Architecture and (ii) a Normal-gradient Perception Loss. These techniques yield a unified feature space in which lighting is explicitly represented by register tokens, while normal details are preserved via wavelet branch. We further introduce PS-Verse, a large-scale synthetic dataset graded by geometric complexity and lighting diversity, and adopt curriculum training from simple to complex scenes. Extensive experiments show new state-of-the-art results on public benchmarks (e.g., DiLiGenT, Luces), stronger generalization to real materials, and improved efficiency; ablations confirm that Light Register Tokens + Interleaved Attention Block drive better feature decoupling, while Wavelet-based Dual-branch Architecture + Normal-gradient Perception Loss recover finer details.
Abstract（参考訳）: ユニバーサル測光ステレオ(英: Universal Photometric stereo、PS)は、2つの因子によって定義される。 (i)任意の未知の照明条件下で動作し、 (ii)特定の照明モデルへの依存を避けること。進歩(SDM UniPSなど)にもかかわらず、2つの課題が残っている。第一に、現在のエンコーダは照明と通常の情報が切り離されていることを保証できない。疎結合を強制するために、LINO UniPSと2つの重要なコンポーネントを紹介します。一点、方向及び環境灯を集積する配光監督付灯台 2 インターリーブド注意ブロックは、すべての照明条件をまとめて、正常な証拠を保持しつつ、照明を消し去ることができる。第二に、高周波の幾何学的詳細は容易に失われる。私たちはこの問題に対処します (i)ウェーブレットに基づくデュアルブランチアーキテクチャと (ii)正常段階の知覚喪失。これらの技術は、ライトがレジスタトークンで明示的に表現されるような統一された特徴空間を生み出し、通常の詳細はウェーブレットブランチを介して保存される。さらに、PS-Verseは、幾何学的複雑さと照明の多様性によって評価された大規模な合成データセットであり、簡単なシーンから複雑なシーンまで、カリキュラムのトレーニングを取り入れている。大規模な実験では、公開ベンチマーク(例:DiLiGenT, Luces)における新しい最先端の結果、実際の材料へのより強力な一般化、効率の向上、Light Register Tokens + Interleaved Attention Blockによる機能分離の促進、WaveletベースのDual-branch Architecture + Normal-gradient Perception Lossによる細部の改善などが示されている。

論文の概要: Light of Normals: Unified Feature Representation for Universal Photometric Stereo

関連論文リスト