Fugu-MT 論文翻訳(概要): Geo-ID: Test-Time Geometric Consensus for Cross-View Consistent Intrinsics

論文の概要: Geo-ID: Test-Time Geometric Consensus for Cross-View Consistent Intrinsics

arxiv url: http://arxiv.org/abs/2603.13859v1
Date: Sat, 14 Mar 2026 09:36:27 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-17 16:19:35.452545
Title: Geo-ID: Test-Time Geometric Consensus for Cross-View Consistent Intrinsics
Title（参考訳）: Geo-ID: クロスビュー・コンセント・イントリンシクスのためのテスト時間幾何コンセンサス
Authors: Alara Dirik, Stefanos Zafeiriou,
Abstract要約: 内在的な画像分解は、画像からアルベド、粗さ、および金属性などの物理ベースのレンダリングパラメータを推定することを目的としている。ビデオベースのモデルは、クロスフレームの一貫性を改善することができるが、高密度で順序付けられたシーケンスと相当な計算を必要とする。クロスビュー一貫した分解を生成するために,未学習の単視点予測器を本質的に再利用する新しいテストタイムフレームワークであるGeo-IDを提案する。
参考スコア（独自算出の注目度）: 37.614964138575935
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Intrinsic image decomposition aims to estimate physically based rendering (PBR) parameters such as albedo, roughness, and metallicity from images. While recent methods achieve strong single-view predictions, applying them independently to multiple views of the same scene often yields inconsistent estimates, limiting their use in downstream applications such as editable neural scenes and 3D reconstruction. Video-based models can improve cross-frame consistency but require dense, ordered sequences and substantial compute, limiting their applicability to sparse, unordered image collections. We propose Geo-ID, a novel test-time framework that repurposes pretrained single-view intrinsic predictors to produce cross-view consistent decompositions by coupling independent per-view predictions through sparse geometric correspondences that form uncertainty-aware consensus targets. Geo-ID is model-agnostic, requires no retraining or inverse rendering, and applies directly to off-the-shelf intrinsic predictors. Experiments on synthetic benchmarks and real-world scenes demonstrate substantial improvements in cross-view intrinsic consistency as the number of views increases, while maintaining comparable single-view decomposition performance. We further show that the resulting consistent intrinsics enable coherent appearance editing and relighting in downstream neural scene representations.
Abstract（参考訳）: 内在画像分解は、画像からアルベド、粗さ、および金属性などの物理ベースレンダリング(PBR)パラメータを推定することを目的としている。最近の手法は強い単一ビュー予測を実現する一方で、同一シーンの複数のビューに独立して適用することで、編集可能なニューラルシーンや3D再構成などの下流アプリケーションでの使用を制限する不整合推定が得られることが多い。ビデオベースのモデルは、クロスフレームの一貫性を改善することができるが、高密度で順序付けられたシーケンスと相当な計算を必要とする。我々は,不確実性に認識されたコンセンサスターゲットを形成する疎幾何対応を通じて,独立したビュー毎の予測を結合することにより,事前学習した単一ビュー固有の予測器を再利用し,クロスビュー一貫した分解を生成する新しいテストタイムフレームワークGeo-IDを提案する。 Geo-IDはモデルに依存しず、トレーニングや逆レンダリングを必要としない。合成ベンチマークと実世界のシーンの実験では、ビューの数が増加するにつれて、クロスビュー固有の一貫性が大幅に向上し、同等の単一ビュー分解性能を維持している。さらに、結果として生じる一貫性のある内在性は、下流のニューラルシーン表現におけるコヒーレントな外観の編集とリライティングを可能にすることを示す。

論文の概要: Geo-ID: Test-Time Geometric Consensus for Cross-View Consistent Intrinsics

関連論文リスト