Fugu-MT 論文翻訳(概要): RPG360: Robust 360 Depth Estimation with Perspective Foundation Models and Graph Optimization

論文の概要: RPG360: Robust 360 Depth Estimation with Perspective Foundation Models and Graph Optimization

arxiv url: http://arxiv.org/abs/2509.23991v1
Date: Sun, 28 Sep 2025 17:33:12 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-30 22:32:19.575132
Title: RPG360: Robust 360 Depth Estimation with Perspective Foundation Models and Graph Optimization
Title（参考訳）: RPG360: パースペクティブファンデーションモデルとグラフ最適化によるロバスト360深さ推定
Authors: Dongki Jung, Jaehoon Choi, Yonghan Lee, Dinesh Manocha,
Abstract要約: RPG360は、トレーニング不要のロバストな360度モノクル深度推定法である。グラフに基づく最適化を用いた新しい深度スケールアライメント手法を提案する。提案手法は,Matterport3D,Stanford2D3D,360Locなど,多様なデータセットにまたがる優れた性能を実現する。
参考スコア（独自算出の注目度）: 48.99932182976206
License: http://creativecommons.org/licenses/by/4.0/
Abstract: The increasing use of 360 images across various domains has emphasized the need for robust depth estimation techniques tailored for omnidirectional images. However, obtaining large-scale labeled datasets for 360 depth estimation remains a significant challenge. In this paper, we propose RPG360, a training-free robust 360 monocular depth estimation method that leverages perspective foundation models and graph optimization. Our approach converts 360 images into six-face cubemap representations, where a perspective foundation model is employed to estimate depth and surface normals. To address depth scale inconsistencies across different faces of the cubemap, we introduce a novel depth scale alignment technique using graph-based optimization, which parameterizes the predicted depth and normal maps while incorporating an additional per-face scale parameter. This optimization ensures depth scale consistency across the six-face cubemap while preserving 3D structural integrity. Furthermore, as foundation models exhibit inherent robustness in zero-shot settings, our method achieves superior performance across diverse datasets, including Matterport3D, Stanford2D3D, and 360Loc. We also demonstrate the versatility of our depth estimation approach by validating its benefits in downstream tasks such as feature matching 3.2 ~ 5.4% and Structure from Motion 0.2 ~ 9.7% in AUC@5.
Abstract（参考訳）: 様々な領域にまたがる360度画像の利用の増加は、全方位画像に適した頑健な深度推定技術の必要性を強調している。しかし、360度深度推定のための大規模ラベル付きデータセットを取得することは大きな課題である。本稿では,視点基礎モデルとグラフ最適化を利用したトレーニング不要なロバストな360度モノクロ深度推定法であるRPG360を提案する。提案手法では,360度画像を6面の立方体図表現に変換する。立方体マップの異なる面にまたがる深度スケールの不整合に対処するために,グラフベース最適化を用いた新しい深度スケールアライメント手法を導入する。この最適化により、6面の立方体マップの奥行きスケールの整合性が確保され、3次元構造的整合性が維持される。さらに,ファウンデーションモデルはゼロショット設定に固有のロバスト性を示すため,Matterport3D,Stanford2D3D,360Locなど,さまざまなデータセットで優れたパフォーマンスを実現する。また,特徴マッチング3.2～5.4%,動作構造0.2～9.7%のAUC@5.7%といった下流タスクにおいて,その利点を検証することで,深度推定手法の汎用性を実証した。

論文の概要: RPG360: Robust 360 Depth Estimation with Perspective Foundation Models and Graph Optimization

関連論文リスト