Fugu-MT 論文翻訳(概要): XYZCylinder: Feedforward Reconstruction for Driving Scenes Based on A Unified Cylinder Lifting Method

論文の概要: XYZCylinder: Feedforward Reconstruction for Driving Scenes Based on A Unified Cylinder Lifting Method

arxiv url: http://arxiv.org/abs/2510.07856v1
Date: Thu, 09 Oct 2025 06:58:03 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-10 17:54:14.916136
Title: XYZCylinder: Feedforward Reconstruction for Driving Scenes Based on A Unified Cylinder Lifting Method
Title（参考訳）: XYZシリンダ:統一シリンダリフティング法による運転シーンのフィードフォワード再構成
Authors: Haochen Yu, Qiankun Liu, Hongyuan Liu, Jianfei Jiang, Juntao Lyu, Jiansheng Chen, Huimin Ma,
Abstract要約: 統一シリンダリフト法に基づくフィードフォワードモデルである textbfXYZ Cylinder を提案する。具体的には、視点に依存した空間対応の学習を避けるため、UCCM(Unified Cylinder Camera Modeling)戦略を設計する。再構成精度を向上させるために,新たに設計されたCylinder Plane Feature Groupに基づく複数の専用モジュールを用いたハイブリッド表現を提案する。
参考スコア（独自算出の注目度）: 27.213339282749885
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recently, more attention has been paid to feedforward reconstruction paradigms, which mainly learn a fixed view transformation implicitly and reconstruct the scene with a single representation. However, their generalization capability and reconstruction accuracy are still limited while reconstructing driving scenes, which results from two aspects: (1) The fixed view transformation fails when the camera configuration changes, limiting the generalization capability across different driving scenes equipped with different camera configurations. (2) The small overlapping regions between sparse views of the $360^\circ$ panorama and the complexity of driving scenes increase the learning difficulty, reducing the reconstruction accuracy. To handle these difficulties, we propose \textbf{XYZCylinder}, a feedforward model based on a unified cylinder lifting method which involves camera modeling and feature lifting. Specifically, to improve the generalization capability, we design a Unified Cylinder Camera Modeling (UCCM) strategy, which avoids the learning of viewpoint-dependent spatial correspondence and unifies different camera configurations with adjustable parameters. To improve the reconstruction accuracy, we propose a hybrid representation with several dedicated modules based on newly designed Cylinder Plane Feature Group (CPFG) to lift 2D image features to 3D space. Experimental results show that XYZCylinder achieves state-of-the-art performance under different evaluation settings, and can be generalized to other driving scenes in a zero-shot manner. Project page: \href{https://yuyuyu223.github.io/XYZCYlinder-projectpage/}{here}.
Abstract（参考訳）: 近年,固定ビュー変換を暗黙的に学習し,単一の表現でシーンを再構築するフィードフォワード再構築パラダイムに注目が集まっている。しかし,その一般化能力と再現精度は,(1)カメラ構成が変化すると固定ビュー変換が失敗し,異なるカメラ構成の異なる運転シーンにまたがる一般化能力が制限されるという2つの側面から生じる。 2) パノラマ360^\circ$パノラマのスパースビューと運転シーンの複雑さの間の小さな重なり合う領域は学習困難を増大させ,再現精度を低下させる。これらの問題に対処するために,カメラモデリングと特徴持ち上げを含む統一シリンダー昇降法に基づくフィードフォワードモデルである「textbf{XYZCylinder}」を提案する。具体的には、一般化能力を改善するために、視点に依存した空間対応の学習を回避し、調整可能なパラメータで異なるカメラ構成を統一する統一シリンダカメラモデリング(UCCM)戦略を設計する。再構成精度を向上させるために,新たに設計されたCylinder Plane Feature Group (CPFG) に基づく複数の専用モジュールを用いたハイブリッド表現を提案する。実験結果から,XYZCylinderは異なる評価条件下での最先端性能を実現し,ゼロショット方式で他の運転シーンに一般化可能であることが示された。プロジェクトページ: \href{https://yuyu223.github.io/XYZCYlinder-projectpage/}{here}。

論文の概要: XYZCylinder: Feedforward Reconstruction for Driving Scenes Based on A Unified Cylinder Lifting Method

関連論文リスト