Fugu-MT 論文翻訳(概要): Cross-Vehicle 3D Geometric Consistency for Self-Supervised Surround Depth Estimation on Articulated Vehicles

論文の概要: Cross-Vehicle 3D Geometric Consistency for Self-Supervised Surround Depth Estimation on Articulated Vehicles

arxiv url: http://arxiv.org/abs/2604.02639v1
Date: Fri, 03 Apr 2026 02:10:11 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-06 17:20:24.276184
Title: Cross-Vehicle 3D Geometric Consistency for Self-Supervised Surround Depth Estimation on Articulated Vehicles
Title（参考訳）: 人工車両の自己監督下地深度推定のためのクロスサイクル3次元幾何学的整合性
Authors: Weimin Liu, Jiyuan Qiu, Wenjun Wang, Joshua H. Meng,
Abstract要約: そこで本研究では,車体の周囲視深度推定のための自己教師型フレームワークを提案する。構造コヒーレンスを改善するために,多視点空間拡張戦略とクロスビュー表面正規制約を導入する。提案手法を検証するために,その上で収集したデータセットを用いて車載実験プラットフォームを構築した。
参考スコア（独自算出の注目度）: 3.512352303010361
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Surround depth estimation provides a cost-effective alternative to LiDAR for 3D perception in autonomous driving. While recent self-supervised methods explore multi-camera settings to improve scale awareness and scene coverage, they are primarily designed for passenger vehicles and rarely consider articulated vehicles or robotics platforms. The articulated structure introduces complex cross-segment geometry and motion coupling, making consistent depth reasoning across views more challenging. In this work, we propose \textbf{ArticuSurDepth}, a self-supervised framework for surround-view depth estimation on articulated vehicles that enhances depth learning through cross-view and cross-vehicle geometric consistency guided by structural priors from vision foundation model. Specifically, we introduce multi-view spatial context enrichment strategy and a cross-view surface normal constraint to improve structural coherence across spatial and temporal contexts. We further incorporate camera height regularization with ground plane-awareness to encourage metric depth estimation, together with cross-vehicle pose consistency that bridges motion estimation between articulated segments. To validate our proposed method, an articulated vehicle experiment platform was established with a dataset collected over it. Experiment results demonstrate state-of-the-art (SoTA) performance of depth estimation on our self-collected dataset as well as on DDAD, nuScenes, and KITTI benchmarks.
Abstract（参考訳）: 周深度推定は、自動運転における3次元認識のためのLiDARに代わる費用対効果を提供する。最近の自己監督的手法は、スケールの認識とシーンのカバレッジを改善するためにマルチカメラ設定を探索するが、それらは主に旅客車両用に設計されており、関節車やロボットプラットフォームをめったに考慮しない。明瞭な構造は複雑な断面幾何学と運動結合を導入し、ビュー間の一貫した深度推論をより困難にする。本研究では,視覚基盤モデルから導かれる構造的先行性によって導かれるクロスビューおよびクロス車両の幾何的整合性を通じて深度学習を向上する,関節車上でのサラウンドビュー深度推定のための自己教師型フレームワークである「textbf{ArticuSurDepth}」を提案する。具体的には、空間的および時間的コンテキスト間の構造的コヒーレンスを改善するために、多視点空間的エンリッチメント戦略とクロスビュー表面正規制約を導入する。さらに、地上面認識によるカメラ高さの正則化を取り入れ、距離深度推定を奨励し、車間ポーズの一貫性を保ち、セグメント間の動き推定を橋渡しする。提案手法を検証するために,その上で収集したデータセットを用いて車載実験プラットフォームを構築した。実験結果は、DDAD、nuScenes、KITTIベンチマークと同様に、自己コンパイルしたデータセットの深さ推定の最先端(SoTA)性能を示す。

論文の概要: Cross-Vehicle 3D Geometric Consistency for Self-Supervised Surround Depth Estimation on Articulated Vehicles

関連論文リスト